BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy13967
(379 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|270007946|gb|EFA04394.1| hypothetical protein TcasGA2_TC014693 [Tribolium castaneum]
Length = 385
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 207/386 (53%), Positives = 261/386 (67%), Gaps = 27/386 (6%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
E+L+ DA+ K ED KT G VTI+ ++ L V++ DY + +EELFVD+SR
Sbjct: 6 EKLRRFDAYPKTLEDVRIKTYGGAVVTIISLTIMTLLFWVELVDYLTPNVSEELFVDTSR 65
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
+ I+LDI+VPTISCD+LALDA+DSSGEQHL ++HNIYKRRLDL G+PI+EP+KE +
Sbjct: 66 SPSIQINLDIIVPTISCDFLALDAMDSSGEQHLQIDHNIYKRRLDLQGQPIEEPKKEDIT 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE-LD 183
K+K +TE T + +CGSCYGA + ++CCNTC +V+EAYR ++WA PE +
Sbjct: 126 I--KRKNSTEVATV-----NKTECGSCYGASFDPKRCCNTCEDVREAYRERRWAFPENPE 178
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
I QCK E +EKLK F +GCQIYG L VNRVSGSFHIAPG S+SINHVHVHD+QP++S
Sbjct: 179 NITQCKEERFSEKLKTAFAQGCQIYGSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPFSS 238
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS- 302
FNTTH IRHLSFG + D + PL TV AEEGASMF Y+IKI+PT Y +LDG
Sbjct: 239 TEFNTTHKIRHLSFGASI--DSDTHNPLKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQF 296
Query: 303 ---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
L G+ GMPGIFF YELSPLMVK TE+S+S GH T + I G
Sbjct: 297 ISANQFSVTKHRRVISLMSGESGMPGIFFQYELSPLMVKYTEQSRSFGHFATNVCAIIGG 356
Query: 348 TYITFMLVDALLHSCVKKIS-KVEIG 372
Y L+D +L+ VK I K+E+G
Sbjct: 357 VYTVAGLIDTMLYHSVKLIQKKIELG 382
>gi|189237821|ref|XP_974331.2| PREDICTED: similar to AGAP012144-PA [Tribolium castaneum]
Length = 395
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 206/385 (53%), Positives = 260/385 (67%), Gaps = 25/385 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ DA+ K ED KT G VTI+ ++ L V++ DY + +EELFVD+SR
Sbjct: 15 KLRRFDAYPKTLEDVRIKTYGGAVVTIISLTIMTLLFWVELVDYLTPNVSEELFVDTSRS 74
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ I+LDI+VPTISCD+LALDA+DSSGEQHL ++HNIYKRRLDL G+PI+EP+KE +
Sbjct: 75 PSIQINLDIIVPTISCDFLALDAMDSSGEQHLQIDHNIYKRRLDLQGQPIEEPKKEDITI 134
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE-LDT 184
K+K +TE T + +CGSCYGA + ++CCNTC +V+EAYR ++WA PE +
Sbjct: 135 --KRKNSTEVSVATV---NKTECGSCYGASFDPKRCCNTCEDVREAYRERRWAFPENPEN 189
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCK E +EKLK F +GCQIYG L VNRVSGSFHIAPG S+SINHVHVHD+QP++S
Sbjct: 190 ITQCKEERFSEKLKTAFAQGCQIYGSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPFSST 249
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
FNTTH IRHLSFG + D + PL TV AEEGASMF Y+IKI+PT Y +LDG
Sbjct: 250 EFNTTHKIRHLSFGASI--DSDTHNPLKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQFI 307
Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L G+ GMPGIFF YELSPLMVK TE+S+S GH T + I G
Sbjct: 308 SANQFSVTKHRRVISLMSGESGMPGIFFQYELSPLMVKYTEQSRSFGHFATNVCAIIGGV 367
Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
Y L+D +L+ VK I K+E+G
Sbjct: 368 YTVAGLIDTMLYHSVKLIQKKIELG 392
>gi|307179776|gb|EFN67966.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Camponotus floridanus]
Length = 385
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 202/385 (52%), Positives = 256/385 (66%), Gaps = 25/385 (6%)
Query: 7 LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
L+ LD K E D +T G VT++ + + L+ ++ Y S +EELFVD+SR
Sbjct: 4 LRQLDVHPKVREEADILVRTFSGAIVTVISTIIMGILLMSEINYYLTPSMSEELFVDTSR 63
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
GSKL I+LDI+VP ISCD L++DA+D++GEQHLH+EHNI+KRRLDL+GKPI++PQ+ +
Sbjct: 64 GSKLRINLDIIVPVISCDLLSIDAMDTTGEQHLHIEHNIFKRRLDLNGKPIEDPQRTNIT 123
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
K T E E+ CG CYGA TET +CCNTC EV+EAY+ KKWA P+
Sbjct: 124 DSKAVNKTAEKAL---EIGSTESCGDCYGAATETLRCCNTCEEVREAYKLKKWAPPDPAN 180
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCK++ S EK+K+ FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+QPYTS
Sbjct: 181 IKQCKDDKSMEKIKHAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTST 240
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
FN TH IRHLSFG+ + + P+D T A EGA MF +YIKI+PT Y R DGS
Sbjct: 241 HFNMTHKIRHLSFGLNIPG---KTNPMDDTTVIATEGAMMFYHYIKIVPTTYVRTDGSTL 297
Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L G+ GMPGIFFSYELSPLMVK TEK+KS GH T I G
Sbjct: 298 FTNQFSVTRHAKQVSLFTGESGMPGIFFSYELSPLMVKYTEKAKSFGHFATNTCAIIGGV 357
Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+LL+ V+ I K+E+G
Sbjct: 358 FTVAGLIDSLLYHSVRAIQKKIELG 382
>gi|380016121|ref|XP_003692037.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Apis florea]
Length = 385
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 202/386 (52%), Positives = 255/386 (66%), Gaps = 27/386 (6%)
Query: 7 LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
L+ LD K E D +T G VTI+ + + L +V Y + +EELFVD+SR
Sbjct: 4 LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEVNYYLTPTLSEELFVDTSR 63
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
GSKL I+LDI+VPTISCD L++DA+D++GEQHL +EHNI+KRRLDL+GKPI++PQ+ +
Sbjct: 64 GSKLRINLDIIVPTISCDLLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDIT 123
Query: 125 AVKK-KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
K K T + +TTE CG CYGA +E KCCNTC +V+EAYR K WA P L
Sbjct: 124 DTKALSKTTAKTLESTTE----KICGDCYGAASEIIKCCNTCEDVREAYRLKNWAPPVLG 179
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
I QC+N+ S EK+K FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+QPYTS
Sbjct: 180 NIKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTS 239
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS- 302
FN TH IRHLSFG+ + + P+D T A EGA MF +YIKI+PT Y R DGS
Sbjct: 240 TQFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGST 296
Query: 303 ---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
L G+ GMPGIFF+YELSPLMVK TEK+KS GH T I G
Sbjct: 297 LLTNQFSVTRHARQVSLFSGESGMPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGG 356
Query: 348 TYITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+LL+ ++ I K+E+G
Sbjct: 357 VFTVAGLIDSLLYHSLRAIQKKIELG 382
>gi|193627365|ref|XP_001948436.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Acyrthosiphon pisum]
Length = 404
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 198/385 (51%), Positives = 262/385 (68%), Gaps = 28/385 (7%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF KP E+ KTV+GG V++VC+L I +L+ ++ +Y + TEELFVD+SR
Sbjct: 13 LKQFDAFAKPLEEVQIKTVWGGIVSLVCFLTIVFLMVSNLVEYLDNTPTEELFVDTSRNK 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I+ DIVVP ISCD+L LDAVD+SGE HL V+HNIYKRRL+L+G+PI +P+K +
Sbjct: 73 KLQINFDIVVPKISCDFLVLDAVDNSGETHLQVDHNIYKRRLNLEGQPISDPEKS-DDVG 131
Query: 127 KKKKVTTENGTTTTELEDPNK----CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
KK + + + E +D N CGSCYGAE+ T CCNTC++VK AY+ K W
Sbjct: 132 SKKTLNPPSMLKSNETDDANNTEDICGSCYGAESSTIPCCNTCDDVKRAYKMKNWDF-RP 190
Query: 183 DTIVQCKNEYSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
+I QCKN+ S ++ + F EGCQ+YG L VNRVSGSFHIAPG+S+S NH+HVHD+ P+
Sbjct: 191 SSIEQCKNQSSQNEMYDKAFKEGCQLYGTLLVNRVSGSFHIAPGMSFSFNHMHVHDVHPF 250
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+S++FNTTH IRHLSFG KL+ + PLD T + A EGA+MF YYIKI+PT+Y+R
Sbjct: 251 SSSSFNTTHTIRHLSFGQKLESINTSHGGNPLDSTESIAGEGATMFQYYIKIVPTLYQRR 310
Query: 300 DGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
D S G G PGIFFSYE SP+M+K+TEK + LGHL+T+ +CN
Sbjct: 311 DLSIFSTNQFSVTKHKVQAFDKGPSGAPGIFFSYEFSPIMIKLTEKPRLLGHLFTQFLCN 370
Query: 345 ISGTYITFMLVDALLHSCVKKISKV 369
ISG +I F ++D ++ K+SKV
Sbjct: 371 ISGVFICFWIIDIFMY----KVSKV 391
>gi|383864675|ref|XP_003707803.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Megachile rotundata]
Length = 385
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/385 (51%), Positives = 253/385 (65%), Gaps = 25/385 (6%)
Query: 7 LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
L+ LD K E D +T G VTI+ + ++ L ++ Y + +EELFVD+SR
Sbjct: 4 LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMAILFLTELNYYLTPTLSEELFVDTSR 63
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
GSKL I+LDIVVPTISCD L++DA+D++GEQHL +EHNIYKRRLDL GKPI++PQK +
Sbjct: 64 GSKLRINLDIVVPTISCDLLSIDAMDTTGEQHLQIEHNIYKRRLDLQGKPIEDPQKTDIT 123
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
K TT +T +E CG CYGA +E KCCNTC +V++AY K WA P+ +
Sbjct: 124 DTKALSKTTAKSVESTTVE---TCGDCYGAASEKIKCCNTCEDVRKAYSDKNWAPPDPGS 180
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+N+ S EK+K FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+QPY S
Sbjct: 181 IKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGNSFSVNHVHVHDVQPYMST 240
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
FN TH IRHLSFG+ + + P+D T A EGA MF +YIKI+PT Y R DGS
Sbjct: 241 QFNMTHKIRHLSFGLNIPG---KTNPIDDTTMVAMEGAMMFYHYIKIVPTTYVRADGSTL 297
Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L G+ GMPGIFFSYELSPLMVK TEK+KS GH T + I G
Sbjct: 298 LTNQFSVTRHARQVSLLSGESGMPGIFFSYELSPLMVKYTEKAKSFGHFATNMCAIIGGV 357
Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+ L+ V+ I K+E+G
Sbjct: 358 FTVAGLIDSFLYHSVRAIQKKIELG 382
>gi|332024433|gb|EGI64631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Acromyrmex echinatior]
Length = 386
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/390 (51%), Positives = 257/390 (65%), Gaps = 30/390 (7%)
Query: 5 ERLKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
+ L+ LD K E D +T G VTI+ + + L ++ Y + +EELFVD+
Sbjct: 2 QMLRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFLSEINYYLTPTMSEELFVDT 61
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRGSKL I+LDI+VP+ISCD L+LDA+D++GEQHLH+EHNI+KRRLDL+G PI++PQ+
Sbjct: 62 SRGSKLRINLDIIVPSISCDLLSLDAMDTTGEQHLHIEHNIFKRRLDLNGNPIEDPQRTN 121
Query: 123 VNAVKKKKVTTENGT---TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ K TTE +TTEL CG CYGA T+T KCCNTC +V EAYR KKWA
Sbjct: 122 ITDAKAMSKTTEKAVEIGSTTEL-----CGDCYGATTDTMKCCNTCEDVWEAYRRKKWAP 176
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
P+ + QC+N+ S +KLK+ FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+Q
Sbjct: 177 PDPADVKQCQNDKSMDKLKHAFTQGCQIYGYMEVNRVGGSFHIAPGASFSVNHVHVHDVQ 236
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
PYTS+ FN TH IRHLSFG+ + + P+DG + A MF +YIKI+PT Y R
Sbjct: 237 PYTSSHFNMTHKIRHLSFGLNIPG---KTNPMDGMTVVDMDAAMMFYHYIKIVPTTYVRA 293
Query: 300 DGS----------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DGS L G+ GMPGIFF+YELSPLMVK TEK+ S GH T
Sbjct: 294 DGSTLLTNQFSVTRHSKKVSLLTGESGMPGIFFNYELSPLMVKYTEKANSFGHFATNTCA 353
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + L+D+LL+ V+ I K+E+G
Sbjct: 354 IIGGVFTVAGLIDSLLYHSVRAIQRKIELG 383
>gi|307193219|gb|EFN76110.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Harpegnathos saltator]
Length = 386
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 199/385 (51%), Positives = 252/385 (65%), Gaps = 24/385 (6%)
Query: 7 LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
L+ LD K E D +T G VTI+ + + L ++ Y + +EELFVD+SR
Sbjct: 4 LRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFMSEINYYLTPTMSEELFVDTSR 63
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
GSKL I+LD++VPTISCD L++DA+D++G Q+L +EHNI++RRLDL+GKPI++PQ+ N
Sbjct: 64 GSKLRINLDVIVPTISCDLLSVDAMDTTGVQYLQIEHNIFQRRLDLNGKPIEDPQR--TN 121
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
K K V T CG CYGA TET +CCNTC++V+ AYR KKWA+P+L
Sbjct: 122 ITKTKAVVKPTDEETQISSTTKVCGDCYGAATETLECCNTCDDVQMAYRLKKWAMPDLAK 181
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+N+ S +K K+ FT+GCQIYGY+EVNRV GSFHIAPG SYS+NHVHVHD+QPY S
Sbjct: 182 IKQCQNDKSADKYKHAFTQGCQIYGYMEVNRVGGSFHIAPGDSYSVNHVHVHDVQPYNSN 241
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
FN TH IRHLSFG+ + + P+D T A EGA MF YYIKI+PT Y R DGS L
Sbjct: 242 HFNMTHKIRHLSFGLNIPG---KTNPMDDTTTVATEGAMMFYYYIKIVPTTYVRADGSTL 298
Query: 305 GGG----------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
D GMPGIFFSYELSPLMVK TEK+KS GH T I G
Sbjct: 299 LTNQFSVTRHSKRMPLYMSDSGMPGIFFSYELSPLMVKYTEKAKSFGHFATNTCAIIGGV 358
Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+LL+ V+ I K+E+G
Sbjct: 359 FTVAGLIDSLLYHSVRAIQKKIELG 383
>gi|350404831|ref|XP_003487234.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Bombus impatiens]
Length = 385
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 200/385 (51%), Positives = 253/385 (65%), Gaps = 25/385 (6%)
Query: 7 LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
L+ LD K E D +T G VTI+ + + L +V Y + +EELFVD+SR
Sbjct: 4 LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEVNYYLTPTLSEELFVDTSR 63
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
GSKL I+LDI+VPTISCD L++DA+D++GEQHL +EHNI+KRRLDL+GKPI++PQ+ +
Sbjct: 64 GSKLRINLDIIVPTISCDVLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDIT 123
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
K + TT +T + CG CYGA + KCCNTC +V+EAYR K WALP L
Sbjct: 124 DTKARSKTTTKTVESTTEK---ACGDCYGAAGDIIKCCNTCEDVREAYRLKNWALPALGM 180
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCKN+ S EK+K F +GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD++PYTS
Sbjct: 181 IKQCKNDKSVEKMKTAFIQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTST 240
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
FN TH IRHLSFG+ + + P+D T A EGA MF +YIKI+PT Y R DGS
Sbjct: 241 QFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTL 297
Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L G+ GMPGIFF+YELSPLMVK TEK+KS GH T I G
Sbjct: 298 LTNQFSVTRHARQVSLFSGESGMPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGGV 357
Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+LL+ V+ I K+E+G
Sbjct: 358 FTVAGLIDSLLYHSVRAIQKKIELG 382
>gi|328786822|ref|XP_393819.4| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Apis mellifera]
Length = 383
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 200/386 (51%), Positives = 255/386 (66%), Gaps = 29/386 (7%)
Query: 7 LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
L+ LD K E D +T G VTI+ + + L ++ Y + +EELFVD+SR
Sbjct: 4 LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEMNYYLTPTLSEELFVDTSR 63
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
GSKL I+LDI+VPTISCD L++DA+D++GEQHL +EHNI+KRRLDL+GKPI++PQ+ +
Sbjct: 64 GSKLRINLDIIVPTISCDLLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDIT 123
Query: 125 AVKK-KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
K K T + +TTE CG CYGA +E KCCNTC +V+EAYR K WA+ L
Sbjct: 124 DTKALSKTTAKTLESTTE----KICGDCYGAASEIIKCCNTCEDVREAYRLKNWAV--LG 177
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
I QC+N+ S EK+K FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+QPYTS
Sbjct: 178 NIKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTS 237
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS- 302
FN TH IRHLSFG+ + + P+D T A EGA MF +YIKI+PT Y R DGS
Sbjct: 238 TQFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGST 294
Query: 303 ---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
L G+ GMPGIFF+YELSPLMVK TEK+KS GH T I G
Sbjct: 295 LLTNQFSVTRHARQVSLFSGESGMPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGG 354
Query: 348 TYITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+LL+ ++ I K+E+G
Sbjct: 355 VFTVAGLIDSLLYHSLRAIQKKIELG 380
>gi|340721521|ref|XP_003399168.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Bombus terrestris]
Length = 385
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 202/385 (52%), Positives = 253/385 (65%), Gaps = 25/385 (6%)
Query: 7 LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
L+ LD K E D +T G VTI+ + +S L +V Y + +EELFVD+SR
Sbjct: 4 LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMSILFLSEVNYYLTPTLSEELFVDTSR 63
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
SKL I+LDI+VPTISCD L++DA+D++GEQHL +EHNI+KRRLDL+GKPI++PQ+ +
Sbjct: 64 DSKLRINLDIIVPTISCDVLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDIT 123
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
K + TTE T E CG CYGA + KCCNTC +V+EAYR K WA P L
Sbjct: 124 DTKARSKTTEK---TVESTTEKACGDCYGAAGDIIKCCNTCEDVREAYRLKNWAPPALGM 180
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCKN+ S EK+K FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD++PYTS
Sbjct: 181 IKQCKNDKSVEKIKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTST 240
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
FN TH IRHLSFG+ + + P+D T A EGA MF +YIKI+PT Y R DGS
Sbjct: 241 QFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTL 297
Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L G+ GMPGIFF+YELSPLMVK TEK+KS GH T I G
Sbjct: 298 LTNQFSVTRHARQVSLFSGESGMPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGGV 357
Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+LL+ V+ I K+E+G
Sbjct: 358 FTVAGLIDSLLYHSVRAIQKKIELG 382
>gi|357612408|gb|EHJ67977.1| hypothetical protein KGM_08440 [Danaus plexippus]
Length = 385
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/386 (49%), Positives = 246/386 (63%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+ K LDA+ K EDF KT G +T+ + LI +++ Y + +EELFVD+SRG
Sbjct: 8 KFKQLDAYAKTLEDFRVKTATGAIITVTGAFVMILLIVLELHTYMSPNISEELFVDTSRG 67
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+ DIVVP ISCDYL LDA+DSSGEQHL ++HN++KRRLDLDG PI+EP KE ++
Sbjct: 68 HKLRINFDIVVPRISCDYLVLDAMDSSGEQHLQMDHNVHKRRLDLDGVPIKEPIKEDISL 127
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K + T CGSCYGA +CCNTC +VKEAYR ++WALP+L T+
Sbjct: 128 SSTVKQNSSEIAIVT-------CGSCYGAAFNDSQCCNTCEDVKEAYRLRRWALPDLATV 180
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK++ S E+ EGCQIYGY+EVNRV GSFHIAPG S++INHVHVHD+QP++S+
Sbjct: 181 EQCKDDDSLERTNLALKEGCQIYGYMEVNRVGGSFHIAPGKSFTINHVHVHDVQPFSSSV 240
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FNTTH IRHLSFG ++ + PLDG A+EGA MF YY+KI+PT+Y +LDG+ L
Sbjct: 241 FNTTHIIRHLSFGSDIESANT--APLDGITGLAKEGAVMFQYYLKIVPTMYVKLDGTILH 298
Query: 306 GG----------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
+ GMPG FFSYELSPLMVK T K +S+GH T + + G +
Sbjct: 299 TNQFSVTRHQKSVSNINVESGMPGAFFSYELSPLMVKYTAKGRSIGHFATNVCAIVGGVF 358
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
+ D LL+ + + GK
Sbjct: 359 TVAGIFDTLLYHSLNAFQNKVVLGKA 384
>gi|158300475|ref|XP_320382.3| AGAP012144-PA [Anopheles gambiae str. PEST]
gi|157013177|gb|EAA00591.3| AGAP012144-PA [Anopheles gambiae str. PEST]
Length = 386
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 190/389 (48%), Positives = 247/389 (63%), Gaps = 23/389 (5%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F + L+ LDA+ K +F +TV G A+T++ + I L+ ++ Y + +EELFV
Sbjct: 1 MRFLDSLRRLDAYPKIDNEFSIRTVSGAALTLISSIVIVTLVIGEINAYLSPNVSEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D++RG KL I+LD +P ISCDY++LDA DS+GEQHLH+EHNIYKRRLDL G I+EP+K
Sbjct: 61 DTTRGHKLKINLDFTIPRISCDYVSLDAQDSTGEQHLHIEHNIYKRRLDLQGNQIEEPKK 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
E + A K+ +TE TTT CGSCYGA +CCNTC EV +AYR +KW P
Sbjct: 121 EDIQASTKRISSTEAPATTTV---KPACGSCYGAAKNASQCCNTCQEVIDAYRERKWN-P 176
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
++ QCKN F+EGC IYG +EVNRV G FHIAPG S+SINH+HVHD+QP
Sbjct: 177 NVEDFEQCKNGNGGSVEGKAFSEGCHIYGTMEVNRVEGRFHIAPGKSFSINHIHVHDVQP 236
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
Y+S+ FNTTH I LSFG + R PLDG + +A EGA MF YYIKI+PT++ L+
Sbjct: 237 YSSSRFNTTHRINTLSFGEQFGFGTTR--PLDGLMVEATEGAMMFQYYIKIVPTMFVPLN 294
Query: 301 GSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L G+ GMPGIF +YELSPLMVK TEK SLGH T +
Sbjct: 295 GPTLYTNQFSVTKHQKSVTAMSGETGMPGIFVNYELSPLMVKFTEKRNSLGHFATNVCAI 354
Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++D+LL + + I K+E+G
Sbjct: 355 IGGIFTVAGIIDSLLFTSIHVIKRKIELG 383
>gi|170031960|ref|XP_001843851.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Culex quinquefasciatus]
gi|167871431|gb|EDS34814.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Culex quinquefasciatus]
Length = 391
Score = 364 bits (934), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 190/396 (47%), Positives = 248/396 (62%), Gaps = 32/396 (8%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + + LDA+ K ++F KT+ G A+T + I +LI + + + ++LFV
Sbjct: 1 MRLIDSFRRLDAYPKIDKEFSIKTIGGAALTTISGTIIVFLIYSEFVAFLTPTIEDQLFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D++RG KL I+LD VVP +SCDY++LDA D++GEQHLH++HNI+KRRLDL G PI+ P+K
Sbjct: 61 DATRGQKLRINLDFVVPRVSCDYVSLDAQDATGEQHLHIDHNIFKRRLDLKGNPIEAPKK 120
Query: 121 EVVNAVKKKKVTTE----NGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKK 176
E + A K +K TE N +TT N CGSCYGA+ + CCNTC +V +AYR K+
Sbjct: 121 EDIQAPKPRKDATEAPVVNSSTTA-----NPCGSCYGAQKNSSHCCNTCQDVIDAYREKQ 175
Query: 177 WALPELDTIVQCKNEYSTEKLK---NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
W P L+ QCK E + KL F EGCQIYGY+EVNRV GSFHIAPG S+SI+H+
Sbjct: 176 WN-PTLEEFEQCKTEVAIGKLSLEAKAFNEGCQIYGYMEVNRVGGSFHIAPGKSFSISHI 234
Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
HVHD+QP++S+ FN THHI LSFG + + PLDGT AEEGA MF YYIKI+P
Sbjct: 235 HVHDVQPFSSSRFNMTHHINTLSFGEEFGFG--QTSPLDGTDVIAEEGAMMFQYYIKIVP 292
Query: 294 TIYERLDGSKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
T + L G KL GD GMPGIF +YELSPLMVK TEK S H
Sbjct: 293 TEFVPLSGPKLHTNQFSVTTHRKSVSLMSGDSGMPGIFVNYELSPLMVKFTEKRSSFSHF 352
Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
T + I G + +VD LL + + + K+E+G
Sbjct: 353 ATNLCAIIGGIFTVSGIVDTLLFTSIHALKRKIELG 388
>gi|156552683|ref|XP_001599365.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Nasonia vitripennis]
Length = 328
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 229/337 (67%), Gaps = 34/337 (10%)
Query: 56 EELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
EELFVD+SRGSKL I+LDIV+ +I+CD L++DA+D++GE HL ++HNI+KRRLDLDGKPI
Sbjct: 3 EELFVDTSRGSKLKINLDIVISSIACDMLSIDAMDTTGETHLEIQHNIFKRRLDLDGKPI 62
Query: 116 QEPQKE-VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEVKEAY 172
++P+K + + K + EN T KCG CYGA +E KCCNTC EVKEAY
Sbjct: 63 EDPKKTGIADPKKTTEKPAENATA--------KCGDCYGAASEELGIKCCNTCEEVKEAY 114
Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH 232
R +KWA+ + QCKN+ S E TF EGCQIYG++EVNRV GSFHIAPG S +I+H
Sbjct: 115 RKRKWAVHDTSRFAQCKNDKSREM---TFKEGCQIYGFMEVNRVGGSFHIAPGDSITIDH 171
Query: 233 VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
+HVHD+QPY+S+ FN TH IRHLSFG + + P+D T A EGA+MF++YIKI+
Sbjct: 172 LHVHDVQPYSSSQFNLTHRIRHLSFGTNIPG---KTNPIDNTTVIASEGATMFHHYIKIV 228
Query: 293 PTIYERLDGSKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
PT + RLDGS L G+ GMPG+FFSYELSPLMVK T+ KSLGH
Sbjct: 229 PTTFMRLDGSILHTNQFSLTKHSRSIKQYSGESGMPGLFFSYELSPLMVKYTQTVKSLGH 288
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
L T I GT+ ++DA L+ V+ I K+E+G
Sbjct: 289 LMTNTCAIIGGTFTVASIIDAFLYHSVRAIQKKMELG 325
>gi|242007856|ref|XP_002424735.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
gi|212508228|gb|EEB11997.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
Length = 376
Score = 350 bits (899), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 190/388 (48%), Positives = 241/388 (62%), Gaps = 46/388 (11%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK D + K +D+ +T+ GGAVT+V ++ ++ L ++ Y +EELFVD++R
Sbjct: 10 LKDFDGYPKTLDDYRIRTLGGGAVTVVSYIIMTLLFISELNTYLTPDISEELFVDTTREP 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV---- 122
KL I+L+I VP ISC YL+LDA+DSSGEQHL +EHNIYK LD +G PI+EP+KE
Sbjct: 70 KLQINLNITVPEISCKYLSLDAMDSSGEQHLQIEHNIYKVSLDKNGIPIKEPEKETFVKP 129
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWALP 180
VN K+K KCGSCYGAE+ET CCNTC +VK+AY + W L
Sbjct: 130 VNETKEK-----------------KCGSCYGAESETLNITCCNTCADVKDAYMKRGWGLN 172
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
L+ I QCKN N F EGC IYG +EVNRV GSFHIAPG S+SINHVHVHD+QP
Sbjct: 173 NLELIEQCKN----LSQNNIFNEGCFIYGTMEVNRVGGSFHIAPGQSFSINHVHVHDVQP 228
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
++S AFNT+H I HLSFG + + PLDG VA EGA+MF YYIKI+PTIY D
Sbjct: 229 FSSKAFNTSHKIDHLSFGYNIPG---KTNPLDGIVALTHEGATMFQYYIKIVPTIYYYYD 285
Query: 301 GS--------------KLGGGDGGM-PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
S K G G+ PGIFF+YEL+P+MVK TE+ +S GH T + I
Sbjct: 286 KSGTILTNQFSVTRHQKSGSETIGVPPGIFFNYELAPIMVKYTERKRSFGHFATNVCAII 345
Query: 346 SGTYITFMLVDALLHSCVKKI-SKVEIG 372
G + L+DA L+ V+ K+EIG
Sbjct: 346 GGVFTVASLIDAFLYRSVQAFKKKIEIG 373
>gi|225708964|gb|ACO10328.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Caligus rogercresseyi]
Length = 385
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 176/381 (46%), Positives = 232/381 (60%), Gaps = 30/381 (7%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
+SE L+ LDA+ K EDF +T+ GGA+T++ + + +L ++ +Y EELFVD+
Sbjct: 4 WSEALRRLDAYPKTLEDFRIQTLSGGAITLLSGVLMVFLFASEIREYLTPRVQEELFVDT 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S+G KL I+LD+V ++SCD+L LDA+D SGE H+ + HNIYKRRL L+G P++EP++E
Sbjct: 64 SKGGKLKINLDVVFNSVSCDFLVLDAMDVSGESHVDIVHNIYKRRLSLEGSPMEEPRRET 123
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
KK TT + E P CGSCYGAET CCN+C EVKEAYR K W
Sbjct: 124 EVGQKK---TTHAPSPKNETSTP-PCGSCYGAETPGSPCCNSCGEVKEAYRRKGW----- 174
Query: 183 DTIVQCKN---EYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
TIV K E TE ++ + EGCQIYG L VNRV GSFHI PG S+++NH+H+HD+Q
Sbjct: 175 -TIVAAKFEQCEMDTEGIERVYKEGCQIYGSLLVNRVGGSFHIVPGKSFTLNHLHIHDLQ 233
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
P++S FNT+H IRHLSFG K D LD A + +G M+ YY+KI+PT Y R
Sbjct: 234 PFSSGEFNTSHRIRHLSFGSKTALDPGGNA-LDAVSALSPKGGLMYQYYLKIVPTTYSRS 292
Query: 300 DGSKLGGGD----------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG G GGMPG+FF+YEL+PLMVK +EK KS GH T +
Sbjct: 293 DGGTFTGNQYSVTRLEKDVSSSLDSGGMPGVFFNYELAPLMVKYSEKEKSFGHFATGLCA 352
Query: 344 NISGTYITFMLVDALLHSCVK 364
I G + D ++S K
Sbjct: 353 IIGGVFTLASAFDKFIYSSSK 373
>gi|321463520|gb|EFX74535.1| hypothetical protein DAPPUDRAFT_226626 [Daphnia pulex]
Length = 381
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 173/387 (44%), Positives = 241/387 (62%), Gaps = 32/387 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
K +DA+ K EDF +T G VT+ + +++L ++ D+ ++ +E+L+VD++R
Sbjct: 9 FKTIDAYPKTLEDFTIRTATGAMVTVFSSIIMAFLFVIEFRDFLSINVSEQLYVDTTRIP 68
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I+ D+ PTISC YL++DAVDSSGEQ VEHNI+K+RL+L G+P+Q + E +N
Sbjct: 69 NMKINFDVTFPTISCSYLSVDAVDSSGEQQFGVEHNIFKQRLNLLGEPLQAAELEEINKT 128
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL-PELDTI 185
K TE T+TE C SCYGA+ CC TC EV+EAYR K WA PE
Sbjct: 129 HNK---TE---TSTEESASKPCNSCYGAK---EGCCETCAEVREAYRQKNWAFRPE--EF 177
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+NE + + + F EGC++YGYLEVNRVSGSFHIAPG SY+INHVHVHD+QPY+S
Sbjct: 178 EQCRNEKNLTRDYSAFKEGCKLYGYLEVNRVSGSFHIAPGKSYAINHVHVHDVQPYSSED 237
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN THHI LSFG L + PLDG + A++GA MF YYIK++PT Y +LDG +
Sbjct: 238 FNVTHHINSLSFGTSLIG---KENPLDGFLTTADKGAMMFQYYIKVVPTWYVKLDGEEFH 294
Query: 306 ----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GG+ G+PG+FF+YE+SPL + E +S+GH T + I G +
Sbjct: 295 TNQYSVTRHQKVVSSYGGESGVPGVFFTYEMSPLQISYKESKRSIGHFATDVCTIIGGVF 354
Query: 350 ITFMLVDALLHSCVKKI-SKVEIGGKT 375
++D+LL+ K + K+++G T
Sbjct: 355 TVAGIIDSLLYRSSKLLQQKLQLGKAT 381
>gi|157118753|ref|XP_001653244.1| ptx1 protein [Aedes aegypti]
gi|108875623|gb|EAT39848.1| AAEL008391-PA [Aedes aegypti]
Length = 384
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 179/392 (45%), Positives = 238/392 (60%), Gaps = 31/392 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + L+ DA+ K ++F +TV G +T + I LI ++ Y T+ELFV
Sbjct: 1 MTLLDSLRRFDAYPKIDKEFSIRTVGGATLTFISGTIIVVLIYSELIAYLTPVVTDELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
DS+RG KL I+LD +P ISCDY++LDA D++GEQHLH+EH IYKRR+DL G PI+E +K
Sbjct: 61 DSTRGQKLKINLDFYIPRISCDYVSLDAQDATGEQHLHIEHTIYKRRMDLQGNPIEEAKK 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
E ++A K + E E+ KC SCYGAE + CC TC +V +AYR K+W P
Sbjct: 121 EDISAPKPRLEKKE--------ENVKKCRSCYGAEKNSTHCCETCQDVIDAYREKQWN-P 171
Query: 181 ELDTIVQCKNEYSTEKL---KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
LD QC+NE K F+EGCQIYG ++VNRV GSFHIAPG S+SI+H+HVHD
Sbjct: 172 NLDDFEQCQNEVLLGKKSLESKAFSEGCQIYGSMQVNRVGGSFHIAPGKSFSISHIHVHD 231
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
+QP++S+ FNT+H I LSFG + R PLD T A EGA MF YYIKI+PT +
Sbjct: 232 VQPFSSSRFNTSHRINTLSFGEEFGYGQTR--PLDFTEKTAHEGAIMFQYYIKIVPTEFV 289
Query: 298 RLDGSKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
L+G L G+ GMPGIF +YELSPLMV+ TEK S H T +
Sbjct: 290 PLNGPTLHTNQFSVTKHQKSVSVMSGESGMPGIFVNYELSPLMVRFTEKRNSFSHFATNL 349
Query: 342 MCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++D+LL + + + K+E+G
Sbjct: 350 CAIIGGIFTVAGIIDSLLFTSIHALKRKIELG 381
>gi|260815243|ref|XP_002602383.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
gi|229287692|gb|EEN58395.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
Length = 397
Score = 333 bits (854), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 170/403 (42%), Positives = 243/403 (60%), Gaps = 47/403 (11%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
++ +L+ DA+ K +DF KT G AVTI+ F+ L ++ Y + TEELFVD+
Sbjct: 6 WAAKLRRFDAYPKTLDDFRVKTFGGAAVTIISGFFMILLFVSELQYYLTLEVTEELFVDT 65
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE- 121
SRG K+ I++DI+ + C YL++DA+D +GEQ + V+HN++KRR+DL G + EP+KE
Sbjct: 66 SRGEKMRINIDILFHKVPCAYLSIDAMDIAGEQQIDVDHNLFKRRMDLQGNILDEPEKED 125
Query: 122 -------VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
+ A+KK EN T C SCYGAETE KCCNTC +V+EAYR
Sbjct: 126 LGDPSDEFMQAIKK----LENKTADV-------CESCYGAETEDLKCCNTCEDVREAYRR 174
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
K WA DTI QCK E +EKLK EGCQ+YGYLEVN+V+G+FH APG S+ +HVH
Sbjct: 175 KGWAFNNPDTIEQCKREGWSEKLKQQKNEGCQVYGYLEVNKVAGNFHFAPGKSFQQHHVH 234
Query: 235 --------VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFN 286
VHD+QP+ FN +HH+ HLSFG D R PLDG + A++G+ M+
Sbjct: 235 VSCFYHPIVHDLQPFGGEKFNLSHHVNHLSFGT---DIPGRVNPLDGHMVAAKQGSMMYQ 291
Query: 287 YYIKIIPTIYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITEK 330
Y++KI+PTIY+++ G ++ G+ G+PG+F YELSP+MV+ TEK
Sbjct: 292 YFVKIVPTIYKKISGQEVRTNQFSVTKHQKQVTASSGEQGLPGVFVLYELSPMMVQFTEK 351
Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
+S H T + + G + L+D+L++ + I K+++G
Sbjct: 352 QRSFMHFLTGVCAIVGGVFTVAGLIDSLIYHSARAIQQKIDLG 394
>gi|148222292|ref|NP_001091124.1| ERGIC and golgi 3 [Xenopus laevis]
gi|120538715|gb|AAI29573.1| LOC100036873 protein [Xenopus laevis]
Length = 384
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 168/388 (43%), Positives = 234/388 (60%), Gaps = 27/388 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ DA+ K EDF KT G VT++ L + L ++ Y ELFVD SRG
Sbjct: 6 RLRQFDAYPKTLEDFRVKTCGGAVVTVISGLIMLILFFSELQYYLTKEIYPELFVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD D KP+ E K +
Sbjct: 66 DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADKHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
++ E+ + DPN+C SCYGAETE CCN+C++V+EAYR K WA D+
Sbjct: 126 KLE------EHVVLDPKTLDPNRCESCYGAETEDFSCCNSCDDVREAYRRKGWAFKTPDS 179
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCK E ++K++ EGCQIYG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 IEQCKREGFSQKMQEQKNEGCQIYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 239
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH I+HLSFG +D PLDGT A + + MF Y++KI+PT+Y ++DG L
Sbjct: 240 NINMTHEIKHLSFG---RDYPGLVNPLDGTSIVAMQSSMMFQYFVKIVPTVYVKVDGEVL 296
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 297 RTNQFSVTRHEKMTNGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGV 356
Query: 349 YITFMLVDALLHSCVKKIS-KVEIGGKT 375
+ L+DAL++ + I K+E+G T
Sbjct: 357 FTVASLIDALIYHSTRAIQKKIELGKAT 384
>gi|327271489|ref|XP_003220520.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Anolis carolinensis]
Length = 383
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 167/387 (43%), Positives = 239/387 (61%), Gaps = 26/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK DAF K EDF KT G VT++ L + L ++ Y EL+VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD DGK + P+ E
Sbjct: 66 DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVT-PEAERHEL 124
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K+++ + + DP++C SCYGAE++ KCCNTC++V+EAYR + WA DTI
Sbjct: 125 GKEEETIFDPNSL-----DPDRCESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E ++K++ EGC++YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
N TH I+HLSFG +D PLDGTV A++ + MF Y++K++PTIY ++DG
Sbjct: 240 INMTHIIKHLSFG---RDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVR 296
Query: 302 ---------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
K+ GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVF 356
Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
L+D+L++ + I K+E+G T
Sbjct: 357 TVAGLIDSLIYHSARVIQKKIELGKTT 383
>gi|405966014|gb|EKC31342.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Crassostrea gigas]
Length = 397
Score = 323 bits (828), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 174/398 (43%), Positives = 233/398 (58%), Gaps = 31/398 (7%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F ERL+ DA+ K EDF KT G VT++ L + L ++ Y ELFVD+
Sbjct: 6 FYERLRQFDAYPKTLEDFRVKTFGGALVTVISSLLMVILFISELNYYLTKDVQPELFVDT 65
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ--EPQK 120
+RG KL I++DI P + C YL++DA+D SGEQ L V+H+++K+RL+ DG+ I+ EP+K
Sbjct: 66 TRGQKLRINIDIDFPKVPCAYLSIDAMDVSGEQQLDVDHHLFKQRLNADGEKIKDTEPEK 125
Query: 121 E------VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
E + K K E T+ DP++C SCYGAET KCCNTC +V+EAYR
Sbjct: 126 EGTMYEPIFELGDKSKDAVE---AVTKKLDPDRCESCYGAETGDLKCCNTCEDVREAYRK 182
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
K WA + I QC E T K+K EGCQ+YGYLEVN+V G+FH APG S+ +HVH
Sbjct: 183 KGWAFNSPEGIEQCNREGWTAKMKAQQKEGCQVYGYLEVNKVQGNFHFAPGKSFQQHHVH 242
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
VHD+Q + FN +H IRHLSFG QD PLD T +E+ +MF YY+K++PT
Sbjct: 243 VHDLQAFGGQKFNLSHAIRHLSFG---QDYPGIINPLDQTSQISEDEQTMFQYYVKVVPT 299
Query: 295 IYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
Y + G L G GD G+PG+FF YELSP+MVK TEK +S H
Sbjct: 300 TYVDVKGKTLYTNQYSVNKHSKTVGNGMGDSGLPGVFFIYELSPMMVKYTEKQRSFMHFL 359
Query: 339 TKIMCNISGTYITFMLVDALL-HSCVKKISKVEIGGKT 375
T + I G + L+D+++ HS K+E+G T
Sbjct: 360 TGVCAIIGGIFTVAGLIDSMIYHSSRALQKKIELGKAT 397
>gi|41055991|ref|NP_957309.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform 2 [Danio rerio]
gi|82210123|sp|Q803I2.1|ERGI3_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|28278376|gb|AAH44474.1| ERGIC and golgi 3 [Danio rerio]
gi|182890166|gb|AAI64701.1| Ergic3 protein [Danio rerio]
Length = 383
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 166/384 (43%), Positives = 230/384 (59%), Gaps = 26/384 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y ELFVD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELFVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG+P+ A
Sbjct: 66 DKLRINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGQPV------TTEA 119
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K E G DP++C SCYGAET+ KCCNTC++V+EAYR + WA DTI
Sbjct: 120 EKHDLGKEEEGVFDPSTLDPDRCESCYGAETDDLKCCNTCDDVREAYRRRGWAFKTPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
N TH I+HLSFG +D PLD T A + + M+ Y++KI+PTIY + DG
Sbjct: 240 INMTHFIKHLSFG---KDYPGIVNPLDDTNVAAPQASMMYQYFVKIVPTIYVKGDGEVVK 296
Query: 302 ---------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
K+ GD G+PG+F YELSP+MVK TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAIIGGVF 356
Query: 350 ITFMLVDALLHSCVKKIS-KVEIG 372
L+D+L++ + I K+E+G
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIELG 380
>gi|47575764|ref|NP_001001226.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Xenopus (Silurana) tropicalis]
gi|82185697|sp|Q6NVS2.1|ERGI3_XENTR RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|45708932|gb|AAH67932.1| ERGIC and golgi 3 [Xenopus (Silurana) tropicalis]
Length = 384
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 163/387 (42%), Positives = 231/387 (59%), Gaps = 25/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ DA+ K EDF KT G VT++ L + L ++ Y ELFVD SRG
Sbjct: 6 RLRQFDAYPKTLEDFRVKTCGGALVTVISGLIMLILFFSELQYYLTKEIYPELFVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD D KP+
Sbjct: 66 DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADRHELG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
++ V + + DPN+C SCYGAET+ CCNTC++V+EAYR + WA D+I
Sbjct: 126 KSEEHVVFDPKSL-----DPNRCESCYGAETDDFSCCNTCDDVREAYRRRGWAFKTPDSI 180
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 181 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 240
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH IRHLSFG +D PLDG+ A + + MF Y++KI+PT+Y ++DG L
Sbjct: 241 INMTHEIRHLSFG---RDYPGLVNPLDGSSVAAMQSSMMFQYFVKIVPTVYVKVDGEVLR 297
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 298 TNQFSVTRHEKMTNGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVF 357
Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
L+D+L++ + I K+E+G T
Sbjct: 358 TVAGLIDSLVYYSTRAIQKKIELGKAT 384
>gi|348521804|ref|XP_003448416.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Oreochromis niloticus]
Length = 384
Score = 320 bits (821), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 166/385 (43%), Positives = 233/385 (60%), Gaps = 27/385 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ + + L ++ Y EL+VD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
KL I++DI+ P + C YL++DA+D +GEQ L VEHN++K+RLD + KP+ QE +K +
Sbjct: 66 DKLKINIDIIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+V + DP++C SCYGAETE KCCNTC++V+EAYR + WA DT
Sbjct: 126 KADDGEVFDPSTL------DPDRCESCYGAETEDLKCCNTCDDVREAYRRRGWAFKSADT 179
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCK E T+K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 239
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH I+HLSFG +D PLDGT A + + M+ Y++KI+PTIY + DG +
Sbjct: 240 NINMTHLIKHLSFG---KDYPGLVNPLDGTDVTAPQASMMYQYFVKIVPTIYMKTDGEVV 296
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK TEK +S H T + I G
Sbjct: 297 KTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGV 356
Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+L++ + I K+E+G
Sbjct: 357 FTVAGLIDSLIYHSARVIQKKIELG 381
>gi|148225661|ref|NP_001087591.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Xenopus laevis]
gi|82181499|sp|Q66KH2.1|ERGI3_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|51513379|gb|AAH80394.1| MGC83277 protein [Xenopus laevis]
Length = 389
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 165/392 (42%), Positives = 232/392 (59%), Gaps = 30/392 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ DA+ K EDF KT G VT++ L + L ++ Y ELFVD SRG
Sbjct: 6 RLRQFDAYPKTLEDFRVKTCGGAVVTVISGLIMLILFFSELQYYLTKEVYPELFVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLDLD KP+
Sbjct: 66 DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDLDKKPVTSEADRHELG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+++V + T DPN+C SCYGAET+ CCN+C++V+EAYR K WA D+I
Sbjct: 126 KSEEQVVFDPKTL-----DPNRCESCYGAETDDFSCCNSCDDVREAYRRKGWAFKTPDSI 180
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QCK E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 181 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 240
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH I+HLSFG +D PLDGT A + + MF Y++KI+PT+Y ++D
Sbjct: 241 FGLDNINMTHEIKHLSFG---KDYPGLVNPLDGTSIVAMQSSMMFQYFVKIVPTVYVKVD 297
Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L GD G+PG+F YELSP+MVK TEK +S H T +
Sbjct: 298 GEVLRTNQFSVTRHEKMTNGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAI 357
Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
I G + L+D+L++ + I K+E+G T
Sbjct: 358 IGGVFTVAGLIDSLIYYSTRAIQKKIELGKAT 389
>gi|327271491|ref|XP_003220521.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Anolis carolinensis]
Length = 388
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 167/392 (42%), Positives = 239/392 (60%), Gaps = 31/392 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK DAF K EDF KT G VT++ L + L ++ Y EL+VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD DGK + P+ E
Sbjct: 66 DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVT-PEAERHEL 124
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K+++ + + DP++C SCYGAE++ KCCNTC++V+EAYR + WA DTI
Sbjct: 125 GKEEETIFDPNSL-----DPDRCESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QCK E ++K++ EGC++YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH I+HLSFG +D PLDGTV A++ + MF Y++K++PTIY ++D
Sbjct: 240 FGLDNINMTHIIKHLSFG---RDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVD 296
Query: 301 G-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G K+ GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 297 GEVVRTNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
I G + L+D+L++ + I K+E+G T
Sbjct: 357 IGGVFTVAGLIDSLIYHSARVIQKKIELGKTT 388
>gi|390359988|ref|XP_792057.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Strongylocentrotus purpuratus]
Length = 400
Score = 317 bits (812), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 169/392 (43%), Positives = 234/392 (59%), Gaps = 28/392 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ DA+ K EDF KT G AVTI+ + + L ++ Y EL+VD++RG
Sbjct: 9 RLREFDAYPKTLEDFRVKTFGGAAVTIISSIIMITLFISELNFYLTKEVIPELYVDATRG 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+++IV P + C YL++DA+D SGEQ L V+HNIYKRR+D G PI EP+KE +
Sbjct: 69 EKLKINMEIVFPKMPCAYLSIDAMDISGEQQLDVDHNIYKRRIDKTGTPISEPEKEELGK 128
Query: 126 VKKKKVTTENGTTTT------ELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ ++ E + E+ DPN+C SCYGAET KCCN C V+EAYR K WA
Sbjct: 129 KEDQEKKEEEDSEQEDEKKKMEVLDPNRCESCYGAETPGLKCCNDCEGVQEAYRRKGWAF 188
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+ +I QCK E +EK+++ EGC++YGYLEVN+V+G+FH APG S+ +HVHVHD+Q
Sbjct: 189 SDPTSIEQCKREGFSEKMQSQKEEGCELYGYLEVNKVAGNFHFAPGKSFQQHHVHVHDLQ 248
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
A FN THH++ LSFG++ PLD +G+SMF Y++KI+PT Y +L
Sbjct: 249 AIAGAKFNMTHHVKTLSFGMEYPG---MENPLDNMKTIDVKGSSMFQYFVKIVPTTYTKL 305
Query: 300 DGS------------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
D S G+ G+PG+F YELSPLMVK TEK +S H T +
Sbjct: 306 DKSITRTNQYSVTKHEKQVTTSFSTGEHGLPGVFVLYELSPLMVKFTEKHRSFMHFLTGV 365
Query: 342 MCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + L+D+L++ K I K+++G
Sbjct: 366 CAIIGGVFTVAGLIDSLIYHSAKAIQKKIDLG 397
>gi|259155256|ref|NP_001158869.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Salmo salar]
gi|223647782|gb|ACN10649.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Salmo salar]
Length = 388
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 167/390 (42%), Positives = 230/390 (58%), Gaps = 33/390 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y ELFVD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELFVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P V
Sbjct: 66 DKLKININVIFPNMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGNP-------VTTE 118
Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+K + E G + DP +C SCYGAETE KCCNTC++V+EAYR + WA DT
Sbjct: 119 AEKHDLGQEEGEIFDPSKLDPERCESCYGAETEDLKCCNTCDDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QCK E ++K++ EGCQIYG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCKREGFSQKMQEQKNEGCQIYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH I+HLSFG +D PLDGT A + + M+ Y++KI+PTIY +
Sbjct: 239 SFGLDNINMTHLIKHLSFG---RDYPGIVNPLDGTDVAAPQASMMYQYFVKIVPTIYVKW 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG + GD G+PG+F YELSP+MVK TEK +S H T +
Sbjct: 296 DGEVVKTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
+ G + L+D+L++ K I K+E+G
Sbjct: 356 IVGGVFTVAGLIDSLIYHSAKAIQKKIELG 385
>gi|387015776|gb|AFJ50007.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3-like
[Crotalus adamanteus]
Length = 372
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 164/387 (42%), Positives = 228/387 (58%), Gaps = 37/387 (9%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK DAF K EDF KT G VT++ L + +L ++ Y EL+VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGAFVTVISGLIMFFLFFSELQYYLTKEIHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++DI P + C YL++DA+D +GEQ L VEHN++K+RLD D +E N+
Sbjct: 66 DKLRINIDIAFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDELGKEEELFFNPNS 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ DP +C SCYGAE+E KCCN C++V+EAYR + WA DTI
Sbjct: 126 L-----------------DPERCESCYGAESEDIKCCNNCDDVREAYRRRGWAFKNPDTI 168
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +EK++ EGC++YG+LEVN+V+G+FH APG S+ +HVHVHD+Q Y
Sbjct: 169 EQCKREGFSEKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSYGLDN 228
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH IRHLSFG +D PLDGT+ A + + MF Y++K++PT+Y ++DG +
Sbjct: 229 INITHFIRHLSFG---KDYPGLVNPLDGTIVTAHQASMMFQYFVKVVPTVYMKVDGEMVR 285
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 286 TNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVF 345
Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
L+D+L++ + I K+E+G T
Sbjct: 346 TVAGLIDSLIYHSARAIQKKIELGKTT 372
>gi|417399979|gb|JAA46966.1| Putative copii vesicle protein [Desmodus rotundus]
Length = 383
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 167/387 (43%), Positives = 228/387 (58%), Gaps = 25/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ +
Sbjct: 66 DKLKINIDVFFPRMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ KV N DP +C SCYGAETE KCCNTC +V+EAYR + WA DTI
Sbjct: 126 KAEMKVFDPNSL------DPERCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y +LDG L
Sbjct: 240 INMTHYIRHLSFG---EDYPGIVNPLDHTNVTALQASMMFQYFVKVVPTVYMKLDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKTV 376
L+D+L++ + I K GKTV
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKTV 383
>gi|74315943|ref|NP_001028277.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform 1 [Danio rerio]
gi|72679324|gb|AAI00126.1| ERGIC and golgi 3 [Danio rerio]
Length = 388
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 166/389 (42%), Positives = 230/389 (59%), Gaps = 31/389 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y ELFVD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELFVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG+P+ A
Sbjct: 66 DKLRINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGQPV------TTEA 119
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K E G DP++C SCYGAET+ KCCNTC++V+EAYR + WA DTI
Sbjct: 120 EKHDLGKEEEGVFDPSTLDPDRCESCYGAETDDLKCCNTCDDVREAYRRRGWAFKTPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QCK E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH I+HLSFG +D PLD T A + + M+ Y++KI+PTIY + D
Sbjct: 240 FGLDNINMTHFIKHLSFG---KDYPGIVNPLDDTNVAAPQASMMYQYFVKIVPTIYVKGD 296
Query: 301 G-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G K+ GD G+PG+F YELSP+MVK TEK +S H T +
Sbjct: 297 GEVVKTNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + L+D+L++ + I K+E+G
Sbjct: 357 IGGVFTVAGLIDSLIYHSARAIQKKIELG 385
>gi|224077228|ref|XP_002191084.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Taeniopygia guttata]
Length = 383
Score = 315 bits (807), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 160/387 (41%), Positives = 232/387 (59%), Gaps = 26/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK DAF K EDF KT G VT V L + L ++ Y EL+VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTAVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+LD++ P + C YL++DA+D +G+Q L VEHN++K+RLD G + +
Sbjct: 66 DKLKINLDVIFPHMPCAYLSIDAMDVAGDQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+++KV N D ++C SCYGAE+E +CCNTC++V+EAYR + WA D+I
Sbjct: 126 KEEEKVFDPNSL------DADRCESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDSI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
N TH+I+HLSFG +D PLDGT A++ + MF Y++K++PT+Y ++DG
Sbjct: 240 INMTHYIKHLSFG---RDYPGIVNPLDGTAVTAQQASMMFQYFVKVVPTVYRKVDGEVVR 296
Query: 302 ---------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
K+ GD G+PG+F YELSP+MVK+TEK +S H T + + G +
Sbjct: 297 TNQFSVTQHEKIANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFVTGVCAIVGGIF 356
Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
+D+L++ + I K+E+G T
Sbjct: 357 TVAGFIDSLIYHSARAIQKKIELGKTT 383
>gi|431894341|gb|ELK04141.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pteropus alecto]
Length = 383
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 167/387 (43%), Positives = 232/387 (59%), Gaps = 27/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E T+K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y +LDG L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKLDGEVL 295
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP++VK+TEK +S H T + I G
Sbjct: 296 RTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMVVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|344279905|ref|XP_003411726.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Loxodonta africana]
Length = 386
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 166/387 (42%), Positives = 233/387 (60%), Gaps = 27/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 9 KLKQFDAYPKTLEDFRIKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 69 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 128
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 129 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 181
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 182 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 241
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 242 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 298
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 299 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 358
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 359 FTVAGLIDSLIYHSARAIQKKIDLGKT 385
>gi|301762088|ref|XP_002916455.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Ailuropoda melanoleuca]
Length = 383
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 166/388 (42%), Positives = 233/388 (60%), Gaps = 27/388 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKTV 376
+ L+D+L++ + I K GKT+
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKTM 383
>gi|363741418|ref|XP_003642491.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Gallus gallus]
gi|363741445|ref|XP_003642499.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Gallus gallus]
Length = 383
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 161/387 (41%), Positives = 231/387 (59%), Gaps = 25/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK DAF K EDF KT G VT+V L + L ++ Y EL+VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD G + +
Sbjct: 66 DKLKINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+++KV N D ++C SCYGAE+E +CCNTC++V+EAYR + WA DTI
Sbjct: 126 KEEEKVFDPNSL------DADRCESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
N TH+I+HLSFG +D PLDGT A++ + MF Y++K++PT+Y ++DG
Sbjct: 240 INMTHYIKHLSFG---RDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVR 296
Query: 302 ---------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
K+ GD G+PG+F YELSP+MVK+TEK + H T + + G +
Sbjct: 297 TNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKTV 376
+D+L++ + I K GKT+
Sbjct: 357 TVAGFIDSLIYHSARAIQKKIELGKTI 383
>gi|348521802|ref|XP_003448415.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Oreochromis niloticus]
Length = 389
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 166/390 (42%), Positives = 233/390 (59%), Gaps = 32/390 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ + + L ++ Y EL+VD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
KL I++DI+ P + C YL++DA+D +GEQ L VEHN++K+RLD + KP+ QE +K +
Sbjct: 66 DKLKINIDIIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+V + DP++C SCYGAETE KCCNTC++V+EAYR + WA DT
Sbjct: 126 KADDGEVFDPSTL------DPDRCESCYGAETEDLKCCNTCDDVREAYRRRGWAFKSADT 179
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH-----DIQ 239
I QCK E T+K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVH D+Q
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 239
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH I+HLSFG +D PLDGT A + + M+ Y++KI+PTIY +
Sbjct: 240 SFGLDNINMTHLIKHLSFG---KDYPGLVNPLDGTDVTAPQASMMYQYFVKIVPTIYMKT 296
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG + GD G+PG+F YELSP+MVK TEK +S H T +
Sbjct: 297 DGEVVKTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCA 356
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + L+D+L++ + I K+E+G
Sbjct: 357 IIGGVFTVAGLIDSLIYHSARVIQKKIELG 386
>gi|410953936|ref|XP_003983624.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Felis catus]
Length = 383
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 166/387 (42%), Positives = 232/387 (59%), Gaps = 27/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|327271493|ref|XP_003220522.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 3 [Anolis carolinensis]
Length = 394
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 166/398 (41%), Positives = 240/398 (60%), Gaps = 37/398 (9%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK DAF K EDF KT G VT++ L + L ++ Y EL+VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD DGK + P+ E
Sbjct: 66 DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVT-PEAERHEL 124
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K+++ + + DP++C SCYGAE++ KCCNTC++V+EAYR + WA DTI
Sbjct: 125 GKEEETIFDPNSL-----DPDRCESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E ++K++ EGC++YG+LEVN+V+G+FH APG S+ +HVHVH ++ + +
Sbjct: 180 EQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 246 F-----------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
F N TH I+HLSFG +D PLDGTV A++ + MF Y++K++PT
Sbjct: 240 FGLDNVSILGKINMTHIIKHLSFG---RDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPT 296
Query: 295 IYERLDG-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
IY ++DG K+ GD G+PG+F YELSP+MVK+TEK +S H
Sbjct: 297 IYMKVDGEVVRTNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFL 356
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
T + I G + L+D+L++ + I K+E+G T
Sbjct: 357 TGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELGKTT 394
>gi|440797665|gb|ELR18746.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 383
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 154/388 (39%), Positives = 236/388 (60%), Gaps = 30/388 (7%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F ++LK DA+ K EDF +TV G AV+I+ L I++L ++ Y ELFVD+
Sbjct: 5 FFKKLKSFDAYPKTLEDFRVRTVSGAAVSIISGLIITWLFFSELSFYLSTDVQPELFVDT 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRG KL I++D+ P + C YL++DA+D SGE L VEHNI+K+RL DG+P+
Sbjct: 65 SRGEKLRINMDVTFPDLPCGYLSVDAMDVSGEHQLDVEHNIFKKRLAADGRPL------- 117
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
++K ++ + + +P +CGSCYG+E E +CCNTC EV+E+YR K WA
Sbjct: 118 --GIEKGELEAAATPSPGQELEPIECGSCYGSEQEPGQCCNTCAEVRESYRKKGWAFAHP 175
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
++I QC E +E L+ EGCQ+YG++ VN+V+G+FH APG S+ +H+HVHD+QP+
Sbjct: 176 ESIEQCAREGFSENLEKQKGEGCQVYGHILVNKVAGNFHFAPGKSFQAHHMHVHDLQPFR 235
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA--SMFNYYIKIIPTIYERLD 300
+++N +H I +SFG + PLDG + GA +M+ Y++KI+PTIYE LD
Sbjct: 236 MSSWNISHRINRISFGKEFPG---VINPLDGVEKTTDPGAGSAMYQYFVKIVPTIYESLD 292
Query: 301 GSKLG---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
G+ + G G+PG+F Y+LSP+MVK TE++KS H T + I
Sbjct: 293 GNVINTNQFSVTEHTRMLPPGDKSGLPGLFVMYDLSPIMVKFTERTKSFAHFLTGVCAII 352
Query: 346 SGTYITFMLVDALLHSCVKKIS-KVEIG 372
G + ++D+L+++ ++ + K+E+G
Sbjct: 353 GGVFTVAGIIDSLIYNSLRTLGKKMELG 380
>gi|284004911|ref|NP_001164802.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Oryctolagus cuniculus]
gi|217038333|gb|ACJ76626.1| serologically defined breast cancer antigen 84 isoform b
(predicted) [Oryctolagus cuniculus]
Length = 383
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 165/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT N + DP++C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFNPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|348564091|ref|XP_003467839.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cavia porcellus]
Length = 383
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 164/387 (42%), Positives = 235/387 (60%), Gaps = 26/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAESEDLKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
L+D+L++ + I K+E+G T
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIELGKTT 383
>gi|13384938|ref|NP_079792.1| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Mus
musculus]
gi|37999778|sp|Q9CQE7.1|ERGI3_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3; AltName: Full=Serologically defined breast
cancer antigen NY-BR-84 homolog
gi|12844094|dbj|BAB26233.1| unnamed protein product [Mus musculus]
gi|12851518|dbj|BAB29073.1| unnamed protein product [Mus musculus]
gi|26341008|dbj|BAC34166.1| unnamed protein product [Mus musculus]
gi|27882157|gb|AAH43720.1| ERGIC and golgi 3 [Mus musculus]
gi|148674217|gb|EDL06164.1| ERGIC and golgi 3, isoform CRA_d [Mus musculus]
Length = 383
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 164/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DPN+C SCYGAE+E KCCN+C +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPNSL----DPNRCESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|395830112|ref|XP_003788179.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Otolemur garnettii]
Length = 383
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 164/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT N + DP++C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFNPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|157820783|ref|NP_001100003.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Rattus norvegicus]
gi|149030853|gb|EDL85880.1| ERGIC and golgi 3 (predicted) [Rattus norvegicus]
Length = 383
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 164/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DPN+C SCYGAE+E KCCN+C +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPNRCESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|126291179|ref|XP_001371602.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Monodelphis domestica]
Length = 383
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 163/387 (42%), Positives = 228/387 (58%), Gaps = 26/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++DI+ P + C YL++DA+D +GEQ L VEHN+YK+RLD DG+P+ A
Sbjct: 66 DKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPV------TTEA 119
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + E DP +C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 120 ERHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+IR LSFG +D PLD T A + + MF Y++K++PT+Y ++ G L
Sbjct: 240 INMTHYIRRLSFG---EDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVSGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 SNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
L+D+L++ + I K+E+G T
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIELGKTT 383
>gi|359322740|ref|XP_864582.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Canis lupus familiaris]
Length = 383
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 164/387 (42%), Positives = 232/387 (59%), Gaps = 27/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ +P++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LNPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + + G
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTSVCAIVGGM 355
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|426241390|ref|XP_004014574.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Ovis aries]
Length = 383
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 164/387 (42%), Positives = 233/387 (60%), Gaps = 27/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|194044515|ref|XP_001929457.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Sus scrofa]
gi|350594868|ref|XP_003483992.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Sus scrofa]
Length = 383
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 164/387 (42%), Positives = 233/387 (60%), Gaps = 27/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEIKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 239 NINMTHYIQHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 296 RTNQFSVTRHEKVASGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|296199725|ref|XP_002747286.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Callithrix jacchus]
gi|403281165|ref|XP_003932068.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Saimiri boliviensis boliviensis]
Length = 383
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 164/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|156389237|ref|XP_001634898.1| predicted protein [Nematostella vectensis]
gi|156221986|gb|EDO42835.1| predicted protein [Nematostella vectensis]
Length = 386
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 163/383 (42%), Positives = 222/383 (57%), Gaps = 26/383 (6%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DA+ K EDF KT G AVT + + L ++ Y ELFVD++R
Sbjct: 10 LRRFDAYPKTLEDFRIKTFGGAAVTFISGFLMFILFVSELNYYLTTEVNPELFVDTTRAQ 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I+++IV P + C YL++DA+D SGEQ + V NI KRR+DLDGK I E NA
Sbjct: 70 KLRINVEIVFPKLPCVYLSIDAMDVSGEQQIDVSSNILKRRVDLDGKIIDE------NAE 123
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K + DPN+C SCYGAET +KCCNTC++V+EAYR K WAL +D +
Sbjct: 124 KGDLGDKSHEAKELLDLDPNRCESCYGAETPDKKCCNTCDDVREAYRRKGWALSNVDDVK 183
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC E +KL+ EGC++ GYLEVN+V+G+FH APG S+ +HVHVHD+QP+ S F
Sbjct: 184 QCMREGWKDKLQEQKNEGCEVTGYLEVNKVAGNFHFAPGKSFQQHHVHVHDLQPFGSTQF 243
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG- 305
N TH+I+HLSFG D + PLD T A E SM+ Y++KI+PT Y +L G L
Sbjct: 244 NLTHNIKHLSFG---HDYPGKTYPLDNTFVPAMEAGSMYQYFVKIVPTTYRKLSGEILHT 300
Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G+ G+PG+F YE SP+MV+ TE +S H T + + G +
Sbjct: 301 HQFSVTKHKRVIRQMSGEHGLPGVFVLYEFSPMMVQYTESRRSFMHFLTGVCAIVGGIFT 360
Query: 351 TFMLVDALL-HSCVKKISKVEIG 372
LVD+++ HS K+++G
Sbjct: 361 VAGLVDSMIYHSSRALQKKIDLG 383
>gi|354477966|ref|XP_003501188.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Cricetulus griseus]
gi|344246673|gb|EGW02777.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Cricetulus griseus]
Length = 383
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 162/386 (41%), Positives = 232/386 (60%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +V + + DPN+C SCYGAE++ KCCN+C +V+EAYR + WA DTI
Sbjct: 124 LGKVEVAVFDPNSL----DPNRCESCYGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|410262554|gb|JAA19243.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 163/387 (42%), Positives = 234/387 (60%), Gaps = 26/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
L+D+L++ + I K+++G T
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKAT 383
>gi|7706278|ref|NP_057050.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Homo sapiens]
gi|332858219|ref|XP_003316930.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Pan troglodytes]
gi|397523795|ref|XP_003831904.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Pan paniscus]
gi|37999823|sp|Q9Y282.1|ERGI3_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3; AltName: Full=Serologically defined breast
cancer antigen NY-BR-84
gi|4689108|gb|AAD27763.1|AF077030_1 hypothetical 43.2 kDa protein [Homo sapiens]
gi|4929577|gb|AAD34049.1|AF151812_1 CGI-54 protein [Homo sapiens]
gi|7671663|emb|CAB89412.1| ERGIC and golgi 3 [Homo sapiens]
gi|14602515|gb|AAH09765.1| ERGIC and golgi 3 [Homo sapiens]
gi|15559308|gb|AAH14014.1| ERGIC and golgi 3 [Homo sapiens]
gi|119596605|gb|EAW76199.1| ERGIC and golgi 3, isoform CRA_a [Homo sapiens]
gi|124249802|gb|ABM92879.1| endoplasmic reticulum-localized protein ERp43 [Homo sapiens]
gi|312152490|gb|ADQ32757.1| ERGIC and golgi 3 [synthetic construct]
gi|380785591|gb|AFE64671.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|383419067|gb|AFH32747.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|384947602|gb|AFI37406.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|410342895|gb|JAA40394.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 164/386 (42%), Positives = 232/386 (60%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|109092202|ref|XP_001098982.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Macaca mulatta]
Length = 383
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 164/386 (42%), Positives = 232/386 (60%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLK 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|410926566|ref|XP_003976749.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Takifugu rubripes]
Length = 384
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 165/385 (42%), Positives = 231/385 (60%), Gaps = 27/385 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ + + L ++ Y EL+VD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTWGGATVTIISGVLMLILFVSELQYYLTKEVHPELYVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++IV P + C YL++DA+D +GEQ L VEHN++K+RLD + +P+ E +K +
Sbjct: 66 DKLKININIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+ V + +T DP +C SCYGAET+ KCCN+C++V+EAYR + WA DT
Sbjct: 126 G--EDDVPVFDPSTL----DPERCESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADT 179
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCK E T+K++ EGCQ+YG LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGVLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 239
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH IRHLSFG QD PLD T A + + M+ Y++KI+PTIY + DG L
Sbjct: 240 NINMTHLIRHLSFG---QDYPGLINPLDDTNITAPQASMMYQYFVKIVPTIYVKTDGEVL 296
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK TEK +S H T + I G
Sbjct: 297 KTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGV 356
Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
+ L+D+L++ + I K+E+G
Sbjct: 357 FTVAGLIDSLIYHSARVIQKKIELG 381
>gi|164448602|ref|NP_001029525.2| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
taurus]
gi|75057944|sp|Q5EAE0.1|ERGI3_BOVIN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|59857621|gb|AAX08645.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|59857623|gb|AAX08646.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|59857741|gb|AAX08705.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|110665562|gb|ABG81427.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 383
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 163/387 (42%), Positives = 232/387 (59%), Gaps = 27/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAE E KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|190402265|gb|ACE77675.1| ERGIC and golgi 3 (predicted) [Sorex araneus]
Length = 388
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 166/392 (42%), Positives = 233/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGVPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
++ K ++ DPN+C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KIEVKVFDPDS-------LDPNRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCQREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|95767625|gb|ABF57320.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 380
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 163/387 (42%), Positives = 232/387 (59%), Gaps = 27/387 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 3 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 62
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 63 DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 122
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAE E KCCN+C +V+EAYR + WA DT
Sbjct: 123 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 175
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 176 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 235
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 236 NINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 292
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 293 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 352
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 353 FTVAGLIDSLIYHSARAIQKKIDLGKT 379
>gi|410218732|gb|JAA06585.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 163/386 (42%), Positives = 231/386 (59%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++ +RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFNQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|197100234|ref|NP_001126130.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pongo abelii]
gi|75041559|sp|Q5R8G3.1|ERGI3_PONAB RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|55730450|emb|CAH91947.1| hypothetical protein [Pongo abelii]
Length = 383
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 163/386 (42%), Positives = 231/386 (59%), Gaps = 25/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+E YR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVRETYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382
>gi|184185558|gb|ACC68956.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Rhinolophus ferrumequinum]
Length = 388
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 167/392 (42%), Positives = 232/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y +L
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKL 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|344279907|ref|XP_003411727.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Loxodonta africana]
Length = 391
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 166/392 (42%), Positives = 233/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 9 KLKQFDAYPKTLEDFRIKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 69 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 128
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 129 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 181
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 182 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 241
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 242 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 298
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 299 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 358
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 359 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 390
>gi|432101449|gb|ELK29631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Myotis davidii]
Length = 391
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 167/395 (42%), Positives = 232/395 (58%), Gaps = 35/395 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEMKVFDPDS-------LDPHRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY--- 241
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 242 -----TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y
Sbjct: 239 NVCTRCCLQINMTHYIRHLSFG---EDYPGIVNPLDRTNVTALQASMMFQYFVKVVPTVY 295
Query: 297 ERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
+LDG L GD G+PG+F YELSP+MVK+TEK +S H T
Sbjct: 296 MKLDGQVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTG 355
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
+ I G + L+D+L++ + I K GKT
Sbjct: 356 VCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 390
>gi|194751543|ref|XP_001958085.1| GF10736 [Drosophila ananassae]
gi|190625367|gb|EDV40891.1| GF10736 [Drosophila ananassae]
Length = 372
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 172/390 (44%), Positives = 236/390 (60%), Gaps = 39/390 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++ +Y Q + EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLNYMQPTMNEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + + C+Y++LDA+DSSG+ HL V+H+I+K RLDL G+P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLGCNYVSLDAMDSSGDTHLRVDHDIFKHRLDLKGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K T CGSCYGAE + CCNTC EV +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNSTHCCNTCEEVLDAYRLRKWNV 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
++D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
+ +H I HLSFG K++ + PLDG + EE S MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGMHVEVEEKKSEMFNYYLKIVPTLYMR 279
Query: 299 LDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
K L + GMPGIFFSYELSPLMVK EK S GH T
Sbjct: 280 DSDGKPIYTNQFSVTRHRKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCCS 339
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++ LL++ ++ I K+E+G
Sbjct: 340 IIGGVFTVAGILAVLLNNSLEAIQRKLEVG 369
>gi|363741420|ref|XP_003642492.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Gallus gallus]
gi|363741447|ref|XP_003642500.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Gallus gallus]
Length = 388
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/392 (41%), Positives = 231/392 (58%), Gaps = 30/392 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK DAF K EDF KT G VT+V L + L ++ Y EL+VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD G + +
Sbjct: 66 DKLKINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+++KV N D ++C SCYGAE+E +CCNTC++V+EAYR + WA DTI
Sbjct: 126 KEEEKVFDPNSL------DADRCESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QCK E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH+I+HLSFG +D PLDGT A++ + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIKHLSFG---RDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVD 296
Query: 301 G-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G K+ GD G+PG+F YELSP+MVK+TEK + H T +
Sbjct: 297 GEVVRTNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
+ G + +D+L++ + I K GKT+
Sbjct: 357 VGGIFTVAGFIDSLIYHSARAIQKKIELGKTI 388
>gi|229368723|gb|ACQ63006.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Dasypus novemcinctus]
Length = 388
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 166/392 (42%), Positives = 233/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|301762086|ref|XP_002916454.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Ailuropoda melanoleuca]
Length = 388
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 166/393 (42%), Positives = 233/393 (59%), Gaps = 32/393 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
I G + L+D+L++ + I K GKT+
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKTM 388
>gi|410953938|ref|XP_003983625.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Felis catus]
Length = 388
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 166/392 (42%), Positives = 232/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|281346059|gb|EFB21643.1| hypothetical protein PANDA_004535 [Ailuropoda melanoleuca]
Length = 387
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 166/392 (42%), Positives = 232/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|95767501|gb|ABF57305.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 376
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 162/385 (42%), Positives = 230/385 (59%), Gaps = 27/385 (7%)
Query: 8 KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSK 67
K DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG K
Sbjct: 1 KQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDK 60
Query: 68 LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVNAV 126
L I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ + V
Sbjct: 61 LKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKV 120
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ K ++ DP++C SCYGAE E KCCN+C +V+EAYR + WA DTI
Sbjct: 121 EVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIE 173
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 174 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 233
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 234 NMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRT 290
Query: 307 ----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 291 NQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFT 350
Query: 351 TFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 351 VAGLIDSLIYHSARAIQKKIDLGKT 375
>gi|194872681|ref|XP_001973062.1| GG13555 [Drosophila erecta]
gi|190654845|gb|EDV52088.1| GG13555 [Drosophila erecta]
Length = 373
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 172/391 (43%), Positives = 237/391 (60%), Gaps = 40/391 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++V +Y Q + EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K T CGSCYGAE CCNTC EV +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEEVLDAYRLRKWNV 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 A-VDKIEQCKGKYKRSD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
+ +H I HLSFG K++ + PLDG V AE + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVEVAETKSEMFNYYLKIVPTLYMR 279
Query: 299 --LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
DG L + GMPGIFFSYELSPLMVK EK S GH T
Sbjct: 280 GNSDGEPIYTNQFSVTRYRKDLSDRERGMPGIFFSYELSPLMVKYAEKRSSFGHFATNCC 339
Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++ LL++ + + K+E+G
Sbjct: 340 SIIGGVFTVAGILAVLLNNSWEALQRKLEVG 370
>gi|395830114|ref|XP_003788180.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Otolemur garnettii]
gi|197215642|gb|ACH53034.1| ERGIC and golgi 3 (predicted) [Otolemur garnettii]
Length = 388
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 164/391 (41%), Positives = 233/391 (59%), Gaps = 30/391 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT N + DP++C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFNPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|126291176|ref|XP_001371575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Monodelphis domestica]
Length = 388
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 163/392 (41%), Positives = 228/392 (58%), Gaps = 31/392 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++DI+ P + C YL++DA+D +GEQ L VEHN+YK+RLD DG+P+ A
Sbjct: 66 DKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPV------TTEA 119
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + E DP +C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 120 ERHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH+IR LSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 240 FGLDNINMTHYIRRLSFG---EDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVS 296
Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 297 GEVLRSNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
I G + L+D+L++ + I K+E+G T
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIELGKTT 388
>gi|195327731|ref|XP_002030571.1| GM24497 [Drosophila sechellia]
gi|195590409|ref|XP_002084938.1| GD12569 [Drosophila simulans]
gi|194119514|gb|EDW41557.1| GM24497 [Drosophila sechellia]
gi|194196947|gb|EDX10523.1| GD12569 [Drosophila simulans]
Length = 373
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 172/391 (43%), Positives = 237/391 (60%), Gaps = 40/391 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++V +Y Q + EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K T CGSCYGAE CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLRKWTV 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 A-VDKIEQCKGKYKRSD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
+ +H I HLSFG K++ + PLDG V AE + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMR 279
Query: 299 --LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
DG L + GMPGIFFSYELSPLMVK EK S GH T
Sbjct: 280 GNSDGEPIYTNQFSVTRYRKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCC 339
Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++ LL++ + I K+E+G
Sbjct: 340 SIIGGVFTVAGILAVLLNNSWEAIQRKLEVG 370
>gi|359322742|ref|XP_851879.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Canis lupus familiaris]
Length = 388
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 164/392 (41%), Positives = 232/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ +P++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LNPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTSVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
+ G + L+D+L++ + I K GKT
Sbjct: 356 IVGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|426241392|ref|XP_004014575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Ovis aries]
Length = 388
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 164/392 (41%), Positives = 233/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKV 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|194044517|ref|XP_001929458.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Sus scrofa]
gi|350594870|ref|XP_003483993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Sus scrofa]
Length = 388
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 164/392 (41%), Positives = 233/392 (59%), Gaps = 32/392 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEIKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIQHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 296 DGEVLRTNQFSVTRHEKVASGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|296199723|ref|XP_002747285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Callithrix jacchus]
gi|403281167|ref|XP_003932069.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Saimiri boliviensis boliviensis]
gi|166831592|gb|ABY90117.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Callithrix jacchus]
Length = 388
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 164/391 (41%), Positives = 233/391 (59%), Gaps = 30/391 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|195495133|ref|XP_002095138.1| GE19855 [Drosophila yakuba]
gi|194181239|gb|EDW94850.1| GE19855 [Drosophila yakuba]
Length = 373
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 171/391 (43%), Positives = 237/391 (60%), Gaps = 40/391 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++V +Y Q + EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVINYMQPTLNEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K T CGSCYGAE CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLRKWNV 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 A-VDKIEQCKGKYKRSD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
+ +H I HLSFG K++ + PLDG V AE + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMR 279
Query: 299 --LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
DG L + GMPGIFFSYELSPLMVK EK S GH T
Sbjct: 280 GNSDGEPIYTNQFSVTRYRKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCC 339
Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++ LL++ + + K+E+G
Sbjct: 340 SIIGGVFTVAGILAVLLNNSWEALQRKLEVG 370
>gi|109092200|ref|XP_001098885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Macaca mulatta]
Length = 388
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 164/391 (41%), Positives = 232/391 (59%), Gaps = 30/391 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 297 GEVLKTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|38327615|ref|NP_938408.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform a [Homo sapiens]
gi|281182526|ref|NP_001162565.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Papio anubis]
gi|397523797|ref|XP_003831905.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Pan paniscus]
gi|410055053|ref|XP_003953764.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Pan troglodytes]
gi|57208593|emb|CAI42842.1| ERGIC and golgi 3 [Homo sapiens]
gi|164623746|gb|ABY64672.1| ERGIC and golgi 3, isoform 1 (predicted) [Papio anubis]
gi|380785589|gb|AFE64670.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform a [Macaca mulatta]
Length = 388
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 164/391 (41%), Positives = 232/391 (59%), Gaps = 30/391 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|410926568|ref|XP_003976750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Takifugu rubripes]
Length = 389
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 165/390 (42%), Positives = 231/390 (59%), Gaps = 32/390 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ + + L ++ Y EL+VD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTWGGATVTIISGVLMLILFVSELQYYLTKEVHPELYVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++IV P + C YL++DA+D +GEQ L VEHN++K+RLD + +P+ E +K +
Sbjct: 66 DKLKININIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+ V + +T DP +C SCYGAET+ KCCN+C++V+EAYR + WA DT
Sbjct: 126 G--EDDVPVFDPSTL----DPERCESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADT 179
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
I QCK E T+K++ EGCQ+YG LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGVLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 239
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH IRHLSFG QD PLD T A + + M+ Y++KI+PTIY +
Sbjct: 240 SFGLDNINMTHLIRHLSFG---QDYPGLINPLDDTNITAPQASMMYQYFVKIVPTIYVKT 296
Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L GD G+PG+F YELSP+MVK TEK +S H T +
Sbjct: 297 DGEVLKTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCA 356
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + L+D+L++ + I K+E+G
Sbjct: 357 IIGGVFTVAGLIDSLIYHSARVIQKKIELG 386
>gi|22760064|dbj|BAC11054.1| unnamed protein product [Homo sapiens]
Length = 388
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 164/391 (41%), Positives = 232/391 (59%), Gaps = 30/391 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDDINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|354477968|ref|XP_003501189.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Cricetulus griseus]
Length = 388
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 162/391 (41%), Positives = 232/391 (59%), Gaps = 30/391 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +V + + DPN+C SCYGAE++ KCCN+C +V+EAYR + WA DTI
Sbjct: 124 LGKVEVAVFDPNSL----DPNRCESCYGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356
Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387
>gi|195378906|ref|XP_002048222.1| GJ11466 [Drosophila virilis]
gi|194155380|gb|EDW70564.1| GJ11466 [Drosophila virilis]
Length = 372
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 167/390 (42%), Positives = 241/390 (61%), Gaps = 39/390 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS L+ ++ +Y + +EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLVFLEFLNYMKPMLSEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLEGQPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K +T CGSCYGAE CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNST--------------CGSCYGAEHNATHCCNTCEDVLDAYRVRKWNM 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
++D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
+T+ +H I HLSFG K++ + PLDG + +E S MFNYY+KI+PT+YER
Sbjct: 225 -FTNVKL--SHTINHLSFGEKIE--FAKTHPLDGLRVEVQESKSEMFNYYLKIVPTLYER 279
Query: 299 -LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L + GMPGIFFSYELSPLMVK E+ S GH T
Sbjct: 280 HSDGQPIYTNQFSVTRHRKDLTDRERGMPGIFFSYELSPLMVKYAERHVSFGHFATNCCS 339
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
+ G + ++ LL++ + + K+E+G
Sbjct: 340 IVGGVFTVAGILAVLLNNSWEALQRKLEVG 369
>gi|34849462|gb|AAH57130.1| Ergic3 protein [Mus musculus]
Length = 394
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 163/397 (41%), Positives = 234/397 (58%), Gaps = 36/397 (9%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DPN+C SCYGAE+E KCCN+C +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPNSL----DPNRCESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVH ++ + +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 246 F-----------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
F N TH+I+HLSFG +D PLD T A + + MF Y++K++PT
Sbjct: 240 FGLDNPSDCLQINMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPT 296
Query: 295 IYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+Y ++DG L GD G+PG+F YELSP+MVK+TEK +S H
Sbjct: 297 VYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFL 356
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
T + I G + L+D+L++ + I K GKT
Sbjct: 357 TGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 393
>gi|75077200|sp|Q4R8X1.1|ERGI3_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|67967936|dbj|BAE00450.1| unnamed protein product [Macaca fascicularis]
Length = 382
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 163/386 (42%), Positives = 231/386 (59%), Gaps = 26/386 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGTPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + DP++C SCYGAE E KCCNTC +V+EAYR ++ A DTI
Sbjct: 124 LGKVEVTVFGPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYR-RRGAFKNPDTI 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 238
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 239 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 295
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 296 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 355
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 356 TVAGLIDSLIYHSARAIQKKIDLGKT 381
>gi|195441336|ref|XP_002068468.1| GK20487 [Drosophila willistoni]
gi|194164553|gb|EDW79454.1| GK20487 [Drosophila willistoni]
Length = 372
Score = 303 bits (777), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 169/390 (43%), Positives = 236/390 (60%), Gaps = 39/390 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++ +Y + + EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLNYMRPTLNEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++R KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+++K RLDL G+P++E P
Sbjct: 61 DTTRNHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLKGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K +T CGSCYGAE CCNTC +V +AY KKW++
Sbjct: 121 KEIVAVSPANKNST--------------CGSCYGAEHNATHCCNTCEDVLDAYHLKKWSV 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
++D + QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 -QVDKLEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
+ +H I HLSFG K++ + PLDG EE S MFNYYIKI+PT+YER
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVNVEESKSEMFNYYIKIVPTLYER 279
Query: 299 -LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L + GMPGIFFSYELSPLMVK E+ S GH T
Sbjct: 280 NSDGQPIYTNQFSVTRYRKDLTDRERGMPGIFFSYELSPLMVKYAERHNSFGHFATNCCS 339
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++ LL++ + I K+E+G
Sbjct: 340 IIGGVFTVAGILAVLLNNSWEAIQRKLEVG 369
>gi|21357439|ref|NP_648758.1| CG7011 [Drosophila melanogaster]
gi|7294304|gb|AAF49653.1| CG7011 [Drosophila melanogaster]
gi|16768234|gb|AAL28336.1| GH25868p [Drosophila melanogaster]
gi|220946650|gb|ACL85868.1| CG7011-PA [synthetic construct]
Length = 373
Score = 303 bits (776), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 170/391 (43%), Positives = 236/391 (60%), Gaps = 40/391 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++V +Y Q + EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++R KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P
Sbjct: 61 DTTRDHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K T CGSCYGAE CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLRKWTV 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 A-VDKIEQCKGKYKRSD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
+ +H I HLSFG K++ + PLDG V AE + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMR 279
Query: 299 --LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
DG L + GMPGIFFSYELSPLMVK E+ S GH T
Sbjct: 280 GNSDGEPIYTNQFSVTRYRKDLSDRERGMPGIFFSYELSPLMVKYAERHSSFGHFATNCC 339
Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++ LL++ + I K+E+G
Sbjct: 340 SIIGGVFTVAGILAVLLNNSWEAIQRKLEVG 370
>gi|410953940|ref|XP_003983626.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Felis catus]
Length = 399
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 165/403 (40%), Positives = 233/403 (57%), Gaps = 43/403 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVH ++ +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 245 AF----------------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
+F N TH+IRHLSFG +D PLD T A + + MF Y+
Sbjct: 239 SFGLDNRSRLRCWYCLQINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYF 295
Query: 289 IKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSK 332
+K++PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK +
Sbjct: 296 VKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHR 355
Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
S H T + I G + L+D+L++ + I K GKT
Sbjct: 356 SFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 398
>gi|334310895|ref|XP_003339551.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Monodelphis domestica]
Length = 396
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 162/400 (40%), Positives = 229/400 (57%), Gaps = 39/400 (9%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++DI+ P + C YL++DA+D +GEQ L VEHN+YK+RLD DG+P+ A
Sbjct: 66 DKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPV------TTEA 119
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + E DP +C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 120 ERHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVH ++ + +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239
Query: 246 F-------------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
F N TH+IR LSFG +D PLD T A + + MF Y++K++
Sbjct: 240 FGLDNVVLCWYLQINMTHYIRRLSFG---EDYPGIVNPLDDTNITAPQASMMFQYFVKVV 296
Query: 293 PTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
PT+Y ++ G L GD G+PG+F YELSP+MVK+TEK +S H
Sbjct: 297 PTVYMKVSGEVLRSNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTH 356
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
T + I G + L+D+L++ + I K+E+G T
Sbjct: 357 FLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGKTT 396
>gi|125978263|ref|XP_001353164.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
gi|54641917|gb|EAL30666.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
Length = 372
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 171/390 (43%), Positives = 234/390 (60%), Gaps = 39/390 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++ Y Q + EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLSYMQPALNEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+I+K RLDL G+P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDIFKHRLDLKGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K T CGSCYGAE CCNTC +V +AYR KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLHKWNV 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
++D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
+ +H I HLSFG K++ + PLDG V AE + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMR 279
Query: 299 L-DGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L + GMPGIFFSYELSPLMVK EK S GH T
Sbjct: 280 QSDGQPIYTNQFSVTRYRKDLTDRERGMPGIFFSYELSPLMVKYAEKHNSFGHFATNCCS 339
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++ LL++ + I K+++G
Sbjct: 340 IIGGVFTVAGILAVLLNNSWEAIQRKLDVG 369
>gi|57208594|emb|CAI42843.1| ERGIC and golgi 3 [Homo sapiens]
Length = 396
Score = 301 bits (770), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 164/401 (40%), Positives = 232/401 (57%), Gaps = 40/401 (9%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 4 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 64 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 121
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 122 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 177
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH----------- 234
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVH
Sbjct: 178 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHGCVCRLKMIAR 237
Query: 235 ----VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIK 290
VHD+Q + N TH+I+HLSFG +D PLD T A + + MF Y++K
Sbjct: 238 SLACVHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVK 294
Query: 291 IIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSL 334
++PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK +S
Sbjct: 295 VVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSF 354
Query: 335 GHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
H T + I G + L+D+L++ + I K GKT
Sbjct: 355 THFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 395
>gi|335304738|ref|XP_003360010.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Sus scrofa]
gi|350594872|ref|XP_003134465.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Sus scrofa]
Length = 398
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 163/402 (40%), Positives = 234/402 (58%), Gaps = 42/402 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEIKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVH ++ +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 245 AF---------------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
+F N TH+I+HLSFG +D PLD T A + + MF Y++
Sbjct: 239 SFGLDNVSTGHRCCLQINMTHYIQHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFV 295
Query: 290 KIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKS 333
K++PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK +S
Sbjct: 296 KVVPTVYMKVDGEVLRTNQFSVTRHEKVASGLMGDQGLPGVFVLYELSPMMVKLTEKHRS 355
Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
H T + I G + L+D+L++ + I K GKT
Sbjct: 356 FTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 397
>gi|195126511|ref|XP_002007714.1| GI12235 [Drosophila mojavensis]
gi|193919323|gb|EDW18190.1| GI12235 [Drosophila mojavensis]
Length = 372
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 171/390 (43%), Positives = 241/390 (61%), Gaps = 39/390 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++ +Y + + TEELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISSSIISLLIFLECLNYMRPTLTEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+++K RLDLDG P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLDGNPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K +T CGSCYGAE + CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNST--------------CGSCYGAEHNSTHCCNTCEDVLDAYRIRKWNM 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
++D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
+T+ +H I HLSFG K++ + PLDG EE S MFNYY+KI+PT+YER
Sbjct: 225 -FTNVKL--SHTINHLSFGEKIE--FAKTHPLDGLRVDVEESKSEMFNYYLKIVPTLYER 279
Query: 299 LDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
K L + GMPGIFFSYELSPLMVK E+ S GH T
Sbjct: 280 HSDGKPIYTNQFSVTRHRKDLTDRERGMPGIFFSYELSPLMVKYAERHVSFGHFATNCCS 339
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
I G + ++ +L++ ++ I K+E+G
Sbjct: 340 IIGGVFTVAGILAVVLNNSLEAIQRKLEVG 369
>gi|389612123|dbj|BAM19583.1| ptx1 protein [Papilio xuthus]
Length = 285
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 151/293 (51%), Positives = 187/293 (63%), Gaps = 25/293 (8%)
Query: 99 VEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET 158
++HNI+KRRLDLDG PI+EP+KE + K T T T CGSCYGA
Sbjct: 1 MDHNIHKRRLDLDGNPIEEPKKEEIAISSTVKQNTSELATVT-------CGSCYGAAFND 53
Query: 159 RKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSG 218
+CCNTC +VKEAYR ++WALP+L TIVQCK++ S EK EGCQIYGY+EVNRV G
Sbjct: 54 SQCCNTCEDVKEAYRIRRWALPDLATIVQCKDDESLEKANLALKEGCQIYGYMEVNRVGG 113
Query: 219 SFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA 278
SFHIAPG S++INHVHVHD+QPY+S+AFNTTH I+HLSFG ++ + PLDG A
Sbjct: 114 SFHIAPGKSFTINHVHVHDVQPYSSSAFNTTHXIQHLSFGSDIKSAN--TAPLDGVKGIA 171
Query: 279 EEGASMFNYYIKIIPTIYERLDGSKLG----------------GGDGGMPGIFFSYELSP 322
+EGA MF YYIKI PT+Y +LD + L + GMPG FFSYELSP
Sbjct: 172 QEGAVMFQYYIKIGPTMYVKLDKTVLHTNQFSVTRHQKSVSNINSESGMPGAFFSYELSP 231
Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
LMVK TEK +S+GH T I I G + ++D LL+ + + GK
Sbjct: 232 LMVKYTEKERSIGHFATNICAIIGGVFTVAGILDTLLYHSLNAFHNKIVLGKA 284
>gi|335774962|gb|AEH58414.1| endoplasmic reticulum-golgi intermediat compartment protein 3-like
protein [Equus caballus]
Length = 354
Score = 296 bits (759), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 210/344 (61%), Gaps = 27/344 (7%)
Query: 49 YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
Y EL+VD SRG KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RL
Sbjct: 20 YLTTEVHPELYVDKSRGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRL 79
Query: 109 DLDGKPIQ-EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNE 167
D DG P+ E ++ + V+ K ++ DP++C SCYGAETE KCCNTC +
Sbjct: 80 DKDGIPVSSEAERHELGKVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCED 132
Query: 168 VKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS 227
V+EAYR + WA DTI QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S
Sbjct: 133 VREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKS 192
Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
+ +HVHVHD+Q + N TH+IRHLSFG +D PLD T A + + MF Y
Sbjct: 193 FQQSHVHVHDLQSFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQY 249
Query: 288 YIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKS 331
++K++PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK
Sbjct: 250 FVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKH 309
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
+S H T + I G + L+D+L++ + I K GKT
Sbjct: 310 RSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 353
>gi|302790744|ref|XP_002977139.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
gi|302820940|ref|XP_002992135.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
gi|300140061|gb|EFJ06790.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
gi|300155115|gb|EFJ21748.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
Length = 386
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 158/390 (40%), Positives = 223/390 (57%), Gaps = 25/390 (6%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M ++L+ LDA+ K EDFH +T+ GG +T+V +F++ L ++ + TT EL V
Sbjct: 1 MQMLKKLQQLDAYPKINEDFHSRTLSGGVITVVSSIFMAILFITELKLFLLPGTTSELLV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG L I+ DI P ++C ++LDA+D SGEQHL V+HNI+K+RLD GK +Q P +
Sbjct: 61 DTSRGETLQINFDITFPALACSVISLDAMDVSGEQHLDVKHNIFKKRLDPSGKVVQPPVQ 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
E + K K ++G E CGSC+GAE +CCN+C EV+EAYR + WA+
Sbjct: 121 EDIGGPKIDKPLQKHGGRLEHNE--TYCGSCFGAEQSDDECCNSCEEVREAYRKRGWAIH 178
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
D I QCK E K+K EGC IYG LEVN+V+G+FH APG S+S HVHVHD+Q
Sbjct: 179 NADLIDQCKREGWLTKIKEEEGEGCNIYGSLEVNKVAGNFHFAPGKSFSQQHVHVHDVQS 238
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
FN +H+I LSFG + PLD + ++M+ Y+IK++PT Y +
Sbjct: 239 LHKEKFNVSHYINELSFGARFPG---VVNPLDKEKRIQKFPSAMYQYFIKVVPTAYTDMT 295
Query: 301 GSKL---------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
G K+ G +PG+FF YELSP+ V TE+ S H T + I
Sbjct: 296 GHKIVTNQFSVTDHFKAVEGLNGRSLPGVFFFYELSPIKVLFTERKTSFLHFLTNVCAII 355
Query: 346 SGTYITFMLVDALL---HSCVKKISKVEIG 372
G + ++D+ + H +KK K+EIG
Sbjct: 356 GGVFTVSGIIDSFIYHGHRAIKK--KMEIG 383
>gi|195021391|ref|XP_001985385.1| GH17030 [Drosophila grimshawi]
gi|193898867|gb|EDV97733.1| GH17030 [Drosophila grimshawi]
Length = 372
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 170/390 (43%), Positives = 241/390 (61%), Gaps = 39/390 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS L+ ++ +Y + + TEELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISSSIISLLVLLEFLNYMKPTMTEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+++K RLDL G+P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLQGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K +T CGSCYGAE + CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNST--------------CGSCYGAEHNSTHCCNTCEDVLDAYRIRKWNM 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
++D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
+T+ +H I HLSFG K++ + PLDG EE S MFNYY+KI+PT+YER
Sbjct: 225 -FTNVKL--SHTINHLSFGEKIE--FAKTHPLDGIRVDVEESKSEMFNYYLKIVPTLYER 279
Query: 299 -LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
DG L + GMPGIFFSYELSPLMVK E+ S GH T
Sbjct: 280 HSDGEPIYTNQFSVTRHRKDLTDRERGMPGIFFSYELSPLMVKYAERHNSFGHFATNCCS 339
Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
+ G + ++ LL++ + I K+E+G
Sbjct: 340 IVGGVFTVAGILAVLLNNSWEAIQRKLEVG 369
>gi|351702542|gb|EHB05461.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Heterocephalus glaber]
Length = 378
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 161/386 (41%), Positives = 228/386 (59%), Gaps = 30/386 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + E DP++C SCYGAE+E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFD----PESLDPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVH +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVH-----GWCCLQ 234
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 235 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 291
Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
GD G+PG+F YELSP+MVK+TEK +S H T + I G +
Sbjct: 292 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 351
Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
L+D+L++ + I K GKT
Sbjct: 352 TVAGLIDSLIYHSARAIQKKIDLGKT 377
>gi|326434226|gb|EGD79796.1| intermediate compartment protein 3 [Salpingoeca sp. ATCC 50818]
Length = 396
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 155/389 (39%), Positives = 226/389 (58%), Gaps = 23/389 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF KT G A++IV L I L ++ Y ELFVD+SR
Sbjct: 9 KLRNLDAYPKTLEDFRVKTFSGAAISIVAILLIVVLFTSELVYYLSTEVEPELFVDTSRD 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
K+ I++D+ ++C +L LD +D SGE L VEH+I+K+RL G PI E +EV +
Sbjct: 69 EKMRINVDVTFHKMACAFLHLDIMDVSGENELDVEHDIFKQRLTETGTPIYEEPEEVDDL 128
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ E DPN+C SCYGAE+E KCCNTC V+EAYR K WAL ++ I
Sbjct: 129 GDESDSAVGALKMMKEGLDPNRCESCYGAESEQNKCCNTCEAVREAYRRKGWALTDIQGI 188
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E TEKLK EGC+IYG+LEVN+V+G+FHIAPG S+ + +H HD+ + A
Sbjct: 189 EQCEREGWTEKLKAQAKEGCRIYGHLEVNKVAGNFHIAPGKSFQQHSIHFHDLNSFGREA 248
Query: 246 ---FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE-GASMFNYYIKIIPTIYERLDG 301
FN +H I HLSFGI+ PLDG A++ GA+M+ YY+KI+PT Y + G
Sbjct: 249 LGKFNMSHTINHLSFGIEYPG---VVNPLDGHSETADKLGATMYQYYVKIVPTRYRKARG 305
Query: 302 SKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
+L G G+PG+F +E+SP++V+++E++ S H T ++ I
Sbjct: 306 QELNTNQYSVTMHQRHIDHKAGQTGLPGMFVMFEISPILVQLSERTHSFFHFLTGVLAII 365
Query: 346 SGTYITFMLVDALLHSCVKKISKVEIGGK 374
G + ++D+ ++ ++ + K + GK
Sbjct: 366 GGIFSVAGMIDSFVYHGLRSLKKKQELGK 394
>gi|291231388|ref|XP_002735646.1| PREDICTED: serologically defined breast cancer antigen 84-like,
partial [Saccoglossus kowalevskii]
Length = 358
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/366 (40%), Positives = 216/366 (59%), Gaps = 29/366 (7%)
Query: 31 TIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVD 90
TI+ + + L ++ Y T EL+VD++RG K+ I+LDI PT+ C YL++DA+D
Sbjct: 1 TIISGILMFILFISELNYYLTKEVTPELYVDTTRGEKMRINLDITFPTLPCGYLSIDAMD 60
Query: 91 SSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGS 150
+GEQ L V+HNI K R+D +GKP+ P+KE + + E DP++C S
Sbjct: 61 VAGEQQLDVDHNIMKSRIDKNGKPVATPEKEDIG-----DKSEEAKDFDVNKLDPDRCES 115
Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY 210
CYGAE++ KCCNTC +V+EAYR K WA D I QC E ++KLK+ EGCQ+YG+
Sbjct: 116 CYGAESKDLKCCNTCEDVREAYRRKGWAFNNADGIAQCSREGWSDKLKSQSGEGCQVYGH 175
Query: 211 LEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKP 270
LEVN+V+G+FH APG S+ +HVHVHD+Q ++ FN +H I HLSFG K P
Sbjct: 176 LEVNKVAGNFHFAPGKSFQQHHVHVHDLQAFSGEKFNLSHRINHLSFGHKYPG---MENP 232
Query: 271 LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL--------------------GGGDGG 310
LD + +++ + M+ Y++KI+PT Y +L+G+ G+ G
Sbjct: 233 LDNSKVTSQKASIMYQYFVKIVPTTYTKLNGATTRSNQYSVTKHEKVVSTSLASAAGEHG 292
Query: 311 MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKV 369
+PG+F YE +PLMVK TEK +S H T + I G + L+D++++ K I K+
Sbjct: 293 LPGVFILYEFAPLMVKYTEKHRSFMHFMTGVCAIIGGVFTVAGLIDSMIYHSSKAIKKKI 352
Query: 370 EIGGKT 375
++G T
Sbjct: 353 DLGKAT 358
>gi|168004517|ref|XP_001754958.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694062|gb|EDQ80412.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 158/390 (40%), Positives = 225/390 (57%), Gaps = 26/390 (6%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +LK LDA K EDF+ +T+ GG +T+V +F+ L + Y T +L V
Sbjct: 1 MAIFNKLKQLDAHPKISEDFYSRTLSGGVITLVSSIFMFLLFVTEFRIYLSAQTQNQLVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG L I+LDI P ++C ++LDA+D SGE HL V HNIYK+RLD+ GK + P+
Sbjct: 61 DTSRGETLQINLDITFPALACSVVSLDAMDISGELHLDVRHNIYKKRLDVHGKAVDAPKP 120
Query: 121 EVVNAVKKKKVTTENGTTTTELED-PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ +NA K +K ++G LED CGSC+GAE+ +CCN+C EV+EAYR K WAL
Sbjct: 121 DAINAPKVQKPLQKHGG---RLEDHETYCGSCFGAESSDDQCCNSCEEVREAYRKKGWAL 177
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
D I QC E E++K EGC IYG LEVN+V+G+F IAPG S+ + +H+ D+
Sbjct: 178 TNTDLIDQCHREGFIERIKEEAGEGCNIYGKLEVNKVAGNFQIAPGKSFQQSAMHLLDLM 237
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ + +FN +H I LSFG PLD + ++ MF Y+IK++PT+Y +
Sbjct: 238 GFVTDSFNVSHTINELSFGAYFPG---AVNPLDKVTSIQKDQNGMFQYFIKVVPTVYTDI 294
Query: 300 DGSKLG-----------GGDGG---MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
G K+ GD G +PG+FF Y+L+P+ VK TE+ S H T + I
Sbjct: 295 KGRKISTNQFSVMEHYTAGDHGPRVIPGVFFFYDLTPIKVKFTEERPSFLHFLTNVCAII 354
Query: 346 SGTYITFMLVDALL---HSCVKKISKVEIG 372
G Y +VD+ + H +KK K+E+G
Sbjct: 355 GGIYTIAGIVDSFIYHGHRAIKK--KMELG 382
>gi|426391505|ref|XP_004062113.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Gorilla gorilla gorilla]
gi|7959731|gb|AAF71038.1|AF116721_14 PRO0989 [Homo sapiens]
Length = 346
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 147/343 (42%), Positives = 210/343 (61%), Gaps = 25/343 (7%)
Query: 49 YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
Y EL+VD SRG KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RL
Sbjct: 12 YLTTEVHPELYVDKSRGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRL 71
Query: 109 DLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEV 168
D DG P+ + + + K +VT + + DP++C SCYGAE E KCCNTC +V
Sbjct: 72 DKDGIPVSSEAER--HELGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDV 125
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
+EAYR + WA DTI QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+
Sbjct: 126 REAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSF 185
Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
+HVHVHD+Q + N TH+I+HLSFG +D PLD T A + + MF Y+
Sbjct: 186 QQSHVHVHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYF 242
Query: 289 IKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSK 332
+K++PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK +
Sbjct: 243 VKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHR 302
Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
S H T + I G + L+D+L++ + I K GKT
Sbjct: 303 SFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 345
>gi|168024878|ref|XP_001764962.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683771|gb|EDQ70178.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 153/389 (39%), Positives = 225/389 (57%), Gaps = 24/389 (6%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +LK LDA+ K EDF+ +T+ GG +T+V +F+ L ++ Y T +L V
Sbjct: 1 MAVFNKLKQLDAYPKISEDFYSRTLSGGVITLVSTVFMFVLFVTEISLYLSAQTQNQLVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG L I+LDI P ++C ++LDA+D SGEQHL+V HNI+K+RLD+ GK + P+
Sbjct: 61 DTSRGETLQINLDITFPALACSMVSLDAMDISGEQHLNVRHNIFKKRLDVHGKVVNAPKP 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ +NA K +K ++G E CGSC+GAE+ +CCN C EV+EAYR K WAL
Sbjct: 121 DAINAPKVQKPLQKHGGRLEHNE--TYCGSCFGAESSDDECCNNCEEVREAYRKKGWALT 178
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
D I QC E E++K EGC IYG LEVN+V+G+FH APG S+ + +H+ D+
Sbjct: 179 NADLIDQCHREGFIERVKEEAGEGCNIYGKLEVNKVAGNFHFAPGKSFQQSAMHLLDLMG 238
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ + +FN +H I LSFG PLD ++ M+ Y+IK++PT+Y +
Sbjct: 239 FITDSFNVSHTINELSFGAHFPG---AVNPLDKVTNIQKDLNGMYQYFIKVVPTVYTDIK 295
Query: 301 GSKLG-----------GGDGG---MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
G K+ GD G +PG+FF Y+LSP+ VK +E+ S H T + +
Sbjct: 296 GRKISTNQFSVTEHYTAGDHGPRFVPGVFFFYDLSPIKVKFSEERPSFLHFLTNVCAIVG 355
Query: 347 GTYITFMLVDALL---HSCVKKISKVEIG 372
G Y ++D+ + H +KK K+E+G
Sbjct: 356 GVYSIAGIIDSFVYHGHRAIKK--KMELG 382
>gi|355563183|gb|EHH19745.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
mulatta]
gi|355784539|gb|EHH65390.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
fascicularis]
Length = 401
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 162/404 (40%), Positives = 230/404 (56%), Gaps = 43/404 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH------------- 232
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +H
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHGTYLTGCVCRLKM 239
Query: 233 -----VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
VHD+Q + N TH+I+HLSFG +D PLD T A + + MF Y
Sbjct: 240 IARSLACVHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQY 296
Query: 288 YIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKS 331
++K++PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK
Sbjct: 297 FVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKH 356
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
+S H T + I G + L+D+L++ + I K GKT
Sbjct: 357 RSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 400
>gi|61555014|gb|AAX46646.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
Length = 346
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 210/344 (61%), Gaps = 27/344 (7%)
Query: 49 YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
Y EL+VD SRG KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RL
Sbjct: 12 YLTTEVHPELYVDKSRGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRL 71
Query: 109 DLDGKPIQ-EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNE 167
D DG P+ E ++ + V+ K ++ DP++C SCYGAE E KCCN+C +
Sbjct: 72 DKDGFPVSSEAERHELGKVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCED 124
Query: 168 VKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS 227
V+EAYR + WA DTI QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S
Sbjct: 125 VREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKS 184
Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
+ +HVHVHD+Q + N TH+IRHLSFG +D PLD T A + + MF Y
Sbjct: 185 FQQSHVHVHDLQSFGLDNINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQY 241
Query: 288 YIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKS 331
++K++PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK
Sbjct: 242 FVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKH 301
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
+S H T + I G + L+D+L++ + I K GKT
Sbjct: 302 RSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 345
>gi|440902508|gb|ELR53293.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
grunniens mutus]
Length = 395
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 160/399 (40%), Positives = 229/399 (57%), Gaps = 39/399 (9%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAE E KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH---------- 234
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVH
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHGCREEVRVTG 238
Query: 235 --VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
+ Q + N TH+IRHLSFG +D PLD T A + + MF Y++K++
Sbjct: 239 ARCSEAQGWCCLQINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVV 295
Query: 293 PTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK +S H
Sbjct: 296 PTVYMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTH 355
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
T + I G + L+D+L++ + I K GKT
Sbjct: 356 FLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 394
>gi|449265747|gb|EMC76893.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3,
partial [Columba livia]
Length = 330
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 209/341 (61%), Gaps = 35/341 (10%)
Query: 57 ELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ 116
EL+VD SRG KL I+LD++ P + C YL++DA+D +GEQ L VEHN++K+RLD G
Sbjct: 4 ELYVDKSRGDKLKINLDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAG---- 59
Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPN-----KCGSCYGAETETRKCCNTCNEVKEA 171
N V + E G ++ DPN +C SCYGAE+E +CCNTC++V+EA
Sbjct: 60 -------NRVTPEAERHELGKEEEKVFDPNSLDADRCESCYGAESEDIRCCNTCDDVREA 112
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
YR + WA DTI QCK E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +
Sbjct: 113 YRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQS 172
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
HVHVHD+Q + N TH+I+HLSFG +D PLDGT A++ + MF Y++K+
Sbjct: 173 HVHVHDLQSFGLDNINMTHYIKHLSFG---RDYPGIVNPLDGTDVTAQQASMMFQYFVKV 229
Query: 292 IPTIYERLDG-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLG 335
+PT+Y ++DG K+ GD G+PG+F YELSP+MVK+TEK +S
Sbjct: 230 VPTVYMKVDGEVVRTNQFSVTRHEKIANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFT 289
Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
H T + + G + +D+L++ + I K GKT+
Sbjct: 290 HFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQKKIELGKTI 330
>gi|168019656|ref|XP_001762360.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686438|gb|EDQ72827.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 380
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 158/389 (40%), Positives = 228/389 (58%), Gaps = 29/389 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F +LK LDA+ K EDF+ +T+ GG +T+V +F++ L + Y T +L V
Sbjct: 1 MSFFNKLKHLDAYPKISEDFYSRTLSGGLITLVSSVFMTLLFITEFRIYLSAQTQNQLVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG L I+LDI ++C ++LDA+D SGEQHL+V HNI+K+RLD+ GK I P+
Sbjct: 61 DTSRGETLQINLDITFSALACSVVSLDAMDISGEQHLNVRHNIFKKRLDVHGKAIDAPKP 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ +NA K ++ ++G E CGSC+GA + +CCN+C EV+EAYR K WAL
Sbjct: 121 DAINAPKVQRPLQKHGGRLEHNE--TYCGSCFGAASSDDECCNSCEEVREAYRKKGWALI 178
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
+D I QC E E++K EGC IYG LEVN+V+G+FHIAPG + + +H+ D+
Sbjct: 179 NIDIIDQCHREGFIERVKEEAGEGCNIYGKLEVNKVAGNFHIAPGKLFQQSAMHLLDLLG 238
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
S +FN +H + LSFG R PLD + ++ M+ Y+IK++PT+Y +
Sbjct: 239 IRSDSFNVSHIVNELSFGAHFPG---RVNPLDKITSIQKDQNGMYQYFIKVVPTVYTDIR 295
Query: 301 GSKLG-----------GGDGG---MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
GS++ GD G +PG+FF Y+LSP+ VK TEK S H T + C I
Sbjct: 296 GSEIATNQFSVTEHYTAGDHGPRVVPGVFFFYDLSPIKVKFTEKRPSFLHFLTTV-CAIV 354
Query: 347 GTYITFMLVDALL---HSCVKKISKVEIG 372
G I +D+ + H VKK K+E+G
Sbjct: 355 GASI----IDSFIYHGHRAVKK--KMELG 377
>gi|196008679|ref|XP_002114205.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
gi|190583224|gb|EDV23295.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
Length = 369
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 149/383 (38%), Positives = 222/383 (57%), Gaps = 43/383 (11%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K EDF +T G +TIV + + L ++ Y V T ELFVD+SRG
Sbjct: 10 LRRYDAFPKTLEDFRIRTFGGATITIVSAVIMLLLFVSEMNYYLSVEVTSELFVDTSRGE 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ I++++ P ++C L++D +D +G Q L ++ N+ KRR+D +GKP +AV
Sbjct: 70 KIKIYMNVTFPKMACAILSVDTMDVAGMQQLDIKQNLMKRRIDENGKPTG-------DAV 122
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K K KCGSCYGAE KCCN+C +V+EAYR K WAL + I
Sbjct: 123 QKNK---------------TKCGSCYGAENAEMKCCNSCEDVREAYRKKGWALTSPEGIE 167
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNR-VSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E + LK EGC ++GYLEVN+ V+G+FH APG S+ + VHVHD+Q + S
Sbjct: 168 QCQEEGWAQMLKEQEKEGCNVFGYLEVNKVVAGNFHFAPGKSFQQHRVHVHDLQSFGSRK 227
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS--- 302
FNT+H I LSFG ++ PLDG +++ ++M+ Y+IK++PT+Y++L G
Sbjct: 228 FNTSHTIHKLSFG---EEFPGIINPLDGHRMSSDQDSAMYQYFIKVVPTVYKKLKGEEVK 284
Query: 303 -------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
KL G+ G+PG+F SYELSP++++ E+ KS H T + I G +
Sbjct: 285 SNQYSVTKHLKYIKLSMGEQGLPGVFISYELSPMIIRYAERRKSFAHFLTGVCAIIGGVF 344
Query: 350 ITFMLVDALLHSCVKKISKVEIG 372
L+DA+++ K + K+E+G
Sbjct: 345 TVASLIDAMVYHSAKML-KIELG 366
>gi|168014180|ref|XP_001759631.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689170|gb|EDQ75543.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 382
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 147/379 (38%), Positives = 216/379 (56%), Gaps = 20/379 (5%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
++LK LDA+ K EDF+ +T+ GG +TI+ F+ L ++ Y +L VD+ R
Sbjct: 3 QKLKSLDAYPKINEDFYSRTLSGGIITIISATFMVLLFFSELKLYLAAQVANDLVVDTER 62
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G + I+LD+ P ++C ++LDA+D SGE HL V+HNI+K+RLD++GK I+ ++E +N
Sbjct: 63 GGTIQINLDVTFPALACSVVSLDAMDISGEAHLDVKHNIFKKRLDVNGKVIEPARQESIN 122
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
K K ++G E CGSC+GAETE CCN C EV+EAYR K WAL D
Sbjct: 123 QPKLDKPLQKHGGRLEHNE--TYCGSCFGAETEEDHCCNNCEEVREAYRKKGWALNNPDL 180
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCK E +K+K+ EGC +YG LE N+V+G+FH APG S+ ++HVHD+ +
Sbjct: 181 IDQCKREGFLQKIKDEDGEGCNVYGTLEANKVAGNFHFAPGKSFQQANMHVHDLMAFGKD 240
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
+FN +H I +SFG++ PLD M+ Y+IK++PT+Y G K+
Sbjct: 241 SFNVSHKINEISFGVRYPG---AVNPLDKLERIQTTTHGMYQYFIKVVPTVYTDTRGRKI 297
Query: 305 G---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
G D +PG+FF Y+LSP+ VK TEK S H T + + G +
Sbjct: 298 STNQFAVTDHFKGVGPGEDHALPGVFFFYDLSPIKVKFTEKRMSFFHFLTNVCAIVGGVF 357
Query: 350 ITFMLVDALLHSCVKKISK 368
++DA ++ K+I K
Sbjct: 358 SVSGIIDAFVYHGQKQIKK 376
>gi|330790779|ref|XP_003283473.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
gi|325086583|gb|EGC39970.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
Length = 383
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 156/389 (40%), Positives = 215/389 (55%), Gaps = 35/389 (8%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ +LK DA+ K +DF KT G V+IV +FI +L V YF ELFV
Sbjct: 1 MLMVSQLKKFDAYPKTVDDFRVKTFTGAIVSIVGGIFILWLFFSQVTLYFSTDIHHELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI--QEP 118
D++RG KL I++DI + C YL+LDA+D SGE V HNI+K+RL G+PI Q P
Sbjct: 61 DTTRGEKLKINMDITFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKKRLSSTGQPIIEQPP 120
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEVKEAYRYKK 176
+E + KK V EN D CGSCYGAE R CCNTC EV+ AY K
Sbjct: 121 IRE--EEINKKIVKNEN--------DVQGCGSCYGAEDPARGIPCCNTCEEVRNAYSKKG 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W L + T+ QC E T+ + EGCQ+YG++ VN+V+G+FH APG S+ +H+HVH
Sbjct: 171 WGL-DPSTVSQCLREGFTKNIVEQNGEGCQVYGFILVNKVAGNFHFAPGKSFQQHHMHVH 229
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
D+QP+ FN +H I L+ G + + PLD G MF Y+IKI+PTIY
Sbjct: 230 DLQPFKDGQFNMSHTINKLAVGNEFPG---IKNPLDEVTKTEVAGVGMFQYFIKIVPTIY 286
Query: 297 ERLDGSKL-----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
E L+G+++ G G+PG+FF Y+LSP+M+K++EK KS T
Sbjct: 287 EGLNGNRIATNQYSVTEHYRLLAKKGEEPTGLPGLFFMYDLSPIMMKVSEKGKSFASFLT 346
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISK 368
+ I G + F + D+ ++ K + K
Sbjct: 347 NVCAIIGGVFTVFGIFDSFIYYSTKNLKK 375
>gi|395510083|ref|XP_003759313.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3, partial [Sarcophilus harrisii]
Length = 335
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 207/341 (60%), Gaps = 31/341 (9%)
Query: 57 ELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ 116
EL+VD SRG KL I++DI P + C YL++DA+D +GEQ L VEHN+YK+RLD DG P+
Sbjct: 4 ELYVDKSRGDKLKINIDIFFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGHPVT 63
Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKK 176
+ +++KV + DP +C SCYGAE+E KCCNTC +V+EAYR +
Sbjct: 64 TEAERHELGKEEEKVFDPSSL------DPERCESCYGAESEDSKCCNTCEDVREAYRRRG 117
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV- 235
WA DTI QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV
Sbjct: 118 WAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVH 177
Query: 236 ----HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
HD+Q + N TH+IR LSFG +D PLD T A + + MF Y++K+
Sbjct: 178 AVEIHDLQSFGLDNINMTHYIRRLSFG---EDYPGIVNPLDDTNITAPQASMMFQYFVKV 234
Query: 292 IPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLG 335
+PT+Y +++G L GD G+PG+F YELSP+MVK+TEK +S
Sbjct: 235 VPTVYMKVNGEVLRSNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFT 294
Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
H T + I G + L+D+L++ + I K+E+G T
Sbjct: 295 HFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGKTT 335
>gi|355686517|gb|AER98082.1| ERGIC and golgi 3 [Mustela putorius furo]
Length = 304
Score = 284 bits (727), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 140/310 (45%), Positives = 196/310 (63%), Gaps = 27/310 (8%)
Query: 57 ELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ 116
EL+VD SRG KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+
Sbjct: 4 ELYVDKSRGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVS 63
Query: 117 -EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
E ++ + V+ K ++ DP++C SCYGAETE KCCNTC +V+EAYR +
Sbjct: 64 SEAERHELGKVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRR 116
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA DTI QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV
Sbjct: 117 GWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHV 176
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
HD+Q + N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+
Sbjct: 177 HDLQSFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTV 233
Query: 296 YERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
Y ++DG L GD G+PG+F YELSP+MVK+TEK +S H T
Sbjct: 234 YMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLT 293
Query: 340 KIMCNISGTY 349
+ I G +
Sbjct: 294 GVCAIIGGMF 303
>gi|66801671|ref|XP_629760.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium discoideum AX4]
gi|74851212|sp|Q54DW2.1|ERGI3_DICDI RecName: Full=Probable endoplasmic reticulum-Golgi intermediate
compartment protein 3
gi|60463164|gb|EAL61357.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium discoideum AX4]
Length = 383
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 153/389 (39%), Positives = 215/389 (55%), Gaps = 30/389 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K +DF KT G V+I+ +FI +L V YF ELFVD++RG
Sbjct: 5 QLKKFDAYPKTVDDFRVKTYTGAIVSIIGGVFILWLFFSQVTLYFSTDIHHELFVDTTRG 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++DI + C YL+LDA+D SGE V HNI+K+RL G+PI E
Sbjct: 65 EKLKINMDITFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKKRLSPTGQPIIEAPPIREEE 124
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEVKEAYRYKKWALPELD 183
+ KK+ +N D CGSCYGAE ++ CCNTC EV+ AY K W L +
Sbjct: 125 INKKESVKDN-------NDVVGCGSCYGAEDPSKGIGCCNTCEEVRVAYSKKGWGL-DPS 176
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
I QC E T+ L EGCQ+YG++ VN+V+G+FH APG S+ +H+HVHD+QP+
Sbjct: 177 GIPQCIREGFTKNLVEQNGEGCQVYGFILVNKVAGNFHFAPGKSFQQHHMHVHDLQPFKD 236
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
+FN +H I LSFG D + PLD G MF Y++K++PTIYE L+G++
Sbjct: 237 GSFNVSHTINRLSFG---NDFPGIKNPLDDVTKTEMVGVGMFQYFVKVVPTIYEGLNGNR 293
Query: 304 L-----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
+ G G+PG+FF Y+LSP+M+K++E+ KS T + I
Sbjct: 294 IATNQYSVTEHYRLLAKKGEEPSGLPGLFFMYDLSPIMMKVSERGKSFASFLTNVCAIIG 353
Query: 347 GTYITFMLVDALLHSCVKKISKVEIGGKT 375
G + F + D+ ++ K + K GKT
Sbjct: 354 GVFTVFGIFDSFIYYSTKNLQKKIDLGKT 382
>gi|302834369|ref|XP_002948747.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
nagariensis]
gi|300265938|gb|EFJ50127.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
nagariensis]
Length = 392
Score = 280 bits (716), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 153/389 (39%), Positives = 216/389 (55%), Gaps = 26/389 (6%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F +LK LDA+ K EDF KT+ GG +TIV + + L ++ Y + EL VD
Sbjct: 8 FLSKLKALDAYPKINEDFFTKTMSGGIITIVASVVMVLLFLSELRLYMTTQSVHELSVDV 67
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
RG K+ IH D+ P + C +L+LDA+D SGE HL ++H++YK+RL +G P++E +K
Sbjct: 68 GRGEKIQIHFDLTFPKVPCSWLSLDAMDISGELHLDLDHDVYKQRLSANGSPVKEVEKHN 127
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
V A KK V NGT + CGSCYGAE CCNTC+EV+ AYR K WAL +
Sbjct: 128 VEATKK--VVPVNGTENSTATP--VCGSCYGAEDRQGDCCNTCDEVRAAYRRKGWALANV 183
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
D I QC ++ TE +K EGC ++G LEVN+V+G+FH APG SY +HVHDI P+
Sbjct: 184 DHIEQCAHDLYTESIKEQTGEGCHMWGMLEVNKVAGNFHFAPGRSYQQGSMHVHDIAPFG 243
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA--KAEEGASMFNYYIKIIPTIYERLD 300
A + H + LSFG + PLD A K+ M+ Y++K++PT Y +D
Sbjct: 244 DAVIDFRHTVNKLSFGAPYPG---MKNPLDNAKAGYKSAAATGMYQYFLKVVPTSYTGID 300
Query: 301 GSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
L GG +PG+FF Y+LSP+ V+I E S S T +
Sbjct: 301 NKTLATNQFSVTENFRESSQGGAGKTLPGVFFFYDLSPIKVRIVEHSSSFLSFLTSVCAI 360
Query: 345 ISGTYITFMLVDALLHSCVKKI-SKVEIG 372
+ G + +VDA +++ + I K+E+G
Sbjct: 361 VGGVFTVSGIVDAFIYTSTRLIRKKMELG 389
>gi|340373749|ref|XP_003385402.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Amphimedon queenslandica]
Length = 386
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 154/388 (39%), Positives = 221/388 (56%), Gaps = 32/388 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK LDA++K EDF KT G +T+V + I L ++ + +EL+VD+SRG
Sbjct: 7 RLKNLDAYSKTLEDFKIKTFSGATITLVSSIIILLLFLSELLYFLSTDVKQELYVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++DI+ C YL++D +D SGE L VEH +YK+RL LDG EV+N
Sbjct: 67 EKLQINVDIIFHRAPCLYLSIDVMDVSGEHQLDVEHTMYKQRLTLDG--------EVINE 118
Query: 126 VKKKKVTTENGTTTTELEDPNK-CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
K V + T + NK CGSCYGAET CCNTC +V+EAYR K WA + +
Sbjct: 119 SPTKSVLARDETQDGKAGAANKTCGSCYGAETPELSCCNTCEQVREAYRKKGWAFSDPSS 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E T ++K EGC++YG ++V++V+G+FH APG S+ + VHVHD+QP+
Sbjct: 179 IEQCEKEGWTTQIKEQMNEGCRVYGLIDVSKVAGNFHFAPGKSFQQHSVHVHDLQPFGVK 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIYERLDG 301
FN +H + LSFG Q+ PLDG A + G M+ Y+IK++PT+Y RL+
Sbjct: 239 HFNMSHTVLKLSFG---QEYPGIINPLDGHKAFDVETTHGGIMYQYFIKVVPTLYRRLNN 295
Query: 302 SKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
+G G+ G+PG+FF Y++SP++V +TE SL H T + +
Sbjct: 296 ETMGTNQFAVTKHQRPVRSASGEHGLPGVFFIYDISPILVYLTEYRHSLTHFLTSVCAIV 355
Query: 346 SGTYITFMLVDALL-HSCVKKISKVEIG 372
G + ++D LL HS K+E+G
Sbjct: 356 GGVFTVAGMIDKLLYHSGRVLKKKMELG 383
>gi|320167013|gb|EFW43912.1| Ergic3 protein [Capsaspora owczarzaki ATCC 30864]
Length = 392
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 147/392 (37%), Positives = 224/392 (57%), Gaps = 30/392 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLK LDA+ K ED KT G V+IVC L ++ L ++ + T EL VD++R
Sbjct: 9 RLKQLDAYAKTTEDVRIKTYGGAIVSIVCALIMAALFVSELNYFLTTETHHELLVDTTRA 68
Query: 66 --SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
KL I++++ P + C Y+++D +D +GE L V H + K RL G+ ++EP V
Sbjct: 69 GEQKLRININVTFPRLPCAYMSIDVMDVAGEHQLDVLHTLVKTRLSASGEVVREPTP--V 126
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
A+ ++ + + +L D +KCG CYGA+TE R CCN+C EV+ AYR K W + + D
Sbjct: 127 EALGQQPPS--DAAERRDL-DNSKCGDCYGAQTEKRPCCNSCEEVQAAYREKGWGMMDPD 183
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
+I QC+ E +E++++ EGC++ G++ VN+V+G+FH APG S HVHVHD+Q + +
Sbjct: 184 SIEQCRQEGFSERMRSIANEGCKVQGFMYVNKVAGNFHFAPGKSSQHQHVHVHDLQQFKT 243
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE---GASMFNYYIKIIPTIYERLD 300
F+ TH I LSFG + + PLD E G++MF Y+IK++PT Y +L+
Sbjct: 244 TTFDMTHTIHLLSFGTEYPG---QVNPLDAVSKVPPENTPGSAMFQYFIKVVPTEYVKLN 300
Query: 301 GS----------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
G G+ G+PG+FF YE SP++VKITE+ KS H T +
Sbjct: 301 GETEQTSQFSATSHVKMINHAAGENGLPGVFFMYEPSPMLVKITERRKSFMHFLTGVCAI 360
Query: 345 ISGTYITFMLVDALLHSCVKKI-SKVEIGGKT 375
+ G + LVDA ++ + I K+E+G +T
Sbjct: 361 VGGVFTVAGLVDATIYHSYRSIKKKMELGKQT 392
>gi|56753075|gb|AAW24747.1| SJCHGC09363 protein [Schistosoma japonicum]
gi|226486460|emb|CAX74359.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
gi|226486464|emb|CAX74361.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
Length = 379
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 148/379 (39%), Positives = 206/379 (54%), Gaps = 33/379 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF KP +DF KT+ G V+I+ I L + + + +E+ VD +RG
Sbjct: 8 LRNFDAFAKPLKDFRIKTMSGAMVSIISSFIIGILFTSEFISFMRTQNKQEIIVDINRGE 67
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ I+LDI + I C +L LD +D++G Q L+V H +YK + + G P+ + VN
Sbjct: 68 KMSIYLDITINFIPCAFLRLDTMDTTGAQQLNVMHEVYKTSVSISGNPLSNSVRHTVN-- 125
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
++ TTT DPN CGSCYGA++ TRKCCNTC EV+ AY +W
Sbjct: 126 ------DDSALTTTR--DPNYCGSCYGADSPTRKCCNTCEEVQMAYHEMQWVFGNASEFE 177
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC+NE +N EGC+I+G L VNRV G FHIAPG SY+ NH HVH I+ F
Sbjct: 178 QCRNENWDGMKRNIGNEGCRIHGSLTVNRVGGGFHIAPGHSYTENHAHVHSIRSLGHVQF 237
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD------ 300
N +H I L FG + LDGT ++ + MFNYY+K++PT+Y +
Sbjct: 238 NVSHSITELRFGDAYPG---QINSLDGTKMTVDKPSQMFNYYLKLVPTMYTSVSNNESTL 294
Query: 301 ------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GS L G G+PG+FF+YE++PL+VKITE+ KS H T I G
Sbjct: 295 ITNQYSATWHSRGSPLSGDGQGLPGVFFNYEIAPLLVKITEERKSFVHFLTNTCAIIGGV 354
Query: 349 YITFMLVDALLH--SCVKK 365
+ L+DA ++ SCV +
Sbjct: 355 FTVASLLDAFIYQSSCVLR 373
>gi|226486462|emb|CAX74360.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
Length = 379
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 148/379 (39%), Positives = 206/379 (54%), Gaps = 33/379 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF KP +DF KT+ G V+I+ I L + + + +E+ VD +RG
Sbjct: 8 LRNFDAFAKPLKDFRIKTMSGAMVSIISSFIIGILFTSEFISFMRTQNKQEIIVDINRGE 67
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ I+LDI + I C +L LD +D++G Q L+V H +YK + + G P+ + VN
Sbjct: 68 KMSIYLDITINFIPCAFLRLDTMDTTGAQQLNVMHEVYKTSVSISGNPLSNSVRHTVN-- 125
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
++ TTT DPN CGSCYGA++ TRKCCNTC EV+ AY +W
Sbjct: 126 ------DDSALTTTR--DPNYCGSCYGADSPTRKCCNTCEEVQMAYHEMQWVFGNASEFE 177
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC+NE +N EGC+I+G L VNRV G FHIAPG SY+ NH HVH I+ F
Sbjct: 178 QCRNENWDGMKRNIGNEGCRIHGSLTVNRVGGGFHIAPGHSYTENHAHVHSIRSLGHVQF 237
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD------ 300
N +H I L FG + LDGT ++ + MFNYY+K++PT+Y +
Sbjct: 238 NVSHSITELRFGDAYPG---QINSLDGTKMTVDKPSQMFNYYLKLVPTMYTSVSNNESTL 294
Query: 301 ------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GS L G G+PG+FF+YE++PL+VKITE+ KS H T I G
Sbjct: 295 ITNQYSATWHSRGSPLSGDGQGLPGVFFNYEIAPLLVKITEERKSFVHFLTNTCAIIGGV 354
Query: 349 YITFMLVDALLH--SCVKK 365
+ L+DA ++ SCV +
Sbjct: 355 FTVASLLDAFIYQSSCVLR 373
>gi|357489473|ref|XP_003615024.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355516359|gb|AES97982.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 386
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 142/383 (37%), Positives = 215/383 (56%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +TIV + + L ++ Y +T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGLITIVSSILMLLLFFSELRLYLHAATETKLVVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P ++C ++LDA+D SGEQHL V H+I K+R+D G I+ Q + +
Sbjct: 67 ETLRINFDVTFPALACSIVSLDAMDISGEQHLDVRHDIIKKRIDSHGNVIETRQDGIGSP 126
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K + G + CGSCYGAE +CCN+C EV+EAYR K WAL D+I
Sbjct: 127 NIEKPLQRHGGRLE---HNETYCGSCYGAEASDEECCNSCEEVREAYRKKGWALSSPDSI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E E++K EGC +YG+LEVN+V+G+FH APG S+ + VHVHD+ + +
Sbjct: 184 DQCKREGFLERIKEEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKES 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +HHI ++FG PLD E + M+ Y+IK++PT+Y + G+ +
Sbjct: 244 FNLSHHINRIAFGDYFPG---VVNPLDRVHWTQETPSGMYQYFIKVVPTMYTDVSGNTIQ 300
Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHFRTADVGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGIFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ K I K+E+G
Sbjct: 361 VSGILDSFIYHGQKAIKKKMELG 383
>gi|224082148|ref|XP_002306582.1| predicted protein [Populus trichocarpa]
gi|222856031|gb|EEE93578.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 148/383 (38%), Positives = 215/383 (56%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLASSVVMFLLFFSELRLYLHAVTETKLVVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQHL V+H+I K+RLD G I E +++ + A
Sbjct: 67 ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDFHGNVI-EARQDGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K +G E CGSCYGAE CCN+C +V+EAYR K WA+ D +
Sbjct: 126 PKIEKPLQRHGGRLEHNE--TYCGSCYGAEASDEDCCNSCEDVREAYRKKGWAVTNPDLM 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +K+K+ EGC IYG+LEVN+V+G+FH APG S+ + VHVHD+ + +
Sbjct: 184 DQCKREGFLQKIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN TH I L+FG PLDG E + M+ Y+IK++PT+Y + G +
Sbjct: 244 FNITHKINRLTFGEYFPG---VVNPLDGVQWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
Query: 306 -----------GGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G D G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHFRGTDIGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D ++ K I K+EIG
Sbjct: 361 VSGILDTFIYHGQKAIKKKMEIG 383
>gi|225459342|ref|XP_002285801.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|302141938|emb|CBI19141.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 143/383 (37%), Positives = 214/383 (55%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ +F+ L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLASSIFMLLLFISELRLYLHAVTETKLVVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQHL V H+I K+R+D G I E +++ + +
Sbjct: 67 ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVRHDIIKKRIDAHGSVI-EARQDGIGS 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K ++G E CGSCYGAE CCN C EV+EAYR K WA+ D I
Sbjct: 126 PKIEKPLQKHGGRLEHNE--TYCGSCYGAEASDDDCCNNCEEVREAYRKKGWAMSNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +++K+ EGC IYG+LEVN+V+G+FH APG S+ +++HVHD+ + +
Sbjct: 184 DQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNIHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +H I L+FG PLDG + M+ Y+IK++PT+Y + G +
Sbjct: 244 FNISHKINRLAFGDYFPG---VVNPLDGVQWIQATPSGMYQYFIKVVPTVYTHVSGHTIS 300
Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 TNQFSVTEHFRNAELGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ K I K+EIG
Sbjct: 361 VSGILDSFIYHSQKAIKKKIEIG 383
>gi|255545672|ref|XP_002513896.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223546982|gb|EEF48479.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 386
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/383 (38%), Positives = 214/383 (55%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFISELRLYIHAVTETKLAVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQHL V+H+I K+RLD G I E +++ + A
Sbjct: 67 ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVI-EARQDGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K + +G E CGSCYGAE CCN+C +V+EAYR K WAL D I
Sbjct: 126 PKIENPLQRHGGRLEHNE--TYCGSCYGAEASDEDCCNSCEDVREAYRKKGWALSNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +++K+ EGC IYG+LEVN+V+G+FH APG S+ ++VHVHD+ + +
Sbjct: 184 DQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
FN +H I L+FG PLDG E + M+ Y+IK++PT+Y + G
Sbjct: 244 FNISHKINRLAFGDYFPG---VVNPLDGVHWTQETPSGMYQYFIKVVPTVYTDVSGYTIQ 300
Query: 302 -----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHFRSAEAGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ K I K+EIG
Sbjct: 361 VSGILDSFIYHGQKAIKKKMEIG 383
>gi|356552872|ref|XP_003544786.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 386
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 145/383 (37%), Positives = 216/383 (56%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T +L VD+SR
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFFSELRLYLHAVTETKLVVDTSRA 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQHL V+H+I K+RLD G I E ++E + A
Sbjct: 67 ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVI-ETRQEGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K +G E CGSCYGAE CCN+C +V+EAYR K WAL D I
Sbjct: 126 PKIEKPLQRHGGRLEHNE--TYCGSCYGAEESDDDCCNSCEDVREAYRKKGWALSNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +++K+ EGC +YG+LEVN+V+G+FH APG S+ + VHVHD+ + +
Sbjct: 184 DQCKREGFLQRIKDEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +HHI L+FG PLD E + M+ Y+IK++PT+Y + G +
Sbjct: 244 FNLSHHINRLAFGEYFPG---VVNPLDNVHWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
Query: 306 G-----------GDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
GD G +PG+FF Y+LSP+ V TE++ S H T + + G +
Sbjct: 301 SNQFSVTEHFRTGDVGRLQSLPGVFFFYDLSPIKVTFTEENVSFLHFLTNVCAIVGGIFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ + I K+E+G
Sbjct: 361 VSGILDSFIYHGQRAIKKKMELG 383
>gi|167535515|ref|XP_001749431.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772059|gb|EDQ85716.1| predicted protein [Monosiga brevicollis MX1]
Length = 394
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/396 (37%), Positives = 221/396 (55%), Gaps = 26/396 (6%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + LK DA+ K +DF KT G AV+I+ + + L ++ + EELFV
Sbjct: 1 MAIFDNLKRFDAYPKTLDDFRVKTFSGAAVSIIAIIIMVILFSSELVYFLSTDVHEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D++R KL I+LDI P + C YL+LD +D SGE +++H+++++RLD G I Q+
Sbjct: 61 DTARNEKLRINLDITFPKMPCVYLSLDVMDISGENEQNIDHDVFRQRLDASGNKIYNGQE 120
Query: 121 EV--VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
E+ + V + +L DPN+C SCYGAE +CCNTC +V+EAYR K WA
Sbjct: 121 EIDELGESHADNVADKALDGLKDL-DPNRCESCYGAEDTEGQCCNTCAQVQEAYRKKGWA 179
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
I QC+ E ++ EGCQ+YG+LEVN+V+G+FHIAPG S+ +++H+HD+
Sbjct: 180 FRSGQGIAQCEREGYDAMMEAQEREGCQLYGHLEVNKVAGNFHIAPGRSFEQHNMHIHDM 239
Query: 239 QPYTS---AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE-GASMFNYYIKIIPT 294
Q + A FN TH I HLSFGI D R LDG V E GA M+ Y++K++PT
Sbjct: 240 QSFGREKLAKFNLTHVINHLSFGIDYPD---RVNSLDGHVEVPNEYGAIMYQYFLKVVPT 296
Query: 295 IYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
Y L +++ G G+PG+FF Y++SP+ +++T+ S+S H
Sbjct: 297 RYRFLSQTEIDTNQYSVTMHQREIRPDQGTSGLPGLFFMYDISPMKIQLTQSSRSFFHFL 356
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
T + I G Y ++D L+ ++ + + GK
Sbjct: 357 TGLCAIIGGVYTVAGMIDGFLYHGIRTLKAKQNMGK 392
>gi|281211641|gb|EFA85803.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Polysphondylium pallidum PN500]
Length = 388
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 156/398 (39%), Positives = 218/398 (54%), Gaps = 45/398 (11%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
++LK DA+ K +DF KT G V+IV +FI +L + Y T ELFVD++R
Sbjct: 3 QKLKSFDAYPKTVDDFRVKTYAGAIVSIVSSIFIIWLFLSQISIYMTTETHHELFVDTNR 62
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
KL I++D+V + C YL+LDA+D SGE V HNI+KRRL G+ I + K N
Sbjct: 63 AEKLKINIDVVFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKRRLSPTGEFIPDAPKREDN 122
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEVKEAYRYKKWALPEL 182
K KV EN D +CGSC GAE ++ CCNTC EV+ AY+ W
Sbjct: 123 VNIKPKV-NEN--------DRPECGSCMGAENPSKGINCCNTCEEVRVAYQKMGWGFDPS 173
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
DT QC E T+ + EGCQ+YG+L VN+V+G+FH APG S+ +H+HVHD+Q +
Sbjct: 174 DT-PQCVREGFTKNVVEQNGEGCQVYGFLLVNKVAGNFHFAPGKSFQQHHMHVHDLQSF- 231
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE----------GASMFNYYIKII 292
FN +H I LSFG D + PLDG V+K E G+ MF YY+KI+
Sbjct: 232 KGQFNLSHTISRLSFG---NDFPGIKNPLDG-VSKTEANQYQYHNLVVGSGMFQYYVKIV 287
Query: 293 PTIYERLDG-----------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
PTIYE L+G +K G G+PG+FF Y+LSP+M+K+ E+SKS
Sbjct: 288 PTIYEGLNGNLINTNQYSVTEHYRLLAKKGEEMTGLPGLFFMYDLSPIMMKVVERSKSFA 347
Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
T + + G + + D+ ++ K + K+++G
Sbjct: 348 SFITSVCAIVGGVFTVAGIFDSFIYQTTKSLKRKIDLG 385
>gi|332248939|ref|XP_003273622.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Nomascus leucogenys]
Length = 380
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 156/394 (39%), Positives = 219/394 (55%), Gaps = 44/394 (11%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEG---CQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HD 237
QC L+ T E C + +V+G+FH APG S+ +HVHV HD
Sbjct: 180 EQC----PARGLQRTQPENERECSL-------QVAGNFHFAPGKSFQQSHVHVHAVEIHD 228
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
+Q + N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y
Sbjct: 229 LQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYM 285
Query: 298 RLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
++DG L GD G+PG+F YELSP+MVK+TEK +S H T +
Sbjct: 286 KVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGV 345
Query: 342 MCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
I G + L+D+L++ + I K GKT
Sbjct: 346 CAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 379
>gi|297846654|ref|XP_002891208.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
lyrata]
gi|297337050|gb|EFH67467.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
lyrata]
Length = 386
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/383 (37%), Positives = 216/383 (56%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T++ + + L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLIVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ DI P ++C L++DA+D SGE HL V+H+I KRRLD +G I E +++ + A
Sbjct: 67 ETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTI-EARQDGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K ++G E CGSCYGAE E CCN+C +V+EAYR K W + D I
Sbjct: 126 TKIEKPLQKHGGRLEHNE--TYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +++K+ EGC IYG+LEVN+V+G+FH APG S+ + VHVHD+ + +
Sbjct: 184 DQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
FN +H I L++G PLD + +M+ Y+IK++PT+Y + G
Sbjct: 244 FNISHKINRLTYGDYFPG---VVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300
Query: 302 -----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++DA ++ K I K+EIG
Sbjct: 361 VSGIIDAFIYHGQKAIKKKMEIG 383
>gi|356548103|ref|XP_003542443.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 386
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 143/383 (37%), Positives = 215/383 (56%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T +L VD+SR
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFYSELRLYLHAVTETKLVVDTSRA 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQ L V+H+I K+RLD G I E ++E + A
Sbjct: 67 ETLRINFDVTFPALPCSILSLDAMDISGEQRLDVKHDIIKKRLDSRGNVI-ETRQEGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K +G E CGSCYG+E CCN+C +V+EAYR K WAL D I
Sbjct: 126 PKIEKPLQRHGGRLEHNE--TYCGSCYGSEVSDDDCCNSCEDVREAYRKKGWALSNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +++K+ EGC +YG+LEVN+V+G+FH APG S+ + VHVHD+ + +
Sbjct: 184 DQCKREGFLQRIKDEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +HHI L+FG PLD E + M+ Y+IK++PT+Y + G +
Sbjct: 244 FNLSHHINRLTFGEYFPG---VVNPLDNVHWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
Query: 306 G-----------GDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
GD G +PG+FF Y+LSP+ V TE++ S H T + + G +
Sbjct: 301 SNQFSVTEHFRTGDMGRLQSLPGVFFFYDLSPIKVTFTEENVSFLHFLTNVCAIVGGIFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ + I K+E+G
Sbjct: 361 VSGILDSFIYHGQRAIKKKMELG 383
>gi|38347102|emb|CAE02574.2| OSJNBa0006M15.17 [Oryza sativa Japonica Group]
gi|116309990|emb|CAH67017.1| H0523F07.5 [Oryza sativa Indica Group]
gi|218194960|gb|EEC77387.1| hypothetical protein OsI_16129 [Oryza sativa Indica Group]
Length = 386
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 139/383 (36%), Positives = 212/383 (55%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T L VD+SRG
Sbjct: 7 KLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRLYLHAVTETTLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C ++LDA+D SG++HL V+H+I+K+R+D+ G I Q + V
Sbjct: 67 ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQ-DAVGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K ++ +G E CGSCYGAE +CCN+C +V+EAYR K W + D I
Sbjct: 126 MKVEQPLQRHGGRLEHNE--TYCGSCYGAEESDEQCCNSCEDVREAYRKKGWGVSNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E + +K+ EGC IYG+LEVN+V+G+FH APG S+ +VHVHD+ P+ +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD----- 300
FN +H I LSFG + PLDG M+ Y+IK++PT+Y ++
Sbjct: 244 FNVSHKINKLSFGQRFPG---VVNPLDGAQWMQHSSYGMYQYFIKVVPTVYTDINEHIIL 300
Query: 301 ----------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHFRSSESGRIQAVPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ + I K+EIG
Sbjct: 361 VSGIIDSFVYHGQRAIKKKMEIG 383
>gi|238478737|ref|NP_001154394.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|12324714|gb|AAG52317.1|AC021666_6 unknown protein; 24499-21911 [Arabidopsis thaliana]
gi|27808598|gb|AAO24579.1| At1g36050 [Arabidopsis thaliana]
gi|110736190|dbj|BAF00066.1| hypothetical protein [Arabidopsis thaliana]
gi|332193720|gb|AEE31841.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 386
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 143/383 (37%), Positives = 215/383 (56%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T++ + + L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLIVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ DI P ++C L++DA+D SGE HL V+H+I KRRLD +G I E +++ + A
Sbjct: 67 ETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTI-EARQDGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K + ++G E CGSCYGAE E CCN+C +V+EAYR K W + D I
Sbjct: 126 TKIENPLQKHGGRLGHNE--TYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +++K+ EGC IYG+LEVN+V+G+FH APG S+ + VHVHD+ + +
Sbjct: 184 DQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
FN +H I L++G PLD + +M+ Y+IK++PT+Y + G
Sbjct: 244 FNISHKINRLTYGDYFPG---VVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300
Query: 302 -----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++DA ++ K I K+EIG
Sbjct: 361 VSGIIDAFIYHGQKAIKKKMEIG 383
>gi|296481082|tpg|DAA23197.1| TPA: endoplasmic reticulum-Golgi intermediate compartment protein 3
[Bos taurus]
Length = 306
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 133/300 (44%), Positives = 190/300 (63%), Gaps = 11/300 (3%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAE E KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295
>gi|194374867|dbj|BAG62548.1| unnamed protein product [Homo sapiens]
Length = 321
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 133/296 (44%), Positives = 189/296 (63%), Gaps = 9/296 (3%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSGAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDG 292
>gi|240254210|ref|NP_564467.5| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|332193719|gb|AEE31840.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 489
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/378 (37%), Positives = 211/378 (55%), Gaps = 21/378 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T++ + + L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLIVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ DI P ++C L++DA+D SGE HL V+H+I KRRLD +G I E +++ + A
Sbjct: 67 ETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTI-EARQDGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K + ++G E CGSCYGAE E CCN+C +V+EAYR K W + D I
Sbjct: 126 TKIENPLQKHGGRLGHNE--TYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +++K+ EGC IYG+LEVN+V+G+FH APG S+ + VHVHD+ + +
Sbjct: 184 DQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
FN +H I L++G PLD + +M+ Y+IK++PT+Y + G
Sbjct: 244 FNISHKINRLTYGDYFPG---VVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300
Query: 302 -----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKISK 368
++DA ++ K I K
Sbjct: 361 VSGIIDAFIYHGQKAIKK 378
>gi|226494692|ref|NP_001148795.1| LOC100282412 [Zea mays]
gi|194696974|gb|ACF82571.1| unknown [Zea mays]
gi|195622210|gb|ACG32935.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|414586929|tpg|DAA37500.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 386
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 138/383 (36%), Positives = 212/383 (55%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+V + L ++ Y T L VD+SRG
Sbjct: 7 KLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C ++LDA+D SG++HL V+H+++K+R+D G I Q +VV
Sbjct: 67 ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ-DVVGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K + +G E CGSCYGA+ +CCNTC +V+EAYR K W + D +
Sbjct: 126 MKMEAPLQHHGGRLEHNE--TYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLL 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E + +K+ EGC IYG++EVN+V+G+FH APG S+ ++VHVHD+ P+ +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD----- 300
FN +H I LSFG PLDG M+ Y+IK++PT+Y ++
Sbjct: 244 FNVSHKINRLSFGEYFPG---VVNPLDGANWVQHSSYGMYQYFIKVVPTVYTDINEHIIL 300
Query: 301 ------GSKLGGGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G+ G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ + I K+EIG
Sbjct: 361 VSGIIDSFVYHSQRAIKKKMEIG 383
>gi|449465886|ref|XP_004150658.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
gi|449518819|ref|XP_004166433.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 386
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/383 (37%), Positives = 214/383 (55%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLSSSILMLLLFISELRLYLHAVTETKLVVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQHL V+H+I K+RLD G I E + + + A
Sbjct: 67 ETLRINFDVTFPALPCSLLSLDAMDISGEQHLDVKHDIIKKRLDSHGNAI-EARPDGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K +G E CGSC+GAE+ CCN+C EV+EAYR K WAL D I
Sbjct: 126 PKIEKPLQRHGGRLEHNE--TYCGSCFGAESADDDCCNSCEEVREAYRKKGWALSNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +++K+ EGC IYG+LEVN+V+G+FH APG S+ ++VHVHD+ + +
Sbjct: 184 DQCKREGFLQRIKDEDGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +H I L+FG PLD K E ++ + Y+IK++PT+Y + G +
Sbjct: 244 FNISHKINRLAFGEYFPG---VVNPLDSVQWKQETPSATYQYFIKVVPTVYNSVSGYTIQ 300
Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G +P +FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHVRTAEVGRLQSLPAVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ K I K+EIG
Sbjct: 361 VSGILDSFIYHGQKVIKKKMEIG 383
>gi|441638772|ref|XP_004090166.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Nomascus leucogenys]
Length = 393
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 152/404 (37%), Positives = 219/404 (54%), Gaps = 51/404 (12%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC ++ + C + +V+G+FH APG S+ +HVHVH ++ + +
Sbjct: 180 EQCPAR-GLQRTQPENERECSL-------QVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 231
Query: 246 F------------------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
F N TH+I+HLSFG +D PLD T A + + MF Y
Sbjct: 232 FGLDNVQLWMSSGWCCLQINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQY 288
Query: 288 YIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKS 331
++K++PT+Y ++DG L GD G+PG+F YELSP+MVK+TEK
Sbjct: 289 FVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKH 348
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
+S H T + I G + L+D+L++ + I K GKT
Sbjct: 349 RSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 392
>gi|198425065|ref|XP_002127888.1| PREDICTED: similar to ERGIC and golgi 3 [Ciona intestinalis]
Length = 385
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/386 (37%), Positives = 208/386 (53%), Gaps = 28/386 (7%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
FS ++K DA+ K EDF KT+ G VT++ + L ++ Y ELFVD
Sbjct: 9 FSSKVKDFDAYPKTLEDFRIKTISGATVTLISGTIMLLLFLSELKYYLTTEVNSELFVDM 68
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRG+KL I++++ P + C++L+LD +D SG++ + V+H + K+ L+ DG + E E
Sbjct: 69 SRGNKLSINMNVTFPLVPCEFLSLDMIDVSGQRDIDVQHTLVKQPLNSDGSWVAE-AAEK 127
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
V+ V K V TE + CGSC+GAET+ CCNTC+++KEAYR K WA P
Sbjct: 128 VDLVGTKPV-----LNATEPPPADYCGSCFGAETKDMTCCNTCSDIKEAYRRKGWAFPRD 182
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
+I C E + K GC ++G+LEVNRV+G+FHI+PG SY + H+HVHD+
Sbjct: 183 GSITPCIGE---DDDKEPVGSGCYLHGHLEVNRVAGNFHISPGKSYEVGHMHVHDMARMG 239
Query: 243 S-AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
N +H HLSFG + PLD A E + F YY+KI+PT YE+L G
Sbjct: 240 KYKESNVSHVFNHLSFGSTYPG---QVHPLDNLEVIASESSVAFQYYVKIVPTTYEKLSG 296
Query: 302 SKLGGGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+PG+F SYELSP+MV+ E+ +S H T + I G
Sbjct: 297 DTFHTNQFSVTRHQKRNKDSRESLPGMFVSYELSPMMVRYVERRRSFVHFLTSVCAIIGG 356
Query: 348 TYITFMLVDALLHSCVKKIS-KVEIG 372
+ L D+ ++ K + K+E+G
Sbjct: 357 IFTVAGLFDSFIYHGSKALQKKIELG 382
>gi|242076030|ref|XP_002447951.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
gi|241939134|gb|EES12279.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
Length = 386
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 211/383 (55%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T L VD+SRG
Sbjct: 7 KLRSLDAYPKVNEDFYSRTLSGGVITLASSVIMLLLFVSELRLYLHAVTETTLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C ++LDA+D SG++HL V+H+++K+R+D G I Q + V
Sbjct: 67 ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ-DAVGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K + +G E CGSCYGA+ +CCN+C +V+EAYR K W + D +
Sbjct: 126 MKMEAPLQHHGGRLEHNE--TYCGSCYGAQESDGQCCNSCEDVREAYRKKGWGVSNPDLL 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E + +K+ EGC IYG++EVN+V+G+FH APG S+ ++VHVHD+ P+ +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD----- 300
FN +H I LSFG PLDG M+ Y+IK++PT+Y ++
Sbjct: 244 FNVSHKINRLSFGEYFPG---VVNPLDGASWVQHSSYGMYQYFIKVVPTVYTDINEHIIL 300
Query: 301 ------GSKLGGGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G+ G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ + I K+EIG
Sbjct: 361 VSGIIDSFVYHSQRAIKKKMEIG 383
>gi|357163897|ref|XP_003579883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 386
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 134/383 (34%), Positives = 211/383 (55%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + L ++ Y T L VD+SRG
Sbjct: 7 KLRNLDAYPKVNEDFYSRTLSGGVITLASSFVMLLLFVSELRLYLHAVTETTLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+ DI P + C +++D +D SG++HL V+H+++K+R+D +G I Q + V
Sbjct: 67 EKLRINFDITFPALQCSIISIDVMDISGQEHLDVKHDVFKQRIDANGNVIATKQ-DAVGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K +K +G E CGSCYGAE +CCN+C +V+EAYR K W + D+I
Sbjct: 126 MKVEKPLQMHGGRLEHNE--TYCGSCYGAEEPGEQCCNSCEDVREAYRKKGWGVSNPDSI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E + +K+ EGC IYG++E+N+V+G+FH APG S+ ++VHVHD+ P+ +
Sbjct: 184 DQCKREGFLQTIKDEEGEGCNIYGFVEINKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDS 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +H I LSFG PLDG M+ Y++K++PT+Y ++ +
Sbjct: 244 FNVSHKINKLSFGEPFPG---VVNPLDGAHWFQHSPYGMYQYFVKVVPTVYSHINEQIIL 300
Query: 306 GGD---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 301 SNQFSVTEHARSSESVRMQALPGVFFFYDLSPIKVTFTERHVSFLHFLTNVCAIVGGVFT 360
Query: 351 TFMLVDALLHSCVKKIS-KVEIG 372
++D+ ++ + I+ K EIG
Sbjct: 361 VSGIIDSFVYHGQRAITKKREIG 383
>gi|443732120|gb|ELU16969.1| hypothetical protein CAPTEDRAFT_192533 [Capitella teleta]
Length = 304
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 135/303 (44%), Positives = 186/303 (61%), Gaps = 29/303 (9%)
Query: 49 YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
Y ELFVD++RG KL I++D+ PT+ C +L LDA+D SGEQ + V H+I+K+RL
Sbjct: 12 YLTTEVHPELFVDTARGQKLKINVDMTFPTVGCSFLTLDAMDVSGEQQIDVLHDIFKQRL 71
Query: 109 DLDGKPIQ-EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNE 167
DLDG ++ EP KE + + N ++ L SCYGAE+E KCCNTCNE
Sbjct: 72 DLDGIEVKAEPSKEG----QSSESCALNHALSSFLFSRF---SCYGAESEAHKCCNTCNE 124
Query: 168 VKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS 227
V+EAYR K WA + I QC E +L+ EGC+IYG+LEVN+V+G+FH+APG S
Sbjct: 125 VREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNFHVAPGRS 184
Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFN 286
+S +H H+HD+Q FN +H I+HLSFG D + PLD + E+ MF+
Sbjct: 185 FSQHHAHIHDMQALQGMKFNMSHRIQHLSFG---DDYPGQVNPLDASEQVTEQADFVMFS 241
Query: 287 YYIKIIPTIYERLDGS--------------KLGG---GDGGMPGIFFSYELSPLMVKITE 329
YY+K++PT Y R +G K+GG G+ G+PG+F +YELSP+MVK TE
Sbjct: 242 YYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFVTYELSPMMVKYTE 301
Query: 330 KSK 332
K++
Sbjct: 302 KNR 304
>gi|224032113|gb|ACN35132.1| unknown [Zea mays]
gi|414586931|tpg|DAA37502.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 391
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 138/388 (35%), Positives = 212/388 (54%), Gaps = 27/388 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+V + L ++ Y T L VD+SRG
Sbjct: 7 KLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C ++LDA+D SG++HL V+H+++K+R+D G I Q +VV
Sbjct: 67 ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ-DVVGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K + +G E CGSCYGA+ +CCNTC +V+EAYR K W + D +
Sbjct: 126 MKMEAPLQHHGGRLEHNE--TYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLL 183
Query: 186 VQ-----CKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
Q CK E + +K+ EGC IYG++EVN+V+G+FH APG S+ ++VHVHD+ P
Sbjct: 184 DQVEPSDCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLP 243
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ +FN +H I LSFG PLDG M+ Y+IK++PT+Y ++
Sbjct: 244 FQKDSFNVSHKINRLSFGEYFPG---VVNPLDGANWVQHSSYGMYQYFIKVVPTVYTDIN 300
Query: 301 -----------GSKLGGGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
G+ G +PG+FF Y+LSP+ V TE+ S H T + +
Sbjct: 301 EHIILSNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIV 360
Query: 346 SGTYITFMLVDALLHSCVKKI-SKVEIG 372
G + ++D+ ++ + I K+EIG
Sbjct: 361 GGVFTVSGIIDSFVYHSQRAIKKKMEIG 388
>gi|328868763|gb|EGG17141.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium fasciculatum]
Length = 335
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 200/345 (57%), Gaps = 37/345 (10%)
Query: 54 TTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
T ELFVD++RG KL I++D+V + C +L+LDA+D SG+ V HNI+K+RL G
Sbjct: 5 THHELFVDTTRGEKLRINMDVVFHHLPCAFLSLDAMDVSGDHQFDVAHNIFKKRLSPTGM 64
Query: 114 PIQE--PQKE-VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEV 168
PI + PQ+E +N K+ EN D CGSCYGAE +R CC+TC EV
Sbjct: 65 PIADASPQREDTIN--KRVPAGNEN--------DKVDCGSCYGAEDPSRGISCCSTCEEV 114
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
+ AY+ K W++ E I QC E T+ + EGCQ+YG++ VN+V+G+FH APG S+
Sbjct: 115 RTAYQKKGWSIQEYSGIAQCVREGFTKNIVEQNGEGCQVYGFINVNKVAGNFHFAPGKSF 174
Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
+H+HVHD+Q + +FN +H I LSFG D + PLDG G+ MF YY
Sbjct: 175 QQHHMHVHDLQAF-KGSFNLSHSINRLSFG---NDFPGIKNPLDGVTKTEMVGSGMFQYY 230
Query: 289 IKIIPTIYERLDGSKLGGGD-----------------GGMPGIFFSYELSPLMVKITEKS 331
IK++PT+YE L+G+++ G+PG+FF Y+LSP+M+K++E+
Sbjct: 231 IKVVPTLYEGLNGNRISTNQFSVTEHYRLLAKKDEEPSGLPGLFFMYDLSPIMMKVSEQG 290
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIGGKT 375
KS T + + G + ++D++++ K + K+++G T
Sbjct: 291 KSFASFLTSVCAIVGGVFTVAGILDSMIYKTTKNLKKKIDLGKNT 335
>gi|326510689|dbj|BAJ87561.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514988|dbj|BAJ99855.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326533080|dbj|BAJ93512.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 383
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 140/382 (36%), Positives = 213/382 (55%), Gaps = 23/382 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LKGLDA+ K EDF+++T+ GG VT++ + L + YF +T +L VD+SRG
Sbjct: 7 KLKGLDAYPKVNEDFYKRTLSGGVVTLLSAFVMLLLFVSETKSYFYSATETKLVVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+L ++ DI P+I C L++D D SGEQH + H+I K+RLD G I E +KE +
Sbjct: 67 ERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLDSHGNVI-ESRKEGIGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K ++G + E+ CG+CYGAE +CCN+C EV+EAY+ K WAL D I
Sbjct: 126 TKIEKPLQKHGGRLGKGEE--YCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E E++K EGC ++G+L+V++V+G+FH APG Y ++V + ++
Sbjct: 184 DQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGKGYYESNVDMPELS--AEGG 241
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN TH I LSFG + PLDG + Y+IK++PTIY + G K+
Sbjct: 242 FNITHKINKLSFGTEFPG---AVNPLDGAQWTQPASDGTYQYFIKVVPTIYNDIRGRKID 298
Query: 306 GG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
DG + PG+FF Y+ SP+ V TE+++S H T + + G +
Sbjct: 299 SNQFSVTEHFRDGNVQPRPQPGVFFFYDFSPIKVIFTEENRSFLHYLTNLCAIVGGIFTV 358
Query: 352 FMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ K + K+EIG
Sbjct: 359 AGIIDSFIYHGQKALKKKMEIG 380
>gi|449684240|ref|XP_002157414.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Hydra magnipapillata]
Length = 311
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 133/317 (41%), Positives = 183/317 (57%), Gaps = 23/317 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M S RLK DA+ K EDF KT G +T + + + L + Y ELFV
Sbjct: 1 MDISTRLKQFDAYPKTLEDFRVKTYGGALITGISSIIMFALFLSEFNYYLTTEVHPELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++R KL I++D+ P I C YL++DA+D SGEQ +EHNI+K+R D G PI +
Sbjct: 61 DTTRHQKLRINIDVYFPNIGCAYLSIDAMDVSGEQQTDLEHNIFKKRYDEKGNPIDTVEK 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE + ++ V N T L+D KC SCYGAET CCNTC +V+ AYR K W
Sbjct: 121 KEELGDKSEEAVKVLNST----LDDKPKCESCYGAETTDHPCCNTCEDVRVAYRKKGWGF 176
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV---- 235
+ D+I QCK E+ + + EGCQIYGY+EV++V+G+FHIAPG S+ H+HV
Sbjct: 177 HDPDSIEQCKREHWKDTFQQQSNEGCQIYGYIEVSKVAGNFHIAPGKSFQQQHIHVQTIR 236
Query: 236 -----------HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASM 284
HD+QP+ + FN +H+I LSFG + PLDGT AE G+ M
Sbjct: 237 FGKDGTISLNMHDLQPFGAKQFNVSHNIWSLSFGEPIPG---VENPLDGTNVSAEAGSLM 293
Query: 285 FNYYIKIIPTIYERLDG 301
+ Y++KI+PT+Y++L G
Sbjct: 294 YQYFVKIVPTVYKKLSG 310
>gi|444729170|gb|ELW69597.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Tupaia chinensis]
Length = 393
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 148/420 (35%), Positives = 218/420 (51%), Gaps = 83/420 (19%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVV- 123
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSTEAERHELG 125
Query: 124 ---------NAVKKKKVTTENGTTTTELE-----------------------DPNKCGSC 151
N++ + + G + +++ DP++C SC
Sbjct: 126 KIEVKVFDPNSLDPDRCESCYGAESEDIKPCLEAADLELGKIEVKVFDPNSLDPDRCESC 185
Query: 152 YGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYL 211
YGAE+E KCCNTC +V+EAYR + WA DTI QC+ E ++K++ EGCQ+YG+L
Sbjct: 186 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 245
Query: 212 EVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPL 271
EVN++ N TH+I+HLSFG +D PL
Sbjct: 246 EVNKI------------------------------NMTHYIQHLSFG---EDYPGIVNPL 272
Query: 272 DGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------------GDGGMPGIF 315
D T A + + MF Y++K++PT+Y ++DG L GD G+PG+F
Sbjct: 273 DHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVF 332
Query: 316 FSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
YELSP+MVK+TEK +S H T + I G + L+D+L++ + I K GKT
Sbjct: 333 VLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 392
>gi|242088319|ref|XP_002439992.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
gi|241945277|gb|EES18422.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
Length = 384
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 140/386 (36%), Positives = 215/386 (55%), Gaps = 22/386 (5%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
F +RLK LDA+ K EDF+++T+ GG VT+V + + L + YF +T +L VD
Sbjct: 3 AFLQRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSATETKLVVD 62
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+SRG +L ++ DI P+I C L++D +D SGEQH + H+I KRRLD G I E +KE
Sbjct: 63 TSRGERLRVNFDITFPSIPCTLLSVDTMDISGEQHHDIRHDIEKRRLDSHGNVI-EARKE 121
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ K ++ ++G + E CG+CYGAE +CCN+C EV+EAY+ K WAL
Sbjct: 122 GIGGAKIERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D I QC E E++K EGC ++G+L+V++V+G+FH APG + +++ V ++
Sbjct: 180 PDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHFAPGKGFYESNIDVPELS-V 238
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
FN TH I LSFG + PLDG + Y+IK++PTIY + G
Sbjct: 239 LEGGFNITHKINKLSFGTEFPG---VVNPLDGAQWIQPASDGTYQYFIKVVPTIYTDIRG 295
Query: 302 SKLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ DG + PG+FF Y+ SP+ V TE+++SL H T + + G
Sbjct: 296 HNIHSNQFSVTEHFRDGNILPKPQPGVFFFYDFSPIKVIFTEENRSLLHYLTNLCAIVGG 355
Query: 348 TYITFMLVDALLHSCVKKI-SKVEIG 372
+ ++D+ ++ K + K+E+G
Sbjct: 356 VFTVSGIIDSFIYHGQKALKKKMELG 381
>gi|256078219|ref|XP_002575394.1| serologically defined breast cancer antigen ny-br-84-related
[Schistosoma mansoni]
gi|353230384|emb|CCD76555.1| serologically defined breast cancer antigen ny-br-84-related
[Schistosoma mansoni]
Length = 338
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 133/344 (38%), Positives = 190/344 (55%), Gaps = 31/344 (9%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF KP +DF KT+ G V+I+ L I L ++ + +E+ VD +RG
Sbjct: 8 LQNFDAFAKPLKDFRIKTLSGALVSIISSLIIGILFTSELLSFTHTQNKQEIIVDVNRGE 67
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ I++DI + I C +L+LD +D++G Q L+V H +YK + +DG P+ + + VN
Sbjct: 68 KMSIYMDITLNFIPCRFLSLDTMDTTGAQQLNVMHEVYKTSVSVDGTPVSDSVRHAVN-- 125
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + T DPN CGSCYGAE+ +RKCCNTC EV+ AY +W +
Sbjct: 126 --------DASALTTTRDPNYCGSCYGAESPSRKCCNTCEEVQMAYNEMRWIFVNISAFE 177
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC+ E E + EGC+I+G L VNRV G+FHIAPG SY+ NH H H Q F
Sbjct: 178 QCRKENWNEIKQKIGNEGCRIHGNLTVNRVGGAFHIAPGHSYTENHAHFHSFQSLGPVQF 237
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL------- 299
N +H I L FG + + PLDGT + + M YY+K++PT+Y L
Sbjct: 238 NVSHSIGELRFG---ESYPGQVNPLDGTKLAVQTHSQMVIYYLKLVPTMYISLRRNESTV 294
Query: 300 -----------DGSKLGGGDGGMPGIFFSYELSPLMVKITEKSK 332
G+ L G G+PG+FF+YE++PL+VKITE+ K
Sbjct: 295 ITNQYSATWHSKGTPLTGDGQGLPGVFFNYEIAPLLVKITEEKK 338
>gi|150036309|emb|CAO03349.1| ERGIC and golgi 3 [Homo sapiens]
Length = 325
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 136/332 (40%), Positives = 194/332 (58%), Gaps = 37/332 (11%)
Query: 12 AFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIH 71
A+ K EDF KT G VTIV L + L ++ Y EL+VD SRG KL I+
Sbjct: 1 AYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDKLKIN 60
Query: 72 LDIVVPTISCDY------------LALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
+D++ P + C + L++DA+D +GEQ L VEHN++K+RLD DG P+
Sbjct: 61 IDVLFPHMPCAWSQYLSLIFLLPDLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEA 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ + + K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA
Sbjct: 121 ER--HELGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAF 174
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
DTI QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q
Sbjct: 175 KNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQ 234
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+ N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++
Sbjct: 235 SFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKV 291
Query: 300 DGSKLGG----------------GDGGMPGIF 315
DG L GD G+PG+F
Sbjct: 292 DGEVLRTNQFSVTRHEKVANGLLGDQGLPGVF 323
>gi|326931697|ref|XP_003211962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Meleagris gallopavo]
Length = 411
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 131/326 (40%), Positives = 190/326 (58%), Gaps = 40/326 (12%)
Query: 77 PTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENG 136
P + L++DA+D +GEQ L VEHN++K+RLD G N V + E G
Sbjct: 100 PHLLVSDLSIDAMDVAGEQQLDVEHNLFKQRLDKAG-----------NRVTPEAERHELG 148
Query: 137 TTTTELEDPN-----KCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNE 191
++ DPN +C SCYGAE+E +CCNTC++V+EAYR + WA DTI QCK E
Sbjct: 149 KEEEKVFDPNSLDADRCESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKRE 208
Query: 192 YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQPYTSAAF 246
++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHV HD+Q +
Sbjct: 209 GFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNI 268
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG----- 301
N TH+I+HLSFG +D PLDGT A++ + MF Y++K++PT+Y ++DG
Sbjct: 269 NMTHYIKHLSFG---RDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRT 325
Query: 302 --------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
K+ GD G+PG+F YELSP+MVK+TEK + H T + + G +
Sbjct: 326 NQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFT 385
Query: 351 TFMLVDALLHSCVKKISKVEIGGKTV 376
+D+L++ + I K GKT+
Sbjct: 386 VAGFIDSLIYHSARAIQKKIELGKTI 411
>gi|225448309|ref|XP_002264644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|296085664|emb|CBI29463.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/387 (35%), Positives = 208/387 (53%), Gaps = 28/387 (7%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+RL+ LDA+ K EDF+ +T GG +T++ + + +L ++ Y T +L VD+SR
Sbjct: 6 QRLRNLDAYPKINEDFYSRTFSGGLITLISSIVMLFLFFSELRLYLHTVTETKLVVDTSR 65
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G L I+ D+ P + C L LDA+D SGEQH ++H+I K+R+D G + Q +
Sbjct: 66 GGTLRINFDVTFPAVPCSVLTLDAMDISGEQHHDIKHDIVKKRIDAHGNVVAVRQDGIGG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNK-CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
+K + G LE K CGSCYGAE CCN+C+EV+EAYR K W + D
Sbjct: 126 PQIEKPLQRHGG----RLEHNEKYCGSCYGAEVTDDDCCNSCDEVREAYRKKGWGMTNPD 181
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
I QCK E +K+K EGC +YG+LEVN+V+G+FH +PG + +++HV+D+ +
Sbjct: 182 LIDQCKREGFVQKVKEEEGEGCNVYGFLEVNKVAGNFHFSPGKGFYQSNIHVNDLLAISK 241
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG-- 301
+N +H I L+FG PLDG + M+ Y+IK++PTIY + G
Sbjct: 242 DGYNISHRINKLAFGDHFPG---VVNPLDGAQWFQDAPDGMYQYFIKVVPTIYTDIRGHT 298
Query: 302 -------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
S G +PG++F Y+LSP+ V E+ S H T I + G
Sbjct: 299 IQSNQFSVTEHFRSAEPGRPHSLPGVYFFYDLSPIKVTSKEEHSSFLHFMTNICAIVGGI 358
Query: 349 YITFMLVDALL---HSCVKKISKVEIG 372
+ ++D+ + H +KK K+E+G
Sbjct: 359 FTVSGIIDSFVYHGHRAIKK--KMELG 383
>gi|357133202|ref|XP_003568216.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 384
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 139/385 (36%), Positives = 213/385 (55%), Gaps = 22/385 (5%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F ++LKGLDA+ K EDF+++T+ GG VT+V + + L + Y +T +L VD+
Sbjct: 4 FLQKLKGLDAYPKVNEDFYKRTLSGGVVTLVSAVVMLLLFISETSSYLNSATETKLVVDT 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRG +L ++ DI P+I C L++D D SGEQH + H+I K+RL+ G I E +KE
Sbjct: 64 SRGERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLNSHGNVI-ESRKEG 122
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+ K ++ ++G + E CG+CYGAE +CCN+C+EV+EAY+ K WAL
Sbjct: 123 IGGAKIERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCDEVREAYKKKGWALTNP 180
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
D I QC E E++K EGC ++G+L+V++V+G+FH APG + ++V V ++
Sbjct: 181 DLIDQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGRGFYESNVDVPELSSL- 239
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
FN TH I LSFG + PLDG + Y+IK++PT Y G
Sbjct: 240 EGGFNITHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTNYTDTRGR 296
Query: 303 KLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
K+ DG + PG+FF Y+ SP+ V TE++KS H T + + G
Sbjct: 297 KIDSNQFSVTEHFRDGNVHPRPQPGVFFFYDFSPIKVIFTEENKSFLHYLTNLCAIVGGI 356
Query: 349 YITFMLVDALLHSCVKKI-SKVEIG 372
+ ++D+ ++ K + K+EIG
Sbjct: 357 FTVSGIIDSFIYHGQKALKKKMEIG 381
>gi|449449715|ref|XP_004142610.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 385
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/385 (35%), Positives = 215/385 (55%), Gaps = 24/385 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+++ LDA+ K EDF+ +T+ GG +TI + + L ++ Y +T +L VD+SRG
Sbjct: 7 KIRKLDAYPKISEDFYNRTLSGGFITIASSIIMFLLFFSELRLYVHTATETKLIVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+L A+D SGEQHL V+H+I K+R+D G I + + + + +
Sbjct: 67 EHLRINFDVTFPALPCSVLSLHAMDISGEQHLDVKHDIVKKRIDYQGNVI-DSRPDGIGS 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ ++ ++G + E CGSCYGA E CCN+C +V+EAY K WAL D I
Sbjct: 126 TEIERPLQKHGGRLKQNE--TYCGSCYGASGE--DCCNSCQDVREAYHRKGWALSHPDLI 181
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD-IQPYTSA 244
QCK E +++KN EGC IYG+LEVN+V+G+FH APG + +++ +H+ + +
Sbjct: 182 DQCKREGFFQRVKNEEGEGCNIYGFLEVNKVAGNFHFAPGRGFQLSYFQIHNPLASFQWD 241
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--- 301
AFN +H I L+FG D PLDG + MF Y+IK++PT+Y+ ++G
Sbjct: 242 AFNISHRINRLTFG---DDFPGVVNPLDGVQWNQGTLSGMFQYFIKVVPTVYKAVNGKAI 298
Query: 302 --------SKLGGGDG----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
L G DG + G+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 299 KSNQFSVTQHLRGIDGESFQALHGVFFFYDLSPIKVTFTEEHISFFHFLTNVCAIVGGVF 358
Query: 350 ITFMLVDALLHSCVKKISKVEIGGK 374
++D++++ K I K GK
Sbjct: 359 TISGILDSIIYHGQKAIKKKMALGK 383
>gi|357112459|ref|XP_003558026.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 387
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 139/392 (35%), Positives = 216/392 (55%), Gaps = 28/392 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L+ LDA+ K EDF+ +T+ GG +TI L I L ++ Y +T +L V
Sbjct: 1 MDLWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFFSEIRLYLYSATESKLTV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG +L I+ D+ P + C +A+D +D SGEQH + H+I+K+R+D G I E +K
Sbjct: 61 DTSRGERLHINFDVTFPALPCSLVAIDTMDVSGEQHYDIRHDIFKKRIDHLGNVI-ESRK 119
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ V + K ++ +G E CGSCYG+E +CCN+C EV++AYR K WAL
Sbjct: 120 DGVGSPKIERPLQNHGGRLDHNE--AYCGSCYGSEESDDQCCNSCEEVRDAYRKKGWALT 177
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
+++I QCK E ++LK+ EGC I+G+++VN+V+G+FH APG + + D+
Sbjct: 178 NVESIDQCKREGFVQRLKDEQGEGCNIHGFVDVNKVAGNFHFAPGKHLDQSFNFLQDMLN 237
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG---ASMFNYYIKIIPTIYE 297
+ +N +H I LSFG + PLDG K E+ M+ Y++K++PTIY
Sbjct: 238 FQPENYNISHKINKLSFGKEFPG---VVNPLDGVEWKQEQATGLTGMYQYFVKVVPTIYT 294
Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
+ G K+ + PG++F YE SP+ V TE++ SL H T I
Sbjct: 295 DIRGRKIHSNQFSVTEHFREAIGFPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 354
Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
+ G + ++D+ + H +KK K+EIG
Sbjct: 355 IVGGIFTVAGIIDSFVYHGHRAIKK--KMEIG 384
>gi|428183328|gb|EKX52186.1| hypothetical protein GUITHDRAFT_65491 [Guillardia theta CCMP2712]
Length = 425
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 139/386 (36%), Positives = 210/386 (54%), Gaps = 39/386 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ D + K +DF +T+ G V+I+ +L + LI ++ Y + T EL VD+SRG
Sbjct: 39 RLREFDIYPKTIQDFQVRTLAGAVVSILGFLIMFVLILGEINLYLTIQTDHELSVDTSRG 98
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+ +I + C ++LD +D SGEQH+ V H +YK+RLD+DG I + +N
Sbjct: 99 EKLQINFNITFHAMPCTIISLDTMDISGEQHIDVHHEVYKQRLDVDGNVILLLSRACLN- 157
Query: 126 VKKKKVTTENGTTTT-----ELEDP---NKCGSCYGAETETRKCCNTCNEVKEAYRYKKW 177
VT +G TT + P +CGSCYGAE +CCNTC+ V+EAYR + W
Sbjct: 158 -----VTNGSGDFTTLRAHAGFDAPLTGGECGSCYGAEESPDECCNTCDSVREAYRRRGW 212
Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYL-------EVNRVSGSFHIAPGLSYSI 230
A D IVQCK E K++ EGC++ G L +VN+V+G+FH +PG S+S
Sbjct: 213 AFVNSDGIVQCKTEGFLLKMQEERHEGCRVVGTLQARLTREQVNKVAGNFHFSPGKSFSQ 272
Query: 231 N-HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
VH D+ +N +H I HLSFG K R PLDG V E ++M+ Y++
Sbjct: 273 QVGVHFQDLLVLRKTDYNVSHAINHLSFGRKYPG---RVNPLDGVVRICEFRSAMYQYFV 329
Query: 290 KIIPTIYERLDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
K++PT Y+ +G+ +L G G+PG+FF Y+LSP+ + E++ S
Sbjct: 330 KVVPTQYQYRNGTILSTNQFSTTENTRQLEGFTRGLPGVFFFYDLSPIKATLAERNNSFL 389
Query: 336 HLWTKIMCNISGTYITFMLVDALLHS 361
H T + I G + ++D+ +++
Sbjct: 390 HFLTGLCAIIGGVFTVMGIIDSTIYT 415
>gi|326506194|dbj|BAJ86415.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 363
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 135/365 (36%), Positives = 203/365 (55%), Gaps = 22/365 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LKGLDA+ K EDF+++T+ GG VT++ + L + YF +T +L VD+SRG
Sbjct: 7 KLKGLDAYPKVNEDFYKRTLSGGVVTLLSAFVMLLLFVSETKSYFYSATETKLVVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+L ++ DI P+I C L++D D SGEQH + H+I K+RLD G I E +KE +
Sbjct: 67 ERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLDSHGNVI-ESRKEGIGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K ++G + E+ CG+CYGAE +CCN+C EV+EAY+ K WAL D I
Sbjct: 126 TKIEKPLQKHGGRLGKGEE--YCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E E++K EGC ++G+L+V++V+G+FH APG Y ++V + ++
Sbjct: 184 DQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGKGYYESNVDMPELS--AEGG 241
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN TH I LSFG + PLDG + Y+IK++PTIY + G K+
Sbjct: 242 FNITHKINKLSFGTEFPG---AVNPLDGAQWTQPASDGTYQYFIKVVPTIYNDIRGRKID 298
Query: 306 GG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
DG + PG+FF Y+ SP+ V TE+++S H T + + G +
Sbjct: 299 SNQFSVTEHFRDGNVQPRPQPGVFFFYDFSPIKVIFTEENRSFLHYLTNLCAIVGGIFTV 358
Query: 352 FMLVD 356
++D
Sbjct: 359 AGIID 363
>gi|449510462|ref|XP_004163672.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 3-like [Cucumis
sativus]
Length = 385
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 138/385 (35%), Positives = 214/385 (55%), Gaps = 24/385 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+++ LDA+ K EDF+ +T+ GG +TI + + L ++ Y +T +L VD+SRG
Sbjct: 7 KIRKLDAYPKISEDFYNRTLSGGFITIASSIIMFLLFFSELRLYVHTATETKLIVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+L A+D SGEQHL V+H+I K+R+D G I + + + + +
Sbjct: 67 EHLRINFDVTFPALPCSVLSLHAMDISGEQHLDVKHDIVKKRIDYQGNVI-DSRPDGIGS 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ ++ ++G + E CGSCYGA E CCN+C +V+EAY K WAL D I
Sbjct: 126 TEIERPLQKHGGRLKQNE--TYCGSCYGASGE--DCCNSCQDVREAYHRKGWALSHPDLI 181
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD-IQPYTSA 244
QCK E +++KN EGC IYG+LEVN+V+G+FH APG + +++ +H+ + +
Sbjct: 182 DQCKREGFFQRVKNEEGEGCNIYGFLEVNKVAGNFHFAPGRGFQLSYFQIHNPLASFQWD 241
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--- 301
AFN +H I L+FG D PLDG + MF Y+IK++PT+Y+ ++G
Sbjct: 242 AFNISHRINRLTFG---DDFPGVVNPLDGVQWNQGTLSGMFQYFIKVVPTVYKAVNGKAI 298
Query: 302 --------SKLGGGDG----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
L G DG + G FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 299 KSNQFSVTQHLRGIDGESFQALHGXFFFYDLSPIKVTFTEEHISFFHFLTNVCAIVGGVF 358
Query: 350 ITFMLVDALLHSCVKKISKVEIGGK 374
++D++++ K I K GK
Sbjct: 359 TISGILDSIIYHGQKAIKKKMALGK 383
>gi|356512071|ref|XP_003524744.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 431
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 137/384 (35%), Positives = 208/384 (54%), Gaps = 26/384 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG VT+V + +L ++ Y T +L VD+SRG
Sbjct: 54 KLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFFSELSLYLYTVTESKLLVDTSRG 113
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQHL + HNI K+R+D +G I+E +K+ + A
Sbjct: 114 DTLHINFDVTFPAVRCSILSLDAMDISGEQHLDIRHNIVKKRIDANGNVIEE-RKDGIGA 172
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K ++ ++G D CGSC+GAE CCN+C EV+EAYR K WA+ +D I
Sbjct: 173 PKIERPLQKHGGRLGH--DEKYCGSCFGAEESDEHCCNSCEEVREAYRKKGWAMTNMDLI 230
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E +++K+ EGC + G LEVN+V+G+FH A G S+ + + + D+
Sbjct: 231 DQCQREGYVQRVKDEEGEGCNLQGSLEVNKVAGNFHFATGKSFLQSAIFLADLLALQDNH 290
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY--------- 296
+N +H I LSFG PLDG M+ Y+IK++PTIY
Sbjct: 291 YNISHRINKLSFGHHFPG---LVNPLDGVKWVQGPAHGMYQYFIKVVPTIYTDIRGRVIH 347
Query: 297 -------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
E S+LG +PG+FF Y++SP+ V E+ H T I I G +
Sbjct: 348 SNQYSVTEHFKSSELG---VAVPGVFFFYDISPIKVNFKEEHIPFLHFLTNICAIIGGVF 404
Query: 350 ITFMLVDALLHSCVKKIS-KVEIG 372
++D+ ++ + I K+E+G
Sbjct: 405 TVAGIIDSSIYYGQRTIKRKMELG 428
>gi|212721670|ref|NP_001132255.1| uncharacterized protein LOC100193691 [Zea mays]
gi|194693892|gb|ACF81030.1| unknown [Zea mays]
gi|223949235|gb|ACN28701.1| unknown [Zea mays]
gi|413949703|gb|AFW82352.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 384
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/386 (35%), Positives = 212/386 (54%), Gaps = 22/386 (5%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
F RLK LDA+ K EDF+++T+ GG VT+V + + L + YF ST +L VD
Sbjct: 3 AFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLVVD 62
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+SRG +L ++ DI P+I C L++D D SGEQH + H+I KRRL+ G I E +KE
Sbjct: 63 TSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVI-EARKE 121
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ K ++ ++G + E CG+CYGAE +CCN+C EV+EAY+ K WAL
Sbjct: 122 GIGGAKVERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D I QC E +++K EGC + G+L+V++V+G+FH APG + +++ V ++
Sbjct: 180 PDLIDQCAREDFIDRVKTQQDEGCNVLGFLDVSKVAGNFHFAPGKGFYESNIDVPELS-L 238
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
FN +H I LSFG + PLDG + Y+IK++PTIY + G
Sbjct: 239 LEGGFNISHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRG 295
Query: 302 SKLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ DG + PG+FF Y+ SP+ V TE+++SL H T + + G
Sbjct: 296 RGIHSNQFSVTEHFRDGNVRPKSQPGVFFFYDFSPIKVIFTEENRSLLHYLTNLCAIVGG 355
Query: 348 TYITFMLVDALLHSCVKKI-SKVEIG 372
+ ++D+ ++ K + K+E+G
Sbjct: 356 VFTVSGIIDSFIYHGQKALKKKMELG 381
>gi|195162746|ref|XP_002022215.1| GL25735 [Drosophila persimilis]
gi|194104176|gb|EDW26219.1| GL25735 [Drosophila persimilis]
Length = 313
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 133/293 (45%), Positives = 184/293 (62%), Gaps = 23/293 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F++ L+ LDA+ + +DF +TV G AVTI+ IS LI ++ Y Q + EELFV
Sbjct: 1 MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLSYMQPALNEELFV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
D++RG KL I+LD+ + ++C+Y++LDA+DSSG+ HL V+H+I+K RLDL G+P++E P
Sbjct: 61 DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDIFKHRLDLKGEPLKETPI 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
KE+V K T CGSCYGAE CCNTC +V +AYR KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLHKWNV 166
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
++D I QCK +Y ++ F EGC+I G+LEVNR++GSFH APG S+SI H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKI 291
+ +H I HLSFG K++ + PLDG V AE MFN+Y+KI
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKTEMFNHYLKI 272
>gi|108707873|gb|ABF95668.1| Serologically defined breast cancer antigen NY-BR-84, putative,
expressed [Oryza sativa Japonica Group]
Length = 387
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/392 (35%), Positives = 217/392 (55%), Gaps = 28/392 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L+ LDA+ K EDF+ +T+ GG +TI L I L ++ Y +T +L V
Sbjct: 1 MDLWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFLSEIRLYLYSATDSKLTV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG +L I+ D+ P + C +A+D +D SGEQH + H+I K+R+D G I E +K
Sbjct: 61 DTSRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDNLGNVI-ESRK 119
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ V A K ++ ++G E CGSCYG+E +CCN+C +V++AYR K WAL
Sbjct: 120 DGVGAPKIERPLQKHGGRLDHNE--VYCGSCYGSEESDDQCCNSCEDVRDAYRKKGWALT 177
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
++ I QCK E ++LK+ EGC I+G++ VN+V+G+FH APG S + + D+
Sbjct: 178 NIEEIDQCKREGFVQRLKDEQGEGCSIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLN 237
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT--VAKAEEGAS-MFNYYIKIIPTIYE 297
+ +N +H I LSFG++ PLDG + + G + M+ Y++K++PTIY
Sbjct: 238 FQQENYNISHKINKLSFGVEFPG---VVNPLDGVEWIQEHTNGLTGMYQYFVKVVPTIYT 294
Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
+ G K+ + PG++F YE SP+ V TE++ SL H T I
Sbjct: 295 DIRGRKINSNQFSVTEHFREAIGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 354
Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
+ G + ++D+ + H +KK K+EIG
Sbjct: 355 IVGGIFTVAGIIDSFVYHGHRAIKK--KMEIG 384
>gi|291000812|ref|XP_002682973.1| predicted protein [Naegleria gruberi]
gi|284096601|gb|EFC50229.1| predicted protein [Naegleria gruberi]
Length = 416
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 147/408 (36%), Positives = 216/408 (52%), Gaps = 49/408 (12%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK D + K +DF KT+ GG ++I+ L I L+ + Y QV ++L+VD+ +
Sbjct: 3 LKSFDFYPKTQDDFRVKTLGGGLISIISLLVILILVLGEFYLYLQVERFDQLYVDTQQER 62
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE-HNIYKRRLDLDGKPIQEPQKEVVN- 124
K+PI+++I P +SCD L LD +D SGE H+H++ H +YK RL LDGKPI E Q E V+
Sbjct: 63 KIPIYINITFPAVSCDALNLDVMDVSGEHHVHLDYHTVYKMRLTLDGKPIIEQQAEQVSD 122
Query: 125 -------------AVKKKKVTTE-----NGTTTTELEDPNKCGSCYGAETETRKCCNTCN 166
AVK V +++DP CGSCYG+ + +CCNTC+
Sbjct: 123 DKPTLDILKPPPGAVKHDLVNNAELDKIRAERAKKVKDPKYCGSCYGSNRDANQCCNTCD 182
Query: 167 EVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGL 226
+V+E+YR WA + I QC E K+K + EGC ++GY VN+V+G+FH APG
Sbjct: 183 DVRESYRRVGWAFSPNEDIEQCYEEILERKMKYSKQEGCNLHGYFLVNKVAGNFHFAPGK 242
Query: 227 SYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV----------A 276
S+ H+HD Y FNT+H I +L FG K+ PLDGT
Sbjct: 243 SFVRAQQHMHDYTNYEVDHFNTSHIINYLGFGEKIPG---LINPLDGTSKIIGYNAETGQ 299
Query: 277 KAEEGASMFNYYIKIIPTIYERLDGS----------------KLGGGDGGMPGIFFSYEL 320
+ E +++F Y++K++PTIYE+ S K +PG+FF Y+L
Sbjct: 300 RVEGESALFQYFVKVVPTIYEKYGSSNSIITNQYSVTQHSRPKNRLHPNVVPGVFFIYDL 359
Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
SP+MV ITE KS T + I G + L+D +++ KK+++
Sbjct: 360 SPIMVHITENKKSFVQFLTSLCAIIGGVFTVSALLDRVIYGVEKKMNR 407
>gi|313231322|emb|CBY08437.1| unnamed protein product [Oikopleura dioica]
Length = 386
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 142/390 (36%), Positives = 214/390 (54%), Gaps = 25/390 (6%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F +++ DA+TKP EDF E+TV G +TI C L L ++ Y EL V
Sbjct: 1 MGFLSQIRRFDAYTKPVEDFRERTVTGAVITICCSLLCMLLFFSELNYYLTTEVVSELRV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D++RG KL ++LD+ V + C+Y ++DA+D +G++ EH ++K R+ DG+ + +K
Sbjct: 61 DNTRGGKLVMNLDLTVAGLPCNYFSIDAMDLTGDR-ADAEHQLFKVRMK-DGQEVALSEK 118
Query: 121 -EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
E +NA K E T ++D +C SCYGAETE + CCN+C EV++AYR K WA
Sbjct: 119 VEEINAEKLHDEKQEEEETGLAVKD--ECQSCYGAETEEQPCCNSCEEVQQAYRNKGWAF 176
Query: 180 P-ELDTIVQCKNEY--STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
QC NE+ E+L+ T E C+++G+LEVNRVSGS I+PG + ++ VH
Sbjct: 177 DHSAQQFSQCVNEHFDLNEELQKTEGESCRVHGHLEVNRVSGSLQISPGKTLVLDGSVVH 236
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
DI+ +F+T+H I HLSFG + PLD T +AE ++Y K+IPT +
Sbjct: 237 DIRGMKHMSFDTSHTIHHLSFGEVFPGQE---NPLDNTEHEAESMNMAWHYNFKVIPTEF 293
Query: 297 ERLDGSK--------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
+LDGS+ L +PGI F +E++P+ V E +S H T +
Sbjct: 294 RKLDGSRTATNQFSVTRHEKALSQMSSRLPGINFHFEIAPIAVIKMETRRSAVHFATSVC 353
Query: 343 CNISGTYITFMLVDALLHSCVKKISKVEIG 372
I G + ++D+ +H K + K E+G
Sbjct: 354 AIIGGVWTISSILDSFIHKTNKLLIKTELG 383
>gi|224066933|ref|XP_002302286.1| predicted protein [Populus trichocarpa]
gi|222844012|gb|EEE81559.1| predicted protein [Populus trichocarpa]
Length = 377
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 138/386 (35%), Positives = 207/386 (53%), Gaps = 37/386 (9%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ DA+ K EDF+ +T+ GG +T+ + + L ++ Y T +L VD+SRG
Sbjct: 7 KLRNFDAYPKINEDFYSRTLSGGVITLASSIVMFLLFFSELRLYLHAVTETKLVVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQHL V+H+I K+RLD G I+ Q
Sbjct: 67 ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVIESRQ------ 120
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETET---RKCCNTCNEVKEAYRYKKWALPEL 182
+G ++E P + ET CCN+C EV+EAY+ K WA+
Sbjct: 121 ---------DGIGAPKIEKPLQRHGGRLEHNETYCDEDCCNSCEEVREAYQKKGWAVTNP 171
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
D + QCK E +++K+ EGC IYG+LEVN+V+G+FH APG S+ + VHVHD+ +
Sbjct: 172 DLMDQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQ 231
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
+FNT+H I L+FG PLDG E + M+ Y+IK++PT+Y + G
Sbjct: 232 KDSFNTSHKINRLAFGEYFPG---VVNPLDGVQWTQETPSGMYQYFIKVVPTVYTDVSGH 288
Query: 303 KLG-----------GGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ G D G +PG+FF Y+LSP+ V TE+ S H T + + G
Sbjct: 289 TIQSNQFSVTEHFRGADIGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 348
Query: 348 TYITFMLVDALLHSCVKKI-SKVEIG 372
+ ++D+ ++ K I K+EIG
Sbjct: 349 VFTVSGILDSFIYHGQKAIKKKMEIG 374
>gi|443734710|gb|ELU18591.1| hypothetical protein CAPTEDRAFT_139954 [Capitella teleta]
Length = 285
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 172/276 (62%), Gaps = 11/276 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DA+ K +EDF KT G AVTIV + + L + Y ELFVD++RG
Sbjct: 10 LRQFDAYPKTFEDFRVKTYGGAAVTIVSGILMFVLFVSEFNYYLITEVHPELFVDTARGQ 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVNA 125
KL I++D+ PT+ C +L LDA+D SGEQ + V H+I+K+RLDLDG ++ EP KE +
Sbjct: 70 KLKINVDMTFPTVGCSFLTLDAMDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEDLGD 129
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K K +N L+D ++C SCYGAE+E KCCNTCNEV+EAYR K WA + I
Sbjct: 130 -KSKDFAVKN-----PLKD-DRCESCYGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNI 182
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E +L+ EGC+IYG+LEVN+V+G+FH+APG S+S +H H+HD+Q
Sbjct: 183 EQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMK 242
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG 281
FN +H I+HLSFG D + PLD + E+G
Sbjct: 243 FNMSHRIQHLSFG---DDYPGQVNPLDASEQVTEQG 275
>gi|115464597|ref|NP_001055898.1| Os05g0490200 [Oryza sativa Japonica Group]
gi|50080302|gb|AAT69636.1| unknown protein [Oryza sativa Japonica Group]
gi|113579449|dbj|BAF17812.1| Os05g0490200 [Oryza sativa Japonica Group]
gi|218197014|gb|EEC79441.1| hypothetical protein OsI_20422 [Oryza sativa Indica Group]
gi|222632053|gb|EEE64185.1| hypothetical protein OsJ_19017 [Oryza sativa Japonica Group]
Length = 384
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 139/385 (36%), Positives = 217/385 (56%), Gaps = 22/385 (5%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F ++LKGLDA+ K EDF+++T+ GG VT+V + + L + YF +T +L VD+
Sbjct: 4 FLQKLKGLDAYPKVNEDFYKRTLSGGVVTVVASVVMLLLFVSETRSYFYSATETKLVVDT 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRG +L ++ D+ P++ C L++D +D SGEQH + H+I KRRLD G I E +KE
Sbjct: 64 SRGERLRVNFDVTFPSVPCTLLSVDTMDISGEQHHDIRHDIEKRRLDAHGNVI-EARKEG 122
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+ K + ++G ++ E+ CG+CYGAE +CCN+C EV+EAY+ K WAL
Sbjct: 123 IGGAKIESPLQKHGGRLSKGEE--YCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
D I QC E E++K EGC ++G+L+V++V+G+ H APG + ++++V ++
Sbjct: 181 DLIDQCTREDFVERVKTQQGEGCNVHGFLDVSKVAGNLHFAPGKGFYESNINVPELSA-L 239
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
FN TH I LSFG + PLDG + Y+IK++PTIY L G
Sbjct: 240 EHGFNITHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDLRGR 296
Query: 303 KLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
K+ DG + PG+FF Y+ SP+ V TE++ SL H T + + G
Sbjct: 297 KIHSNQFSVTEHFRDGNIRPKPQPGVFFFYDFSPIKVIFTEENSSLLHYLTNLCAIVGGV 356
Query: 349 YITFMLVDALLHSCVKKI-SKVEIG 372
+ ++D+ ++ K + K+E+G
Sbjct: 357 FTVSGIIDSFIYHGQKALKKKMELG 381
>gi|226494401|ref|NP_001141198.1| uncharacterized protein LOC100273285 [Zea mays]
gi|194703210|gb|ACF85689.1| unknown [Zea mays]
gi|238011828|gb|ACR36949.1| unknown [Zea mays]
gi|413945823|gb|AFW78472.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 384
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/386 (35%), Positives = 212/386 (54%), Gaps = 22/386 (5%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
F +RLK LDA+ K EDF+++T+ GG VT+V + + L + YF +T +L VD
Sbjct: 3 AFLQRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSATETKLVVD 62
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+SRG +L ++ DI +I C L++D +D SGEQH + H+I K RLD G I E +K
Sbjct: 63 TSRGERLRVNFDITFLSIPCTLLSVDTMDISGEQHQDIRHDIEKIRLDAHGNVI-EARKV 121
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ K ++ ++G + E CG+CYGAE +CCN+C EV+EAY+ K WAL
Sbjct: 122 SIGGAKIERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D I QC E E++K EGC ++G+L+V++V+G+FH APG + +++ V ++
Sbjct: 180 PDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHFAPGKGFYESNIDVPELS-L 238
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
FN TH I LSFG + PLDG + Y+IK++PTIY + G
Sbjct: 239 LEGGFNITHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRG 295
Query: 302 SKLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ DG + PG+FF Y+ SP+ V TE+S+SL H T + + G
Sbjct: 296 HNIHSNQFSVTEHFRDGNVRPKPQPGVFFFYDFSPIKVIFTEESRSLLHYLTNLCAIVGG 355
Query: 348 TYITFMLVDALLHSCVKKI-SKVEIG 372
+ ++D+ ++ K + K+E+G
Sbjct: 356 VFTVSGIIDSFIYHGQKALKKKMELG 381
>gi|443734706|gb|ELU18587.1| hypothetical protein CAPTEDRAFT_139951 [Capitella teleta]
Length = 285
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 171/276 (61%), Gaps = 11/276 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DA+ K EDF KT G AVTIV + + L + Y ELFVD++RG
Sbjct: 10 LRQFDAYPKTLEDFRVKTYGGAAVTIVSGILMFVLFVSEFNYYLTTEVHPELFVDTARGQ 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVNA 125
KL I++D+ PT+ C +L LDA+D SGEQ + V H+I+K+RLDLDG ++ EP KE +
Sbjct: 70 KLKINVDMTFPTVGCSFLTLDAMDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEDLGD 129
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K K +N L+D ++C SCYGAE+E KCCNTCNEV+EAYR K WA + I
Sbjct: 130 -KSKDFAVKN-----PLKD-DRCESCYGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNI 182
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E +L+ EGC+IYG+LEVN+V+G+FH+APG S+S +H H+HD+Q
Sbjct: 183 EQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMK 242
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG 281
FN +H I+HLSFG D + PLD + E+G
Sbjct: 243 FNMSHRIQHLSFG---DDYPGQVNPLDASEQVTEQG 275
>gi|297850670|ref|XP_002893216.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
lyrata]
gi|297339058|gb|EFH69475.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
lyrata]
Length = 386
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 131/383 (34%), Positives = 205/383 (53%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ LDA+ K EDF+ +T+ GG +T+V + L ++ Y T +L VD+SRG
Sbjct: 7 RLRNLDAYPKINEDFYRRTLSGGVITLVSSFVMLILFFSELQLYIHPVTETQLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+ D+ P + C ++LD++D SGE+HL V H+I KRRLD G I+ Q + +
Sbjct: 67 EKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHT 126
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K + G + CGSC+GAE CCN+C EV+EAYR K WAL + ++I
Sbjct: 127 KIEKPLQKHGGRLE---HNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDPESI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +K+K+ EGC ++G+LEVN+V+G+FH PG S+ + HD+ +
Sbjct: 184 DQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGN 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL- 304
+N +H + L+FG PLDG + + ++ Y+IK++P+IY + + +
Sbjct: 244 YNISHTVNRLAFGDFFPG---VVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300
Query: 305 --------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G PG+FF Y+LSP+ V E+ H T + + G +
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKVIFEEQHVEFLHFLTNVCAIVGGIFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
+VD+ ++ + I K+EIG
Sbjct: 361 VSGIVDSFIYHGQRAIKKKMEIG 383
>gi|18395087|ref|NP_564162.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|9454530|gb|AAF87853.1|AC073942_7 Contains similarity to a PR00989 protein from Homo sapiens
gi|7959731. EST gb|AI995648 comes from this gene
[Arabidopsis thaliana]
gi|13878151|gb|AAK44153.1|AF370338_1 unknown protein [Arabidopsis thaliana]
gi|21281042|gb|AAM44956.1| unknown protein [Arabidopsis thaliana]
gi|21553754|gb|AAM62847.1| unknown [Arabidopsis thaliana]
gi|332192089|gb|AEE30210.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 386
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/383 (33%), Positives = 205/383 (53%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T +L VD+SRG
Sbjct: 7 RLRNLDAYPKINEDFYRRTLSGGVITLASSIVMLILFFSELQLYIHPVTETQLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+ D+ P + C ++LD++D SGE+HL V H+I KRRLD G I+ Q + +
Sbjct: 67 EKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHT 126
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K + G + CGSC+GAE CCN+C EV+EAYR K WAL + ++I
Sbjct: 127 KIEKPLQKHGGRLE---HNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDPESI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +K+K+ EGC ++G+LEVN+V+G+FH PG S+ + HD+ +
Sbjct: 184 DQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGN 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL- 304
+N +H + L+FG PLDG + + ++ Y+IK++P+IY + + +
Sbjct: 244 YNISHKVNRLAFGDFFPG---VVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300
Query: 305 --------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G PG+FF Y+LSP+ V E+ H T + + G +
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKVIFEEQHVEFLHFLTNVCAIVGGIFT 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
+VD+ ++ + I K+EIG
Sbjct: 361 VSGIVDSFIYHGQRAIKKKMEIG 383
>gi|324511490|gb|ADY44781.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Ascaris suum]
Length = 382
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 138/390 (35%), Positives = 204/390 (52%), Gaps = 31/390 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M RL+ LDA+TKP +DF KT GGAVT++ L I L + + E+LFV
Sbjct: 1 MSLLARLRDLDAYTKPLDDFRVKTFTGGAVTLLSTLVIVVLFVSETISFLSTDVVEQLFV 60
Query: 61 DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
DS S +L ++ D+ + C + +D +D SG+ V+ ++YK+RLD G I
Sbjct: 61 DSTSADQRLDVNFDVTFTKLPCAMVTVDVMDVSGDNQDDVQDDVYKQRLDQQGNNITG-- 118
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
A + V T ++L KCGSCYGA + +CCNTC +VKEAY + W +
Sbjct: 119 ----QAAVRLGVNVNTSTPASQLTTEPKCGSCYGA---SDRCCNTCEDVKEAYSARGWQM 171
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+++++ QCK++ + + EGC++YG ++V +V+G+FHIAPG H HD+
Sbjct: 172 LDIESVEQCKSDAWVRTINDFKGEGCRVYGKVQVAKVAGNFHIAPGDPLRSLRSHFHDLH 231
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS--MFNYYIKIIPTIYE 297
A F+T H I HLSFG + PLDG + +S MF YY+K++PT+YE
Sbjct: 232 SIAPAKFDTAHIINHLSFGTPFPG---KNYPLDGKSFGTNKDSSGIMFQYYMKVVPTMYE 288
Query: 298 RLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
LD S +G G G+PG F YE SPLMVK E+ + L +
Sbjct: 289 FLDSSNNIFSHQFSVTTHQKDIGMGASGLPGFFVQYEFSPLMVKYEERRQPLSTFLVSLC 348
Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEI 371
I G + L+D+L++ + I KVE+
Sbjct: 349 AIIGGVFTVASLIDSLIYHSSRAIQHKVEM 378
>gi|217071774|gb|ACJ84247.1| unknown [Medicago truncatula]
Length = 384
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 141/384 (36%), Positives = 205/384 (53%), Gaps = 26/384 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG VT+V + +L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFISELRLYLYTVTESKLLVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LD +D SGE+H + HNI K+R+D +GK I E +KE + A
Sbjct: 67 ETLNINFDVTFPAVRCSILSLDTMDISGERHHDILHNIMKQRIDANGKVI-EARKEGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K ++ ++G D CGSC+GAE CCN C EV+EAYR K WAL +D I
Sbjct: 126 PKIERPLQKHGGRLEH--DEKYCGSCFGAEESDDHCCNNCEEVREAYRKKGWALTNIDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E +K+K+ EGC I+G LEVN+V+G+FH A G S+ + + + D+
Sbjct: 184 DQCQREGFVQKVKDEEGEGCNIHGSLEVNKVAGNFHFATGQSFLQSAIFLTDLLALQDNH 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY--------- 296
+N +H I LSFG PLDG M Y+IK++PT+Y
Sbjct: 244 YNISHQINKLSFG---HHYPGLVNPLDGIKWVQGNDHGMCQYFIKVVPTVYTDIRGRVIH 300
Query: 297 -------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
E S+LG +PG+FF Y++SP+ V E+ H T I I G +
Sbjct: 301 SNQYSVTEHFKSSELG---AAVPGVFFFYDISPIKVNFKEEHIPFLHFLTNICAIIGGIF 357
Query: 350 ITFMLVDALLHSCVKKI-SKVEIG 372
+VD+ ++ K I K+EIG
Sbjct: 358 TIAGIVDSSIYYGQKTIKKKMEIG 381
>gi|363806898|ref|NP_001242045.1| uncharacterized protein LOC100781612 [Glycine max]
gi|255644390|gb|ACU22700.1| unknown [Glycine max]
Length = 384
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/384 (35%), Positives = 206/384 (53%), Gaps = 26/384 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG VT+V + +L ++ T +L VD+SRG
Sbjct: 7 KLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFFSELSLCLYTVTESKLLVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C L+LDA+D SGEQHL + HNI K+R+D +G I+E +K+ + A
Sbjct: 67 DTLHINFDVTFPAVRCSILSLDAMDISGEQHLDIRHNIVKKRIDANGNVIEE-RKDGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K ++G D CGSC+GAE CCN+C EV+EAYR K WA+ +D I
Sbjct: 126 PKIEKPLQKHGGRLGH--DEKYCGSCFGAEESDEHCCNSCEEVREAYRKKGWAMTNMDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E +++K+ EGC + G LEVN+V+G+FH A G S+ + + + D+
Sbjct: 184 DQCQREGYVQRVKDEEGEGCNLQGSLEVNKVAGNFHFATGKSFLQSAIFLADVLALQDNH 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY--------- 296
+N +H I LSFG PLDG M+ Y+IK++PTIY
Sbjct: 244 YNISHRINKLSFGHHFPG---LVNPLDGVRWVQGPTHGMYQYFIKVVPTIYTDIRGRVIH 300
Query: 297 -------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
E S+LG +PG+FF Y++SP+ V E+ H T I I G
Sbjct: 301 SNQYSVTEHFKSSELG---VAVPGVFFFYDISPIKVNFKEEHTPFLHFLTNICAIIGGVL 357
Query: 350 ITFMLVDALLHSCVKKIS-KVEIG 372
++D+ ++ + I K+E+G
Sbjct: 358 AVAGIIDSSIYYGQRTIKRKMELG 381
>gi|159470839|ref|XP_001693564.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283067|gb|EDP08818.1| predicted protein [Chlamydomonas reinhardtii]
Length = 388
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 145/393 (36%), Positives = 202/393 (51%), Gaps = 36/393 (9%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F +LK LDA+ K EDF KT+ GG +TIV + + L ++ + S+ EL VD
Sbjct: 6 FLGKLKALDAYPKINEDFFTKTMSGGIITIVSSVVMVLLFLSELRLFLTTSSAHELSVDV 65
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK--RRLDLDGKPIQEPQK 120
RG K+ IH D+ P + C +L+LDA+D SGE HL + +Y RR + E +
Sbjct: 66 GRGEKIKIHFDVTFPKVPCAWLSLDAMDISGELHLDLVVELYTLWRR---GAAGLTEGKG 122
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ + + N T N CGSCYGAE + CCNTC+EV+ AYR K WAL
Sbjct: 123 GGIGVLSVSVSRSRNATALA-----NGCGSCYGAEDKQGDCCNTCDEVRAAYRRKGWALS 177
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
+D I QC ++ TE +K EGC I +EVN+V+G+FH APG SY +HVHDI P
Sbjct: 178 NVDHIEQCAHDLYTEAIKEQAGEGCHI--GVEVNKVAGNFHFAPGRSYQQGSMHVHDIAP 235
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-----TVAKAEEGASMFNYYIKIIPTI 295
+ A + H I LSFG + + PLDG A A MF Y++K++PT
Sbjct: 236 FGDAVIDFRHVIHKLSFG---EPYPGMKNPLDGAKAGQAAAAAAAATGMFQYFLKVVPTS 292
Query: 296 YERLDGSKL---------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
Y L L GG +PG+FF Y+LSP+ VKI E S T
Sbjct: 293 YTDLSNKTLSTNQFSVTENFREAQGGAGRTLPGVFFFYDLSPIKVKIVEHGSSFLSFLTS 352
Query: 341 IMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
+ + G + +VDA +++ + I K+E+G
Sbjct: 353 VCAIVGGVFTVSGIVDAFVYTGTRMIKKKMELG 385
>gi|449438787|ref|XP_004137169.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 386
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 142/383 (37%), Positives = 208/383 (54%), Gaps = 22/383 (5%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T GG +T+ F+ +L ++ Y T +L VD+SRG
Sbjct: 7 KLRNLDAYPKINEDFYRRTFSGGLITLASSFFMLFLFFSELRMYLHAKTETQLVVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+L I+ D+ P I C L+LDA+D SGEQHL + HNI K+R+D G I E + + + A
Sbjct: 67 GELHINFDLSFPAIPCSILSLDAIDISGEQHLDIRHNIIKKRIDHLGTVI-EARPDGIGA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K +K ++G E CGSC+GAE CCN+C EV+EAYR K WA+ D I
Sbjct: 126 PKIEKPLQKHGGRLEHNE--TYCGSCFGAEASDDDCCNSCEEVREAYRKKGWAITNQDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E +K+K+ EGC I G LEVN+V+GSFH PG S+ + + + ++
Sbjct: 184 DQCQREDFIQKVKDEEGEGCNIEGSLEVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSD 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
+N +H I L+FG D PLDG + E M Y++K++PTIY+ + G +
Sbjct: 244 YNVSHRINRLAFG---NHYDGLVNPLDGVHWEYNEQNVMHQYFVKVVPTIYKNIRGRTVH 300
Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G +PG+FF Y+LSP+ V TE+ H T I I G +
Sbjct: 301 SNQYSVTEHFKSVEFGSSQSIPGVFFYYDLSPVKVTYTEEHVPFLHFMTHICAIIGGVFS 360
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++DA ++ +K+ KVEIG
Sbjct: 361 VAGIIDAFIYHGQRKMKKKVEIG 383
>gi|413949704|gb|AFW82353.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 398
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/360 (37%), Positives = 199/360 (55%), Gaps = 22/360 (6%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
F RLK LDA+ K EDF+++T+ GG VT+V + + L + YF ST +L VD
Sbjct: 3 AFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLVVD 62
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+SRG +L ++ DI P+I C L++D D SGEQH + H+I KRRL+ G I E +KE
Sbjct: 63 TSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVI-EARKE 121
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ K ++ ++G + E CG+CYGAE +CCN+C EV+EAY+ K WAL
Sbjct: 122 GIGGAKVERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D I QC E +++K EGC + G+L+V++V+G+FH APG + +++ V ++
Sbjct: 180 PDLIDQCAREDFIDRVKTQQDEGCNVLGFLDVSKVAGNFHFAPGKGFYESNIDVPELS-L 238
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
FN +H I LSFG + PLDG + Y+IK++PTIY + G
Sbjct: 239 LEGGFNISHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRG 295
Query: 302 SKLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ DG + PG+FF Y+ SP+ V TE+++SL H T +C I G
Sbjct: 296 RGIHSNQFSVTEHFRDGNVRPKSQPGVFFFYDFSPIKVIFTEENRSLLHYLTN-LCAIVG 354
>gi|358054679|dbj|GAA99605.1| hypothetical protein E5Q_06306 [Mixia osmundae IAM 14324]
Length = 424
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 137/415 (33%), Positives = 204/415 (49%), Gaps = 53/415 (12%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F KGLDAF K ED KT +GG +T+V + I+ L ++ DY +V + VD
Sbjct: 7 FGGAFKGLDAFGKTLEDVKIKTGFGGILTLVSFTLIAALTLMEFVDYRRVHLHPSIVVDK 66
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRG KL +HL+I P + C L++D +D SGE + H+I K RLD G +Q +
Sbjct: 67 SRGEKLVVHLNITFPRVPCYLLSVDIMDISGEHQNDIHHDILKNRLDKSGALVQATRDST 126
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+ ++ V + +P CGSCYG CCNTC+EV+E+Y + W+
Sbjct: 127 LKGELERAVGVK--------REPGYCGSCYGGAPGDSGCCNTCDEVRESYVRRGWSFVNP 178
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
D I QC E +EK+K EGC + G ++VN+V G+FH++PG S+ N HVHD+ PY
Sbjct: 179 DGIDQCVREGFSEKIKEQSEEGCNVAGQVKVNKVIGNFHLSPGKSFQSNMHHVHDLVPYL 238
Query: 243 SAA--FNTTHHIRHLSFGIKLQDDDER-----------RKPLDGTVAKAEEGASMFNYYI 289
+A + H I SF + D R PL G A E+ MF Y++
Sbjct: 239 AAGQQHDFGHIINRFSFAAEGDDGFNRETARLKQSLNIEDPLTGVRAHTEQSNYMFQYFV 298
Query: 290 KIIPTIYERLDGSKLGG-----------------------------GDGGMPGIFFSYEL 320
K++ T ++ LDG L G G+PG+FF+YE+
Sbjct: 299 KVVSTKFKTLDGRTLSSHQYSVTQYERDLSKGNKPGKDEDGHQTSHGYAGVPGLFFNYEI 358
Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
SP++V E+ +S H T + G L+D L++S ++++ GGK+
Sbjct: 359 SPMLVVHREERQSFAHFITSTCAIVGGILTVAGLIDTLVYSSQ---TRLQAGGKS 410
>gi|268581953|ref|XP_002645960.1| C. briggsae CBR-ERV-46 protein [Caenorhabditis briggsae]
Length = 380
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 129/383 (33%), Positives = 198/383 (51%), Gaps = 26/383 (6%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DA+ KP +DF KT+ GG VT++ + I LI ++ + + E LFVDS+
Sbjct: 7 LKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVLETRQFLSTAVLEHLFVDSTTSD 66
Query: 67 -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
++ I DI + C+++ +D +D S E ++ +IY+ RLD DG+ + E +++
Sbjct: 67 ERVHIEFDITFNKLPCNFITVDVMDVSSEAQENINDDIYRLRLDADGRNVSESAQKI--E 124
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + K E TEL KCGSCYGA + CCNTC +VK AY K W + ++ +
Sbjct: 125 INQNKTIGE----PTELVQEVKCGSCYGAVADG-ICCNTCEDVKNAYAVKGWQV-NIEEV 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCKN+ ++ EGC++YG ++V +V+G+FH+APG + HVHD+
Sbjct: 179 EQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVK 238
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
F+ +H + H+SFG + PLDG V G M+ YY+K++PT Y+ LDG
Sbjct: 239 FDASHTVNHISFGKSFPG---KNYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQ 295
Query: 302 ----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
LG G+PG F YE SPLMV+ E +SL + + G +
Sbjct: 296 SHQFSVTTHKKDLGFRQAGLPGFFLQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAM 355
Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
LVD ++ + + GGK
Sbjct: 356 AQLVDITIYHTSRYMKSRIAGGK 378
>gi|341884797|gb|EGT40732.1| CBN-ERV-46 protein [Caenorhabditis brenneri]
Length = 379
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/383 (33%), Positives = 195/383 (50%), Gaps = 27/383 (7%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DA+ KP +DF KT+ GG VT++ + I LI ++ + E LFVDS+
Sbjct: 7 LKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVMETRQFLSTDVLEHLFVDSTTSD 66
Query: 67 -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
++ I DI + C+++ +D +D S E ++ +IY+ RLD DGK + E +++
Sbjct: 67 ERVHIEFDITFNKLPCNFITVDVMDVSSEAQDNINDDIYRLRLDADGKNVSETAQKI--- 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
++ TEL KCGSCYGA + CCNTC +VK AY K W + ++ +
Sbjct: 124 ----EINQNKTVDATELIQEVKCGSCYGAAADG-ICCNTCEDVKNAYAIKGWQV-NIEEV 177
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCKN+ ++ EGC++YG ++V +V+G+FH+APG + HVHD+
Sbjct: 178 EQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQSMRSHVHDLHNLDPVK 237
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
F+ +H + H+SFG + PLDG V G M+ YY+K++PT Y+ LDG
Sbjct: 238 FDASHTVNHISFGKSFPG---KNYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQ 294
Query: 302 ----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
LG G+PG F YE SPLMV+ E +SL + + G +
Sbjct: 295 SHQFSVTTHKKDLGFRQSGLPGFFLQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAM 354
Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
LVD ++ + + GGK
Sbjct: 355 AQLVDITIYHSSRYMKNRIAGGK 377
>gi|224073341|ref|XP_002304080.1| predicted protein [Populus trichocarpa]
gi|222841512|gb|EEE79059.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/384 (34%), Positives = 204/384 (53%), Gaps = 22/384 (5%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
++++ LDA+ K EDF+ +T+ GG +T++ + I +L ++ Y T +L VD+SR
Sbjct: 6 QKVRNLDAYPKINEDFYSRTLSGGLITLISSVLILFLFFSELSLYLHKVTETKLLVDTSR 65
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G L I+ D+ P I C L++DA+D SGEQHL + H+I K+R++ G I E ++E +
Sbjct: 66 GQSLRINFDVTFPAIRCSLLSVDAIDISGEQHLDIRHDISKKRINAHGDVI-EVRQEGIG 124
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
A K + +G E+ CGSC+G E CCNTC EV+EAYR K WA+ +D
Sbjct: 125 APKIDRPLQSHGGRLGHNEE--YCGSCFGGEMSHDDCCNTCEEVREAYRRKGWAMTNMDL 182
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QCK E + +K+ EGC I G LEVNRV+GSFH AP S+ +++ + D+
Sbjct: 183 IDQCKREGFIQMIKDEEGEGCNINGSLEVNRVAGSFHFAPWKSFHLSNFLIQDLLDLQKD 242
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
++N +H I L+FG PL G + + ++IK++PTIY + G +
Sbjct: 243 SYNISHRINRLAFGDYFPG---VVNPLAGIQLMHDTPNGVQQFFIKVVPTIYTDIRGRTV 299
Query: 305 GGGD---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
+PG++F Y+ SP+ V E+ S H T I I G +
Sbjct: 300 HSNQYSATEHFKKSELTPLDSLPGVYFFYDFSPIKVIFKEEHISFLHFMTSICAIIGGIF 359
Query: 350 ITFMLVDALLHSCVKKIS-KVEIG 372
++D+ ++ + I+ KV IG
Sbjct: 360 TIAGIIDSFIYYGQRAITKKVGIG 383
>gi|326497521|dbj|BAK05850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 391
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 123/341 (36%), Positives = 188/341 (55%), Gaps = 21/341 (6%)
Query: 49 YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
Y T L VD+SRG KL I+ DI P + C +++D +D SG++HL V+H+++K+R+
Sbjct: 55 YLHAVTETTLRVDTSRGEKLRINFDITFPALQCSIISVDVMDISGQEHLDVKHDVFKQRI 114
Query: 109 DLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEV 168
D G I Q + V +K +K +G E CGSCYGA+ +CCN+C +V
Sbjct: 115 DAHGNVIATKQ-DAVGGMKVEKPLQHHGGRLEHNE--TYCGSCYGAQESPEQCCNSCEDV 171
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
+EAYR K W + D+I QCK+E + +K+ EGC IYG+LE+N+V+G+FH APG S+
Sbjct: 172 REAYRKKGWGVSNPDSIDQCKSEGFLQTIKDEEGEGCNIYGFLEINKVAGNFHFAPGKSF 231
Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
++VHVHD+ P+ +FN +H I LSFG PLDG M Y+
Sbjct: 232 QQSNVHVHDLLPFQKDSFNLSHKINKLSFGEPFPG---VINPLDGAQWIQHSSYGMAQYF 288
Query: 289 IKIIPTIYERLDGSKL-----------GGGDGG----MPGIFFSYELSPLMVKITEKSKS 333
+K++PT+Y ++ + GD G +PG+FF Y+LSP+ V TE+ S
Sbjct: 289 VKVVPTVYSHINEQIILSNQFSVTEHSRSGDSGRVQALPGVFFFYDLSPIKVTFTERHVS 348
Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
H T + + G + ++D+ ++ + I+K GK
Sbjct: 349 FLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAITKKRELGK 389
>gi|308483051|ref|XP_003103728.1| CRE-ERV-46 protein [Caenorhabditis remanei]
gi|308259746|gb|EFP03699.1| CRE-ERV-46 protein [Caenorhabditis remanei]
Length = 380
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 128/383 (33%), Positives = 197/383 (51%), Gaps = 26/383 (6%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DA+ KP +DF KT+ GG VT++ + I LI ++ + E LFVDS+
Sbjct: 7 LKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVLETKQFLSTDVLEHLFVDSTTSD 66
Query: 67 -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
++ I DI + C+++ +D +D S E ++ +IY+ RLD DG+ I E +++
Sbjct: 67 ERVHIEFDITFNKLPCNFITVDVMDVSSEAQDNINDDIYRLRLDADGRNISESAQKI--E 124
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + K + TEL KCGSCYGA + CCNTC +VK AY K W + ++ +
Sbjct: 125 INQNKTIAD----PTELTQEVKCGSCYGAAADG-ICCNTCEDVKSAYAIKGWQV-NIEEV 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCKN+ ++ EGC++YG ++V +V+G+FH+APG + HVHD+
Sbjct: 179 EQCKNDKWVKEFTEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVK 238
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
F+ +H + HL+FG + PLDG V G M+ YY+K++PT Y+ LDG
Sbjct: 239 FDASHTVNHLTFGKSFPG---KHYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQ 295
Query: 302 ----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
LG G+PG F YE SPLMV+ E +SL + + G +
Sbjct: 296 SHQFSVTTHKKDLGFRQSGLPGFFVQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAM 355
Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
L+D ++ + + GGK
Sbjct: 356 AQLIDITIYQTHRYMKNRIAGGK 378
>gi|224059030|ref|XP_002299683.1| predicted protein [Populus trichocarpa]
gi|222846941|gb|EEE84488.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/384 (34%), Positives = 203/384 (52%), Gaps = 22/384 (5%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
++L+ LDA+ K EDF+ +T+ GG +T++ + + +L + Y T +L VD++R
Sbjct: 6 QKLRNLDAYPKINEDFYSRTLSGGLITLISSIIMLFLFFSEFSLYLHAVTETKLLVDTTR 65
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G L I+ DI P I C L++DA+D SGEQH + H+I K+R++ G I E +++ +
Sbjct: 66 GQTLRINFDITFPAIRCSLLSVDAIDISGEQHHDIRHDITKKRINAHGDVI-EVRQDGIG 124
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
A K K ++G E+ CGSC+GAE CCN+C+EV+EAYR K WAL +D
Sbjct: 125 APKIDKPLQKHGGRLEHNEE--YCGSCFGAEMSDDHCCNSCDEVREAYRKKGWALTNMDL 182
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
I QC E + +K+ EGC I G LEVNRV+G+FH PG S+ ++ + D+
Sbjct: 183 IDQCIREGFVQMIKDEEGEGCNINGSLEVNRVAGNFHFVPGKSFHQSNFQLLDLLDMQKE 242
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
++N +H I L+FG PLDG + ++IK++PTIY + G +
Sbjct: 243 SYNISHRINRLAFGDYFPG---VVNPLDGIQLMHGTQNGVQQFFIKVVPTIYTDIRGRTV 299
Query: 305 GGGD---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
+PG++F Y+ SP+ V E+ S H T I I G +
Sbjct: 300 HSNQYSVTEHFTKSELMRLDSLPGVYFIYDFSPIKVTFKEEHTSFLHFMTSICAIIGGIF 359
Query: 350 ITFMLVDALLHSCVKKI-SKVEIG 372
+VD+ ++ + I K+EIG
Sbjct: 360 TIAGIVDSFIYHGRRAIKKKMEIG 383
>gi|393907059|gb|EFO23462.2| hypothetical protein LOAG_05019 [Loa loa]
Length = 378
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 135/390 (34%), Positives = 203/390 (52%), Gaps = 42/390 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M ERLK DA+TKP +DF +T GGAVT+V I ++ + + V E+L+V
Sbjct: 1 MSLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYV 60
Query: 61 DSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
DS+ ++ ++ DI P + C + +D +D SG+ + ++YK L LDGK +
Sbjct: 61 DSTPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIRDDVYKISL-LDGKEGNGVR 119
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETETRKCCNTCNEVKEAYRYKK 176
+EV N T+T P CGSCYGA+ CCNTC EVKEAY K
Sbjct: 120 QEV------------NINTSTASSVPASQVLCGSCYGAK---EGCCNTCEEVKEAYMRKG 164
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W L ++T+ QCK++ +K+ EGC++YG ++V +V+G+FHIAPG + H H
Sbjct: 165 WELINIETVEQCKSDLWVKKMSEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLRAHRSHFH 224
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV---AKAEEGASMFNYYIKIIP 293
D+ + + F+T+H + H SFG + PLDG A+ +G M+ Y++K++P
Sbjct: 225 DLHSLSPSKFDTSHTVNHFSFGNSFPG---KVYPLDGKFFGSARNSDGI-MYQYHLKLVP 280
Query: 294 TIYERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
T Y LD ++ + G G+PG F YE SPLMVK E+ +SL
Sbjct: 281 TSYVFLDSTRNIFSHLFSVTTYQKDISQGASGLPGFFVQYEFSPLMVKYEERQQSLSTFL 340
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISK 368
I I G + L+DA ++ + IS+
Sbjct: 341 VSICAIIGGIFTVASLIDAFIYRSGRIISQ 370
>gi|384252531|gb|EIE26007.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 386
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 138/392 (35%), Positives = 210/392 (53%), Gaps = 40/392 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK LDA+ K EDF ++T+ GG +TI + + L ++ + +++TT EL VD++RG
Sbjct: 7 KLKNLDAYPKVNEDFFQRTLSGGIITIGSSIIMLCLFLSELSLFMKITTTNELSVDTTRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+L I+ D+ P + C++++LD +D SGE HL V+H++YKRRLD +G I + +
Sbjct: 67 DQLSINFDMTFPALPCEWISLDLMDISGEMHLDVDHDVYKRRLDSNGVVIPD-------S 119
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
++K +V E T + +CGSCYGA + +CCN C EV+ AYR K W + I
Sbjct: 120 IEKHQVGPELDDTLLHKANETECGSCYGAAPD-EECCNNCEEVRAAYRRKGWGFTDPQQI 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E EKL+ EGC ++G L VN+V+G+FH APG S+ +HVHD+ P+
Sbjct: 179 SQCAKEGFVEKLRAQEGEGCHMWGSLAVNKVAGNFHFAPGKSFQQGPMHVHDLVPFQGVT 238
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDG------TVAKAEEGASMFNYYIKIIPTIY--- 296
F+ +H I LSFG + PLD + + Y++K++PTIY
Sbjct: 239 FDLSHRIDKLSFG---HEYPGMTNPLDRVNLPKFNTRNPQGLPGAYQYFLKVVPTIYVNS 295
Query: 297 -------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
E GS+ +PG+FF Y+LSP+ VK E S H T +
Sbjct: 296 HNHTINSNQYSVTEHFKGSQ--DFQAQLPGVFFYYDLSPIKVKYHETRMSFLHFLTSVCA 353
Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
+ G + +VDA + H +KK KV++G
Sbjct: 354 IVGGIFTVAGIVDAFIYHGHQAIKK--KVDLG 383
>gi|17568835|ref|NP_510575.1| Protein ERV-46 [Caenorhabditis elegans]
gi|3878494|emb|CAB01889.1| Protein ERV-46 [Caenorhabditis elegans]
Length = 380
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 127/383 (33%), Positives = 199/383 (51%), Gaps = 26/383 (6%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DA+ KP +DF KT+ GG VT++ + I LI ++ + E LFVDS+
Sbjct: 7 LKHFDAYRKPMDDFRVKTLSGGLVTLIATIAIVLLIVLETKQFLSTEVLEHLFVDSTTSD 66
Query: 67 -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
++ I DI + C+++ +D +D S E ++ +IY+ RLD +G+ I E +++
Sbjct: 67 ERVHIEFDITFTKLPCNFITVDVMDVSSEAQENINDDIYRLRLDPEGRNISESAQKI--E 124
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + K + E TT++ KCGSCYGA + CCNTC++VK AY K W + ++ +
Sbjct: 125 INQNKTSVE----TTDVIQEVKCGSCYGAAADG-ICCNTCDDVKSAYAVKGWQV-NIEEV 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCKN+ ++ EGC++YG ++V +V+G+FH+APG + HVHD+
Sbjct: 179 EQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVK 238
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
F+ +H + H+SFG + PLDG V G M+ YY+K++PT Y+ LDG
Sbjct: 239 FDASHTVNHVSFGKSFPG---KNYPLDGKVNTDNRGGIMYQYYVKVVPTRYDYLDGRVDQ 295
Query: 302 ----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
LG G+PG F YE SPLMV+ E +S + + G +
Sbjct: 296 SHQFSVTTHKKDLGFRQSGLPGFFLQYEFSPLMVQYEEFRQSFASFLVSLCAIVGGVFAM 355
Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
LVD ++ + + GGK
Sbjct: 356 AQLVDITIYHSSRYMKSRIAGGK 378
>gi|226498912|ref|NP_001150650.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|194699894|gb|ACF84031.1| unknown [Zea mays]
gi|195640862|gb|ACG39899.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
Length = 387
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 139/392 (35%), Positives = 215/392 (54%), Gaps = 28/392 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L+ LDA+ K EDF+ +T+ GG +TI+ L I L ++ Y +T +L V
Sbjct: 1 MELWSKLRNLDAYPKVNEDFYSRTLSGGLITILSSLAILLLFFSEIRLYLYSATESKLTV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG +L I+ D+ P + C +A+D +D SGEQH + H+I K+R+D G I E +K
Sbjct: 61 DTSRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDHLGNVI-ESRK 119
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ V A K ++ ++G E CGSCYGAE +CCN+C EV++AYR K WA+
Sbjct: 120 DGVGAPKIERPLQKHGGRLDHNE--VYCGSCYGAEESDDQCCNSCEEVRDAYRKKGWAVN 177
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
++ I QCK E ++LK+ EGC I+G++ VN+V+G+FH APG S + + D+
Sbjct: 178 NVELIDQCKREGYVQRLKDEQGEGCTIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLN 237
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT--VAKAEEGAS-MFNYYIKIIPTIYE 297
+N +H I LSFG + PLDG + G + M+ Y++K++PTIY
Sbjct: 238 LQPETYNISHKINKLSFGEEFPG---VVNPLDGVEWIQDNSNGLTGMYQYFVKVVPTIYT 294
Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
+ G K+ + PG++F YE SP+ V TE++ SL H T I
Sbjct: 295 DIRGRKIHSNQFSVTEHFREAIGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 354
Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
+ G + ++D+ + H +KK K+E+G
Sbjct: 355 IVGGIFTVAGIIDSFVYHGHRAIKK--KMELG 384
>gi|393212588|gb|EJC98088.1| endoplasmic reticulum-derived transport vesicle ERV46 [Fomitiporia
mediterranea MF3/22]
Length = 421
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/412 (33%), Positives = 201/412 (48%), Gaps = 56/412 (13%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LKG+DAF K ED KT G +TI+ I ++ DY +V+ + VD SRG
Sbjct: 9 LKGIDAFGKTMEDVKVKTKTGAFLTILSAAIILAFTTIEFLDYRRVNLETSIVVDRSRGE 68
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L + +++ P + C L+LD +D SGE + HNI K RLD +G + +A
Sbjct: 69 RLTVRMNVTFPKVPCYLLSLDVMDISGEAQRDISHNIVKARLDANGAVVPNSH----SAE 124
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ K+ N T N CGSCYG CCNTC EV++AY K W+ D+I
Sbjct: 125 LRNKLDVMNDQT-----QDNYCGSCYGGVAPEGGCCNTCEEVRQAYVNKGWSFSNPDSIE 179
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC E+ +EKL TEGC I G L VN+V G+ H++PG S+ N++++H++ PY
Sbjct: 180 QCVREHWSEKLHEQSTEGCNISGRLRVNKVIGNIHLSPGRSFQTNYMNIHELVPYLKEDK 239
Query: 247 NTTHHIRHLSFGIKLQDDDE---RRK---------------PLDGTVAKAEEGASMFNYY 288
N H H+ + + DDE R+K PLDG V KA MF Y+
Sbjct: 240 NR-HDFGHIVHELSFEGDDEYNFRKKERSKGIKKKLGIEANPLDGAVGKAASLQYMFQYF 298
Query: 289 IKIIPTIYERLDGSKLG---------------GGDG-------------GMPGIFFSYEL 320
+K++ T +E +DG + G G GMPG+F +YE+
Sbjct: 299 VKVVSTKFELMDGQTVKTHQYSATHFERDLTTGAIGQTKEGVHIAHTNVGMPGVFINYEI 358
Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
SPL+V +E +S H T I G +VD+++ + +++ K +G
Sbjct: 359 SPLLVVHSETRQSFAHFLTSTCAIIGGVLTIATIVDSVVFATGRRLKKSGVG 410
>gi|328770814|gb|EGF80855.1| hypothetical protein BATDEDRAFT_19389 [Batrachochytrium
dendrobatidis JAM81]
Length = 409
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 137/395 (34%), Positives = 203/395 (51%), Gaps = 39/395 (9%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DA+ KP +DF +T+ G VT+V L I +L + D++Q L VD R
Sbjct: 28 LKKYDAYAKPLDDFRIRTISGALVTVVSTLVILFLTFSEFTDWYQKEMLPSLEVDKGRKE 87
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ I+L++ + C L++D +D SGE ++ H+++K R+D G + E QK++ N
Sbjct: 88 KMNINLNVTFYHMPCYLLSVDVMDVSGEHQNNLPHSMHKVRIDQLGN-LLEKQKKLGN-- 144
Query: 127 KKKKVTTENGTTTTELED----PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
T + E+ D P CGSCYG KCCNTC +V+EAY W+ +
Sbjct: 145 ------TNSSGVKKEIRDMALDPKYCGSCYGGVAPESKCCNTCEQVQEAYERSGWSFTDP 198
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
D+I QC E +++++ E C IYG++EVN+V G+ H APG S+ N +HVHD+ Y
Sbjct: 199 DSIEQCVREGWSKRMETQINEACNIYGHIEVNKVQGNIHFAPGHSFQQNALHVHDLHDYN 258
Query: 243 S--AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ +FN H I LSFG + PLD + YYIK++ T L+
Sbjct: 259 APNGSFNFKHTIHELSFG----ESSSFVNPLDTVTKTPPTKYFSYQYYIKVVGTDISYLN 314
Query: 301 GSKL------------------GGGDGGMPG-IFFSYELSPLMVKITEKSKSLGHLWTKI 341
GS+L G GMPG +FF++E+SP++VK E K H T +
Sbjct: 315 GSQLTTNQFSVTEHEQDVTPLFGALPIGMPGKLFFNFEISPMLVKFKEFRKPFTHFLTDL 374
Query: 342 MCNISGTYITFMLVDALLHSCVKKI-SKVEIGGKT 375
I G + ++DALL + + I +KVEIG T
Sbjct: 375 CAIIGGVFTVAGMIDALLFATQRSIQAKVEIGKNT 409
>gi|331241265|ref|XP_003333281.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312271|gb|EFP88862.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 421
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 137/419 (32%), Positives = 200/419 (47%), Gaps = 51/419 (12%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F KGLD F+K ED KT +GG +T+ I LI V+ DY Q+ + VD
Sbjct: 7 FGGYFKGLDGFSKTMEDVKVKTGFGGMLTMASAALIFTLILVEFRDYRQIHVQPSILVDK 66
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRG KL +H++I P + C L++D +D SGE V H++ K RL LDG P+
Sbjct: 67 SRGEKLLVHMNITFPRVPCYLLSVDVMDISGEHQNDVAHDLAKTRLGLDGVPLS------ 120
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
N +K + E T + CGSCYG E CCN+C EV+E+Y + W+
Sbjct: 121 TNTTQKLQGELE---TIIASRAKDYCGSCYGGEPGPSGCCNSCEEVRESYVRRGWSFNNP 177
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
D I QC E+ +E++K EGC I G L+VN+V G+FH++PG S+ + VHVHD+ PY
Sbjct: 178 DGIEQCVQEHWSERIKEQSKEGCNINGVLKVNKVIGNFHLSPGRSFQTHQVHVHDLVPYL 237
Query: 243 --SAAFNTTHHIRHLSF-----------GIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
S + H I + +F ++L+ PLDG A E MF Y++
Sbjct: 238 QDSNLHDFGHVIHNFAFMDANQPTETAHTLRLKKTLGIVNPLDGVKAHTEASNYMFQYFL 297
Query: 290 KIIPTIYERLDGS-------------------------KLGG----GDGGMPGIFFSYEL 320
K++ T ++ LDG +LG G G+PG+FF+YE+
Sbjct: 298 KVVGTQFQLLDGQVAKTHQYSVTQYERDLDNSDKSDADELGHLTSHGHSGVPGVFFNYEI 357
Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTKR 379
SP+ V E +S H T + G L+D+ ++ ++ G R
Sbjct: 358 SPMQVVHQEYRQSFAHFATSTCAIVGGVLTVAGLLDSFVYGAQNRMKGGSSNGAASHSR 416
>gi|242035905|ref|XP_002465347.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
gi|241919201|gb|EER92345.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
Length = 387
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 138/392 (35%), Positives = 214/392 (54%), Gaps = 28/392 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L+ LDA+ K EDF+ +T+ GG +TI+ L I L ++ Y +T +L V
Sbjct: 1 MELWSKLRNLDAYPKVNEDFYSRTLSGGLITILSSLAILLLFFSEIRLYLYSATESKLTV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG +L I+ D+ P + C +A+D +D SGEQH + H+I K+R+D G I E +K
Sbjct: 61 DTSRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDITKKRIDHLGNVI-ESRK 119
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ V A K ++ ++G E CGSCYGAE +CCN+C EV++ YR K WA+
Sbjct: 120 DRVGAPKIERPLQKHGGRLDHNE--VYCGSCYGAEETDDQCCNSCEEVRDVYRKKGWAIN 177
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
++ I QCK E ++LK+ EGC I+G++ VN+V+G+FH APG S + + D+
Sbjct: 178 NVELIDQCKREGYVQRLKDETGEGCTIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLN 237
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT--VAKAEEGAS-MFNYYIKIIPTIYE 297
+N +H I LSFG + PLDG + G + M+ Y++K++PTIY
Sbjct: 238 IQPETYNISHKINKLSFGEEFPG---VVNPLDGVEWIQDNSNGLTGMYQYFVKVVPTIYT 294
Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
+ G K+ + PG++F YE SP+ V TE++ SL H T I
Sbjct: 295 DIRGRKIYSNQFSVTEHFREAIGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 354
Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
+ G + ++D+ + H +KK K+E+G
Sbjct: 355 IVGGIFTVAGIIDSFVYHGHRAIKK--KMELG 384
>gi|79318328|ref|NP_001031077.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|332192090|gb|AEE30211.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 338
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 184/335 (54%), Gaps = 21/335 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T +L VD+SRG
Sbjct: 7 RLRNLDAYPKINEDFYRRTLSGGVITLASSIVMLILFFSELQLYIHPVTETQLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I+ D+ P + C ++LD++D SGE+HL V H+I KRRLD G I+ Q + +
Sbjct: 67 EKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHT 126
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K + G + CGSC+GAE CCN+C EV+EAYR K WAL + ++I
Sbjct: 127 KIEKPLQKHGGRLE---HNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDPESI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E +K+K+ EGC ++G+LEVN+V+G+FH PG S+ + HD+ +
Sbjct: 184 DQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGN 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL- 304
+N +H + L+FG PLDG + + ++ Y+IK++P+IY + + +
Sbjct: 244 YNISHKVNRLAFGDFFPG---VVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300
Query: 305 --------------GGGDGGMPGIFFSYELSPLMV 325
G PG+FF Y+LSP+ V
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKV 335
>gi|358334909|dbj|GAA53334.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Clonorchis sinensis]
Length = 323
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 169/305 (55%), Gaps = 36/305 (11%)
Query: 84 LALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELE 143
L LD +DS+GEQ + V IYK R+D G PI +++ N K + VT +
Sbjct: 23 LNLDTMDSTGEQKIDVSQQIYKTRIDSTGSPISATRRDDGNPSKGQVVT----------K 72
Query: 144 DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTE 203
DP+ CGSCYGAE+ETRKCCNTC E++ AY+ + W + L QC+ E + L N +E
Sbjct: 73 DPDYCGSCYGAESETRKCCNTCKEIQLAYQERHWVVKNLSVFEQCREEQWDDTLANLGSE 132
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+I G L+VN+V+GSFHI PG SY+ + VHVH++Q + N +H I L+FG
Sbjct: 133 GCRIQGSLQVNKVAGSFHITPGNSYASDQVHVHNLQGFDGQKLNMSHKIDKLAFGNMYPG 192
Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD---------------------GS 302
+ PLDGT E A M YY+K++PT+Y + GS
Sbjct: 193 ---QTNPLDGTTMNVVEPAQMVTYYMKLVPTMYVSYNTTTRSLSTVHTNQYSVTWHSKGS 249
Query: 303 KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH-- 360
L G+PG+FF+YELSPL+VKI+ + KS H T I G + L+DA ++
Sbjct: 250 PLTSDSSGIPGLFFNYELSPLLVKISYEHKSFLHFLTNTCAIIGGVFTVASLLDAFIYQS 309
Query: 361 SCVKK 365
+CV +
Sbjct: 310 TCVVR 314
>gi|395324643|gb|EJF57079.1| endoplasmic reticulum-derived transport vesicle ERV46 [Dichomitus
squalens LYAD-421 SS1]
Length = 423
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 135/412 (32%), Positives = 199/412 (48%), Gaps = 56/412 (13%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F LKG+DAF K ED KT G +TI+ I ++ DY +V + VD
Sbjct: 5 FLNALKGVDAFGKTMEDVKVKTRTGALLTIIAAAIILSFTTIEFFDYRRVFVDTSIVVDR 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-KE 121
SRG KL ++++I P + C L+LD +D SGE + HNI K RLD GKP+ E
Sbjct: 65 SRGEKLTVNMNITFPRVPCYLLSLDVMDISGETQSDITHNILKTRLDEKGKPVSHSLIAE 124
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ N + K ++G CGSCYG CCNTC EV++AY + W+
Sbjct: 125 LQNDLDKLNEQRQSGY----------CGSCYGGIEPEGGCCNTCEEVRQAYVNRGWSFNR 174
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D+I QC E ++KLK EGC I G + VN+V G+ H++PG S+ + +++++ PY
Sbjct: 175 PDSIEQCVKEGWSDKLKEQAHEGCNIAGRVRVNKVVGNIHLSPGRSFRTSAHNLYELVPY 234
Query: 242 TSAAFNT---THHIRHLSF---------GIKLQDDDERR-----KPLDGTVAKAEEGASM 284
N TH I H +F KL + + R PLDGT + + M
Sbjct: 235 LRTDGNRHDFTHQIHHFAFEGDDEYDPRNAKLGKELKNRLGIDANPLDGTQGRTIKQQYM 294
Query: 285 FNYYIKIIPTIYERLDGSKLGG----------------------------GDGGMPGIFF 316
F Y++K++ T ++ +DG K+G G+GG+PG FF
Sbjct: 295 FQYFLKVVSTQFQTIDGKKVGTHQYSATHFERDLDKGPSEDSPAGLHVAHGNGGIPGAFF 354
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+YE+SPL+++ E +S H T + G L+D+LL + K K
Sbjct: 355 NYEISPLLIRHVETRQSFAHFLTSTCAIVGGVLTVASLIDSLLFATRKAFKK 406
>gi|312075860|ref|XP_003140604.1| hypothetical protein LOAG_05019 [Loa loa]
Length = 365
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/388 (32%), Positives = 198/388 (51%), Gaps = 51/388 (13%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M ERLK DA+TKP +DF +T GGAVT+V I ++ + + V E+L+V
Sbjct: 1 MSLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYV 60
Query: 61 DSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-P 118
DS+ ++ ++ DI P + C + +D +D SG+ + ++YK +++++ P
Sbjct: 61 DSTPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIRDDVYKIKVNINTSTASSVP 120
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
+V+ CGSCYGA+ CCNTC EVKEAY K W
Sbjct: 121 ASQVL------------------------CGSCYGAK---EGCCNTCEEVKEAYMRKGWE 153
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
L ++T+ QCK++ +K+ EGC++YG ++V +V+G+FHIAPG + H HD+
Sbjct: 154 LINIETVEQCKSDLWVKKMSEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLRAHRSHFHDL 213
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV---AKAEEGASMFNYYIKIIPTI 295
+ + F+T+H + H SFG + PLDG A+ +G M+ Y++K++PT
Sbjct: 214 HSLSPSKFDTSHTVNHFSFGNSFPG---KVYPLDGKFFGSARNSDGI-MYQYHLKLVPTS 269
Query: 296 YERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
Y LD ++ + G G+PG F YE SPLMVK E+ +SL
Sbjct: 270 YVFLDSTRNIFSHLFSVTTYQKDISQGASGLPGFFVQYEFSPLMVKYEERQQSLSTFLVS 329
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISK 368
I I G + L+DA ++ + IS+
Sbjct: 330 ICAIIGGIFTVASLIDAFIYRSGRIISQ 357
>gi|402083890|gb|EJT78908.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 444
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 142/448 (31%), Positives = 205/448 (45%), Gaps = 86/448 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG +TIV + + YL + DY +++ EL V
Sbjct: 1 MAPKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLALGEWSDYRRIAIHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IHL+I P + C+ L LD +D SGEQ V+H + K RL +PQ
Sbjct: 61 DKSRGDRMEIHLNITFPRMPCELLTLDVMDVSGEQQHGVQHGVVKVRL--------QPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K ++ + DP CG CYGA + CC+TC+EV+EAY
Sbjct: 113 EGGGVIDVKALSLHADEDSATHLDPKYCGPCYGAPAPSNAAKAGCCSTCDEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC E+ E+L EGCQI G L VN+V G+FH+APG S+S ++HVH
Sbjct: 173 WAFGRGENVEQCLREHYAERLDEQRQEGCQIAGSLRVNKVIGNFHLAPGRSFSNGNMHVH 232
Query: 237 DIQPYTSAAFNTTHHIRH----LSFGIKLQDDDERR----------------KPLDGTVA 276
D++ Y + H H LSFG +L + ++R PLDGT
Sbjct: 233 DLKNYWDTPVDGGHSFSHVVHSLSFGPQLPLEVQKRLDRGRSLPWADHSHQLNPLDGTSQ 292
Query: 277 KAEEGASMFNYYIKIIPTIYERL---------------------------DGS------- 302
+ + F Y++KI+PT Y L DG+
Sbjct: 293 ETADPNFSFMYFLKIVPTSYLPLGWEGRRAKIATGNHDKDSWVGTYGYSPDGAVETHQYS 352
Query: 303 ------KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIM 342
L GGD GG+PG+FFSY++SP+ ++ E+ K+ T +
Sbjct: 353 VTSHKRSLAGGDDAAEGHQERLHSKGGIPGVFFSYDISPMKVINREERPKTFAGFLTGLC 412
Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
+ GT VD + ++ K+
Sbjct: 413 AILGGTLTVAAAVDRTFYEGATRLKKMR 440
>gi|164655211|ref|XP_001728736.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
gi|159102620|gb|EDP41522.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
Length = 427
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/417 (31%), Positives = 201/417 (48%), Gaps = 60/417 (14%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
F +L+GLDAF + +D +T G +T+ L I LI + DY +V T+ L VD
Sbjct: 5 AFFGQLRGLDAFGRMSDDVRIRTNVGALLTLTSALMILVLIVSEFLDYRRVQTSPRLEVD 64
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
SRG +L + ++ P I C L+LD VD GE + V H++ +RRLD GKP+ E E
Sbjct: 65 LSRGERLAVQFNVTFPRIPCYLLSLDVVDVVGETQMDVHHDVERRRLDETGKPVSE---E 121
Query: 122 VVNAVKK--KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
V+ ++ K+V E G P+ CG CYGA+ CCN+C+ V+EAY W+
Sbjct: 122 VIRELESEAKRVIAERG--------PDYCGDCYGADPPEGGCCNSCDAVREAYMLHNWSF 173
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
D I QC E+ +E ++ EGC I G + VN+V G+ H PG ++ N +H HD+
Sbjct: 174 TSPDDIEQCAQEHWSEHVREQNHEGCNIAGEVRVNKVVGNLHFIPGRTFHRNDIHTHDLV 233
Query: 240 PYTSAAFNTTHHIRH----LSFGIKLQDDDER----------------RKPLDGTVAKAE 279
PY + HH H SFG++ + ER + L+G AK
Sbjct: 234 PYLHGTGDDVHHFGHKIHRFSFGMEDEFAIERTSRGRRQGPLKNRMGIKNALEGRSAKTL 293
Query: 280 EGASMFNYYIKIIPTIYERLDGSKLG------------------GGD---------GGMP 312
MF Y++K++P +L+G ++ GG G+P
Sbjct: 294 SSNYMFQYFLKVVPVEVHKLNGHEMSTYQYSATSYERNLEDFDRGGQMSGHIVRMIEGIP 353
Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
G++F+YE+SPL V TE S+ HL + + I G L+D ++ + + V
Sbjct: 354 GVYFNYEISPLRVIQTEWHHSIWHLVSNLFALIGGIVTVAGLIDGAIYRSRRTFNIV 410
>gi|345569114|gb|EGX51983.1| hypothetical protein AOL_s00043g717 [Arthrobotrys oligospora ATCC
24927]
Length = 397
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 152/407 (37%), Positives = 201/407 (49%), Gaps = 51/407 (12%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + RL LDAFTK ED +T GG VTI L I L+ + DY +VS EL V
Sbjct: 1 MGRASRLMRLDAFTKTVEDARIRTSSGGIVTIFSVLVIFCLVIGEWNDYRKVSVISELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D +RG ++ IHL+I P I C+ L LD +D SG+ V H I K RLD G I+
Sbjct: 61 DKTRGEQMEIHLNITFPHIPCELLTLDVMDVSGDLQPSVSHGIGKHRLDKSGGIIESKFL 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E+ K DP+ CG CYGA ++ CC TC++V+EAY K
Sbjct: 121 ELHPEHPKHL-------------DPSYCGECYGAVAPDTSKKAGCCQTCDDVREAYAAKG 167
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E E LK EGC+I G+L VN+V G+FHIAPG S+S +HVH
Sbjct: 168 WAFGDGTGVHQCEEEGYKEMLKEQAGEGCRIDGHLWVNKVVGNFHIAPGKSFSNAQMHVH 227
Query: 237 DIQPYTSAAF--NTTHHIRHLSFGIKLQDD-----DERRKPLDGTVAKAEEGASMFNYYI 289
D+ Y + TH I LSFG L D ++ PLD T K + + Y++
Sbjct: 228 DLANYLQGDVHHDFTHTINALSFGPPLPTDLLHENHHQQNPLDATSKKTSDRNYNYLYFL 287
Query: 290 KIIPTIYERLDG----------------SKLGGGD----------GGMPGIFFSYELSPL 323
KI+ T YE LD S GG D GG+PGIFFSY++SP+
Sbjct: 288 KIVSTSYEHLDHGYTIHTHQYSVTSHERSLEGGKDDVHPGTVHARGGIPGIFFSYDISPM 347
Query: 324 MVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
V E ++KS T I I GT +D L+ ++I K+
Sbjct: 348 KVVNREIRTKSFSGFLTSICAIIGGTLTVAAALDRGLYEGARRIGKL 394
>gi|347842451|emb|CCD57023.1| similar to endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Botryotinia fuckeliana]
Length = 439
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 143/443 (32%), Positives = 207/443 (46%), Gaps = 81/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTI L + YL + DY +++ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWADYRRITVHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG K+ IHL+I P I C+ L LD +D SGEQ + V H + K RL PQ+
Sbjct: 61 DKGRGEKMEIHLNITFPKIPCELLTLDVMDVSGEQQVGVMHGVKKVRLG--------PQE 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E + K + N + DPN CG+CYGA + CCNTC+EV+EAY
Sbjct: 113 EGGKVIDIKALDLHNAEDSATHLDPNYCGACYGATPPPNAQKPGCCNTCDEVREAYASVS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ E+L + EGC+I G L VN+V G+FHIAPG S++ ++HVH
Sbjct: 173 WAFGRGENVEQCEREHYGERLDSQRKEGCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVH 232
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDD--------------DERRKPLDGTVAKA 278
D+ + +HHI L FG +L ++ + PLD T
Sbjct: 233 DLNNFFDTPVPGGHVFSHHIHSLRFGPELPEEVFKKLGSDSIIPWTNHHLNPLDNTEQIT 292
Query: 279 EEGASMFNYYIKIIPTIYERL------------------------DGS------------ 302
E A F Y++K++ T Y L DGS
Sbjct: 293 HEAAYNFMYFVKVVSTSYLPLGWETNYNSRPHDASVDIGTYGHSEDGSIETHQYSVTSHR 352
Query: 303 -KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISG 347
L GGD GG+PG+FFSY++SP+ ++ E++K+L T + + G
Sbjct: 353 RSLNGGDDSAEGHKEKLHARGGIPGVFFSYDISPMKVINKEERTKTLAGFLTGLCAIVGG 412
Query: 348 TYITFMLVDALLHSCVKKISKVE 370
T VD ++ ++ K++
Sbjct: 413 TLTVAAAVDRGVYEGATRLRKMQ 435
>gi|328858670|gb|EGG07782.1| hypothetical protein MELLADRAFT_105603 [Melampsora larici-populina
98AG31]
Length = 422
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 136/402 (33%), Positives = 197/402 (49%), Gaps = 51/402 (12%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
KGLD F K ED +T +GG +T+ + I L+ V+ DY + + VD SRG
Sbjct: 11 FKGLDGFGKTMEDVKIRTGFGGFLTLASAILIVTLVLVEFVDYRTLHLNPSIVVDKSRGE 70
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL + ++I P + C L++D +D SGE V H++ K RL+ DG +V+A
Sbjct: 71 KLIVDMNITFPRVPCYLLSVDLMDISGEHQNDVNHDMTKTRLNPDGT--------LVSAS 122
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K + E T P CGSCYG CCNTC EV+E+Y + W+ D I
Sbjct: 123 VSKGLKGEL-DTIAATRAPGYCGSCYGGTPPESGCCNTCEEVRESYVRRGWSFSNPDGIE 181
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY--TSA 244
QC E+ ++K+K EGC + G ++VN+V G+FH++PG S+ N +HVHD+ PY T
Sbjct: 182 QCVQEHWSDKIKEQEKEGCNMNGQVKVNKVIGNFHMSPGRSFQTNAMHVHDLVPYLQTGN 241
Query: 245 AFNTTHHIRHLSFGIKLQ--DDDERRK---------PLDGTVAKAEEGASMFNYYIKIIP 293
+ + H I +F + Q DDDE R+ PLDG A EE MF Y++K++
Sbjct: 242 SHDFGHIIHKFAFLAEHQSPDDDETRRIKTSLGIVNPLDGIKAHTEESNYMFQYFLKVVG 301
Query: 294 TIYERLDG-------------------SKLGGGD----------GGMPGIFFSYELSPLM 324
T + LD S GG D G+PG+FF+YE+SP+
Sbjct: 302 TEFHLLDQRVVKTHQYSVTQYERDLTKSSRGGTDELGHQTSHGYAGVPGLFFNYEISPMQ 361
Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
V E +S H T I G L+D+ ++ +I
Sbjct: 362 VIHKEYRQSFAHFATSTCAIIGGVLTVAGLIDSAVYGARNRI 403
>gi|171696240|ref|XP_001913044.1| hypothetical protein [Podospora anserina S mat+]
gi|170948362|emb|CAP60526.1| unnamed protein product [Podospora anserina S mat+]
Length = 437
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 146/440 (33%), Positives = 201/440 (45%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV + + +L + DY ++ EL V
Sbjct: 1 MAAKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLIVVFFLAWGEWQDYRRIEIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGEQ V+H + K RL P
Sbjct: 61 DKGRGERMEIHLNVSFPRVPCELLTLDVMDVSGEQQHGVQHGVVKTRL--------RPLS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E ++ K + DPN CG CYGA + CC TC+EVKEAY +
Sbjct: 113 EGGGVIEAKALALHARDEEAAHLDPNYCGPCYGAAPPVHAQKPNCCQTCDEVKEAYAAQA 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+ E+ EKL EGC+I G + VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGRGEGIEQCEREHYAEKLDEQRNEGCRIEGNVRVNKVIGNFHIAPGKSFSNGNMHVH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR--------------KPLDGTVAKAEE 280
D++ Y T TH I HL FG +L D ++ PLD T + ++
Sbjct: 233 DLKNYWDTPVKHTFTHEIHHLRFGPQLPDGLAKKLGKNKALPWTNHHVNPLDNTHQETDD 292
Query: 281 GASMFNYYIKIIPTIY------------------------ERLDGS-------------K 303
F Y+IKI+PT Y + DGS
Sbjct: 293 VNYNFMYFIKIVPTSYLPLGWEKTWQGFKDQHHKELGSFGQSADGSLETHQYSVTSHRRS 352
Query: 304 LGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTY 349
L GGD GG+PG+FFSY++SP+ ++ E+ KS + + GT
Sbjct: 353 LSGGDDGSEGHKERLHAKGGIPGVFFSYDISPMKVINREERPKSFLGFLAGLCAIVGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD L K+ K+
Sbjct: 413 TVAAAVDRALFEGGMKLKKL 432
>gi|429853391|gb|ELA28466.1| copii-coated vesicle membrane protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 437
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 144/427 (33%), Positives = 200/427 (46%), Gaps = 79/427 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV + + +L + DY ++ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIVSLIVVLWLAWGEWVDYRRIEIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V H + K RL PQK
Sbjct: 61 DQGRGERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL--------RPQK 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E + K ++ + E DPN CG CYGA + CCNTC EV+EAY
Sbjct: 113 EGGGVIDVKALSLHSSDEAAEHLDPNYCGPCYGAPAPPNAQKAGCCNTCEEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC E+ EKL+ EGC+I G L VN+V G+FH+APG S+S ++HVH
Sbjct: 173 WAFGKGENVEQCTREHYAEKLEEQRREGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVH 232
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
D++ Y A + TH I L FG +L D ++ PLD T +
Sbjct: 233 DLKNYWETPDDAQHDFTHVIHTLRFGPQLPDTITKKMTKRAYAWTNHHGNPLDSTHQETN 292
Query: 280 EGASMFNYYIKIIPTIYERL-----------------------DGS-------------K 303
+ F Y++KI+PT Y L DGS
Sbjct: 293 DPNYNFMYFVKIVPTSYLALNWQKSASIQDEESSGLGLLGHLSDGSVETHQYSVTSHKRS 352
Query: 304 LGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTY 349
L GGD GG+PG+FFSY++SP+ ++ E++K+ T + I GT
Sbjct: 353 LAGGDDSAEGHQERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTL 412
Query: 350 ITFMLVD 356
VD
Sbjct: 413 TVAAAVD 419
>gi|71021625|ref|XP_761043.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
gi|46100607|gb|EAK85840.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
Length = 435
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 128/413 (30%), Positives = 206/413 (49%), Gaps = 62/413 (15%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+G+DAF+K +D +T G +T++ L I+ L + DY V L VD SRG
Sbjct: 9 QLRGIDAFSKTMDDVRIRTNAGALITLISALLIAVLTIGEFIDYRTVHVKPALEVDRSRG 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL ++++I P + C L+LD +D SGE ++H++ + R++ DGK I++ +K +
Sbjct: 69 EKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDVERTRINHDGKIIEQGKKSLKGD 128
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ T + + CG CYG + KCCNTC+EV+EAY K W+ + D +
Sbjct: 129 AARIANT----------KGKDYCGDCYGGQPPASKCCNTCDEVREAYVRKGWSFADPDHV 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E +EK+K EGC+I G L VN+V GSFH++PG ++ N +H+HD+ PY S
Sbjct: 179 DQCVAEGWSEKIKEQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPYLSGT 238
Query: 246 FNTTHHIRHL----SFGIK-----LQDDDER--------RKPLDGTVAKAEEGASMFNYY 288
+ H H+ SFG + L ER + PL+G A+ ++ MF Y+
Sbjct: 239 GSEHHDFGHIIHEFSFGSEQEYHGLTSAKERAVKAKLGVKDPLEGVRAQTQQSQFMFQYF 298
Query: 289 IKIIPTIYERLDGSKL-----------------------------------GGGDGGMPG 313
+K++ T + L G L G G+PG
Sbjct: 299 VKVVSTEFRPLSGETLKTQQYSVTTYERDLSPGANAAALAGLSNEGSGAHISHGFAGVPG 358
Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+FF+YE+SPL +E +SL H T + G ++D+L+++ +++
Sbjct: 359 VFFNYEISPLKTIHSEYRQSLSHFLTSTCAIVGGILTVAGILDSLVYNSRRRL 411
>gi|310800359|gb|EFQ35252.1| hypothetical protein GLRG_10396 [Glomerella graminicola M1.001]
Length = 437
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 142/427 (33%), Positives = 201/427 (47%), Gaps = 79/427 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV + + +L + DY ++ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLIVVFWLAWGEWADYRRIEIHSELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGEQ V H + K RL P+K
Sbjct: 61 DKGRGERMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL--------RPRK 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET----ETRKCCNTCNEVKEAYRYKK 176
E + K + + + E DPN CG CYGA+ + CCNTC+EV+EAY
Sbjct: 113 EGGGVIDIKALDLHSRDDSAEHLDPNYCGPCYGAQAPPNAQKPGCCNTCDEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC E+ E+L+ EGC+I G L VNRV G+FH+APG S+S ++HVH
Sbjct: 173 WAFGKGEGVEQCTREHYAERLEEQRQEGCRIEGNLRVNRVVGNFHLAPGRSFSNGNMHVH 232
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
D++ Y A + TH I L FG +L D ++ PLD T
Sbjct: 233 DLKNYWDTPADAQHDFTHTIHSLRFGPQLPDQVTKKMGKRAYAWTNHHGNPLDNTHQDTN 292
Query: 280 EGASMFNYYIKIIPTIYERL-----------------------DGS-------------K 303
+ F Y++KI+PT Y L DGS
Sbjct: 293 DPNYNFMYFVKIVPTSYLALNWQKSTAYQDDDSSSLGLLGQGNDGSVETHQYSVTSHKRS 352
Query: 304 LGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTY 349
L GGD GG+PG+FFSY++SP+ ++ E++K+ T + I GT
Sbjct: 353 LAGGDDAAEGHQERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTL 412
Query: 350 ITFMLVD 356
VD
Sbjct: 413 TVAAAVD 419
>gi|389632999|ref|XP_003714152.1| hypothetical protein MGG_01245 [Magnaporthe oryzae 70-15]
gi|351646485|gb|EHA54345.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae 70-15]
Length = 439
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 145/443 (32%), Positives = 201/443 (45%), Gaps = 81/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG +TIV + + YL + DY ++ EL V
Sbjct: 1 MAPKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLAWGEWADYRRIDIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IHL+I P + C+ L LD +D SGEQ V+H + K RL PQ
Sbjct: 61 DKSRGDRMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVIKVRL--------RPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K + DPN CG CYGA CCNTC+EV+EAY
Sbjct: 113 EGGGVIDAKTLALHAEDEAATHLDPNYCGGCYGAPAPANAKKAGCCNTCDEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC E+ E+L EGCQI G L VN+V G+FH+APG S+S ++HVH
Sbjct: 173 WAFGRGENVEQCTREHYAERLDEQRHEGCQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVH 232
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGI--------KLQDDDERR-------KPLDGTVAK 277
D++ Y + +H I L FG KL + D+ PLDG +
Sbjct: 233 DLKNYWDTPVEGGHSFSHTIHSLRFGPQLPPSALEKLGNKDKNMPWTNHHINPLDGVIQT 292
Query: 278 AEEGASMFNYYIKIIPTIYERL-----------------------DGS------------ 302
+ + Y++KI+PT Y L DGS
Sbjct: 293 TVDPNFNYMYFVKIVPTSYLPLGWEKRTHLATMHDHGVGTYGYSGDGSVETHQYSVTSHK 352
Query: 303 -KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISG 347
L GGD GG+PG+FFSY++SP+ V E ++K+ T + + G
Sbjct: 353 RSLAGGDDGEDGHKERMHSRGGIPGVFFSYDISPMKVINREVRTKTFAGFLTGLCAILGG 412
Query: 348 TYITFMLVDALLHSCVKKISKVE 370
T +D + V +I K++
Sbjct: 413 TLTVAAAIDRMTFEGVTRIKKMQ 435
>gi|367019108|ref|XP_003658839.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
42464]
gi|347006106|gb|AEO53594.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
42464]
Length = 436
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 144/440 (32%), Positives = 204/440 (46%), Gaps = 78/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV + + +L + DY ++ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLIVVFFLALGEWSDYRRIVVHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P I C+ L LD +D SGEQ V+H I K RL P
Sbjct: 61 DKGRGERMEIHLNITFPRIPCELLTLDVMDVSGEQQHGVQHGITKTRL--------RPLS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K++ + DPN CG CYGA CCNTC+EV++AY
Sbjct: 113 EGGGDIDSKEIVLHSRDEAAVHLDPNYCGECYGAPPPNNAKKPGCCNTCDEVRDAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + IVQC+ E+ +EKL EGC+I G L VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGRGEGIVQCEREHYSEKLDAQRNEGCRIEGGLRVNKVVGNFHIAPGRSFSNGNMHVH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAEEG 281
D++ Y + TH I HL FG +L + ++ PLD T + ++
Sbjct: 233 DLKNYWDSPTKHTFTHTIHHLRFGPQLPESLTQKLGTKNLPWTNHHVNPLDDTHQQTDDV 292
Query: 282 ASMFNYYIKIIPTIYERL------------------------DGS-------------KL 304
+ Y++KI+PT Y L DGS L
Sbjct: 293 NYNYMYFLKIVPTSYLPLGWEKTWAGFRERHSAELGSFGTSPDGSVETHQYSVTSHKRSL 352
Query: 305 GGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYI 350
GG+ GG+PG+FFSY++SP+ ++ E++KS + + GT
Sbjct: 353 AGGNDAAEGHQERQHARGGIPGVFFSYDISPMKVINREERAKSFLGFLAGLCAIVGGTLT 412
Query: 351 TFMLVDALLHSCVKKISKVE 370
+D L ++ K+
Sbjct: 413 VAAAIDRALFEGTVRLKKLR 432
>gi|412994036|emb|CCO14547.1| predicted protein [Bathycoccus prasinos]
Length = 436
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 134/420 (31%), Positives = 208/420 (49%), Gaps = 57/420 (13%)
Query: 7 LKGLDAFTKPYE-DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
L+ DAF K + DF+ ++ GG +T+V ++ L+ + Y + +L+VD+ RG
Sbjct: 17 LRKFDAFPKFVDVDFYSRSFGGGIITVVTYIVAVSLLLAETKLYLKTHVKHDLYVDNGRG 76
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEP------ 118
+ I++D+ P +SC L LD +D SGE HL V +H + K R D G + +
Sbjct: 77 ETMRINVDVFFPNLSCGSLGLDVMDVSGETHLDVVDHEMRKIRYDRYGVKLADALNDEHG 136
Query: 119 QKEVVNAVKKKKVTTENGTTTTE--------------LEDPNK--CGSCYGAET------ 156
++EVVN TE ++ + +ED CGSCYGA+
Sbjct: 137 KEEVVNEKAFDSNETETASSLRKNKTKKTAKELIPRYMEDGKTKYCGSCYGADVSGANRG 196
Query: 157 ETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRV 216
++CC TC EV+EAY WA ++ QCK E +E L N EGC+ G+L+VN+V
Sbjct: 197 REQRCCQTCEEVREAYIEVGWAFTGASSMEQCKREGFSEVLGNVHEEGCEFKGFLDVNKV 256
Query: 217 SGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT-- 274
G+FHIAPG S+ HVHD+ P+ FN +H +RHLSFG + + PLDGT
Sbjct: 257 QGNFHIAPGKSFQQGEQHVHDLSPFPDGKFNFSHEVRHLSFG---EGYPGKVDPLDGTKR 313
Query: 275 VAKAEEGASMFNYYIKIIPTIYERL---------------------DGSKLGGGDGGMPG 313
K ++ Y+ +I+PT Y L D + + GG +PG
Sbjct: 314 TLKLPAETGVYQYFFRIVPTTYTYLNPFKKDISTNQYSVVDHFKPVDAASIQGGSSDLPG 373
Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
+FF Y+LSP+ V I E S+ ++ ++ G + +VD +++ I K+++G
Sbjct: 374 VFFFYDLSPIKVDIAEYRTSVWKFLAEVCASVGGVFAVSGIVDKVVYKGSLAIKKKIQLG 433
>gi|340923948|gb|EGS18851.1| hypothetical protein CTHT_0054620 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 436
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 144/439 (32%), Positives = 205/439 (46%), Gaps = 78/439 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV + + +L + +Y ++ EL V
Sbjct: 1 MAGKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLIVVLFLSWSEWREYRRIVVHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V+H + K RL P +
Sbjct: 61 DKGRGERMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVTKTRL--------RPWE 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + KK++ + + DPN CGSCYGA CC TC+EV+EAY
Sbjct: 113 EGGGDIDKKELALHSIEESATHLDPNYCGSCYGANPPPNAVKPGCCQTCDEVREAYAQAA 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+ E+ E+L EGC+I G L VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGRGENIEQCQREHYAERLDQQRREGCRIEGGLRVNKVVGNFHIAPGKSFSNGNMHVH 232
Query: 237 DIQPYTSAAFNT--THHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAEEG 281
D++ Y + TH I HL FG +L + ++ PLD T + +E
Sbjct: 233 DLKNYWESPVRHTFTHIIHHLRFGPQLPESLHQKLGNKALPWSNHHVNPLDNTHQETDEV 292
Query: 282 ASMFNYYIKIIPTIYERL------------------------DGS-------------KL 304
+ Y+IKI+PT Y L DGS L
Sbjct: 293 NFSYMYFIKIVPTSYLPLGWEKTWDQFREQHHAELGSFGTSADGSVETHQYSVTSHRRSL 352
Query: 305 GGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYI 350
GGD GG+PG+FFSY++SP+ ++ E++KS + + GT
Sbjct: 353 SGGDDAAEGHSERLHSKGGIPGVFFSYDISPMKVINREERAKSFLGFLAGLCAIVGGTLT 412
Query: 351 TFMLVDALLHSCVKKISKV 369
+D L ++ K+
Sbjct: 413 VAAAIDRALFEGTVRLKKL 431
>gi|380489161|emb|CCF36889.1| hypothetical protein CH063_08353 [Colletotrichum higginsianum]
Length = 437
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 141/427 (33%), Positives = 201/427 (47%), Gaps = 79/427 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV + + +L + DY ++ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIVSLIVVFWLAWGEWVDYRKIEIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGEQ V H + K RL QK
Sbjct: 61 DKGRGERMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGVIHGVNKVRL--------RSQK 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET----ETRKCCNTCNEVKEAYRYKK 176
E + K + + T E DPN CG+CYGA+ + CCNTC EV+EAY
Sbjct: 113 EGGGVIDMKALDLHSREATAEHLDPNYCGACYGAQAPANAQKAGCCNTCEEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC E+ E+L+ EGC++ G L VN+V G+FH+APG S+S ++HVH
Sbjct: 173 WAFGKGENVEQCTREHYAERLEEQRQEGCRLEGNLRVNKVVGNFHLAPGRSFSNGNMHVH 232
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
D++ Y A + TH I L FG +L D ++ PLD T +
Sbjct: 233 DLKNYWDTPDDAQHDFTHTIHSLRFGPQLPDQVTKKMGKRAYAWTNHHGNPLDNTHQETT 292
Query: 280 EGASMFNYYIKIIPTIYERL-----------------------DGS-------------K 303
+ F Y++KI+PT Y L DGS
Sbjct: 293 DPNYNFMYFVKIVPTSYLALNWQKSSSYQDEENSGLGLLGQGNDGSVETHQYSVTSHKRS 352
Query: 304 LGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTY 349
L GGD GG+PG+FFSY++SP+ ++ E++K+ T + I GT
Sbjct: 353 LAGGDDAAEGHKERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTL 412
Query: 350 ITFMLVD 356
VD
Sbjct: 413 TVAAAVD 419
>gi|453082617|gb|EMF10664.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 432
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 151/440 (34%), Positives = 200/440 (45%), Gaps = 84/440 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTI + I YL+ + DY + EL V
Sbjct: 1 MPAKSRFTKLDAFTKTVEDARIRTSTGGIVTITSLILILYLVWGEWTDYRRTVVHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG K+ IH++I P + C+ L LD +D SGE V H + K RLD +GK I
Sbjct: 61 DKGRGEKMEIHMNISFPRVPCELLTLDVMDVSGEVQSGVMHGVNKVRLDANGKEI----- 115
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA---ETETRK-CCNTCNEVKEAYRYKK 176
K+ T N DP+ CG CYGA ET T+ CCN C EV+EAY
Sbjct: 116 -------GKEALTVNSEEQVPHLDPDYCGDCYGAPAPETATKAGCCNNCAEVREAYAGVS 168
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W+ + + QC E+ E L EGC+I G + VN+V G+FH APG S+S ++HVH
Sbjct: 169 WSFGRGEGVEQCTREHYAEHLDEQRKEGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVH 228
Query: 237 DIQPYTSAA---FNTTHHIRHLSFGIKLQDD-------------DERRKPLDGTVAKAEE 280
D++ Y + + TH I HL FG +L DD + PLD T +E
Sbjct: 229 DLENYFQSGEVQHSFTHKIHHLRFGPELPDDVVKAVGKKGMAWSNHHLNPLDDTEQVTDE 288
Query: 281 GASMFNYYIKIIPTIYERL--DGS------------------------------------ 302
A F Y++K++ T Y L DGS
Sbjct: 289 VAYNFMYFVKVVSTAYLPLGWDGSGSLLDIPHELIALGGYGKGEQGSIETHQYSVTSHKR 348
Query: 303 KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGT 348
L GGD GG+PG+FFSY++SP+ V E ++KS + I GT
Sbjct: 349 SLTGGDAKAEGHEERLHAKGGIPGVFFSYDISPMKVINREARAKSFSGFLVGVCAVIGGT 408
Query: 349 YITFMLVDALLHSCVKKISK 368
VD LL+ K+ K
Sbjct: 409 LTVAAAVDRLLYEGGSKLRK 428
>gi|222628979|gb|EEE61111.1| hypothetical protein OsJ_15023 [Oryza sativa Japonica Group]
Length = 369
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 123/383 (32%), Positives = 192/383 (50%), Gaps = 39/383 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ + G
Sbjct: 7 KLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRHTLTYTF----------G 56
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L + D+ P + C ++LDA+D SG++HL V+H+I+K+R+D+ G I Q V
Sbjct: 57 MILKMQFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQDAV--- 113
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
NG + N +CCN+C +V+EAYR K W + D I
Sbjct: 114 -------GGNGPYSGMAAGLNTMRPIVALVMSDEQCCNSCEDVREAYRKKGWGVSNPDLI 166
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E + +K+ EGC IYG+LEVN+V+G+FH APG S+ +VHVHD+ P+ +
Sbjct: 167 DQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDS 226
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD----- 300
FN +H I LSFG + PLDG M+ Y+IK++PT+Y ++
Sbjct: 227 FNVSHKINKLSFGQRFPG---VVNPLDGAQWMQHSSYGMYQYFIKVVPTVYTDINEHIIL 283
Query: 301 ----------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S G +PG+FF Y+LSP+ V TE+ S H T + + G +
Sbjct: 284 SNQFSVTEHFRSSESGRIQAVPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 343
Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
++D+ ++ + I K+EIG
Sbjct: 344 VSGIIDSFVYHGQRAIKKKMEIG 366
>gi|336465550|gb|EGO53790.1| hypothetical protein NEUTE1DRAFT_151014 [Neurospora tetrasperma
FGSC 2508]
gi|350295150|gb|EGZ76127.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
2509]
Length = 444
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 148/448 (33%), Positives = 203/448 (45%), Gaps = 86/448 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV L + +L + DY +V EL V
Sbjct: 1 MAGKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V+H + K RL PQ
Sbjct: 61 DKGRGERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRL--------RPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E + K ++ + DP+ CG CYGA + CC+TC EV+EAY
Sbjct: 113 EGGGEIDAKILSLHAADESATHLDPSYCGPCYGAPAPYNAKKPGCCSTCEEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + T+ QC+ E+ TE+L EGC+I G L VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGDGATMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232
Query: 237 DIQPYTSAAFNTTHHIRH----LSFGIKLQDDDERR---------------KPLDGTVAK 277
D+ + S H H L FG +L DD R+ PLD T +
Sbjct: 233 DLAQWWSTPVPGGHSFSHIIHSLRFGPQLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQE 292
Query: 278 AEEGASMFNYYIKIIPTIYERL----------------------------DGS------- 302
++ F Y++KI+PT Y L DGS
Sbjct: 293 TDDPNYNFMYFVKIVPTSYLPLGWEKQAAQNKATWEQDHSVGLGAYGYGSDGSMETHQYS 352
Query: 303 ------KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIM 342
L GGD GG+PG+FFSY++SP+ +V E++KS +
Sbjct: 353 VTSHKRSLTGGDDSKEGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFLGFLAGLC 412
Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
+ GT VD L ++ K+
Sbjct: 413 AVVGGTLTVAAAVDRGLFEGTVRLKKLR 440
>gi|170089933|ref|XP_001876189.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
gi|164649449|gb|EDR13691.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
Length = 421
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 133/416 (31%), Positives = 200/416 (48%), Gaps = 64/416 (15%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F LKG+DAF K ED KT G +TI+ I V+ DY V+ + VD
Sbjct: 5 FFANLKGVDAFGKTTEDVKVKTRTGALLTIISAAIILAFSFVEFIDYRAVNIDTSIVVDK 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-E 121
SRG KL ++L++ P + C L+LD +D SGE + HN+ K RLD GK + E
Sbjct: 65 SRGEKLTVNLNVTFPRVPCYLLSLDIMDISGELQRDISHNVMKVRLDTHGKEVPNSHSAE 124
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ N + K + + N CGSC+G CCNTC +V+ AY + W+
Sbjct: 125 LRNDLDK----------MNDAKRENYCGSCFGGLEPEGGCCNTCEDVRLAYVNRGWSFSN 174
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
+ I QCKNE +KLK EGC I G + VN+V G+ H++PG S+ N +++++ PY
Sbjct: 175 PEAIEQCKNEGWADKLKEQADEGCNISGRIRVNKVIGNIHLSPGRSFQTNARNLYELVPY 234
Query: 242 TSAAFNT---THHIRHLSFGIKLQDDDE------------RRK------PLDGTVAKAEE 280
N +H I HL+F + DDE R++ PLDG +A+ +
Sbjct: 235 LRDDGNRHDFSHTIHHLAF----EGDDEYDYWKAAAGSAMRQRMGLTENPLDGAIARTAK 290
Query: 281 GASMFNYYIKIIPTIYERLDGSKL-----------------------GG-----GDGGMP 312
MF Y++K++ T + LDG K+ GG G G+P
Sbjct: 291 AQYMFQYFLKVVSTQFRTLDGRKVNTHQYSTTQFERDLTEGAAGETAGGIHVQHGVSGLP 350
Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
G FF++E+SP++V E +S H T I G ++D++L + +++ K
Sbjct: 351 GAFFNFEISPILVVHAETRQSFAHFLTSTCAIIGGVLTVASIIDSILFATNRRLKK 406
>gi|402218655|gb|EJT98731.1| ER to Golgi transport-related protein [Dacryopinax sp. DJM-731 SS1]
Length = 455
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 137/440 (31%), Positives = 203/440 (46%), Gaps = 84/440 (19%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
VFS+ LKGLDAF K ED KT G +T++ I + ++ DY ++ + VD
Sbjct: 6 VFSQ-LKGLDAFGKTMEDVKVKTRTGALLTLISACIIVFFTLMEFVDYRRIHLATSVVVD 64
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE--PQ 119
SRG KL ++++I P + C L+LD +D SGE+ V HN+ + RL G PI + P+
Sbjct: 65 RSRGEKLLVNMNITFPRVPCYLLSLDVMDISGERQHDVTHNMQRVRLSPQGIPIPDVLPE 124
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ N ++K E + +CGSCYG + CCNTC +V+EAY + W+
Sbjct: 125 SGLSNEIEK----------VIEAREGGECGSCYGGDPPASGCCNTCEDVREAYMRRGWSF 174
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+ I QC NE TEK+K+ EGC I G + VN+V G+FH +PG S+ N +HVHD+
Sbjct: 175 SSPEDIKQCVNEGWTEKVKSQSEEGCNISGRVRVNKVIGNFHFSPGKSFQTNAMHVHDLV 234
Query: 240 PYTSAA--FNTTHHIRHLSFGIKLQDDDE--------------RRKPLDGTVA------- 276
PY A + H I + F + E + PLDG A
Sbjct: 235 PYLKDANRHDFGHEIHYFGFESDGEQQAEVGRLSKSIKTKLGIDKNPLDGLRAHVRSLSR 294
Query: 277 -------------------KAEEGASMFNYYIKIIPTIYERLDGS--------------K 303
+ E+ MF Y++K++ T YE L G+
Sbjct: 295 RETRRVPGMSSNRRSYRPEQTEKSNYMFQYFLKVVSTKYEMLRGTVVNSHQYSVTSYERD 354
Query: 304 LGGGD---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L GD G+PG FF++E+SP++V E +S H T + G
Sbjct: 355 LSQGDKAQRDEHGTMTSHGVSGIPGAFFNFEISPMVVVHQETRQSFAHFLTSTCAIVGGV 414
Query: 349 YITFMLVDALLHSCVKKISK 368
+ D++L S +K+ K
Sbjct: 415 LTVAAIFDSMLFSAERKLKK 434
>gi|367052857|ref|XP_003656807.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
gi|347004072|gb|AEO70471.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
Length = 436
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 150/440 (34%), Positives = 204/440 (46%), Gaps = 78/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG +TIV + + +L + DY +V EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVEDARIRTTSGGIITIVSIIVVLFLAWGEWADYRRVVVHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V+H + K RL P
Sbjct: 61 DKGRGERMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVTKTRL--------RPLS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K + DP+ CG CYGA+ T CCNTC+EVKEAY +
Sbjct: 113 EGGGDIDSKALALHAADEAAIHLDPSYCGPCYGAKPPTTAKKPGCCNTCDEVKEAYAQQA 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA D I QC+ E+ E+L EGC+I G L VN+V G+FHIAPG S+S +VHVH
Sbjct: 173 WAFGRGDGIEQCEREHYGERLDEQRREGCRIEGGLRVNKVVGNFHIAPGRSFSNGNVHVH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAEEG 281
D++ Y T TH I HL FG +L D ++ PLDGT + ++
Sbjct: 233 DLKNYWDTPTKHTFTHIIHHLRFGPQLPDSLHKKLGTKHLPWTNHHLNPLDGTSQETDDV 292
Query: 282 ASMFNYYIKIIPTIYERL------------------------DGS-------------KL 304
+ Y+IKI+PT Y L DGS L
Sbjct: 293 NFNYMYFIKIVPTSYLPLGWEKTWAGFREEHQAELGSFGTSADGSVETHQYSVTSHKRSL 352
Query: 305 GGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYI 350
GGD GG+PG+FFSY++SP+ ++ E+SK+ + + GT
Sbjct: 353 AGGDDAAEGHRERLHAKGGIPGVFFSYDISPMKVINREERSKTFLGFIAGLCAIVGGTLT 412
Query: 351 TFMLVDALLHSCVKKISKVE 370
VD L ++ K+
Sbjct: 413 VAAAVDRALFEGTVRLKKLR 432
>gi|85115136|ref|XP_964815.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
gi|28926610|gb|EAA35579.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
Length = 444
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 148/448 (33%), Positives = 202/448 (45%), Gaps = 86/448 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV L + +L + DY +V EL V
Sbjct: 1 MAGKWRFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V+H + K RL PQ
Sbjct: 61 DKGRGERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRL--------RPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E + K ++ + DP+ CG CYGA + CC+TC EV+EAY
Sbjct: 113 EGGGEIDAKVLSLHAADESATHLDPSYCGPCYGAPAPYNAKKPGCCSTCEEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + T+ QC+ E+ TE+L EGC+I G L VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGDGATMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232
Query: 237 DIQPYTSAAFNTTHHIRH----LSFGIKLQDDDERR---------------KPLDGTVAK 277
D+ + S H H L FG +L DD R+ PLD T +
Sbjct: 233 DLAQWWSTPVPGGHSFSHIIHSLRFGPQLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQE 292
Query: 278 AEEGASMFNYYIKIIPTIYERL----------------------------DGS------- 302
+ F Y++KI+PT Y L DGS
Sbjct: 293 TNDPNYNFMYFVKIVPTSYLPLGWEKQAAQNKAAWEQDHSVGLGAYGYGSDGSMETHQYS 352
Query: 303 ------KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIM 342
L GGD GG+PG+FFSY++SP+ +V E++KS +
Sbjct: 353 VTSHKRSLTGGDDSKEGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFLGFLAGLC 412
Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
+ GT VD L ++ K+
Sbjct: 413 AVVGGTLTVAAAVDRGLFEGTVRLKKLR 440
>gi|388856238|emb|CCF50047.1| uncharacterized protein [Ustilago hordei]
Length = 435
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 128/413 (30%), Positives = 203/413 (49%), Gaps = 62/413 (15%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+G+DAF+K +D +T G +T++ L I L + DY V L VD SRG
Sbjct: 9 QLRGIDAFSKTMDDVRIRTNAGALITLISALLILVLTIGEYVDYRTVHLKPALEVDRSRG 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL ++++I P + C L+LD +D SGE ++H+I + R+ DGK +
Sbjct: 69 EKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRISQDGKV----------S 118
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
++ K + + + CG CYG + CCNTC+EV+EAY K W+ + D +
Sbjct: 119 IQGTKSLKGDAARIANTKGKDYCGDCYGGQPPASGCCNTCDEVREAYVRKGWSFSDPDHV 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E +EK+K EGC+I G L VN+V GSFH++PG ++ N +H+HD+ PY S +
Sbjct: 179 EQCVAEGWSEKIKEQNKEGCRISGKLHVNKVVGSFHLSPGRAFQRNSMHIHDLVPYLSGS 238
Query: 246 FNTTHHIRHL----SFGIK-----LQDDDER--------RKPLDGTVAKAEEGASMFNYY 288
H H+ SFG + L ER + PL+G A+ +E MF Y+
Sbjct: 239 GAEHHDFGHIIHEFSFGSEQEYHGLTTAKERAVKDKLGVKDPLEGVRARTKESQYMFQYF 298
Query: 289 IKIIP------------------TIYER-----------------LDGSKLGGGDGGMPG 313
+K++ T YER G+++ G G+PG
Sbjct: 299 LKVVSTEFRPLAGETLKTQQYSVTTYERDLSPGANAAALAGLSNEGSGARISHGFAGVPG 358
Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+FF+YE+SPL +E +SL H T + G ++D+L+++ +++
Sbjct: 359 VFFNYEISPLKTIHSEYRQSLSHFLTSTCAIVGGILTVAGILDSLIYNSGRRL 411
>gi|402590490|gb|EJW84420.1| hypothetical protein WUBG_04668 [Wuchereria bancrofti]
Length = 341
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 187/352 (53%), Gaps = 39/352 (11%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M ERLK DA+TKP +DF +T GGAVT+V I ++ + + V E+L+V
Sbjct: 1 MSLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYV 60
Query: 61 DSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
DS+ ++ ++ DI P + C + +D +D SG+ ++ ++YK L L+GK +
Sbjct: 61 DSTPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIKDDVYKISL-LNGKEGNGIR 119
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETETRKCCNTCNEVKEAYRYKK 176
+ V N TTT P CGSCYGA+ CCNTC EVKEAY K
Sbjct: 120 QGV------------NINTTTVSSAPASQILCGSCYGAKD---GCCNTCEEVKEAYIKKG 164
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W L ++T+ QCK++ +K+ EGC++YG ++V +V+G+FHIAPG + H H
Sbjct: 165 WELVNIETVEQCKSDLWVKKMNEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLKAHRSHFH 224
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT-VAKAEEGASMFNYYIKIIPTI 295
D+ + + F+T+H + HLSFG + PLDG A++ M+ Y++K++PT
Sbjct: 225 DLHSLSPSKFDTSHTVNHLSFGNSFPG---KVYPLDGKFFGSAKDSGIMYQYHLKLVPTS 281
Query: 296 YERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSK 332
Y LD ++ + G G+PG F YE SPLMVK E+ +
Sbjct: 282 YVFLDSTRNIFSHLFSVTTYQKDISQGASGLPGFFIQYEFSPLMVKYEERRQ 333
>gi|170586880|ref|XP_001898207.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
putative [Brugia malayi]
gi|158594602|gb|EDP33186.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
putative [Brugia malayi]
Length = 341
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 187/352 (53%), Gaps = 39/352 (11%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M ERLK DA+TKP +DF +T GGAVT+V I ++ + + V E+L+V
Sbjct: 1 MSLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYV 60
Query: 61 DSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
DS+ ++ ++ DI P + C + +D +D SG+ ++ ++YK L L+GK +
Sbjct: 61 DSTPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIKDDVYKISL-LNGKEGNGIR 119
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETETRKCCNTCNEVKEAYRYKK 176
+ V N TTT P CGSCYGA+ CCNTC EVKEAY K
Sbjct: 120 QGV------------NINTTTVSSVPASQILCGSCYGAKD---GCCNTCEEVKEAYIKKG 164
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W L ++T+ QCK++ +K+ EGC++YG ++V +V+G+FHIAPG + H H
Sbjct: 165 WELVNIETVEQCKSDLWVKKMNEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLKAHRSHFH 224
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT-VAKAEEGASMFNYYIKIIPTI 295
D+ + + F+T+H + HLSFG + PLDG A++ M+ Y++K++PT
Sbjct: 225 DLHSLSPSKFDTSHTVNHLSFGNSFPG---KVYPLDGKFFGSAKDSGIMYQYHLKLVPTS 281
Query: 296 YERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSK 332
Y LD ++ + G G+PG F YE SPLMVK E+ +
Sbjct: 282 YVFLDSTRNIFSHLFSVTTYQKDISQGASGLPGFFIQYEFSPLMVKYEERRQ 333
>gi|242803029|ref|XP_002484091.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
stipitatus ATCC 10500]
gi|218717436|gb|EED16857.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
stipitatus ATCC 10500]
Length = 440
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 148/442 (33%), Positives = 205/442 (46%), Gaps = 82/442 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VT+V + I +L+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARVRTTSGGIVTLVSLVVILWLVWGEWADYRRVVVLPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IHL++ P + C+ L LD +D SGEQ + V H + K RL P+ E K
Sbjct: 61 DKSRGERMEIHLNMTFPRLPCELLTLDVMDVSGEQQMGVVHGLNKVRL----SPVAEGGK 116
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
V V K ++ +N +P CG C GA T CCNTC EV+EAY K
Sbjct: 117 --VIDVAKLELHAQNEVAVHL--NPEYCGQCGGAPPPPNTNKPGCCNTCEEVREAYALKS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + I QC+ E EK+ EGC+I G + VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGKGENIEQCQREGYAEKINAQRREGCRIEGDIRVNKVIGNFHIAPGRSFSTGNMHVH 232
Query: 237 DIQPYTSAAFN------TTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEG 281
D+ Y + +H I L FG +L D+ RR PLD T +E
Sbjct: 233 DLDTYMDRELSDNEKHTMSHIIHQLRFGPQLSDELSRRWQWTDHHHTNPLDDTQQFTDEP 292
Query: 282 ASMFNYYIKIIPTIY----------ERLDG------------------------------ 301
A +NYYIK++ T Y ++L G
Sbjct: 293 AYNYNYYIKVVSTSYLPLGWDSSQSDQLHGDDQSTPLGLHGAVHGAAGSLETHQYSVTSH 352
Query: 302 --SKLGGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
S GG D GG+PG+FF+Y++SP+ V E + K+ T + I
Sbjct: 353 KRSLHGGNDAAEGHKERVHAEGGIPGVFFNYDISPMKVVNREVRPKTFTGFLTGVCAVIG 412
Query: 347 GTYITFMLVDALLHSCVKKISK 368
GT VD L+ +++ K
Sbjct: 413 GTLTVAAAVDRFLYEGSRRMRK 434
>gi|452842116|gb|EME44052.1| hypothetical protein DOTSEDRAFT_71753 [Dothistroma septosporum
NZE10]
Length = 436
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 142/441 (32%), Positives = 203/441 (46%), Gaps = 82/441 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VT+ L I YL+ + DY +++ EL V
Sbjct: 1 MPAKSRFTKLDAFTKTVEDARIRTTSGGIVTVTSLLLILYLVWGEWADYRRITVHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG K+ IH+++ P + C+ L LD +D SGE V H + K RL P+
Sbjct: 61 DKGRGEKMEIHMNVSFPRVPCELLTLDVMDVSGEVQTGVMHGVNKVRL--------RPEA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E ++KK + L DP+ CG CYGA + CCNTC EV+EAY
Sbjct: 113 EGGGEIEKKALDLGVEEAAQHL-DPDYCGECYGAPAPSNAAKPGCCNTCAEVREAYAGVS 171
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W+ + + QC+ E+ +E L EGC+I G + VN+V G+FH APG S+S ++HVH
Sbjct: 172 WSFGRGENVEQCEREHYSEHLDAQRKEGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVH 231
Query: 237 DIQPYTSAA----FNTTHHIRHLSFGIKLQDD-------------DERRKPLDGTVAKAE 279
D++ + ++ TH I L FG +L DD + PLDGT E
Sbjct: 232 DLENFFNSPEGIQHTFTHKIHSLRFGPQLPDDVVNKVGKRGIAWSEHHLNPLDGTSQVTE 291
Query: 280 EGASMFNYYIKIIPTIYERL----DGS--------------------------------- 302
E + F Y++K++ T Y L GS
Sbjct: 292 EKSYNFMYFVKVVSTAYLPLAWKPSGSLLDLPHELVELGGYGKGEGGSIETHQYSVTSHK 351
Query: 303 -KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISG 347
L GGD GG+PG+FFSY++SP+ V E ++K+ T + I G
Sbjct: 352 RSLQGGDANEEGHKERLHARGGIPGVFFSYDISPMKVVNREARTKTFTGFLTGVAAVIGG 411
Query: 348 TYITFMLVDALLHSCVKKISK 368
T VD L++ +++ K
Sbjct: 412 TLTVAAAVDRLMYEGGQRVRK 432
>gi|440473660|gb|ELQ42442.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae Y34]
gi|440486294|gb|ELQ66175.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae P131]
Length = 444
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 145/448 (32%), Positives = 201/448 (44%), Gaps = 86/448 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG +TIV + + YL + DY ++ EL V
Sbjct: 1 MAPKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLAWGEWADYRRIDIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IHL+I P + C+ L LD +D SGEQ V+H + K RL PQ
Sbjct: 61 DKSRGDRMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVIKVRL--------RPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K + DPN CG CYGA CCNTC+EV+EAY
Sbjct: 113 EGGGVIDAKTLALHAEDEAATHLDPNYCGGCYGAPAPANAKKAGCCNTCDEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC E+ E+L EGCQI G L VN+V G+FH+APG S+S ++HVH
Sbjct: 173 WAFGRGENVEQCTREHYAERLDEQRHEGCQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVH 232
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGI--------KLQDDDERR-------KPLDGTVAK 277
D++ Y + +H I L FG KL + D+ PLDG +
Sbjct: 233 DLKNYWDTPVEGGHSFSHTIHSLRFGPQLPPSALEKLGNKDKNMPWTNHHINPLDGVIQT 292
Query: 278 AEEGASMFNYYIKIIPTIYERL-----------------------DGS------------ 302
+ + Y++KI+PT Y L DGS
Sbjct: 293 TVDPNFNYMYFVKIVPTSYLPLGWEKRTHLATMHDHGVGTYGYSGDGSVETHQYSVTSHK 352
Query: 303 -KLGGGD-------------GGMPGIFFSY-----ELSPLMVKITE-KSKSLGHLWTKIM 342
L GGD GG+PG+FFSY ++SP+ V E ++K+ T +
Sbjct: 353 RSLAGGDDGEDGHKERMHSRGGIPGVFFSYPFCPQDISPMKVINREVRTKTFAGFLTGLC 412
Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
+ GT +D + V +I K++
Sbjct: 413 AILGGTLTVAAAIDRMTFEGVTRIKKMQ 440
>gi|302923326|ref|XP_003053651.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734592|gb|EEU47938.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 437
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 145/443 (32%), Positives = 205/443 (46%), Gaps = 83/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV + + +L + +Y +V EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLIVVIFLAWGEWSEYRRVEIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V H + K RL +PQ
Sbjct: 61 DRGRGERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL--------QPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE--TETRK--CCNTCNEVKEAYRYKK 176
+ + K ++ + DP+ CG CYGA+ RK CC TC+EV+EAY
Sbjct: 113 KGGADIDSKSLSLHDDAAAHL--DPSYCGGCYGAQPPANARKAGCCQTCDEVREAYAQAS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ EKL EGC+I G L VN+V G+FH APG S+S ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYAEKLDAQREEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVH 230
Query: 237 DIQPYTSA----AFNTTHHIRHLSFGIKLQDDDERR------------KPLDGTVAKAEE 280
D++ Y A A + TH I L FG +L D+ R+ PLDGT ++
Sbjct: 231 DLKNYWDAPKGKAHDFTHIIHSLRFGPQLPDEVARKVGKGTPWTNHHQNPLDGTRQDIKD 290
Query: 281 GASMFNYYIKIIPTIYERL--------------------------DGS------------ 302
F Y++KI+PT Y L DGS
Sbjct: 291 PNFNFMYFVKIVPTSYLPLGWDSKGLKIAGLLQDDTSLGAYGYAEDGSVETHQYSVTSHK 350
Query: 303 -KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISG 347
L GG+ GG+PG+FFSY++SP+ +V EK K+ + + G
Sbjct: 351 RSLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKGKTFSGFLAGLCAIVGG 410
Query: 348 TYITFMLVDALLHSCVKKISKVE 370
T VD L ++ K+
Sbjct: 411 TLTVAAAVDRGLFEGAARLKKMR 433
>gi|148674215|gb|EDL06162.1| ERGIC and golgi 3, isoform CRA_b [Mus musculus]
Length = 269
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/259 (43%), Positives = 161/259 (62%), Gaps = 17/259 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 17 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 76
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 77 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HE 134
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DPN+C SCYGAE+E KCCN+C +V+EAYR + WA DTI
Sbjct: 135 LGKVEVTVFDPNSL----DPNRCESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTI 190
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVH ++ + +
Sbjct: 191 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 250
Query: 246 F-----------NTTHHIR 253
F N TH+I+
Sbjct: 251 FGLDNPSDCLQINMTHYIK 269
>gi|320592791|gb|EFX05200.1| copii-coated vesicle membrane protein [Grosmannia clavigera kw1407]
Length = 440
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 144/446 (32%), Positives = 205/446 (45%), Gaps = 86/446 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV + + YL + DY +V EL V
Sbjct: 1 MAAKSRFTRLDAFTKTVEDARIRTTSGGVVTIVSLIVVLYLAWGEWLDYRRVIIRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P I C+ L LD +D SGEQ V+H + RL EPQ
Sbjct: 61 DKGRGERMEIHLNITFPRIPCELLTLDVMDVSGEQQHGVQHGVRMVRL--------EPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
+ ++ K + + + L DP CG CYGA CCNTC+EV+EAY
Sbjct: 113 RGGSEIEVKTLDL-HADAASHL-DPEYCGPCYGATPPQHAIKTGCCNTCDEVREAYASSS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC+ E+ E++ EGC+I G L VN+V G+FHIAPG S+S ++HVH
Sbjct: 171 WAFGKGENVEQCQREHYAERIDEQRHEGCRIEGGLRVNKVVGNFHIAPGRSFSNGNMHVH 230
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR----------------KPLDGTVA 276
D++ Y T + TH + L FG +L + ++ PLDG +
Sbjct: 231 DLKNYWDMPTPNLHSFTHTVHSLRFGPQLPESLQKTLAGGGAKGQPWTNHHINPLDGVMQ 290
Query: 277 KAEEGASMFNYYIKIIPTIYERL-------------------------DGS--------- 302
+ + + Y+IKI+PT Y L DGS
Sbjct: 291 QTSDPNFNYMYFIKIVPTSYLALGWEKTFRGFVDDHDSADVGSYGLLADGSVETHQYSVT 350
Query: 303 ----KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCN 344
L GGD GG+PG+FFSY++SP+ +V E++K+ +
Sbjct: 351 SHKRSLQGGDDAAEGHQERLHARGGIPGVFFSYDISPMKVVNREERAKTFAGFLAGLCAI 410
Query: 345 ISGTYITFMLVDALLHSCVKKISKVE 370
I GT VD + ++ K+
Sbjct: 411 IGGTLTVAAAVDRTVFEGTIRLKKMR 436
>gi|414586930|tpg|DAA37501.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 268
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 104/253 (41%), Positives = 155/253 (61%), Gaps = 3/253 (1%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+V + L ++ Y T L VD+SRG
Sbjct: 7 KLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C ++LDA+D SG++HL V+H+++K+R+D G I Q +VV
Sbjct: 67 ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ-DVVGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K + +G E CGSCYGA+ +CCNTC +V+EAYR K W + D +
Sbjct: 126 MKMEAPLQHHGGRLEHNE--TYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLL 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E + +K+ EGC IYG++EVN+V+G+FH APG S+ ++VHVHD+ P+ +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDS 243
Query: 246 FNTTHHIRHLSFG 258
FN +H I LSFG
Sbjct: 244 FNVSHKINRLSFG 256
>gi|390603136|gb|EIN12528.1| endoplasmic reticulum-derived transport vesicle ERV46 [Punctularia
strigosozonata HHB-11173 SS5]
Length = 419
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 131/419 (31%), Positives = 196/419 (46%), Gaps = 58/419 (13%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LKGLDAF K ED KT G +T + I ++ DY +V+ + VD SRG
Sbjct: 10 LKGLDAFGKTMEDVKVKTRTGAFLTFLSAAIILTFTMIEFVDYRRVNMDTSIVVDKSRGE 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNA 125
KL + +++ P + C L+LD +D SGEQ + HNI K RLD GK I Q+ E+ +
Sbjct: 70 KLTVRMNVTFPRVPCYLLSLDVMDISGEQQRDISHNILKTRLDSTGKLIPGSQRSELESE 129
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
++ +G CGSCYGAE CCN+C+ V++AY + W+ D+I
Sbjct: 130 FDRQNKPMPDGY----------CGSCYGAEPSEGACCNSCDAVRQAYVNRGWSFGNPDSI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E +EKLK+ +EGC I G + VN+V G+ H++PG S+ ++++ PY
Sbjct: 180 EQCVKENWSEKLKDQASEGCNIAGRVRVNKVIGNIHLSPGRSFQSQGRSMYELVPYLRED 239
Query: 246 FNTTHHIRHLSFGIKLQDDDE------------RRK------PLDGTVAKAEEGASMFNY 287
N H H + DDE R K PLDG V + + MF Y
Sbjct: 240 GN-RHDFSHTIHEFAFEGDDEYLPDKYKVSKEMRAKMGLEAGPLDGAVGRTIKAQYMFQY 298
Query: 288 YIKIIPTIYERLDGSKL----------------GGGDG------------GMPGIFFSYE 319
++K++ T + LDG + G D G+PG FF++E
Sbjct: 299 FLKVVSTQFRTLDGQTVNSHQYSATHFERDLDKGSEDNTAEGVHISHTTYGVPGAFFNFE 358
Query: 320 LSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
+SP+++ +E +S H T + G +VD++L + K + K G K
Sbjct: 359 ISPILIVHSETRQSFAHFLTSTCAIVGGVLTIASIVDSVLFATTKALKKGASGSAASGK 417
>gi|134054958|emb|CAK36967.1| unnamed protein product [Aspergillus niger]
Length = 406
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 136/417 (32%), Positives = 194/417 (46%), Gaps = 62/417 (14%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +TI L I +L+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWADYRRVVVMPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG K+ IHL++ P + C+ L LD +D SGEQ V H I K RL
Sbjct: 61 DKSRGEKMEIHLNVTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRL------------ 108
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
+A + +V + DP+ CG CYGA CCNTC+EV+EAY ++
Sbjct: 109 --TSAAEGGRVIDVKALELAKHLDPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQ 166
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC+ E E++ EGC++ G L VN+V G+FHIAPG S++ ++HVH
Sbjct: 167 WAFGKGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVH 226
Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
D+ + A TH I L FG +L D D PLDGT + E
Sbjct: 227 DLANFFDADLPDAEKHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDGTKQETNEP 286
Query: 282 ASMFNYYIKIIPTIYERLDGS---------------KLGGGDG-------------GMPG 313
+ Y++K++ T Y L L GGD G+PG
Sbjct: 287 GYNYMYFVKVVSTSYLPLGWDPLIETHQYSVTSHKRSLMGGDASDEGHKERLHAANGIPG 346
Query: 314 IFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
+F +Y++SP+ V E + K+ T + I GT +D L+ V ++ K+
Sbjct: 347 VFVNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGVSRMKKL 403
>gi|378732932|gb|EHY59391.1| hypothetical protein HMPREF1120_07381 [Exophiala dermatitidis
NIH/UT8656]
Length = 437
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 144/445 (32%), Positives = 201/445 (45%), Gaps = 81/445 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV L + YLI + DY ++ EL V
Sbjct: 1 MPAKTRFTRLDAFTKTVEDARIRTTSGGIVTIVSILVVIYLILGEWADYRRIVVQPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG K+ IHL+I P I C+ L LD +D SGEQ V H + K RL + +
Sbjct: 61 DKGRGEKMEIHLNITFPRIPCELLTLDVMDVSGEQQSGVVHGVNKVRLTSVAEGSRVIDT 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
+ + ++ +V++ DP+ CGSCY A CCNTC+EV+EAY
Sbjct: 121 QALQLHQQAEVSSH--------LDPDYCGSCYSAPAPPNAKKPGCCNTCDEVREAYAANS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E +L EGC+I G + VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGRGEGVEQCEREGYGARLDEQRHEGCRIEGVIRVNKVVGNFHIAPGRSFSNGNMHVH 232
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDER---------RKPLDGTVAKAEEGAS 283
D+ + TH I L FG +L D + + PLDG + +E
Sbjct: 233 DLNNFFDTPIEGGHTFTHEIHSLRFGPQLSDQEAKWTGADHHLNANPLDGLRQETDEPGY 292
Query: 284 MFNYYIKIIPTIYERLD--------------------------GSK-------------- 303
F Y+IK++ T Y L GS+
Sbjct: 293 NFMYFIKVVSTSYLPLGWDEDKSIQQHSSLSDLIPLGMHGKGAGSQGSIETHQYSVTSHK 352
Query: 304 --LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISG 347
L GG+ GG+PG+FFSY++SP+ V E + KS + T + I G
Sbjct: 353 RSLAGGNDAAEGHKERLHAHGGIPGVFFSYDISPMKVINREVRPKSFANFLTGVCAVIGG 412
Query: 348 TYITFMLVDALLHSCVKKISKVEIG 372
T +D L+ ++ KV G
Sbjct: 413 TLTVAAAIDRGLYEGATRLKKVHQG 437
>gi|440301578|gb|ELP93964.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba invadens IP1]
Length = 363
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 126/378 (33%), Positives = 188/378 (49%), Gaps = 30/378 (7%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DA+ K ED K +GG +TIVC + I L+ + Y Q T +L VD R
Sbjct: 1 MKRFDAYGKVPEDLQVKHGFGGIMTIVCGILIGILVLTEFRYYLQREVTPQLIVDRERDE 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ +H DI P SC ++D + SGE + +E NI K RL+ +G P+ E +
Sbjct: 61 KIKVHFDITFPFSSCPITSVDVLTKSGESMIDIEKNITKTRLNKNGVPLTESEL------ 114
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K T + + D C SCYGAET +RKCC TC++V EAY+ + W L + TI
Sbjct: 115 ---KATQQKLNANIKTVDQKTCRSCYGAETPSRKCCYTCDDVIEAYKERGWNL-NIRTIA 170
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC N E K T EGC++ G L +N++ G+FHIAPG S + H H+I+
Sbjct: 171 QCDNSEKLEMAKLTLEEGCRVEGNLLLNKIGGNFHIAPGTSDNTWTGHHHNIEWTGRTKI 230
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL-- 304
+ TH LSFG E K G+ A+ MF Y++ +IP ++G+K
Sbjct: 231 DLTHTWNDLSFG-------EGSKTYSGSKKDAKMNG-MFQYFLTLIPKKNNFINGTKFVY 282
Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
G G PG+F Y++SP+++++ E + H + I G + F L+
Sbjct: 283 DFVINEQTRSGQGEGEPGVFVYYDVSPMLLEVNEFNHGFLHFLIGVCAIIGGVFTVFQLI 342
Query: 356 DALLHSCVKKIS-KVEIG 372
DA + + + K+E+G
Sbjct: 343 DAFVFDSIHTLQKKIELG 360
>gi|296417040|ref|XP_002838173.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634087|emb|CAZ82364.1| unnamed protein product [Tuber melanosporum]
Length = 399
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 141/410 (34%), Positives = 203/410 (49%), Gaps = 49/410 (11%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M RL LDAFTK ED +T GG VT+V + L+ + +Y ++ EL V
Sbjct: 1 MGRGSRLTRLDAFTKTVEDARVRTTSGGIVTLVSLFVVFVLVVGEFREYRRIQVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D +RG +LPI L+I P I C+ L LD +D SGEQ + H I+ RL P E +
Sbjct: 61 DKTRGEQLPISLNITFPHIPCELLTLDVMDVSGEQQSSITHGIHLTRL----TPFPESKP 116
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE--TETRKCCNTCNEVKEAYRYKKWA 178
++ + T + DP CG CYGA + + CC TC +V+EAY WA
Sbjct: 117 VSTTSLNVHEDTASH-------LDPAYCGKCYGAPGPEKDKGCCQTCEDVREAYASIGWA 169
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+ + + QC+ E+ E+L EGC I G+L VN+V G+FHIAPG S+S +HVHD+
Sbjct: 170 FGKGEGVEQCEREHYAERLDEMREEGCNIAGHLSVNKVIGNFHIAPGKSFSSAQMHVHDL 229
Query: 239 QPY--TSAAFNTTHHIRHLSFGIKLQDDDE-RRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
Y ++ TH I HLSFG L + + +R PLD + +E + F Y+IK++ T
Sbjct: 230 NQYFASTKEHTFTHTIHHLSFGPDLPANVKVQRNPLDDSRQVTQERSFNFMYFIKVVSTS 289
Query: 296 YERLDGSK----------------------LGGGD----------GGMPGIFFSYELSPL 323
Y L S+ +GG D GG+PG+FFSY++SP+
Sbjct: 290 YLPLGTSENSYIPGAIETHQYSVTSHKRSLMGGADKEHASTIHARGGIPGVFFSYDISPM 349
Query: 324 MVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
V E ++KS T + I GT +D L+ ++ K+ G
Sbjct: 350 KVINREVRAKSFAGFLTGVCAVIGGTLTVAAAIDRGLYEGGMRVKKLHQG 399
>gi|443894052|dbj|GAC71402.1| hypothetical protein PANT_3d00017 [Pseudozyma antarctica T-34]
Length = 461
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/407 (31%), Positives = 195/407 (47%), Gaps = 62/407 (15%)
Query: 14 TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLD 73
+K +D +T G +T+V L I L + DY V L VD SRG KL +++D
Sbjct: 44 SKTMDDVRIRTNAGALITMVSALLIVVLTIGEFVDYRTVHLKPSLEVDRSRGEKLTVNMD 103
Query: 74 IVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTT 133
I P + C L+LD +D SGE ++H+I + R+ DGKPI + +K + + T
Sbjct: 104 ITFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRVTHDGKPITQGKKNLKGDAARIAAT- 162
Query: 134 ENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYS 193
+ + CG CYG + CCNTC+EV+EAY K W+ + D + QC E
Sbjct: 163 ---------KGKDYCGDCYGGQPPASGCCNTCDEVREAYVRKGWSFADPDHVDQCVAEGW 213
Query: 194 TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIR 253
++K+K EGC+I G L VN+V GSFH++PG ++ N VH+HD+ PY S H
Sbjct: 214 SDKIKEQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSVHIHDLVPYLSGTGAEHHDFG 273
Query: 254 HL----SFGIKLQ-----DDDER--------RKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
H+ SFG + Q ER + PL+G A+ ++ MF Y++K++ T +
Sbjct: 274 HIIHDFSFGSEQQYHGLTTAKEREVKQKLGVKDPLEGVRAQTQQSQFMFQYFLKVVSTEF 333
Query: 297 ERLDGSKL-----------------------------------GGGDGGMPGIFFSYELS 321
L G L G G+PG+FF+YE+S
Sbjct: 334 RPLSGDTLKTQQYSVTTYERDLSPGANAAAMAGMSNEGSGAHISHGFAGVPGVFFNYEIS 393
Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
PL +E +SL H T + G +VD+L+++ +++ +
Sbjct: 394 PLKTIHSEHRQSLSHFLTSTCAIVGGILTVAGIVDSLVYNSRRRLRR 440
>gi|255941116|ref|XP_002561327.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211585950|emb|CAP93687.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 412
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 136/417 (32%), Positives = 197/417 (47%), Gaps = 56/417 (13%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +TI L + +L+ + DY ++ EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARIRTNSGGVITIASLLIVMWLVWGEWADYRRIVVQPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IHL++ P + C+ L LD +D SGEQ + V H + K RL P
Sbjct: 61 DKSRGERMEIHLNMTFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLS--------PHN 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRYKK 176
E + + + + + + P+ CG C GA CC TC EV+EAY K+
Sbjct: 113 EGGKVIDVQALDLHSSSEAAKHLAPDYCGECGGATPPANVIKPGCCTTCEEVREAYAEKQ 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QCK E EKL EGC+I G L+VN+V G+FHIAPG S++ ++HVH
Sbjct: 173 WAFGDGSNIEQCKREGYAEKLAEQRREGCRIEGVLKVNKVVGNFHIAPGRSFTTGNMHVH 232
Query: 237 DIQPYT-----SAAFNTTHHIRH-LSFGIKLQ---------DDDERRKPLDGTVAKAEEG 281
D+ Y A +T H+ H L FG +L D PLD T + +E
Sbjct: 233 DLDAYVVPNAGPAEQHTMSHLVHELRFGPQLPTELAGRWGWTDHHHTNPLDDTKQETDEP 292
Query: 282 ASMFNYYIKIIPTIYERLDGS---------------KLGGGD-------------GGMPG 313
A F Y++K++ T Y L L GG+ GG+PG
Sbjct: 293 AYNFMYFVKVVSTSYLPLGWDPHIEAHQYSVTSHKRPLSGGNDAAEGHKERVHAGGGIPG 352
Query: 314 IFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
+FF+Y++SP+ V E + K+ + T + I GT +D L+ ++ K+
Sbjct: 353 VFFNYDISPMKVINREARPKTFTNFLTGVCAIIGGTLTVAAALDRGLYEGAMRVKKL 409
>gi|452980033|gb|EME79795.1| hypothetical protein MYCFIDRAFT_64499 [Pseudocercospora fijiensis
CIRAD86]
Length = 436
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 145/443 (32%), Positives = 202/443 (45%), Gaps = 86/443 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTI L I YL + DY ++ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVEDARVRTSTGGIVTIASLLLILYLTWGEWADYRKIIIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD--LDGKPIQEP 118
D RG ++ IHL++ P + C+ L LD +D SGE V H I K RL DG + E
Sbjct: 61 DKGRGERMEIHLNVSFPRVPCELLTLDVMDVSGEVQTGVLHGINKVRLSSVADGSKVIEK 120
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRY 174
QK ++A EN P+ CG CYGA + CCNTC EV++AY
Sbjct: 121 QKLDLDAA-------ENSVHLA----PDYCGECYGAPAPDNAKKAGCCNTCAEVRDAYAS 169
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
W+ + + QC+ E+ +E+L EGC+I G L VN+V G+FH APG S+S ++H
Sbjct: 170 VSWSFGRGENVEQCEREHYSEQLDAQRKEGCRIEGALRVNKVVGNFHFAPGKSFSNGNLH 229
Query: 235 VHDIQPYTSAA---FNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKA 278
VHD+ Y ++ + THHI L FG L D ++R PLD T +
Sbjct: 230 VHDLDNYFNSGEVEHSFTHHIHRLRFGPPLPHDFDKRVGKKGMAWSNHHLNPLDDTHQET 289
Query: 279 EEGASMFNYYIKIIPTIYERLDGSK----------------------------------- 303
++ A F Y++K++ T Y L K
Sbjct: 290 DDSAFNFMYFVKVVSTAYLPLGWEKTNSFSRSLPHELIDLGDYGHGEQGSIETHQYSVTS 349
Query: 304 ----LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNI 345
L GGD GG+PG+FFSY++SP+ V E ++KS + I
Sbjct: 350 HKRSLQGGDAKDEGHKERVHARGGIPGVFFSYDISPMKVINRETRAKSFSGFLVGVCAVI 409
Query: 346 SGTYITFMLVDALLHSCVKKISK 368
GT VD +L+ +++ K
Sbjct: 410 GGTLTVAAAVDRMLYEGEQRVRK 432
>gi|406866287|gb|EKD19327.1| copii-coated vesicle membrane protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 453
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 148/458 (32%), Positives = 204/458 (44%), Gaps = 97/458 (21%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTI L + YL + DY +++ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWTDYRRIAVHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG K+ IHL+I P I C+ L LD +D SGEQ V H + K RL P+
Sbjct: 61 DKGRGEKMEIHLNISFPRIPCELLTLDVMDVSGEQQTGVMHGVKKVRLG--------PEA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E + + + T L DP+ CG CYGA + CCNTC EV+EAY
Sbjct: 113 EGGKEISIESLDLHGDDQATHL-DPDYCGGCYGATAPPNAKKAGCCNTCEEVREAYASVS 171
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ EKL EGC+I G + VN+V G+FHIAPG S+S ++HVH
Sbjct: 172 WAFGRGENVEQCEREHYGEKLDAQRKEGCRIEGGIRVNKVVGNFHIAPGRSFSNGNMHVH 231
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
D+ Y THHI L FG +L + ++ PLD T A
Sbjct: 232 DLNNYFDTPVPGGHVFTHHIHSLRFGPQLPESVTKKLGNKALPWTNHHINPLDDTRQVAP 291
Query: 280 EGASMFNYYIKIIPTIYERL------------------------DGS------------- 302
E A F Y++K++PT Y L DGS
Sbjct: 292 ETAYNFMYFVKVVPTSYLPLGWDNSVTSEQRIDHVDIGSYGHLDDGSVETHQFSVTSHKR 351
Query: 303 KLGGGD-------------GGMPGIFFSY----------------ELSPL-MVKITEKSK 332
L GGD GG+PG+FFSY ++SP+ ++ E++K
Sbjct: 352 SLSGGDDGAEGHKEKLHSRGGIPGVFFSYVSSHFYPQKISTNKTQDISPMKVINREERAK 411
Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVE 370
SL T + I GT VD ++ ++ K++
Sbjct: 412 SLAGFLTGLCAIIGGTLTVAAAVDRGVYEGTTRLKKMQ 449
>gi|61555552|gb|AAX46728.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
Length = 283
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 153/237 (64%), Gaps = 8/237 (3%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAE E KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSF 235
>gi|6598578|gb|AAF18633.1|AC006228_4 F5J5.4 [Arabidopsis thaliana]
Length = 440
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/387 (33%), Positives = 197/387 (50%), Gaps = 52/387 (13%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDV-CDYFQVSTTEELFVDSSR 64
+L+ LDA+ K EDF+ +T+ GG +T++ + + L ++ S +E + +
Sbjct: 7 KLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRTSLSSYSHRDEAYSRYFK 66
Query: 65 GSKL--PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
G + + DI P ++C L++DA+D SGE HL V+H+I KRRLD +G I E +++
Sbjct: 67 GRDVTHQRNFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTI-EARQDG 125
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET-------------------------- 156
+ A K + ++G E CGSCYGAE
Sbjct: 126 IGATKIENPLQKHGGRLGHNE--TYCGSCYGAEAVIVLSLYLTLWSMVSQLSSEVCFFPV 183
Query: 157 -ETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNR 215
E CCN+C +V+EAYR K W + D I QCK E +++K+ EGC IYG+LEVN+
Sbjct: 184 QEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFLEVNK 243
Query: 216 VSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV 275
V+G+FH APG S+ + VHVHD+ + +FN +H I L++G PLD
Sbjct: 244 VAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPG---VVNPLDKVE 300
Query: 276 AKAEEGASMFNYYIKIIPTIYERLDG---------------SKLGGGDGGMPGIFFSYEL 320
+ +M+ Y+IK++PT+Y + G S G +PG+FF Y+L
Sbjct: 301 WSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYDL 360
Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISG 347
SP+ V TE+ S H T + C I G
Sbjct: 361 SPIKVTFTEEHISFLHFLTNV-CAIVG 386
>gi|409042254|gb|EKM51738.1| hypothetical protein PHACADRAFT_150385 [Phanerochaete carnosa
HHB-10118-sp]
Length = 422
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/416 (31%), Positives = 191/416 (45%), Gaps = 62/416 (14%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LKGLDAF K ED KT G +TI+ I + ++ DY +V+ + VD SRG
Sbjct: 9 LKGLDAFGKTMEDVKVKTRTGAFLTILSAAIILAITTMEFFDYRRVNVDTSIEVDKSRGE 68
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV--- 123
KL + ++ P + C L+LD +D SGE + HN+ K RL+ G P+ P ++V
Sbjct: 69 KLIVSFNVTFPRVPCYLLSLDVMDISGETQTDIVHNVIKTRLNEQGNPV--PANKIVELR 126
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
N + K ++G CGSCYG CCNTC +V++AY + W+ D
Sbjct: 127 NDIDKLNEQRQDGY----------CGSCYGGVEPAGGCCNTCEDVRQAYVNRGWSFTAPD 176
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
+I QC E +KL++ EGC G L VN+V G+ H++PG S+ +++DI PY
Sbjct: 177 SIEQCAQEGWADKLRDQANEGCNAAGKLRVNKVVGNIHLSPGRSFRSGSHNIYDIVPYLK 236
Query: 244 AAFNTTHHIRHLSFGIKLQDDDE-------------RR-----KPLDGTVAKAEEGASMF 285
N H H DDE RR PLDGT K + A MF
Sbjct: 237 EDGNR-HDFSHTVHAFAFAGDDEFNFQKADHGNSLKRRLGIADGPLDGTTQKTSKQAYMF 295
Query: 286 NYYIKIIPTIYERLDGSKLGG----------------------------GDGGMPGIFFS 317
Y++K++ T + LDG + G G+PG FF+
Sbjct: 296 QYFLKVVSTQFITLDGKSIKTHQHSATHFERDLSKGIAENSQQGMHVMHGMTGIPGAFFN 355
Query: 318 YELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGG 373
YE+SP++V E +S H T + G L+D++L + KK+ K G
Sbjct: 356 YEISPILVVHRETRQSFAHFLTSTCAVVGGVLTVASLIDSMLFATSKKLKKSGTSG 411
>gi|346979363|gb|EGY22815.1| ER-derived vesicles protein ERV46 [Verticillium dahliae VdLs.17]
Length = 435
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 138/439 (31%), Positives = 199/439 (45%), Gaps = 77/439 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG +TIV + +L + DY +++ EL V
Sbjct: 1 MAGKSRFTRLDAFTKTVDEARIRTSSGGIITIVSLFIVFWLAWGEWADYRRITLHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG K+ IHL++ P + C+ L LD +D SGEQ + I K RL QK
Sbjct: 61 DKGRGEKMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGIVSGISKVRL--------RSQK 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET----RKCCNTCNEVKEAYRYKK 176
+ + K ++ P+ CG CYGA+ + CCNTC EV+EAY
Sbjct: 113 DGGGVIDTKALSLHAADEAATHLAPDYCGDCYGAKAPANAVKQGCCNTCEEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC E+ E+L EGC+I G L VN+V G+FH+APG S+S ++HVH
Sbjct: 173 WAFGKGENVEQCTREHYAERLDEQRAEGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVH 232
Query: 237 DIQPYTSA--AFNTTHHIRHLSFGIKLQDD-------------DERRKPLDGTVAKAEEG 281
D++ Y + TH I L FG +L + + PLDGT +
Sbjct: 233 DLKNYWDGDITHDFTHQIHALRFGPQLPESITKNLGNKATPWTNHHLNPLDGTSQITTDP 292
Query: 282 ASMFNYYIKIIPTIYERL-----------DGSKLG------------------------- 305
+ F Y++KI+PT Y L DG LG
Sbjct: 293 SFNFMYFVKIVPTSYLPLGWDSKRSPQDHDGGLLGSFGQGSDGSIETHQYSVTSHKRSLS 352
Query: 306 GGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYIT 351
GGD GG+PG+FFSY++SP+ ++ E+SKS T + I GT
Sbjct: 353 GGDDSAEGHAERLHTRGGIPGVFFSYDISPMKVINREERSKSFTGFLTGLCAVIGGTLTV 412
Query: 352 FMLVDALLHSCVKKISKVE 370
VD + ++ K+
Sbjct: 413 AAAVDRGMFEGSLRLKKIR 431
>gi|396471326|ref|XP_003838845.1| similar to endoplasmic reticulum-golgi intermediate compartment
protein 3 [Leptosphaeria maculans JN3]
gi|312215414|emb|CBX95366.1| similar to endoplasmic reticulum-golgi intermediate compartment
protein 3 [Leptosphaeria maculans JN3]
Length = 439
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 146/454 (32%), Positives = 204/454 (44%), Gaps = 103/454 (22%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VT+V + I +L + DY +V+ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVEDARVRTTSGGIVTLVSLVVIFWLTWGEWADYRRVTVRPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD--------LDG 112
D RG ++ I L+I P + C+ L LD +D SGE + + H I K RL +D
Sbjct: 61 DKGRGERMEISLNITFPRMPCELLTLDVMDVSGELQMGITHGINKVRLSPEVDGSKVIDA 120
Query: 113 KPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEV 168
KP+ Q E + DP+ CG+CYGA T CCNTC+EV
Sbjct: 121 KPLDLHQDEASHL------------------DPSYCGNCYGAPPPTNAIKHGCCNTCDEV 162
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
++AY W+ + + QC+ E+ E L EGC++ G ++VN+V G+FHIAPG S+
Sbjct: 163 RDAYASISWSFGRGEGVEQCEREHYAEHLDEQRQEGCRLEGSIKVNKVVGNFHIAPGKSF 222
Query: 229 SINHVHVHDIQPY--TSAAFNTTHHIRHLSFGIKL-----QDDDERR------------- 268
S ++HVHD++ Y A TH I HL FG +L QD ++
Sbjct: 223 SNGNLHVHDLENYFRDEYAHTFTHKIHHLRFGPQLSQAVVQDMAKKHMATGPGGWTNHHV 282
Query: 269 KPLDGTVAKAEEGASMFNYYIKIIPTIYERL------DGSKLGGGD-------------- 308
PLD T + +E A + Y+IK++ T Y L DGS GG D
Sbjct: 283 NPLDHTEQRTDEKAFNYMYFIKVVSTAYLPLGWEKSADGSSSGGYDDLLGTTIHSVNKGS 342
Query: 309 --------------------------------GGMPGIFFSYELSPLMVKITE-KSKSLG 335
GG+PG+FFSY++SP+ V E + K+
Sbjct: 343 IETHQYSVTSHKRSLQGGSDEKEGHKERIHARGGIPGVFFSYDISPMKVINREMREKTFS 402
Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
+ I GT VD L+ V KI K+
Sbjct: 403 GFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKKI 436
>gi|299743758|ref|XP_002910702.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
okayama7#130]
gi|298405804|gb|EFI27208.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
okayama7#130]
Length = 416
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/410 (32%), Positives = 193/410 (47%), Gaps = 57/410 (13%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F LKG+DAF K ED KT G +T++ I + ++ DY +V + VD
Sbjct: 5 FFSTLKGIDAFGKTTEDVKVKTRTGAFLTLLSAAIILAITTMEFFDYRKVFIDTSIVVDR 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRG KL ++L++ P + C L+LD +D SGE + HN+ K RLD GK +
Sbjct: 65 SRGEKLTVNLNVTFPKVPCYLLSLDIMDISGEVQRDISHNVLKVRLDRSGKEVPGSHTAD 124
Query: 123 VNA-VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
++A V+K T + G CGSCYG CCNTC +V+ AY + W+
Sbjct: 125 LSADVEKLSHTKKEGY----------CGSCYGGLEPESGCCNTCEDVRMAYVNRGWSFTN 174
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D I QC+NE +KL++ EGC I G + VN+V G+ H++PG S+ N +++++ PY
Sbjct: 175 PDAIEQCRNEGWADKLRDQADEGCNISGRIRVNKVIGNIHMSPGRSFQSNSRNIYELVPY 234
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDE------------RRK------PLDGTVAKAEEGAS 283
N H H+ + DDE RR+ PLDG A+ +
Sbjct: 235 LRDDQN-RHDFSHIIHHFGFEGDDEYDYWKAEAGQKMRRRMGLTENPLDGIEARTWKSQY 293
Query: 284 MFNYYIKIIPTIYERLDGS--------------KLGGG----DG---------GMPGIFF 316
MF Y++K++ T + LDG LG G DG G+PG FF
Sbjct: 294 MFQYFLKVVSTRFRTLDGQTVNTHQYSTTSFERDLGEGMNQDDGGIRVQHGVSGLPGAFF 353
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+YE+SP+ V E +S H T I G LVD+ L K I
Sbjct: 354 NYEISPIQVVHAESRQSFAHFLTSTCAVIGGVLTVAALVDSALFVTAKAI 403
>gi|67524561|ref|XP_660342.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
gi|40743850|gb|EAA63036.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
gi|259486349|tpe|CBF84116.1| TPA: COPII-coated vesicle membrane protein Erv46, putative
(AFU_orthologue; AFUA_1G05120) [Aspergillus nidulans
FGSC A4]
Length = 437
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 144/446 (32%), Positives = 208/446 (46%), Gaps = 89/446 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ++ +T GG +TI L I +L + DY +V+ EL V
Sbjct: 1 MAAKSRFTRLDAFAKTVDEARIRTTSGGIITIASLLIIIWLTWGEWVDYRRVAVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD--LDGKPIQEP 118
D SRG K+ IHL+I P + C+ LD +D SGEQ + V H + K RL +G + +
Sbjct: 61 DKSRGEKMEIHLNITFPRLPCELTTLDVMDVSGEQQVGVAHGVNKVRLAPAAEGGRVLDV 120
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRY 174
Q ++A + K + DP+ CG C GA CC+TC+EV+EAY
Sbjct: 121 QALQLHAEEAKHL------------DPDYCGECGGAPPPPNAIKPGCCSTCDEVREAYAQ 168
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
K+W + I QC+ E+ +E++ EGC++ G + VN+V G+FHIAPG S+S N+VH
Sbjct: 169 KQWGFGKGTNIEQCEREHYSERIDAQRREGCRLEGVIRVNKVVGNFHIAPGRSFSSNNVH 228
Query: 235 VHDIQPY-----TSAAFNTTHHIRH-LSFGIKLQD---------DDERRKPLDGTVAKAE 279
+HDI Y + A +T HI H L FG +L D D PLD T +A
Sbjct: 229 IHDIANYEERGLSPAEQHTMSHIIHSLRFGPQLPDELSDRWQWTDHHHTNPLDSTSQEAP 288
Query: 280 EGASMFNYYIKIIPTIYERLD--------------------------GSK---------- 303
E A F Y+IK++ T Y L GS+
Sbjct: 289 EPAYSFMYFIKVVSTSYLPLGWDPLYSASLHAAADTNTPLGAQGLSAGSQGSIETHQYSV 348
Query: 304 ------LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMC 343
L GGD GG+PG+FF+Y++SP+ V E + K+ T +
Sbjct: 349 TSHKRSLRGGDASDEAHKERIHAAGGIPGVFFNYDISPMKVINREARPKTFTGFLTGVCA 408
Query: 344 NISGTYITFMLVDALLHSCVKKISKV 369
+ GT +D L+ V ++ K+
Sbjct: 409 IVGGTLTVAAAIDRTLYEGVSRVRKL 434
>gi|336265645|ref|XP_003347593.1| hypothetical protein SMAC_04901 [Sordaria macrospora k-hell]
gi|380096460|emb|CCC06508.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 428
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 140/433 (32%), Positives = 199/433 (45%), Gaps = 72/433 (16%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV L + +L + DY +V EL V
Sbjct: 1 MAGKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V+H + K RL PQ
Sbjct: 61 DKGRGERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRL--------RPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E + K + + DP+ CG CYGA + CC+TC E++EAY
Sbjct: 113 EGGGEIDAKVLALHAADESATHLDPSYCGPCYGAPAPYNAKKAGCCSTCEEIREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + T+ QC+ E+ TE+L EGC+I G L VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGDGSTMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDD----DERRKPLDGTVAKAEEGASMFNYYIKII 292
D+ + ++ +R L G + + + PLD T + ++ F Y++KI+
Sbjct: 233 DLAQWWNSPL-PDDLVRKLGGGKDGKRNTLWTNHHLNPLDNTRQETDDPNYNFMYFVKIV 291
Query: 293 PTIYERL----------------------------DGS-------------KLGGGD--- 308
PT Y L DGS L GGD
Sbjct: 292 PTSYLPLGWEKQAAQNKASWDQDHSVGLGVFGQGSDGSMETHQYSVTSHKRSLAGGDDAK 351
Query: 309 ----------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
GG+PG+FFSY++SP+ +V E++KS + + GT VD
Sbjct: 352 EGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFIGFLAGLCAVVGGTLTVAAAVDR 411
Query: 358 LLHSCVKKISKVE 370
L ++ K+
Sbjct: 412 GLFEGTVRLKKLR 424
>gi|302688477|ref|XP_003033918.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
gi|300107613|gb|EFI99015.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
Length = 415
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 128/412 (31%), Positives = 194/412 (47%), Gaps = 56/412 (13%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F LKG+DAF K ED KT G +T++ I ++ DY +V + VD
Sbjct: 5 FLSHLKGIDAFGKTAEDVKVKTRTGALLTLIAASIILAFTTLEFFDYRKVIIDTSVTVDQ 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-KE 121
SRG +L + +++ P + C L++D D SG+ V HN+ K RLD DGK I+ E
Sbjct: 65 SRGERLTVRMNVTFPRVPCYLLSVDVTDISGDVQRDVSHNMLKTRLDKDGKAIRGAHTAE 124
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ N + K+ E + CGSCYG CCNTC EV+ AY + W+
Sbjct: 125 LRNEIDKQN----------EQRGADYCGSCYGGLPPASGCCNTCEEVRTAYVNRGWSFNN 174
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D+I QCKNE +KL+ EGC I G L +N+V+G+ H++PG S+ +V+++ PY
Sbjct: 175 PDSIEQCKNEGWADKLREQANEGCNIAGRLRINKVAGNIHLSPGRSFQTGGRNVYELVPY 234
Query: 242 TSAAFNT---THHIRHLSFGIKLQDDDERRK--------------PLDGTVAKAEEGASM 284
N +H I LSF D+ +R+ PLDGTV + M
Sbjct: 235 LRDDGNRHDFSHTIHSLSFEGDDAYDNRKRETSKEMRQRMGLSSNPLDGTVRVTNKAQYM 294
Query: 285 FNYYIKIIPTIYERLDGSKL----------------GG------------GDGGMPGIFF 316
F Y++K++ T + L+G + GG G G+PG F
Sbjct: 295 FQYFVKVVSTKFRPLNGRTVNSHSYSVTHFERDLTDGGQAQTGQNVQVQHGVTGLPGAFI 354
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
++++SP+ + TE +S H T + G L+D++L + K + K
Sbjct: 355 NFDVSPIQLVHTEWRQSFAHFVTSTCAIVGGVLTVASLLDSVLFATSKALKK 406
>gi|115388503|ref|XP_001211757.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195841|gb|EAU37541.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 438
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 141/443 (31%), Positives = 200/443 (45%), Gaps = 82/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +TI L I +L+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARIRTTSGGIITIASLLIILWLVWGEWVDYRRVVVMPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG K+ IHL+I P + C+ L LD +D SGEQ + V H I K RL
Sbjct: 61 DKSRGEKMEIHLNITFPRLPCELLTLDVMDVSGEQQVGVAHGINKVRL--------ASPA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET---ETRKCCNTCNEVKEAYRYKKW 177
E + + + + + + DPN CG C G E ++CCNTC EV+EAY +W
Sbjct: 113 EGGHVLDVQALELHSEQEVAKHLDPNYCGECGGIPQQPGEPKRCCNTCEEVREAYAEHQW 172
Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
A + + I QC+ E ++ EGC++ G L VN+V G+FHIAPG S+S ++HVHD
Sbjct: 173 AFGKGENIEQCEREGYAARIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFSSGNIHVHD 232
Query: 238 IQPY------TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGA 282
++ Y S THHI L FG +L D D PLD TV + + A
Sbjct: 233 LENYFELDQPASEKHTMTHHIHQLRFGPQLPDELSDRWQWTDHHHTNPLDDTVQETDLAA 292
Query: 283 SMFNYYIKIIPTIYERL-----------------------------DGS----------- 302
+ Y++K++ T Y L DGS
Sbjct: 293 FNYMYFVKVVSTAYLPLGWDPRVSSYIHSASSHNVPLGRHGIGYGHDGSIETHQYSVTSH 352
Query: 303 --KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
L GG+ G+PG+FF+Y++SP+ V E + K+ T + I
Sbjct: 353 KRPLMGGNAADEGHKERLHAAAGIPGVFFNYDISPMKVINREARPKTFTGFLTGVCAIIG 412
Query: 347 GTYITFMLVDALLHSCVKKISKV 369
GT +D L+ ++ K+
Sbjct: 413 GTLTVAAAIDRGLYEGAIRVKKL 435
>gi|218192721|gb|EEC75148.1| hypothetical protein OsI_11348 [Oryza sativa Indica Group]
gi|222624836|gb|EEE58968.1| hypothetical protein OsJ_10656 [Oryza sativa Japonica Group]
Length = 355
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 130/392 (33%), Positives = 201/392 (51%), Gaps = 60/392 (15%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L+ LDA+ K EDF+ +T+ GG +TI L I L ++ Y +T +L V
Sbjct: 1 MDLWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFLSEIRLYLYSATDSKLTV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+SRG +L I+ D+ P + C +A+D +D SGEQH + H+I K+R+D G I E +K
Sbjct: 61 DTSRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDNLGNVI-ESRK 119
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ V A K ++ ++G E CGSCYG+E +CCN+C +V++AYR K WAL
Sbjct: 120 DGVGAPKIERPLQKHGGRLDHNE--VYCGSCYGSEESDDQCCNSCEDVRDAYRKKGWALT 177
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
++ I QCK E ++LK+ EGC I+G++ VN++S
Sbjct: 178 NIEEIDQCKREGFVQRLKDEQGEGCSIHGFVNVNKIS----------------------- 214
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT--VAKAEEGAS-MFNYYIKIIPTIYE 297
H I LSFG++ PLDG + + G + M+ Y++K++PTIY
Sbjct: 215 ---------HKINKLSFGVEFPG---VVNPLDGVEWIQEHTNGLTGMYQYFVKVVPTIYT 262
Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
+ G K+ + PG++F YE SP+ V TE++ SL H T I
Sbjct: 263 DIRGRKINSNQFSVTEHFREAIGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 322
Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
+ G + ++D+ + H +KK K+EIG
Sbjct: 323 IVGGIFTVAGIIDSFVYHGHRAIKK--KMEIG 352
>gi|407929248|gb|EKG22082.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
Length = 442
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 145/453 (32%), Positives = 205/453 (45%), Gaps = 92/453 (20%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTI + I +LI + ++ QV+ EL V
Sbjct: 1 MPAKSRFMRLDAFTKTVEDARVRTSTGGIVTITSIIMILWLIWGEWAEFRQVTVKPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG K+ IH++I P I C+ L LD +D SGE V H + K RL P E +
Sbjct: 61 DKSRGEKMEIHMNISFPRIPCELLTLDVMDVSGEIQTGVMHGVNKVRL----TPENEGSR 116
Query: 121 EV-VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYK 175
+ VNA+ + DP+ CG CYGA T CCNTC++V++AY
Sbjct: 117 PIEVNALNLHADEASH-------MDPDYCGECYGAPAPTTAKKPGCCNTCDDVRDAYAAI 169
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
W+ D + QC+ E+ EKL EGC++ G + VN+V G+FH APG S+S ++HV
Sbjct: 170 SWSFTRGDGVEQCEREHYGEKLDAQRREGCRVEGGIRVNKVIGNFHFAPGKSFSNGNMHV 229
Query: 236 HDIQPY--TSAAFNTTHHIRHLSFGIKLQDD-----------------DERRKPLDGTVA 276
HD++ Y A + TH + L FG +L DD + PLD T
Sbjct: 230 HDLENYFKDGAPHSFTHQVHSLRFGPQLPDDVIAKLEASGMSASSLWTNHHINPLDNTEQ 289
Query: 277 KAEEGASMFNYYIKIIPTIYERL------------------------------DGS---- 302
+ +E A F Y++K++ T Y L +GS
Sbjct: 290 RTDEKAFNFMYFVKVVSTAYLPLGWENKGSSSLSGLLPDADRAPLGSYGLASGEGSIETH 349
Query: 303 ---------KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWT 339
L GG+ GG+PG+FFSY++SP+ V E ++KS
Sbjct: 350 QYSVTSHKRSLAGGNDEKDGHKERLHARGGIPGVFFSYDISPMKVINRESRAKSFSGFLV 409
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+ I GT +D L+ K+ K+ G
Sbjct: 410 GVCAVIGGTLTVAAAIDRALYEGSTKLKKLHQG 442
>gi|358378080|gb|EHK15763.1| hypothetical protein TRIVIDRAFT_86970 [Trichoderma virens Gv29-8]
Length = 420
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 138/425 (32%), Positives = 201/425 (47%), Gaps = 66/425 (15%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV L + +L + DY ++ EL V
Sbjct: 1 MAPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVFFLSWGEWTDYRRIVVHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGEQ V H I K RL + E +
Sbjct: 61 DKGRGERMDIHLNMTFPNMPCELLTLDVMDVSGEQQHGVAHGISKIRLRPAAQGGGEIES 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
+ + +K E P+ CG CYGA E CCNTC+EV+EAY
Sbjct: 121 NTLTQLHEK----------AEHLAPDYCGGCYGATAPANAEKPGCCNTCDEVREAYAQMS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ E+L EGC+I G L+VN+V G+FH+APG S+S ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYAERLDQQREEGCRIEGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR------------KPLDGTVAKAEE 280
D++ Y + TH I L FG +L D R PLD T + ++
Sbjct: 231 DLKTYWDFPEGKPHDFTHIIHSLRFGPQLPDTVIERMGGKNTWTNHHLNPLDATHQETKD 290
Query: 281 GASMFNYYIKIIPTIYERL---------DGS-------------KLGGGD---------- 308
+ Y++KI+PT Y L DGS L GGD
Sbjct: 291 PNFNYMYFVKIVPTSYLPLGWEKRTPGYDGSIETHQYSVTSHKRSLMGGDDSQEGHPERL 350
Query: 309 ---GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
G+PG+FFSY++SP+ ++ E++K+ + + + GT VD L
Sbjct: 351 HARNGIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAAVDRGLFEGAS 410
Query: 365 KISKV 369
++ K+
Sbjct: 411 RLKKL 415
>gi|322708973|gb|EFZ00550.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Metarhizium anisopliae ARSEF 23]
Length = 429
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 142/434 (32%), Positives = 205/434 (47%), Gaps = 75/434 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTI+ + +L + +Y +V EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVDEARIRTTSGGVVTIISLFVVLFLSWGEWAEYRRVVVRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IHL++ P + C+ L LD +D SGEQ V H + RL +P E Q
Sbjct: 61 DKSRGERMQIHLNMTFPRMPCELLTLDVMDVSGEQQHGVSHGVKNVRL----RP--ESQG 114
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET--ETRK--CCNTCNEVKEAYRYKK 176
V +K KV + + DP+ CG CYGA RK CCNTC+EV+EAY +
Sbjct: 115 GGVIDIKSMKVHDD----PADHLDPSYCGECYGATAPPNARKAGCCNTCDEVREAYASQG 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC E+ E+L EGC++ G+LEVN+V G+FH+APG S+S ++HVH
Sbjct: 171 WAFGRGENVEQCTREHYAERLDEQREEGCRVEGHLEVNKVVGNFHLAPGRSFSNGNMHVH 230
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
D++ Y + TH I L FG +L R PLDGT +
Sbjct: 231 DLKNYWETPNGKQHDFTHTIHQLRFGPQLPAAVSDRLGKGSMPWTNHHLNPLDGTRQEIG 290
Query: 280 EGASMFNYYIKIIPTIYERL-----------------DGS-------------KLGGGD- 308
+ A + Y++KI+PT Y L DGS L GG+
Sbjct: 291 DPAFNYMYFVKIVPTSYLPLGWEKRFKNAAGSTYGNADGSLETHQYSVTSHKRSLEGGND 350
Query: 309 ------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
GG+PG+FFSY++SP+ ++ E +K+ + + GT V
Sbjct: 351 AAEGHAERQHSQGGIPGVFFSYDISPMKVINREEPAKTFTGFLAGLCAIVGGTLTVAAAV 410
Query: 356 DALLHSCVKKISKV 369
D L ++ K+
Sbjct: 411 DRGLFEGAARLKKM 424
>gi|358391585|gb|EHK40989.1| ER-derived vesicle Erv46-like protein [Trichoderma atroviride IMI
206040]
Length = 422
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 136/428 (31%), Positives = 202/428 (47%), Gaps = 68/428 (15%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV L + +L + Y ++ EL V
Sbjct: 1 MAPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVLFLSWGEWSSYRRIVVHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V H I K RL +P
Sbjct: 61 DKGRGERMDIHLNITFPNMPCELLTLDVMDVSGEQQHGVAHGITKLRL--------QPPS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
++ + + E +P+ CG CYGA E CCNTC+EV+EAY
Sbjct: 113 RGGGVIESNSLAQLH--EKAEHLNPDYCGGCYGATAPANAEKPGCCNTCDEVREAYAQAS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ +E+L EGC+I G L+VN+V G+FH+APG S+S ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYSERLDQQREEGCRIEGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230
Query: 237 DIQ-----PYTSAAFNTTHHIRHLSFGIKLQDD------------DERRKPLDGTVAKAE 279
D++ P A + TH I L FG +L + + PLDG +
Sbjct: 231 DLKNYWDLPNGMKAHDFTHVIHSLRFGPQLPPEVIARMGRRTAWTNHHLNPLDGIHQETS 290
Query: 280 EGASMFNYYIKIIPTIYERL----------DGS-------------KLGGGD-------- 308
+ + Y++KI+PT Y L DGS L GGD
Sbjct: 291 DPNFNYMYFVKIVPTSYLPLGWEQKSASASDGSVETHQYSVTSHKRSLMGGDDAKEGHAE 350
Query: 309 -----GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
GG+PG+FFSY++SP+ ++ E++K+ + + + GT +D L
Sbjct: 351 RLHSKGGIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAAIDRGLFEG 410
Query: 363 VKKISKVE 370
++ K+
Sbjct: 411 ATRLKKLR 418
>gi|342874382|gb|EGU76396.1| hypothetical protein FOXB_13074 [Fusarium oxysporum Fo5176]
Length = 439
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 141/445 (31%), Positives = 205/445 (46%), Gaps = 85/445 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV L + +L + +Y ++ EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVLFLSWGEWAEYRRIEIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V H + K RL +P
Sbjct: 61 DKGRGERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL--------QPAN 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE--TETRK--CCNTCNEVKEAYRYKK 176
+ + K + + + + DP+ CG CYGA+ RK CC TC+EV+EAY
Sbjct: 113 QGGAVIDIKSLALHD--ESADHLDPSYCGGCYGAQPPANARKAGCCQTCDEVREAYAQSS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ EKL EGC+I G L VN+V G+FH APG S+S ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYGEKLDAQREEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVH 230
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDD-------------DERRKPLDGTVAKAE 279
D++ Y + + TH+I L FG +L D+ + + PLD T +
Sbjct: 231 DLKNYWDVPKGKSHDFTHYIHSLRFGPQLPDNIAKKVGTKSSLWTNHHQNPLDNTRQEIH 290
Query: 280 EGASMFNYYIKIIPTIYERL---------------------------DGS---------- 302
+ F Y++KI+PT Y L DGS
Sbjct: 291 DPNFNFMYFVKIVPTSYLPLGWDSKGIKIAGLLQDDNAGLGAYGYSEDGSVETHQYSVTS 350
Query: 303 ---KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNI 345
L GG+ GG+PG+FFSY++SP+ +V EK+K+ + +
Sbjct: 351 HKRSLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIV 410
Query: 346 SGTYITFMLVDALLHSCVKKISKVE 370
GT VD L +I K+
Sbjct: 411 GGTLTVAAAVDRGLFEGAARIKKMR 435
>gi|116181584|ref|XP_001220641.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
gi|88185717|gb|EAQ93185.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
Length = 438
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 145/442 (32%), Positives = 196/442 (44%), Gaps = 80/442 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV + + +L + DY +V EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLVVVFFLAWGEWSDYRRVEVHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P I C+ L LD +D SGEQ V+H + K RL PQ
Sbjct: 61 DKGRGERMEIHLNITFPRIPCELLTLDVMDISGEQQHGVQHGVTKTRL--------RPQS 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K V DP+ CG CYGA+ CCNTC EVK+AY
Sbjct: 113 EGGGDIDTKAVALHARDEVATHLDPSYCGPCYGAQPPPNAKKPGCCNTCEEVKDAYAQAA 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+ E+ +EKL EGC+I G L VN+V G+FHIAPG S+S ++HVH
Sbjct: 173 WAFGRGEGIEQCEREHYSEKLDEQRNEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDD-----DERR---------KPLDGTV----- 275
D++ Y T +H I HL FG +L D+ D R+ PLD T
Sbjct: 233 DLKNYWDTPTKHTFSHQIHHLRFGPQLPDNLHKKLDARKNMRGRSTTFNPLDDTPPGDGT 292
Query: 276 --------------------AKAEEGASMFNYYIKIIPTIYERLDGS------------- 302
A + A + + + DGS
Sbjct: 293 TSTTTTCTSSRSCPHRTCRWAGRKTWAGFREEHHAELGSFGASADGSVETHQYSVTSHKR 352
Query: 303 KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGT 348
L GGD GG+PG+FFSY++SP+ ++ EK+KS + + GT
Sbjct: 353 SLAGGDDSAEGHQERLHARGGIPGVFFSYDISPMKVINREEKAKSFLGFIAGLCAIVGGT 412
Query: 349 YITFMLVDALLHSCVKKISKVE 370
+D L ++ K+
Sbjct: 413 LTVAAAIDRALFEGGVRLKKMR 434
>gi|449299159|gb|EMC95173.1| hypothetical protein BAUCODRAFT_529716 [Baudoinia compniacensis
UAMH 10762]
Length = 435
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 142/441 (32%), Positives = 201/441 (45%), Gaps = 83/441 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VT+ L I YL+ + DY +V+ EL V
Sbjct: 1 MPSKSRFTRLDAFTKTVEDARIRTTSGGIVTLASLLLILYLVWGEWADYRRVTVAPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG K+ IH++I P + C+ L LD +D SGE V H + K RL DG+ +
Sbjct: 61 DKGRGEKMEIHMNISFPRVPCELLTLDVMDVSGEVQTGVMHGVNKVRLGEDGREVGREAL 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E+ V++ + + DP CG CYGA CCNTC EV+EAY
Sbjct: 121 ELGKEVEE----------SMKHMDPEYCGECYGAPAPGNAIRAGCCNTCAEVREAYASVS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W+ + + QC+ E+ +E L EGC+I G + VN+V G+FH APG S+S ++HVH
Sbjct: 171 WSFGRGENVEQCEREHYSEHLDEQRREGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVH 230
Query: 237 DIQPYTSAA----FNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
D++ Y + +H I HL FG +L +D RR PLD T K +
Sbjct: 231 DLENYFAGGEGIDHTFSHTIHHLRFGPQLPEDVVRRIGRRGMAWSNHHLNPLDETEQKTD 290
Query: 280 EGASMFNYYIKIIPTIYERLDGSKLG---------------------------------- 305
E A + Y++K++ T Y L + G
Sbjct: 291 EKAYNYMYFVKVVSTAYLPLGWERTGSILDIPHELVELGGYGKGEAGSVETHQYSVTSHK 350
Query: 306 ----GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISG 347
GGD GG+PG+FFSY++SP+ V E +SKS + I G
Sbjct: 351 RSLAGGDGGEEGHKERLHARGGIPGVFFSYDISPMKVINREARSKSFSGFLVGVCAVIGG 410
Query: 348 TYITFMLVDALLHSCVKKISK 368
T +D L+ +++ K
Sbjct: 411 TLTVAAAIDRALYEGGQRVKK 431
>gi|119496763|ref|XP_001265155.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
fischeri NRRL 181]
gi|119413317|gb|EAW23258.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
fischeri NRRL 181]
Length = 438
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 138/443 (31%), Positives = 198/443 (44%), Gaps = 82/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +T+ + I YL+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARIRTTSGGIITLASLVVILYLVWGEWLDYRRVVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IH++I P + C+ L LD +D SGEQ + V H + K RL + +
Sbjct: 61 DKSRGERMEIHMNITFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDV 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE----TETRKCCNTCNEVKEAYRYKK 176
+ ++ K+++ DPN CG C GA+ + CCNTC+EV+EAY K
Sbjct: 121 QALDLHSKEEIAKH--------LDPNYCGDCGGADPLPGSMKEGCCNTCDEVREAYAAKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+ E ++ EGC++ G L VN+V G+FHIAPG S++ VH H
Sbjct: 173 WAFGKGSNIEQCEREGYAARIDAQRREGCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAH 232
Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
D+Q Y THHI L FG +L D D PLD T + +
Sbjct: 233 DLQNYLDLELPDNEKHTMTHHIHQLRFGPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDP 292
Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
A F Y++K++ T Y L D + LG
Sbjct: 293 AYNFVYFVKVVSTSYLPLGWDPLFSSAAHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSH 352
Query: 306 -----GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
GGD G+PG+FF+Y++SP+ V E + KS T + I
Sbjct: 353 KRSLRGGDASDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKSFSGFLTGVCAIIG 412
Query: 347 GTYITFMLVDALLHSCVKKISKV 369
GT +D L+ ++ K+
Sbjct: 413 GTLTVAAAIDRGLYEGALRVKKL 435
>gi|212540034|ref|XP_002150172.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
marneffei ATCC 18224]
gi|210067471|gb|EEA21563.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
marneffei ATCC 18224]
Length = 440
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 144/444 (32%), Positives = 204/444 (45%), Gaps = 86/444 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VT+V + I +L+ + DY +V EL V
Sbjct: 1 MPPKSRFTRLDAFAKTVEDARVRTTSGGIVTLVSLVVILWLVWGEWADYRRVVVLPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD--LDGKPIQEP 118
D SRG ++ IHL++ P + C+ L LD +D SGEQ + V H + K RL DG + +
Sbjct: 61 DKSRGERMEIHLNMTFPRLPCELLTLDVMDVSGEQQMGVVHGLNKVRLSSVADGGRVID- 119
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRY 174
V K ++ ++N DP CG C GA + CCNTC EV+EAY
Sbjct: 120 -------VSKLELHSQNEVAIHL--DPEYCGECGGASPPENAKKPGCCNTCEEVREAYAL 170
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
K WA + + I QC+ E +++ EGC+I G + VN+V G+FHIAPG S+S ++H
Sbjct: 171 KSWAFGKGENIEQCQREGYADRIDAQRREGCRIEGDIRVNKVIGNFHIAPGRSFSSGNMH 230
Query: 235 VHDIQPYTSAAF------NTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAE 279
VHD+ Y +H I L FG +L D+ +R PLD T
Sbjct: 231 VHDLDTYLDRELADYEKHTMSHIIHQLRFGPQLSDEVSQRWQWTDHHHTNPLDSTQQLTN 290
Query: 280 EGASMFNYYIKIIPTIY----------ERLDG---------------------------- 301
E A +NYYIK++ T Y ++L G
Sbjct: 291 EPAYNYNYYIKVVSTSYLPLGWDSARSDQLHGDDQFTPLGLHGAAHGTAGSIETHQYSVT 350
Query: 302 ----SKLGGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCN 344
S GG D GG+PG+FF+Y++SP+ V E ++K+ T +
Sbjct: 351 SHKRSLHGGNDAAEGHQERIHAEGGIPGVFFNYDISPMKVVNREARAKTFTGFLTGVCAV 410
Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
I GT VD L+ ++I K
Sbjct: 411 IGGTLTVAAAVDRFLYEGSRRIRK 434
>gi|408400673|gb|EKJ79750.1| hypothetical protein FPSE_00030 [Fusarium pseudograminearum CS3096]
Length = 439
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 141/445 (31%), Positives = 203/445 (45%), Gaps = 85/445 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV L + +L + DY ++ EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLSWGEWADYRRIDIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L+LD +D SGEQ V H + K RL +P+
Sbjct: 61 DKGRGERMEIHLNITFPKMPCELLSLDVMDVSGEQQHGVMHGVNKVRL--------QPES 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
+ + K ++ + DP+ CG CYGA + CC TC+EV+EAY
Sbjct: 113 QGGAVIDTKSLSLHD--DAAHHLDPSYCGGCYGATPPANAQKAGCCQTCDEVREAYAQAS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ EKL +EGC+I G L VN+V G+FH APG S+S ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYGEKLDAQRSEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVH 230
Query: 237 DIQPYTSAAFNTTH---HIRH-LSFGIKLQDDDERR-------------KPLDGTVAKAE 279
D++ Y +H HI H L FG +L D R+ PLD T +
Sbjct: 231 DLKNYWDVPKGFSHDFTHIVHSLRFGPQLPDHIARKVGHKNTLWTNHHQNPLDDTRQETH 290
Query: 280 EGASMFNYYIKIIPTIYERL---------------------------DGS---------- 302
+ F Y++KI+PT Y L DGS
Sbjct: 291 DPNYNFMYFVKIVPTSYLPLGWDKKGIKIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTS 350
Query: 303 ---KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNI 345
L GG+ GG+PG+FFSY++SP+ +V EK+K+ + +
Sbjct: 351 HRRSLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIV 410
Query: 346 SGTYITFMLVDALLHSCVKKISKVE 370
GT VD L ++ K+
Sbjct: 411 GGTLTVAAAVDRGLFEGAARLKKMR 435
>gi|46105482|ref|XP_380545.1| hypothetical protein FG00369.1 [Gibberella zeae PH-1]
Length = 444
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 141/445 (31%), Positives = 203/445 (45%), Gaps = 85/445 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV L + +L + DY ++ EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLSWGEWADYRRIDIHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L+LD +D SGEQ V H + K RL +P+
Sbjct: 61 DKGRGERMEIHLNITFPKMPCELLSLDVMDVSGEQQHGVMHGVNKVRL--------QPES 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
+ + K ++ + DP+ CG CYGA + CC TC+EV+EAY
Sbjct: 113 QGGAVIDTKSLSLHD--DAAHHLDPSYCGGCYGATPPANAQKAGCCQTCDEVREAYAQAS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ EKL +EGC+I G L VN+V G+FH APG S+S ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYGEKLDAQRSEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVH 230
Query: 237 DIQPYTSAAFNTTH---HIRH-LSFGIKLQDDDERR-------------KPLDGTVAKAE 279
D++ Y +H HI H L FG +L D R+ PLD T +
Sbjct: 231 DLKNYWDVPKGFSHDFTHIVHSLRFGPQLPDHIARKVGHKNTLWTNHHQNPLDDTRQETH 290
Query: 280 EGASMFNYYIKIIPTIYERL---------------------------DGS---------- 302
+ F Y++KI+PT Y L DGS
Sbjct: 291 DPNYNFMYFVKIVPTSYLPLGWDKKGIKIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTS 350
Query: 303 ---KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNI 345
L GG+ GG+PG+FFSY++SP+ +V EK+K+ + +
Sbjct: 351 HRRSLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIV 410
Query: 346 SGTYITFMLVDALLHSCVKKISKVE 370
GT VD L ++ K+
Sbjct: 411 GGTLTVAAAVDRGLFEGAARLKKMR 435
>gi|70990824|ref|XP_750261.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus fumigatus
Af293]
gi|66847893|gb|EAL88223.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
fumigatus Af293]
gi|159130735|gb|EDP55848.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
fumigatus A1163]
Length = 438
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 138/443 (31%), Positives = 199/443 (44%), Gaps = 82/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +T+ + I YL+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARIRTTSGGIITLASLVVILYLVWGEWLDYRRVVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IH++I P + C+ L LD +D SGEQ + V H + K RL + +
Sbjct: 61 DKSRGERMEIHMNITFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDV 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE----TETRKCCNTCNEVKEAYRYKK 176
+ ++ K+++ DPN CG C GA+ + CCNTC+EV+EAY K
Sbjct: 121 QALDLHSKEEIAKH--------LDPNYCGDCGGADPLPGSIKEGCCNTCDEVREAYAAKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+ E ++ EGC++ G L VN+V G+FHIAPG S++ VH H
Sbjct: 173 WAFGKGTNIEQCEREGYAARIDAQRREGCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAH 232
Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
D+Q Y + THHI L FG +L D D PLD T + +
Sbjct: 233 DLQNYLDSELPDNEKHTMTHHIHQLRFGPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDP 292
Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
A F Y++K++ T Y L D + LG
Sbjct: 293 AYNFVYFVKVVSTSYLPLGWDPLFSSAAHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSH 352
Query: 306 -----GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
GGD G+PG+FF+Y++SP+ V E + KS T + I
Sbjct: 353 KRSLRGGDASDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKSFSGFLTGVCAIIG 412
Query: 347 GTYITFMLVDALLHSCVKKISKV 369
GT +D L+ ++ K+
Sbjct: 413 GTLTVAAAIDRGLYEGALRVKKL 435
>gi|340520521|gb|EGR50757.1| predicted protein [Trichoderma reesei QM6a]
Length = 430
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 138/436 (31%), Positives = 203/436 (46%), Gaps = 76/436 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV L + +L + DY ++ EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVVFLAWGEWTDYRRIVVHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGEQ V H I K RL E +
Sbjct: 61 DKGRGERMDIHLNMTFPNMPCELLTLDVMDVSGEQQHGVAHGITKIRLQPAALGGGEIES 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
+ ++ + +K E DPN CG CYGA + CCNTC+EV+EAY
Sbjct: 121 KSLSQLHEK----------AEHLDPNYCGGCYGAIAPSTAQKPGCCNTCDEVREAYALAS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ E+L EGC+I G L+VN+V G+FH+APG S+S ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYAERLDQQREEGCRIEGLLQVNKVIGNFHLAPGRSFSNGNMHVH 230
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQD------------DDERRKPLDGTVAKAEE 280
D++ Y + + TH I L FG +L D + PLD T ++
Sbjct: 231 DLKNYWDLPEGKSHDFTHIIHSLRFGPQLPDTVIERLGGKNTWSNHHLNPLDNTRQDTKD 290
Query: 281 GASMFNYYIKIIPTIYERL-------------------DGS-------------KLGGGD 308
+ Y++KI+PT Y L DGS L GGD
Sbjct: 291 PNFNYMYFVKIVPTSYLPLGWEKRKPSTTNGGVTTFYSDGSIETHQYSVTSHKRSLMGGD 350
Query: 309 -------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFML 354
G+PG+FFSY++SP+ ++ E++K+ + + + GT
Sbjct: 351 DAKEGHPERLHARNGIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAA 410
Query: 355 VDALLHSCVKKISKVE 370
VD L ++ K+
Sbjct: 411 VDRGLFEGATRLKKLR 426
>gi|403417426|emb|CCM04126.1| predicted protein [Fibroporia radiculosa]
Length = 419
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/404 (29%), Positives = 189/404 (46%), Gaps = 51/404 (12%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+G+DAF K +D KT G +TI+ I ++ DY QV + VD SRG
Sbjct: 10 LRGVDAFGKTTDDVKVKTRTGAFLTILSAAIILAFTMMEFLDYRQVKIDTSVVVDKSRGE 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL + +++ P + C L+LD +D SGE + HNI K RL+ G P+Q K
Sbjct: 70 KLNVRMNVTFPRVPCYLLSLDVMDISGESQADITHNILKTRLNEKGIPLQSLAKSAELRN 129
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K+ + G N CGSCYG + CCNTC++V++AY + W+ D+I
Sbjct: 130 DLDKINEQRG--------DNYCGSCYGGQAPPGGCCNTCDQVRQAYIDRGWSFTRPDSIE 181
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC NE +EKLK +EGC I G + VN+V G+ ++PG S+ +++D+ PY
Sbjct: 182 QCTNEGWSEKLKEQASEGCNIAGKVRVNKVIGNIQLSPGRSFRTAAQNMYDLVPYLKEDK 241
Query: 247 NTTHHIRHLSFGIKLQDDDERRK--------------PLDGTVAKAEEGASMFNYYIKII 292
N H H + D E+ + PLD T K + MF Y++K++
Sbjct: 242 N-RHDFSHTIHQFAFESDQEKERHRARDFQKRVGIESPLDNTERKTSKQQYMFQYFLKVV 300
Query: 293 PTIYERLD--------------------GSKLGGGDG--------GMPGIFFSYELSPLM 324
T + LD G + +G G+PG+F +Y++SP++
Sbjct: 301 STHFAMLDNKVYKTHQYSATHFERDLTKGQQEDNKEGVHIAHTATGIPGVFINYDISPML 360
Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+ +E +S H T + G L+D++L + + + K
Sbjct: 361 ILHSETRQSFAHFLTSTCAIVGGVLTVASLIDSVLFATTRALKK 404
>gi|392591676|gb|EIW81003.1| ER-derived vesicles protein ERV46 [Coniophora puteana RWD-64-598
SS2]
Length = 419
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/417 (31%), Positives = 198/417 (47%), Gaps = 64/417 (15%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F LKG+DAF K ED KT G +T++ I ++ DY +V T + VD
Sbjct: 5 FLAGLKGIDAFGKTTEDVKVKTRTGAFLTLLSAAIILSFTLMEFVDYRRVYTDTSIVVDR 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-E 121
SRG KL + +++ P + C L++D +D SGE V HN+ K+RLD GK I + +
Sbjct: 65 SRGEKLSVRMNVTFPHVPCYLLSVDVMDISGETQRDVSHNVVKQRLDKTGKGIAGSRSGD 124
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET-RKCCNTCNEVKEAYRYKKWALP 180
+ N + K EL P+ CGSCYG T T CCN+C EV++AY K W+
Sbjct: 125 LRNEIDK----------LAELRGPDYCGSCYGGYTSTDNGCCNSCEEVRQAYVNKGWSFG 174
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
+ I QC E T+K+K+ EGC I G + VN+V G+ +I+PG S+ + +D P
Sbjct: 175 NPEGIEQCTQEGWTDKVKDQADEGCNISGRIRVNKVVGNINISPGRSFQTGSRNFYDFVP 234
Query: 241 Y---TSAAFNTTHHIRHLSFGIKLQDDD------------ERR-----KPLDGTVAKAEE 280
Y + TH+I L+F L DD+ ++R PLDG A +
Sbjct: 235 YLKEDGGQHDFTHYIDELTF---LADDEYNPNKMKHGKELKQRMGLDSNPLDGFKASTTK 291
Query: 281 GASMFNYYIKIIPTIYERLDGSK------------------LGGGD-----------GGM 311
M+ Y++K++ T + L+G +GGG+ GG
Sbjct: 292 KMFMYQYFLKVVSTQFRTLNGRTINTHQYSATHFERDLSRGMGGGENNQGVYVQHGAGGA 351
Query: 312 PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
PG +F++E+SP+ V E +S H T + G L+D+ L + + + K
Sbjct: 352 PGAYFNFEISPIQVVHAETRQSFAHFLTSTCAIVGGVLTVAALLDSFLFATSRALKK 408
>gi|330919615|ref|XP_003298687.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
gi|311327999|gb|EFQ93219.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
Length = 437
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 139/443 (31%), Positives = 199/443 (44%), Gaps = 85/443 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTI L I +L + DY +V+ EL V
Sbjct: 1 MPAKSRFNKLDAFTKTVEDARVRTTSGGIVTIASLLVIFWLSWGEWADYRRVTVRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ I ++I P + C+ L LD +D SGE + V H I K RL P+
Sbjct: 61 DKSRGERMEIAMNISFPRMPCELLTLDVMDVSGELQMGVTHGINKVRLS--------PEA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
+ A++ K V T P+ CG CYGA + CCNTC+EV++AY
Sbjct: 113 DGSKAIEIKAVDLH--TDEASHLAPDYCGQCYGAPAPSNAKKPTCCNTCDEVRDAYASVS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W+ + + QC+ E+ E L EGC++ G ++VN+V G+FH APG S+S ++HVH
Sbjct: 171 WSFGRGEGVEQCEREHYAEHLDQQRQEGCRLEGNIKVNKVVGNFHFAPGKSFSNGNLHVH 230
Query: 237 DIQPYTSAAFNT--THHIRHLSFGIKLQD------------------DDERRKPLDGTVA 276
D++ Y + THHI L FG +L D + PLD T+
Sbjct: 231 DLENYFKDEYTHTFTHHIHQLRFGPQLSDVVVQNMQKKHQESGIGGWSNHHINPLDETMQ 290
Query: 277 KAEEGASMFNYYIKIIPTIYERLDGSKL-------------------------------- 304
+E A + Y+IK++ T+Y L K+
Sbjct: 291 HTDEKAYNYMYFIKVVTTVYLPLGWEKVFPHPSKFSDILGATIDESYKGSIETHQYSVTS 350
Query: 305 ------GGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNI 345
GG D GG+PG+FFSY++SP+ V E + K+ + I
Sbjct: 351 HKRSLQGGNDEKDGHKERIHARGGIPGVFFSYDISPMEVINREVREKTFSGFLVGLCAVI 410
Query: 346 SGTYITFMLVDALLHSCVKKISK 368
GT +D L+ V +I K
Sbjct: 411 GGTLTVAAAIDRALYEGVNRIKK 433
>gi|388581981|gb|EIM22287.1| endoplasmic reticulum-derived transport vesicle ERV46 [Wallemia
sebi CBS 633.66]
Length = 407
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 124/400 (31%), Positives = 193/400 (48%), Gaps = 53/400 (13%)
Query: 9 GLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKL 68
G DAF K E+ +T +G +TI+C + IS+L + DY V + VD SR KL
Sbjct: 8 GFDAFAKTLEESRIRTNFGAYLTIICAILISFLTFNEFRDYRAVDFKPRIIVDQSRSEKL 67
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKK 128
++ ++ P + C L+LD +D SGEQ + H I + RL G E ++ +K
Sbjct: 68 QLNFNVTFPRVPCYLLSLDLMDVSGEQVRDLRHAIVRTRLSEKG--------ETIDGMKT 119
Query: 129 KKVTTENGTTTTELEDPNKCGSCYGA-ETETRKCCNTCNEVKEAYRYKKWALPELDTIVQ 187
++ E+ P +CGSCYG KCC TC++V+E+Y + W+ D + Q
Sbjct: 120 AGMSG----YLNEVAKPRECGSCYGGVPPNEEKCCYTCDDVRESYVKQGWSFVNPDGVKQ 175
Query: 188 CKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFN 247
C +E+ E++K +EGC + G ++VN+V G+FHI+PG S+ N H+HD+ PY A N
Sbjct: 176 CLDEHWAERVKEQSSEGCNVAGLVDVNKVVGNFHISPGRSFQSNAHHIHDLVPYLKNANN 235
Query: 248 ---TTHHIRHLSFGIKLQ--DDDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIY 296
H + H SF + D D ++ PL T A E MF Y++K++ T +
Sbjct: 236 HHDFGHILHHFSFKSSNEPADTDNLKEMLNINDPLSNTKAHTEVSNYMFQYFLKVVSTDF 295
Query: 297 ERLDGSKLGG-----------------------------GDGGMPGIFFSYELSPLMVKI 327
+ L+G KL G G PG+FF+Y++SPL V
Sbjct: 296 DFLNGEKLNSHQYSATAYERNLDEKGIYAQDGHGQTILHGVEGFPGVFFNYDISPLRVIY 355
Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
TE +S T + G ++DA + +K++
Sbjct: 356 TESRRSFASFLTSTCAIVGGVLTVASIIDAGVFGARQKLT 395
>gi|261188384|ref|XP_002620607.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis SLH14081]
gi|239593207|gb|EEQ75788.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis SLH14081]
gi|239609349|gb|EEQ86336.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis ER-3]
gi|327354450|gb|EGE83307.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis ATCC 18188]
Length = 435
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 135/440 (30%), Positives = 195/440 (44%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTI + I +LI + +Y +V EL V
Sbjct: 1 MPPKSRFARLDAFTKTVEDARIRTRSGGVVTITALIIIFFLIWGEWSEYRRVVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V H + K RL P +
Sbjct: 61 DKGRGERMEIHLNVTFPNLPCELLTLDVMDISGEYQTEVVHGVNKLRL--------SPAE 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E + + + T + DPN CGSCYGA + CCNTC+EV+EAY K+
Sbjct: 113 EGGQVLDITALQLHSKTDNAKDLDPNYCGSCYGAPAPPNAQKPGCCNTCDEVREAYAAKR 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W+ + + QC+ E + L EGC++ G + VN+V G+FHIAPG S++ ++H H
Sbjct: 173 WSFGRGENVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVIGNFHIAPGRSFTNGNMHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
D+ Y T N H I +L FG +L D+ RR PLD T F
Sbjct: 233 DLNNYYNTPIPHNVGHKIHYLRFGPQLPDEVSRRWKWTDHHHTNPLDNTEQHTTNPRLNF 292
Query: 286 NYYIKIIPTIYERL----------------------DGSKLGGG---------------- 307
Y++K++ T Y L G LG G
Sbjct: 293 AYFVKVVATSYLPLGWDDDWSSTVHSKVSNNVPLGKQGVSLGSGGSIETHQYSVTSHKRS 352
Query: 308 -----------------DGGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
GG+PG+F +Y++SP+ V E ++K+ T + I GT
Sbjct: 353 VDGGNDAEEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
+D L+ ++ K+
Sbjct: 413 TVAAAIDRALYEGSVRVKKL 432
>gi|407034208|gb|EKE37117.1| hypothetical protein ENU1_208770 [Entamoeba nuttalli P19]
Length = 361
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 191/378 (50%), Gaps = 32/378 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K D + K ED + +GG +TI+C + I L + Y Q +L VD R S
Sbjct: 1 MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+P+H DI P SC ++D + SGE + +E N+ K R+ DG + E + + + +
Sbjct: 61 KIPVHFDITFPYSSCPITSVDILTKSGESMIDIEQNVTKIRIHHDGSLVTENEMKAIQS- 119
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+TE DP +C SCYGAET +KCC TC++VKEAY+ K W L +L+ +
Sbjct: 120 ----------KLSTETHDPKECRSCYGAETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVS 168
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC+N + K T EGC++ G +N++ G+FHIAPG S + H H+++
Sbjct: 169 QCQNHEKIQMAKLTKDEGCRLIGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQI 228
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL-- 304
+ +H LSFG E K T K + SMF YY+ IIP ++G+
Sbjct: 229 DLSHKWNELSFG-------ENSKKFT-TEKKDTQMNSMFQYYLTIIPIKNNFINGTSTFY 280
Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
G G PG+F Y++SP+++++TE + H I + G + TF L
Sbjct: 281 DYSIQENTRSGKGEGQPGVFVYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLF 340
Query: 356 DALLHSCVKKI-SKVEIG 372
DA++ + + KVE+G
Sbjct: 341 DAIVFESIHTLKKKVELG 358
>gi|343425773|emb|CBQ69306.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 435
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/419 (30%), Positives = 204/419 (48%), Gaps = 62/419 (14%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+G+DAF+K +D +T G +T+V L I L + DY V L VD SRG
Sbjct: 9 QLRGIDAFSKTMDDVRIRTNAGALITLVSVLLIVVLTIGEFVDYRTVHLKPALEVDRSRG 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL ++++I P + C L+LD +D SGE ++H+I + R+ DGK +++ +K +
Sbjct: 69 EKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRISHDGKVVEQGKKHLKGD 128
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ T + + CG CYG + CCNTC+EV+EAY + W+ + D +
Sbjct: 129 AARIANT----------KGKDYCGDCYGGQPPASGCCNTCDEVREAYVRRGWSFADPDHV 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E ++K+K EGC+I G L VN+V GSFH++PG ++ N +H+HD+ PY S
Sbjct: 179 DQCVAEGWSDKIKQQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPYLSGT 238
Query: 246 FNTTHHIRHL----SFGIK-----LQDDDER--------RKPLDGTVAKAEEGASMFNYY 288
H H+ SFG + L ER + PL G A+ ++ MF Y+
Sbjct: 239 GAEHHDFGHIIHEFSFGSEQEYHGLTTAKERAVKAKLGVKDPLAGVRAQTQQSQFMFQYF 298
Query: 289 IKIIP------------------TIYER-----------------LDGSKLGGGDGGMPG 313
+K++ T YER G+ + G G+PG
Sbjct: 299 VKVVATEFRPLAGETLKTQQYSVTTYERDLSPGASAAALAGMSNEGSGAHISHGFAGVPG 358
Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+FF+YE+SPL E +SL H T + G ++D+L+++ +++ + G
Sbjct: 359 VFFNYEISPLKTIHAEYRQSLAHFLTSTCAIVGGILTVAGILDSLVYNSRRRLGLRDAG 417
>gi|451849936|gb|EMD63239.1| hypothetical protein COCSADRAFT_38106 [Cochliobolus sativus ND90Pr]
Length = 437
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 143/444 (32%), Positives = 202/444 (45%), Gaps = 85/444 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV L I +L + DY +V+ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVEDARVRTTSGGIVTIVSLLVIFWLTWGEWADYRRVTVRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ I L+I P + C+ + LD +D SGE + V H I K RL P++
Sbjct: 61 DKGRGERMEIALNISFPRVPCELITLDVMDVSGELQMGVTHGINKVRLS--------PER 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E ++ K + + + L P+ CG C+GA CCNTC+EV++AY
Sbjct: 113 EGSKTIEIKALDL-HADEASHLA-PDYCGECFGAPPPANAKKPGCCNTCDEVRDAYASIS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W+ + + QC+ E+ E L EGC++ G + VN+V G+FHIAPG S+S ++HVH
Sbjct: 171 WSFGRGEGVEQCEREHYAEHLDEQRQEGCRLEGSIRVNKVVGNFHIAPGKSFSNGNMHVH 230
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD------DDERR------------KPLDGTVA 276
D++ Y A TH I L FG +L D D+ R PLD T
Sbjct: 231 DLENYFKDEYAHTFTHKIHQLRFGPQLSDVVIQGIQDKHRGSGPGSWSNHHINPLDNTEQ 290
Query: 277 KAEEGASMFNYYIKIIPTIY---------------ERLDGSKL----------------- 304
+E A F Y+IK++ T Y + L GS +
Sbjct: 291 HTDEKAFNFMYFIKVVSTAYLPLGWEDAAPRLTKHDELLGSTIDATHKGSIETHQYSVTS 350
Query: 305 ------GGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNI 345
GG D GG+PG+FFSY++SP+ V E + K+ + I
Sbjct: 351 HKRNLKGGNDEKDGHKERVHARGGIPGVFFSYDISPMKVINREVREKTFSGFLVGLCAVI 410
Query: 346 SGTYITFMLVDALLHSCVKKISKV 369
GT VD L+ V +I K+
Sbjct: 411 GGTLTVAAAVDRALYEGVNRIKKI 434
>gi|317025332|ref|XP_001388859.2| COPII-coated vesicle membrane protein Erv46 [Aspergillus niger CBS
513.88]
gi|350638031|gb|EHA26387.1| hypothetical protein ASPNIDRAFT_196625 [Aspergillus niger ATCC
1015]
Length = 438
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 138/443 (31%), Positives = 198/443 (44%), Gaps = 82/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +TI L I +L+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWADYRRVVVMPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG K+ IHL++ P + C+ L LD +D SGEQ V H I K RL
Sbjct: 61 DKSRGEKMEIHLNVTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRL--------TSAA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K + + + + DP+ CG CYGA CCNTC+EV+EAY ++
Sbjct: 113 EGGRVIDVKALELHSKDESAKHLDPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQ 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC+ E E++ EGC++ G L VN+V G+FHIAPG S++ ++HVH
Sbjct: 173 WAFGKGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVH 232
Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
D+ + A TH I L FG +L D D PLDGT + E
Sbjct: 233 DLANFFDADLPDAEKHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDGTKQETNEP 292
Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
+ Y++K++ T Y L D + LG
Sbjct: 293 GYNYMYFVKVVSTSYLPLGWDPLFSSSIHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSH 352
Query: 306 -----GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
GGD G+PG+F +Y++SP+ V E + K+ T + I
Sbjct: 353 KRSLMGGDASDEGHKERLHAANGIPGVFVNYDISPMKVINREARPKTFTGFLTGVCAIIG 412
Query: 347 GTYITFMLVDALLHSCVKKISKV 369
GT +D L+ V ++ K+
Sbjct: 413 GTLTVAAALDRGLYEGVSRMKKL 435
>gi|12060847|gb|AAG48265.1|AF308298_1 serologically defined breast cancer antigen NY-BR-84, partial [Homo
sapiens]
Length = 239
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 147/229 (64%), Gaps = 6/229 (2%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 15 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 74
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ + +
Sbjct: 75 DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 132
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K +VT + + DP++C SCYGAE E KCCNTC +V+EAYR + WA DTI
Sbjct: 133 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 188
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVH
Sbjct: 189 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVH 237
>gi|440636941|gb|ELR06860.1| hypothetical protein GMDG_08151 [Geomyces destructans 20631-21]
Length = 441
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 138/448 (30%), Positives = 203/448 (45%), Gaps = 89/448 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTI L + YL + DY ++ EL V
Sbjct: 1 MPPKSRFTRLDAFTKTVDEARIRTTSGGIVTIASLLIVIYLAFGEWADYRRIVVHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL---DLDGKPIQE 117
D SRG K+ I ++I P + C+ L LD +D SGE V+H + K RL D G
Sbjct: 61 DKSRGEKMEIWMNITFPYVPCELLTLDVMDVSGEMQTGVKHGVSKVRLNSPDAGG----- 115
Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYR 173
A+ K + + DP+ CG CYGA + CCNTC+EV++AY
Sbjct: 116 ------GAIDVKALDLHSTEEKAAHLDPSYCGQCYGATPPPNAQKAGCCNTCDEVRDAYA 169
Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
WA + + QC+ E+ +E+L EGC+I G + VN+V G+FHIAPG SYS ++
Sbjct: 170 SASWAFGRGENVEQCEREHYSERLDEQRKEGCRIEGGVRVNKVIGNFHIAPGRSYSNGNM 229
Query: 234 HVHDIQ-----PYTSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTV 275
HVHD+ P + H I H+ FG +L + ++ PLDGT
Sbjct: 230 HVHDLANYWDTPSLERGHSFAHTIHHVRFGPQLPEGLSKKFGGKNQPWTNHHLNPLDGTQ 289
Query: 276 AKAEEGASMFNYYIKIIPTIY--------------------------ERLDGS------- 302
+ A + Y++K++ T Y +DGS
Sbjct: 290 QHTRDPAFNYMYFVKVVSTSYLPLGWNSKSAAKTQISEENIGLGAYGHAVDGSVETHQYS 349
Query: 303 ------KLGGGDG-------------GMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIM 342
L GGD G+PG+FFSY++SP+ ++ E++K+L T +
Sbjct: 350 VTSHKRSLSGGDDGAEGHKERLHSRTGIPGVFFSYDISPMKVINREERTKTLSGFITGLC 409
Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
+ GT VD L+ V +I K++
Sbjct: 410 AIVGGTLTVAAAVDRGLYEGVSRIKKLQ 437
>gi|189203047|ref|XP_001937859.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984958|gb|EDU50446.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 437
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 143/445 (32%), Positives = 201/445 (45%), Gaps = 89/445 (20%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTI L I +L + DY +V+ EL V
Sbjct: 1 MPVKSRFNKLDAFTKTVEDARVRTTSGGIVTIASLLVIFWLSWGEWADYRRVTVRPELMV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL--DLDGKPIQEP 118
D RG ++ I +++ P I C+ L LD +D SGE + V H I K RL + DG + E
Sbjct: 61 DKGRGERMEIAMNVSFPRIPCELLTLDVMDVSGELQMGVTHGINKVRLSPEADGSKVIET 120
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRY 174
+ ++A + + P+ CG CYGA T CCNTC+EV++AY
Sbjct: 121 KALDLHADEASHLA------------PDYCGQCYGAPPPTNAKKPNCCNTCDEVRDAYAS 168
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
W+ + + QC+ E+ E L EGC++ G ++VN+V G+FH APG S+S ++H
Sbjct: 169 ISWSFGRGEGVEQCEREHYAEHLDQQRQEGCRLEGSIKVNKVVGNFHFAPGKSFSNGNLH 228
Query: 235 VHDIQPY--TSAAFNTTHHIRHLSFGIKLQD---DDERRK---------------PLDGT 274
VHD++ Y A TH I L FG +L D D ++K PLD T
Sbjct: 229 VHDLENYFKDDYAHTFTHRIHQLRFGPQLSDVVVRDMQKKHLDSGHNGWSNHHVNPLDNT 288
Query: 275 VAKAEEGASMFNYYIKIIPTIY------------------------ERLDGS-------- 302
V +E A + Y+IK++ T Y E GS
Sbjct: 289 VQHTDEKAYNYMYFIKVVSTAYLPLGWEQEFPHPSKYSDILGTTIDESYKGSIETHQYSV 348
Query: 303 -----KLGGG-------------DGGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMC 343
L GG GG+PG+FFSY++SP+ V E + KS +
Sbjct: 349 TSHKRSLQGGTDEKDGHKERIHARGGIPGVFFSYDISPMKVVNREVREKSFSGFLVGLCA 408
Query: 344 NISGTYITFMLVDALLHSCVKKISK 368
I GT +D L+ V +I K
Sbjct: 409 VIGGTLTVAAAIDRALYEGVNRIKK 433
>gi|167376738|ref|XP_001734125.1| endoplasmic reticulum-golgi intermediate compartment protein
[Entamoeba dispar SAW760]
gi|165904489|gb|EDR29705.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba dispar SAW760]
Length = 361
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/378 (33%), Positives = 192/378 (50%), Gaps = 32/378 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K D + K ED + +GG +TI+C + I L + Y Q +L VD R S
Sbjct: 1 MKRFDTYGKLPEDLRTRHCFGGFLTIICVVIIIILSIAEFTFYLQREVVPQLLVDRDRSS 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+P+H DI P SC ++D + SGE + +E N+ K R+ DG + E + + + +
Sbjct: 61 KIPVHFDITFPYSSCPITSVDILTKSGESMIDIEQNVTKIRIHHDGSLVTESEMKAIQS- 119
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+TE DP +C SCYGAET +KCC TC++VKEAY+ K W L +L+ +
Sbjct: 120 ----------KLSTETHDPKECRSCYGAETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVS 168
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC+N + + T EGC++ G +N++ G+FHIAPG S H H+++
Sbjct: 169 QCQNHEKIQMARLTKDEGCRVIGDFLLNKIGGNFHIAPGSSEQSWGRHSHNLEWTGKTQI 228
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP----------TIY 296
+ +H LSFG E K T K + SMF YY+ IIP T Y
Sbjct: 229 DLSHKWNELSFG-------EHSKKFT-TEKKDTQMNSMFQYYLTIIPIKNNFINGTSTFY 280
Query: 297 ERLDGSKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
+ + G+G G PG+F Y++SP+++++TE + H I + G + TF L
Sbjct: 281 DYSIQENIRSGEGEGSPGVFVYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLF 340
Query: 356 DALLHSCVKKIS-KVEIG 372
DA++ + + KVE+G
Sbjct: 341 DAIVFESIHSLEKKVELG 358
>gi|389744843|gb|EIM86025.1| ER-derived vesicles protein ERV46 [Stereum hirsutum FP-91666 SS1]
Length = 419
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 122/411 (29%), Positives = 195/411 (47%), Gaps = 55/411 (13%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F LKG+DAF K ED KT G +T++ I ++ DY +V+ + VD
Sbjct: 5 FFGALKGVDAFGKTMEDVKVKTRTGAFLTLMAAAIILTFTTMEFFDYRRVTMDTSVEVDR 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKE 121
SRG KL + +++ P + C L+LD +D SGE + HNI K RL+ DG + +
Sbjct: 65 SRGEKLTVRMNVTFPRVPCYLLSLDVMDISGETQRDISHNIVKTRLNSDGTQVPNSANMQ 124
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ N + K ++G CGSCYG CCNTC++V+EAY + W+
Sbjct: 125 LRNELDKLNAQRQDGY----------CGSCYGGTPPEGGCCNTCDQVREAYVQRGWSFGN 174
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D+I QC E+ +EKL +EGC I G + VN+V G+ H++PG S+ + ++++ PY
Sbjct: 175 PDSIEQCVQEHWSEKLHEQSSEGCNISGRVRVNKVIGNIHLSPGKSFQNSASSIYELVPY 234
Query: 242 TSAAFNT---THHIRHLSFGIKLQDDDERRK--------------PLDGTVAKAEEGASM 284
N +H + L+FG + D + K PLDG A+ + ++M
Sbjct: 235 LKDDKNRHDFSHIVHSLTFGADDEYDSRKTKIANEMKQRMGLDSNPLDGYHARTSQPSTM 294
Query: 285 FNYYIKIIPTIYERLDGSKLGG---------------------------GDGGMPGIFFS 317
F Y++K + T + +DG + G G+PG FF+
Sbjct: 295 FQYFLKAVSTQFRTIDGKVVNTHQYQVTHYNRDAGNPQDKTNQGVNVMHGITGVPGAFFN 354
Query: 318 YELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
YE+SP+ V E +S H T + G ++D++L + +++ K
Sbjct: 355 YEISPIKVIHEETRQSFAHFLTSTCAIVGGVLTVTSILDSVLFAANQRLKK 405
>gi|384483831|gb|EIE76011.1| hypothetical protein RO3G_00715 [Rhizopus delemar RA 99-880]
Length = 408
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 121/388 (31%), Positives = 194/388 (50%), Gaps = 36/388 (9%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F +R + LDA+ K +DF +T GGAVTI+ L I L+ + Y E+ VD
Sbjct: 26 FIKRFRKLDAYAKTLDDFRVRTATGGAVTIISGLCILILVLFETVQYLTPIMKPEILVDG 85
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
KLPI DI P + C L+LD +D SGE + +H++YK RLD P EV
Sbjct: 86 GNMEKLPIKFDITFPHLPCYMLSLDIMDESGEHISNYDHDVYKERLD--------PNGEV 137
Query: 123 VNAVKKKKVTTENGTTTTE--LEDPNK-CGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ A K ++ E + P+ CGSCYGA+ + +CCNTC E++ AY W +
Sbjct: 138 ITAEKSNDLSNSQAKNAREHSMNVPDDYCGSCYGAKG-SNECCNTCEEIQNAYSELGWNV 196
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+ D QC E EK+++ EGC+++G L VN++ G+FH + G ++ + H+HD+
Sbjct: 197 -DPDNFEQCIREGWKEKIESQSREGCRMHGTLLVNKIRGNFHFSAGKAFKQSGSHIHDMS 255
Query: 240 PY--TSAAFNTTHHIRHLSFGIKLQDDDERRK--------PLDGTVAKAEEGASMFNYYI 289
+ N H I+HL FG + +++++ PL+ + E A M+ Y++
Sbjct: 256 TFLHNDKNQNFMHTIQHLQFGNHDYNSEKQKRTKSRELIHPLENIKSGNSETAIMYQYFL 315
Query: 290 KIIPTIYERLDGSKLGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGH 336
KI+PT + L+G ++ GG+PG+FF + SP+ + +E SL
Sbjct: 316 KIVPTEFNFLNGKRIRTFQYSVSKQDHIVSYLGGLPGVFFMLDHSPMRIIYSETKTSLAS 375
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVK 364
T + I G + ++D + +K
Sbjct: 376 YLTSLCAIIGGIFTVASVIDGSIQHMLK 403
>gi|400602673|gb|EJP70275.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Beauveria bassiana ARSEF 2860]
Length = 423
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 138/428 (32%), Positives = 201/428 (46%), Gaps = 69/428 (16%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV L + +L+ + DY ++ EL V
Sbjct: 1 MAAKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLVWGEWADYRTIAIRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V H ++K RL P+
Sbjct: 61 DQGRGERMDIHLNITFPRMPCELLTLDVMDVSGEQQHGVAHGVHKVRL--------RPEA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRYKK 176
E + + N E DP+ CG C GA + CCNTC E++EAY
Sbjct: 113 EGGGVIDVSSLDLHN--DAAEHLDPSYCGDCGGAPAPSNVKKAGCCNTCEEIREAYAQVS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + QC+ E+ E+L+ EGC+I G L+VN+V G+FH+APG S+S ++HVH
Sbjct: 171 WAFGDGKAFEQCEREHYAERLEEQRHEGCRIDGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230
Query: 237 DIQPYTSAA----FNTTHHIRHLSFGIKLQD-------------DDERRKPLDGTVAKAE 279
D++ Y + TH+I HL FG +L + + PLD T +
Sbjct: 231 DLKNYWETTDDKKHDFTHYIHHLRFGPQLPEAVVKKMGKGATPWTNHHANPLDNTKQLTD 290
Query: 280 EGASMFNYYIKIIPTIYERL-----------DGS-------------KLGGGD------- 308
+ F Y++KI+PT + L DGS L GGD
Sbjct: 291 DPNYNFMYFVKIVPTSFLPLGWEKMSRAMNTDGSVETHQYSVTSHKRSLTGGDDAAEGHA 350
Query: 309 ------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
GG+PG+FFSY++SP+ ++ E+ KS + + GT VD L
Sbjct: 351 ERLHSRGGIPGVFFSYDISPMKVINREEQGKSFLGFIAGLCAVVGGTLTVAAAVDRGLFE 410
Query: 362 CVKKISKV 369
++ K+
Sbjct: 411 GTTRLKKI 418
>gi|398398231|ref|XP_003852573.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
gi|339472454|gb|EGP87549.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
Length = 435
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 138/440 (31%), Positives = 198/440 (45%), Gaps = 80/440 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF+K ED +T GG VT+ L I +L + DY +++ E+ V
Sbjct: 1 MPVKSRFTKLDAFSKTVEDARIRTTSGGFVTVFSMLLIIWLAWGEWSDYRRITIQPEIIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D +RG K+ IHL++ P I C+ L LD +D SG+ V H I K RL +P+
Sbjct: 61 DKARGEKMEIHLNVTFPRIPCELLTLDVMDVSGDVQTGVLHGIVKTRL--------KPES 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K ++ + + CG CYGA CCNTC EV+EAY
Sbjct: 113 EGGGDIDKGRLQVNEVEEAAKHLARDYCGDCYGAPPPANAIKSGCCNTCAEVREAYASVS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W+ + + QC E+ +E L EGC++ G + VN+V G+FH APG S+S ++HVH
Sbjct: 173 WSFGRGENVEQCTREHYSEHLDEQRKEGCRVDGVIRVNKVVGNFHFAPGKSFSNGNMHVH 232
Query: 237 DIQPYTSAAFNTT--HHIRHLSFGIKLQD-------DDERR-------KPLDGTVAKAEE 280
D++ Y + + T H I HL FG L + D ER PLDG + E
Sbjct: 233 DLENYLTGGGDHTPSHIIHHLRFGPLLPESYKHRVRDTERHWSNNHHLSPLDGFRQETNE 292
Query: 281 GASMFNYYIKIIPTIYERLD------------------------GSK------------- 303
A + Y++K++PT Y L GS
Sbjct: 293 KAYNYMYFVKVVPTAYLPLGYENLPSVGDYPHEHAHVGEYGISHGSSIETHQYSVTSHKR 352
Query: 304 -LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGT 348
LGGGD GG+PG+FFSY++SP+ V E ++KS I + GT
Sbjct: 353 HLGGGDANDEGHKERLHARGGIPGVFFSYDISPMKVIDREVRAKSFSSFLVGICGVLGGT 412
Query: 349 YITFMLVDALLHSCVKKISK 368
VD + +++ K
Sbjct: 413 LTVAAAVDRIWFEGTQRVKK 432
>gi|58264656|ref|XP_569484.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|134109945|ref|XP_776358.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259032|gb|EAL21711.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57225716|gb|AAW42177.1| ER to Golgi transport-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 422
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/412 (30%), Positives = 191/412 (46%), Gaps = 63/412 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+G DAF K ED KT G +T + I + ++ DY ++ + VD SRG
Sbjct: 10 FQGFDAFGKTMEDVKIKTRTGALLTFISLSIILTSVMLEFIDYRRIHMEPSIIVDRSRGE 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNA 125
KL I DI P + C L+LD +D SGE EH + K R++ DG I + Q ++
Sbjct: 70 KLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRMNKDGNVISKVQGGQLKGD 129
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
V++ + +DPN CGSCYGA CCN+C EV++AY K W+ + + I
Sbjct: 130 VERANLN----------QDPNYCGSCYGALPPESGCCNSCEEVRQAYGRKGWSFSDPEGI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E +K+K EGC+I G++ VN+V G+ H +PG S+ N + + ++ PY
Sbjct: 180 EQCVEEGWMDKMKEQNEEGCRIDGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR-- 237
Query: 246 FNTTHH-----IRHLSFGIKLQDDDE---------------RRKPLDGTVAKAEEGASMF 285
+ HH + FG + +E R PL G A E MF
Sbjct: 238 -DKNHHDFGHIVHKFRFGADMTKAEELTVLPKEQRWRDKLGLRDPLQGIKAHTEVSNYMF 296
Query: 286 NYYIKIIPTIYERLDGSKL----------------GGGDG-------------GMPGIFF 316
Y++K++ T + L G ++ G G G+PG+FF
Sbjct: 297 QYFLKVVSTNFISLSGEEISSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFF 356
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+YE+SP+ V TE+ +S H T + G LVD+L+ + K++ K
Sbjct: 357 NYEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLVDSLIFNSSKRLKK 408
>gi|67479189|ref|XP_654976.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56472072|gb|EAL49587.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 361
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 192/378 (50%), Gaps = 32/378 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K D + K ED + +GG +TI+C + I L + Y Q +L VD R S
Sbjct: 1 MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+P+H DI P SC ++D + SGE + +E N+ K R+ DG + E + + + +
Sbjct: 61 KIPVHFDITFPYSSCPITSVDILTKSGESMIGIEQNVTKIRIHHDGSLVTENEMKAIQS- 119
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ E DP +C SCYGAET +KCC TC++VKEAY+ + W L +L+ +
Sbjct: 120 ----------KLSIETPDPKECRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL-DLNIVS 168
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC+N + K T EGC++ G +N++ G+FHIAPG S + H H+++
Sbjct: 169 QCQNHEKIQMAKLTKDEGCRLIGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQI 228
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP----------TIY 296
+ +H LSFG E K T K + SMF YY+ IIP T Y
Sbjct: 229 DLSHKWNELSFG-------ENSKKFT-TEKKDTQMNSMFQYYLTIIPIKNNFINGTSTFY 280
Query: 297 ERLDGSKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
+ + G+G G PG+F Y++SP+++++TE + H I + G + TF L
Sbjct: 281 DYSIQENIRSGEGEGQPGVFIYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLF 340
Query: 356 DALLHSCVKKI-SKVEIG 372
DA++ + + KVE+G
Sbjct: 341 DAIVFESIHTLKKKVELG 358
>gi|358372047|dbj|GAA88652.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus kawachii
IFO 4308]
Length = 438
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 138/443 (31%), Positives = 197/443 (44%), Gaps = 82/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +TI L I +L+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWVDYRRVVVMPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG K+ IHL+I P + C+ L LD +D SGEQ V H I K RL
Sbjct: 61 DKSRGEKMEIHLNITFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRLT--------SAA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + K + + + + DP+ CG CYGA CCNTC+EV+EAY ++
Sbjct: 113 EGGRVIDVKALELHSKDESAKHLDPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQ 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC+ E E++ EGC++ G L VN+V G+FHIAPG S++ ++HVH
Sbjct: 173 WAFGKGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVH 232
Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
D+ + A TH I L FG +L D D PLD T + E
Sbjct: 233 DLATFFDAELPESERHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDNTKQETNEP 292
Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
+ Y++K++ T Y L D + LG
Sbjct: 293 GYNYMYFVKVVSTSYLPLGWDPLFSSSIHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSH 352
Query: 306 -----GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
GGD G+PG+F +Y++SP+ V E + K+ T + I
Sbjct: 353 KRSLMGGDASDEGHKERLHAANGIPGVFVNYDISPMKVINREARPKTFTGFLTGVCAIIG 412
Query: 347 GTYITFMLVDALLHSCVKKISKV 369
GT +D L+ V ++ K+
Sbjct: 413 GTLTVAAALDRGLYEGVSRMKKL 435
>gi|213409826|ref|XP_002175683.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
yFS275]
gi|212003730|gb|EEB09390.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
yFS275]
Length = 394
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 124/405 (30%), Positives = 201/405 (49%), Gaps = 49/405 (12%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+F +L+ DAFTK ED KT GG ++I+ + + ++ ++ +Y ++ E+ V
Sbjct: 1 MLFRAQLRRFDAFTKTVEDAKIKTAGGGLISIISAVIVFVIVFLEWKNYQRIVVQPEIVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SR ++ I+ +I P + C Y+ +D +D SG+ V+H++ K RLD G I
Sbjct: 61 DPSRNERMEINFNITFPHVPCHYMGVDVMDISGDFQQDVQHSVTKTRLDKYGNIIAVIDS 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
++ +A + + + T CG CYGA ET CCN C V++AY K+
Sbjct: 121 DIGSATDESAMDKDGEVT---------CGDCYGAGDAAPPETPGCCNNCKAVRDAYARKQ 171
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA+ + D QC++E + + EGC I G+L VNRV+G+FH APG S+ H+H
Sbjct: 172 WAIGDYDAFQQCRDENYKAEHASQKGEGCNIAGHLFVNRVAGNFHFAPGRSFQTQQGHLH 231
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII-- 292
D++ Y A + TH I LSFG ++ E PLDG ++ + Y+IK +
Sbjct: 232 DLRGYEEEQEAHDMTHMIHQLSFGPPIKPSAEHTDPLDGHFKNTDDALHNYAYFIKCVAH 291
Query: 293 ---------PTI---------YERLDGSKLGGGD----------GGMPGIFFSYELSPLM 324
PTI +ER S GG + GG+PG+FF+ ++SP++
Sbjct: 292 KFVPLDPADPTINTNEFSVTQHER---SVTGGRENDNPSHLNRRGGIPGVFFNIDISPML 348
Query: 325 VKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
V + + + G + ++ + G LVD L++ K+ K
Sbjct: 349 VIQRQIRGNTFGGFISNVLSFLGGFITLTTLVDRGLYAAELKMKK 393
>gi|303322923|ref|XP_003071453.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
delta SOWgp]
gi|240111155|gb|EER29308.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
delta SOWgp]
gi|320033474|gb|EFW15422.1| COPII-coated vesicle membrane protein Erv46 [Coccidioides posadasii
str. Silveira]
Length = 435
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 139/440 (31%), Positives = 195/440 (44%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTIV + + L+ + DY +V EL V
Sbjct: 1 MPVKSRFTRLDAFAKTVEDARIRTRSGGVVTIVSLIVVILLVWGEWRDYRRVVVLPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C L LD +D SGEQ V H + K RL +
Sbjct: 61 DKGRGERMEIHLNITFPHLPCQLLTLDVMDVSGEQQSGVIHGVNKVRLSAASEGGHALDV 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E V+ KK + DP CGSCY + CCNTC+EV+EAY +
Sbjct: 121 ETVDLDKKDQAPLH--------LDPGYCGSCYDGIPPPNAKKPGCCNTCDEVREAYALRN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E K+ + EGC++ G L VN+V G+FH+APG S++ ++H H
Sbjct: 173 WAFGRGEGVEQCEQEGYGSKIDSQRNEGCRLEGILRVNKVVGNFHVAPGRSFTNGYMHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
D++ Y T +H I L FG +L D D PLD T E+ F
Sbjct: 233 DLKTYYETPVKHTMSHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKFNF 292
Query: 286 NYYIKIIPTIYERL----------------------DGSKLG------------------ 305
Y++K++ T Y L G +LG
Sbjct: 293 MYFVKVVSTSYLPLGWDASLSSEVHSRLSSDAPLGKQGIQLGQYGSIETHQYSVTSHKRS 352
Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
GGD GG+PG+FF+Y++SP+ V E ++KSL T + I GT
Sbjct: 353 IEGGDDSAEGHKERVHTAGGIPGVFFNYDISPMKVINREARTKSLSGFLTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD L+ ++ K+
Sbjct: 413 TVAAAVDRALYEGSVRVKKL 432
>gi|169770949|ref|XP_001819944.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus oryzae
RIB40]
gi|238486566|ref|XP_002374521.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
flavus NRRL3357]
gi|83767803|dbj|BAE57942.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699400|gb|EED55739.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
flavus NRRL3357]
gi|391874294|gb|EIT83200.1| COPII vesicle protein [Aspergillus oryzae 3.042]
Length = 436
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 139/441 (31%), Positives = 198/441 (44%), Gaps = 80/441 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +TI L I +L+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARIRTTSGGIITIASLLAILWLVWGEWVDYRRVVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG K+ IHL++ P + C+ L LD +D SGEQ V H I K RL
Sbjct: 61 DKSRGEKMEIHLNMTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRLS--------SPA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETE--TRKCCNTCNEVKEAYRYKKWA 178
E + + K + + + DPN CG C G ++CCNTC EV+EAY ++WA
Sbjct: 113 EGGHVIDVKALELHSEQEAAKHLDPNYCGDCGGVPQPGGEKRCCNTCEEVREAYAQQQWA 172
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+ + I QC+ E ++L EGC++ G L VN+V G+FHIAPG S++ +VHVHD+
Sbjct: 173 FGKGENIEQCEREGYAQRLDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNVHVHDL 232
Query: 239 QPY------TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGAS 283
+ Y + TH I L FG +L D D PLD T + + A
Sbjct: 233 ENYFEGDLPDAEKHTMTHIIHQLRFGPQLPDELSDRWQWTDHHHTNPLDSTQQETSDPAY 292
Query: 284 MFNYYIKIIPTIYERL--------------DGSKLG------------------------ 305
F Y++K++ T Y L + S LG
Sbjct: 293 NFMYFVKVVSTSYLPLGWDPLFSSAVHSAYEDSPLGSHGIAYGSQSSIETHQYSVTSHKR 352
Query: 306 ---GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGT 348
GGD G+PG+FF+Y++SP+ V E + K+ T + I GT
Sbjct: 353 SLRGGDASDEGHKERLHAANGIPGVFFNYDISPMKVINKEARPKTFTGFLTGVCAIIGGT 412
Query: 349 YITFMLVDALLHSCVKKISKV 369
+D L+ ++ K+
Sbjct: 413 LTVAAALDRGLYEGALRVKKL 433
>gi|301106576|ref|XP_002902371.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262098991|gb|EEY57043.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 393
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 127/376 (33%), Positives = 192/376 (51%), Gaps = 40/376 (10%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK +D + K + +F +T +G V+IV +F++ L ++ Y+ V+T E + VDS+ G
Sbjct: 30 LKKVDVYPKMHREFKVQTEFGATVSIVAGIFMAILFLSELSTYWTVNTHEHMVVDSTLGE 89
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL ++LD+ ++C ++A+D +GE +++ + K RLD +G+ I
Sbjct: 90 KLQVNLDVSFLAVNCRDAHINAMDVAGELQVNMHQTVVKTRLDANGRSIS---------- 139
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETET-RKCCNTCNEVKEAYRYKKWALPELDTI 185
TT + T+L CGSCYG ++CCNTC EVKEA+ + +L E +
Sbjct: 140 -----TTADELAKTDLP-AGYCGSCYGTRHPAGKECCNTCEEVKEAFIHSDLSLEEAEQK 193
Query: 186 VQCKNE-YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
QC E TEKL EGC+ G + VNRV+G+FH+A G ++ VH +P
Sbjct: 194 EQCVRESIDTEKLAQD-GEGCRFTGKMFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEH 252
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
FN++H I LSFG + PLDG AE+ +F YYIKI+PTIY +D S +
Sbjct: 253 TFNSSHIIHSLSFGEPIPGAT---SPLDGVSKIAEQSGGVFQYYIKIVPTIYSDIDESAI 309
Query: 305 ----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
G +PG FF ++LSP MVK+ H TKI C I G
Sbjct: 310 HSYQFSVTQQSNYLNPRGQMTSLPGTFFVFDLSPFMVKVENDRVPFTHFLTKI-CAIVGG 368
Query: 349 YITFM-LVDALLHSCV 363
I+ VD+ +++ +
Sbjct: 369 VISIAGFVDSFMYNSL 384
>gi|452001785|gb|EMD94244.1| hypothetical protein COCHEDRAFT_1202021 [Cochliobolus
heterostrophus C5]
Length = 437
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 142/444 (31%), Positives = 200/444 (45%), Gaps = 85/444 (19%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV L I +L + DY +V+ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLLVIFWLTWGEWADYRRVTVRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ I L+I P + C+ + LD +D SGE + V H I K RL P+K
Sbjct: 61 DKGRGERMEIALNISFPRVPCELITLDVMDVSGELQMGVTHGINKVRLG--------PEK 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E ++ K + + + L P+ CG C+GA CCNTC+EV++AY
Sbjct: 113 EGSKTIEIKALDL-HADEASHLA-PDYCGECFGAPPPANAKKPGCCNTCDEVRDAYASIS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W+ + + QC+ E+ E L EGC++ G + VN+V G+FHIAPG S+S ++HVH
Sbjct: 171 WSFGRGEGVEQCEREHYAEHLDEQRQEGCRLEGSIRVNKVVGNFHIAPGKSFSNGNMHVH 230
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD------------------DDERRKPLDGTVA 276
D++ Y A TH I L FG +L D + PLD T
Sbjct: 231 DLENYFKDEYAHTFTHKIHQLRFGPQLSDVVIQGIQDKHKGSGPGSWSNHHINPLDNTEQ 290
Query: 277 KAEEGASMFNYYIKIIPTIY---------------ERLDGSKL----------------- 304
+E A F Y+IK++ T Y + L GS +
Sbjct: 291 HTDEKAFNFMYFIKVVSTAYLPLGWEDAAPRLTKHDELLGSTIDASHKGSIETHQYSVTS 350
Query: 305 ------GGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNI 345
GG D GG+PG+FFSY++SP+ V E + K+ + I
Sbjct: 351 HKRNLKGGNDEKDGHKERIHARGGIPGVFFSYDISPMKVINREVREKTFSGFLVGLCAVI 410
Query: 346 SGTYITFMLVDALLHSCVKKISKV 369
GT VD L+ V +I K+
Sbjct: 411 GGTLTVAAAVDRALYEGVNRIKKI 434
>gi|121702771|ref|XP_001269650.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
clavatus NRRL 1]
gi|119397793|gb|EAW08224.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
clavatus NRRL 1]
Length = 438
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 139/443 (31%), Positives = 196/443 (44%), Gaps = 82/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTI + I YL+ + DY +V EL V
Sbjct: 1 MPAKSRFTRLDAFAKTVEDARVRTTSGGIVTIASLIVILYLVWGEWVDYRRVVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IH++I P + C+ + LD +D SGEQ + V H + K RL
Sbjct: 61 DKSRGERMEIHMNITFPRLPCELVTLDVMDVSGEQQVGVAHGVNKVRL--------SSPA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + + + + + + DPN CG C GA+ CCNTC+EV+EAY K
Sbjct: 113 EGGHVLDIRSLDLHSKDEVAKHLDPNYCGDCGGADPLPGAIKPGCCNTCDEVREAYAAKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+ E T ++ EGC++ G L VN+V G+FHIAPG S++ ++HVH
Sbjct: 173 WAFGKGANIEQCEREGYTARIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTNGNIHVH 232
Query: 237 DIQPY------TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
D Q Y A H I L FG +L D D PLD T + +
Sbjct: 233 DTQAYFDLDLPDDAKHTMEHEIHQLRFGPQLPDELSARWQWTDHHHTNPLDNTHQETNDP 292
Query: 282 ASMFNYYIKIIPTIYERL----------------------------DGS----------- 302
A F Y++K++ T Y L GS
Sbjct: 293 AYNFVYFVKVVSTSYLPLGWDPLFSSALHSTYEKAPLGAHGIGYGASGSIETHQYSVTSH 352
Query: 303 --KLGGGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
L GGD G+PG+FF+Y++SP+ V E + K+L T + I
Sbjct: 353 KRSLRGGDAEDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKTLSSFLTGVCAIIG 412
Query: 347 GTYITFMLVDALLHSCVKKISKV 369
GT +D L+ ++ K+
Sbjct: 413 GTLTVAAAIDRGLYEGALRVKKL 435
>gi|297602842|ref|NP_001052965.2| Os04g0455900 [Oryza sativa Japonica Group]
gi|255675519|dbj|BAF14879.2| Os04g0455900 [Oryza sativa Japonica Group]
Length = 253
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 99/243 (40%), Positives = 148/243 (60%), Gaps = 3/243 (1%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ LDA+ K EDF+ +T+ GG +T+ + + L ++ Y T L VD+SRG
Sbjct: 7 KLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRLYLHAVTETTLRVDTSRG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I+ D+ P + C ++LDA+D SG++HL V+H+I+K+R+D+ G I Q + V
Sbjct: 67 ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQ-DAVGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K ++ +G E CGSCYGAE +CCN+C +V+EAYR K W + D I
Sbjct: 126 MKVEQPLQRHGGRLEHNE--TYCGSCYGAEESDEQCCNSCEDVREAYRKKGWGVSNPDLI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QCK E + +K+ EGC IYG+LEVN+V+G+FH APG S+ +VHVHD+ P+ +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDS 243
Query: 246 FNT 248
FN
Sbjct: 244 FNV 246
>gi|51214107|emb|CAH17876.1| hypothetical protein (22C8.0001), conserved [Pneumocystis carinii]
Length = 388
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 132/398 (33%), Positives = 190/398 (47%), Gaps = 48/398 (12%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF+K E+ KT+ GG +TI+ + I LI + DY Q+ EL +D SRG
Sbjct: 7 FRRFDAFSKTIENAQIKTINGGFITILSIIVIFVLIYFEWRDYRQIVILPELTIDRSRGE 66
Query: 67 KLPIHLDIVVPTISCD---YLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
KL I+L++ P I C L+LD +D SGE V HN+ K RLD +G I +
Sbjct: 67 KLQINLNLTFPKIPCSRLLVLSLDVMDVSGELETDVSHNVVKNRLDSNGIFINSTSLNTL 126
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
N + K P+ CGSCYGA+ CCNTC +V +AY W +P+
Sbjct: 127 NFQQPAKT-----------RPPDYCGSCYGAK---EGCCNTCQQVIDAYASNNWPVPDTK 172
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT- 242
QCK +Y+ N F EGC G +EVN+V G+FH APG S I H+HDI Y
Sbjct: 173 AFEQCKEKYNN---LNEFDEGCNFVGRIEVNKVVGNFHFAPGHSSQIMRNHIHDIYDYMT 229
Query: 243 -SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
S+ + +H I LSFG +++ + PLD + + ++Y+IK + +E L
Sbjct: 230 DSSPHDFSHTINKLSFGPEVE-GRSLQNPLDNVKKETDNPTLRYSYFIKCVAYRFEYLSK 288
Query: 302 SKLG-------------GGDG------------GMPGIFFSYELSPLMVKITEKSKSLGH 336
L GD G+PG+FFSY++SP+ + E +
Sbjct: 289 PSLDTNKYSVTVHERSISGDSDPNYPTHISPKDGIPGVFFSYDISPIKIIERETRGNFST 348
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
T + ISG +VD +L+ ++I K GK
Sbjct: 349 FLTSTVIIISGVLTIAGIVDRILYETERQIEKKLREGK 386
>gi|393233667|gb|EJD41236.1| endoplasmic reticulum-derived transport vesicle ERV46 [Auricularia
delicata TFB-10046 SS5]
Length = 419
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/413 (29%), Positives = 185/413 (44%), Gaps = 57/413 (13%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
+FS LKG+DAF K +D KT G +T++ I ++ DY +++ + VD
Sbjct: 5 IFST-LKGVDAFGKTMDDVKVKTRTGALLTLISIAIIFTFTTIEFVDYRRINHDTSMVVD 63
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
SRG KL ++L++ P I C L+LD +D SGE+ V HNI K R+D + + I +
Sbjct: 64 KSRGEKLTVNLNVTFPKIPCYLLSLDVMDISGERQADVTHNILKTRIDANRQRIADQTTT 123
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ +KV G N CGSCYG CC TC V++AY + WA +
Sbjct: 124 YDLQNEAEKVVAARG--------ANYCGSCYGGLEPEGGCCQTCEAVRQAYINRGWAFSD 175
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D I QCK E EK++ EGC + G + VN+V GS + G S+ +N + +HD+ PY
Sbjct: 176 PDAIEQCKQEGWKEKIQAQMNEGCNVEGRVRVNKVVGSIQFSFGRSFQMNQMSLHDLVPY 235
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDE------------------RRKPLDGTVAKAEEGAS 283
H RH DDE PLDG E
Sbjct: 236 LRD--ENVHDWRHRVQHFYFSSDDEFNIYKAGISSSMKQRLGIAANPLDGNYGHTESTEY 293
Query: 284 MFNYYIKIIPTIYERL----------------------------DGSKLGGGDGGMPGIF 315
MF Y++K++ T + + DG + G G+PG+F
Sbjct: 294 MFQYFLKVVSTQFRTIGGEVINTHQYSATHFDRDLAEGVRGKTEDGVVVTHGVQGLPGVF 353
Query: 316 FSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
F++E+SP+ + +E +S H T + G +VD+LL + + + K
Sbjct: 354 FNFEISPMRIIHSETRQSFAHFITSTCAIVGGVLTIASIVDSLLFTTQQALKK 406
>gi|302414546|ref|XP_003005105.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
gi|261356174|gb|EEY18602.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
Length = 349
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 127/376 (33%), Positives = 182/376 (48%), Gaps = 39/376 (10%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG +TIV + +L + DY +++ EL V
Sbjct: 1 MAGKSRFTKLDAFTKTVDEARIRTSSGGIITIVSLFIVFWLAWGEWADYRRITLHPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG K+ IHL++ P + C+ L LD +D SGEQ + I K RL QK
Sbjct: 61 DKGRGEKMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGIVSGISKVRL--------RSQK 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET----RKCCNTCNEVKEAYRYKK 176
+ + K ++ P+ CG CYGA+ + CCNTC EV+EAY
Sbjct: 113 DGGGVIDTKALSLHAADEAATHLAPDYCGDCYGAKAPANAVKQGCCNTCEEVREAYAQAS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + + QC E+ E+L EGC+I G L VN+V G+FH+APG S+S ++HVH
Sbjct: 173 WAFGKGENVEQCTREHYAERLDEQRAEGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVH 232
Query: 237 DIQPYTSAAF--NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
D++ Y A + TH I L F + DE + L G AE A
Sbjct: 233 DLKNYWDAEIIHDFTHQIHALRFVLS----DEPQAQLSGGDDSAEGHA------------ 276
Query: 295 IYERLDGSKLGGGDGGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFM 353
ERL GG+PG+FFSY++SP+ ++ E+SKS T + I GT
Sbjct: 277 --ERLHTR------GGIPGVFFSYDISPMKVINREERSKSFTGFLTGLCAVIGGTLTVAA 328
Query: 354 LVDALLHSCVKKISKV 369
VD + ++ K+
Sbjct: 329 AVDRGMFEGSLRLKKI 344
>gi|74267709|gb|AAI02327.1| ERGIC and golgi 3 [Bos taurus]
Length = 231
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/230 (44%), Positives = 147/230 (63%), Gaps = 8/230 (3%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAE E KCCN+C +V+EAYR + WA DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
I QC+ E ++K++ EGCQ+YG+LEVN+V+G+FH APG S+ +HVH
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVH 228
>gi|258565913|ref|XP_002583701.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907402|gb|EEP81803.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 435
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 136/440 (30%), Positives = 198/440 (45%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTIV + + L+ + DY +V EL V
Sbjct: 1 MAAKSRFTRLDAFAKTVEDARIRTRSGGVVTIVALIAVILLVWGEWKDYRRVVVLSELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ + H I K RL +
Sbjct: 61 DKGRGERMEIHLNITFPHLPCELLTLDVMDVSGEQQSGLIHGIKKVRLGPASEGGHVLDA 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
+ ++ KK +V DP CGSCY + + CCNTC+EV+EAY +
Sbjct: 121 QTLDLHKKDEVAVH--------LDPEYCGSCYDGVPPPNAQKQGCCNTCDEVREAYASRG 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E ++ EGC++ G L VN+V G+FHIAPG S++ ++H H
Sbjct: 173 WAFGRGEGVAQCEREGYGARIDAQRHEGCRLEGILRVNKVIGNFHIAPGRSFTNGYMHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
D++ Y T H I L FG +L D D PLD T E+ F
Sbjct: 233 DLKIYHETPVKHTMAHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKYNF 292
Query: 286 NYYIKIIPT--------------IYERL--------DGSKLG------------------ 305
Y++K++ T ++ RL G +LG
Sbjct: 293 MYFVKVVSTSYLPLGWDASLSSEVHSRLASDAPLGKQGIQLGRHGSIETHQYSVTSHKRS 352
Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
GGD GG+PG+FF+Y++SP+ V E ++KS T + I GT
Sbjct: 353 VEGGDDSAEGHKERIHTAGGIPGVFFNYDISPMKVINREARTKSFSGFLTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
+D +L+ ++ K+
Sbjct: 413 TVAAAIDRMLYEGAVRVKKL 432
>gi|295672798|ref|XP_002796945.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226282317|gb|EEH37883.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 435
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 137/440 (31%), Positives = 192/440 (43%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV IS+LI + +Y ++ EL V
Sbjct: 1 MAPKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGE V H I K RL P+
Sbjct: 61 DKGRGERMEIHLNITFPHLPCELLTLDVMDVSGEMQSGVIHGISKVRL--------APES 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + + + T + DP+ CG CYGA CC+TC EV+EAY +
Sbjct: 113 EGGHVIDTTALVLHTQTDAAKHLDPDYCGPCYGAPPPPHATKPGCCSTCEEVREAYASQS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E ++ L EGC+I G L VN+V G+FHIAPG S+S ++H H
Sbjct: 173 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVIGNFHIAPGRSFSNGNLHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
D+ Y T H I L FG +L D+ R PLD T + F
Sbjct: 233 DLDTYYHTPVPHYMAHKIHQLRFGPQLPDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 292
Query: 286 NYYIKIIPTIYERLDGS------------------------------------------K 303
Y++K++ T Y L S
Sbjct: 293 MYFVKVVSTSYLPLGWSPEFSSSVHETTLRDTPLGKQGVHFGSSGSIETHQYSVTSHKRS 352
Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
+ GGD GG+PG+F +Y++SP+ V E ++K+ T + I GT
Sbjct: 353 IDGGDDAAEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD L+ ++ K+
Sbjct: 413 TVAAAVDRALYEGAVRVKKL 432
>gi|119189667|ref|XP_001245440.1| hypothetical protein CIMG_04881 [Coccidioides immitis RS]
gi|392868334|gb|EAS34105.2| COPII-coated vesicle membrane protein Erv46 [Coccidioides immitis
RS]
Length = 435
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 137/440 (31%), Positives = 195/440 (44%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTIV + + L+ + DY +V EL V
Sbjct: 1 MPVKSRFTRLDAFAKTVEDARIRTRSGGVVTIVSLIVVILLVWGEWKDYRRVVVLPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C L LD +D SGEQ V H + K RL +
Sbjct: 61 DKGRGERMEIHLNITFPHLPCQLLTLDVMDVSGEQQSGVIHGVNKVRLSAASEGGHALDV 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E ++ K+ + DP CGSCY + CCNTC+EV+EAY +
Sbjct: 121 ETLDLDKRDQAPLH--------LDPAYCGSCYDGIPPPNAKKPGCCNTCDEVREAYALRN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E K+ + EGC++ G L VN+V G+FH+APG S++ ++H H
Sbjct: 173 WAFGRGEGVEQCEQEGYGSKIDSQRNEGCRLEGILRVNKVVGNFHVAPGRSFTNGYMHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
D++ Y T +H I L FG +L D D PLD T E+ F
Sbjct: 233 DLKTYYETPVKHTMSHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKFNF 292
Query: 286 NYYIKIIPTIYERL----------------------DGSKLG------------------ 305
Y++K++ T Y L G +LG
Sbjct: 293 MYFVKVVSTSYLPLGWDASLSSEVHSRLSSDAPLGKQGIQLGQYGSIETHQYSVTSHKRS 352
Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
GGD GG+PG+FF+Y++SP+ V E ++KSL T + I GT
Sbjct: 353 IEGGDDSAEGHKERVHTAGGIPGVFFNYDISPMKVINREARTKSLSGFLTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD L+ ++ K+
Sbjct: 413 TVAAAVDRALYEGSVRVKKL 432
>gi|425772976|gb|EKV11354.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
digitatum PHI26]
gi|425782132|gb|EKV20058.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
digitatum Pd1]
Length = 438
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 136/443 (30%), Positives = 200/443 (45%), Gaps = 82/443 (18%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG +TI L + +L+ + DY +V EL V
Sbjct: 1 MSAKSRFTRLDAFAKTVEDARIRTKSGGVITIASLLIVMWLVWGEWADYRRVVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D SRG ++ IHL++ P + C+ L LD +D SGEQ + V H + K RL P+
Sbjct: 61 DKSRGERMEIHLNMTFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLS--------PRN 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRYKK 176
E + + + + + + DP CG C GA CC TC EV++AY K+
Sbjct: 113 EGGKVIDVQALDLHSPSEAAKHLDPEYCGECGGATPPPNVIKPGCCTTCEEVRQAYAEKQ 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC E E+L EGC+I G L+VN+V G+FHIAPG S++ ++HVH
Sbjct: 173 WAFGDGSNIEQCTREGYAERLAEQRREGCRIEGVLKVNKVIGNFHIAPGRSFTTGNMHVH 232
Query: 237 DIQPYTS-----AAFNTTHHIRH-LSFGIKLQ---------DDDERRKPLDGTVAKAEEG 281
D+ Y A +T H+ H L FG +L D PLD T + +E
Sbjct: 233 DLDTYIDPNAGPAEQHTMSHLVHELRFGPQLPAELAGRWGWTDHHHTNPLDDTKQETDEP 292
Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
A F Y++K++ T Y L D + LG
Sbjct: 293 AYNFLYFVKVVSTSYLPLGWDPQFSTAIHNAYDKAPLGYHGLAYGTQGSIEAHQYSVTSH 352
Query: 306 -----GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
GG+ GG+PG+FF+Y++SP+ V E + K+ + T + I
Sbjct: 353 KRPLSGGNDAAEGHKERVHAGGGIPGVFFNYDISPMKVVNREARPKTFTNFLTGVCAIIG 412
Query: 347 GTYITFMLVDALLHSCVKKISKV 369
GT +D ++ ++ K+
Sbjct: 413 GTLTVAAALDRGVYEGAMRVKKL 435
>gi|348680250|gb|EGZ20066.1| CopII vesicle protein [Phytophthora sojae]
Length = 409
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 123/376 (32%), Positives = 195/376 (51%), Gaps = 32/376 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK +D + K + +F +T +G V+IV + ++ L ++ Y+ ++T E + VDSS G
Sbjct: 31 LKKVDVYPKMHREFKVQTEFGATVSIVAGIVMAILFLSELSAYWSLNTHEHMVVDSSLGE 90
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL ++LD+ ++C ++A+D +GE +++ + K RLD DG I P ++ +
Sbjct: 91 KLQVNLDVSFLAVNCRDAHINAMDVAGELQVNMHQTVVKTRLDADGNTIGRP----ISMI 146
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETET-RKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + T E CGSC+GA+ ++CCNTC +VKEA+ Y ++L + +
Sbjct: 147 TDEGAEEQAKTALPE----GYCGSCHGAQHPAGKECCNTCEDVKEAFIYSDFSLEDAEQK 202
Query: 186 VQCKNE-YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
QC E EKL EGC+ G + VNRV+G+FH+A G ++ VH +P
Sbjct: 203 EQCVREIMEAEKLAQD-GEGCRFTGKMFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEH 261
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD---- 300
+N++H I LSFG + PLDG AE+ +F YYIKI+PTIY +D
Sbjct: 262 TYNSSHIIHSLSFGEPMPG---VAGPLDGVSKIAEQSGGVFQYYIKIVPTIYSDIDENTI 318
Query: 301 ----------GSKLG--GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
G+ L G +PG FF ++LSP MVK+ H TK+ C I G
Sbjct: 319 HSYQFSVTQQGNYLNPRGQMTSLPGTFFVFDLSPFMVKVENDRMPFTHFLTKV-CAIVGG 377
Query: 349 YITFM-LVDALLHSCV 363
I+ VD+ +++ +
Sbjct: 378 VISIAGFVDSFMYNSL 393
>gi|426196003|gb|EKV45932.1| hypothetical protein AGABI2DRAFT_207344 [Agaricus bisporus var.
bisporus H97]
Length = 1000
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 121/399 (30%), Positives = 184/399 (46%), Gaps = 61/399 (15%)
Query: 18 EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
ED KT G +T + I ++ DY +V T + VD SRG KL ++L+I P
Sbjct: 602 EDVKVKTRTGAFLTFIAAAIILSFTTLEFLDYRRVYTDTSIVVDKSRGEKLTVNLNITFP 661
Query: 78 TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGT 137
+ C L+LD +D SGE + HNI K RL+ +G +V A ++ E
Sbjct: 662 RVPCFLLSLDVMDISGEVQRDISHNILKTRLENNGT--------IVPASYSAQLQNE-LD 712
Query: 138 TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
E++ CGSCYG CCNTC+EV++AY + W+ D I QCK E +EK+
Sbjct: 713 KMNEVQQSGYCGSCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKM 772
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT--SAAFNTTHHIRHL 255
K+ EGC + G L VN+V G+ H++PG S+ N +++++ PY + +H I H
Sbjct: 773 KDQADEGCNVSGRLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHF 832
Query: 256 SFGIKLQDDDE------------------RRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
+F + DDE PLDG + + MF Y++K++ T +
Sbjct: 833 AF----EGDDEYVYWKASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFR 888
Query: 298 RLDGS----------------------------KLGGGDGGMPGIFFSYELSPLMVKITE 329
LDG + G G+PG FF+YE+SP++V +
Sbjct: 889 TLDGKIVNTHQYSVTHFERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHAD 948
Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+S H T + G LVD+LL + + + K
Sbjct: 949 SRQSFAHFLTSTCAIVGGVLTVASLVDSLLFATTRALKK 987
>gi|50548631|ref|XP_501785.1| YALI0C13112p [Yarrowia lipolytica]
gi|49647652|emb|CAG82095.1| YALI0C13112p [Yarrowia lipolytica CLIB122]
Length = 401
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 134/400 (33%), Positives = 195/400 (48%), Gaps = 45/400 (11%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L DAF KP D KT GG VT++ L I L + Y ++ VD RG
Sbjct: 4 KLFRYDAFAKPTADATIKTASGGIVTLLAILLIVVLTISEYWAYTTPVMRSQMTVDRYRG 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+L IHL+I P + C + LD +DSSGE V+H++ K LD G + +
Sbjct: 64 DRLDIHLNITFPQLPCSLVTLDIIDSSGEVQQSVDHDMTKVTLDERGNILSSEALTLGEN 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K V T L+DPN CGSCYGAE+E +CCNTC +V+ AY K WA + +
Sbjct: 124 PDSKAVAKR-----TFLDDPNYCGSCYGAESEPDQCCNTCEQVRAAYATKGWAFTDGSGV 178
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS-- 243
QC+ E+LK + +GC I G V +V+G+FH APG+S + H+HD+ +
Sbjct: 179 EQCEVIGFKEQLKAQYNQGCNIAGKFTVQKVAGNFHFAPGVSSHRDEQHLHDLSHFKDPE 238
Query: 244 AAFNTTHHIRHLSFGIKLQ----DDDE----RRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
A F +H I LSFG ++ D D+ PL+ T + FNY+ K++ T
Sbjct: 239 APFTFSHIIHDLSFGEQVDVSGLDWDKGVAMETSPLENTPHHTDNKWFRFNYFTKVVSTR 298
Query: 296 YERLDGSKL---------------GGGD----------GGMPGIFFSYELSPL-MVKITE 329
+E LDG K+ GG D GG+PG+FFSY++SP+ +V E
Sbjct: 299 FEFLDGKKIETNQYAATAHERPLQGGRDEDHQNTRHMRGGLPGVFFSYDISPMRIVNKQE 358
Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
G +++ I G + V A+L + ++ +V
Sbjct: 359 YRSHFGAFVMQVVATIGGV----LTVAAVLDRGIYEVDQV 394
>gi|409079094|gb|EKM79456.1| hypothetical protein AGABI1DRAFT_120853 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1000
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 121/399 (30%), Positives = 184/399 (46%), Gaps = 61/399 (15%)
Query: 18 EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
ED KT G +T + I ++ DY +V T + VD SRG KL ++L+I P
Sbjct: 602 EDVKVKTRTGAFLTFIAAAIILSFTTLEFLDYRRVYTDTSIVVDKSRGEKLTVNLNITFP 661
Query: 78 TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGT 137
+ C L+LD +D SGE + HNI K RL+ +G +V A ++ E
Sbjct: 662 RVPCFLLSLDVMDISGEVQRDISHNILKTRLENNGT--------IVPASYSAQLQNE-LD 712
Query: 138 TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
E++ CGSCYG CCNTC+EV++AY + W+ D I QCK E +EK+
Sbjct: 713 KMNEVQQSGYCGSCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKM 772
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT--SAAFNTTHHIRHL 255
K+ EGC + G L VN+V G+ H++PG S+ N +++++ PY + +H I H
Sbjct: 773 KDQADEGCNVSGRLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHF 832
Query: 256 SFGIKLQDDDE------------------RRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
+F + DDE PLDG + + MF Y++K++ T +
Sbjct: 833 AF----EGDDEYVYWKASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFR 888
Query: 298 RLDGS----------------------------KLGGGDGGMPGIFFSYELSPLMVKITE 329
LDG + G G+PG FF+YE+SP++V +
Sbjct: 889 TLDGKIVNTHQYSVTHFERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHAD 948
Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+S H T + G LVD+LL + + + K
Sbjct: 949 SRQSFAHFLTSTCAIVGGVLTVASLVDSLLFATTRALKK 987
>gi|392566201|gb|EIW59377.1| endoplasmic reticulum-derived transport vesicle ERV46 [Trametes
versicolor FP-101664 SS1]
Length = 423
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 127/412 (30%), Positives = 193/412 (46%), Gaps = 56/412 (13%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F LKG+DAF K ED KT G +T++ I+ ++ DY +V+ + VD
Sbjct: 5 FLSALKGVDAFGKTMEDVKVKTRTGALLTLIAAAIITSFTTIEFFDYRRVNVDTSIVVDR 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-KE 121
SRG KL +++++ P + C L+LD +D SGE + HNI K R+D G P+ E
Sbjct: 65 SRGEKLTVNMNVTFPRVPCYLLSLDVMDISGETQSDITHNILKTRMDERGFPVPTTVITE 124
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ N + K ++ E G E E CCNTC +V++AY + W+
Sbjct: 125 LQNDLDK---------INSQREGGYCGSCYGGVEPEG-GCCNTCEDVRQAYVNRGWSFNR 174
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D+I QC E +EKLK TEGC I G + VN+V G+ H++PG S+ + ++++ PY
Sbjct: 175 PDSIEQCVQEGWSEKLKEQATEGCNIAGRVRVNKVVGNIHLSPGRSFRTSSHSLYELVPY 234
Query: 242 TSAAFNT---THHIRHLSF---------GIKLQDDDERR-----KPLDGTVAKAEEGASM 284
N TH I HL+F KL + ++R PLDGT + + M
Sbjct: 235 LKTDGNRHDFTHTIHHLAFEGDDEWDLAKAKLGKELKQRLGIAANPLDGTTGRTIKQQYM 294
Query: 285 FNYYIKIIPTIYERLDGSKL----------------------------GGGDGGMPGIFF 316
F Y++K++ T + L G + G+GG+PG FF
Sbjct: 295 FQYFLKVVATQFRTLSGKTINTHQYSATHFERDLDKGSQENTPTGVHVAHGNGGIPGAFF 354
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+YE+SPL + E +S H T + G L+D+ L + K + K
Sbjct: 355 NYEISPLRIVHAETRQSFAHFLTSTCAIVGGVLTVASLIDSALFATRKALKK 406
>gi|346324387|gb|EGX93984.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Cordyceps militaris CM01]
Length = 423
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 137/429 (31%), Positives = 195/429 (45%), Gaps = 69/429 (16%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTIV + + +L + Y V EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLVVVLFLAWGEWASYRTVVIRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGEQ V H ++K RL P+
Sbjct: 61 DQGRGERMDIHLNITFPRMPCELLTLDVMDVSGEQQHGVAHGVHKVRL--------RPEG 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET----RKCCNTCNEVKEAYRYKK 176
E + + N E DP+ CG C GA T CCNTC E++EAY
Sbjct: 113 EGGGVIDVSSLNLHN--DAAEHLDPSYCGDCGGAPAPTTVTKAGCCNTCEEIREAYAQVS 170
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + QC+ E+ E+L+ EGC+I G L+VN+V G+FH+APG S+S ++HVH
Sbjct: 171 WAFGDGKAFEQCEREHYAERLEEQRHEGCRIDGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230
Query: 237 DIQPYTSAA----FNTTHHIRHLSFGIKLQD-------------DDERRKPLDGTVAKAE 279
D++ Y + THHI HL FG +L + + PLD T
Sbjct: 231 DLKNYWETTDDKKHDFTHHIHHLRFGPQLPETVVQKLGKGATPWTNHHGNPLDSTKQLTN 290
Query: 280 EGASMFNYYIKIIPTIYERLDGSKLG------------------------GGD------- 308
+ F Y++KI+PT + L K+ GGD
Sbjct: 291 DPNFNFMYFVKIVPTSFLPLGWEKMARTMNVDASVETHQYSVTSHKRSLTGGDDSAEGHA 350
Query: 309 ------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
GG+PG+FFSY++SP+ ++ EK KS + + GT VD L
Sbjct: 351 ERLHSRGGIPGVFFSYDISPMKVINREEKGKSFLGFVAGLCAVVGGTLTVAAAVDRGLFE 410
Query: 362 CVKKISKVE 370
++ K+
Sbjct: 411 GTTRLKKIR 419
>gi|405123077|gb|AFR97842.1| COPII-coated vesicle component Erv46 [Cryptococcus neoformans var.
grubii H99]
Length = 422
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 186/403 (46%), Gaps = 63/403 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+G DAF K ED KT G +T + I + ++ DY ++ + VD SRG
Sbjct: 10 FQGFDAFGKTMEDVKIKTRTGALLTFISLSIILTSVMLEFIDYRRIHLEPSIIVDRSRGE 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-KEVVNA 125
KL I DI P + C L+LD +D SGE EH + K R++ DG I + Q ++
Sbjct: 70 KLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRMNKDGNVISKVQGSQLKGD 129
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
V++ + +DPN CGSCYGA CCN+C EV++AY K W+ + + I
Sbjct: 130 VERANLN----------QDPNYCGSCYGAPPPESGCCNSCEEVRQAYGRKGWSFSDPEGI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E +K+K EGC+I G++ VN+V G+ H +PG S+ N + + ++ PY
Sbjct: 180 EQCVEEGWMDKMKEQNEEGCRIDGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR-- 237
Query: 246 FNTTHH-----IRHLSFGIKLQDDDE---------------RRKPLDGTVAKAEEGASMF 285
+ HH + FG + +E R PL G A E MF
Sbjct: 238 -DKNHHDFGHIVHKFRFGGDMTKAEELTVLPKEQRWRDKLGLRDPLQGMKAHTEVSNYMF 296
Query: 286 NYYIKIIPTIYERLDGSKL----------------GGGDG-------------GMPGIFF 316
Y++K++ T + L+G ++ G G G+PG+FF
Sbjct: 297 QYFLKVVSTNFISLNGEEIPSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFF 356
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
+YE+SP+ V TE+ +S H T + G LVD+ +
Sbjct: 357 NYEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLVDSFI 399
>gi|321253192|ref|XP_003192660.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
gi|317459129|gb|ADV20873.1| ER to Golgi transport-related protein, putative [Cryptococcus
gattii WM276]
Length = 435
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 191/412 (46%), Gaps = 63/412 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+G DAF K ED KT G +T + I + ++ DY ++ + VD SRG
Sbjct: 10 FQGFDAFGKTMEDVKVKTRTGALLTFISLSIILTSVMLEFIDYRRIHLEPSIIVDRSRGE 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNA 125
KL I DI P + C L+LD +D SGE EH + K R+D +GK I + Q ++
Sbjct: 70 KLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRIDKNGKIISKVQGGQLKGD 129
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+++ + +DPN CGSCYGA CCN+C EV++AY K W+ + + I
Sbjct: 130 LERANLN----------QDPNYCGSCYGAPPPESGCCNSCEEVRQAYGRKGWSFSDPEGI 179
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E +K+K EGC+I G++ VN+V G+ H +PG S+ N + + ++ PY
Sbjct: 180 EQCVEEGWMDKMKEQNEEGCRIGGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR-- 237
Query: 246 FNTTHH-----IRHLSFGIKLQDDDE---------------RRKPLDGTVAKAEEGASMF 285
+ HH + FG + +E + PL G E MF
Sbjct: 238 -DKNHHDFGHIVHKFRFGGDMTKAEELTVLPKEQRWRDKLGLKDPLQGIKVHTEVSNYMF 296
Query: 286 NYYIKIIPTIYERLDGSKL----------------GGGDG-------------GMPGIFF 316
Y++K++ T + L+G ++ G G G+PG+FF
Sbjct: 297 QYFLKVVSTNFISLNGEEIPSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFF 356
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+YE+SP+ V TE+ +S H T + G L+D+ + + K++ K
Sbjct: 357 NYEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLLDSFIFNSSKRLKK 408
>gi|296811622|ref|XP_002846149.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma otae CBS 113480]
gi|238843537|gb|EEQ33199.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma otae CBS 113480]
Length = 435
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 139/440 (31%), Positives = 199/440 (45%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTI L + YL+ + DY +V EL V
Sbjct: 1 MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVIQPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V+H + K RL + +
Sbjct: 61 DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRLSSAAEGGKVIDV 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
++ KK DPN CG+CYG A + +K CCNTC EV++AY K
Sbjct: 121 TALDLHKKDDSPAH--------LDPNYCGNCYGVPAPSTAKKPGCCNTCAEVRDAYAEKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC +E ++++ EGC+I G L VN+V+G+FHIAPG S + + H H
Sbjct: 173 WAFGRGEGVTQCMDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
D+ Y T TH I L FG +L ++ R PLD + + +E F
Sbjct: 233 DLDNYYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHRTDEVRYNF 292
Query: 286 NYYIKIIPTIYERLD--------------------------GSK---------------- 303
Y++K++ T Y L GS+
Sbjct: 293 LYFVKVVSTSYLPLGWDATWSSEVHSQAHKDIPLGNHGVYFGSQGSIETHQYSVTSHKRS 352
Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
L GGD GG+P + F+YE+SP+ V E + KSL +T + I GT
Sbjct: 353 LDGGDDSAEGHKERQYARGGIPSVMFNYEISPMKVINRETRPKSLSTFFTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD LL+ ++ K+
Sbjct: 413 TVAAAVDRLLYEGSLRVKKL 432
>gi|336369994|gb|EGN98335.1| hypothetical protein SERLA73DRAFT_109778 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382751|gb|EGO23901.1| hypothetical protein SERLADRAFT_450196 [Serpula lacrymans var.
lacrymans S7.9]
Length = 988
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 189/403 (46%), Gaps = 57/403 (14%)
Query: 18 EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
ED KT G +TI+ I ++ DY V+ + VD SRG KL + +++ P
Sbjct: 591 EDVKVKTRTGAFLTILSAAIILAFTAMEFFDYRTVNVDTSIIVDRSRGEKLSVRMNMTFP 650
Query: 78 TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNAVKKKKVTTENG 136
+ C L+LD +D SGEQ V HNI+K R+ +G P+ + E+ N + K NG
Sbjct: 651 RVPCYLLSLDIMDISGEQQRDVSHNIHKTRITPEGGPVPGARNGELRNEIDKLNDQRSNG 710
Query: 137 TTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEK 196
CGSCYG CCN+C +V++AY + W+ D I QC E +EK
Sbjct: 711 Y----------CGSCYGGVEPEGGCCNSCEDVRQAYVNRGWSFNNPDNIEQCVAEGWSEK 760
Query: 197 LKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNT---THHIR 253
LK+ EGC I G L VN+V G+ +++PG S+ + + +++ PY N +H I
Sbjct: 761 LKDQAEEGCNISGRLRVNKVIGNINVSPGRSFQSSSRNFYELVPYLREDNNRHDFSHVIH 820
Query: 254 HLSF---------GIKLQDDDERR-----KPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
SF KL D ++R PLDG AK + MF Y++K++ T + +
Sbjct: 821 EFSFMTDDEYNLHKAKLGKDMKQRMGIAENPLDGLNAKTNKAQYMFQYFLKVVSTQFRTI 880
Query: 300 DGSKL-------------------GGGDG----------GMPGIFFSYELSPLMVKITEK 330
DG + GG +G G+PG FF++E+SP++V +E
Sbjct: 881 DGKTINTHQYSATHFERDLSKGSQGGDNGEGVVTQHGVSGVPGAFFNFEISPILVVHSEG 940
Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGG 373
+S H T + G L+D+ L + +++ K G
Sbjct: 941 RQSFAHFLTSTCAIVGGVLTVAALLDSFLFATGRRLKKGSSNG 983
>gi|19113757|ref|NP_592845.1| COPII-coated vesicle component Erv46 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|1351651|sp|Q09895.1|YAI8_SCHPO RecName: Full=Uncharacterized protein C24B11.08c
gi|1061296|emb|CAA91773.1| COPII-coated vesicle component Erv46 (predicted)
[Schizosaccharomyces pombe]
Length = 390
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 128/411 (31%), Positives = 196/411 (47%), Gaps = 58/411 (14%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M F L+ DAF K ED KT GG +T+V L + +++ ++ +Y +V E+ V
Sbjct: 1 MQFRSPLRRFDAFQKTVEDARIKTASGGLITLVSGLIVIFIVLMEWINYRRVIAVHEIIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
+ S G ++ I+ +I P I C L +D +D SGE + H + K RL G+ I
Sbjct: 61 NPSHGDRMEINFNITFPRIPCQILTVDVLDVSGEFQRDIHHTVSKTRLSPSGEIISVDDL 120
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET----ETRKCCNTCNEVKEAYRYKK 176
++ N ++ +++G +CG CYGA +T CCNTC+ V++AY
Sbjct: 121 DIGN----QQSISDDGAA--------ECGDCYGAADFAPEDTPGCCNTCDAVRDAYGKAH 168
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
W + ++D QCK+E E + EGC + G L VNR++G+FHIAPG S + HVH
Sbjct: 169 WRIGDVDAFKQCKDENFKELYEAQKVEGCNLAGQLSVNRMAGNFHIAPGRSTQNGNQHVH 228
Query: 237 DIQPYTSA--AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
D + Y + + +H I HLSFG L PLDGTV K + Y+IK +
Sbjct: 229 DTRDYINELDLHDMSHSIHHLSFGPPLDASVHYSNPLDGTVKKVSTADYRYEYFIKCVSY 288
Query: 295 IYERLDGSKL-----------------GGGD----------GGMPGIFFSYELSPLMVKI 327
+ L S L GG + GG+PG++F +++SP+ V
Sbjct: 289 QFMPLSKSTLPIDTNKYAVTQHERSIRGGREEKVPTHVNFHGGIPGVWFQFDISPMRV-- 346
Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
+ ++ N G +++ +L ALL CV S V+ G V K
Sbjct: 347 ---------IERQVRGNTFGGFLSNVL--ALLGGCVTLASFVDRGYYEVQK 386
>gi|349804919|gb|AEQ17932.1| putative ergic and golgi 3 [Hymenochirus curtipes]
Length = 228
Score = 197 bits (500), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 98/237 (41%), Positives = 143/237 (60%), Gaps = 26/237 (10%)
Query: 94 EQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCY 152
EQ L VEHN++K RLD D +P+ E ++ + ++ + DP++C SCY
Sbjct: 1 EQQLDVEHNLFKLRLDKDRQPVSSEAERHDLGKAEEPVIFDPKSL------DPDRCESCY 54
Query: 153 GAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLE 212
GAET+ +CCN+C++V+EAYR + WA D+I QCK E ++K++ EGC++YG+LE
Sbjct: 55 GAETDDFRCCNSCDDVREAYRRRGWAFKTPDSIEQCKREGFSQKMQEQKNEGCRVYGFLE 114
Query: 213 VNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD 272
VN+V+G+FH APG S+ +HVHVHD+Q + N TH I+HLSFG+ D PLD
Sbjct: 115 VNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIKHLSFGM---DYPGLVNPLD 171
Query: 273 GTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------------GDGGMPG 313
GT A + + MF Y++KI+PT+Y ++DG L GD G+PG
Sbjct: 172 GTSVSAVQSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKVTNGLIGDQGLPG 228
>gi|194224360|ref|XP_001916465.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Equus caballus]
Length = 342
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 128/387 (33%), Positives = 186/387 (48%), Gaps = 68/387 (17%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
KL I++D+ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ E ++ +
Sbjct: 66 DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V+ K ++ DP++C SCYGAETE K C
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKPPYFC------------------- 159
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
+ +L + +++ V +HD+Q +
Sbjct: 160 ----------------------LQDHLHSSLAGKGLPWGRDQEEALHAVEIHDLQSFGLD 197
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N TH+IRHLSFG +D PLD T A + + MF Y++K++PT+Y ++DG L
Sbjct: 198 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 254
Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
GD G+PG+F YELSP+MVK+TEK +S H T + I G
Sbjct: 255 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 314
Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
+ L+D+L++ + I K GKT
Sbjct: 315 FTVAGLIDSLIYHSARAIQKKIDLGKT 341
>gi|145340712|ref|XP_001415464.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575687|gb|ABO93756.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 379
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 124/388 (31%), Positives = 190/388 (48%), Gaps = 36/388 (9%)
Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD---SSRGSK 67
D F K +DF +T GGA+ + + L + + +T +L VD + K
Sbjct: 1 DLFPKISDDFARRTATGGAIATIGLALMVILFLQQTAELMRTTTAYDLRVDDGVAGATKK 60
Query: 68 LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHN-IYKRRLDLDGKPI-QEPQKEVVNA 125
+ I++D+ + + C ++LDA+D +GE L V + + R+D G+ I ++ VNA
Sbjct: 61 IVINVDLTLRAMHCAQVSLDAMDVTGETRLDVSRSEVRTTRVDARGRAIAMTSERTAVNA 120
Query: 126 VKKKKVTTENGTTTTELED-PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
TE G E + CG CYGA E CC+ C+ V+EAYR K WALP+L
Sbjct: 121 ------KTEAGEREREATGGRSACGDCYGA-AEAGTCCDDCDSVREAYRVKGWALPDLRR 173
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
+ QC EY ++N EGC G+ EVN+V+G+FHIAPG SY+ HVHD+ P+
Sbjct: 174 VTQCTKEYDVVAMRNEHKEGCHFSGHFEVNKVAGNFHIAPGKSYNNLGQHVHDLSPFAGV 233
Query: 245 -AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG-ASMFNYYIKIIPTIYERLD-- 300
+FN +H I LSFG + PLDG ++ A ++ Y + ++P Y+ L
Sbjct: 234 ESFNFSHIIHKLSFGEEFPG---VVNPLDGVTRTMDDANAGVYQYRLSVVPARYKYLGFR 290
Query: 301 -----------GSKLGGGD----GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
G D G+PG+FF Y+LSPL V+ E+ + + I
Sbjct: 291 ARVVESNDYSVTDHFRGFDVTKNPGLPGLFFFYDLSPLRVEYEERRIGFFQYLSNVAAII 350
Query: 346 SGTYITFMLVDALLHSCVKKI-SKVEIG 372
G +VD L++ + + KV++G
Sbjct: 351 GGVSAVVNIVDGLVYRGQRALREKVDLG 378
>gi|307105810|gb|EFN54058.1| hypothetical protein CHLNCDRAFT_25376, partial [Chlorella
variabilis]
Length = 312
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 102/309 (33%), Positives = 166/309 (53%), Gaps = 26/309 (8%)
Query: 81 CDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTT 140
C +L++DA+D SGE L V+H++YKRRL DG P+ E +K N +
Sbjct: 1 CSWLSIDAMDISGEVQLEVDHDVYKRRLSPDGTPLDEGGCPRAGWLKP---VPGNDSEAD 57
Query: 141 ELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNT 200
+ P CGSCYG+E+ +CCNTC EV++AYR K WAL +++ + QC +E E++
Sbjct: 58 PTKAPGYCGSCYGSESRAGQCCNTCAEVRDAYRTKGWALLDVEKVEQCHHEGYKEEIDEQ 117
Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
EGC ++G L++N+V+G+FHIAPG SY ++H+HD+ P+ AF+ +H I L+FG +
Sbjct: 118 KGEGCHVWGELQINKVAGNFHIAPGRSYQQGNMHIHDLSPFAGQAFDFSHTIHKLAFGRE 177
Query: 261 LQDDDERRKPLDG---TVAKAEEGASMFNYYIKIIPTIYERLDGSKL------------- 304
R + L +V E ++ Y++K++PT Y L + +
Sbjct: 178 YP--GTRGQALSTFCLSVGTRRERMGLYQYFLKVVPTSYSDLRNNTIYTNQFSVTEHFRE 235
Query: 305 ----GGGDGGMPGIFFSYELSPLMVKITEKSK-SLGHLWTKIMCNISGTYITFMLVDALL 359
G G +PG+F Y+LSP+ + +++ S T + I G + ++DA +
Sbjct: 236 TASPTAGGGQLPGVFLFYDLSPIKASLEGRARLSFLSFLTSLCAIIGGVFTVSGIIDATV 295
Query: 360 HSCVKKISK 368
+ + I K
Sbjct: 296 YHGQQAIKK 304
>gi|226292523|gb|EEH47943.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb18]
Length = 435
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 133/440 (30%), Positives = 190/440 (43%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV IS+LI + +Y ++ EL V
Sbjct: 1 MAPKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL+I P + C+ L LD +D SGE + H I K RL P+
Sbjct: 61 DKGRGERMEIHLNITFPHLPCELLTLDVMDVSGEMQSGIIHGISKVRL--------APES 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + + + T + DP+ CG CYGA + EV+EAY +
Sbjct: 113 EGGHVIDTTALVLHTQTDAAKHLDPDYCGPCYGAPPPSHATKPGVALPAKEVREAYASQS 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E ++ L EGC+I G L VN+V G+FHIAPG S+S ++H H
Sbjct: 173 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
D+ Y T + +H I L FG +L D D PLD T + F
Sbjct: 233 DLDTYYHTPVPHHMSHKIHQLRFGPQLSDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 292
Query: 286 NYYIKIIPTIYERLDGS------------------------------------------K 303
Y++K++ T Y L S
Sbjct: 293 MYFVKVVSTSYLPLGWSPEFSSSVHETTLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRS 352
Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
+ GGD GG+PG+F +Y++SP+ V E ++K+ T + I GT
Sbjct: 353 IDGGDDAAEGHKERLHSHGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD L+ V ++ K+
Sbjct: 413 TVAAAVDRALYEGVARVKKL 432
>gi|254581328|ref|XP_002496649.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
gi|238939541|emb|CAR27716.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
Length = 404
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 129/414 (31%), Positives = 193/414 (46%), Gaps = 60/414 (14%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF+K ED +T GG + ++C L +L+ + ++ QV EL VD R
Sbjct: 7 LRTFDAFSKTEEDVRIRTRTGGIIALLCCLVTIFLLISEWLNFNQVVNRPELVVDKDRQL 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL + DI P++ CD L+LD +DS+GE L + E K RLD +G+ +
Sbjct: 67 KLELEADITFPSMPCDMLSLDIMDSAGEIQLDLLESGFTKTRLDQNGQSL---------G 117
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYKK 176
KV+ E + + +D N CG+CYGA+ ++R CC TCN+V+ AY
Sbjct: 118 SSSLKVSDE----SYDPKDENYCGACYGAKDQSRNNEVPKEERVCCQTCNDVRRAYLEAN 173
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+ E +++ EGC++ G +NR+ G+ H APG+++ H H
Sbjct: 174 WAFFDGKNIEQCEREGYVDRVNEQLNEGCRVQGSALLNRIQGTLHFAPGVAFQNPKGHFH 233
Query: 237 DIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGASM--FNY 287
D+ Y + N H I HLSFG + + R PLDG A + M F+Y
Sbjct: 234 DLSLYEKTHNLNFNHIINHLSFGKPVTSNARGRGASVATAPLDGRQAFPDRDTHMHQFSY 293
Query: 288 YIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSP 322
+ KI+PT YE +D + GG D GG PG+F +E+SP
Sbjct: 294 FTKIVPTRYEYMDKMVVETAQFSATLHDRPLHGGADQDHPTTLHTKGGFPGLFVYFEMSP 353
Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
L V E+ W+ + N + + V +L K K G K+V
Sbjct: 354 LKVINREQH---AQTWSGFILNCITSIGGVLAVGTVLDKITYKAQKSIWGKKSV 404
>gi|154280410|ref|XP_001541018.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150412961|gb|EDN08348.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 435
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 132/440 (30%), Positives = 193/440 (43%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTI I +LI + +Y ++ EL V
Sbjct: 1 MPPKSRFARLDAFTKTVEDARIRTRLGGVVTISALFVIFFLIWGEWSEYRRIVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V H + K RL ++E +
Sbjct: 61 DKGRGERMEIHLNVTFPNLPCELLTLDVMDISGEYQTGVIHGVNKVRL----SSVEEGGR 116
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
+ + T GT DP+ CG CYGA + CCNTC EV++AY K
Sbjct: 117 VLDITALQLHSQTNKGTDV----DPDYCGQCYGATPPSNAKKPGCCNTCEEVRDAYAAKG 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E + L EGC++ G + VN+V G+FHIAPG S++ ++H H
Sbjct: 173 WAFGRGENVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVVGNFHIAPGRSFTNGNLHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
D+ Y T N H + +L FG +L + D+ PLD T F
Sbjct: 233 DLDNYYHTPVQHNMGHRVHYLRFGPQLPEELSSRWKWTDNHHTNPLDNTEQHTTNPRFNF 292
Query: 286 NYYIKIIPTIYERL----------------------DGSKLG------------------ 305
Y++K++ T Y L G G
Sbjct: 293 IYFVKVVSTSYLPLGWDPDASSSAHSKYSKNAPLGKQGLSFGSYGSIETHQYSVTSHKRS 352
Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
GGD GG+PG+F +Y++SP+ V E ++KS T + I GT
Sbjct: 353 VDGGDDSAEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKSFSGFLTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
+D +L+ ++ K+
Sbjct: 413 TVAAAIDRVLYEGAVRVKKL 432
>gi|348667280|gb|EGZ07106.1| hypothetical protein PHYSODRAFT_319656 [Phytophthora sojae]
Length = 398
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 114/361 (31%), Positives = 178/361 (49%), Gaps = 27/361 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RLKGLDA+ K E+F +T+ GG +++ + IS L+ ++ Y T +++ VD R
Sbjct: 11 RLKGLDAYPKTIEEFKVRTLQGGLFSLLAFACISLLLVSELSFYLATDTVDKMTVDGGRN 70
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ + I+ D+ P ++C +AL++ D +G +EHNI K LD G+ + E +V+
Sbjct: 71 TMVAINFDVEFPRMACSVVALESADMAGNVQHDIEHNIRKIPLDHTGQALAEGMHDVIGG 130
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
T N E + P CGSCY A E +CC+TC VK AY K W +P L TI
Sbjct: 131 A-----LTNNTELHGETDKP-ACGSCYSA-GEPGECCDTCESVKAAYARKSWMMPSLHTI 183
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ + L+ EGC+I G L V++V+G + AP + ++ D+ T
Sbjct: 184 AQCQEVEIEKVLRGEVNEGCRIQGSLVVSKVAGKLYFAPSKFFRSGYLSSKDLVDATFKV 243
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA--EEGASMFNYYIKIIPTIYERLDGSK 303
F+T+H IR LSFG D + PLD + E+ F Y++K++PT Y L S+
Sbjct: 244 FDTSHTIRSLSFGEAYPD---MKNPLDNRKKELPDEKTRGSFQYFLKVVPTEYTFLSASR 300
Query: 304 L---------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
+ D G+P + FSY SP+M +I + T + + G
Sbjct: 301 IITNQFSATEHFRQLTPVSDKGLPMVTFSYTFSPIMFRIEQYRVGFLQFLTSVCAIVGGV 360
Query: 349 Y 349
+
Sbjct: 361 F 361
>gi|225562998|gb|EEH11277.1| COPII coated vesicle component Erv46 [Ajellomyces capsulatus
G186AR]
gi|240279818|gb|EER43323.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H143]
gi|325092948|gb|EGC46258.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H88]
Length = 435
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 132/440 (30%), Positives = 193/440 (43%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTI I +LI + +Y ++ EL V
Sbjct: 1 MPPKSRFARLDAFTKTVEDARIRTRSGGVVTISALFVIFFLIWGEWSEYRRIVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V H + K RL ++E +
Sbjct: 61 DKGRGERMEIHLNVTFPNLPCELLTLDVMDISGEYQTGVIHGVNKVRL----SSVEEGGR 116
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
+ + T GT DP+ CG CYGA + CCNTC EV++AY K
Sbjct: 117 VLDITALQLHSQTNKGTDV----DPDYCGQCYGATPPSNAKKPGCCNTCEEVRDAYAAKG 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E + L EGC++ G + VN+V G+FHIAPG S++ ++H H
Sbjct: 173 WAFGRGENVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVVGNFHIAPGRSFTNGNLHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
D+ Y T N H I +L FG +L + D+ PLD T F
Sbjct: 233 DLDNYYHTPVQHNMGHRIHYLRFGPQLPEQLSSRWKWTDNHHTNPLDNTEQHTTNPRFNF 292
Query: 286 NYYIKIIPTIYERL----------------------DGSKLG------------------ 305
Y++K++ T Y L G G
Sbjct: 293 MYFVKVVSTSYLPLGWDPDASSSAHSQYSKNAPLGKQGLSFGSYGSIETHQYSVTSHKRS 352
Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
GGD GG+PG+F +Y++SP+ V E ++K+ T + I GT
Sbjct: 353 VDGGDDSAEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
+D +L+ ++ K+
Sbjct: 413 TVAAAIDRVLYEGAVRVKKL 432
>gi|406606433|emb|CCH42207.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Wickerhamomyces ciferrii]
Length = 405
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 199/404 (49%), Gaps = 53/404 (13%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF+K ED KT GG +T+ C L + LI + + +++ EL VD R
Sbjct: 7 LLSFDAFSKTVEDARVKTTSGGLITVTCILTLFSLIINEWRQFNEITIDPELVVDRDRNL 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I+LD+ P + CD ++LD +D SG+ L V + + + I+ +
Sbjct: 67 KLDINLDVTFPDLPCDIMSLDIMDVSGDLQLDVTNYGFTK--------IRLTETGEEIGE 118
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAET---------ETRKCCNTCNEVKEAYRYKKW 177
++ K+ ++G ++ + CG CYGA+ E + CCN C+ V++AY W
Sbjct: 119 EEMKIGDDHGHADADIP-ADYCGPCYGAKNQDKNENKPQEEKVCCNDCDSVRKAYASVGW 177
Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
A + + QC+ E +K+ + EGC++ G ++NR++G+ H APG SYS + HVHD
Sbjct: 178 AFFDGKNVEQCEREGYVKKINDRLGEGCRVKGTAKLNRINGNIHFAPGASYSAPNRHVHD 237
Query: 238 IQPY-TSAAFNTTHHIRHLSFGIKLQDD------DERRKPLDGTVAKAEEGASMFNYYIK 290
+ Y + FN H I H SFG + + PLDGT A +++Y++K
Sbjct: 238 LSLYGKNKDFNFRHVINHFSFGPDVNSKYTAETLELSSHPLDGTNAIQGSRDHLYSYFLK 297
Query: 291 IIPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMV 325
++PT YE L+G+K+ GG D GG+PG+FF +E+SPL
Sbjct: 298 VVPTRYEYLNGTKVETNQFSSTYHDRPLTGGRDEDHPNTFHARGGIPGLFFHFEMSPL-- 355
Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
KI K ++ G W+ + N+ + V A++ V KV
Sbjct: 356 KIINK-ETYGTSWSGFLLNVISAIGGILTVGAVVDRTVFVADKV 398
>gi|149237735|ref|XP_001524744.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451341|gb|EDK45597.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 411
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 203/415 (48%), Gaps = 54/415 (13%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED KT GG +T++C L LI + DY + T EL VD
Sbjct: 7 KLISLDAFAKTVEDARIKTASGGIITLLCCLVALILIRNEYIDYTTIVTLPELVVDRDIN 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV------EHNIYKRRLDLDGKPIQEPQ 119
+L I++D+ P + CD + +D D +G+ L V ++ I KR + K ++E
Sbjct: 67 KQLEINMDMSFPNLPCDMINMDLFDETGDMKLDVINSGLEKYRIIKRG---NNKVVEELD 123
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKW 177
+ A+++++ E E E +CGSCYGA + +K CCN+C V+ AY +KKW
Sbjct: 124 DQP--ALRREQPLHEICKGLGENEQ-GECGSCYGALPQDKKEYCCNSCAAVRRAYAHKKW 180
Query: 178 ALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
+ + I QC+ E +KLK+ EGC++ G ++NRV+G+ APG+S + N HV
Sbjct: 181 QFFDGENIEQCEKEGYVQKLKDRINQNEGCRVKGSAKINRVAGTMDFAPGISTTSNGQHV 240
Query: 236 HDIQPYTS--AAFNTTHHIRHLSFG------IKLQDDDERRKPLDGTVAKAEEGASMFNY 287
HD+ YT FN H I HLSFG LQ+ D PLDG + M NY
Sbjct: 241 HDLSLYTKYPDKFNFDHVIHHLSFGKIPTAITNLQETDS-LSPLDGHSFLQHKRYHMNNY 299
Query: 288 YIKIIPTIYERLDGSK----------------LGGGD----------GGMPGIFFSYELS 321
Y+KI+ T +E LDG+K +GG D GG+P + F +++S
Sbjct: 300 YLKIVSTRFENLDGTKKVDTNQFSVITHDRPLVGGKDEDHQHTLHARGGVPSVAFHFDIS 359
Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
PL + E+ W+ + + + ++V ALL V + G K +
Sbjct: 360 PLKIINRER---YAKTWSGFVLGVVSSVAGVLMVGALLDRSVFAAQQAMKGKKDL 411
>gi|353237029|emb|CCA69011.1| related to ERV46-component of copii vesicles [Piriformospora indica
DSM 11827]
Length = 428
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 125/405 (30%), Positives = 188/405 (46%), Gaps = 50/405 (12%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
KG+DAF + ED KT G +T++ FI+ ++ D+ +V + VD SRG
Sbjct: 9 FKGIDAFGRTSEDVKVKTRTGAFLTLISAFFIATFTFIEFMDFRRVGVDTAIVVDRSRGE 68
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL + +I P + C L LD D SG+ + H++ K RLD +P + +
Sbjct: 69 KLQVVFNITFPRVPCFLLNLDVTDISGDVVREITHHVVKTRLD---PAAHQPIPDGIYRT 125
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K ++ T T++ CGSCYG + CCNTC++V+ AY + WA D I
Sbjct: 126 DLKSDLSKQLTATSK----GYCGSCYGGQPPEGGCCNTCDDVRRAYTDRGWAFGNPDQID 181
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC +E TEK+ EGC I G + VN+V+G+ +PG S+ +N V+ + PY +
Sbjct: 182 QCVSENWTEKIMAMQREGCNIEGRVRVNKVTGNMQFSPGRSFVVNRPEVYALVPYLKDSN 241
Query: 247 N-TTHHIRHLSFGIKLQDDDERRK--------------PLDGTVAKAEEGASMFNYYIKI 291
+ HHI L +D RR PL+ A E MF Y++K+
Sbjct: 242 HFFGHHIHSLEIYDYEEDTWTRRNLPEQIKERLGITKPPLEDVYAHTESADYMFQYFLKV 301
Query: 292 IPTIYERLDG-----------------SKLGGG---DG--------GMPGIFFSYELSPL 323
+ + Y+ LDG + + G DG G+PG+FF++E+SP+
Sbjct: 302 VKSSYKGLDGKAYSTHQYSTSSFERDLATMSHGKNEDGIEIVHERQGVPGVFFNFEISPM 361
Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
V E+ +S H T + I G LVDALL + I K
Sbjct: 362 EVIHIEQRQSWAHFITSMAAIIGGVLTVATLVDALLFNTQGLIKK 406
>gi|260950825|ref|XP_002619709.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
gi|238847281|gb|EEQ36745.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
Length = 415
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 132/420 (31%), Positives = 199/420 (47%), Gaps = 49/420 (11%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M RL LDAF K ED KT GG +T+VC L + +LI + DY V EL V
Sbjct: 1 MSSRPRLLSLDAFAKTVEDARVKTASGGVITLVCVLIVLFLIRNEYSDYMSVVVRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQ 119
+ +L I+LDI P + C ++LD +D +G+ HL VE R+ G+ I +
Sbjct: 61 NRDVNRQLDINLDITFPDVPCGVMSLDILDMTGDLHLDIVESGFEMFRVLPSGEEISDDL 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKW 177
+ A K + V T E+ CG CYGA +T+ ++CCNTC V+ AY ++W
Sbjct: 121 PLLSGAKKFEDVCGP--LTEDEISRGVPCGPCYGAVDQTDNKRCCNTCEAVRMAYAVQEW 178
Query: 178 ALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
+ I QC+ E EK+ + EGC+I G ++NR+SG+ H APG+ S N H
Sbjct: 179 GFFDGSNIEQCEREGYVEKMVSRINNNEGCRIKGSAKINRISGNLHFAPGVPLSRNGRHS 238
Query: 236 HDIQPYT--SAAFNTTHHIRHLSFG------IKLQDDDERRK----PLDGTVAKAEEGAS 283
HD+ +T S F+ H I H SFG +L D+ ++ PLDG ++
Sbjct: 239 HDLSLWTKYSNKFSIDHKINHFSFGEDPSASRRLASTDDSQEPSIHPLDGFHFDLKKKNH 298
Query: 284 MFNYYIKIIPTIYERLDGSK-----------------LGGGD----------GGMPGIFF 316
+ +YY+ ++ T +E LDG K +GG D GG+PG FF
Sbjct: 299 VASYYLSVVSTRFEFLDGKKEAVDTNQFSVITHDRPIVGGRDDDHQNTMHAQGGVPGAFF 358
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
+++SP+ + E+ W+ + + + + V A L V +V G K +
Sbjct: 359 HFDISPMKIISREE---YAKTWSGFILGVVSSIAGVLTVGAALDRSVWTAEQVLRGKKDM 415
>gi|156844136|ref|XP_001645132.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
70294]
gi|156115789|gb|EDO17274.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
70294]
Length = 405
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 129/419 (30%), Positives = 195/419 (46%), Gaps = 59/419 (14%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L LDAF +P E+ +T GG +TI C L YL+ + + +V + +L V
Sbjct: 1 MSKKSKLSSLDAFARPDEEVRIRTKMGGIITISCILTTLYLLSWEWSKFREVISKPQLVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQ 119
D SKL ++LDI P + CD++ LD +D SG+ L V E+ K RLD DGK ++
Sbjct: 61 DRDHSSKLELNLDISFPNVPCDFINLDIMDDSGDLQLDVLEYGFTKTRLDPDGKVLETDD 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA---------ETETRKCCNTCNEVKE 170
++ ++G +T DPN CG CYG+ E R CC TC +V++
Sbjct: 121 FDMYK---------QDGAPST---DPNYCGPCYGSIDQSKNDEVEASERVCCQTCEDVRK 168
Query: 171 AYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSI 230
AY WA + I QC+ E +K+ + EGC++ G +NR+ G+ H APG S+
Sbjct: 169 AYVKAGWAFYDGKGIEQCEQEGYVKKINSHLNEGCRVAGSASLNRIQGNIHFAPGKSFQT 228
Query: 231 NHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGAS 283
H HD Y + N H I H SFG ++ R PLDG E
Sbjct: 229 VRGHFHDQSLYERNPQLNFNHIIHHFSFGKEIPTKLASRHSKNIVNPLDGRSVAPERDTH 288
Query: 284 M--FNYYIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFF 316
+ F+YY KI+PT +E L+ + + GG D G+PG+FF
Sbjct: 289 LHQFSYYTKIVPTRFEYLNKAVVDTAQFSATYHDRPLRGGADDDHPNTFHFRSGIPGVFF 348
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
++ SP +K+ K G W+ N + + V ++L + K + +G K+
Sbjct: 349 FFDASP--IKVINKEYISGS-WSSFFLNCITSIGGVLAVGSMLDRLMYKAQRSFLGKKS 404
>gi|401839164|gb|EJT42494.1| ERV46-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 415
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 128/415 (30%), Positives = 191/415 (46%), Gaps = 66/415 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDAF K ED +T GG +T+ C L +L+ + + V T +L VD R +
Sbjct: 6 LLSLDAFAKTEEDVRVRTKAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHA 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL +++D+ P++ CD + LD +D SGE L + + RLD +G+P+ +
Sbjct: 66 KLELNIDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMTRLDKEGRPVGD-------- 117
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYKK 176
+ +V + +DPN CG CYGA +T+ CC C+ V+ AY
Sbjct: 118 AAELQVGGDGDGVAPVNDDPNYCGPCYGARDQTQNENLAQADKVCCQDCDAVRSAYLDAG 177
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+ E K+ EGC+I G ++NR+ G+ H APG + + H H
Sbjct: 178 WAFFDGKNIEQCEREGYVSKINEHLHEGCRIEGSAQINRIQGNIHFAPGRPFQNANGHFH 237
Query: 237 DIQPY-TSAAFNTTHHIRHLSFGI------KLQDDDERR-------KPLDGTVAKAEEG- 281
D+ Y + N H I HLSFG KL ++D+R PLDG E
Sbjct: 238 DVSLYEKTPDLNFNHMINHLSFGKPIESRNKLLENDDRHGGAVIATSPLDGRKVFPERTT 297
Query: 282 -ASMFNYYIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIF 315
+ +F+Y+ KI+PT YE LD + GG D GG+PG+F
Sbjct: 298 HSHLFSYFAKIVPTRYEYLDDVVIETAQFSATYHSRPLRGGRDQDHPNTFHARGGIPGLF 357
Query: 316 FSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
+E+SPL V E+ G W+ + N I G ++D L + + I
Sbjct: 358 VFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409
>gi|443925078|gb|ELU44001.1| ER-derived vesicles protein ERV46 [Rhizoctonia solani AG-1 IA]
Length = 383
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 122/390 (31%), Positives = 180/390 (46%), Gaps = 64/390 (16%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
+F KGLD F K ED KT G +T++ I ++ DY +V + VD
Sbjct: 5 LFGGAFKGLDGFGKTMEDVKVKTRTGAFLTMLSAAIILTFTIIEFIDYRRVVVDSSILVD 64
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEP--Q 119
SRG KL + ++I P + L+LD D SGE + HN+ K RLD +G+ IQ+
Sbjct: 65 RSRGEKLTVKMNITFPRVPL--LSLDVTDISGEIQQDLTHNMVKTRLDSNGQIIQDGFHN 122
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
E+ N V+K T + CGSCYG E CC TC V++AY + W+
Sbjct: 123 NELDNDVEK----------TMKARPQGYCGSCYGGEPPEGGCCQTCESVRQAYMNRGWSF 172
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+ D I QC E+ T K+ +EGC I G + VN+V+G+FH +PG S+ +N H D+
Sbjct: 173 GDPDAIEQCVAEHWTAKIHEQNSEGCHISGRVRVNKVTGNFHFSPGRSFVLNRGHFQDLV 232
Query: 240 PYTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-------------- 283
PY + H++ F + + +DE R GT + + G S
Sbjct: 233 PYLKDGNHHDFGHYVHEFRFEGESEAEDEWRGTDRGTRWRKKVGISANPLDQVSAHVVDD 292
Query: 284 -----MFNYYIKIIPTIYERLDGS--------------KLGGGDG--------------- 309
MF Y++K++ T ++ LDG L GDG
Sbjct: 293 RASNYMFQYFMKVVSTEFKYLDGDIIRSHQYSVTSYERDLTHGDGAERDSHGTLTAHGVQ 352
Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
G+PG FF++E+SP+MV E ++ H T
Sbjct: 353 GLPGAFFNFEISPMMVVHRETRQTFAHFAT 382
>gi|151941348|gb|EDN59719.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
gi|190406692|gb|EDV09959.1| ER-Golgi transport vesicle protein [Saccharomyces cerevisiae
RM11-1a]
gi|207348028|gb|EDZ74008.1| YAL042Wp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256272276|gb|EEU07261.1| Erv46p [Saccharomyces cerevisiae JAY291]
gi|259144662|emb|CAY77603.1| Erv46p [Saccharomyces cerevisiae EC1118]
gi|323334778|gb|EGA76150.1| Erv46p [Saccharomyces cerevisiae AWRI796]
gi|323338873|gb|EGA80087.1| Erv46p [Saccharomyces cerevisiae Vin13]
gi|323349926|gb|EGA84136.1| Erv46p [Saccharomyces cerevisiae Lalvin QA23]
gi|365767200|gb|EHN08685.1| Erv46p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 415
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 68/416 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDAF K ED +T GG +T+ C L +L+ + + V T +L VD R +
Sbjct: 6 LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHA 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL +++D+ P++ CD + LD +D SGE L + + RL+ +G+P+ + + V
Sbjct: 66 KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125
Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
NG T + DPN CG CYGA+ +++ CC C+ V+ AY
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + I QC+ E K+ EGC+I G ++NR+ G+ H APG Y + H
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236
Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
HD Y ++ N H I HLSFG +Q +D+R PLDG +
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296
Query: 282 ASM--FNYYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGI 314
F+Y+ KI+PT YE LD L GG GG+PG+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGM 356
Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
F +E+SPL V E+ G W+ + N I G ++D L + + I
Sbjct: 357 FVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409
>gi|6319274|ref|NP_009358.1| Erv46p [Saccharomyces cerevisiae S288c]
gi|1723191|sp|P39727.2|ERV46_YEAST RecName: Full=ER-derived vesicles protein ERV46
gi|1326054|gb|AAC04989.1| Yal042wp [Saccharomyces cerevisiae]
gi|285810158|tpg|DAA06944.1| TPA: Erv46p [Saccharomyces cerevisiae S288c]
gi|392301230|gb|EIW12318.1| Erv46p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 415
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 68/416 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDAF K ED +T GG +T+ C L +L+ + + V T +L VD R +
Sbjct: 6 LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHA 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL +++D+ P++ CD + LD +D SGE L + + RL+ +G+P+ + + V
Sbjct: 66 KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125
Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
NG T + DPN CG CYGA+ +++ CC C+ V+ AY
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + I QC+ E K+ EGC+I G ++NR+ G+ H APG Y + H
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236
Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
HD Y ++ N H I HLSFG +Q +D+R PLDG +
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296
Query: 282 ASM--FNYYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGI 314
F+Y+ KI+PT YE LD L GG GG+PG+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHVRGGIPGM 356
Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
F +E+SPL V E+ G W+ + N I G ++D L + + I
Sbjct: 357 FVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409
>gi|349576209|dbj|GAA21381.1| K7_Erv46p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 415
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 68/416 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDAF K ED +T GG +T+ C L +L+ + + V T +L VD R +
Sbjct: 6 LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHA 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL +++D+ P++ CD + LD +D SGE L + + RL+ +G+P+ + + V
Sbjct: 66 KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125
Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
NG T + DPN CG CYGA+ +++ CC C+ V+ AY
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + I QC+ E K+ EGC+I G ++NR+ G+ H APG Y + H
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236
Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
HD Y ++ N H I HLSFG +Q +D+R PLDG +
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296
Query: 282 ASM--FNYYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGI 314
F+Y+ KI+PT YE LD L GG GG+PG+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGM 356
Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
F +E+SPL V E+ G W+ + N I G ++D L + + I
Sbjct: 357 FVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409
>gi|320583549|gb|EFW97762.1| COPII-coated vesicle membrane protein Erv46, putative [Ogataea
parapolymorpha DL-1]
Length = 400
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 130/387 (33%), Positives = 185/387 (47%), Gaps = 52/387 (13%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
DAF+K +D KT GG +T+VC L L+ + DY ++ T EL VD R KL
Sbjct: 10 FDAFSKTVDDARIKTTSGGILTLVCILTTLLLLINEYTDYSRIVTRPELVVDRDRHKKLE 69
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNAVKK 128
I+LDI + CD L +D +D SG+ L + K RLD G I + +
Sbjct: 70 INLDISFQNMPCDLLTMDIMDQSGDMQLDLLSSGFSKIRLDRQGNEIGQ---------EN 120
Query: 129 KKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYKKWAL 179
+V E T++ DP CGSCYGA ++R CCN+C VK+AY W
Sbjct: 121 MRVNQEFALTSS---DPTYCGSCYGAADQSRNDELPQDQKVCCNSCESVKQAYARNAWKF 177
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+ I QC+ E +++ EGC++ G E+ R+ G+ H APG S + N HVHD+
Sbjct: 178 YDGKDIEQCEKEGYVDRINARLDEGCRVRGTAEIARIGGNLHFAPGSSMNFNEKHVHDLS 237
Query: 240 PYT--SAAFNTTHHIRHLSFGIKLQD--DDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
Y S FN H I H SFG+ D + PLD T + +++Y++K++ T
Sbjct: 238 LYDMHSNKFNFDHTINHFSFGLDDHSVADYKTTHPLDATTHRDGRKYHVYSYFLKVVNTR 297
Query: 296 YERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKITEK 330
YE LDG K+ GG D GG+PG+FF +E+SPL + E+
Sbjct: 298 YEFLDGRKVETNQFSATQHDRPFRGGRDEDHPNTIHAQGGLPGVFFHFEISPLKIINREQ 357
Query: 331 -SKSLGHLWTKIMCNISGTYITFMLVD 356
+K+ ISG F L+D
Sbjct: 358 YNKTWSAFALGACAAISGVLTVFTLLD 384
>gi|323356370|gb|EGA88170.1| Erv46p [Saccharomyces cerevisiae VL3]
Length = 415
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 68/416 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDAF K ED +T GG +T+ C L +L+ + + V T +L VD R +
Sbjct: 6 LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHA 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL +++D+ P++ CD + LD +D SGE L + + RL+ +G+P+ + + V
Sbjct: 66 KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125
Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
NG T + DPN CG CYGA+ +++ CC C+ V+ AY
Sbjct: 126 ---------NGDGTXPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + I QC+ E K+ EGC+I G ++NR+ G+ H APG Y + H
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236
Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
HD Y ++ N H I HLSFG +Q +D+R PLDG +
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296
Query: 282 ASM--FNYYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGI 314
F+Y+ KI+PT YE LD L GG GG+PG+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGM 356
Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
F +E+SPL V E+ G W+ + N I G ++D L + + I
Sbjct: 357 FVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409
>gi|365989554|ref|XP_003671607.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
gi|343770380|emb|CCD26364.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
Length = 438
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 135/405 (33%), Positives = 184/405 (45%), Gaps = 68/405 (16%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
S +L DAF K E+ +T GG +TI C L YL+ + + V T+ +L VD
Sbjct: 6 SAKLLSFDAFAKTEEEVRIRTNTGGIITISCILVTLYLLLNEWSQFNSVITSPQLVVDRD 65
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKP------IQ 116
R KL ++LDI P ISCD + LD +D SGE L ++ K RLD G P +
Sbjct: 66 RNLKLELNLDISFPNISCDLINLDIMDESGELQLDLLDSTFIKTRLDPQGNPLDNDNNVA 125
Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNE 167
+ ++V V E +DP+ CGSCYG++ +T CC TCN+
Sbjct: 126 DTDADLVIGVDDLTKNGEKRLKEILAKDPDYCGSCYGSQDQTENESKSKDQKICCQTCND 185
Query: 168 VKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS 227
V+++Y WA + I QC+NE K+ EGC+I G +NR+ G+ H APG S
Sbjct: 186 VRDSYLNAGWAFFDGAQIEQCENEGYVAKINKHLEEGCRIKGQALLNRIQGNIHFAPGKS 245
Query: 228 YS----INHVHVHDIQPYTSA-AFNTTHHIRHLSFGIK--------LQDDDERRK----P 270
YS H HD Y N H I HLSFG L+D +R+K P
Sbjct: 246 YSNYKAKGSTHRHDTSLYDKVKKMNFNHIIHHLSFGKSIDKVGKNDLKDYSDRKKFSINP 305
Query: 271 LDG---TVAKAEEGASMFNYYIKIIPTIYERLDGSKL------------------GGGD- 308
LD V F+YY KI+PT YE LD K+ GG D
Sbjct: 306 LDDRKVIVKDFNPAFHQFSYYTKIVPTRYEFLD-EKISSIETAQFSATYHSRPIQGGTDE 364
Query: 309 ---------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
GG+PG+FF +E+SP +K+ K W+ + N
Sbjct: 365 DHPTTFHSRGGIPGLFFFFEMSP--IKVINKEHHF-RTWSSFLLN 406
>gi|148674214|gb|EDL06161.1| ERGIC and golgi 3, isoform CRA_a [Mus musculus]
Length = 238
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 107/253 (42%), Positives = 150/253 (59%), Gaps = 18/253 (7%)
Query: 29 AVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDA 88
AVTIV L + L ++ Y EL+VD SRG KL I++D++ P + C YL++DA
Sbjct: 1 AVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDKLKINIDVLFPHMPCAYLSIDA 60
Query: 89 VDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKC 148
+D +GEQ L VEHN++K+RLD DG P+ + + + K +VT + + DPN+C
Sbjct: 61 MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HELGKVEVTVFDPNSL----DPNRC 114
Query: 149 GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIY 208
SCYGAE+E KCCN+C +V+EAYR + WA DTI QC+ E ++K++ EGCQ+Y
Sbjct: 115 ESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVY 174
Query: 209 GYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR 268
G+LEVN+V G S VHD+Q + N TH+I+HLSFG +D
Sbjct: 175 GFLEVNKVPGG---------SKARQLVHDLQSFGLDNINMTHYIKHLSFG---EDYPGIV 222
Query: 269 KPLDGTVAKAEEG 281
PLD T A +G
Sbjct: 223 NPLDHTNVTAPQG 235
>gi|325189930|emb|CCA24410.1| hypothetical protein BRAFLDRAFT_63528 [Albugo laibachii Nc14]
Length = 699
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 196/380 (51%), Gaps = 27/380 (7%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ +D K ++F KT+ GG ++++ I YL+ ++ Y V +++ VD SR
Sbjct: 322 KLRNVDFNPKTLDEFKVKTINGGILSLLSIGLIGYLLVSELIFYLSVDIVDKMLVDGSRN 381
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ I+ D+ P + C + L++ SSGE H ++H+++K+ +DL+GK + K +++
Sbjct: 382 RMVTINFDVEFPRMPCSIVTLESTGSSGEIHHDIQHSVHKQAIDLNGKILSAGMK--LDS 439
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ K T ++ T E +CGSCYGA + +CCNTC +V++AY ++W +P L TI
Sbjct: 440 IGKAW-TNQSDTVAEEKTVKVECGSCYGAGA-SGECCNTCEDVQQAYASRRWNIPSLHTI 497
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ + L +T EGC+IYG + V +V G AP + ++ +I T
Sbjct: 498 EQCQKSEIEKLLHSTVEEGCRIYGSIAVTKVHGKVLFAPAKALLSGYISTEEILDKTIKI 557
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDG---TVAKAEEGASMFNYYIKIIPTIYERLDGS 302
F+T+H I +L FG + E + PL+G + K G + Y+++++PT Y L+G
Sbjct: 558 FDTSHKINYLDFG---ERYPEMKSPLNGHNTILPKGTRGT--YQYFLQVVPTAYYYLNGG 612
Query: 303 KLGG---------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ G+ +P I F Y+ SP+M +I ++ + T + + G
Sbjct: 613 IIDTNQYSVTQHYQELTPLGEQQLPMITFQYKFSPIMFQIEQRRRGYLQFLTSLCAILGG 672
Query: 348 TYITFMLVDALLHSCVKKIS 367
+ VD++L + + S
Sbjct: 673 VFTMVGAVDSILFAYSNQFS 692
>gi|47214843|emb|CAF95749.1| unnamed protein product [Tetraodon nigroviridis]
Length = 299
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 100/231 (43%), Positives = 140/231 (60%), Gaps = 14/231 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ + + L ++ Y EL+VD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
KL I++DIV P + C YL++DA+D +GEQ L VEHN++K+RLD + KP+ E +K +
Sbjct: 66 DKLKINIDIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLKPVSTEAEKHELG 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+ +V + DPN+C SCYGAET+ KCCN+C++V+EAYR + WA DT
Sbjct: 126 GAEDVEVFDPSTL------DPNRCESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADT 179
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVS-------GSFHIAPGLSY 228
I QCK E T+K++ EGCQ+YG LEVN+VS G F + G +
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGVLEVNKVSLIAQEGGGKFSLCSGKKF 230
>gi|254569250|ref|XP_002491735.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv41p [Komagataella pastoris GS115]
gi|238031532|emb|CAY69455.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv41p [Komagataella pastoris GS115]
gi|328351763|emb|CCA38162.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Komagataella pastoris CBS 7435]
Length = 401
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 121/388 (31%), Positives = 187/388 (48%), Gaps = 44/388 (11%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K +D KT GG +T++C + L+ + DY V EL VD
Sbjct: 5 KLLSLDAFAKTADDVKVKTTSGGVITLICLIVTLILVTNEYFDYQTVVIRPELVVDRDHA 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I L++ I C+ LA+D +D +G+ + + + +++ +DG +E + VN
Sbjct: 65 KKLDISLNVTFHHIPCELLAMDIMDITGDLQIDLLMSGFQKTRVVDGLA-KETTELRVNE 123
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETET---------RKCCNTCNEVKEAYRYKK 176
K+ EN T +P CGSCYGA + + CCNTC VK+AY
Sbjct: 124 YKQ-----ENNKLTNS-NNPYYCGSCYGALNQKDNENKPFDEKLCCNTCESVKKAYAKAG 177
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC+NE + + + EGCQ+ G ++NRVSG+ H APG S + H+H
Sbjct: 178 WAFYDGRNIEQCENEGYVQLVTSMVDEGCQVSGTAQINRVSGNLHFAPGSSLTSGSRHIH 237
Query: 237 DIQPYTS--AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
D+ + FN H + HLSFG + + + PLDG A +++Y++K++ T
Sbjct: 238 DLSLFEKYPDKFNFDHTVNHLSFGKTIDNQEMSTHPLDGYEAATGNKNHLYSYFLKVVAT 297
Query: 295 IYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKITE 329
YE + G K GG D GG+PG FF +E+SPL + E
Sbjct: 298 RYESMSGLKWDTNQFSATYHDRPLEGGRDSDHPNTLHASGGIPGAFFHFEISPLKIINRE 357
Query: 330 K-SKSLGHLWTKIMCNISGTYITFMLVD 356
+ SK+ + +++G ++D
Sbjct: 358 QYSKTRSAFALGVSASVAGVLTLGSVLD 385
>gi|255712984|ref|XP_002552774.1| KLTH0D01144p [Lachancea thermotolerans]
gi|238934154|emb|CAR22336.1| KLTH0D01144p [Lachancea thermotolerans CBS 6340]
Length = 402
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 127/407 (31%), Positives = 191/407 (46%), Gaps = 61/407 (14%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L +DAF K ED +T GG +T+ C + L+ + +V T +L VD R
Sbjct: 5 KLLSIDAFAKTEEDVRIRTRTGGLITLSCVVVTFLLLLSEWFHLKEVVTRPQLVVDRDRH 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPIQ----EPQK 120
KL +++DI P I C L +D +DS+GE L V + + K RLD G+ + +P K
Sbjct: 65 LKLDLNMDITFPHIPCYLLNMDIMDSAGEMQLEVLNKGWSKTRLDPSGQVLDTKQFKPGK 124
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET---------RKCCNTCNEVKEA 171
+VV+ ED N CG CYGA ++ R CC TC++V+EA
Sbjct: 125 DVVDYAP---------------EDENYCGPCYGARDQSKNDEVNVDERVCCQTCDDVREA 169
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
Y K+WA + I QC+ E E++ EGC+I G ++NR+ G+ H APG +
Sbjct: 170 YAEKQWAFFDGKNIEQCEREGYVEQVNEHIEEGCRIKGMAKLNRIGGNLHFAPGKGFHNI 229
Query: 232 HVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQD---DDERRKPLDGTVAKAE--EGASMF 285
H HD Y S + N H I HLSFG +++D PLDGT E F
Sbjct: 230 RGHFHDASLYQNSPSLNFNHIIHHLSFGKEVEDITGQGASTAPLDGTNVSPEFDTHKHQF 289
Query: 286 NYYIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYEL 320
+Y+ KI+PT YE L G + GG D GG P ++F +E+
Sbjct: 290 SYFAKIVPTRYEYLSGETVETTQFTTTYHSRPLKGGRDSDHPTTLHSQGGFPSVYFYFEM 349
Query: 321 SPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
SPL ++ + ++S W + +I G ++D + + + +
Sbjct: 350 SPLKVINKQQYAQSWSGFWLNCITSIGGVLAVGTVLDKITYKAQRSM 396
>gi|403215799|emb|CCK70297.1| hypothetical protein KNAG_0E00290 [Kazachstania naganishii CBS
8797]
Length = 408
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 190/411 (46%), Gaps = 53/411 (12%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ L +DAF++ +D +T G VT+ C + +L+ + + + + L +
Sbjct: 1 MMRRSTLLSMDAFSRAEDDVRVRTRAGAYVTLACLVTTVFLLLSEYRQWNTIVSRSSLVI 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE---HNIYKRRLDLDGKPIQE 117
D G KL + LD+ P + CD ++ D +D SG L V+ ++ K R+D G+P+
Sbjct: 61 DREHGLKLDLRLDVTFPHLPCDLVSFDVLDDSGVLLLDVDDENNHFTKTRIDQRGEPL-- 118
Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEV 168
+A + DP+ CGSCYG+ +TR CCNTC+ V
Sbjct: 119 ------DAAAAASFKLDAEAAQLPPTDPDYCGSCYGSRDQTRNDELDPANKVCCNTCSSV 172
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
+EAY WA + I QC+ E +K+ TEGC+I G + +NRV G+ H APG ++
Sbjct: 173 REAYLDAGWAFFDGKNIEQCEREGYVDKISQRITEGCRIKGGVRLNRVQGNIHFAPGDAF 232
Query: 229 SINHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERRK-------PLDG--TVAKA 278
H HD Y + + N H I HLSFG + + K PLDG + +
Sbjct: 233 RSARGHFHDTSMYDQTGSLNFDHIIHHLSFGPSVDNMQSLEKASNVAIAPLDGKQVLPRY 292
Query: 279 EEGASMFNYYIKIIPTIYERLDGS--------------KLGGG--------DGGMPGIFF 316
+ A + Y+ KI+PT +E GS +GGG GG PG++F
Sbjct: 293 DSHAYQYTYFTKIVPTRFEYFSGSVIETTQFSSTFSARPIGGGTTETATYTSGGTPGLYF 352
Query: 317 SYELSPLMVKITEKSK-SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+ E+SPL V E++K S + +I G +VD +L+ + +
Sbjct: 353 NIEMSPLKVIHKEQNKISWSGFLLNCITSIGGVLAVGTVVDKILYRAERTL 403
>gi|327296796|ref|XP_003233092.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
gi|326464398|gb|EGD89851.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
Length = 435
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 133/440 (30%), Positives = 190/440 (43%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTI L + YL+ + DY +V EL V
Sbjct: 1 MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V+H + K RL
Sbjct: 61 DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
E + ++ + DPN CG CYG A + +K CCNTC+EV++AY K
Sbjct: 113 EGGRVIDVTALSLHKKEDSPAHLDPNYCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC +E ++++ EGC+I G L VN+V+G+FHIAPG S + + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
D+ Y T TH I L FG +L ++ R PLD + K E F
Sbjct: 233 DLDNYYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNF 292
Query: 286 NYYIKIIPTIYERLDGSKLGGGDG----------GMPGIFF------------------- 316
Y++K++ T Y L + G G+FF
Sbjct: 293 LYFVKVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRS 352
Query: 317 --------------------------SYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
+YE+SP+ V E + KSL +T + I GT
Sbjct: 353 LDAEDASADGHKERQHARGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD LL+ ++ K+
Sbjct: 413 TVAAAVDRLLYEGSLRVKKL 432
>gi|302666755|ref|XP_003024974.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
gi|291189052|gb|EFE44363.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
Length = 435
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 133/440 (30%), Positives = 189/440 (42%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTI L + YL+ + DY +V EL V
Sbjct: 1 MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V+H + K RL
Sbjct: 61 DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
E + + + DPN CG CYG A + +K CCNTC+EV++AY K
Sbjct: 113 EGGRVIDVTALALHKKEDSPAHLDPNYCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC +E ++++ EGC+I G L VN+V+G+FHIAPG S + + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
D+ Y T TH I L FG +L ++ R PLD + K E F
Sbjct: 233 DLDNYYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNF 292
Query: 286 NYYIKIIPTIYERLDGSKLGGGDG----------GMPGIFF------------------- 316
Y++K++ T Y L + G G+FF
Sbjct: 293 LYFVKVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRS 352
Query: 317 --------------------------SYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
+YE+SP+ V E + KSL +T + I GT
Sbjct: 353 LDAEDASADGHKERQHSRGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD LL+ ++ K+
Sbjct: 413 TVAAAVDRLLYEGSLRVKKL 432
>gi|302511557|ref|XP_003017730.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
gi|291181301|gb|EFE37085.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
Length = 435
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 133/440 (30%), Positives = 189/440 (42%), Gaps = 79/440 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTI L + YL+ + DY +V EL V
Sbjct: 1 MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V+H + K RL
Sbjct: 61 DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
E + + + DPN CG CYG A + +K CCNTC+EV++AY K
Sbjct: 113 EGGRVIDVTALALHKKEDSPAHLDPNYCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC +E ++++ EGC+I G L VN+V+G+FHIAPG S + + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
D+ Y T TH I L FG +L ++ R PLD + K E F
Sbjct: 233 DLDNYYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNF 292
Query: 286 NYYIKIIPTIYERLDGSKLGGGDG----------GMPGIFF------------------- 316
Y++K++ T Y L + G G+FF
Sbjct: 293 LYFVKVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRS 352
Query: 317 --------------------------SYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
+YE+SP+ V E + KSL +T + I GT
Sbjct: 353 LDAEDASADGHKERQHARGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTL 412
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD LL+ ++ K+
Sbjct: 413 TVAAAVDRLLYEGSLRVKKL 432
>gi|323454843|gb|EGB10712.1| hypothetical protein AURANDRAFT_2571, partial [Aureococcus
anophagefferens]
Length = 380
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 181/373 (48%), Gaps = 32/373 (8%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L+ +D + K ++F +T+ GG ++ + L+ ++ VST + LFV+SS G
Sbjct: 7 KLRNMDMYPKTKDEFRVRTMQGGVSSLFAVVVAIILVRSELKHSLAVSTHDRLFVNSSHG 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV--V 123
L + ++ P +C+ LA+DA D SG+ V+ ++ K RLD +G+ + +K V
Sbjct: 67 DGLSVRFELEFPRANCELLAIDANDESGQPLEGVQQHVIKTRLDTNGRRVLVNRKAANSV 126
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
+ V + E+ E + CG CYGA+ + R CC TC++V+ AYR + W E
Sbjct: 127 HKVGDTATSEEHLAAPDEAKPEVACGDCYGAQDDERPCCATCDDVRSAYRKRGWTFHE-H 185
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-HDIQPYT 242
T+ QC E + L EGC I G LE+ VSG+FH+APG + + D+ T
Sbjct: 186 TVAQCAGELAEAALDLDSDEGCSIKGTLELPAVSGNFHVAPGRHLQTSGLFKGMDLVQLT 245
Query: 243 SAAFNTTHHIRHLSFGI---KLQDDDERRK----------PLDGTVAKAEEGASMFNYYI 289
FN +H ++ L FG L+ RK LDG +G M YY+
Sbjct: 246 FDKFNVSHTVKQLRFGPDERSLEPARASRKVVGPDVDLSSQLDGESRTLGDGYGMHQYYL 305
Query: 290 KIIPTIYERLDGSK--------------LGGGDG-GMPGIFFSYELSPLMVKITEKSKSL 334
K++PT+Y+ L G + G G G+PG+FF YE+SPL + E+
Sbjct: 306 KVVPTVYKNLGGKTRELWQYSVTEHVRHVAPGSGKGLPGVFFFYEVSPLCAEFVERRNGW 365
Query: 335 GHLWTKIMCNISG 347
L T + + G
Sbjct: 366 LALLTGLAAIVGG 378
>gi|315044047|ref|XP_003171399.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma gypseum CBS 118893]
gi|311343742|gb|EFR02945.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma gypseum CBS 118893]
Length = 435
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 154/311 (49%), Gaps = 23/311 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTI L + YL+ + DY +V EL V
Sbjct: 1 MAGKSRFTRLDAFAKTVEDARIRTRSGGIVTITALLVVLYLVWGEWKDYRRVVVQPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V+H + K RL
Sbjct: 61 DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
E + + + DPN CG CYG A + +K CCNTC EV++AY K
Sbjct: 113 EGGKVIDVTALALHKKEDSPAHLDPNYCGDCYGVPAPSNAKKPGCCNTCEEVRDAYAEKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC +E ++++ EGC+I G L VN+V+G+FHIAPG S + + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
D+ Y T +H I L FG +L ++ R PLD + K +E F
Sbjct: 233 DLDNYYHTPVPHTMSHTIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSDHKTDEARYNF 292
Query: 286 NYYIKIIPTIY 296
Y++K++ T Y
Sbjct: 293 MYFVKVVSTSY 303
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 309 GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
GG+P + F+YE+SP+ V E + KSL +T + I GT VD LL+ ++
Sbjct: 371 GGIPSVIFNYEISPMKVINREARPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGGLRVK 430
Query: 368 KV 369
K+
Sbjct: 431 KL 432
>gi|326476034|gb|EGE00044.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
gi|326481270|gb|EGE05280.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Trichophyton equinum CBS 127.97]
Length = 435
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 154/311 (49%), Gaps = 23/311 (7%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAF K ED +T GG VTI L + YL+ + DY +V EL V
Sbjct: 1 MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG ++ IHL++ P + C+ L LD +D SGE V+H + K RL
Sbjct: 61 DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
E + + + DPN CG CYG A + +K CCNTC+EV++AY K
Sbjct: 113 EGGKVIDVTALALHKKEDSPAHLDPNYCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKN 172
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC +E ++++ EGC+I G L VN+V+G+FHIAPG S + + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
D+ Y T +H I L FG +L ++ R PLD + K E F
Sbjct: 233 DLDNYYHTPVPHTMSHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEARYNF 292
Query: 286 NYYIKIIPTIY 296
Y++K++ T Y
Sbjct: 293 LYFVKVVSTSY 303
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 309 GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
GG+P + F+Y++SP+ V E + KSL +T + I GT VD LL+ ++
Sbjct: 371 GGIPSVMFNYDISPMKVINRESRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVK 430
Query: 368 KV 369
K+
Sbjct: 431 KL 432
>gi|344301666|gb|EGW31971.1| hypothetical protein SPAPADRAFT_50577 [Spathaspora passalidarum
NRRL Y-27907]
Length = 410
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 127/401 (31%), Positives = 195/401 (48%), Gaps = 44/401 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL LDAF K +D +T GG +T++C L LI + DY V T EL VD
Sbjct: 8 RLLSLDAFAKTVDDARIRTTSGGIITLLCVLITLVLIRNEYIDYTTVITRPELVVDRDIN 67
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQKEVVN 124
+L I+LDI + CD ++D +D +G+ L+ + K RL D I +
Sbjct: 68 KQLVINLDISFINLPCDMASIDLLDETGDMQLNIINAGFQKLRLIKDKGNIVREISDDTP 127
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKWALPEL 182
A+ + +E E DP CGSCYGA + + + CCN C VK AY ++W+ +
Sbjct: 128 ALNLDRPLSEVVKGLPEGGDPKTCGSCYGALPQEKHQYCCNDCYSVKRAYAERRWSFFDG 187
Query: 183 DTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
+ I QC+ E ++L+ EGC+I G ++NRVSG+ APG S++ + HVHD+
Sbjct: 188 ENIEQCEKEGYVKRLRQRINDNEGCRIKGSAKINRVSGTMDFAPGASFTSDGRHVHDVSL 247
Query: 241 YT--SAAFNTTHHIRHLSFGIKLQDDDERRK------PLDGTVAKAEEGASMFNYYIKII 292
Y FN H I HLSFG +D R + PLDG + + +YY+K++
Sbjct: 248 YGKYQDKFNFDHIINHLSFG----SNDAREEILNSVHPLDGYQFMLHKKHHVASYYLKVV 303
Query: 293 PTIYERLDGSK----------------LGGGD----------GGMPGIFFSYELSPLMVK 326
T +E LD SK GG D GG+PG+ F +++SPL +
Sbjct: 304 ATRFESLDQSKRLDTNQFSVITHDRPLTGGKDEDHEHTLHARGGIPGVEFHFDISPLKII 363
Query: 327 ITEK-SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
E+ +K+ ++ +I+G + L+D +++ + I
Sbjct: 364 NKEQYAKTWSGFVLGVISSIAGVLMVGTLIDRSVYATQQAI 404
>gi|387219467|gb|AFJ69442.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Nannochloropsis gaditana CCMP526]
Length = 432
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 187/389 (48%), Gaps = 43/389 (11%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEE-LFVDSSRG 65
L+ +D FTK +++ +T G ++ + W+ + L+C + + F S T+E L VD+S G
Sbjct: 34 LERMDVFTKFHDEDKIQTSRGASMALFSWVLVLVLLCSEAYEAFLTSRTKEHLVVDTSLG 93
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I LD+ ++C + +DA+D +G+ + VEHN+ K+RL G+ I P E
Sbjct: 94 DKLNITLDMTFHALTCADVHVDAMDVAGDNQMQVEHNMLKQRLSSQGERIGFPFLEDPTD 153
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
KK G + CGSC+ A T T CCN+C ++++AY + + ++ T
Sbjct: 154 FDSKKADALLGAAPWDY-----CGSCFQARTHTGACCNSCQDLEQAYLTQGLPMGKIKTT 208
Query: 186 V-QCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
QC + EGC + G++ VN+V+G+FHIA G S + H+H P
Sbjct: 209 APQCLPGFQAPAPSGPMQKGEGCNLKGFMSVNKVAGNFHIAFGDSVVKDGRHIHQFIPSE 268
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV--AKAEEGASMFNYYIKIIPTIYE--- 297
+ FN +H I+H+SFG + R PLDG V + G +F Y+IK+IPT Y+
Sbjct: 269 APFFNVSHTIQHVSFG---DEYPGRVNPLDGKVKYVSSTVGTGLFQYFIKVIPTHYKGRA 325
Query: 298 ------------------------RLDGSKLGGGD--GGMPGIFFSYELSPLMVKITEKS 331
RL G D +PG+FF Y+LSP V+++ S
Sbjct: 326 GEAIRTNRISVTERFKPLHKEGEARLTGDSHAHNDQTSVLPGVFFIYDLSPFNVEVSTVS 385
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLH 360
H K+ G + L+D + +
Sbjct: 386 VPFSHFLVKLCAIAGGVFSISRLLDNVFY 414
>gi|344230637|gb|EGV62522.1| hypothetical protein CANTEDRAFT_131007 [Candida tenuis ATCC 10573]
Length = 410
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 122/401 (30%), Positives = 193/401 (48%), Gaps = 42/401 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED KT GG +T+V + +LI + DY + T EL VD
Sbjct: 6 KLLSLDAFAKTVEDARVKTASGGIITLVSITIVLFLIRNEYLDYTSIITRPELVVDRDIN 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
KL I LDI P+I C + LD +D SG L + N +++ R+ G+ + +++
Sbjct: 66 QKLDITLDISFPSIPCSMINLDILDVSGNVELDILQNGFQKYRILSSGEEVLMKNAPLID 125
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWALPEL 182
+ + + G E + CG CYG+ + RK CCN C ++ AY K WA +
Sbjct: 126 STPLEVMA--KGLDKPEDAEHTPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFYDG 183
Query: 183 DTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
+ I C++E + +++ EGC++ G ++NR+SG+ H APG S++ HVHD+
Sbjct: 184 ENIKPCEDEGYVKAIQSEIFNNEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDLSL 243
Query: 241 YTSAA--FNTTHHIRHLSFG----IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
Y FN H I HLSFG D + PLDG +E +++Y++K++ T
Sbjct: 244 YNKFPDRFNFDHTINHLSFGKDPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVVST 303
Query: 295 IYERL------------------DGSKLGGGD----------GGMPGIFFSYELSPLMVK 326
YE L D GG D GG+PG++F +++SPL +
Sbjct: 304 RYEYLQEKLKAPLETNQFSAIYHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLKII 363
Query: 327 ITEK-SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
E+ SK+ ++ +I+G + L+D + + K I
Sbjct: 364 NKEQYSKTWSGFVLGVISSIAGVLMIGSLLDRSVWAAEKAI 404
>gi|344230638|gb|EGV62523.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
Length = 409
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 122/401 (30%), Positives = 193/401 (48%), Gaps = 42/401 (10%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED KT GG +T+V + +LI + DY + T EL VD
Sbjct: 5 KLLSLDAFAKTVEDARVKTASGGIITLVSITIVLFLIRNEYLDYTSIITRPELVVDRDIN 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
KL I LDI P+I C + LD +D SG L + N +++ R+ G+ + +++
Sbjct: 65 QKLDITLDISFPSIPCSMINLDILDVSGNVELDILQNGFQKYRILSSGEEVLMKNAPLID 124
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWALPEL 182
+ + + G E + CG CYG+ + RK CCN C ++ AY K WA +
Sbjct: 125 STPLEVMA--KGLDKPEDAEHTPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFYDG 182
Query: 183 DTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
+ I C++E + +++ EGC++ G ++NR+SG+ H APG S++ HVHD+
Sbjct: 183 ENIKPCEDEGYVKAIQSEIFNNEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDLSL 242
Query: 241 YTSAA--FNTTHHIRHLSFG----IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
Y FN H I HLSFG D + PLDG +E +++Y++K++ T
Sbjct: 243 YNKFPDRFNFDHTINHLSFGKDPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVVST 302
Query: 295 IYERL------------------DGSKLGGGD----------GGMPGIFFSYELSPLMVK 326
YE L D GG D GG+PG++F +++SPL +
Sbjct: 303 RYEYLQEKLKAPLETNQFSAIYHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLKII 362
Query: 327 ITEK-SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
E+ SK+ ++ +I+G + L+D + + K I
Sbjct: 363 NKEQYSKTWSGFVLGVISSIAGVLMIGSLLDRSVWAAEKAI 403
>gi|367017984|ref|XP_003683490.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
gi|359751154|emb|CCE94279.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
Length = 406
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 126/414 (30%), Positives = 189/414 (45%), Gaps = 62/414 (14%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MV L DAF+K ED +T GG +++ C + +L+ + ++ QV T +L V
Sbjct: 1 MVQKSALLSFDAFSKTEEDVRIRTRSGGLISLSCVVLTIFLLISEWLNFNQVVTRPQLVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQ 119
D R KL +DI P++ C ++LD +D++GE L + E K R+D +GK I
Sbjct: 61 DRDRQLKLDFVVDITFPSMPCAMISLDIMDNAGELQLDIMEAGFTKTRIDSNGKEI---- 116
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE---------TETRKCCNTCNEVKE 170
+ ++ +D N CGSCYGA+ E R CC TC++V++
Sbjct: 117 -------STSSFDASDSSSDYVPDDENYCGSCYGAKDQDKNDELPKEERVCCQTCDDVRK 169
Query: 171 AYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSI 230
AY +WA + I QC+ E E++ EGC++ G ++R+ G+ H APG +
Sbjct: 170 AYLEAEWAFYDGKNIEQCEREGYVERINQQLNEGCRVQGNALLSRIQGTIHFAPGRGFQN 229
Query: 231 NHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGAS 283
N H HD+ Y + N H I HLSFG + E R PLDG +
Sbjct: 230 NRGHFHDMSLYDNTPQLNFNHIIHHLSFGKPINSGAEDRGAATSTHPLDGRQVFPDRDTH 289
Query: 284 M--FNYYIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFF 316
+ F+Y+ KI+PT YE LD + GG D GG PG+F
Sbjct: 290 LHQFSYFAKIVPTRYEYLDDVVVETAQFSTTYHDRPLRGGVDDDHPNTLHSRGGSPGMFV 349
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
+E+SPL V E+ W+ + N I G ++D +L+ K I
Sbjct: 350 YFEMSPLKVINKEQH---AQTWSGFLLNCITSIGGVLAVGTVLDKVLYKAQKSI 400
>gi|224011116|ref|XP_002294515.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970010|gb|EED88349.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 454
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 187/385 (48%), Gaps = 21/385 (5%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVS--TTEELFVD 61
++ +K LD F K D+ +T GG T+V ++ + LI + + ++ + E + VD
Sbjct: 71 AKTVKKLDFFPKLERDYEVRTERGGQATLVGYVIMLVLILAEFWTWRGLNGESLEHIVVD 130
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+S G ++ ++L+I P + CD L LD +D +G+ L + ++K RL+LDG ++ K
Sbjct: 131 TSLGKRMRVNLNITFPNLHCDDLHLDVIDVAGDSQLDLSDTLFKHRLNLDGT-LRSKAKI 189
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
A K + ++ + CG CYGA+ + CCNTC++V E Y+ K+W
Sbjct: 190 ATEANIKADEDKKKQEALSKDIPADYCGPCYGADEKEGDCCNTCDDVMERYKKKRWNENA 249
Query: 182 LDTIV-QCKNE--YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+ + QC E E + + EGC + G+ VNRV+G+FHIA G + H+H
Sbjct: 250 VQPLAEQCIREGKGKNEPKRMSNGEGCNLSGHFTVNRVAGNFHIAMGEGVDRDGRHIHQF 309
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDD--------DERRKPLDGTVAKAEEGASMFNYYIK 290
P FN +H + L F + D + + V + +F Y+IK
Sbjct: 310 LPEDRMNFNASHVVHELIFMDEEYGDMVIAGVPGETSMNSVSKVVTEDTGTTGLFQYFIK 369
Query: 291 IIPTIYERLDGSKL-------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
++PT Y+ G L + +PG+FF YE+ P V++T+ HL +IM
Sbjct: 370 VVPTKYKGKSGGTLHEKVEHHDTQNAVLPGVFFVYEIYPFAVEVTKNKVPFMHLLIRIMA 429
Query: 344 NISGTYITFMLVDALLHSCVKKISK 368
+ G + +D+ L+S KK S+
Sbjct: 430 TVGGVFTIMGWIDSALYSREKKSSR 454
>gi|294657513|ref|XP_459821.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
gi|199432751|emb|CAG88060.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
Length = 402
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 125/411 (30%), Positives = 194/411 (47%), Gaps = 54/411 (13%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L +D F K ED KT GG +T+VC + +LI + DY + T EL VD
Sbjct: 6 KLISIDVFAKTVEDAKIKTASGGIITLVCIFIVMFLIRNEYKDYTSIITRPELVVDRDIN 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+KL I+LD+ P + CD L LD +D SG+ L + + +++ + ++E E+++
Sbjct: 66 TKLDINLDVSFPNMPCDVLTLDILDISGDLQLDILKSGFQKY-----RILKESNHEILDE 120
Query: 126 VKKKKVTTENGTTTTELED----PNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKWAL 179
N + E+ KCG CYGA + CCN+C VK AY K WA
Sbjct: 121 AP----VLSNDLSLEEMAKGVGANGKCGPCYGALPQDNNEYCCNSCETVKLAYAEKMWAF 176
Query: 180 PELDTIVQCKNE----YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
+ I QC+NE TE++ N EGC++ G ++NR+SG+ H APG S + H+
Sbjct: 177 YDGKDIEQCENEGYVSRLTERINN--NEGCRVKGTAQINRISGNLHFAPGSSSTAPGRHI 234
Query: 236 HDIQPYT--SAAFNTTHHIRHLSFGIKLQDDDERRK--PLDGTVAKAEEGASMFNYYIKI 291
HD+ + FN H I H SFG D++ ++ PLD +E + +YY+K+
Sbjct: 235 HDLSLFEKYEDKFNFDHVINHFSFGSDPHDNNLQQSTHPLDNHQLVFDEKYHVASYYLKV 294
Query: 292 IPTIYERLDGS---------------KLGGGD-----------GGMPGIFFSYELSPLMV 325
+ T +E +D S L GG GG+PG+FF +E+SP+
Sbjct: 295 VATRFEFIDTSLPLDTNQFSVISHHRPLRGGKDEDHKHTLHARGGLPGVFFHFEISPM-- 352
Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
KI K + W+ + + + ++V +L V K G K +
Sbjct: 353 KIINKEQ-YAKTWSGFILGVISSVAGVLMVGTVLDRSVWAAEKAIKGKKDM 402
>gi|444321132|ref|XP_004181222.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
gi|387514266|emb|CCH61703.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
Length = 414
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 119/412 (28%), Positives = 190/412 (46%), Gaps = 58/412 (14%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L DAF K E+ +T GG +T+ C L YL+ + +Y++++ ++ VD R
Sbjct: 4 KLLSFDAFNKTDEEVRIRTRTGGIITLFCILTTLYLLQKEWIEYYKITNKPQVVVDRDRH 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
KL ++LDI P++SCD + LD VD SGE L V E K R+D +G + + + V
Sbjct: 64 LKLELNLDITFPSLSCDLIGLDIVDDSGETSLDVLESGFTKIRVDTNGNELDDGSQLDVG 123
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETET----------RKCCNTCNEVKEAYRY 174
T ++ +++ CG CYGA ++ + CC TC +V++AY
Sbjct: 124 -------TDRESLSSLDMDKAKYCGPCYGALDQSGNDNIDVASEKVCCQTCYDVRKAYTD 176
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
WA + I QC+ E +++ + EGC+I G +NR+ G+ H APG ++ H
Sbjct: 177 VGWAFFDGKDIEQCEREGYVDRINDHLHEGCRIVGSALLNRIQGNVHFAPGAAFETAKGH 236
Query: 235 VHDIQPY-TSAAFNTTHHIRHLSFGIKLQD----------DDERRKPLDGTVAKAEEGAS 283
HD Y + N H I HLSFG + RR+PLDG V E +
Sbjct: 237 FHDTSLYDKTEQLNFNHIINHLSFGKTGHELLTPKSSKSFSVSRRQPLDGRVMIPESRNT 296
Query: 284 ---MFNYYIKIIPTIYERLDGS---------------KLGG----------GDGGMPGIF 315
F+Y+ KI+PT +E L G GG G G+PG+F
Sbjct: 297 HFFQFSYFAKIVPTRFESLSGKVEEAAQYSVTFHSRPLQGGRDEDHPNTFHGRSGIPGLF 356
Query: 316 FSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
++++PL ++ I S++ L + I G ++D + + + I
Sbjct: 357 IYFQMAPLKVIDIEAHSQTFSGLLLNCITTIGGVLAVGTMMDKVFYKAQRSI 408
>gi|209876426|ref|XP_002139655.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555261|gb|EEA05306.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 395
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 190/389 (48%), Gaps = 52/389 (13%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+ +K +D + K ++D+ K+ G ++I+ ++ + L + Y T E + VD +
Sbjct: 32 KSVKYIDIYGKVHDDYCAKSTSGSIMSILVYILVIILTIGEFLKYIGGETVEHIGVDDNM 91
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
KL I LDI P++ C +++D VD+ GE ++ N+ K +D+ G +QE
Sbjct: 92 NQKLDIRLDISFPSLRCSEISVDTVDNVGENQVNAHGNLLKIPIDIHGNEVQE------- 144
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
++ + +T+ KC SC+GAE+ KCCNTC +K A+RYK W+ ++ +
Sbjct: 145 -----EIMAQYNESTSM-----KCLSCFGAESIHYKCCNTCESLKSAFRYKGWSYLDIAS 194
Query: 185 IV-QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY-T 242
QC N T GC+++G L+VN+VSG+ H+A G + + HVH+
Sbjct: 195 KAPQCIN-----------TVGCRLHGSLQVNKVSGNIHVALGQATVRDGKHVHEFNMNDI 243
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDER-RKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
S FNT+H I L FG +D+ E PL+ T G SMF+YY+K++PT + +
Sbjct: 244 SRGFNTSHTIHELRFG---KDNIEFIGSPLENTKKIVTTGTSMFHYYLKLVPTQFIKSGY 300
Query: 302 SKL------------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
SK+ G G+PG+F Y+ P +++ S H T
Sbjct: 301 SKVLFSNQYTYTERQKDVLVKDGELSGLPGVFIVYDFQPFVIRKIHNSIPTTHFLTSFCA 360
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIG 372
I G Y LVD++L +K+ S + G
Sbjct: 361 IIGGIYSLMSLVDSILFWFIKRTSAILSG 389
>gi|401626934|gb|EJS44847.1| erv46p [Saccharomyces arboricola H-6]
Length = 415
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 126/416 (30%), Positives = 184/416 (44%), Gaps = 68/416 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDAF K ED +T GG +T+ C L +L+ + + V T +L VD R +
Sbjct: 6 LLSLDAFAKTEEDVRVRTKAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHA 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL +++D+ P++ C+ + LD +D SGE L + + R+D DG P+ + + V
Sbjct: 66 KLELNMDVTFPSMPCELVNLDIMDDSGELQLDILDAGFTMTRVDKDGHPVGDATELHVGG 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAE---------TETRKCCNTCNEVKEAYRYKK 176
NG T +DPN CG CYGA E + CC C+ V+ AY K
Sbjct: 126 ---------NGEGATPNDDPNYCGQCYGARDQSNNENLAQEDKVCCQNCDSVRSAYLDKG 176
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYS-INHVHV 235
WA + I QC+ E K+ + EGC+I G ++NR+ G+ H APG + H
Sbjct: 177 WAFFDGKDIEQCEKEGYVNKINDHLHEGCRIEGSAQINRIQGNIHFAPGKPFQDTRGNHR 236
Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDER-------------RKPLDGTVAKAEEG 281
HD Y + N H I LSFG +Q +R PLDG +
Sbjct: 237 HDTSLYDKTPDLNFNHIINRLSFGKPIQSHHKRLGNDKLHGGAVVSTSPLDGRQVFPDRP 296
Query: 282 ASM--FNYYIKIIPTIYERLDGS--------------KLGGG-----------DGGMPGI 314
F+Y+ KI+PT YE LD + LGGG GG+ G+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDSTVIETAQFSATYHSRPLGGGRDQDHPNTFHARGGISGL 356
Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
+ +E+SPL V E+ G W+ + N I G ++D L + + I
Sbjct: 357 YVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409
>gi|345319994|ref|XP_001507420.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Ornithorhynchus anatinus]
Length = 203
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 122/205 (59%), Gaps = 19/205 (9%)
Query: 161 CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSF 220
CCNTC +V+EAYR + WA DTI QCK E ++K++ EGCQ+YG+LEVN+V+G+F
Sbjct: 1 CCNTCEDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNF 60
Query: 221 HIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE 280
H APG S+ +HVH + N TH+I HLSFG +D PLDGT A +
Sbjct: 61 HFAPGKSFQQSHVHGKERLRIHPRPINMTHYIEHLSFG---EDYPGIVNPLDGTDVSAPQ 117
Query: 281 GASMFNYYIKIIPTIYERLDG-------------SKLGG---GDGGMPGIFFSYELSPLM 324
+ MF Y++K++PT+Y + DG K+ GD G+PG+F YELSP+M
Sbjct: 118 ASMMFQYFVKVVPTVYVKADGEVVRTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMM 177
Query: 325 VKITEKSKSLGHLWTKIMCNISGTY 349
VK+TEK +S H T + I G +
Sbjct: 178 VKLTEKHRSFTHFLTGVCAIIGGVF 202
>gi|300123299|emb|CBK24572.2| unnamed protein product [Blastocystis hominis]
Length = 376
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 181/368 (49%), Gaps = 49/368 (13%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F+++L+ LD + K +D+ KT GG V++ I L ++ +Y +V+ T+ + +D+
Sbjct: 25 FTKKLEKLDIYPKIGDDYVIKTESGGFVSLFSGFIIIILFVSELTNYLKVNRTDVITIDN 84
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
+R KL I+ +I + I C +LD +D SG+Q + V I + LD + KP+ V
Sbjct: 85 TRNEKLQINFNISLYGIPCSEASLDIMDISGQQQMGVTSRIVQLDLDENHKPVNMALSSV 144
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+ +K + DP CGSC+GA + CCNTC++V AY + W
Sbjct: 145 L---YEKNI------------DP-ACGSCFGASL-SNVCCNTCDDVLSAYERRGW----- 182
Query: 183 DTIV------QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
DT QC+ K ++GC ++G LEVN+V+G+FHIA G + + + H+H
Sbjct: 183 DTWFVSKYSPQCRKNNDEVKKPRVNSQGCMMWGVLEVNKVAGNFHIAVGHAANRDSHHIH 242
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
P + FN THHI LSFG + + PLDG AE S NYY+K++PT+Y
Sbjct: 243 SFNPLMISKFNVTHHIEKLSFGEHIPG---IQNPLDGHDMVAESLTSQ-NYYLKVMPTVY 298
Query: 297 ERLDGSKLG-----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
+ + G +PGIFF Y+++P M +TE + H
Sbjct: 299 SNRTSTVVSNELSVNEVSRRVEMTPFGQITSLPGIFFIYDITPFMHVVTESRIAFAHFLV 358
Query: 340 KIMCNISG 347
++ I G
Sbjct: 359 RVCAVIGG 366
>gi|354544621|emb|CCE41346.1| hypothetical protein CPAR2_303350 [Candida parapsilosis]
Length = 412
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 193/403 (47%), Gaps = 45/403 (11%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED KT GG +T++C +LI + DY V EL VD
Sbjct: 7 KLISLDAFAKTVEDARIKTASGGIITLLCIFVALFLIRNEYIDYTTVIARPELVVDRDIN 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDG---KPIQEPQKE 121
+L I+LDI + CD +++D D SG+ L + + K R+ G KP++ K+
Sbjct: 67 KQLDINLDISFLNLPCDLVSIDLFDESGDLKLDIINSQLEKFRIIKQGHSSKPVE--IKD 124
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWAL 179
A++++ + E + +CGSCYGA + +K CCNTC V+ AY W
Sbjct: 125 EQPALQREVPLEQIAPGLPEGQTEGECGSCYGAVPQDKKQYCCNTCAAVRRAYAEANWQF 184
Query: 180 PELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
+ + I QC+ E ++LK EGC++ G ++NR+SG+ APG S + + HVHD
Sbjct: 185 FDGENIAQCEQEGYVQRLKQRIGENEGCRVKGTAKINRISGTMDFAPGASMTKDGRHVHD 244
Query: 238 IQPYT--SAAFNTTHHIRHLSFG-----IKLQDDDERRKPLDGTVAKAEEGASMFNYYIK 290
+ Y FN H I HLSFG KL D PLDG + NY++K
Sbjct: 245 LSLYQKYKDKFNFDHVINHLSFGNNPPASKLVDTGS-ITPLDGHKFLQHKKYHSINYFLK 303
Query: 291 IIPTIYERLDGS---------------KLGGGD-----------GGMPGIFFSYELSPL- 323
I+ T +E LDG L GG GG+PG+ F++++SPL
Sbjct: 304 IVATRFESLDGKHKFDTNQFSVITHDRPLAGGKDEDHQHTLHARGGVPGVAFNFDISPLK 363
Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
++ E +K+ ++ +I+G + L+D + + + I
Sbjct: 364 IINREEYAKTRSGFILGVVSSIAGVLMVGSLMDRSVFAAQQAI 406
>gi|366987855|ref|XP_003673694.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
gi|342299557|emb|CCC67313.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
Length = 425
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 124/396 (31%), Positives = 182/396 (45%), Gaps = 65/396 (16%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
S +L DAF K E+ +T GG +T+ C + YL+ + + V T+ +L VD
Sbjct: 8 SAKLLSFDAFAKTEEEVRVRTNTGGIITLSCIIVTLYLLLNEWSQFNSVITSPQLVVDRD 67
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPIQEPQKEV 122
R KL ++ D+ P+ISCD + LD +D SGE L + + + K R+D DG + EV
Sbjct: 68 RNLKLELNFDVTFPSISCDLINLDIMDDSGELQLDLLDSAFTKIRVDADGNELGSSTLEV 127
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET---------ETRKCCNTCNEVKEAYR 173
+V N DP+ CGSCYG++ E+R CC TCN+V+EAY
Sbjct: 128 GTDDLASEVQQRNN-------DPDYCGSCYGSKVQDENDKLPRESRVCCQTCNDVREAYL 180
Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY----- 228
W + I QC+ E K+ EGC++ G ++R+ G+ H APG SY
Sbjct: 181 NIGWGFFDGKGIEQCEKEGYVAKINEHLKEGCRVKGQTLLSRIQGNIHFAPGKSYTSYKR 240
Query: 229 SINHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERRK---------PLDG---TV 275
S + H HD Y ++ N H I HLSFG + DE+ + PLDG
Sbjct: 241 STSASHYHDTSLYDKTSNLNFNHKINHLSFGKPIDKLDEKVQDHSTEFSISPLDGREVIP 300
Query: 276 AKAEEGASMFNYYIKIIPTIYERLDGSK-----------------LGGGD---------- 308
+ +++YY KI+PT YE L+ + GG D
Sbjct: 301 TDIDTHYHVYSYYAKIVPTRYEFLNKKEKSIETAQFSTTFHSRPLRGGRDADHPTTMHSQ 360
Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
GG+PG+F +E+S VK+ K W+ + N
Sbjct: 361 GGIPGLFIYFEMSA--VKVINKEHHF-RSWSSFLLN 393
>gi|255732259|ref|XP_002551053.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
gi|240131339|gb|EER30899.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
Length = 414
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 126/414 (30%), Positives = 201/414 (48%), Gaps = 54/414 (13%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L DAF K ED KT GG +T++C L LI + DY + T EL V
Sbjct: 1 MSSRPKLLSFDAFAKTVEDARIKTASGGIITLICVLITLILIRNEYIDYTTIITRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQ 119
D +L I+LDI + CD +++D +D +G+Q L ++ + K RL ++ Q
Sbjct: 61 DRDINKQLDINLDISFINLPCDLISVDLLDVTGDQQLDIIDSGLKKVRL------LKNKQ 114
Query: 120 KEV-VNAVKKKKVTTENGTTTTEL-------EDPNK-CGSCYGAETETRK--CCNTCNEV 168
+V +N ++ K + + EL D N CG CYGA + +K CCN CN V
Sbjct: 115 GDVIINEIEDDKPALNSDVSLKELAKGLPEGSDQNAYCGPCYGALPQDKKQFCCNDCNTV 174
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGL 226
+ AY K+W + + I QC+ E ++L+ EGC+I G ++NRVSG+ APG
Sbjct: 175 RRAYAEKQWQFFDGENIEQCEKEGYVKRLRERINNNEGCRIKGSTKINRVSGTMDFAPGS 234
Query: 227 SYSINHVHVHDIQPYT--SAAFNTTHHIRHLSFG-IKLQDDDERR----KPLDGTVAKAE 279
S++ + H HD+ Y + FN H I HLSFG + + E PLD
Sbjct: 235 SFNHDGRHFHDLSLYKKYNDKFNFDHVINHLSFGEVPTNNGAEEMFDSIHPLDDYQFMLH 294
Query: 280 EGASMFNYYIKIIPTIYERLDGSK----------------LGGGD----------GGMPG 313
+ + +Y++K++ T YE LD SK +GG D GG+PG
Sbjct: 295 KKDHVVSYFLKVVATRYESLDYSKRVDTNQFSVITHDRPLIGGKDEDHQHTLHARGGIPG 354
Query: 314 IFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+ F++++SPL ++ + +K+ ++ +I+G + L+D + + + I
Sbjct: 355 VNFNFDISPLKIINRQQYAKTWSGFILGVVSSIAGVLMVGTLLDRSVFAAQQAI 408
>gi|407044387|gb|EKE42566.1| hypothetical protein ENU1_017250 [Entamoeba nuttalli P19]
Length = 354
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 175/371 (47%), Gaps = 41/371 (11%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+ +K DA+ K + K GG ++IVC + + ++ ++ DYF + L VD S+
Sbjct: 2 QNIKRFDAYPKINSNNRVKHWIGGLLSIVCIITMIWMFSSELNDYFTIRKKPVLRVDESK 61
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
KLPI+ DI P +C + ++D +D++GE + + NI K RL+L + E +
Sbjct: 62 NKKLPINFDITFPHSACSFTSVDVLDTTGEVIIDISKNIKKERLNL----VNEDE----- 112
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
KKK T GT +C C E + KCC TC E+ E+Y+ +P+
Sbjct: 113 ISKKKFAKTVYGT---------ECPPC-NNEIDKDKCCFTCEELTESYQKLNKEVPKGSP 162
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
+ KN + N EGC+I G + VNR SG+FHIAPG S + H+H + + S
Sbjct: 163 QCEIKNIHKMTTFYN--GEGCRISGTVFVNRASGNFHIAPGSSQQLTQEHIHSVD-WISG 219
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--- 301
N TH LSFG PLDG V SM+ Y+++++P Y LD
Sbjct: 220 GINLTHTWNFLSFGDSFPG---MINPLDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNKVI 276
Query: 302 -------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L + G+PG+F Y++S + V E+ S GHL T I I G
Sbjct: 277 NTNGYSVTEHYRPGSLKSPEQGIPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGV 336
Query: 349 YITFMLVDALL 359
+ F L+D +
Sbjct: 337 FALFSLLDYFI 347
>gi|449549110|gb|EMD40076.1| hypothetical protein CERSUDRAFT_132878 [Ceriporiopsis subvermispora
B]
Length = 1001
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 175/398 (43%), Gaps = 58/398 (14%)
Query: 18 EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
ED KT G +TI+ I ++ DY +V+ + VD SRG KL + +++ P
Sbjct: 598 EDVKVKTRTGALLTILSAAIILAFTTIEFFDYRRVNVDTSIQVDKSRGEKLTVKMNVTFP 657
Query: 78 TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVNAVKKKKVTTENG 136
+ C L+LD +D SGE + HNI K RL G P+ E+ N + K + G
Sbjct: 658 RVPCYLLSLDVMDISGETQTDISHNIIKTRLTEKGLPVPNAASSELRNDIDKLNEQRQGG 717
Query: 137 TTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEK 196
+ G C CN+C +V++AY + W+ + I QC +E +EK
Sbjct: 718 YCGSCYGGVEPAGGC----------CNSCEDVRQAYVNRGWSFNRPEGIEQCVDEGWSEK 767
Query: 197 LKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLS 256
LK+ EGC I G + VN+V G+ H++PG S+ +++D+ PY N H H
Sbjct: 768 LKDQANEGCNIAGRVRVNKVVGNIHLSPGRSFRSGSQNLYDLVPYLKDDGN-RHDFSHTI 826
Query: 257 FGIKLQDDDE------------RRK------PLDGTVAKAEEGASMFNYYIKIIPTIYER 298
+ DDE RR+ PLDG + + + MF Y++K++ T +
Sbjct: 827 HEFAFEGDDEYDILKAKSGKEMRRRMGIEGNPLDGAIGRTSKQQYMFQYFLKVVSTQFRT 886
Query: 299 LDG--------------SKLGGGDG--------------GMPGIFFSYELSPLMVKITEK 330
LDG L G G+PG FF+YE+SP+++ E
Sbjct: 887 LDGMSVNTNQYSATHFERDLTAGQQEKDQAGLHVAHTSVGIPGAFFNYEISPILISHAES 946
Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+S H T + G L+D++L + + K
Sbjct: 947 RQSFAHFLTSTCAIVGGVLTVASLIDSVLFVAGRTLKK 984
>gi|363752862|ref|XP_003646647.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
DBVPG#7215]
gi|356890283|gb|AET39830.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
DBVPG#7215]
Length = 399
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 123/364 (33%), Positives = 173/364 (47%), Gaps = 53/364 (14%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K ED +T GG +++ C + L+ + + V +L +D R
Sbjct: 6 LLSFDAFAKTEEDVRVRTKAGGIISLGCIVVTLLLLFNEWSQFNTVIQRPQLVLDRDRRL 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPIQEPQKEVVNA 125
K+ ++LD + C L LD +D+SGE L ++ + K RLD G PI+ + EV +
Sbjct: 66 KMDLNLDFEFSNMPCAMLNLDVMDTSGEVQLDLQDAGFTKTRLDHSGTPIRTEKLEVGS- 124
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAET---------ETRKCCNTCNEVKEAYRYKK 176
K V +DPN CGSCYG+++ E + CC TC EV+EAY K
Sbjct: 125 --NKAVHLP--------DDPNYCGSCYGSKSQDNNDALPKEQKVCCQTCEEVREAYSEKG 174
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG-LSYSINHVHV 235
WA + I QC E EK+ + EGC++ G ++NR+ G+ H APG + S H
Sbjct: 175 WAFFDGQKIEQCIREGYVEKINSQLHEGCRVKGSAKLNRIQGNIHFAPGRTTNSGKRTHT 234
Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG---TVAKAEEGASMFNYYIKI 291
HD+ Y T + N H I LSFG D PLDG + + S F+Y+ KI
Sbjct: 235 HDVSLYDTHSHLNFNHIIHKLSFGSDA--DGALSNPLDGHKNIIQGDDAHFSTFSYFTKI 292
Query: 292 IPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVK 326
+PT YE LDG KL GG D GG+ G+ +E+SPL V
Sbjct: 293 VPTRYEYLDGRKLETTQFSVTTHSRPLKGGKDDDHPNTIHHRGGIAGVTIFFEMSPLKVI 352
Query: 327 ITEK 330
+EK
Sbjct: 353 NSEK 356
>gi|323449476|gb|EGB05364.1| hypothetical protein AURANDRAFT_30967 [Aureococcus anophagefferens]
Length = 368
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 173/350 (49%), Gaps = 36/350 (10%)
Query: 29 AVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDA 88
++T+ W+ +C ++ + +V + + VD S G +L I L+I P ++C + LDA
Sbjct: 22 SLTVGHWVMALLFLC-ELLVFLRVEERDHVVVDRSMGQRLKIGLNITFPALTCAEVHLDA 80
Query: 89 VDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKC 148
+D +G+ H ++E ++ K+RLD G PI P + A+ ++ E+G T C
Sbjct: 81 MDVAGDYHPYMEQHMTKQRLDGRGSPI--PHR----AIPERANEYEHGPEDTG----AGC 130
Query: 149 GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT-IVQCKNEYSTEKLKNTFT-EGCQ 206
SC+GAET + CCNTC+E+ AY K W+ E+ QC ++ + ++ EGC
Sbjct: 131 QSCFGAETAEQPCCNTCDELLRAYGNKGWSAQEIKKEAPQCVDDTRDDSIRAIKKGEGCN 190
Query: 207 IYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDE 266
+ G+LEVN+V+G+ H+A G S N VH P + FN +H I L+FG + D
Sbjct: 191 LAGWLEVNKVAGNVHVAMGESAIQNGRFVHQFDPTRAPEFNVSHVIHDLAFG---ETYDG 247
Query: 267 RRKPLDGT--VAKAEEGASMFNYYIKIIPTIYERL-DGSKL-----------------GG 306
PL GT + A G +F Y+IK++PTIY D + +
Sbjct: 248 MALPLSGTSRIVDAATGTGLFQYFIKLVPTIYRAAPDAAPVRTVRYSYTQRFRPLHNQPP 307
Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVD 356
+PGIF Y+ S MV++T SL H ++ + G VD
Sbjct: 308 PTAMLPGIFLVYDFSAFMVEVTRHRSSLAHFLVRVCAIVGGVSTVVAFVD 357
>gi|224000966|ref|XP_002290155.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973577|gb|EED91907.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 396
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 189/394 (47%), Gaps = 52/394 (13%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
+++ + +D + +F +T+ G A+++ LF YLI + F + + + V
Sbjct: 5 YTDYFRSIDTHSPISSEFRIRTLSGAAISLFTLLFTLYLISSEYSYNFSTTFLDHVHVMP 64
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGE-QHLHVE--HNIYKRRLDLDGKPIQEPQ 119
L + DI P I C LA DA D +G+ Q H++ H I+K RL+ DGKPI
Sbjct: 65 QSPDGLEVEFDITFPHIPCALLASDANDPTGQSQSFHIDKKHRIWKHRLNKDGKPI---- 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+K GT T+ D +CGSCYGA E +CCNTC++VK AYR K+W +
Sbjct: 121 -------GRKSRFELGGTLTSSDHDEEECGSCYGAGGEG-ECCNTCDDVKRAYRTKQWHI 172
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP------------GLS 227
++ I QC + ++K+ EGC I+GY+ ++ G+ H AP GL
Sbjct: 173 TDMTKITQCAH---LVRVKDEDGEGCNIHGYVALSTGGGNLHFAPDRQWEKEGDKQNGLM 229
Query: 228 YSINHVHVHDIQPYTSAA---FNTTHHIRHLSFGIKL----QDDDERRKPLDGTVAKAEE 280
+++ I + A FN TH + LSFG + ++ LDG +
Sbjct: 230 IMGGFINLDSIVEMFNDAYEQFNVTHTVNKLSFGPYMPKHVKNSLNLTSQLDGATRTVTD 289
Query: 281 GASMFNYYIKIIPTIYERLDGSKL---------------GGGDGGMPGIFFSYELSPLMV 325
G MF +Y++I+PT+Y L+G+ + G + GMPG+FF YE+S L V
Sbjct: 290 GYGMFQFYLQIVPTVYRFLNGTTIETFQYSVTEHVRHVDPGSNRGMPGVFFFYEVSALHV 349
Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
+ E + H +T + + G + ++D L+
Sbjct: 350 EFEEYRRGWTHFFTGVCAAVGGAFTVMGMLDRLV 383
>gi|67479077|ref|XP_654920.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56472012|gb|EAL49533.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449701866|gb|EMD42605.1| endoplasmic reticulumgolgi intermediate compartment protein,
putative [Entamoeba histolytica KU27]
Length = 354
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 176/371 (47%), Gaps = 41/371 (11%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+ +K DA+ K + K GG ++IVC + + ++ ++ DYF + L VD S+
Sbjct: 2 QNIKRFDAYPKINSNNRVKHWIGGLLSIVCIITMIWMFSSELNDYFTIRKKPVLRVDESK 61
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
KLPI+ DI P +C + ++D +D++GE + + NI K RL+L + E +
Sbjct: 62 NKKLPINFDITFPHSACSFSSVDVLDTTGEVIIDISKNIKKERLNL----VNEDE----- 112
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
KKK T GT +C C E++ KCC TC E+ E+Y+ +P+
Sbjct: 113 ISKKKFAKTVYGT---------ECPPC-NNESDKDKCCFTCEELTESYQKLNKEVPKGSP 162
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
+ +N + N EGC+I G + VNR SG+FHIAPG S + H+H + + S
Sbjct: 163 QCEIRNIHKMTTFYN--GEGCRISGTVFVNRASGNFHIAPGSSQQLTQEHIHSVD-WISG 219
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--- 301
N TH LSFG P+DG V SM+ Y+++++P Y LD
Sbjct: 220 GINLTHTWNFLSFGDSFPG---MINPMDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNKVI 276
Query: 302 -------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L + G+PG+F Y++S + V E+ S GHL T I I G
Sbjct: 277 HTNGYSVTEHYRPGSLKSPEQGIPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGV 336
Query: 349 YITFMLVDALL 359
+ F L+D +
Sbjct: 337 FALFSLLDYFI 347
>gi|367007030|ref|XP_003688245.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
gi|357526553|emb|CCE65811.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
Length = 407
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 124/406 (30%), Positives = 188/406 (46%), Gaps = 60/406 (14%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L DAF+K ED +T GG +T+ C L YL+ + + +V++ L VD R
Sbjct: 7 KLAKFDAFSKTDEDVRIRTRLGGIITLGCILTAIYLLGGEWAAFNEVTSVPRLVVDKDRS 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
L ++LDI P I CD + LD +D +G L + + +K+ RLD +GK ++ + ++ +
Sbjct: 67 IDLNMNLDISFPFIPCDIINLDIMDDAGGLQLDILDSGFKKTRLDPNGKQLEFREFDLKD 126
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGA--------ETETRKCCNTCNEVKEAYRYKK 176
K++ +E G PN CGSCYGA E + CCNTC +V+ AY
Sbjct: 127 --NSKRIVSEKG--------PNYCGSCYGAIDQSHNDEEGAKKVCCNTCEDVRLAYVTAN 176
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + I QC++E +++ EGC++ G ++NRV G+ H APG + H+H
Sbjct: 177 WAFFDGKNIEQCEDEGYVKRINEHLNEGCRVTGKAKINRVKGNIHFAPGKPMQNSKGHLH 236
Query: 237 DIQPY-TSAAFNTTHHIRHLSFG------IKLQDDDERRKPLD--GTVAKAEEGASMFNY 287
D Y S N H I H SFG K + D PLD + F+Y
Sbjct: 237 DTSLYEKSPNMNFKHIIHHFSFGEPIDRKAKSKGADVLTNPLDDYDVQPNIDTHYHQFSY 296
Query: 288 YIKIIPTIYERL---------------DGSKLGGGD----------GGMPGIFFSYELSP 322
Y+K++PT YE L D GG D G+PG+FF +++S
Sbjct: 297 YMKVVPTRYEYLNRMVVETAQFSVTFHDRPLRGGKDEDHPNTIHARNGIPGVFFFFDISS 356
Query: 323 LMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVK 364
+ V E+ + W+ + N I G +VD L + K
Sbjct: 357 IKVINNEQ---ITQTWSGFILNCIITIGGVLAVGSMVDRLSYKAQK 399
>gi|3860008|gb|AAC72954.1| unknown [Homo sapiens]
Length = 198
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 84/187 (44%), Positives = 115/187 (61%), Gaps = 19/187 (10%)
Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY 210
CYGAE E KCCNTC +V+EAYR + WA DTI QC+ E ++K++ EGCQ+YG+
Sbjct: 8 CYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGF 67
Query: 211 LEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKP 270
LEVN+V+G+FH APG S+ +HVHVHD+Q + N TH+I+HLSFG +D P
Sbjct: 68 LEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNP 124
Query: 271 LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------------GDGGMPGI 314
LD T A + + MF Y++K++PT+Y ++DG L GD G+PG+
Sbjct: 125 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGV 184
Query: 315 FFSYELS 321
F LS
Sbjct: 185 FAHLPLS 191
>gi|190347075|gb|EDK39286.2| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
6260]
Length = 404
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 120/404 (29%), Positives = 190/404 (47%), Gaps = 51/404 (12%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L DAF K ED +T GG +T++C + + YLI + +Y + EL VD
Sbjct: 5 KLLSFDAFAKTVEDARVRTPAGGIITLICVIVVLYLIRNEYSEYTSIINRPELVVDRDIN 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
KL I+LDI P I CD L +D +D SG+ + + + +++ RL DG I++ + +
Sbjct: 65 KKLEINLDISFPDIPCDVLTMDILDVSGDLQVDLLSSGFEKFRLLKDGSEIRDESPVMSS 124
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGA---ETETRKCCNTCNEVKEAYRYKKWALPE 181
A + ++ + CGSCYGA + + CCN C V+ AY K W +
Sbjct: 125 AGELEERARGRAPDGS-------CGSCYGALPQDENSDYCCNDCETVRLAYAQKAWGFFD 177
Query: 182 LDTIVQCKNEYSTEKLK---NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+ I QC+ E +L N F EGC+I G ++NR+SG+ H APG S++ H HD+
Sbjct: 178 GENIEQCEREGYVARLNEKINNF-EGCRIKGTGKINRISGNLHFAPGASFTAPGSHFHDL 236
Query: 239 QPYT--SAAFNTTHHIRHLSFGIKLQD----DDERRKPLDGTVAKAEEGASMFNYYIKII 292
+ F H I HLSFG + + + PLD + + +++YY+K++
Sbjct: 237 SLFNKYDDKFTFDHVINHLSFGSDPHNIQFFEKQSTHPLDKSSMILKSKDRLYSYYLKVV 296
Query: 293 PTIYERLDGS----------------KLGGGD-----------GGMPGIFFSYELSPLMV 325
T +E L + L GG GG+PG+FF +E+SP+
Sbjct: 297 ATRFEFLTPNTPALETNQFSVISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEISPM-- 354
Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
KI K + W+ + + + ++V ALL V +V
Sbjct: 355 KIINKEQ-YAKTWSGFVLGVISSIAGVLMVGALLDRSVWAAERV 397
>gi|440299607|gb|ELP92159.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba invadens IP1]
Length = 361
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 173/375 (46%), Gaps = 42/375 (11%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+ +K DA+ K D + GG ++I+C L + ++ +V DY+ V L VD S+
Sbjct: 2 DTIKRFDAYPKLNYDVRVRYWLGGLLSILCLLTMGWMFYSEVQDYYTVQMRPTLRVDESK 61
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
KLPI+ DI P ISC + +D +D++GE + +E N+ K+RL+ E N
Sbjct: 62 SEKLPINFDITFPRISCSLMTIDVLDTTGEVSIDIESNVNKKRLN------PHSMTESSN 115
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
KV E D N KCC TC+E+KE+Y+ +P
Sbjct: 116 KATAHKVYGIECPACEESVDKN-------------KCCFTCDELKESYKKAGKEVPP--N 160
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
VQC+ + + EGC +YG + VNRVSG+FHIAPG+S H H + S
Sbjct: 161 AVQCQLKNIQKMALALDGEGCHMYGSVFVNRVSGNFHIAPGMSEQQGEGHRHSAEWIGS- 219
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD---- 300
N TH LSFG KP+D SM+ Y+++++P Y LD
Sbjct: 220 -LNLTHTWNSLSFGDNFPG---MIKPMDSIQKVDVTNNSMYQYFVQVVPMTYFGLDKKVV 275
Query: 301 ------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
L + G+PG+F YE+S + V TE++ S GHL T I + G
Sbjct: 276 KTNGYSVTEHYRSGNLKTMEQGVPGVFVLYEISSMEVLYTEETGSFGHLLTGICGIVGGI 335
Query: 349 YITFMLVDALLHSCV 363
+ F L+DA + V
Sbjct: 336 FTIFSLLDAFIFHTV 350
>gi|365982867|ref|XP_003668267.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
gi|343767033|emb|CCD23024.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
Length = 410
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 119/413 (28%), Positives = 182/413 (44%), Gaps = 58/413 (14%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
+V L LDAF+K ED +T G ++I C L L+ + Y Q+ T L V
Sbjct: 2 LVNKSTLLSLDAFSKTQEDVRIRTKTGAIISISCILVTVLLLLNEWIQYSQIVTRPTLVV 61
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV--EHNIYKRRLDLDGKPIQEP 118
D R KL ++LDI P++ CD L LD +D +G+ L + + K RLD G
Sbjct: 62 DRERNLKLDLNLDISFPSMPCDILNLDILDDAGDLQLDILNQGQFTKTRLDRMG------ 115
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----------ETETRKCCNTCNEV 168
N ++ K ++ D N CG CYG+ + + CC TC +V
Sbjct: 116 -----NVIEVSKFKIDDDVAEFPPNDENYCGPCYGSIDQSGNDKIESVKDKICCQTCEQV 170
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
+EAY WA + I QC+ E K+ EGC++ G + +NR+ G+ H APG ++
Sbjct: 171 REAYLKAGWAFFDGKNIEQCEREGYVTKINKHLNEGCRVKGNVLLNRIQGNIHFAPGKAF 230
Query: 229 SINHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEG 281
H HD Y TS N H I HLSFG ++ + R PLDG
Sbjct: 231 QNVKGHFHDSSLYETSPDLNFNHIIHHLSFGKTIEQLAQLRGATVATSPLDGQQISPSFD 290
Query: 282 ASM--FNYYIKIIPTIYERLD-------------------------GSKLGGGDGGMPGI 314
+ + ++Y++KI+PT YE LD + G+PG+
Sbjct: 291 SHLYRYSYFVKIVPTRYEYLDKMISETAQFSATFHQSLVTGERDPENPNIKYSRTGLPGL 350
Query: 315 FFSYELSPLMVKITEKS-KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
F +E+SPL + TE+ KS ++ + +I G ++D + + +
Sbjct: 351 FIYFEMSPLKIINTEQHFKSWSGVFLHCITSIGGILAVGTILDKFFYKAQRTV 403
>gi|50305633|ref|XP_452777.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641910|emb|CAH01628.1| KLLA0C12947p [Kluyveromyces lactis]
Length = 405
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 119/408 (29%), Positives = 185/408 (45%), Gaps = 62/408 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L +DAF K ED +T GG +T+ C + L+ + + + T +L VD R
Sbjct: 6 LLSIDAFGKTEEDVRVRTRTGGLITVSCIIITMLLLVSEWKQFSTIVTRPDLVVDRDRHL 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL ++LD+ P++ C+ L LD +D SGE +++ + K R+ +GK + + + +V +
Sbjct: 66 KLDLNLDVTFPSMPCNVLNLDILDDSGEFQINLLDSGFTKIRISPEGKELSKEKFQVGDK 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYKK 176
K+ E CG CYGA +++ CC TC++V+ AY K
Sbjct: 126 SSKQSFNEEG-----------YCGPCYGALDQSKNDELPQDQKVCCQTCDDVRAAYGQKG 174
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E E + EGC++ G ++NR+ G+ H PG S H H
Sbjct: 175 WAFKDGKGVEQCEREGYVESINARIHEGCRVQGRAQLNRIQGTIHFGPGSSMRNIRGHFH 234
Query: 237 DIQPYTS-AAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGASM--FNY 287
D Y + N H I L+FG K +D D PLD + F+Y
Sbjct: 235 DTSLYDAYPHLNFNHIINTLTFGEKPKDGDSELIGSASISPLDSRQVFPDRDTHFHEFSY 294
Query: 288 YIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSP 322
+ KIIPT +E LDG K+ GG D GG+PG+FF++E+SP
Sbjct: 295 FCKIIPTRFEFLDGKKVETTQFSATYHDRPLRGGRDEDHPNTVHSKGGVPGVFFNFEMSP 354
Query: 323 LMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
L V E+ + W+ + N I G ++D + + K I
Sbjct: 355 LKVINKEQHAT---SWSGFLLNCITSIGGVLAVGTVIDKITYRAQKSI 399
>gi|323306137|gb|EGA59869.1| Erv46p [Saccharomyces cerevisiae FostersB]
Length = 349
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 106/321 (33%), Positives = 156/321 (48%), Gaps = 36/321 (11%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDAF K ED +T GG +T+ C L +L+ + + V T +L VD R +
Sbjct: 6 LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWXQFNSVVTRPQLVVDRDRHA 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL +++D+ P++ CD + LD +D SGE L + + RL+ +G+P+ + + V
Sbjct: 66 KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125
Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
NG T + DPN CG CYGA+ +++ CC C+ V+ AY
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + I QC+ E K+ EGC+I G ++NR+ G+ H APG Y + H
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236
Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
HD Y ++ N H I HLSFG +Q +D+R PLDG +
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296
Query: 282 ASM--FNYYIKIIPTIYERLD 300
F+Y+ KI+PT YE LD
Sbjct: 297 THFHQFSYFAKIVPTRYEYLD 317
>gi|225680824|gb|EEH19108.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb03]
Length = 413
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 128/440 (29%), Positives = 180/440 (40%), Gaps = 101/440 (22%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ED +T GG VTIV IS+LI + +Y ++ EL V
Sbjct: 1 MAPKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D R D +D SGE V H I K RL P+
Sbjct: 61 DKGR----------------------DVMDVSGEMQSGVIHGISKVRL--------APES 90
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
E + + + T + DP+ CG CYGA + CC+TC EV+EAY +
Sbjct: 91 EGGHVIDTTALVLHTQTDAAKHLDPDYCGPCYGAPPPSHATKPGCCSTCEEVREAYASQS 150
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E ++ L EGC+I G L VN+V G+FHIAPG S+S ++H H
Sbjct: 151 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAH 210
Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
D+ Y T + +H I L FG +L D D PLD T + F
Sbjct: 211 DLDTYYHTPVPHHMSHKIHQLRFGPQLSDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 270
Query: 286 NYYIKIIPTIYERLDGS------------------------------------------K 303
Y++K++ T Y L S
Sbjct: 271 MYFVKVVSTSYLPLGWSPEFSSSVHETTLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRS 330
Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
+ GGD GG+PG+F +Y++SP+ V E ++K+ T + I GT
Sbjct: 331 IDGGDDAAEGHKERLHSHGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 390
Query: 350 ITFMLVDALLHSCVKKISKV 369
VD L+ ++ K+
Sbjct: 391 TVAAAVDRALYEGAARVKKL 410
>gi|150866674|ref|XP_001386342.2| hypothetical protein PICST_85013 [Scheffersomyces stipitis CBS
6054]
gi|149387930|gb|ABN68313.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 407
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 117/402 (29%), Positives = 194/402 (48%), Gaps = 48/402 (11%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L DAF K ED +T GG +T+ C + +LI + DY V T EL VD
Sbjct: 7 KLLTFDAFAKTVEDARIRTTSGGIITLFCIFVVMFLIRNEYSDYTSVITRPELVVDRDIN 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN- 124
L I+LD+ + CD L+LD +D +G+ L + + +++ + +++ ++E+++
Sbjct: 67 KPLDIYLDVSFHNLPCDLLSLDIMDEAGDLQLDILKSGFEKF-----RIVKDSEEEIIDR 121
Query: 125 ---AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWAL 179
+ E E ED +CGSCYGA + +K CCN C VK AY K W
Sbjct: 122 ESTPINADLSIEEMAKGLKEGED-GECGSCYGALPQDKKQYCCNDCETVKLAYAEKLWGF 180
Query: 180 PELDTIVQCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
+ + I QC+NE +++++ EGC+I G +NR+SG+ APG S++ + HVHD
Sbjct: 181 YDGENIEQCENEGYVQRVQSRINGKEGCRIKGNARINRISGTMDFAPGASFTSSGHHVHD 240
Query: 238 IQPYTS-AAFNTTHHIRHLSFGIKLQDDD----ERRKPLDGTVAKAEEGASMFNYYIKII 292
+ Y N H + L+FG + D+ E PLD + +F YY+K++
Sbjct: 241 LSLYDKHPHLNFDHIVNKLTFG-PIPDESVPTAESTHPLDNYGVALNDKNHVFTYYLKVV 299
Query: 293 PTIYERLDGSK-----------------LGGGD----------GGMPGIFFSYELSPLMV 325
T +E L+G+ GG D GG+PG+ F +++SPL +
Sbjct: 300 ATRFEFLNGASKALDANQFSVITHDRPISGGKDNDHQHTLHAKGGIPGVVFHFDISPLKI 359
Query: 326 KITEK-SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
E+ +KS ++ +++G I L+D +++ I
Sbjct: 360 INREQYAKSWSGFVLGVVSSVAGVLIVGSLLDRSVYAAESAI 401
>gi|448081831|ref|XP_004194985.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
gi|359376407|emb|CCE86989.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
Length = 405
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 127/395 (32%), Positives = 187/395 (47%), Gaps = 46/395 (11%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED KT GG +T+VC L + LI + +Y V EL VD
Sbjct: 7 KLLSLDAFAKTVEDAKVKTASGGIITLVCVLVVLLLIRNEYSEYTSVVNRPELVVDRDVN 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
KL I++DI P + CD + LD +D SG+ V + K RL I +EV++
Sbjct: 67 RKLDINIDITFPNLPCDLVTLDILDVSGDTQADVLKSGFEKYRL------IPSSNEEVLD 120
Query: 125 --AVKKKKVTTENGTTTTELEDPNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKWALP 180
V + ++ E+ E CGSCYGA + + CCN C V+ AY + WA
Sbjct: 121 NAPVLRNDLSLEDIARNPNKEGGGFCGSCYGALPQGDNEYCCNDCETVRLAYAERMWAFY 180
Query: 181 ELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+ I QC+NE +L EGC+I G ++NRVSG+ H APG + + H+HD+
Sbjct: 181 DGANIEQCENEGYVTRLNQRIEQKEGCRIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDL 240
Query: 239 QPYTS--AAFNTTHHIRHLSFGIKLQDDDERRK---PLDGTVAKAEEGASMFNYYIKIIP 293
Y FN H I HLSFG+ +D + PLDG + + + +YY+K++
Sbjct: 241 SLYEKHFDKFNFDHVINHLSFGLDPVKEDPNHQSTHPLDGYRLILNDKSRVISYYLKVVA 300
Query: 294 TIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKIT 328
T +E L G + GG D GG+PG+FF +++SP+ KI
Sbjct: 301 TRFEFLSGLAMETNQFSAIPHHRPYRGGKDEDHRHTMHAKGGIPGVFFHFDISPM--KII 358
Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
K + W+ + + + + V A+L V
Sbjct: 359 NKEQ-YAKTWSGFVLGVVSSIAGVLTVGAVLDRSV 392
>gi|353242343|emb|CCA73995.1| related to ERV46-component of copii vesicles [Piriformospora indica
DSM 11827]
Length = 420
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/416 (29%), Positives = 180/416 (43%), Gaps = 69/416 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
K +DAF K ED +T G +T + I L ++ DY V + + +R
Sbjct: 11 FKAIDAFGKTLEDVKIRTRTGAFLTFLSIGIICLLTLIEFIDYRTVYLDTNIEIMKARDE 70
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L ++++I P + C L+LDA D SGE V HNI K RLD +GKP P ++ ++ +
Sbjct: 71 RLTVNMNITFPRVPCFLLSLDATDVSGEHMREVSHNIVKVRLDSEGKPY--PNQDHISDL 128
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + + ++ P CGSCYG CCNTC +V+++Y + WA + I
Sbjct: 129 RNEI------SRVKDIGKPGYCGSCYGGLEPEGGCCNTCEDVRKSYLDRGWAFSAPEHIE 182
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
QC E TEK+K +GCQI G + + +V+ S + G S+ N H ++ PY
Sbjct: 183 QCVREGWTEKIKVQANDGCQISGRVRIKKVASSLIFSFGRSFQANSFHAQELVPYLKDGL 242
Query: 247 --NTTHHIRHLSFGIKLQDDDE---RRK---------------PLDGTVA-----KAEEG 281
+ HHI L F Q DDE RR PL+G + G
Sbjct: 243 IHDFGHHIETLQF----QSDDEYDPRRANEAARLKKHLGVPKDPLNGFNSHYAKYSGRRG 298
Query: 282 AS----MFNYYIKIIPTIYERLD----------------------------GSKLGGGDG 309
MF Y+IK++ +E LD G + G
Sbjct: 299 PDITTYMFQYFIKVVSADFETLDHEHVSSHLYSYSSHTRNVGEAYHLKNTEGIETTHGYD 358
Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
PG+F + ++SP+ V TEK K H T I G LVD+ L + + K
Sbjct: 359 AAPGLFINIDVSPMQVIHTEKRKPFAHFLTTFCAIIGGVLTVASLVDSALFNTINK 414
>gi|50294900|ref|XP_449861.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529175|emb|CAG62841.1| unnamed protein product [Candida glabrata]
Length = 415
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/413 (29%), Positives = 183/413 (44%), Gaps = 62/413 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K ED +T GG +T+ C + L+ + D+ V T EL +D R
Sbjct: 6 LLSFDAFAKTEEDVRIRTRSGGFITLGCLVVTLMLLLSEWRDFNSVVTRPELVIDRDRSL 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPIQEPQKEVVNA 125
+L ++LDI P++ C+ L LD +D SGE L + + + K RL +GK + ++ A
Sbjct: 66 RLDLNLDITFPSMPCELLTLDIMDDSGEVQLDIMNAGFEKTRLSKEGKVLGTADMKIGEA 125
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----------CCNTCNEVKEAYRYK 175
KK K N CG+CYGA + + CC TC++V++AY K
Sbjct: 126 AKKDK------EAQLAKLGANYCGNCYGARDQGKNNDDTPRDQWVCCQTCDDVRQAYFEK 179
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + I QC+ E +K+ + EGC++ G ++NR+ G+ H A G + H
Sbjct: 180 NWAFFDGKDIEQCEREGYVQKIADQLQEGCRVSGSAQLNRIDGNLHFAAGPGFQNIRGHF 239
Query: 236 HDIQPYTS-AAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASM- 284
HD Y N H I HLSFG ++ + + PLDG A
Sbjct: 240 HDDSLYIQHPNLNFNHIINHLSFGKAVEPTKKGKVMGIEKVTVNPLDGHSMFPPRDAHFL 299
Query: 285 -FNYYIKIIPTIYERLDGSKL----------------GGGD----------GGMPGIFFS 317
++YY KI+PT YE L+ + GG D GG P ++ +
Sbjct: 300 QYSYYAKIVPTRYEGLNKKNMVETAQFSSTFHIRPVGGGSDDDHPNTVHQRGGSPSMWIN 359
Query: 318 YELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
+E+SPL V E+ G W+ + N I G ++D L+ + I
Sbjct: 360 FEMSPLKVINREEH---GQSWSGFVLNCITSIGGVLAVGTVLDKALYKAQRTI 409
>gi|448531492|ref|XP_003870264.1| Erv46 protein [Candida orthopsilosis Co 90-125]
gi|380354618|emb|CCG24134.1| Erv46 protein [Candida orthopsilosis]
Length = 411
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 124/404 (30%), Positives = 190/404 (47%), Gaps = 48/404 (11%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED KT GG +T++C L +LI + DY V EL VD
Sbjct: 7 KLISLDAFAKTVEDARIKTASGGIITLICILVALFLIRNEYIDYTTVIARPELVVDRDIN 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDG---KP--IQEPQ 119
+L I+LDI + CD +++D D SG+ L + + K R+ G KP I++ Q
Sbjct: 67 KQLDINLDISFLNLPCDLVSIDLFDESGDLKLDIINSQLEKFRIIKSGHSSKPTEIKDDQ 126
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKW 177
+ + +++ TE E CGSCYGA + +K CCN+C V+ AY W
Sbjct: 127 PPLQREMPLEQIAPGLPDGQTEGE----CGSCYGAVPQDKKQYCCNSCAAVRRAYAEANW 182
Query: 178 ALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
+ + I QC+ E ++L+ EGC++ G ++NRV+G+ APG S + HV
Sbjct: 183 QFYDGENIAQCEEEGYVQRLRQRINDNEGCRVKGTTKINRVAGTMDFAPGASMT-KERHV 241
Query: 236 HDIQPYT--SAAFNTTHHIRHLSFGIKLQD----DDERRKPLDGTVAKAEEGASMFNYYI 289
HD+ Y FN H I HLSFG D D PLDG + NY++
Sbjct: 242 HDLSLYMKYKDKFNFDHVINHLSFGNNPPDSQLVDTGSISPLDGHKFLQHKKLHSINYFL 301
Query: 290 KIIPTIYERLDGSK---------------LGGGD-----------GGMPGIFFSYELSPL 323
KI+ T +E L+G L GG G+PG+ F++++SPL
Sbjct: 302 KIVATRFESLEGKDKFDTNQFSAITHDRPLAGGKDDDHQHTLHARAGVPGVAFNFDISPL 361
Query: 324 -MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
++ E +K+ ++ +I+G + L+D + + + I
Sbjct: 362 KIINREEYAKTRSGFILGVVSSIAGVLMVGSLMDRSVFAAQQAI 405
>gi|325191973|emb|CCA26442.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 401
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 171/369 (46%), Gaps = 36/369 (9%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK D + K + +F +T G V+I+ + L ++ +Y V E + VDS+
Sbjct: 33 LKRFDVYPKLHTEFKVQTETGAIVSIITAVIALILFLAELREYMSVRMHEHMVVDSTISE 92
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI ++C L A+D +GE + + +I RLD G PI +++
Sbjct: 93 KLRINIDISYLALTCKESYLTAMDVTGELQMDLHRSIGMTRLDAKGNPIN-----TLDSA 147
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCY-GAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K+ E+ N CGSCY + CCNTC+EVKEA+ L + D
Sbjct: 148 KE------------EVLPANYCGSCYETVHPLGKTCCNTCDEVKEAFVANDLRLFDADQK 195
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E + E+ + EGC++ GY+ VNRV+G+FH+ G ++ +H P +
Sbjct: 196 EQCVREMTEEQRQAQAGEGCRLKGYMMVNRVAGNFHVGLGRTFHRKGKLIHQFLPGQESV 255
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS--- 302
FN + + LSFG + + LDGT ++ + Y++KI+PTIY + S
Sbjct: 256 FNASFLLHSLSFGTPYAN---VKNGLDGTQYITKKKGGVMKYFLKIVPTIYSDISSSVHS 312
Query: 303 ------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G G+PG +F +E SP MVKI + H +I + G
Sbjct: 313 YQYSHTKQEKYMNAMGQISGLPGAYFMFEFSPFMVKIDSEQIPFTHFVIRIFAILGGMIS 372
Query: 351 TFMLVDALL 359
VD+++
Sbjct: 373 IAGFVDSVI 381
>gi|312376736|gb|EFR23738.1| hypothetical protein AND_12338 [Anopheles darlingi]
Length = 265
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 89/198 (44%), Positives = 119/198 (60%), Gaps = 22/198 (11%)
Query: 83 YLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTEL 142
+++LDA DS+GEQHLH+EH+IYKRRLDL+G I+EP+KE + K+ +TE T++ +
Sbjct: 30 HVSLDAQDSTGEQHLHIEHSIYKRRLDLEGNQIEEPKKEDIQVSTKRVSSTETPVTSSTI 89
Query: 143 EDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFT 202
+ C V +AYR +KW P ++ QCKN F
Sbjct: 90 KP-------------------ACGNVIDAYRERKWN-PNVEDFEQCKNSNHGAIEGKAFN 129
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
EGC IYG +EVNRV G FHIAPG S+SI ++HVHD+QPY+S+ FNT+H I LSFG +
Sbjct: 130 EGCHIYGTMEVNRVEGRFHIAPGKSFSIQNIHVHDVQPYSSSRFNTSHRINTLSFGEQF- 188
Query: 263 DDDERRKPLDGTVAKAEE 280
D +PLDG A E
Sbjct: 189 -DFGTTQPLDGLNVVATE 205
>gi|401888400|gb|EJT52358.1| ER to golgi family transport-related protein [Trichosporon asahii
var. asahii CBS 2479]
gi|406696432|gb|EKC99721.1| ER to transport-related protein [Trichosporon asahii var. asahii
CBS 8904]
Length = 378
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 106/375 (28%), Positives = 163/375 (43%), Gaps = 65/375 (17%)
Query: 44 VDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNI 103
++ DY +V+ + VD SRG KL I LDI P + C L+LD +D SGE+ + H++
Sbjct: 2 IEFIDYRRVTLEPTIIVDRSRGEKLEIDLDITFPRVPCFLLSLDVMDISGERQNDITHDM 61
Query: 104 YKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCN 163
K RL G+ ++ V + + DPN CGSCYGA+ CCN
Sbjct: 62 AKHRLSASGEELE---------VTRSGQLKGEAERAAQNRDPNYCGSCYGAQAPESGCCN 112
Query: 164 TCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA 223
+C++V++AY W P TI QC E E + TEGC+I G ++VN+V G+
Sbjct: 113 SCDDVRKAYSESGWQFPNPSTIEQCVEENWAENMAQQNTEGCRIVGQVKVNKVVGNLQFT 172
Query: 224 PGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL---------------QDDDERR 268
G ++ H D+ PY N H H+ + + +DE R
Sbjct: 173 HGNVFTRGHT---DLLPYLRDG-NVHHDFGHIINKFRFTGEMPGQLYHRSQIQKKEDETR 228
Query: 269 K------PLDGTVAKAEEGAS--MFNYYIKIIPTIYERLDGSKLGGGD------------ 308
K PL G + AE S M+ Y++K++ T + L+G +
Sbjct: 229 KELGIHDPLQGVRSHAENDGSNIMYQYFVKVVSTAFVYLNGQNINTNQYSATEYERDLKH 288
Query: 309 -----------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+PG+F +YE+SP+ V TE +S H T + G
Sbjct: 289 GNLPTKDQHGHVTTHYTNAIPGVFINYEISPMKVVHTETRQSFAHFVTSTCAIVGGVLTV 348
Query: 352 FMLVDALLHSCVKKI 366
L+DA + + K++
Sbjct: 349 ASLIDAAIFNSRKRL 363
>gi|146416067|ref|XP_001484003.1| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
6260]
Length = 404
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 120/404 (29%), Positives = 189/404 (46%), Gaps = 51/404 (12%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L DAF K ED +T GG +T++C + + YLI + +Y + EL VD
Sbjct: 5 KLLSFDAFAKTVEDARVRTPAGGIITLICVIVVLYLIRNEYLEYTSIINRPELVVDRDIN 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
KL I+LDI P I CD L +D +D SG+ + + + +++ RL DG I++ + +
Sbjct: 65 KKLEINLDISFPDIPCDVLTMDILDVSGDLQVDLLLSGFEKFRLLKDGLEIRDESPVMSS 124
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---CCNTCNEVKEAYRYKKWALPE 181
A + ++ G L CGSCYGA + CCN C V+ AY K W +
Sbjct: 125 AGELEE--RARGRAPDGL-----CGSCYGALPQDENLDYCCNDCETVRLAYAQKAWGFFD 177
Query: 182 LDTIVQCKNEYSTEKLK---NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+ I QC+ E +L N F EGC+I G ++NR+SG+ H APG S++ H HD+
Sbjct: 178 GENIEQCEREGYVARLNEKINNF-EGCRIKGTGKINRISGNLHFAPGASFTAPGSHFHDL 236
Query: 239 QPYT--SAAFNTTHHIRHLSFGIKLQD----DDERRKPLDGTVAKAEEGASMFNYYIKII 292
+ F H I HL FG+ + + + PLD + + +++YY+K++
Sbjct: 237 SLFNKYDDKFTFDHVINHLLFGLDPHNIQFFEKQLTHPLDKSSMILKSKDRLYSYYLKVV 296
Query: 293 PTIYERLDGS----------------KLGGGD-----------GGMPGIFFSYELSPLMV 325
T +E L + L GG GG+PG+FF +E+ P+
Sbjct: 297 ATRFEFLTPNTPALETNQFLVISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEILPM-- 354
Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
KI K + W+ + + + ++V ALL V +V
Sbjct: 355 KIINKEQ-YAKTWSGFVLGVISSIAGVLMVGALLDRSVWAAERV 397
>gi|219111025|ref|XP_002177264.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411799|gb|EEC51727.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 404
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 116/406 (28%), Positives = 193/406 (47%), Gaps = 57/406 (14%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +RLK D + ++F TV G ++IV +F+ YL+ D FQV+ E++ V
Sbjct: 1 MDLKDRLKRFDTHSPVSKEFRVYTVQGAVLSIVTLVFVGYLVTADFFFNFQVTLQEKVHV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQ---HLHVEHNIYKRR---------- 107
++S S + + D+ +P + C L++DA D +G++ HL +H+++K R
Sbjct: 61 NASSPSGIELEFDVSLPDVPCSKLSIDANDPNGQKQSLHLDTDHHVWKHRITLLPNGHRQ 120
Query: 108 -------LDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK 160
L+L + E EV ++ + +N + TE+ CG CYGA E +
Sbjct: 121 LLGERSKLELGSTLLTEKDLEV--KAEELQNAKDNSESRTEM---TPCGDCYGAGEEG-E 174
Query: 161 CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSF 220
CC +C +VK AY+ + W+L + + QC+ E + + EGC ++G + ++ G+
Sbjct: 175 CCKSCEDVKRAYKRRGWSLRDTSGVSQCRRESGIAEAEG---EGCNVHGVVALSSGGGNL 231
Query: 221 HIAPGLSYSINH---VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAK 277
HIAPG N +++ D + +N +H I L FG +D LDG
Sbjct: 232 HIAPGRDTEANFPGGMNIFDALLQSFHQWNVSHQIHKLRFG---KDYPAGVYQLDGETRT 288
Query: 278 AEEGASMFNYYIKIIPTIYERLDGSKLG---------------GGDGG------MPGIFF 316
+G M+ YY +++PT Y L+G+ + G + G MPGIFF
Sbjct: 289 ITDGYGMYQYYFQVVPTRYTFLNGTTIQTHQYSVTEHLRHVSPGSNRGYSLNSRMPGIFF 348
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM-LVDALLHS 361
YE+SPL V I E + + +C I G +T L+D ++ S
Sbjct: 349 FYEVSPLHVDIMEVYQKGWIAFLTSVCAIVGGVVTIAGLIDHVIFS 394
>gi|156838396|ref|XP_001642904.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
70294]
gi|156113483|gb|EDO15046.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
70294]
Length = 404
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 177/411 (43%), Gaps = 71/411 (17%)
Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPI 70
D FTK ED +T GG +T+ C F + L+ + ++ V T L +D KL +
Sbjct: 10 DVFTKTEEDVRIRTRVGGIITLCCLSFTAILLFSEWINFNHVITKPNLVIDREHHLKLEL 69
Query: 71 HLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
++DI P I C L LD +D SG L + E K R+ DG+
Sbjct: 70 NIDITFPFIPCQLLNLDIMDDSGNVQLDITESGFTKTRIGSDGQ---------------- 113
Query: 130 KVTTENGTTTTEL-----EDPNKCGSCYGAETETRK----------CCNTCNEVKEAYRY 174
++ T N + +L +D N CGSCYGA +++ CC TC +VK AY
Sbjct: 114 QLGTTNFKVSEDLLEYSPKDKNYCGSCYGARDQSKNDEAESVDKKVCCQTCEDVKNAYSD 173
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
WA + I QC+ E EK+ + EGC+I G +NR+ G+ H APG ++ H
Sbjct: 174 AGWAFFDGKNIEQCEREGYVEKMNDQLNEGCRISGEALLNRIHGNIHFAPGKAFQNRGGH 233
Query: 235 VHDIQPYTS-AAFNTTHHIRHLSFGIKL------QDDDERRKPLDG--TVAKAEEGASMF 285
HD Y N H I HLSFG + +D PLDG + + F
Sbjct: 234 FHDTSFYNDHKNLNFKHMIEHLSFGRPVAQFKSNKDLVAMTSPLDGHQELPSIDAHNHQF 293
Query: 286 NYYIKIIPTIYERL-----------------------DGSKLGGGDGGMPGIFFSYELSP 322
Y+ KI+PT +E L D S G+PG+F YE+SP
Sbjct: 294 IYFAKIVPTRFEYLNKQAQETSQLVVTSHMKPIGDATDYSTTMNSRQGIPGLFIDYEISP 353
Query: 323 LMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKISKV 369
L V E+ + W+ + N I G + D ++H+ + +S +
Sbjct: 354 LKVINREQHAT---TWSGFLLNCITSIGGILAVGTVADKIVHATQRVVSHI 401
>gi|413945824|gb|AFW78473.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
partial [Zea mays]
Length = 284
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 98/287 (34%), Positives = 151/287 (52%), Gaps = 22/287 (7%)
Query: 101 HNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK 160
H+I K RLD G I E +K + K ++ ++G + E CG+CYGAE +
Sbjct: 2 HDIEKIRLDAHGNVI-EARKVSIGGAKIERPLQKHGGRLDKGE--QYCGTCYGAEESDEQ 58
Query: 161 CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSF 220
CCN+C EV+EAY+ K WAL D I QC E E++K EGC ++G+L+V++V+G+F
Sbjct: 59 CCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNF 118
Query: 221 HIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE 280
H APG + +++ V ++ FN TH I LSFG + PLDG
Sbjct: 119 HFAPGKGFYESNIDVPELS-LLEGGFNITHKINKLSFGTEFPG---VVNPLDGAQWTQPA 174
Query: 281 GASMFNYYIKIIPTIYERLDGSKLGGG---------DGGM-----PGIFFSYELSPLMVK 326
+ Y+IK++PTIY + G + DG + PG+FF Y+ SP+ V
Sbjct: 175 SDGTYQYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNVRPKPQPGVFFFYDFSPIKVI 234
Query: 327 ITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
TE+S+SL H T + + G + ++D+ ++ K + K+E+G
Sbjct: 235 FTEESRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELG 281
>gi|219110527|ref|XP_002177015.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411550|gb|EEC51478.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 500
Score = 167 bits (422), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 113/413 (27%), Positives = 188/413 (45%), Gaps = 66/413 (15%)
Query: 7 LKGLD-AFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF--QVSTTEELFVDSS 63
+K LD F K ++ +T GG ++V +L I+ L + + T + + VD+S
Sbjct: 76 VKKLDFLFPKVDTEYTVQTDRGGLASLVAYLLIAVLALAETASWLSHNRDTVDHVRVDTS 135
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
G ++ ++L+I P+++CD L +D +D +G+ L++E + KR++D G+ Q E++
Sbjct: 136 LGQRMRVNLNITFPSLACDDLHVDVMDVAGDSQLNIEDTLTKRKMDRTGR---YGQAEIL 192
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP-EL 182
+ + ++ + + CG CYGA+ + CCN C+ + +AY+ K W L
Sbjct: 193 QSNQHEQEQSRKAKLRQDPLPDTYCGPCYGAQPDVDACCNNCDALLDAYKLKGWRTDLVL 252
Query: 183 DTIVQCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
T QC E +K EGC + G++ +NRV+G+FHIA G + H+H P
Sbjct: 253 YTAEQCIREGRDQKKLRPLIQGEGCNLSGFMSLNRVAGNFHIAMGEGLQRDGRHIHVFDP 312
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDER----RKPLDGT--VAKAEEGAS-MFNYYIKIIP 293
S +N +H I HLSFG ++Q + L+G + E G + +F Y+IK++P
Sbjct: 313 EDSEHYNASHVIHHLSFGPEIQGKTKSGNLDSSSLNGVTKMVTPEHGTTGLFQYFIKVVP 372
Query: 294 TIYERLDGSK----------------------------------------LGGG------ 307
T Y G + GGG
Sbjct: 373 TTYLGPGGRRDESGTFETNRYFYTERFRPLMKEYLPEEAVAEDPKQAAVHAGGGHRTHDH 432
Query: 308 ----DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVD 356
+ +PG+FF YE+ P V+I S L HL ++M I G + VD
Sbjct: 433 HHVRNSVLPGVFFLYEIYPFAVEIHPVSVPLTHLLIRLMATIGGVFTIVRWVD 485
>gi|322792513|gb|EFZ16471.1| hypothetical protein SINV_10123 [Solenopsis invicta]
Length = 141
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 75/121 (61%), Positives = 93/121 (76%), Gaps = 3/121 (2%)
Query: 160 KCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGS 219
+CCNTC +V EAYR KKWA P+ + QC+N+ S EKLK+ FT+GCQIYGY+EVNRV GS
Sbjct: 11 RCCNTCEDVWEAYRRKKWAPPDPADVKQCQNDKSMEKLKHAFTQGCQIYGYMEVNRVGGS 70
Query: 220 FHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAE 279
FHIAPG+S+S+NHVHVHD+QPYTS+ FN TH IRHLSFG+ + + P+D T A
Sbjct: 71 FHIAPGVSFSVNHVHVHDVQPYTSSHFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAM 127
Query: 280 E 280
E
Sbjct: 128 E 128
>gi|45188262|ref|NP_984485.1| ADR389Cp [Ashbya gossypii ATCC 10895]
gi|44983106|gb|AAS52309.1| ADR389Cp [Ashbya gossypii ATCC 10895]
Length = 392
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/408 (29%), Positives = 183/408 (44%), Gaps = 57/408 (13%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED +T GG +T+ C + L+ + ++V ++ +D R
Sbjct: 5 KLLSLDAFAKTEEDVRVRTRAGGLITLGCVVVTLLLLVSEWRRLWEVEKRPQVVLDRDRQ 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
KL + LDI + C+ L LD +D +GE L++ E K RLD G+ + + + V
Sbjct: 65 QKLELRLDITFSQMPCELLNLDIIDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFRV-- 122
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETE---------TRKCCNTCNEVKEAYRYK 175
G T +D + CG CYGA + R CC TC EV+ AY
Sbjct: 123 -----------GETLPSTDDQDYCGPCYGARDQDQNENLPRSERVCCQTCGEVRAAYAEM 171
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + QCK E TE+L+ EGC++ G ++NRV G+ H APG S + H
Sbjct: 172 NWATFDGKGFEQCKREGYTERLQEQINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKGHA 230
Query: 236 HDIQPYTSAAFNTTHHIRH-LSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIP 293
HD Y + +H+ H LSFG ++ + PL+G + G S F+Y+ K++P
Sbjct: 231 HDDSFYKEHPHLSFNHVIHSLSFGPEIAGNP---GPLNGRAMEVPNGHSHFFSYFAKVVP 287
Query: 294 TIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKIT 328
YE L G+ GG D GGM G+ ++E+SPL V
Sbjct: 288 IRYETLAGTITESAEFSVTAHDRPVHGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQR 347
Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
E+ S WT + N + + V +L + +G KT+
Sbjct: 348 EQYAS---TWTAFVLNAITSIGGVLAVGTVLDRVTYHTQRTLMGKKTL 392
>gi|374107698|gb|AEY96606.1| FADR389Cp [Ashbya gossypii FDAG1]
Length = 392
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/408 (29%), Positives = 183/408 (44%), Gaps = 57/408 (13%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED +T GG +T+ C + L+ + ++V ++ +D R
Sbjct: 5 KLLSLDAFAKTEEDVRVRTRAGGLITLGCVVVTLLLLVSEWRRLWEVEKRPQVVLDRDRQ 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
KL + LDI + C+ L LD +D +GE L++ E K RLD G+ + + + V
Sbjct: 65 QKLELRLDITFSQMPCELLNLDIIDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFRV-- 122
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETE---------TRKCCNTCNEVKEAYRYK 175
G T +D + CG CYGA + R CC TC EV+ AY
Sbjct: 123 -----------GETLPSTDDQDYCGPCYGARDQDQNENLPRSERVCCQTCGEVRAAYAEM 171
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + QCK E TE+L+ EGC++ G ++NRV G+ H APG S + H
Sbjct: 172 NWATFDGKGFEQCKREGYTERLQEQINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKGHA 230
Query: 236 HDIQPYTSAAFNTTHHIRH-LSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIP 293
HD Y + +H+ H LSFG ++ + PL+G + G S F+Y+ K++P
Sbjct: 231 HDDSFYKEHPHLSFNHVIHSLSFGPEIAGNP---GPLNGRAMEVPNGHSHFFSYFAKVVP 287
Query: 294 TIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKIT 328
YE L G+ GG D GGM G+ ++E+SPL V
Sbjct: 288 IRYETLAGTITESAEFSATAHDRPVHGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQR 347
Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
E+ S WT + N + + V +L + +G KT+
Sbjct: 348 EQYAS---TWTAFVLNAITSIGGVLAVGTVLDRVTYHTQRTLMGKKTL 392
>gi|156030895|ref|XP_001584773.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980]
gi|154700619|gb|EDO00358.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 381
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 120/390 (30%), Positives = 164/390 (42%), Gaps = 102/390 (26%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M R LDAFTK ++ +T GG VTI L + YL + DY +++ EL V
Sbjct: 1 MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWADYRRITVHPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D R D +D SGEQ + V H + K RL Q+
Sbjct: 61 DKGR----------------------DVMDVSGEQQVGVMHGVKKVRLSA--------QE 90
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
E + + N DPN CG CYGA + + CCNTC+EV+EAY
Sbjct: 91 EGGKVIDTTALDLHNADEAATHLDPNYCGPCYGATPPPNAKKQGCCNTCDEVREAYASVS 150
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
WA + + QC+ E+ E+L + EGC+I G L VN+V G+FHIAPG S++ ++HVH
Sbjct: 151 WAFGRGENVEQCEREHYGERLDSQRKEGCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVH 210
Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR--------------KPLDGTVAKA 278
D+ Y +HHI L FG +L ++ ++ PLD T
Sbjct: 211 DLNNYFDTPVPGGHVFSHHIHSLRFGPELPEEVTKKLGSDSIIPWTNHHLNPLDNTEQIT 270
Query: 279 EEGASMFNYYIKIIPTIYERL------------------------DGS------------ 302
E A F Y++K++ T Y L DGS
Sbjct: 271 HEAAYNFMYFVKVVSTSYLPLGWETTYNSPPHDASVDIGTYGHSEDGSIETHQYSVTSHR 330
Query: 303 -KLGGGD-------------GGMPGIFFSY 318
L GGD GG+PG+FFSY
Sbjct: 331 RSLNGGDDSAEGHKEKLHARGGIPGVFFSY 360
>gi|215704311|dbj|BAG93745.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 261
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/257 (36%), Positives = 140/257 (54%), Gaps = 21/257 (8%)
Query: 89 VDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKC 148
+D SGEQH + H+I KRRLD G I E +KE + K + ++G ++ E+ C
Sbjct: 1 MDISGEQHHDIRHDIEKRRLDAHGNVI-EARKEGIGGAKIESPLQKHGGRLSKGEE--YC 57
Query: 149 GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIY 208
G+CYGAE +CCN+C EV+EAY+ K WAL D I QC E E++K EGC ++
Sbjct: 58 GTCYGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCTREDFVERVKTQQGEGCNVH 117
Query: 209 GYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR 268
G+L+V++V+G+ H APG + ++++V ++ FN TH I LSFG +
Sbjct: 118 GFLDVSKVAGNLHFAPGKGFYESNINVPELSA-LEHGFNITHKINKLSFGTEFPG---VV 173
Query: 269 KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGG---------DGGM-----PGI 314
PLDG + Y+IK++PTIY L G K+ DG + PG+
Sbjct: 174 NPLDGAQWTQPASDGTYQYFIKVVPTIYTDLRGRKIHSNQFSVTEHFRDGNIRPKPQPGV 233
Query: 315 FFSYELSPLMVKITEKS 331
FF Y+ SP+ V E++
Sbjct: 234 FFFYDFSPIKVVTMERN 250
>gi|407852879|gb|EKG06122.1| hypothetical protein TCSYLVIO_002790, partial [Trypanosoma cruzi]
Length = 472
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/411 (27%), Positives = 190/411 (46%), Gaps = 53/411 (12%)
Query: 7 LKGLDAF----TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
L LD F TK +D ++T GG +++ L I+ L+ +V +F E++VD
Sbjct: 70 LGQLDVFPKFDTKFEQDARQRTAVGGIFSLISLLIIAVLVIGEVRYFFSTVEQHEMYVDP 129
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
G + I ++I P + CD + DA+D+ G VE + K R+ + +
Sbjct: 130 DLGGTMEITVNITFPRVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEARPL 189
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
V+ +KKK+T + E E+ C SCYGAE E CC+TC +V+ AY ++W E
Sbjct: 190 VD--EKKKITKALDPSGAEKEN---CPSCYGAEPEPGACCHTCEDVRRAYSLRRWVFNED 244
Query: 183 D-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D ++ QC E + + EGC ++ +V RV+G+ H PG +++ H+HD +
Sbjct: 245 DISVEQCAEERLRKAATLSSQEGCNLFVNYKVARVTGNIHFVPGRMFNLMGQHLHDFRGK 304
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLD------GTVAKAEEGASMFNYYIKIIPTI 295
T N +H + L FG + + P+D G V EE F+Y++K++PT
Sbjct: 305 TVRQLNLSHIVHTLGFGERFPG---QVNPMDGLVNSRGAVDATEEVNGRFSYFVKVVPTQ 361
Query: 296 YERLDGSKLGGGD------------------------------GGMPGIFFSYELSPLMV 325
Y+ S LG G +PG+F +Y+LSP+ V
Sbjct: 362 YQ--SASVLGVGSVVESNQYSVTRHFTPSPSAELSAAAAESSPVVVPGVFITYDLSPIKV 419
Query: 326 KITEKS--KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
+ EK S+ HL ++ G + LVD+++ V+++ + GK
Sbjct: 420 FVIEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGK 470
>gi|241955457|ref|XP_002420449.1| COPII-coated vesicle complex subunit, putative; ER-derived vesicle
protein, putative [Candida dubliniensis CD36]
gi|223643791|emb|CAX41527.1| COPII-coated vesicle complex subunit, putative [Candida
dubliniensis CD36]
Length = 414
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 128/414 (30%), Positives = 202/414 (48%), Gaps = 54/414 (13%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L DAF K ED KT GG +T++C L LI + DY + T EL V
Sbjct: 1 MSSRPKLLSFDAFAKTVEDARIKTTSGGIITLICILITLVLIRNEYVDYTTIITRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQ 119
D +L I+LDI + CD +++D +D +G+ L+ ++ + K RL ++ Q
Sbjct: 61 DRDINKQLDINLDISFINLPCDLISIDLLDVTGDLSLNIIDSGLKKIRL------LKNKQ 114
Query: 120 KEV-VNAVKKKKVTTENGTTTTEL-------EDPNK-CGSCYGAETETRK--CCNTCNEV 168
+V VN ++ + N T+L D N CGSCYGA + +K CCN CN V
Sbjct: 115 GDVIVNEIEDDEPAFNNDIELTDLAKGLPEGSDENAYCGSCYGALPQDKKQFCCNDCNTV 174
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGL 226
+ AY K W+ + + I QC+ E +L+ EGC+I G ++NRVSG+ APG
Sbjct: 175 RRAYAEKHWSFYDGENIEQCEKEGYVARLRERINNNEGCRIKGTTKINRVSGTMDFAPGA 234
Query: 227 SYSINHVHVHDIQPYT--SAAFNTTHHIRHLSFG---IKLQDDD--ERRKPLDGTVAKAE 279
S++ H HD+ YT FN H I HLSFG + Q D + PLD
Sbjct: 235 SFTREGRHFHDLSLYTKYEDKFNFDHIINHLSFGEMPVDGQADQLFDSIHPLDDHQFMLH 294
Query: 280 EGASMFNYYIKIIPTIYE------RLDGSKL----------GGGD----------GGMPG 313
+ A + +YY+K++ T +E R+D ++ GG D GG+PG
Sbjct: 295 KKAHLVSYYLKVVATRFESLDYKNRIDTNQFSVITHDRPLRGGKDEDHQHTLHARGGIPG 354
Query: 314 IFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+ F++++SPL ++ + +K+ ++ +I+G + L+D + + + I
Sbjct: 355 VNFNFDISPLKIINRQQYAKTWSGFVLGVISSIAGVLMVGTLLDRSVFAAQQAI 408
>gi|68483709|ref|XP_714213.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
gi|68483794|ref|XP_714172.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
gi|46435713|gb|EAK95089.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
gi|46435761|gb|EAK95136.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
gi|238882494|gb|EEQ46132.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 414
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 127/414 (30%), Positives = 201/414 (48%), Gaps = 54/414 (13%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M +L DAF K ED KT GG +T++C L LI + DY + T EL V
Sbjct: 1 MSSRPKLLSFDAFAKTVEDARIKTTSGGIITLICILITLVLIRNEYVDYTTIITRPELVV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQ 119
D +L I+LDI + CD +++D +D +G+ L+ ++ + K RL ++ Q
Sbjct: 61 DRDINKQLDINLDISFINLPCDLISIDLLDVTGDLSLNIIDSGLKKIRL------LKNKQ 114
Query: 120 KEV-VNAVKKKKVTTENGTTTTEL-------EDPNK-CGSCYGAETETRK--CCNTCNEV 168
+V VN ++ + N ++L D N CGSCYGA + +K CCN CN V
Sbjct: 115 GDVIVNEIEDDEPAFNNDIELSDLAKGLPEGSDENAYCGSCYGALPQDKKQFCCNDCNTV 174
Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGL 226
+ AY K W+ + + I QC+ E +L+ EGC+I G ++NRVSG+ APG
Sbjct: 175 RRAYAEKHWSFYDGENIEQCEKEGYVGRLRERINNNEGCRIKGTTKINRVSGTMDFAPGA 234
Query: 227 SYSINHVHVHDIQPYTS--AAFNTTHHIRHLSFG---IKLQDDD--ERRKPLDGTVAKAE 279
S++ H HD+ YT FN H I HLSFG + Q D+ + PLD
Sbjct: 235 SFTREGRHFHDLSLYTKYPDKFNFDHIINHLSFGEMPVDGQADELFDSIHPLDDHQFMLH 294
Query: 280 EGASMFNYYIKIIPTIYERLDGSK----------------LGGGD----------GGMPG 313
+ A + +YY+K++ T +E LD +GG D GG+PG
Sbjct: 295 KKAHLVSYYLKVVATRFESLDYKNRIDTNQFSVITHDRPLVGGKDEDHQHTLHARGGIPG 354
Query: 314 IFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+ F++++SPL ++ + +K+ ++ +I+G + L+D + + + I
Sbjct: 355 VNFNFDISPLKIINRQQYAKTWSGFVLGVISSIAGVLMVGTLLDRSVFAAQQAI 408
>gi|71407913|ref|XP_806393.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870127|gb|EAN84542.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 406
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 114/418 (27%), Positives = 188/418 (44%), Gaps = 67/418 (16%)
Query: 7 LKGLDAF----TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
L LD F TK +D ++T GG +++ L I+ L+ +V +F E++VD
Sbjct: 4 LGQLDVFPKFDTKFEQDARQRTAIGGIFSLLSLLIIAVLVIGEVRYFFSTVEQHEMYVDP 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL-------DGKPI 115
G + I ++I P + CD + DA+D+ G VE + K R+ + +P+
Sbjct: 64 DIGGTMEITVNITFPRVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEARPL 123
Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
+ +K++ A+ EN C SCYGAE E CC+TC +V+ AY +
Sbjct: 124 VDEKKKITKALDPSGAEKEN------------CPSCYGAEPEPGACCHTCEDVRRAYSLR 171
Query: 176 KWALPELDTIV-QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
+W E D V QC E + + EGC ++ +V RV+G+ H PG +++ H
Sbjct: 172 RWVFNEDDVSVEQCAEERLRKAAILSSQEGCNLFVNYKVARVTGNIHFVPGRMFNLMGQH 231
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD------GTVAKAEEGASMFNYY 288
+HD + T N +H + L FG + + P+D G V EE F+Y+
Sbjct: 232 LHDFRGKTVRQLNLSHIVHTLGFGERFPG---QVNPMDGLVNLRGAVDATEEVNGRFSYF 288
Query: 289 IKIIPTIYERLDGSKLGGGD------------------------------GGMPGIFFSY 318
+K++PT Y+ S LG G +PG+F +Y
Sbjct: 289 VKVVPTQYQ--SASILGVGSVVESNQYSVTHHFTPSPSAELSAAAAESSPVMVPGVFITY 346
Query: 319 ELSPLMVKITEKS--KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
+LSP+ V + EK S+ HL ++ G + LVD+++ V+++ + GK
Sbjct: 347 DLSPIKVFVFEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGK 404
>gi|255578837|ref|XP_002530273.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223530205|gb|EEF32113.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 265
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/252 (37%), Positives = 133/252 (52%), Gaps = 21/252 (8%)
Query: 89 VDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKC 148
+D GEQH ++HNI K+R++ G I E +KE + A K +K +G E C
Sbjct: 1 MDIMGEQHFDIKHNITKKRINAHGDVI-EVRKEGIGAPKIEKPLQRHGGRLEHNE--TYC 57
Query: 149 GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIY 208
GSCYGAE CCN+C+EV+EAYR K WAL +D I QCK E +K+K+ EGC IY
Sbjct: 58 GSCYGAEMSDDDCCNSCDEVREAYRKKGWALTGVDLIDQCKREGFIQKVKDEEGEGCNIY 117
Query: 209 GYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR 268
G LEVN+V+G+FH +PG + + D+ + ++N +H I L+FG
Sbjct: 118 GSLEVNKVAGNFHFSPGKGLHQSSFFIQDLLVFQGDSYNISHTINRLAFGDYFPG---VV 174
Query: 269 KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGDGGM---------------PG 313
PLDG E M Y++K++PTIY + G + + PG
Sbjct: 175 NPLDGVPWVHETPNGMHQYFLKVVPTIYTDIRGRTVRSNQYSVTEHFKKSEFARLDSPPG 234
Query: 314 IFFSYELSPLMV 325
+FF Y+ SP+ V
Sbjct: 235 VFFFYDFSPIKV 246
>gi|322693278|gb|EFY85144.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Metarhizium acridum CQMa 102]
Length = 356
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 116/361 (32%), Positives = 166/361 (45%), Gaps = 75/361 (20%)
Query: 74 IVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTT 133
+ P + C+ L LD +D SGEQ V H + RL +P E Q V +K KV
Sbjct: 1 MTFPRMPCELLTLDVMDVSGEQQHGVSHGVKNVRL----RP--ESQGGGVIDIKSMKVHD 54
Query: 134 ENGTTTTELEDPNKCGSCYGAET--ETRK--CCNTCNEVKEAYRYKKWALPELDTIVQCK 189
+ E DP+ CG CYGA RK CCNTC+EV+EAY + WA + + QC
Sbjct: 55 D----PAEHLDPSYCGECYGATAPPNARKAGCCNTCDEVREAYASQGWAFGRGENVEQCT 110
Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY----TSAA 245
E+ E+L EGC++ G+LEVN+V G+FH+APG S+S ++HVHD++ Y
Sbjct: 111 REHYAERLDEQREEGCRVEGHLEVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPNGKQ 170
Query: 246 FNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAEEGASMFNYYIKII 292
+ TH I L FG +L R PLDGT + + A + Y++KI+
Sbjct: 171 HDFTHTIHQLRFGPQLPAAVSDRLGKGSMPWTNHHINPLDGTRQETGDPAFNYMYFVKIV 230
Query: 293 PTIYERL-----------------DGS-------------KLGGGD-------------G 309
PT Y L DGS L GG+ G
Sbjct: 231 PTSYLPLGWEKRFKNAAGSTYGNADGSLETHQYSVTSHKRSLEGGNDAAEGHAERQHSQG 290
Query: 310 GMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
G+PG+FFSY++SP+ ++ E +K+ + + GT VD L ++ K
Sbjct: 291 GIPGVFFSYDISPMKVINREEPAKTFTGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 350
Query: 369 V 369
+
Sbjct: 351 M 351
>gi|340053482|emb|CCC47775.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 404
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/404 (28%), Positives = 182/404 (45%), Gaps = 53/404 (13%)
Query: 7 LKGLDAFTKP----YEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
L+ LD F K +D ++TV GG ++ C I+ L+ +V + E++VD
Sbjct: 4 LRCLDVFPKFDVRFEQDARQRTVVGGLLSFACMTAIAVLVVGEVRYFLSTVDQHEMYVDP 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQKE 121
G ++ I L++ P + CD + DA+DS GE V + K R+ D +PI E +
Sbjct: 64 HIGGEMHITLNVTFPRVPCDLMTADAIDSFGEYAKDVIRSTRKMRVHADTLQPISEARGL 123
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
V V+K++ +T + E C SCYGAE CCNTC++V+ A++ K W+ E
Sbjct: 124 V---VEKRQSSTNADSGGAE-----GCPSCYGAEKNPGDCCNTCDDVRNAFKDKGWSFNE 175
Query: 182 LDT-IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
D I QC E ++ EGC IY +RV G+ H PG + H+H ++
Sbjct: 176 DDIGIAQCAEERLRHAESSSSREGCNIYAKFSASRVKGNIHFVPGSMFDYYGQHMHVLKG 235
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD------GTVAKAEEGASMFNYYIKIIPT 294
N +H I L FG + ++ PLD G V K+E F+Y+++++PT
Sbjct: 236 EIIRKMNLSHIIHQLDFGERFPG---QKNPLDGMVNSRGVVDKSESTNGRFSYFVQVVPT 292
Query: 295 IYERLD--------------------------GSKLGGGDGG--MPGIFFSYELSPL--M 324
Y+ + G D +PGIF Y++SP+
Sbjct: 293 QYQHVSIFGTGRLLETNQYSVTHYFTESWNATGRDKSANDAPSVVPGIFILYDISPIKTS 352
Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
VK T S+ HL ++ G + L+D+ L +++ K
Sbjct: 353 VKATHPYPSVVHLVLQLCAVGGGVFNVASLIDSFLFHGTRQVQK 396
>gi|407418919|gb|EKF38246.1| hypothetical protein MOQ_001547 [Trypanosoma cruzi marinkellei]
Length = 406
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/418 (26%), Positives = 188/418 (44%), Gaps = 67/418 (16%)
Query: 7 LKGLDAF----TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
L LD F TK +D ++T GG +++ I+ L+ +V +F E++VD
Sbjct: 4 LGQLDVFPKFDTKFEQDARQRTAVGGVFSLLSLFIIAVLVIGEVRYFFSTVEQHEMYVDP 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL-------DGKPI 115
G + I ++I P + CD + DA+D+ G VE + K R+ + +P+
Sbjct: 64 DLGGTMEITVNITFPHVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEARPL 123
Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
+ +K++ A+ EN C SCYGAE E CC+TC++V+ AY +
Sbjct: 124 VDEKKKITKALDPNGAEKEN------------CPSCYGAEPEPGACCHTCDDVRRAYSLR 171
Query: 176 KWALPELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
+W E D ++ QC E + EGC ++ +V RV+G+ H PG +++ H
Sbjct: 172 RWVFNEDDISVEQCAGERLRKAAILISQEGCNLFVKYKVARVTGNIHFVPGRMFNLMGQH 231
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD------GTVAKAEEGASMFNYY 288
+HD + T N +H + L FG + + P+D G V EE F+Y+
Sbjct: 232 LHDFRGKTVRQLNLSHIVHTLCFGERFPG---QVNPMDGLVNSRGAVDATEEVNGRFSYF 288
Query: 289 IKIIPTIYERLDGSKLGGGDGG------------------------------MPGIFFSY 318
+K++PT Y+ S LG G +PG+F +Y
Sbjct: 289 VKVVPTQYQA--ASILGVGSVVESNQYSVTHHFTASPSAELSTTTPESTPVIVPGVFITY 346
Query: 319 ELSPLMVKITEKS--KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
+LSP+ V + EK S+ HL ++ G + LVD+++ V+++ + GK
Sbjct: 347 DLSPIKVFVMEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGK 404
>gi|261327856|emb|CBH10834.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 405
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 113/406 (27%), Positives = 191/406 (47%), Gaps = 60/406 (14%)
Query: 7 LKGLDAFTKPYEDF----HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
L LD F K E F ++T GG +++ L I++L+ +V +F E++VD
Sbjct: 4 LSRLDVFPKFDERFERDARQRTALGGVLSMASILIITFLVVGEVRYFFSSVEQHEMYVDP 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQKE 121
G + + ++I P + CD + DA+D+ GE +V + + R++ D P+ E +
Sbjct: 64 HIGGIMHMKVNITFPRVPCDLMTADAIDAFGEHVENVLTDTARVRVNPDTLVPLGEARPL 123
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ KK+ NG + KC SCYGAE+ CC+TC++V+ A+ ++W E
Sbjct: 124 MD---MKKQPADGNGA------EHGKCPSCYGAESNPGDCCHTCDDVRRAFAERQWEFHE 174
Query: 182 LD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
D +IVQC +E + TEGC ++ V RV+G+ H PG ++ H+H +
Sbjct: 175 DDASIVQCVHERLKMAAASASTEGCNLHASFSVPRVTGNIHFIPGRMFNFFGQHLHSFKG 234
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA------KAEEGASMFNYYIKIIPT 294
T N +H + L FG + + P+DG +E F+Y++K++PT
Sbjct: 235 ETIQKLNLSHIVHSLEFGERFPG---QSNPMDGMANVRGATDPSEPLIGRFSYFVKVVPT 291
Query: 295 IYERLDGSKLGGG---------------------DGG-----------MPGIFFSYELSP 322
+Y R++ S +GGG GG +PG+F SY+LSP
Sbjct: 292 VY-RIE-SLVGGGRVVESNQYSVTHHFTPSWETPKGGENNNAKHDPSVVPGVFISYDLSP 349
Query: 323 LMVKI--TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+ V + T S+ HL ++ G Y L+D+L ++++
Sbjct: 350 IRVSVKRTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHSIRRM 395
>gi|66363024|ref|XP_628478.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
and possible N region transmembrane [Cryptosporidium
parvum Iowa II]
gi|46229502|gb|EAK90320.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
and possible N region transmembrane [Cryptosporidium
parvum Iowa II]
Length = 397
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 180/377 (47%), Gaps = 57/377 (15%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
++K +D + K +ED+ K+ ++++ ++ + +L ++ YF+ + + VD++
Sbjct: 33 KVKKIDIYGKIHEDYCVKSTSRSIISLLVYIIVFFLTLNEIFKYFKGEMIDNIGVDNTIN 92
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+KL I LDI P + C+ +++D+VD GE + + + K +DL+G+ ++ + N
Sbjct: 93 NKLDIMLDITFPRLRCEEISVDSVDYVGENQVDSKEYMVKIPIDLNGQEVRNIKYNQQND 152
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA-LPELDT 184
+K +C SCYGAET CCN C+ +K AYR K W+ L +
Sbjct: 153 LKI------------------ECMSCYGAETNEFLCCNDCDSLKTAYRSKGWSYLDIVSK 194
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY-TS 243
QC EK+ GC+I G ++VN+VSG+ H+A G + N HVH+ S
Sbjct: 195 APQC-----IEKV------GCRINGRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDVS 243
Query: 244 AAFNTTHHIRHLSFGIKLQDDDE---RRKPLDGTVAKAEEGASMFNYYIKIIPTIY---- 296
FNT+H I L FG D+ PL+ +G MF+YY+K+IPT Y
Sbjct: 244 RGFNTSHIIHELRFG-----SDKIPFLFSPLENIQKFVHKGTKMFHYYVKLIPTQYFSGN 298
Query: 297 -------------ERLDGSKLGGGD-GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
ER + G+ G+PGIF Y+ P +++ K + HL T
Sbjct: 299 GEVNLYGNQYAFTERERDVHVQNGELSGLPGIFIVYDFQPFLLQKIYKRVPISHLITSFC 358
Query: 343 CNISGTYITFMLVDALL 359
+ G Y L+D +
Sbjct: 359 AIVGGIYSIMSLLDTFV 375
>gi|298706631|emb|CBJ29569.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 453
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 117/413 (28%), Positives = 187/413 (45%), Gaps = 65/413 (15%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK LD +++P +F TV+G VTIV + L ++ + T E LFV+S+
Sbjct: 28 KLKRLDIYSRPKREFQRATVHGAMVTIVLVGAVLVLTWRELVFSMKRETVENLFVNSTIN 87
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVN 124
+ + D+V I C +L+LDA D+ G + H++ + RLD G+ + + +K E+ N
Sbjct: 88 PTVNVTFDVVFARIPCGFLSLDAEDALGIPQEDLRHDVTRTRLDSIGRALDDGEKHEMGN 147
Query: 125 AVK-------KKKVTTENGTTTTELEDPNKCG------------------------SCYG 153
+K +K+ + +L+ ++ G +CYG
Sbjct: 148 TLKAVIAKEEEKQAEADASPGDEDLDSKSRAGDGGDGDVEQRALEDTATTGQEDECNCYG 207
Query: 154 AETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF------TEGCQI 207
A E +CC TC +V++AYR K W L + I C E + NT EGC++
Sbjct: 208 AGAEG-ECCRTCEDVRKAYRRKGWRLNPAE-IPACAGEALSANSANTMESPPVENEGCRL 265
Query: 208 YGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDD 265
G+LEV+R G+FH APG L N + D +FNTTH I L+FG +
Sbjct: 266 AGHLEVSRTEGNFHFAPGHRLHRHANELSFVDRIQVALESFNTTHTINTLTFGDQPPPGH 325
Query: 266 ERRKP------LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL--------------- 304
K L+G ++ +M Y+++++PT+Y RLD +
Sbjct: 326 ASPKHAVASTVLEGHQKTVQDTHAMHQYFLQLVPTVY-RLDNGETVHSNQYSATEHLKHV 384
Query: 305 -GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVD 356
G G+PG++F YE+SP+ + EK K T + G Y LV+
Sbjct: 385 HDGTSRGLPGVYFYYEVSPVQALVEEKRKGFLAFLTGACGVVGGVYTILGLVN 437
>gi|403215743|emb|CCK70242.1| hypothetical protein KNAG_0D05030 [Kazachstania naganishii CBS
8797]
Length = 422
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 123/421 (29%), Positives = 187/421 (44%), Gaps = 66/421 (15%)
Query: 9 GLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKL 68
LDAF+K E+ +T GG ++++C + L+ + + V+T L +D L
Sbjct: 10 ALDAFSKTEEEARVRTSGGGLISLLCVVSAVVLLWREWAQFRAVTTDPMLVIDRDHELPL 69
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPI----QEPQKEVV 123
+ LDI P + C L LD +D SG L V + + K R+D++G + EP K
Sbjct: 70 KLTLDITFPAMPCALLGLDIMDESGNVQLDVLFDQFTKTRVDVNGNMVGGSASEPYKP-- 127
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAET---------ETRKCCNTCNEVKEAYRY 174
N++ K+ G ++ D + CGSCYG++ E R CC TC++V +AY
Sbjct: 128 NSLSGKRA----GAKDLQM-DADYCGSCYGSKNQENNAELPPEQRICCQTCDDVHDAYLE 182
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV- 233
WA + I QC++E ++++ EGC + G +NR+ G+ H APG Y
Sbjct: 183 AGWAFFDGANIEQCESEGYVKRIQEQLHEGCNVKGTALLNRIQGNLHFAPGKPYQQLAAG 242
Query: 234 -------HVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDD-----DERRKPLDGTVAKAEE 280
H HD+ Y + N H I FG Q + +R PL+ TVA E
Sbjct: 243 MPGQGLGHYHDVSLYERNRHMNLNHVINEFRFGEDPQSEIVAQKIQRSAPLEDTVASLEN 302
Query: 281 GA-SMFNYYIKIIPTIYERLDGSK----------------LGG----------GDGGMPG 313
+FNYY ++PT YE L SK +GG G GG PG
Sbjct: 303 PHYYIFNYYTNVVPTRYEFLGASKPLDTAQYSATYHDRPIMGGRDADHPTTLHGRGGTPG 362
Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGG 373
++F+ E SPL + E+ W+ ++ N T + V + V K + IG
Sbjct: 363 VYFNLEFSPLKIINRERRP---QQWSTLLLNWITTIGGILAVGTVTDKVVYKAQR-SIGA 418
Query: 374 K 374
K
Sbjct: 419 K 419
>gi|67623967|ref|XP_668266.1| serologically defined breast cancer antigen 84 [Cryptosporidium
hominis TU502]
gi|54659454|gb|EAL38030.1| serologically defined breast cancer antigen 84 [Cryptosporidium
hominis]
Length = 397
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 180/378 (47%), Gaps = 59/378 (15%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
++K +D + K +ED+ K+ ++++ ++ + +L ++ YF+ + + VD++
Sbjct: 33 KVKKIDIYGKIHEDYCVKSTSRSIISLLVYIIVFFLTLNEIFKYFKGEMIDNIGVDNTIN 92
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+KL I LDI P + C+ +++D+VD GE + + + K +DL+G+ ++ + N
Sbjct: 93 NKLDIMLDITFPRLRCEEISVDSVDYVGENQVDSKEYMAKIPIDLNGQEVRNIKYNQQND 152
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA-LPELDT 184
+K +C SCYGAET CCN C+ +K AYR K W+ L +
Sbjct: 153 LKI------------------ECMSCYGAETNEFLCCNDCDSLKTAYRSKGWSYLDIVSK 194
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY-TS 243
QC EK+ GC+I G ++VN+VSG+ H+A G + N HVH+ S
Sbjct: 195 APQC-----IEKV------GCRINGRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDVS 243
Query: 244 AAFNTTHHIRHLSFGIKLQDDDER----RKPLDGTVAKAEEGASMFNYYIKIIPTIY--- 296
FNT+H I L FG +R PL+ +G MF+YY+K+IPT Y
Sbjct: 244 RGFNTSHIIHELRFG------SDRIPFLFSPLENIQKFVHKGTKMFHYYVKLIPTQYFSG 297
Query: 297 --------------ERLDGSKLGGGD-GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
ER + G+ G+PG+F Y+ P +++ K + HL T
Sbjct: 298 NGEVNLYGNQYAFTERERDVHVQNGELSGLPGVFIVYDFQPFLLQKIYKRVPISHLITSF 357
Query: 342 MCNISGTYITFMLVDALL 359
+ G Y L+D +
Sbjct: 358 CAIVGGIYSIMSLLDTFV 375
>gi|72388468|ref|XP_844658.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62360135|gb|AAX80555.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801191|gb|AAZ11099.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 405
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 112/406 (27%), Positives = 190/406 (46%), Gaps = 60/406 (14%)
Query: 7 LKGLDAFTKPYEDF----HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
L LD F K E F ++T GG +++ I++L+ +V +F E++VD
Sbjct: 4 LSRLDVFPKFDERFLRDARQRTALGGVLSMASIFIITFLVVGEVRYFFSSVEQHEMYVDP 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQKE 121
G + + ++I P + CD + DA+D+ GE +V + + R++ D P+ E +
Sbjct: 64 HIGGIMHMKVNITFPRVPCDLMTADAIDAFGEHVENVLTDTARVRVNPDTLVPLGEARPL 123
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ KK+ NG + KC SCYGAE+ CC+TC++V+ A+ ++W E
Sbjct: 124 MD---MKKQPADGNGA------EHGKCPSCYGAESNPGDCCHTCDDVRRAFAERQWEFHE 174
Query: 182 LD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
D +IVQC +E + TEGC ++ V RV+G+ H PG ++ H+H +
Sbjct: 175 DDASIVQCVHERLKMAAASASTEGCNLHASFSVPRVTGNIHFIPGRMFNFFGQHLHSFKG 234
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA------KAEEGASMFNYYIKIIPT 294
T N +H + L FG + + P+DG +E F+Y++K++PT
Sbjct: 235 ETIQKLNLSHIVHSLEFGERFPG---QSNPMDGMANVRGATDPSEPLIGRFSYFVKVVPT 291
Query: 295 IYERLDGSKLGGG---------------------DGG-----------MPGIFFSYELSP 322
+Y R++ S +GGG GG +PG+F SY+LSP
Sbjct: 292 VY-RIE-SLVGGGRVVESNQYSVTHHFTPSWETPKGGENNNAKHDPSVVPGVFISYDLSP 349
Query: 323 LMVKI--TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+ V + T S+ HL ++ G Y L+D+L ++++
Sbjct: 350 IRVSVKRTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHSIRRM 395
>gi|449528843|ref|XP_004171412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Cucumis sativus]
Length = 355
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 89/241 (36%), Positives = 128/241 (53%), Gaps = 19/241 (7%)
Query: 148 CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQI 207
CGSC+GAE CCN+C EV+EAYR K WA+ D I QC+ E +K+K+ EGC I
Sbjct: 115 CGSCFGAEASDDDCCNSCEEVREAYRKKGWAITNQDLIDQCQREDFIQKVKDEEGEGCNI 174
Query: 208 YGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDER 267
G LEVN+V+GSFH PG S+ + + + ++ +N +H I L+FG D
Sbjct: 175 EGSLEVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSDYNVSHRINRLAFG---NHYDGL 231
Query: 268 RKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG---------------GGDGGMP 312
PLDG + E M Y++K++PTIY+ + G + G +P
Sbjct: 232 VNPLDGVHWEYNEQNVMHQYFVKVVPTIYKNIRGRTVHSNQYSVTEHFKSVEFGSSQSIP 291
Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEI 371
G+FF Y+LSP+ V TE+ H T I I G + ++DA ++ +K+ KVEI
Sbjct: 292 GVFFYYDLSPVKVTYTEEHVPFLHFMTHICAIIGGVFSVAGIIDAFIYHGQRKMKKKVEI 351
Query: 372 G 372
G
Sbjct: 352 G 352
>gi|330803630|ref|XP_003289807.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
gi|325080118|gb|EGC33688.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
Length = 388
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 178/369 (48%), Gaps = 47/369 (12%)
Query: 5 ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEE--LFVD 61
E++K D + K +D +K+ +GG VT+VC L +YL+ ++ YF E L VD
Sbjct: 34 EKVKLFDFYPKVDDDVPRQKSTFGGVVTVVCLLITAYLLISEI--YFFTFPVREHSLKVD 91
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDS-SGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
+RG++LPI++DI P + C + +D VD G+ + I K RLD G P +
Sbjct: 92 VTRGNRLPINIDIHFPRLVCTDITIDVVDGIDGKPIKDAAYQIVKERLDSKGVPFAKGV- 150
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
A+ KK + T E K S + + KCCN+C++++E YR +
Sbjct: 151 ----ALAGKKGIFSSRCTECEFPKQKKGSSVFFRQ----KCCNSCDDLREYYRLNRIPQN 202
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH-VHVHDIQ 239
D QC E + EGC+IYG L+V ++ G FHI GLS +H H H +
Sbjct: 203 FADDAPQCLIERPIQD-----DEGCRIYGSLQVQKMKGDFHILAGLSADESHDGHAHHVH 257
Query: 240 PYTS------AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
T FN THHI SFG D D PL+G A+ A NYYI+++P
Sbjct: 258 RITKENIGRVTQFNITHHIHKFSFG---DDIDGLINPLEGFGIVAQSLAVQ-NYYIQVVP 313
Query: 294 TIYER-------------LDGSKLGGGDGG--MPGIFFSYELSPLMVKITEKSKSLGHLW 338
IY++ D + + G PGI+F Y++SPLM+++ + SK + L
Sbjct: 314 AIYKKNDYVLETNQYSYTYDYRNVNVFNLGRIFPGIYFKYDMSPLMIEVDQTSKPIVELI 373
Query: 339 TKIMCNISG 347
T I C I G
Sbjct: 374 TSI-CAIGG 381
>gi|298708525|emb|CBJ49158.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 467
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/316 (34%), Positives = 164/316 (51%), Gaps = 32/316 (10%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQV-STTEELFVDSSRG 65
+K LD + + ED +T G AVTI W+ + L +V Y +V + TE + VDSS G
Sbjct: 44 IKQLDVYARVDEDLQVRTEAGAAVTIGFWVLMVVLCVGEVQAYRKVQAPTERVVVDSSMG 103
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++D+ +I C + +DA+D +G+ + ++H ++K+RLD DG I E EV
Sbjct: 104 QKLRINIDMTFHSIPCLDVHVDAMDVAGDNQIDIDHGMWKQRLDPDGSAIGEAFMEVPGE 163
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DT 184
V ++ + ED CGSC+GA+ + CCN C +V +AY K W++ ++ T
Sbjct: 164 V-------DDDPAQSLPED--YCGSCFGAK---KGCCNMCRDVVDAYTAKGWSVQDIRRT 211
Query: 185 IVQC-KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
QC ++ + + N EGC + G++ VN+VSG+FH+A G HVH +
Sbjct: 212 AEQCIRDNHIETPIVN--GEGCNLSGFMSVNKVSGNFHVATGEGVMREGRHVHLYTLEQA 269
Query: 244 AAFNTTHHIRHLSF-----GIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-- 296
FNT+H I LSF G+K D + +D V G F YYIK++PT++
Sbjct: 270 VGFNTSHSINLLSFWEPYPGMKPNPLDRTSRIIDEDV-----GTGAFQYYIKLVPTMHSL 324
Query: 297 ---ERLDGSKLGGGDG 309
GS L G G
Sbjct: 325 SPQSEASGSPLPKGKG 340
>gi|157873507|ref|XP_001685262.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128333|emb|CAJ08503.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 467
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 114/412 (27%), Positives = 189/412 (45%), Gaps = 54/412 (13%)
Query: 5 ERLKGLDAFTKPYEDFHE----KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
+LK LD F K F + +TV GG +++V + I +L+ +V + V +E+FV
Sbjct: 63 RQLKRLDVFPKFDRKFEQDARHRTVSGGVLSVVAIVIIIWLLVGEVRYFLSVEEHQEMFV 122
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+ G + + ++I + CD + LDAVD G VE N K+R+D + +
Sbjct: 123 DTKVGGDMQVTVNITFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDAATGQVISAAR 182
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+V+ +KKV T+ + + C SCYGAE CC+TC +V++AY + W L
Sbjct: 183 AMVD---EKKVMTK--AIDADGAEKENCPSCYGAERNPGDCCHTCEDVRQAYARRGWKL- 236
Query: 181 ELD--TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
++D ++ QC + + EGC +Y +R +GS PG Y +HD+
Sbjct: 237 DIDEISVEQCAEDRINMAAAASGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDL 296
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS------MFNYYIKI 291
T+ + +H + L FG ++ PLDGT A G + F+Y++K+
Sbjct: 297 MGSTTRKLDLSHTVHTLEFGDPFPG---QQNPLDGTAQGSALSGDAKDAMNGRFSYFVKL 353
Query: 292 IPTIYERLD----------------------------GSKLGGGDGGMPGIFFSYELSPL 323
+PT Y+R S+ +PG+F +Y+LSP+
Sbjct: 354 VPTTYQRYSLITGLQDVVESNQYSATHHFTPSEAAKAASQAPKKQEIVPGVFMTYDLSPV 413
Query: 324 MVKITEKS--KSLGHLWTKIMCNISGTYITFM-LVDALLHSCVKKISKVEIG 372
+ + E+ SL H + +C + G +T LVD+L +KI K+ G
Sbjct: 414 RILVQERHPYPSLAHFVLQ-LCAVCGGVLTVAGLVDSLCFHSARKIRKMCTG 464
>gi|72393511|ref|XP_847556.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175086|gb|AAX69235.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803586|gb|AAZ13490.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|261330829|emb|CBH13814.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 405
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 104/405 (25%), Positives = 182/405 (44%), Gaps = 54/405 (13%)
Query: 7 LKGLDAF----TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
L LD F T+ +D ++T GG +++ L I++L+ ++ + E++VD
Sbjct: 4 LSRLDVFPKFDTRFEQDARQRTALGGVLSMASILIITFLVVGEIRYFLSTVEQHEMYVDP 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
G + + ++I P + CD + DA+D+ GE +V + K R+D +P +
Sbjct: 64 HIGGIMHMKVNITFPRVPCDLMTADAIDAFGEYVENVVTDTAKVRVD---SSTLKPLGKA 120
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
V KK T T E C +CYGAE +CC+TC++V+ A+ ++W E
Sbjct: 121 RQLVDLKKQPTNGNETGNE-----NCPTCYGAEKNPGECCHTCDDVRRAFAERQWEFHED 175
Query: 183 D-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
D +I QC +E + EGC ++ V RV+G+ H PG ++ H+H +
Sbjct: 176 DVSIAQCAHERLKVAADSASAEGCNLHASFSVPRVTGNIHFVPGRMFNFFGQHLHSFKGE 235
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAK------AEEGASMFNYYIKIIPTI 295
T N +H + L FG + + P+DG V +E F Y++K++PT+
Sbjct: 236 TIRKLNLSHIVHALEFGERFPGQN---NPMDGMVNARGVKDPSEPLIGRFTYFVKVVPTL 292
Query: 296 YERLDGSKLGG----------------------GDGG--------MPGIFFSYELSPLMV 325
Y+ + + G G+ +PG+F SY++SP+ V
Sbjct: 293 YQVVSMANTGNLVESNQYSVTHHFTPSWAAPKEGETDNPNSDPLVVPGVFISYDISPIRV 352
Query: 326 KITEKS--KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+T S+ HL ++ G Y L+D+L +K++ +
Sbjct: 353 SVTRTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHGIKRVQE 397
>gi|302823246|ref|XP_002993277.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
gi|302825185|ref|XP_002994225.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
gi|300137936|gb|EFJ04730.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
gi|300138947|gb|EFJ05698.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
Length = 333
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 176/380 (46%), Gaps = 69/380 (18%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
++K ++AF E +KTV G +TIV I L + Y + ++ VD++RG
Sbjct: 4 KMKNINAFAHADEHLTQKTVSGAILTIVGVSIILVLFAYEFKFYLSTNVVHQMSVDTTRG 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
LPIH++I P++ C L++DA+D SG+ + ++ NI+K RL DG + E ++
Sbjct: 64 QNLPIHINITFPSLPCQILSVDAIDMSGKHEVDLDTNIWKLRLHKDGHILGS---EYLSD 120
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ +K+ +N T + + E R NE+ +A +
Sbjct: 121 LVEKEHAHDNLT------------GIFHSHEELRSAVKVVNEINKALQDG---------- 158
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSA 244
EGC+++G L+V RV+G+FHI+ G+S I H +
Sbjct: 159 -----------------EGCRVFGVLDVERVAGNFHISMHGMSLQIFH---------SVK 192
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N +H I LSFG K PLD TV + A F Y+IKI+PT Y L+G KL
Sbjct: 193 EVNVSHIINDLSFGPKYPG---IHNPLDRTVRILRDTAGTFKYFIKIVPTEYRYLNGGKL 249
Query: 305 GGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
D P ++F Y+LSP+ V I E+ +S GHL T+ + GT+
Sbjct: 250 PTNQFSVGEYYLAARDDDISWPAVYFLYDLSPITVLIKEERRSFGHLLTRFCAIVGGTFS 309
Query: 351 TFMLVDALLHSCVKKISKVE 370
++D ++ V+ I++ +
Sbjct: 310 LTGMLDRWIYRLVESITRAK 329
>gi|448086324|ref|XP_004196073.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
gi|359377495|emb|CCE85878.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
Length = 405
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 129/401 (32%), Positives = 191/401 (47%), Gaps = 46/401 (11%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L LDAF K ED KT GG +T+VC L + LI + +Y V EL VD
Sbjct: 7 KLLSLDAFAKTVEDAKVKTASGGIITLVCVLVVLLLIRNEYSEYTSVVNRPELVVDRDVN 66
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
KL I++DI P + CD + LD +D SG+ V + K RL I +EV++
Sbjct: 67 RKLDINIDITFPYLPCDLVTLDILDVSGDTQADVLKSGFEKYRL------IPSSNEEVLD 120
Query: 125 --AVKKKKVTTENGTTTTELEDPNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKWALP 180
V + ++ E+ E CGSCYGA + + CCN C V+ AY + WA
Sbjct: 121 NAPVLRNDLSLEDIARNPNKEGGGYCGSCYGALPQGDNEFCCNDCETVRVAYAERMWAFY 180
Query: 181 ELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+ I QC+NE +L EGC+I G ++NRVSG+ H APG + + H+HD+
Sbjct: 181 DGANIEQCENEGYVTRLNQRIEQKEGCRIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDL 240
Query: 239 QPYTS--AAFNTTHHIRHLSFGIKLQDDDERRK---PLDGTVAKAEEGASMFNYYIKIIP 293
Y F+ H I HLSFG+ +D + PLDG + + + +YY+K++
Sbjct: 241 SLYEKHFDKFSFDHVINHLSFGLDPAKEDPNHQSTHPLDGYRLILNDKSRVISYYLKVVA 300
Query: 294 TIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKIT 328
T +E L+GS + GG D GG+PG+FF +++SP+ KI
Sbjct: 301 TRFEFLNGSSMETNQFSAIPHHRPYRGGKDEDHRHTMHAKGGIPGVFFHFDISPM--KII 358
Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
K + W+ + + + + V A+L V KV
Sbjct: 359 NKEQ-YAKTWSGFVLGVISSIAGVLTVGAVLDRSVWAAEKV 398
>gi|146095510|ref|XP_001467598.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398020411|ref|XP_003863369.1| hypothetical protein, conserved [Leishmania donovani]
gi|134071963|emb|CAM70660.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501601|emb|CBZ36681.1| hypothetical protein, conserved [Leishmania donovani]
Length = 467
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/412 (27%), Positives = 193/412 (46%), Gaps = 54/412 (13%)
Query: 5 ERLKGLDAFTKPYEDFHE----KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
+LK LD F K F + +TV GG +++V + I +L+ +V + V +E+FV
Sbjct: 63 RQLKRLDVFPKFDRKFEQDARHRTVSGGVLSVVAIVVIIWLLVGEVRYFLSVEEHQEMFV 122
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+ G + + +++ + CD + LDAVD G VE N K+R+D + +
Sbjct: 123 DTKVGGDMQVTVNVTFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDAATGQVISAAR 182
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+V+ +KKV T+ + + C SCYGAE CC+TC +V++AY + W L
Sbjct: 183 AMVD---EKKVMTK--AIDADGAEKENCPSCYGAERNPGDCCHTCEDVRQAYARRGWKL- 236
Query: 181 ELD--TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
++D ++ QC + + EGC +Y +R +GS PG Y +HD+
Sbjct: 237 DIDEISVEQCAEDRIKMAAAASGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDL 296
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS------MFNYYIKI 291
T+ + +H + L FG ++ PLDGT A G + F+Y++K+
Sbjct: 297 MGSTTRKLDLSHTVHTLEFGDPFPG---QQNPLDGTAQGSALSGDAKDAMNGRFSYFVKL 353
Query: 292 IPTIYER---LDG-------------------------SKLGGGDGGMPGIFFSYELSPL 323
+PT Y+R + G S+ +PG+F +Y+LSP+
Sbjct: 354 VPTTYQRYSLITGLQDAVESNQYSATHHFTPSEAAKAVSQTPKKQEIVPGVFMTYDLSPV 413
Query: 324 MVKITEKS--KSLGHLWTKIMCNISGTYITFM-LVDALLHSCVKKISKVEIG 372
+ + E+ SL H + +C + G +T + LVD++ V+KI K+ G
Sbjct: 414 RILVQERHPYPSLVHFVLQ-LCAVCGGVLTVVGLVDSMCFHSVRKIRKMCTG 464
>gi|291001965|ref|XP_002683549.1| predicted protein [Naegleria gruberi]
gi|284097178|gb|EFC50805.1| predicted protein [Naegleria gruberi]
Length = 391
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 114/404 (28%), Positives = 184/404 (45%), Gaps = 65/404 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ D ++K +KT GG V+I+ + I +L+ + Y ++ + L VD
Sbjct: 9 IRSFDLYSKTDSIATKKTSLGGVVSILALIIIIFLVGSALIRYLSINRRDTLSVDIQVED 68
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
++ I +I P + C L +D+VD+SG+ + V H+I+K +D G+ + +
Sbjct: 69 RVVIFFNISFPDLKCYDLHVDSVDASGDAAIDVAHHIHKVPVDSSGR---------ITHL 119
Query: 127 KKKKVTTENGTTTTELE-DPNK-------CGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
+ K T+ GT + + DP K CG+CY E +CCNTC +V E Y+
Sbjct: 120 ESPKHKTKLGTEMPQDKYDPTKDPHSIMYCGTCY-VEQRRGECCNTCQDVMEVYKRNGLP 178
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV----H 234
P ++ + QC + S GC IYG L+V +V+G+FH PG S+S + H
Sbjct: 179 APRVEDVEQCLFDASKNH------PGCNIYGTLDVQKVNGNFHFLPGRSFSQEYETRVHH 232
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD---GTVAKAEEG------ASMF 285
+H+ P +N+TH I LSFG+++ PLD G + K EE ++F
Sbjct: 233 IHEFNPILVDRYNSTHIIHSLSFGLRIP---HVTYPLDETVGIIPKIEESDAQAPKTALF 289
Query: 286 NYYIKIIPTIY---------------------ERLDGSKLGGGDGGMPGIFFSYELSPLM 324
Y+IK +PT Y D SK+ +PG+FF Y P+
Sbjct: 290 KYFIKAVPTTYIGSSYFSSTINTYQFSFTKHVMPFDSSKM----MMLPGVFFVYNFEPIR 345
Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+ E H +M +G ++ +DALL V K+ K
Sbjct: 346 ITYEENGMPFTHFIVDLMAVCAGIFVVLNYIDALLEGVVHKLRK 389
>gi|449705731|gb|EMD45722.1| endoplasmic reticulumgolgi intermediate compartment protein,
putative [Entamoeba histolytica KU27]
Length = 272
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 146/286 (51%), Gaps = 32/286 (11%)
Query: 99 VEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET 158
+E N+ K R+ DG + E + + + + + E DP +C SCYGAET
Sbjct: 4 IEQNVTKIRIHHDGSLVTENEMKAIQS-----------KLSIETPDPKECRSCYGAETPE 52
Query: 159 RKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSG 218
+KCC TC++VKEAY+ + W L +L+ + QC+N + K T EGC++ G +N++ G
Sbjct: 53 KKCCFTCDDVKEAYKKRGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRLIGDFLLNKIGG 111
Query: 219 SFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA 278
+FHIAPG S + H H+++ + +H LSFG E K T K
Sbjct: 112 NFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFG-------ENSKKFT-TEKKD 163
Query: 279 EEGASMFNYYIKIIP----------TIYERLDGSKLGGGDG-GMPGIFFSYELSPLMVKI 327
+ SMF YY+ IIP T Y+ + G+G G PG+F Y++SP+++++
Sbjct: 164 TQMNSMFQYYLTIIPIKNNFINGTSTFYDYSIQENIRSGEGEGQPGVFIYYDVSPMVLEV 223
Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
TE + H I + G + TF L DA++ + + KVE+G
Sbjct: 224 TESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELG 269
>gi|385302035|gb|EIF46185.1| erv46p [Dekkera bruxellensis AWRI1499]
Length = 266
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 93/262 (35%), Positives = 133/262 (50%), Gaps = 20/262 (7%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
DAF K ++ KT GG +T++C I L+ + DY + EL VD L
Sbjct: 10 FDAFAKTLDEAKVKTTSGGILTLICSFTIFILLINEYRDYRTLIMRPELVVDRDHDKTLG 69
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNAVKK 128
++LDI P + CD L++D +D +G+ + E N + RLD DGK I + VN K+
Sbjct: 70 LNLDITFPNMPCDLLSMDIMDLTGDVQADILEGNFLRTRLDRDGKEIATDEPFKVN--KE 127
Query: 129 KKVTTENGTTTTELEDPNKCGSCYGA--------ETETRK--CCNTCNEVKEAYRYKKWA 178
V +E T ED CGSCYGA E++ K CCN+C VK AY W
Sbjct: 128 DXVKSELST-----EDSQYCGSCYGAIDQSGNEKESDPTKWVCCNSCEAVKLAYSKAAWK 182
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+ + I QC+ E +++ EGC++ G ++NR+ G+ H APG S ++N HVHD+
Sbjct: 183 FYDGEGIEQCEKEGYVDRINKRLDEGCRVKGTAQLNRIGGNLHFAPGSSITMNDRHVHDL 242
Query: 239 QPYT--SAAFNTTHHIRHLSFG 258
+ FN H I H SFG
Sbjct: 243 SLFDKHQDKFNFDHVINHFSFG 264
>gi|389602486|ref|XP_001567299.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505471|emb|CAM42729.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 541
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 114/413 (27%), Positives = 187/413 (45%), Gaps = 52/413 (12%)
Query: 5 ERLKGLDAFTKPYEDFHE----KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
+LK LD F K F + +TV GG ++V + I +L+ +V + + E+FV
Sbjct: 137 RQLKRLDVFPKFDRKFEQDARHRTVSGGIFSVVAIVVILWLLVGEVRYFLSIEEHHEMFV 196
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+ G + + +++ + CD + LDAVD G VE N K+R+D + +
Sbjct: 197 DTEVGGDMRVTVNVTFNHVPCDLITLDAVDVFGVFANDVEDNTVKQRIDAATGQVISAAR 256
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
VV+ +KK +T E E+ C SCYGAE CC+TC +V++AY K W L
Sbjct: 257 AVVD--EKKVITKAIDADGVEKEN---CPSCYGAERSPGDCCHTCEDVRQAYAQKGWRLN 311
Query: 181 ELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
D ++ QC + EGC +Y +R +GS PG Y + +HD+
Sbjct: 312 VDDISVEQCAEDRIKMATAAFGKEGCNLYATFAASRATGSLQFIPGRMYQMLGRRMHDLM 371
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS------MFNYYIKII 292
+ + +H + L FG + ++ PLDGT A G + F+Y++K+I
Sbjct: 372 GSAARKLDLSHTVHTLEFGERFPG---QQNPLDGTAQGSALSGDAKDAMNGRFSYFVKVI 428
Query: 293 PTIYERLD------------------------GSKLGGGDGGM----PGIFFSYELSPLM 324
PT Y+R +K M PG+F +Y+LSP+
Sbjct: 429 PTTYQRYSLITGLQDTVESNQYTATHHFTPSAATKAASQTPTMQEIVPGVFMTYDLSPVR 488
Query: 325 VKITEKS--KSLGHLWTKIMCNISGTYITFM-LVDALLHSCVKKISKVEIGGK 374
+ E+ S+ H + +C + G +T + LVD++ V+K+ K+ G +
Sbjct: 489 ILAQERHPYPSVIHFVLQ-LCAVCGGVLTVVGLVDSMCFHSVRKVRKMCTGKQ 540
>gi|342183032|emb|CCC92512.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 401
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 107/408 (26%), Positives = 181/408 (44%), Gaps = 60/408 (14%)
Query: 5 ERLKGLDAFTKP----YEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
+R LD F K +D ++T GG ++I + I+ LI +V + E++V
Sbjct: 2 KRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVTIALLIIGEVRYFLTTVEQHEMYV 61
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQ 119
D G + + ++I P + CD + DA+D+ GE + + K R+D D P+ E
Sbjct: 62 DPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDSDTLAPLGE-A 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ +VN KK D + C SCYGAE CC+TC++V+ A+ ++W
Sbjct: 121 RPLVNMNKKAT------------SDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEF 168
Query: 180 PELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
E D +I+QC E EGC ++ V RV+G+ H PG ++ H+H
Sbjct: 169 HEDDVSIMQCAKERLQMAASTASREGCNLHSSFSVPRVTGNIHFVPGRMFNFFGQHLHSF 228
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV------AKAEEGASMFNYYIKII 292
+ T N +H I L FG + ++ PLDG V +E+ F Y++K++
Sbjct: 229 KGETIQRLNLSHIIHTLEFGERFPG---QKNPLDGMVNTRGVENPSEDLIGRFAYFVKVV 285
Query: 293 PTIYE---------------------------RLDGSKLGGGDGG---MPGIFFSYELSP 322
PT+Y+ D + D +PG+F SY++SP
Sbjct: 286 PTLYQVRTLMSSGRVVESNQYSVTHHFTASWDAADQNNQTNRDANPRVVPGVFVSYDISP 345
Query: 323 LMVKI--TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+ V + T S+ HL ++ G Y L+D++ ++++ +
Sbjct: 346 IRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVVGLIDSMFFHSIRRVQE 393
>gi|342183042|emb|CCC92522.1| unnamed protein product [Trypanosoma congolense IL3000]
gi|343474271|emb|CCD14057.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 401
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/408 (26%), Positives = 181/408 (44%), Gaps = 60/408 (14%)
Query: 5 ERLKGLDAFTKP----YEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
+R LD F K +D ++T GG ++I + I+ LI +V + E++V
Sbjct: 2 KRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVTIALLIIGEVRYFLTTVEQHEMYV 61
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQ 119
D G + + ++I P + CD + DA+D+ GE + + K R+D D P+ E
Sbjct: 62 DPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDSDTLAPLGE-A 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ +VN KK D + C SCYGAE CC+TC++V+ A+ ++W
Sbjct: 121 RPLVNMNKKAT------------SDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEF 168
Query: 180 PELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
E D +I+QC E EGC ++ V RV+G+ H PG ++ H+H
Sbjct: 169 HEDDVSIMQCAKERLQMAASTASREGCNLHSSFSVPRVTGNIHFVPGRMFNFFGQHLHSF 228
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV------AKAEEGASMFNYYIKII 292
+ T N +H I L FG + ++ PLDG V +E+ F Y++K++
Sbjct: 229 KGETIQRLNLSHIIHTLEFGERFPG---QKNPLDGMVNTRGVENPSEDLIGRFAYFVKVV 285
Query: 293 PTIYE---------------------------RLDGSKLGGGDGG---MPGIFFSYELSP 322
PT+Y+ D + D +PG+F SY++SP
Sbjct: 286 PTLYQVKTLMSSGRVVESNQYSVTHHFTASWDAADQNNQTNRDANPRVVPGVFVSYDISP 345
Query: 323 LMVKI--TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+ V + T S+ HL ++ G Y L+D++ ++++ +
Sbjct: 346 IRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVVGLIDSMFFHSIRRVQE 393
>gi|384501765|gb|EIE92256.1| hypothetical protein RO3G_17063 [Rhizopus delemar RA 99-880]
Length = 291
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/266 (33%), Positives = 132/266 (49%), Gaps = 26/266 (9%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ D + K ++F KT G +V +S L+ + + L VD SR
Sbjct: 11 LRQFDGYAKTLDEFRIKTTSGASV-------LSELMTYNTSVW-----KPSLVVDKSRKE 58
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+PI +I P + C L++D +D SGEQ ++ K RLD G ++ +
Sbjct: 59 KMPIDFNITFPNMPCHMLSIDIMDESGEQSSGYSQDVTKIRLDTLGN--------IIESG 110
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAET-ETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K+ LE+ +CGSCYGA+ CC++C +V+EAY + W L I
Sbjct: 111 HTVKLGDHTNDAKKALEEAPECGSCYGAKPLREDGCCHSCQDVREAYVKQGWGLVNTKEI 170
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC E KL+N EGC ++G+L VN+V G+FH APG ++ +HVHD+Q YT A
Sbjct: 171 EQCIREGWLAKLENQSNEGCNVHGHLLVNKVRGNFHFAPGGAFQAGSMHVHDLQEYTQGA 230
Query: 246 -----FNTTHHIRHLSFGIKLQDDDE 266
F+ +H I L FG +D +E
Sbjct: 231 PNGHSFDMSHRIHKLKFGPDTKDQNE 256
>gi|444314203|ref|XP_004177759.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
gi|387510798|emb|CCH58240.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
Length = 406
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/418 (27%), Positives = 170/418 (40%), Gaps = 67/418 (16%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
S L LDAF++ ED +T G +T+ C L+ + + + T EL +D
Sbjct: 3 SSTLLSLDAFSRTEEDVRVRTKTGALITLGCMGITFLLLLNEWLRFGIIETRPELVIDRE 62
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEV 122
R KL + LD+ P + CD + LD +D +GE L + K RLD G
Sbjct: 63 RHLKLDLDLDVTFPNMPCDLINLDLMDDAGEIQLDILSSGFTKTRLDSRG---------- 112
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----------CCNTCNEVKEAY 172
N + + +D CG CYGA ++ CC TC +V++AY
Sbjct: 113 -NELGTFDFDLSKDISEYPPDDDKYCGPCYGALDQSNNKDDMPMDEKVCCQTCADVRQAY 171
Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH 232
WA + I QC+ E +++ + EGC+I G +NR+ G+ H APGL++
Sbjct: 172 LNAGWAFFDGKDIEQCEREGYVQRINDHLNEGCRIQGNARLNRIHGNVHFAPGLAFQNRR 231
Query: 233 VHVHDIQPY---TSAAFNTTHHIRHLSF------GIKLQDDDERRKPLDG--TVAKAEEG 281
H HD Y T FN H I HLSF GI + PLDG + +
Sbjct: 232 GHYHDTSLYDKKTELTFN--HIINHLSFGKHVKPGIGSKFSAASVSPLDGHQMILNDDPH 289
Query: 282 ASMFNYYIKIIPTIYERLDGSKLGGGD-------------------------GGMPGIFF 316
F Y+ KI+PT YE LD + G PG++
Sbjct: 290 NVQFIYFAKIVPTRYEYLDKDVIETAQFSTTTHSKALNNLADDKTTPKPSRRSGTPGLYI 349
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKISKVE 370
+YE+SPL V E+ W + N I G ++D + + + I +
Sbjct: 350 NYEMSPLKVINREQHV---QTWVSFILNCLTSIGGVLAVGTVIDKIFYRAQRTIQSTK 404
>gi|366996541|ref|XP_003678033.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
gi|342303904|emb|CCC71687.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
Length = 409
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 120/408 (29%), Positives = 185/408 (45%), Gaps = 57/408 (13%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L +DAF+K ED +T G +TI C + L+ + Y + + L +D R
Sbjct: 8 LLSIDAFSKTQEDVRIRTKSGAIITICCIVITLILLLNEYIQYTHIVSRPTLVIDRERNL 67
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV--EHNIYKRRLDLDGKPIQEPQKEVVN 124
KL ++LDI P+I CD L LD +D SGE L + E + K R+D +G N
Sbjct: 68 KLELNLDITFPSIPCDLLNLDILDDSGELQLDLLQEGSFTKTRVDSNG-----------N 116
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
A+ K ++ +D N CGSCYGA ++ CC C +V+ AY
Sbjct: 117 ALDSMKFKLDDEVGEYPPQDDNYCGSCYGALDQSNNDNLPKDEKVCCQDCEQVRNAYLTA 176
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
WA + I QC+ E ++ + EGC++ G + +NR+ G+ H APG ++ H
Sbjct: 177 GWAFFDGKKIEQCEREGYVARINSHLNEGCRVKGDVLLNRIHGNIHFAPGRAFQNTKGHF 236
Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGASM--FN 286
HD Y + + N H I HLSFG ++ E R PLDG + + ++
Sbjct: 237 HDTSLYEQTLSLNFNHIINHLSFGKSVEQLAEVRGASVSTSPLDGQQVSPSFDSHLYRYS 296
Query: 287 YYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGIFFSYELS 321
Y+ KI+PT YE LDG S + G G+PG+F +E+S
Sbjct: 297 YFTKIVPTRYEWLDGVVAETAQFSATFHESPVNGAMDPEHPHIRHSRTGLPGVFIYFEMS 356
Query: 322 PLMVKITEKS-KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
PL V E+ KS ++ + ++ G ++D + + + I K
Sbjct: 357 PLKVINQEQHFKSWSGVFLHGITSMGGILAVGTVLDKIFYRAQRTIQK 404
>gi|401426616|ref|XP_003877792.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494038|emb|CBZ29334.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 406
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 115/412 (27%), Positives = 189/412 (45%), Gaps = 54/412 (13%)
Query: 5 ERLKGLDAFTKPYEDFHE----KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
+LK LD F K F + +TV GG ++V + I +L+ +V + V +E+FV
Sbjct: 2 RQLKHLDVFPKFDRKFEQDARHRTVSGGVFSVVAVVVIIWLLVGEVRYFLSVEEHQEMFV 61
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D+ G + + +++ + CD + LDAVD G VE N K+R+D + +
Sbjct: 62 DTKVGGDMQVTVNVTFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDTATGQVISAAR 121
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+V+ +KK VT E E+ C SCYGAE CC+TC +V++AY + W L
Sbjct: 122 AIVD--EKKVVTKAIDADGAEKEN---CPSCYGAERHPGDCCHTCEDVRQAYVRRGWKL- 175
Query: 181 ELD--TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
++D ++ QC + EGC +Y +R +GS PG Y +HD+
Sbjct: 176 DIDEISVEQCAEDRIKMATAAFGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDL 235
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS------MFNYYIKI 291
+ + +H + L FG ++ PLDGT A G + F+Y++K+
Sbjct: 236 MGSATRKLDLSHTVHTLEFGDPFPG---QQNPLDGTAQGSALSGDAKDAMNGRFSYFVKL 292
Query: 292 IPTIYER---------------------LDGSKLGGGDGG-------MPGIFFSYELSPL 323
+PT Y+R S+ + +PG+F +Y+LSP+
Sbjct: 293 VPTTYQRYSLITGLQDTVESNQYSATHHFTPSEAAKAESQAPKKQEIVPGVFMTYDLSPV 352
Query: 324 MVKITEKS--KSLGHLWTKIMCNISGTYITFM-LVDALLHSCVKKISKVEIG 372
+ + E+ SL H ++ C + G +T + LVD+L V+KI K+ G
Sbjct: 353 RILVQERHPYPSLAHFVLQV-CAVCGGVLTVVGLVDSLCFHSVRKIRKMCTG 403
>gi|328875761|gb|EGG24125.1| DUF1692 family protein [Dictyostelium fasciculatum]
Length = 1172
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/394 (28%), Positives = 177/394 (44%), Gaps = 61/394 (15%)
Query: 5 ERLKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
E+LK D + K E H+ K++YGG T++C + +L+ ++ Y L VD S
Sbjct: 808 EKLKLFDFYPKLDESVHQTKSIYGGIATVICIIVTVFLLTSELYYYTFPIRDHSLRVDVS 867
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
RG+++ I+ D+ P++ C + +++VD +DGKPI++ ++V
Sbjct: 868 RGNRMNINFDVHFPSLICSDIIVESVDG------------------VDGKPIKDAAHQIV 909
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAE-------TETRKCCNTCNEVKEAYRYKK 176
K+ G+ L SC E E RKCCN+C +++ YR K
Sbjct: 910 -----KERLNRRGSPLERLHARAGLFSCTKCELPPKYQLLEKRKCCNSCEDLRTFYRTNK 964
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV--- 233
D QC T T EGC+++G L V ++ G HI G + +H
Sbjct: 965 VPQHLADESPQC-----TIGKPVTEDEGCRVFGILSVQKMKGDIHIIAGRPHEESHDGHS 1019
Query: 234 -HVHDIQPYTSA---AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
HVH + P + FN +HHI SFG QD + PL+G G + YY+
Sbjct: 1020 HHVHKLTPEIAQRIHKFNISHHIHKFSFG---QDVEGLINPLEGFGIVVPMGLGLQTYYL 1076
Query: 290 KIIPTIYER-------------LDGSKLGGGDGG--MPGIFFSYELSPLMVKITEKSKSL 334
+++PTIY++ + + + G PGI+F Y+LSPLM+++ + SK
Sbjct: 1077 QVVPTIYKQNNYILETNQYSYTREYKSINYNNLGYLFPGIYFKYDLSPLMIEVDQSSKPF 1136
Query: 335 GHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
L T I G Y+ F L + V KI K
Sbjct: 1137 SELITSICAIGGGMYVAFGLFYHVTARIVGKIKK 1170
>gi|367004394|ref|XP_003686930.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
gi|357525232|emb|CCE64496.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
Length = 439
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 125/439 (28%), Positives = 198/439 (45%), Gaps = 84/439 (19%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQ---VSTTEELFVD 61
+ L D FTK ED +T GG +T++C + +++L+ + ++FQ V + EL +D
Sbjct: 8 DNLLAYDVFTKVEEDIRIRTRTGGLITLIC-IGVTFLLLI--SEWFQFKKVISKPELVID 64
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE-----HNIYKRRLDLDGKPIQ 116
SKL +++D+ P I CD L LD +D SG L ++ N K RL+ G
Sbjct: 65 RDYQSKLELNIDVTFPYIPCDLLNLDILDDSGNVQLDIDLEEASSNFVKTRLNNRG---- 120
Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----------CCNTCN 166
EV+ KK K+T + G E + N CGSCYG++ +T+ CCN+C
Sbjct: 121 ----EVIGKAKKFKITDDLGEYAPE-DKENYCGSCYGSKDQTKNEDIEKITDKVCCNSCE 175
Query: 167 EVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGL 226
+V++AY WA + I QC+ E + + +EGC++ G +N++ G+ H APG
Sbjct: 176 DVRQAYSEAGWAFFDGKNIEQCEREGYVKTINERLSEGCRVKGEALLNKIHGNLHFAPGK 235
Query: 227 SYSINHVHVHDIQPYTS-AAFNTTHHIRHLSFG--------IKLQD---DDERRK--PLD 272
++ H HD + N H I HLSFG QD D R + P+D
Sbjct: 236 AFQNRRGHFHDTSLFNQHKNLNFQHVINHLSFGKPIRQLVTSNFQDTMSDSLRAQTAPID 295
Query: 273 GTVAKAEEGAS--------------MFNYYIKIIPTIYERLDGS--------------KL 304
G A ++ F YY +II T +E L G K+
Sbjct: 296 GHQAFIQDNTGDSDSASTTIAAHDYQFIYYAEIISTRFEYLKGDLEETSQLTVTSHYKKI 355
Query: 305 GGGDG-----------GMPGIFFSYELSPLMVKITEK-SKSLGHLWTKIMCNISGTYITF 352
G +G G+PG++ +E+SPL V E+ S S K + +I G
Sbjct: 356 GYQNGQDYMQGMQSRSGIPGLYIDFEVSPLKVINKEQYSTSWSGYLLKTITSIGGILAVG 415
Query: 353 MLVDALLHSCVKKISKVEI 371
++D ++++ + + I
Sbjct: 416 TVIDKVVYATQTALKQASI 434
>gi|430811512|emb|CCJ31046.1| unnamed protein product [Pneumocystis jirovecii]
Length = 264
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/277 (35%), Positives = 139/277 (50%), Gaps = 23/277 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF+K ED KT GG +TI+ + I L+ + DY +V EL +D +R
Sbjct: 7 FRRFDAFSKTIEDAQIKTTNGGLITIISIIIIFILVSFEWHDYRRVVVLPELTIDRTRSE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I+L++ P I C L+LD +D SGE V HN+ K RLD +G I +N
Sbjct: 67 KLQINLNLTFPKIPCSILSLDIMDVSGELQTDVSHNVVKNRLDKNGIFINSTSINTLNFQ 126
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ KV + CGSCYGA+ CCNTC +V AY W +P T
Sbjct: 127 QPIKVLPS-----------DYCGSCYGAK---EGCCNTCEDVINAYIANNWPIPNKRTFE 172
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG-LSYSINHVHVHDIQPYTSAA 245
QCK+ + + EGC G +EVN+V G+FH APG S +I HVHDI Y + +
Sbjct: 173 QCKDSNNMDGPD----EGCNFVGRIEVNKVIGNFHFAPGHSSQTITGGHVHDIYDYLTDS 228
Query: 246 F--NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE 280
+ +H I LSFG +++ + PLD ++
Sbjct: 229 LPHDFSHMINKLSFGPEIE--GSLQNPLDNVKKDTDD 263
>gi|410078101|ref|XP_003956632.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
gi|372463216|emb|CCF57497.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
Length = 414
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 122/413 (29%), Positives = 184/413 (44%), Gaps = 71/413 (17%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQ--VSTTEELFVD 61
S L +DAF++ +D +T G +TI C + ++ ++ ++ FQ +ST L VD
Sbjct: 3 SSTLLSIDAFSRAQDDIRIRTKSGAIITISC-IAVTVILLINQWLQFQYSISTITNLVVD 61
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE---HNIYKRRLDLD-GKPIQE 117
R KL + DI + C+ + +D +D + ++ + K R+D GKPI
Sbjct: 62 RERNLKLNLDFDITFTNLPCNLINIDILDDASFLQSIIDPDSSSFTKIRIDRSSGKPISS 121
Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET-----------ETRKCCNTCN 166
+ N +K T +D N CG CYGA+ E R CC TC+
Sbjct: 122 SE---FNLNEK--------TYEYPPDDENYCGPCYGAKDQSINDKEGIKKEDRVCCQTCS 170
Query: 167 EVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY-LEVNRVSGSFHIAPG 225
+VK +Y WA + I QC+ E EK+ + EGCQI G + +NRV+G+ H APG
Sbjct: 171 DVKNSYLDAGWAFFDGKNIEQCEREGYIEKINSQLNEGCQIKGSNVLINRVNGNLHFAPG 230
Query: 226 LSYSINHVHVHDIQPYT-SAAFNTTHHIRHLSFGIKLQDDDE------RRKPLDGT--VA 276
+Y + H HD Y N H I H SFG D D PLDGT +
Sbjct: 231 EAYHNPNGHYHDTSFYDLKPQLNFNHIINHFSFGNGAVDRDATHDTTLMNSPLDGTQVLP 290
Query: 277 KAEEGASMFNYYIKIIPTIYERLDGSKL---------------GGGD----------GGM 311
+ + A F Y+ KI+ T YE L+ L GG D GG+
Sbjct: 291 EYDSHAYAFTYFNKIVSTRYEYLERDPLETVQFTSMFHDRQINGGNDIHDEKIKHARGGI 350
Query: 312 PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLH 360
PG+F +++SP+ KI K + + W+ + N I G ++D + +
Sbjct: 351 PGLFIYFDISPM--KIINKEQHTVN-WSTFVLNCITSIGGILAVGTVIDKIFY 400
>gi|413949705|gb|AFW82354.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
partial [Zea mays]
Length = 202
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 79/186 (42%), Positives = 111/186 (59%), Gaps = 3/186 (1%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
F RLK LDA+ K EDF+++T+ GG VT+V + + L + YF ST +L VD
Sbjct: 3 AFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLVVD 62
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+SRG +L ++ DI P+I C L++D D SGEQH + H+I KRRL+ G I E +KE
Sbjct: 63 TSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVI-EARKE 121
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ K ++ ++G + E CG+CYGAE +CCN+C EV+EAY+ K WAL
Sbjct: 122 GIGGAKVERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179
Query: 182 LDTIVQ 187
D I Q
Sbjct: 180 PDLIDQ 185
>gi|123389547|ref|XP_001299739.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121880652|gb|EAX86809.1| hypothetical protein TVAG_100310 [Trichomonas vaginalis G3]
Length = 351
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 107/367 (29%), Positives = 167/367 (45%), Gaps = 53/367 (14%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICV-DVCDYFQVSTTEELF-VDSSRGS- 66
LD F K + GA + + + +C+ ++ Y + + E+L V RG+
Sbjct: 3 LDFFPKFIDSAMTHKTACGAFNSILMIACALALCISEIYAYAKPALHEQLVSVSDLRGAL 62
Query: 67 -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+L I + V ++ C L LD D G + + +YK R+D +G PI PQ ++
Sbjct: 63 DQLSISFNFTV-SVPCVLLHLDVFDMMGSGNRPDQKTLYKVRVDQNGNPI--PQTQIA-- 117
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
ED CG CYGAE+ RKCC TC +V AY+ K W + L +
Sbjct: 118 -----------------ED---CGPCYGAESSQRKCCQTCEDVVAAYQEKGWGIGNLSSW 157
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
QC+ E K E CQ YG L VN + G FH+APG++ HVHD P
Sbjct: 158 AQCRAEGVMFDGK----ERCQAYGNLHVNAIEGGFHLAPGINVFSRFGHVHDFSPLVD-T 212
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGT-VAKAEEGASMFNYYIKIIPTIYE------- 297
N TH I H+SFG + + PLD T V + + G + Y +K +PT+ E
Sbjct: 213 LNLTHEIEHISFGAPID-----KSPLDNTRVVQKKPGQIHYRYNLKAVPTVKEVNGKVHR 267
Query: 298 ----RLDGSKLGGGDGGM--PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
++ +++ G PGIFF Y +P+ + T ++ L +++ G+++
Sbjct: 268 FFRFTVNYAEIPVTARGRYGPGIFFVYSFAPVAITSTYDRPNITVLLARLISIFGGSFML 327
Query: 352 FMLVDAL 358
L+D+
Sbjct: 328 ARLIDSF 334
>gi|397564627|gb|EJK44287.1| hypothetical protein THAOC_37187 [Thalassiosira oceanica]
Length = 506
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 152/315 (48%), Gaps = 26/315 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVS--TTEELFVDSSR 64
++ LD F K D +T GG +T ++ + LI + + ++ + E + VD+S
Sbjct: 58 VRKLDFFNKIEVDHIVRTERGGQLTAAGYVIMLILILAEYLTWSGMNGESIEHVVVDTSL 117
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G ++ ++L+I P++ C+ L L+ +D +G+ L V ++K+RLDLDG P P ++
Sbjct: 118 GKRMKVNLNITFPSLHCEDLHLNIIDVAGDSQLEVSDKMFKQRLDLDGTP--RPLAKISA 175
Query: 125 AVKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
K + + E P+ CG CYGA+ + CCNTC++V E Y+ K+W +
Sbjct: 176 EANAKALEDKKRREVVEKSVGPDYCGPCYGAQENAQDCCNTCDDVIERYKKKRWNDNAVQ 235
Query: 184 TIV-QCKNEYS---TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
+ QC E +E + EGC + G+ VNRV+G+FHIA G + H+H
Sbjct: 236 PLAEQCIREGRAGVSEPKRMAGGEGCNLSGHFTVNRVAGNFHIAMGEGVERDGRHIHQFL 295
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDE--------------RRKPLDGTV---AKAEEGA 282
P F H I LSF D E + ++G+V +
Sbjct: 296 PEDRVNFIANHVIHELSFLDDEYGDIEGEGFLNLMSKAGVNGERSMNGSVKTVTEETGTT 355
Query: 283 SMFNYYIKIIPTIYE 297
+F Y+IK++PT Y+
Sbjct: 356 GLFQYFIKVVPTKYK 370
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 311 MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
+PG+FF YE+ P MV+++ HLW +IM + G + +D LH+ K+
Sbjct: 448 LPGVFFVYEIYPFMVEVSRNRVPFMHLWIRIMATVGGVFTMMSWIDGALHARDKR 502
>gi|297830940|ref|XP_002883352.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
lyrata]
gi|297329192|gb|EFH59611.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 176/391 (45%), Gaps = 66/391 (16%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + L+ +DAF + + +KT G V+IV L ++ L ++ Y T ++ V
Sbjct: 1 MGVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH+++ P++ CD L++DA+D SG+ + ++ NI+K RL+ G I +
Sbjct: 61 DLKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIG--TE 118
Query: 121 EVVNAVKK---------KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEA 171
+ + V+K K E TE E N G AET +K
Sbjct: 119 YISDLVEKGHEHGHSPHKHDGKEEHKNETETEALNILGFDQAAETMIKKV---------- 168
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
K AL + EGC++YG L+V RV+G+FHI+ + +N
Sbjct: 169 ----KQALAD--------------------GEGCRVYGVLDVQRVAGNFHIS---VHGLN 201
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
++V + S N +H I LSFG K PLD T + + F YYIKI
Sbjct: 202 -IYVAQMIFGGSKNVNVSHMIHDLSFGPKYPG---IHNPLDDTNRILHDTSGTFKYYIKI 257
Query: 292 IPTIYERLDGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
+PT Y L L D P ++F Y+LSP+ V I E+ +S HL
Sbjct: 258 VPTEYRYLSKDVLSTNQYSVTEYYTPMTEFDRTWPAVYFLYDLSPITVTIKEERRSFLHL 317
Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISK 368
T++ + GT+ ++D + ++ +K
Sbjct: 318 ITRLCAVLGGTFALTGMLDRWMFRLIESFNK 348
>gi|30686584|ref|NP_188868.2| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|13877821|gb|AAK43988.1|AF370173_1 unknown protein [Arabidopsis thaliana]
gi|51969000|dbj|BAD43192.1| unknown protein [Arabidopsis thaliana]
gi|51970108|dbj|BAD43746.1| unknown protein [Arabidopsis thaliana]
gi|51970556|dbj|BAD43970.1| unknown protein [Arabidopsis thaliana]
gi|51970734|dbj|BAD44059.1| unknown protein [Arabidopsis thaliana]
gi|62319967|dbj|BAD94071.1| hypothetical protein [Arabidopsis thaliana]
gi|332643097|gb|AEE76618.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 354
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 176/391 (45%), Gaps = 66/391 (16%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + L+ +DAF + + +KT G V+IV L ++ L ++ Y T ++ V
Sbjct: 1 MGVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH+++ P++ CD L++DA+D SG+ + ++ NI+K RL+ G I +
Sbjct: 61 DLKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIG--TE 118
Query: 121 EVVNAVKK---------KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEA 171
+ + V+K K E TE E N G AET +K
Sbjct: 119 YISDLVEKGHEHGHSPHKHDGKEEHKNETETEALNILGFDQAAETMIKKV---------- 168
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
K AL + EGC++YG L+V RV+G+FHI+ + +N
Sbjct: 169 ----KQALAD--------------------GEGCRVYGVLDVQRVAGNFHIS---VHGLN 201
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
++V + S N +H I LSFG K PLD T + + F YYIKI
Sbjct: 202 -IYVAQMIFGGSKNVNVSHMIHDLSFGPKYPG---IHNPLDDTNRILHDTSGTFKYYIKI 257
Query: 292 IPTIYERLDGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
+PT Y L L D P ++F Y+LSP+ V I E+ +S HL
Sbjct: 258 VPTEYRYLSKDVLSTNQYSVTEYFTPMTEFDRTWPAVYFLYDLSPITVTIKEERRSFLHL 317
Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISK 368
T++ + GT+ ++D + ++ +K
Sbjct: 318 ITRLCAVLGGTFALTGMLDRWMFRFIESFNK 348
>gi|168004249|ref|XP_001754824.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693928|gb|EDQ80278.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 170/378 (44%), Gaps = 72/378 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF + E +KT G AV+ + + L ++ Y + T E+ VD RG
Sbjct: 9 IKNLDAFPRAEEHLLQKTSSGAAVSAIGLFIMGVLFFHELRFYLETVTVHEMSVDVKRGE 68
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KLPIH+++ P + C+ L+LDA+D SG+ + ++ NI+K R+ DG + E VN +
Sbjct: 69 KLPIHINMTFPALPCEVLSLDAIDMSGKHEVDLDTNIWKLRIHRDGYVLG---SEFVNDL 125
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + E + +D +K G + + NEVK+A +D
Sbjct: 126 VEGEHRKEE--PKADKKDEHKDG-----DHRKKDPQKVINEVKKA----------IDD-- 166
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
EGCQI+G L+V RV+G+FHI+ +H + Y ++
Sbjct: 167 ---------------GEGCQIFGVLDVERVAGNFHIS-----------MHGLSLYVASKI 200
Query: 247 -------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
N +H I LSFG PLDG+ + + F Y++KI+PT Y L
Sbjct: 201 FEAGYEVNVSHVIHDLSFGPTYPG---HHNPLDGSERILHDTSGTFKYFLKIVPTEYHYL 257
Query: 300 DG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
G + D P ++F Y+LSP++V I E ++ GH T++ +
Sbjct: 258 HGEVMPTNQFSVTEYYQRTKPSDRSYPAVYFVYDLSPIVVTIREHRRNFGHFITRLCAVL 317
Query: 346 SGTYITFMLVDALLHSCV 363
GT+ ++D + +
Sbjct: 318 GGTFAVTGMLDRWMSRII 335
>gi|225446891|ref|XP_002284045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|296086333|emb|CBI31774.3| unnamed protein product [Vitis vinifera]
Length = 351
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 179/384 (46%), Gaps = 57/384 (14%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + +K L AF + E +KT G V+I+ + ++ L ++ Y T ++ V
Sbjct: 1 MGVKQFIKSLHAFPRAEEHLLQKTQSGAVVSIIGLVIMATLFLHELRYYLTTYTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH+++ P++ CD L++DA+D SG+ + ++ NI+K RL+ DG I +
Sbjct: 61 DLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNRDGFIIG--TE 118
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ + V+K+ ++ D ++ + + + N +VK+A
Sbjct: 119 YLSDLVEKEHADHKHDHNKDHHGDSDQKLHAHSFDQDAE---NMVKKVKQA--------- 166
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
L N EGC++YG L+V RV+G+FHI S++ +++ Q
Sbjct: 167 ----------------LANG--EGCRVYGVLDVQRVAGNFHI------SVHGLNIFVAQM 202
Query: 241 YTSAAF--NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
A N +H I LSFG K PLDGTV + F YYIKI+PT Y
Sbjct: 203 IFDGAIHVNVSHIIHDLSFGPKYPG---LHNPLDGTVRILRGASGTFKYYIKIVPTEYRY 259
Query: 299 LDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
+ S + D P ++F Y+LSP+ V I E+ +S H T++
Sbjct: 260 ISKEVLPTNQFSVMEYFSPMNEFDRTWPAVYFLYDLSPVTVTIKEERRSFLHFITRLCAV 319
Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
+ GT+ ++D ++ ++ ++K
Sbjct: 320 LGGTFALTGMLDRWMYRFLEMLTK 343
>gi|255637400|gb|ACU19028.1| unknown [Glycine max]
Length = 347
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 179/384 (46%), Gaps = 58/384 (15%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + +K LDAF + + +KT G V+++ + ++ L ++ Y T ++ V
Sbjct: 1 MGMKQVIKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH+++ P++ CD L++DA+D SG+ + ++ NI+K RL+ G I
Sbjct: 61 DLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHIIG---T 117
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
E V+ + +K+ T D NK + +K L
Sbjct: 118 EYVSDLVEKEHTHHK-------HDDNK---------------------NHEHSEQKIHLQ 149
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
LD + + E LKN EGC++YG L+V RV+G+FHI S++ ++++ Q
Sbjct: 150 NLDESTENIIKKVKEALKNG--EGCRVYGVLDVQRVAGNFHI------SVHGLNIYVAQM 201
Query: 241 YTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
A N +H I LSFG K PLD T + + F YYIK++PT Y
Sbjct: 202 IFDGAKNVNVSHFIHDLSFGPKYPG---LHNPLDDTTRILHDTSGTFKYYIKVVPTEYRY 258
Query: 299 LDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
+ S + D P ++F Y+LSP+ V I E+ +S H T++
Sbjct: 259 ISKEVLPTNQFSVSEYYSPINQFDRTWPAVYFLYDLSPITVTIKEERRSFFHFITRLCAV 318
Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
+ GT+ ++D ++ ++ ++K
Sbjct: 319 LGGTFAVTGMLDRWMYRLLETLTK 342
>gi|123449396|ref|XP_001313417.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121895300|gb|EAY00488.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 361
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 170/370 (45%), Gaps = 43/370 (11%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICV-DVCDYFQVSTTEELFVDSSRG 65
++ D F K ++ T+ GG ++++ +F + ++C +V Y T + LFVD+ R
Sbjct: 3 IRKFDVFPKLANEYRIGTISGGILSLIS-VFAAIVLCFYEVAAYLNAPTRQFLFVDTRRP 61
Query: 66 S-------------KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEH-NIYKRRLDLD 111
+ +L + + + P C + LD +DS + + +E+ N RLD
Sbjct: 62 TGPDGVTIDQNSQPRLDVKVSVTFPKAPCFLIHLDVIDSVTQLAMPLENINSKFMRLDSQ 121
Query: 112 GKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEA 171
GKPI+ ++T TT E KCGSCY A+ R CC +C EV +A
Sbjct: 122 GKPIE-----------ALDLSTLVNTTVQE-----KCGSCYNAKDPKRICCRSCQEVFDA 165
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
YR + P L I QCK EK+ EGC++ + RV+ HIAPG S++
Sbjct: 166 YRDAAFKPPVLTEIEQCKP--VAEKVAKMEGEGCKVDASFKALRVASEMHIAPGYSWNSE 223
Query: 232 HVHVHDIQPYTS--AAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYY 288
HVHD+ +T A+ N TH I +LSF K D PL+ + E GA Y
Sbjct: 224 GWHVHDLSLFTKEFASLNLTHTIHYLSFSEKEGD-----YPLNNLNNVQTENGAWRVVYT 278
Query: 289 IKIIPTIYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
I+ Y ++ G+FF Y++SP+ S+ + HL T+I+ + G
Sbjct: 279 ADILEGNYSA-SKYQMYNPKSFASGLFFKYDVSPISAVTYTDSEPVFHLLTRILTVLGGV 337
Query: 349 YITFMLVDAL 358
L+DA+
Sbjct: 338 LGLCRLIDAI 347
>gi|356547537|ref|XP_003542168.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
compartment protein 3-like [Glycine max]
Length = 351
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 175/384 (45%), Gaps = 54/384 (14%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + +K LDAF + + +KT G V+++ + ++ L ++ Y T ++ V
Sbjct: 1 MGMKQVIKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHKMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH+++ P++ CD L++DA+D SG+ + ++ NI+K RL+ G I +
Sbjct: 61 DLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHIIG--TE 118
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ + V+K+ E+ + N +VKEA
Sbjct: 119 YISDLVEKEHTNQEHDDNKDHDHHHEHSEQKIHLQNLDESTENIIKKVKEA--------- 169
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
LKN EGC++YG L+V RV+G+FHI S++ ++++ Q
Sbjct: 170 ----------------LKN--GEGCRVYGVLDVQRVAGNFHI------SVHGLNIYVAQM 205
Query: 241 YTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
A N +H I LSFG K PLD T + + F YYIK++PT Y
Sbjct: 206 IFDGAKNVNVSHFIHDLSFGPKYPG---LHNPLDDTTRILHDTSGTFKYYIKVVPTEYRY 262
Query: 299 LDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
+ S + D P ++F Y+LSP+ V I E+ +S H T++
Sbjct: 263 ISKEVLPTNQFSVSEYYSPINQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAV 322
Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
+ GT+ ++D ++ ++ ++K
Sbjct: 323 LGGTFAVTGMLDRWMYRLLEALTK 346
>gi|356575088|ref|XP_003555674.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 347
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 176/389 (45%), Gaps = 68/389 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + +K LDAF + + +KT G V+++ + ++ L ++ Y T ++ V
Sbjct: 1 MGMKQVIKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH+++ P++ CD L++DA+D SG+ + ++ NI+K RL+ G I
Sbjct: 61 DLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHIIG---T 117
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
E ++ + +K+ T D NK + +K L
Sbjct: 118 EYISDLVEKEHTHHK-------HDDNK---------------------NHEHSEQKIHLQ 149
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
LD + + E LKN EGC++YG L+V RV+G+FHI+ VH +
Sbjct: 150 NLDESTENIIKKVKEALKNG--EGCRVYGVLDVQRVAGNFHIS-----------VHGLNI 196
Query: 241 YTSAAF-------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
Y + N +H I LSFG K PLD T + + F YYIK++P
Sbjct: 197 YVAQMIFDGAKNVNVSHFIHDLSFGPKYPG---LHNPLDDTTRILHDTSGTFKYYIKVVP 253
Query: 294 TIYERLDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
T Y + S + D P ++F Y+LSP+ V I E+ +S H T
Sbjct: 254 TEYRYISKEVLPTNQFSVSEYYSPINQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFIT 313
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISK 368
++ + GT+ ++D ++ ++ ++K
Sbjct: 314 RLCAVLGGTFAVTGMLDRWMYRLLETLTK 342
>gi|11036454|dbj|BAB17274.1| unnamed protein product [Arabidopsis thaliana]
Length = 333
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 168/372 (45%), Gaps = 66/372 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + L+ +DAF + + +KT G V+IV L ++ L ++ Y T ++ V
Sbjct: 1 MGVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH+++ P++ CD L++DA+D SG+ + ++ NI+K RL+ G I +
Sbjct: 61 DLKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIG--TE 118
Query: 121 EVVNAVKK---------KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEA 171
+ + V+K K E TE E N G AET +K
Sbjct: 119 YISDLVEKGHEHGHSPHKHDGKEEHKNETETEALNILGFDQAAETMIKKV---------- 168
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
K AL + EGC++YG L+V RV+G+FHI+ + +N
Sbjct: 169 ----KQALAD--------------------GEGCRVYGVLDVQRVAGNFHIS---VHGLN 201
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
++V + S N +H I LSFG K PLD T + + F YYIKI
Sbjct: 202 -IYVAQMIFGGSKNVNVSHMIHDLSFGPKYPG---IHNPLDDTNRILHDTSGTFKYYIKI 257
Query: 292 IPTIYERLDGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
+PT Y L L D P ++F Y+LSP+ V I E+ +S HL
Sbjct: 258 VPTEYRYLSKDVLSTNQYSVTEYFTPMTEFDRTWPAVYFLYDLSPITVTIKEERRSFLHL 317
Query: 338 WTKIMCNISGTY 349
T++ + GT+
Sbjct: 318 ITRLCAVLGGTF 329
>gi|224137484|ref|XP_002322569.1| predicted protein [Populus trichocarpa]
gi|222867199|gb|EEF04330.1| predicted protein [Populus trichocarpa]
Length = 351
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 181/393 (46%), Gaps = 67/393 (17%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + +K LDAF + E +KT G V+++ + ++ L ++ Y T ++ V
Sbjct: 1 MGVKQAIKSLDAFPRAEEHLLQKTQSGALVSVIGLVIMATLFYHELAYYLTTYTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH++I P++ CD L++DA+D SG+ + ++ NI+K RL+ G
Sbjct: 61 DLQRGEILPIHVNITFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHG---HITGT 117
Query: 121 EVVNAVKKKKVTTENGTTTTEL-----EDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
E ++ + +K+ N + E+ + G AET +K VK+A
Sbjct: 118 EYLSDLVEKEHEAHNHDHDKDHHKDSHEEQHTHGFDDAAETMIKK-------VKQA---- 166
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
L N EGC++YG L+V RV+G+FHI S++ +++
Sbjct: 167 ---------------------LANG--EGCRVYGVLDVQRVAGNFHI------SVHGLNI 197
Query: 236 HDIQPYTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
Q A N +H I LSFG K PLDGT E + +F YYIKI+P
Sbjct: 198 FVAQMIFDGAKHVNVSHIIHDLSFGPKYPG---IHNPLDGTARILRETSGIFKYYIKIVP 254
Query: 294 TIYERLDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
T Y + S + D P ++F Y+LSP+ V I E+ +S H T
Sbjct: 255 TEYRYISKDVLPTNQFSVTEYFSPITDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFIT 314
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
++ + GT+ ++D ++ ++ ++K G
Sbjct: 315 RLCAILGGTFALTGMLDRWMYRLLEALTKPNRG 347
>gi|66813156|ref|XP_640757.1| DUF1692 family protein [Dictyostelium discoideum AX4]
gi|60468793|gb|EAL66793.1| DUF1692 family protein [Dictyostelium discoideum AX4]
Length = 421
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 172/380 (45%), Gaps = 46/380 (12%)
Query: 5 ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
E++K D + K +D K+ +GG T++C L +YL+ ++ Y L VD +
Sbjct: 52 EKVKLFDFYPKVNDDVPRHKSTFGGVATMICILITTYLLVSEIYFYTFPIREHSLKVDIT 111
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDS-SGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
RG++LPI++DI P + C + +D VD G + I K+RLD G+P +
Sbjct: 112 RGNRLPINIDIHFPRLVCTDITIDVVDGIDGNPIKDAAYQIVKQRLDSYGEPFAQGV--- 168
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
A+ KK T E + S + + KCCN+C ++++ YR +
Sbjct: 169 --ALAGKKGIFSRSCTECEFPKSKRVSSVFYKQ----KCCNSCEDLRQYYRLNRIPQNLA 222
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ--- 239
D QC E + EGC+IYG L V ++ G FHI G +H
Sbjct: 223 DDSPQCLIERPVQD-----DEGCRIYGSLSVQKMKGDFHILAGTGIDQSHDGHVHHAHHI 277
Query: 240 PYTSAA----FNTTHHIRHLSFGIKLQDDDERRKPLD--GTVAKAEEGASMFNYYIKIIP 293
P + FN THHI SFG +D + PL+ G VA++ ++ YY++++P
Sbjct: 278 PRENIGRIKHFNITHHIHKFSFG---EDIEGLINPLEDFGIVAQS---LAVQTYYLQVVP 331
Query: 294 TIYERLDGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
IY++ D + PGI+F Y+LSPLM+++ + SK L L
Sbjct: 332 AIYKKNDFVLETNQYSYTYDYRIVNMFNLGQLFPGIYFKYDLSPLMIEVDQTSKPLVELI 391
Query: 339 TKIMCNISGTYITFMLVDAL 358
T I G Y+ LV L
Sbjct: 392 TSICAIGGGMYVVLGLVVRL 411
>gi|123438593|ref|XP_001310077.1| MGC83277 protein [Trichomonas vaginalis G3]
gi|121891831|gb|EAX97147.1| MGC83277 protein, putative [Trichomonas vaginalis G3]
Length = 355
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/322 (30%), Positives = 153/322 (47%), Gaps = 39/322 (12%)
Query: 51 QVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL 110
Q+ +D+ K+ I+ DI++ I C YL +D +D+ E E ++ R D
Sbjct: 45 QLPFINNRIIDTEHLPKMDINFDIMMKHIPCSYLHVDVIDNIKESDESYEGHVRMERFDE 104
Query: 111 DGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKE 170
G PI KK +N + T +DP CG+CYG ++ CCNTC EV++
Sbjct: 105 KGNPIL------------KKSYPKNSSVT---KDPGYCGNCYGQKS---GCCNTCKEVRK 146
Query: 171 AYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSI 230
A++ P + I QC +E E+L E C+++G L V+R G+FH+APG SY+I
Sbjct: 147 AFKANNRPPPPIIHIQQCVDEGYKEELIAMKGEACRVHGTLTVHRAPGTFHVAPGESYNI 206
Query: 231 N--HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNY 287
N H H ++ N +H I H S G+ + PLDG T + + G Y
Sbjct: 207 NGEHDHYYEDLGINIDEMNFSHTINHFSIGMPTANS---YYPLDGHTEIQQKTGRMKMIY 263
Query: 288 YIKIIPTIYERLDGSKL-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
+++ +P LDG G PG+FFSY++S L+ ++ ++ SL
Sbjct: 264 FLRAVPI---NLDGRVFSFGASSYQNYRGSNSTKYPGVFFSYDVS-LIGIVSSQNSSLMD 319
Query: 337 LWTKIMCNISGTYITFMLVDAL 358
L T++M + G + +D L
Sbjct: 320 LVTELMSILGGVFAIATFLDML 341
>gi|169603005|ref|XP_001794924.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
gi|111067148|gb|EAT88268.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
Length = 351
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/343 (30%), Positives = 156/343 (45%), Gaps = 88/343 (25%)
Query: 115 IQEPQKEVVNAVKKKKVTTE-NGTT---TTELE---------DPNKCGSCYGAETETRK- 160
+++ Q V + + K +++ E G+T TT L+ P+ CG CYGA + T
Sbjct: 6 LEQLQMGVTHGINKVRLSPEIEGSTVLSTTALDLHKDEAQHLAPDYCGECYGAPSPTNAI 65
Query: 161 ---CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVS 217
CCNTC+EV++AY W+ + + QC+ E+ E L EGC++ G + VN+V
Sbjct: 66 KAGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDQQRQEGCRLEGSIRVNKVV 125
Query: 218 GSFHIAPGLSYSINHVHVHDIQPYTSAAFNT--THHIRHLSFGIKLQD---DDERRK--- 269
G+FHIAPG S+S ++HVHD++ Y ++ TH I HL FG +L + D ++K
Sbjct: 126 GNFHIAPGKSFSTGNMHVHDLENYFKDEYSHTFTHKIHHLRFGPQLSNAVIADMQKKHQN 185
Query: 270 ------------PLDGTVAKAEEGASMFNYYIKIIPTIY---------------ERLDGS 302
PLD T + E A F Y++K++ T Y + L GS
Sbjct: 186 TGPGGWTSHHINPLDNTEQQTSEKAYNFMYFVKVVSTAYLPLGWEKEAPRLTKHDELLGS 245
Query: 303 KL-----------------------GGGD------------GGMPGIFFSYELSPLMVKI 327
+ GG D GG+PG+FFSY++SP+ V
Sbjct: 246 TIEGNYKGSIETHQYSVTSHKRSLAGGNDEKEGHKERIHAKGGIPGVFFSYDISPMKVIN 305
Query: 328 TE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
E + K+ + I GT VD L+ V KI K+
Sbjct: 306 REVRDKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKKI 348
>gi|322792514|gb|EFZ16472.1| hypothetical protein SINV_10246 [Solenopsis invicta]
Length = 153
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 76/161 (47%), Positives = 101/161 (62%), Gaps = 14/161 (8%)
Query: 5 ERLKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
+ L+ LD K E D +T G VTI+ + + L +V Y S +EELFVD+
Sbjct: 2 QMLRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFLSEVNYYLTPSMSEELFVDT 61
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
SRGSKL I+LDI+VP +SCD+ A+D++GEQHLH+EHNI+KRRLDL+GKPI++PQ+
Sbjct: 62 SRGSKLRINLDIIVPAVSCDH----AMDTTGEQHLHIEHNIFKRRLDLNGKPIEDPQRTN 117
Query: 123 VNAVKKKKVTTENGT---TTTELEDPNKCGSCYGAETETRK 160
+ K TTE +TTE CG CYGA T+T K
Sbjct: 118 ITDAKAVSKTTEKAVEIGSTTE-----TCGDCYGAATDTMK 153
>gi|449016424|dbj|BAM79826.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 499
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 116/448 (25%), Positives = 186/448 (41%), Gaps = 95/448 (21%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
+++ L+ LD + K ED ++V GG + + ++ I L+ + + Q + VD+
Sbjct: 38 WNDWLRKLDVYPKTVEDVRLRSVTGGIIALFSYICIGILVVSEFLRWLQPQLHSNVLVDA 97
Query: 63 SR---GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
+ + L I + + CD +LDA+ ++G Q + + KR LD G+P+ P+
Sbjct: 98 RSILDTEPITVDLGIDLLAVGCDEFSLDALTANGAQLPNSVVELRKRPLDASGQPVIFPR 157
Query: 120 ---------KEVVNAVKKKKVTTENGTTTTELEDP------------------------- 145
E + TE+ T +LE
Sbjct: 158 GAFGRSRLRNERGGVAPAPQALTEDPPNTQQLEGRVSQEVRAQLKQYREEAIAFRDRLAA 217
Query: 146 -NK-----CGSCYGAETETRK-----------CCNTCNEVKEAYRYKKWALPE-LDTIVQ 187
NK CGSCYGA +T + CCNTC+E++ Y + WA + L T Q
Sbjct: 218 LNKTGVAYCGSCYGAVPQTDQVGEANQITSGVCCNTCDEIRVLYEERNWAFDQVLRTAEQ 277
Query: 188 C-KNEYST--EKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYS---INHVHVHDIQPY 241
C + Y T + + GC++ L++ RV+G+FH APG ++ +HVH D Q
Sbjct: 278 CAEKRYLTLLHEAGRVQSGGCRVSARLQLPRVAGNFHFAPGKGHTHRMGHHVHSVDDQ-L 336
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG------ASMFNYYIKIIPTI 295
+N +H IRHL FG ++ PLDG + E+ +M YY K+IPT
Sbjct: 337 LHRTYNFSHRIRHLRFGPLF---PHQQNPLDGAMRILEQPPPGSPFGNMVLYYCKLIPTT 393
Query: 296 YER-------LDGSKLGGGD----------------GGMPGIFFSYELSPLMVKITE-KS 331
Y R L + D G +PGIFF YE PL + E +
Sbjct: 394 YRRDRQRGDALRSMEYAAADLTQSSEQDRVGITHSTGALPGIFFFYEPQPLQIAYFEGRM 453
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALL 359
L H ++ + G + ++D +
Sbjct: 454 YGLLHFIVQLCAIVGGVFTVSSMIDRFV 481
>gi|224086657|ref|XP_002307923.1| predicted protein [Populus trichocarpa]
gi|222853899|gb|EEE91446.1| predicted protein [Populus trichocarpa]
Length = 351
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 109/388 (28%), Positives = 179/388 (46%), Gaps = 65/388 (16%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + +K LDAF + E +KT G V+I+ + ++ L ++ Y T ++ V
Sbjct: 1 MGMKQAIKKLDAFPRAEEHLLQKTQSGALVSIIGLVTMATLFYHELAYYLTTYTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D +RG LPIH++I P++ CD L++DA+D SG+ + ++ +I+K RL+ G +
Sbjct: 61 DLTRGETLPIHINITFPSLPCDVLSVDAIDMSGKHEVDLDTSIWKLRLNSYGHITG--TE 118
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG----AETETRKCCNTCNEVKEAYRYKK 176
+ + V+K+ + ED + +G AET +K VK+A
Sbjct: 119 YLSDLVEKEHEAHNHDHNKDHHEDSHAKQHTHGFDDAAETMVKK-------VKQA----- 166
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
L N EGC++YG L+V RV+G+FHI S++ +++
Sbjct: 167 --------------------LANG--EGCRVYGVLDVQRVAGNFHI------SVHGLNIF 198
Query: 237 DIQPYTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
Q A N +H I LSFG K PLDGT E + F YYIKI+PT
Sbjct: 199 VAQMIFDGAKHVNVSHIIHDLSFGPKYPG---IHNPLDGTTRILHETSGTFKYYIKIVPT 255
Query: 295 IYERLDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
Y + S + D P ++F Y+LSP+ V I E+ +S H T+
Sbjct: 256 EYRYISKEVLPTNQFSVTEYFSPMTDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITR 315
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISK 368
+ + GT+ ++D + ++ ++K
Sbjct: 316 LCAVLGGTFALTGMLDRWMCRLLEALTK 343
>gi|118482697|gb|ABK93267.1| unknown [Populus trichocarpa]
Length = 366
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 105/399 (26%), Positives = 178/399 (44%), Gaps = 64/399 (16%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + +K LDAF + E +KT G V+++ + ++ L ++ Y T ++ V
Sbjct: 1 MGVKQAIKSLDAFPRAEEHLLQKTQSGALVSVIGLVIMATLFYHELAYYLTTYTVHQMSV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D RG LPIH++I P++ CD L++DA+D SG+ + ++ NI+K+ L
Sbjct: 61 DLQRGEILPIHVNITFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKKLL------------ 108
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYR------- 173
G T +E + +G T T + + EA+
Sbjct: 109 --------------FGMLLTRIEFLQLRLNSHGHITGTEYLSDLVEKEHEAHNHDHDKDH 154
Query: 174 ----YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYS 229
+++ D + + + L N EGC++YG L+V RV+G+FHI S
Sbjct: 155 HKDSHEEQHTHGFDDAAETMIKKVKQALANG--EGCRVYGVLDVQRVAGNFHI------S 206
Query: 230 INHVHVHDIQPYTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
++ +++ Q A N +H I LSFG K PLDGT E + +F Y
Sbjct: 207 VHGLNIFVAQMIFDGAKHVNVSHIIHDLSFGPKYPG---IHNPLDGTARILRETSGIFKY 263
Query: 288 YIKIIPTIYERLDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKS 333
YIKI+PT Y + S + D P ++F Y+LSP+ V I E+ +S
Sbjct: 264 YIKIVPTEYRYISKDVLPTNQFSVTEYFSPITDFDRTWPAVYFLYDLSPITVTIKEERRS 323
Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
H T++ + GT+ ++D ++ ++ ++K G
Sbjct: 324 FLHFITRLCAILGGTFALTGMLDRWMYRLLEALTKPNRG 362
>gi|410083920|ref|XP_003959537.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
gi|372466129|emb|CCF60402.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
Length = 417
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 172/397 (43%), Gaps = 74/397 (18%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L DAF K ED +T GG ++I C + +L+ + + ++ T +L VD
Sbjct: 5 KLLVFDAFNKTEEDVRVRTNTGGLISIGCVVLTCFLLLREWYQFNEIITRPKLVVDRDHD 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
+L ++ DI P+ISCD L LD +D +G+ L + E + K R+D +G + + N
Sbjct: 65 LELDLNFDITFPSISCDLLTLDILDDAGDLQLDLLESGLTKTRVDSNGVSLTTESFNIGN 124
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
K+ ++ CGSCYGA + + CC TC +V +AY
Sbjct: 125 EALIKRDFPQD-----------YCGSCYGALDQGKNDELNANEKVCCQTCEDVHDAYLNI 173
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY------S 229
WA + I QC+ E +++ EGC++ G +NRV G+ H APG SY +
Sbjct: 174 GWAFYDGKNIEQCETEGYVDRINEHLNEGCRVQGSARLNRVQGNIHFAPGKSYQDYSRRN 233
Query: 230 INHVHVHDIQPYT---SAAFNTTHHIRHLSFGIKLQDDDERR----------KPLDGTVA 276
H HD Y S +FN H I H SFG +++ PLDG
Sbjct: 234 SFATHFHDTSLYDKTHSLSFN--HIIHHFSFGKPIENSYVNNHNEGLSKISTNPLDGRKV 291
Query: 277 KAEEGASM--FNYYIKIIPTIYERLDGSK-----------------LGGGD--------- 308
+ + ++Y+ +I+PT YE L+ GG D
Sbjct: 292 FPDRDSHFIQYSYFAEIVPTRYEYLNNKSDPVETTQFSATFHSRPLRGGRDEDHPTTLHQ 351
Query: 309 -GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
GG+PG+F +E SPL V E+ W+ + N
Sbjct: 352 RGGIPGLFIYFETSPLKVINKEQ---YSQAWSTFLLN 385
>gi|242059085|ref|XP_002458688.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
gi|241930663|gb|EES03808.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
Length = 350
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 171/382 (44%), Gaps = 68/382 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK L+AF E +KT G VTI+ L + L ++ Y T ++ VD RG
Sbjct: 7 LKSLNAFPHAEEHLLKKTYSGAVVTILGLLVMITLFVHELQFYLTTYTVHQMSVDLKRGE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
LPIH+++ P++ C+ L++DA+D SG+ + + NI+K RLD G I + + + V
Sbjct: 67 TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG--TEYLSDLV 124
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K + E D E +K T NE E + ++
Sbjct: 125 EKGHGAHHDHDHGQEHHD------------EQKKPEQTFNE-------------EAEKMI 159
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
+ + L N EGC++YG L+V RV+G+FHI+ VH + +
Sbjct: 160 KSVKQ----ALGNG--EGCRVYGMLDVQRVAGNFHIS-----------VHGLNIFVAEKI 202
Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
S+ N +H I LSFG K PLD T + + F YYIK++PT Y+ L
Sbjct: 203 FEGSSHVNVSHVIHELSFGPKYPGI---HNPLDETSRILHDTSGTFKYYIKVVPTEYKYL 259
Query: 300 DGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
L D P ++F Y+LSP+ V I E+ ++ H T++ +
Sbjct: 260 SKKVLPTNQFSVTEYFLPIRPSDRAWPAVYFLYDLSPITVTIKEERRNFLHFITRLCAVL 319
Query: 346 SGTYITFMLVDALLHSCVKKIS 367
GT+ ++D ++ ++ ++
Sbjct: 320 GGTFAMTGMLDRWMYRLIESVT 341
>gi|357112836|ref|XP_003558212.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
compartment protein 3-like [Brachypodium distachyon]
Length = 349
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 171/382 (44%), Gaps = 69/382 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK +AF + +KT G VTI + + L ++ Y T ++ VD RG
Sbjct: 7 LKNFNAFPHAEDHLLKKTYSGAIVTIFGLIIMFTLFVHELKFYLTTYTMHQMSVDLKRGE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
LPIH+++ P++ C+ L++DA+D SG+ + + NI+K RLD G I E ++ +
Sbjct: 67 TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGTIIG---TEYLSDL 123
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K+ + E D E +K +T NE + D +V
Sbjct: 124 VEKEHGAHHHDNGHEHHD------------EEKKPEHTFNE-------------DADKMV 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
+ + L+N EGC++YG L+V RV+G+FHI+ VH + Y
Sbjct: 159 KSVRQ----ALENG--EGCRVYGMLDVQRVAGNFHIS-----------VHGLNIYVAEKI 201
Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
S+ N +H I LSFG K PLD T + + F YYIK++PT Y L
Sbjct: 202 FEGSSHVNVSHVIHELSFGPKYPGI---HNPLDDTTRILHDASGTFKYYIKVVPTEYRYL 258
Query: 300 DGSKLG--------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
L D P ++F Y+LSP+ V I E+ ++ H T++ +
Sbjct: 259 SKQVLPTNQFSVTEYFVPIRPADRSWPAVYFLYDLSPITVTIKEERRNFLHFITRLCAVL 318
Query: 346 SGTYITFMLVDALLHSCVKKIS 367
GT+ ++D ++ ++ +S
Sbjct: 319 GGTFAMTGMLDRWMYRIIESVS 340
>gi|115455745|ref|NP_001051473.1| Os03g0784400 [Oryza sativa Japonica Group]
gi|14718311|gb|AAK72889.1|AC091123_8 unknown protein [Oryza sativa Japonica Group]
gi|108711422|gb|ABF99217.1| Serologically defined breast cancer antigen NY-BR-84, putative,
expressed [Oryza sativa Japonica Group]
gi|113549944|dbj|BAF13387.1| Os03g0784400 [Oryza sativa Japonica Group]
gi|215737170|dbj|BAG96099.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625918|gb|EEE60050.1| hypothetical protein OsJ_12848 [Oryza sativa Japonica Group]
Length = 350
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 168/384 (43%), Gaps = 70/384 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK +AF + +KT G VTI + + L ++ Y T ++ VD RG
Sbjct: 7 LKNFNAFPHAEDHLLKKTYSGAIVTIFGLIIMVTLFAHELKFYLTTYTVHQMSVDLKRGE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
LPIH+++ P++ C+ L++DA+D SG+ + + NI+K RLD G I E +N +
Sbjct: 67 TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG---TEYLNDL 123
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K+ T N E ED K K +T NE E + ++
Sbjct: 124 VEKEHGTHNHDHDHEHEDEQK------------KQEHTFNEDAEKM---------VKSVK 162
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
Q EGC++YG L+V RV+G+FHI+ VH + +
Sbjct: 163 QAMEN----------GEGCRVYGVLDVQRVAGNFHIS-----------VHGLNIFVAEKI 201
Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
S+ N +H I LSFG K PLD T + + F YYIKI+PT Y L
Sbjct: 202 FDGSSHVNVSHIIHDLSFGPKYPGI---HNPLDETTRILHDTSGTFKYYIKIVPTEYRYL 258
Query: 300 DGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
K P ++F Y+LSP+ V I E+ ++ H T++
Sbjct: 259 SKQVLPTNQFSVTEYFVPKRATDRSAWPAVYFLYDLSPITVTIKEERRNFLHFLTRLCAV 318
Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
+ GT+ ++D ++ ++ ++K
Sbjct: 319 LGGTFAMTGMLDRWMYRLIESVTK 342
>gi|326490247|dbj|BAJ84787.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326493774|dbj|BAJ85349.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 348
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 169/376 (44%), Gaps = 58/376 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK +AF + +KT G VTI+ + + L ++ Y T ++ VD RG
Sbjct: 7 LKNFNAFPHAEDHLLKKTYSGAIVTILGLIVMVTLFAHELTFYLTTYTMHQMSVDLKRGE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
LPIH+++ P++ C+ L++DA+D SG+ + + NI+K RLD G+ I
Sbjct: 67 TLPIHINVSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGQIIG---------- 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
TE + E E ++ +T NE + D +V
Sbjct: 117 ------TEYLSDLVEKEHGTHDHDHGHGHDVQKQPEHTFNE-------------DADKMV 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
+ + KL EGC++YG L+V RV+G+FHI+ GL+ + + + D S+
Sbjct: 158 K------SVKLAMENGEGCRVYGALDVQRVAGNFHISVHGLNIFVAN-QIFD----GSSH 206
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N +H I LSFG + PLD T + + F YYIK++PT Y L L
Sbjct: 207 VNVSHVIHRLSFGPEYPGI---HNPLDDTSRILHDTSGTFKYYIKVVPTEYRYLSKGVLP 263
Query: 306 GG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
D P ++F Y+LSP+ V I E+ ++ H T++ + GT+
Sbjct: 264 TNQFSVTEYFVPIRPTDRSWPAVYFLYDLSPITVTIREERRNFLHFITRLCAVLGGTFAM 323
Query: 352 FMLVDALLHSCVKKIS 367
++D ++ ++ IS
Sbjct: 324 TGMLDRWMYRIIESIS 339
>gi|194708090|gb|ACF88129.1| unknown [Zea mays]
gi|195607866|gb|ACG25763.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|195619788|gb|ACG31724.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|413952088|gb|AFW84737.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 350
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 171/382 (44%), Gaps = 68/382 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK L+AF E +KT G VTI L + L ++ Y T ++ VD RG
Sbjct: 7 LKSLNAFPHAEEHLLKKTYSGAVVTIFGLLIMITLFVHELQFYLTTYTVHQMSVDLKRGE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
LPIH+++ P++ C+ L++DA+D SG+ + + NI+K RLD G I
Sbjct: 67 TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG---------- 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
T L D + G +GA + + +E K ++++ E + ++
Sbjct: 117 ------------TEYLSDLVEKG--HGAHHDHDHDHDHHDEQK---KHEQTFNEEAEKMI 159
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
+ + L N EGC++YG L+V RV+G+FHI+ VH + +
Sbjct: 160 KSVKQ----ALGNG--EGCRVYGMLDVQRVAGNFHIS-----------VHGLNIFVAEKI 202
Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
S N +H I LSFG K PLD T + + F YYIK++PT Y+ L
Sbjct: 203 FEGSNHVNVSHVIHELSFGPKYPGI---HNPLDETSRILHDTSGTFKYYIKVVPTEYKYL 259
Query: 300 DGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
L D P ++F Y+LSP+ V I E+ ++ H T++ +
Sbjct: 260 SKKVLPTNQFSVTEYFLPIRPTDRAWPAVYFLYDLSPITVTIKEERRNFLHFVTRLCAVL 319
Query: 346 SGTYITFMLVDALLHSCVKKIS 367
GT+ ++D ++ +K ++
Sbjct: 320 GGTFAMTGMLDRWMYQLIKTVT 341
>gi|218193856|gb|EEC76283.1| hypothetical protein OsI_13786 [Oryza sativa Indica Group]
Length = 350
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 167/384 (43%), Gaps = 70/384 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK +AF + KT G VTI + + L ++ Y T ++ VD RG
Sbjct: 7 LKNFNAFPHAEDHLLPKTYSGAIVTIFGLIIMVTLFAHELKFYLTTYTVHQMSVDLKRGE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
LPIH+++ P++ C+ L++DA+D SG+ + + NI+K RLD G I E +N +
Sbjct: 67 TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG---TEYLNDL 123
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K+ T N E ED K K +T NE E + ++
Sbjct: 124 VEKEHGTHNHDHDHEHEDEQK------------KQEHTFNEDAEKM---------VKSVK 162
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
Q EGC++YG L+V RV+G+FHI+ VH + +
Sbjct: 163 QAMEN----------GEGCRVYGVLDVQRVAGNFHIS-----------VHGLNIFVAEKI 201
Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
S+ N +H I LSFG K PLD T + + F YYIKI+PT Y L
Sbjct: 202 FDGSSHVNVSHIIHDLSFGPKYPGI---HNPLDETTRILHDTSGTFKYYIKIVPTEYRYL 258
Query: 300 DGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
K P ++F Y+LSP+ V I E+ ++ H T++
Sbjct: 259 SKQVLPTNQFSVTEYFVPKRATDRSAWPAVYFLYDLSPITVTIKEERRNFLHFLTRLCAV 318
Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
+ GT+ ++D ++ ++ ++K
Sbjct: 319 LGGTFAMTGMLDRWMYRLIESVTK 342
>gi|412992535|emb|CCO18515.1| predicted protein [Bathycoccus prasinos]
Length = 428
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 178/398 (44%), Gaps = 74/398 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---- 62
+ LDA+ K ED+ + G A+T++C+L L + + EL VD+
Sbjct: 32 IASLDAYPKVKEDYARGSTLGAAITLICFLACLCLFFSEYRTHLVSKIESELDVDTMGVN 91
Query: 63 ---SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPI--- 115
S +L +++D+ +++C+ + LD++D++GE H V + +I KRRLD DGKPI
Sbjct: 92 KFESNAERLHVYVDVTFHSLACELITLDSLDAAGEVHHDVHDGHITKRRLDRDGKPIPRR 151
Query: 116 ------------QEPQKE--VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKC 161
++P K + V++K+ E E E + + + + RK
Sbjct: 152 DSSAKDDVAVTREKPNKHKHIEKLVREKEKEEEGKKNEGEQEQEQQEQNHEQHDEKRRKL 211
Query: 162 CNTCNEVKEAYRYKKWALPELDTIV--QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGS 219
NT A +++ ++ Q N E KN EGC++ GYLEVNRV GS
Sbjct: 212 QNT------ALAGFGGGFFDINALIHEQFPNGLE-EAFKNKNKEGCEVMGYLEVNRVPGS 264
Query: 220 FHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG------IKLQDDDERRKPLDG 273
F I+PG S I H IQ + N +H I L+FG + L D + R P +
Sbjct: 265 FSISPGKSLQIGMSH---IQLNVVSHLNMSHTINRLAFGEAFPGALNLLDKNTRYLPPN- 320
Query: 274 TVAKAEEGASMFNYYIKIIPTIYERLDGSKL-------------------GGGDGGMP-G 313
++ Y++K++PT + RL + L G G G P G
Sbjct: 321 ---------AVHQYFLKVVPTSFARLKDTTLATNQYSVTESSSSAKQSFFGMGSSGKPSG 371
Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
I+F YELSP+ + E+ S G + C+I G T
Sbjct: 372 IYFHYELSPIRIDFKERRNSFGEFMLSV-CSIIGGVAT 408
>gi|194689880|gb|ACF79024.1| unknown [Zea mays]
gi|413949702|gb|AFW82351.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 176
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/174 (40%), Positives = 101/174 (58%), Gaps = 3/174 (1%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
F RLK LDA+ K EDF+++T+ GG VT+V + + L + YF ST +L VD
Sbjct: 3 AFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLVVD 62
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+SRG +L ++ DI P+I C L++D D SGEQH + H+I KRRL+ G I E +KE
Sbjct: 63 TSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVI-EARKE 121
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
+ K ++ ++G + E CG+CYGAE +CCN+C E + R K
Sbjct: 122 GIGGAKVERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEESGKHIRRK 173
>gi|340055752|emb|CCC50073.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 404
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 105/399 (26%), Positives = 175/399 (43%), Gaps = 59/399 (14%)
Query: 6 RLKGLDAFTK--PY--EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
R++ D F++ P E E+T GG ++ + L ++ I +++ Y V E++VD
Sbjct: 3 RIRRFDMFSRFDPALEEAGRERTTCGGLLSFLFILLVALFIKIELYRYLSVVELREMYVD 62
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
G + I ++I P I CD +A+D + GE +I K R+ P Q+P
Sbjct: 63 PHVGGDMHITINITFPHIHCDLMAVDVIGPFGEYMTGAVRSITKVRV-----PTQDPA-- 115
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
V + ++ +T L NK C SCYGAE CCN+C++V A+R W
Sbjct: 116 ---PVSEALPQSDRSVSTAALPVSNKMGGCVSCYGAEESPGDCCNSCDDVHAAFRRNGWE 172
Query: 179 LPELDT-IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
+ E D + QC + +EGC I+ V ++ G+ H PG + ++
Sbjct: 173 IDENDIKLSQCTEGQLHNVGPVSPSEGCNIHSKFSVRKIKGNIHFVPGRRLNHRGQPMYV 232
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS-----MFNYYIKI 291
++ N +H L FG + + PL+G A+ AS F+YY+++
Sbjct: 233 VRREAIKKMNLSHVFHSLEFGERFPG---QVNPLNGIANARGVRNASEVVSGRFSYYVQV 289
Query: 292 IPTIYERLD--GSKL-------------------------GGGDGGM-PGIFFSYELSPL 323
+PT Y+ + GS++ G D + G+F Y++SP+
Sbjct: 290 LPTEYQFVPALGSRVRLETNQYSVKQHFTESWYTTDRRYPGWSDPTLVAGVFIVYDVSPV 349
Query: 324 --MVKITEKSKSLGHLWTKIMCNISGTYITFM-LVDALL 359
+V T SL HL + MC + G T ++D+LL
Sbjct: 350 KTLVMRTSPYPSLIHLLLR-MCAVGGGAFTVASMIDSLL 387
>gi|407859749|gb|EKG07137.1| hypothetical protein TCSYLVIO_001725 [Trypanosoma cruzi]
Length = 393
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 166/387 (42%), Gaps = 54/387 (13%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGA-VTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
+++ +D F KP ED+ Y GA V++V + I L+ +VC Y + + T EL VD
Sbjct: 22 KKVAAVDLFPKPKEDYSRSQTYHGALVSLVTVVVIGLLVFWEVCSYIFGRDAYTTELSVD 81
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+S +++ +LDI P + C ++LD +D +G +L+V NI+K +D G
Sbjct: 82 TSLSTEVEFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNIFKTPVDAQGN-------- 133
Query: 122 VVNAVKKKKVTTENGTTTTELED----PNKCGSCYGAETET------RKCCNTCNEVKEA 171
+ ++ E G+ + +D P CG C+ +E + +CCNTCN+V A
Sbjct: 134 -FAFIGTRQGVGEYGSFREQSKDDPNSPQFCGRCFISEHQLSMMDNKNRCCNTCNDVLNA 192
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
Y + P+ + + QC E S GC G L V + G AP
Sbjct: 193 YDQQGLPRPQKNEVEQCIYELS------LINPGCNYKGTLIVKKFGGRLVFAP--KRVPG 244
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
+ D+ F+++H I LS G + RR PL+G A+ + Y+
Sbjct: 245 GFLIKDVM-----QFDSSHIINKLSIGDERVTRFSRRGVQHPLNGHEFVAQRRFTEIRYF 299
Query: 289 IKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSK 332
+K++PT+Y S G G P + ++ P+ V +
Sbjct: 300 LKVVPTMYFSGKNSASFNATYEYSVQWSHRLTPIGFGHFPSVSLGFDFHPMQVNNYFRRS 359
Query: 333 SLGHLWTKIMCNISGTYITFMLVDALL 359
S H ++ + G ++ L+D L+
Sbjct: 360 SFPHFIVQLCGIVGGLFVVLGLIDGLV 386
>gi|308808274|ref|XP_003081447.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
gi|116059910|emb|CAL55969.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
Length = 406
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/378 (28%), Positives = 157/378 (41%), Gaps = 85/378 (22%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD- 61
S LK LDA K ED+ ++ G +T+VC L + Y EL V+
Sbjct: 30 MSALLKSLDANPKLKEDYARQSTSGVIITLVCGALCLLLFLGEFFAYRTTKVVSELRVNP 89
Query: 62 ------SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKP 114
+ +L I +DI +++C+ + LD D +GEQH V + +I KRR+D DGKP
Sbjct: 90 MGVHSVTPNAERLKIDIDITFHSMACNLITLDTSDKAGEQHYDVHDGHIEKRRVDKDGKP 149
Query: 115 I------QEPQK--EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCN 166
I ++P K E+V A LE N+ S G ET +K
Sbjct: 150 IDATFTSEKPNKHKEMVQA----------------LEKMNQTDSVVGNETALQKQ----- 188
Query: 167 EVKEAYRYK---------KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVS 217
A+R+ K A PE +N + +N EGC++ GYLEVNRV
Sbjct: 189 --DRAHRFAGVFGFESMLKEAFPE-----GIENAF-----RNEAREGCEVKGYLEVNRVP 236
Query: 218 GSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAK 277
G I+PG + + + + N TH I LSFG + PLDGT
Sbjct: 237 GRISISPG---RVVMMGMQQFKLNVHTDLNLTHTIHRLSFGERFPG---LVSPLDGTHRS 290
Query: 278 AEEGASMFNYYIKIIPTIYERLDGS-------------------KLGGGDGGM-PGIFFS 317
A Y++ ++ T ++ L G LGG G PG+FF+
Sbjct: 291 LPPNAVQ-QYFLNVVATTFQPLRGDARISTHQYSVTETFTTSQRSLGGSSNGRDPGVFFT 349
Query: 318 YELSPLMVKITEKSKSLG 335
YE+ P+ V E + G
Sbjct: 350 YEIEPIRVDFKETRTTFG 367
>gi|449445069|ref|XP_004140296.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 388
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 114/425 (26%), Positives = 186/425 (43%), Gaps = 100/425 (23%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKT---VYGGAVTIVCW---------------------- 35
M + +K LDAF + E +KT +G I CW
Sbjct: 1 MGLKQTIKSLDAFPRAEEHLLQKTQTGAFGNMRGICCWISHNGHTISARTEILSLHIYCS 60
Query: 36 ---------LFISYL--ICVD----VCDYFQVSTTE--ELFVDSSRGSKLPIHLDIVVPT 78
LF YL I +D + D+ + + VD RG LPIH+++ P+
Sbjct: 61 SVGKQQMWPLFFLYLRIIPLDWGEGMSDFGDPVLWKGFHMSVDLKRGETLPIHINMTFPS 120
Query: 79 ISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTT 138
+ CD L++DA+D SG+ + ++ NI+K RL+ G+ I + + + V+K+ V ++
Sbjct: 121 LPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGQIIG--TEYLSDLVEKEHVDHKHDHD 178
Query: 139 TTELED-PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
+ +D P+ G AE N +VK+A L E
Sbjct: 179 HDKEKDHPHIHGFDQAAE-------NLVKKVKQA-------LEE---------------- 208
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
+GC++YG L+V RV+G+FHI+ + +N + V + S N +H I LSF
Sbjct: 209 ----AQGCRVYGVLDVQRVAGNFHIS---VHGLN-IFVAQMIFGGSKHVNVSHMIHDLSF 260
Query: 258 GIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--------------SK 303
G K PLDGTV + + F YYIKI+PT Y+ + S
Sbjct: 261 GPKYPGI---HNPLDGTVRILRDTSGTFKYYIKIVPTEYKYISKAVLPTNQFSVTEYFSP 317
Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
+ D P ++F Y+LSP+ V I E+ +S H T++ + GT+ ++D + +
Sbjct: 318 MTDSDRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMFRFL 377
Query: 364 KKISK 368
+ ++K
Sbjct: 378 EALTK 382
>gi|145476255|ref|XP_001424150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391213|emb|CAK56752.1| unnamed protein product [Paramecium tetraurelia]
Length = 339
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 164/389 (42%), Gaps = 82/389 (21%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ LD + K D E T G ++++ + I L ++ Y +V + E+FVD +RG
Sbjct: 8 RLRKLDIYRKLPADLTEPTTAGALISVISTIVIVILFITELQAYIEVDNSSEMFVDINRG 67
Query: 66 S-KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
++ ++LDI CD L+LD D G ++VE + K+R
Sbjct: 68 GEQIRVNLDIEFHKFPCDILSLDVQDIMGSHVVNVEGRLIKKR----------------- 110
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+K KV +E + E G E + + +++A++ K
Sbjct: 111 -IKNGKVISEEVHSNHE-----------GHEHHNQPSIDFA-RIEQAFKEK--------- 148
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
EGCQI GY+ VN+V G+FH++ I H Q
Sbjct: 149 ------------------EGCQIAGYIIVNKVPGNFHVSAHAFGGILH---QVFQRSQIQ 187
Query: 245 AFNTTHHIRHLSFG-------IKLQDDDERRKPLDGT--VAKAEEGASM-FNYYIKIIPT 294
+ +H I H+SFG IK Q PLD T VA+ + G M F YYI ++PT
Sbjct: 188 TLDLSHTINHISFGEEDDLMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYISVVPT 247
Query: 295 IYERLDGS-----KLGGGDG-----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
Y + G+ + +P +F Y+LSP+ VK + +S H +I
Sbjct: 248 TYVDVSGNEYYVHQFTANSNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFLHFLVQICAI 307
Query: 345 ISGTYITFMLVDALLH-SCVKKISKVEIG 372
+ G + +VD ++H S V + K E+G
Sbjct: 308 LGGVFTIASIVDGMIHKSVVALLKKYEMG 336
>gi|71409973|ref|XP_807304.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871276|gb|EAN85453.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 393
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 164/389 (42%), Gaps = 58/389 (14%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGA-VTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
+++ +D F KP ED+ Y GA V++V + I L+ +V Y + + T EL VD
Sbjct: 22 KKVAAVDLFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVYSYIFGRDAYTTELSVD 81
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+S ++ +LDI P + C ++LD +D +G +L+V NI+K +D G
Sbjct: 82 TSLSKEVEFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNIFKTPVDAQGN-------- 133
Query: 122 VVNAVKKKKVTTENGTTTTELED----PNKCGSCYGAETE------TRKCCNTCNEVKEA 171
+ ++ E G+ + +D P CG C+ +E + +CCNTCN+V A
Sbjct: 134 -FAFIGTRQGVGEYGSFREQSKDDPNSPQFCGRCFISEHQLSMSENKNRCCNTCNDVLNA 192
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
Y + P+ + + QC + S GC G L V + G AP
Sbjct: 193 YDQQGLPRPQKNEVEQCIYDLS------RINPGCNYKGTLIVKKFGGRLVFAP--KRVPG 244
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
+ D+ F+++H I LS G + RR PL+G + + Y+
Sbjct: 245 GFLIRDVM-----QFDSSHIINKLSIGDERVTRFSRRGVQHPLNGHEFDTQRRFTEIRYF 299
Query: 289 IKIIPTIYERLDGSKLGG------------------GDGGMPGIFFSYELSPLMVKITEK 330
+K++PT+Y L G G G P + ++ P+ V +
Sbjct: 300 LKVVPTMY--LSGKNSASFNATYEYSVQWSHRLTPIGFGHFPSVSLGFDFHPMQVNNYFR 357
Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALL 359
S H ++ + G ++ L+D L+
Sbjct: 358 RSSFPHFLVQLCGIVGGLFVVLGLIDGLV 386
>gi|123472317|ref|XP_001319353.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121902134|gb|EAY07130.1| hypothetical protein TVAG_342940 [Trichomonas vaginalis G3]
Length = 358
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 165/386 (42%), Gaps = 63/386 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR-- 64
LK D F K +ED KT + G VT+VC +SYL+ + ++L VD ++
Sbjct: 3 LKDFDFFPKVFEDHSRKTDFSGTVTVVCLAIMSYLLVFQTLGFIASPPKQKLVVDQAKLP 62
Query: 65 -----------GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
KL I++DI P++ C + +D E + +R+ DGK
Sbjct: 63 VNEDNVLDWPFVPKLQIYIDIEFPSLPCPVIDFQVLDRFEEIQSDSFSKVKLKRIGPDGK 122
Query: 114 PIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYR 173
+K KK E P CGSCYGA + CCNTC +VK A++
Sbjct: 123 -----------IIKNKKT-----------EKPEVCGSCYGAAS---GCCNTCKDVKNAFK 157
Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
K P L TI QC++ + + E C +YG + V G+ + G SY
Sbjct: 158 KKGRVPPSLSTIRQCRD--AVIDYNHIRNESCHVYGTVIVPPTHGTIVMNSGDSYGAQMN 215
Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFN--YYIKI 291
+ FN TH I + G ++D PL G + K ++ + Y+I+
Sbjct: 216 TTTSSLGISIDDFNFTHKINDIYIG----ENDLGDHPLKG-IKKVQKEVGRYKGLYFIR- 269
Query: 292 IPTIYERLDGSKL------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
T+ E+ ++ G G PG++F+Y++SP++V + ++ ++ +
Sbjct: 270 --TLREQKGSLQVYRATSSHYDRYREGTTGKFPGLYFNYDVSPIIV-MYKRDTTVLNFVI 326
Query: 340 KIMCNISGTYITFMLVDALLHSCVKK 365
++M + G Y L+D L +K+
Sbjct: 327 ELMAILGGIYSLGSLLDHLSLITIKR 352
>gi|123451578|ref|XP_001313964.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121895945|gb|EAY01112.1| hypothetical protein TVAG_442240 [Trichomonas vaginalis G3]
Length = 375
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 174/387 (44%), Gaps = 39/387 (10%)
Query: 7 LKGLDAFTKPYE-DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
LK D F K + D KT G ++++ +S L ++ + E++ VDSSR
Sbjct: 4 LKKFDIFPKYTDPDVKVKTNGGAILSLIAMTLMSILFLHELYRFIFPRIYEDIAVDSSRV 63
Query: 66 S---KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S + I+ +I + + C L + A D+ G ++I ++R+D +G I +
Sbjct: 64 SLARTMNINFNISI-QVPCGKLFISAYDAEGNAQSTDVNDIKQQRIDENGFAI-----DS 117
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
VN ++ K+ + + CG CYGA + KCCN+C +V A++ K W + +
Sbjct: 118 VNWIRLKRAAKSKKQKKEQPQ--QYCGKCYGALPQG-KCCNSCEDVINAFKAKGWGIDGI 174
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
D QC +E + K E C +YG + V +SG + A Y + H DI
Sbjct: 175 DRWQQCIDEGYADLGK----ESCNVYGDINVAHISGFLYFALE-DYKVGDKHPKDIS-RL 228
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYERLDG 301
S +N TH I +L FG ++ + PLDG TV + E G +NY ++++PT + G
Sbjct: 229 SHKYNLTHTINYLEFGPRVSHEP---GPLDGLTVLQEEPGLMQYNYDLEVVPTKWFSSRG 285
Query: 302 SKLG---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
+ + G+PGIF +Y L+P+ + E S L T + +
Sbjct: 286 FPVSTYKFHPMITQKNFTEKVNRGVPGIFLNYNLAPISLVQYEVISSPWKLITSVCAIVG 345
Query: 347 GTYITFMLVDALLHSCVKKI-SKVEIG 372
G + L D + + I K +IG
Sbjct: 346 GCFTCVSLADQIFFRTLSSIEGKRQIG 372
>gi|428171090|gb|EKX40010.1| hypothetical protein GUITHDRAFT_154283 [Guillardia theta CCMP2712]
Length = 331
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 166/385 (43%), Gaps = 83/385 (21%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
E LK D F K +D E +V GG V++V F+ L+ + + + +T E+ VD+ R
Sbjct: 13 EWLKNFDVFPKTVDDAKEASVSGGTVSVVVLFFMFLLLFTETSIFLKTNTKFEMEVDTMR 72
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G L I+ DI P + C L+LD++D SGE L ++V+
Sbjct: 73 GGMLQINFDISFPGLPCSVLSLDSMDVSGEHEL-----------------------DIVH 109
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
V K+ + ++ N G + + + + + +KE
Sbjct: 110 DVYKRAMDSKG----------NALGPVISEKVKLARDALSISHIKEQLERH--------- 150
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
EGC IYG L +VSG+FH LS HV A
Sbjct: 151 ------------------EGCNIYGTLNAQKVSGNFH----LSLHAQDFHVLAQVFPDRA 188
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
NT+H + HLSFG +D + PLDG + ++G+ F YYIKI+PT + LDG+ +
Sbjct: 189 TVNTSHIVNHLSFG---RDYPGLKNPLDGEMKVLDQGSGTFEYYIKIVPTKFHHLDGTII 245
Query: 305 GGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
G P ++F Y++SP+MV++ + +S H T++ G Y+
Sbjct: 246 DTNQYSVTDHFRKLQDGFPAVYFIYDISPIMVRVKQWKQSFSHYATQLCAITGGMYV--- 302
Query: 354 LVDALLHSCVKKI-SKVEIGGKTVT 377
V LH+ K + +K IG K+ +
Sbjct: 303 -VTGQLHALSKFLWTKYYIGRKSFS 326
>gi|221114903|ref|XP_002155889.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Hydra magnipapillata]
Length = 399
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 165/386 (42%), Gaps = 69/386 (17%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
S+ K LDAF K E + E + GG V+I+ +LFIS L+ + Y T + VD
Sbjct: 15 SKGFKDLDAFPKIPESYQETSASGGTVSILVFLFISMLVISEFIYYSGSILTYKYEVDKE 74
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGE-----QHLHVEHNIYKRRLDLDGKPIQEP 118
+K I++DI V + CD + D +D SG ++LH+ + +
Sbjct: 75 ADNKFRINIDITV-AMECDDIGADVLDLSGGNVDTGENLHLTPAHFS---------MSSN 124
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
QK+ +A + + + E Y + + + +V Y
Sbjct: 125 QKQWWDAFRSARKSDEG----------------YRSINKVTQIDMIFGDVMPTY------ 162
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
+P+ + ++E+ ++ +GC+IYG +EVN+V+G+FHI G S H H
Sbjct: 163 MPD-----EIESEFEGKEF-----DGCRIYGNIEVNKVAGNFHITAGKSIPHPRGHAHLS 212
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
+ +N +H I LSFG + PLDG + M+ YYI I+PT +
Sbjct: 213 ALVSELNYNFSHRIDMLSFG---EPHPGIINPLDGDLMITTTPYHMYQYYIAIVPTTIQT 269
Query: 299 LDGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
L + L G G+PGIFF Y+ + + V + E+ +S ++
Sbjct: 270 LKNTIKTNQYSVTQRSRQLNLNSGSQGVPGIFFKYDFNAISVSVNEERRSFNEFLIRLCG 329
Query: 344 NISGTYITFMLVDALLHSCVKKISKV 369
I G + T +LHS + ++ +
Sbjct: 330 IIGGVFAT----SGMLHSAIGALADI 351
>gi|229594330|ref|XP_001024169.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila]
gi|225566928|gb|EAS03924.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila
SB210]
Length = 348
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/397 (25%), Positives = 166/397 (41%), Gaps = 90/397 (22%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK D + K D E T+ G V+IV L + L + Y V E+FVD ++G
Sbjct: 9 KLKSFDMYRKLPSDLTEPTLSGAIVSIVSTLIMLILFISEFNGYLSVEENSEMFVDVAQG 68
Query: 66 -SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
K+ ++LDI P CD +LD D G ++VE ++ K RL G +++ ++
Sbjct: 69 GQKIRVNLDIDFPQFPCDIFSLDVQDIMGSHSVNVEGDLVKTRLSSTGTYLEKIKQNTGG 128
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+G + +LE VK+A+ +
Sbjct: 129 DHGHGGHGHGHGDVSLDLE-----------------------RVKKAFNDR--------- 156
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP-YTS 243
EGC+I G++ VN+V G+FHI+ +H + + +Q +
Sbjct: 157 ------------------EGCKISGFMLVNKVPGNFHIS-------SHAYGNYLQRIFQD 191
Query: 244 AAFNT---THHIRHLSFGIKLQDDDERR----------KPLDGTVAKAEEGASMF----N 286
A NT +H I HLSFG +++D R +PLD T E
Sbjct: 192 ARINTLDLSHVINHLSFG---EENDLNRIKKTFQQGILQPLDHTKKIKPENLRTVGVTHQ 248
Query: 287 YYIKIIPTIYERLDGSK-----LGGGDGGM-----PGIFFSYELSPLMVKITEKSKSLGH 336
YYI ++PT Y+ L K M P +FF Y+LSP+ V+ ++ +S H
Sbjct: 249 YYINVVPTTYKDLSNRKYHVYQFVANSNEMTTQHLPAVFFRYDLSPVTVQFSQTRESFLH 308
Query: 337 LWTKIMCNISGTYITFMLVDALLH-SCVKKISKVEIG 372
++ I G + ++D+++H S V + K E+G
Sbjct: 309 FLVQVCAIIGGVFTVAGIIDSIVHRSVVHILKKAEMG 345
>gi|340058906|emb|CCC53277.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 394
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 97/398 (24%), Positives = 162/398 (40%), Gaps = 62/398 (15%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVT----------IVCWLFISYLICVDVCDYFQV 52
F ++ + D F KP ED+ GA+ +V W ++Y+ D
Sbjct: 21 FLKKFEAFDFFPKPKEDYRRSQTTVGALVSVVTLALILLLVLWEGVAYIYGRD------- 73
Query: 53 STTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
+ EL VD+S ++ ++DI P C+ L LD D++G +V N++K LD G
Sbjct: 74 AYRTELAVDTSLTKEVVFNIDISFPQERCNELFLDVFDATGSTRFNVTMNVHKTPLDASG 133
Query: 113 KPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSC-------YGAETETRKCCNTC 165
K + ++ T + P CG C Y + ET C NTC
Sbjct: 134 KSVFVGERHF-----HTDYTVPQYNAKFDPTSPKFCGKCFVGRKYSYLQQPET-PCRNTC 187
Query: 166 NEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG 225
+V E + +K A P T+ QC E S E GC G L++ + SG+ AP
Sbjct: 188 EQVMEEFERRKLAKPSKSTVEQCIGELSEE------NPGCNYRGSLKLKKASGTLIFAPK 241
Query: 226 LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRK---PLDGTVAKAEEGA 282
+ ++ ++D+ FN +H I LS G L +R PL+ +
Sbjct: 242 MFENV--FRINDLM-----QFNASHVINKLSIGDDLVRRFSKRGVYFPLNNQRFVTTKQF 294
Query: 283 SMFNYYIKIIPTIY----------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVK 326
+ Y++KI+PT Y + D ++ G G +P + FS++ S + V
Sbjct: 295 AQVRYFMKIVPTTYISDNTANPVASTYEYSVQWDHRQVPLGSGEIPSVVFSFDFSSMQVN 354
Query: 327 ITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
+ S H + + G ++ +VD L+ ++
Sbjct: 355 NYFQRPSFCHFIVSLCGIVGGLFVVLGMVDGLVARVLR 392
>gi|407424942|gb|EKF39210.1| hypothetical protein MOQ_000571 [Trypanosoma cruzi marinkellei]
Length = 393
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 97/389 (24%), Positives = 162/389 (41%), Gaps = 58/389 (14%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGA-VTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
+++ +D F KP ED+ Y GA V++V + I L+ +V Y + + T EL VD
Sbjct: 22 KKVAAVDFFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVYSYIVGRDAYTTELSVD 81
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+S +++ +LDI P I C ++LD +D +G +L+V NI+K +D G
Sbjct: 82 TSLSTEVEFNLDITFPRIRCHDVSLDILDVTGTVNLNVTRNIFKTPVDAQGN-------- 133
Query: 122 VVNAVKKKKVTTENGTTTTELED----PNKCGSCYGAETET------RKCCNTCNEVKEA 171
+ ++ E G+ + +D P CG C+ E + +CCNTC++V A
Sbjct: 134 -FAFIGTRQGVGEYGSFREQSKDDPNSPQFCGRCFINEHQVSVKENKNRCCNTCDDVLNA 192
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
Y + P + QC + S GC G L V + G AP
Sbjct: 193 YDQQGLPRPRKSEVEQCIYDLS------RINPGCNYKGTLIVKKFGGRLVFAP--KRVSG 244
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
+ D+ F+++H I LS G + RR PL+G + + Y+
Sbjct: 245 GFLIKDVM-----QFDSSHVINKLSIGDERVTRFSRRGVQHPLNGHKFDTQRRITEIRYF 299
Query: 289 IKIIPTIYERLDGSKLGG------------------GDGGMPGIFFSYELSPLMVKITEK 330
+KI+PT+Y L G G G P + ++ P+ V +
Sbjct: 300 LKIVPTMY--LSGKNSAPFNATYEYSVQWSQRLTPIGFGHFPSVSLGFDFHPMQVNNYFR 357
Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALL 359
S H ++ + G ++ L+D L+
Sbjct: 358 RSSFPHFIVQLCGIVGGLFVVLGLIDGLV 386
>gi|145351005|ref|XP_001419879.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580112|gb|ABO98172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 373
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 154/374 (41%), Gaps = 54/374 (14%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---- 62
LK LDA K ED+ ++ G T+VC L + Y EL V+
Sbjct: 5 LKALDANPKLKEDYVSESTSGVITTLVCAALCLILFFGEFFSYKTTKIVSELRVNPLGVH 64
Query: 63 ---SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEP 118
+L I +DI +++C+ + LD D +GE+H V + +I KRR+D GK
Sbjct: 65 QTVPNAERLKIDVDITFHSLACNLITLDTSDKAGEEHYDVHDGHIEKRRIDKHGK----- 119
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
V++A + ++ L+ N+ S + A++ + + +
Sbjct: 120 ---VIDAAFTSEKPNKHKEIEQALQKMNETDSAHAADSHAMEHVQPFGGMFGLQSLLQEV 176
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
PE +N EGC++ GYLEVNRV G F I+PG S + + +
Sbjct: 177 FPE----------GVEHAFRNENQEGCEVKGYLEVNRVPGRFSISPGRSLMMG---MQMV 223
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
+ A N TH I LSFG + PLDGT A Y++ ++ T +E
Sbjct: 224 KLNVQTALNLTHTIHRLSFG---ESFPGLVSPLDGTHRSLPPNAVQ-QYFLNVVSTTFEP 279
Query: 299 LDGSK--------------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
L +K +G +G PG+ F+YE+SP+ V E S G
Sbjct: 280 LGENKIISTHQYSVTETFTSSQRSIMGTSNGRDPGVIFTYEISPIRVDFKETRTSFGAFV 339
Query: 339 TKIMCNISGTYITF 352
I C++ G +T
Sbjct: 340 LGI-CSVIGGVVTM 352
>gi|291232448|ref|XP_002736170.1| PREDICTED: MGC81917 protein-like [Saccoglossus kowalevskii]
Length = 395
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 155/383 (40%), Gaps = 84/383 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E++ E T GG V+I+ + I+ L+ ++ Y + + E VD+ S
Sbjct: 13 VKELDAFPKIPENYQETTATGGTVSILTFSLIAILVISEIQYYSETTMKYEYEVDTDLTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSG-----------EQHLHVEHNIYKRRLDLDGKPI 115
KL +++DI V + CDY+ D +D +G EQ +H E RR K +
Sbjct: 73 KLRLNIDITV-AMKCDYIGADVLDMTGDTVSASFGSLKEQAVHFE---LSRRQKQWQKKL 128
Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
Q + + N + + + G + P + GA
Sbjct: 129 QAVRSALANEHAIQDLLFKVGFDGSPTSMPEREDKPAGAPN------------------- 169
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
C+I+G + +N+V+G+FHI G S H
Sbjct: 170 ----------------------------SCRIHGSMSLNKVAGNFHITLGKSIPHPRGHA 201
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT- 294
H + + +N +H I H SFG+ PLDG +E A M+ Y+I+I+PT
Sbjct: 202 HLAAFISQSQYNFSHRIDHFSFGVPTPGI---VNPLDGDQRVTQENARMYQYFIQIVPTR 258
Query: 295 --------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
+ ER G G+ GIFF Y+LS + VK+TE+ + +
Sbjct: 259 VNTRRASADTHQYAVTERDRVISHSSGSHGVAGIFFKYDLSSVSVKVTEEYQPYWQFLVR 318
Query: 341 IMCNISGTYITFMLVDALLHSCV 363
+ I G + T +LHS +
Sbjct: 319 LCGIIGGVFAT----SGMLHSLI 337
>gi|71755761|ref|XP_828795.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834181|gb|EAN79683.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 391
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/390 (24%), Positives = 156/390 (40%), Gaps = 49/390 (12%)
Query: 5 ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
++ +D FTKP ED+ +T G ++I+ + L +V Y + EL VD
Sbjct: 23 RKVAAVDLFTKPKEDYCRSQTRAGAIISIITVFAVGLLASWEVMSYTLGWNAYKTELSVD 82
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+S + ++DI C L LD D SG ++V N+ K +D+ G
Sbjct: 83 TSPEKNITFNIDITFMQEPCHDLFLDVSDVSGTFSINVTENLLKTPVDVGGN-------- 134
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNK---CGSCY---GAETETRKCCNTCNEVKEAYRYK 175
+ ++ T T DPN CG C+ A + CCNTC EV + K
Sbjct: 135 LAYLGTRRFFTDPRSPLYTRRNDPNSPDFCGRCFTGNKAIAGGKNCCNTCEEVMAEHDRK 194
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
P + + QC E S E GC G L V +VSG P + N + +
Sbjct: 195 GLPRPNKNVVEQCIGELSLEN------PGCNYRGALNVRKVSGVIFFTPKVIK--NTIKM 246
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMF---NYYIKII 292
D+ F+ +H I S G + RR L+ + G+ F YY+ I+
Sbjct: 247 EDL-----LKFDASHVINKFSIGDESVRRHSRRGVLNPLEKQRFNGSGRFMKVRYYLNIV 301
Query: 293 PTIY----------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
PT Y + ++ G GG P + FS++ P+ V K + + H
Sbjct: 302 PTTYGSGASSGLHPPTYEYSANWNSREVAIGYGGFPSVEFSFDFFPMQVNNNFKREPIYH 361
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKI 366
++ I G ++ LVD+++ + +
Sbjct: 362 FLVQLCGIIGGLFVVLGLVDSVVARLTRLV 391
>gi|198422133|ref|XP_002131157.1| PREDICTED: similar to ptx1 [Ciona intestinalis]
Length = 391
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 97/395 (24%), Positives = 162/395 (41%), Gaps = 73/395 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E E + GG +T++ I++L+ ++ YF V+ + VD S
Sbjct: 15 VKSLDAFPKVPELCIETSTRGGTITLITTAVITFLVLSEIIYYFNVTFRYDYQVDVDFDS 74
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ ++ DI V T C + D +D +G+ + E+ +Y+ Q++ + +
Sbjct: 75 KVWLNFDITVAT-PCTLIGADVLDVTGQATV-FENEVYEELTFFRQSNTAAAQRKALLRM 132
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K++ +T ENG +E+ L +
Sbjct: 133 KEELLTPENGKKMSEIT--------------------------------------LQSNF 154
Query: 187 QCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
+ KL N + C+ YG L +N+V+G+FHI G + H H ++
Sbjct: 155 NPNLMFKNRKLDNVGIKMDACRFYGNLPLNKVAGNFHIVAGKPIQMFGGHAHLSMMFSPI 214
Query: 245 AFNTTHHIRHLSFG------IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---- 294
+N +H I H SFG I D DER V +E + +F YY+ ++ T
Sbjct: 215 PYNFSHRIDHFSFGNMKTGFINALDGDER-------VTSSE--SYIFQYYLDVVSTKINS 265
Query: 295 -----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
+ E+ G G PG+FF Y SPL V ITE+ L ++
Sbjct: 266 RRITTDTFQFSVSEQSRALDHASGSHGQPGVFFKYNFSPLSVMITEQKMPFYRLLVRLCS 325
Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
+ G + T +++ALL C+ +K K +T
Sbjct: 326 IVGGIFATSHVLNALL-GCLPGFTKQSESSKLITN 359
>gi|261334705|emb|CBH17699.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 391
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 155/390 (39%), Gaps = 49/390 (12%)
Query: 5 ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
++ +D FTKP ED+ +T G ++I+ + L +V Y + EL VD
Sbjct: 23 RKVAAVDLFTKPKEDYCRSQTRAGAIISIITVFAVGLLASWEVMSYTLGWNAYKTELSVD 82
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+S + ++DI C L LD D SG ++V N+ K +D+ G
Sbjct: 83 TSPEKNITFNIDITFMQEPCHDLFLDVSDVSGTFSINVTENLLKTPVDVGGN-------- 134
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETET---RKCCNTCNEVKEAYRYK 175
+ ++ T T DPN CG C+ + CCNTC EV + K
Sbjct: 135 LAYLGTRRFFTDPRSPLYTRRNDPNSPDFCGRCFTGNKAIAGGKNCCNTCEEVMAEHDRK 194
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
P + + QC E S E GC G L V +VSG P + N + +
Sbjct: 195 GLPRPNKNVVEQCIGELSLEN------PGCNYRGALNVRKVSGVIFFTPKVIK--NTIKM 246
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMF---NYYIKII 292
D+ F+ +H I S G + RR L+ + G+ F YY+ I+
Sbjct: 247 EDL-----LKFDASHVINKFSIGDESVRRHSRRGVLNPLEKQRFNGSGRFMKVRYYLNIV 301
Query: 293 PTIY----------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
PT Y + ++ G GG P + FS++ P+ V K + + H
Sbjct: 302 PTTYGSGASSGLHPPTYEYSANWNSREVAIGYGGFPSVEFSFDFFPMQVNNNFKREPIYH 361
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKI 366
++ + G ++ LVD+++ + +
Sbjct: 362 FLVQLCGIVGGLFVVLGLVDSVVARLTRLV 391
>gi|449704125|gb|EMD44426.1| endoplasmic reticulumgolgi intermediate compartment protein,
putative [Entamoeba histolytica KU27]
Length = 185
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 91/173 (52%), Gaps = 11/173 (6%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K D + K ED + +GG +TI+C + I L + Y Q +L VD R S
Sbjct: 1 MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+P+H DI P SC ++D + SGE + +E N+ K R+ DG + E + + + +
Sbjct: 61 KIPVHFDITFPYSSCPITSVDILTKSGESMIGIEQNVTKIRIHHDGSLVTENEMKAIQS- 119
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ E DP +C SCYGAET +KCC TC++VKEAY+ + W L
Sbjct: 120 ----------KLSIETPDPKECRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL 162
>gi|212275606|ref|NP_001131002.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
gi|194690678|gb|ACF79423.1| unknown [Zea mays]
gi|413952089|gb|AFW84738.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 293
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/329 (27%), Positives = 148/329 (44%), Gaps = 68/329 (20%)
Query: 60 VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
VD RG LPIH+++ P++ C+ L++DA+D SG+ + + NI+K RLD G I
Sbjct: 3 VDLKRGETLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG--- 59
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
T L D + G +GA + + +E K ++++
Sbjct: 60 -------------------TEYLSDLVEKG--HGAHHDHDHDHDHHDEQK---KHEQTFN 95
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
E + +++ + L N EGC++YG L+V RV+G+FHI+ VH +
Sbjct: 96 EEAEKMIKS----VKQALGNG--EGCRVYGMLDVQRVAGNFHIS-----------VHGLN 138
Query: 240 PYT-------SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
+ S N +H I LSFG K PLD T + + F YYIK++
Sbjct: 139 IFVAEKIFEGSNHVNVSHVIHELSFGPKYPGI---HNPLDETSRILHDTSGTFKYYIKVV 195
Query: 293 PTIYERLDGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
PT Y+ L L D P ++F Y+LSP+ V I E+ ++ H
Sbjct: 196 PTEYKYLSKKVLPTNQFSVTEYFLPIRPTDRAWPAVYFLYDLSPITVTIKEERRNFLHFV 255
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKIS 367
T++ + GT+ ++D ++ +K ++
Sbjct: 256 TRLCAVLGGTFAMTGMLDRWMYQLIKTVT 284
>gi|348505737|ref|XP_003440417.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oreochromis niloticus]
Length = 374
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 155/371 (41%), Gaps = 66/371 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E T GG V+++ + F++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVESTASGGTVSLIAFTFMAVLAFLEFFVYRHTWMKYEYEVDRDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + + DG Q E VN
Sbjct: 73 KLRINVDITV-AMRCQYIGADVLD------------LAETMVASDGL-----QYEPVN-- 112
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETR-KCCNTCNEV--KEAYRYKKWALPELD 183
EL + + R + + +V K A + ALP
Sbjct: 113 -------------FELPPQQRIWHMTLLHIQERLRVEHALQDVIFKAAIKGAPPALPP-- 157
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
++E ST L C+I+G+L VN+V+G+FHI G S H H
Sbjct: 158 -----RSEDSTASLS-----ACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVAH 207
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------- 294
++N +H I HLSFG L PLDGT A + MF Y+I I+PT
Sbjct: 208 DSYNFSHRIDHLSFGEPLPGII---SPLDGTEKIATDSNHMFQYFITIVPTKLNTYKVSA 264
Query: 295 ------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
+ ER G G+ GIF Y++S LMVK+TE+ L ++ I G
Sbjct: 265 ETHQYSVTERERVINHAAGSHGVSGIFMKYDISSLMVKVTEQHMPLWQFLVRLCGIIGGI 324
Query: 349 YITFMLVDALL 359
+ T ++ L+
Sbjct: 325 FSTTGMIHGLV 335
>gi|403371798|gb|EJY85783.1| hypothetical protein OXYTRI_16231 [Oxytricha trifallax]
Length = 333
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/394 (24%), Positives = 157/394 (39%), Gaps = 98/394 (24%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LD F + +D E T G +T +C+ + L +V Y V T ++ VD S
Sbjct: 7 LARLDIFKRVPKDLTEPTFCGALLTSICFFVLVGLSLSEVARYLNVETKTDMLVDISHSD 66
Query: 67 -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++DI P C+ L+LD D G H+++E + K+R+
Sbjct: 67 DKLEINIDITFPRFPCEILSLDVQDVMGTHHVNIEGGLVKQRI----------------- 109
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
T NG E S + + + T +EVK
Sbjct: 110 -------TANGEVILEY-------SAHTKQDRSHVASQTRDEVKAQ-------------- 141
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
EGC IYG + +NRV G+FHI+ +++ N + + +Q
Sbjct: 142 -----------------EGCHIYGNILINRVPGNFHIS---THAFNDILMGLMQ--EGHH 179
Query: 246 FNTTHHIRHLSFGIKLQDDDERRK--------PLDGTVAKAEEGASMF------NYYIKI 291
F+ ++ I H+SFG + D RRK PLDG A F N+Y+
Sbjct: 180 FDFSYKIDHISFGKRNNFDMIRRKFRDHQLISPLDGKSETAPRDNKNFPKSLEGNFYLIA 239
Query: 292 IPTIYERLDG-------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+P+ ++ + G + G G+ + F+YELSP+ V ++ +S+
Sbjct: 240 VPSYFKDVSGGVYQVYQLTANDHTNFGTGNNILK---FNYELSPITVGFSQDRESIALFL 296
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
I I G + ++DA++H + K IG
Sbjct: 297 VHICAIIGGVFTAVSIIDAIIHKSFSLLFKKRIG 330
>gi|390337315|ref|XP_792272.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Strongylocentrotus purpuratus]
gi|390337317|ref|XP_003724529.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Strongylocentrotus purpuratus]
Length = 388
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/368 (25%), Positives = 156/368 (42%), Gaps = 58/368 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K ED+ + T GG V+IV ++ I+ L+ + Y VD+ +
Sbjct: 13 VKELDAFPKIPEDYVKTTSTGGTVSIVTFIVIAGLVISEFMYYLDSRMKYGYDVDTDFNT 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + CDY+ D +DS+G+ + GK +EP
Sbjct: 73 KLQINIDITV-AMKCDYIGADVLDSAGDSAM----------FKFSGKLKEEP-------- 113
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
T E + S + RK + + +++ ++ +
Sbjct: 114 -------------TSFEMTPQQRSWHKTLQTVRKALSEEHAIQDLLFQTGFSSKPTN--- 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
Q + S +KL + C+++G L N+V+G+FH+ G S H H +
Sbjct: 158 QPQRVDSGKKL-----DACRLHGSLTTNKVAGNFHVTIGKSIPHPRGHAHLALMIDPNNY 212
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I H S+G + PLDG + E ++ Y+I+I+PT
Sbjct: 213 NFSHRIDHFSYGTPVPG---IVNPLDGDLKVTNESLQIYQYFIQIVPTKVKTRAAKAHTH 269
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G G+ GIFF YELS L++ + E L ++ + G + T
Sbjct: 270 QYAVTERERVINHGAGSHGVTGIFFKYELSSLVISVEEVYDPFWKLLVRLCGIVGGVFAT 329
Query: 352 FMLVDALL 359
++++L+
Sbjct: 330 SGIINSLM 337
>gi|340372649|ref|XP_003384856.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Amphimedon queenslandica]
Length = 347
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 147/371 (39%), Gaps = 66/371 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K ED+ + T GG +IV I +LI ++ + E VD+ S
Sbjct: 10 VKEFDAFPKVSEDYIKPTTRGGLFSIVSITIILFLIVSELSYFKDSEILYEYMVDTDMTS 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L + DI V + C++L D VD++G + L QE KE
Sbjct: 70 TLKLRFDITV-AMPCEFLGADVVDAAGS----------SKSLQ------QEVHKE----- 107
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
T EL K E R E R + + D+
Sbjct: 108 ----------PTIFELNKEQKAWLAAKQEVIRRH---------EGLRLLRDVM--FDSHP 146
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSA 244
Q + + C+++G+++VN+VSG+FHI G + + H H+ P +
Sbjct: 147 QQYIPFPEHPQHSAPLTSCRVHGHIQVNKVSGNFHITAGQAVPHPQGHAHLSAFVP--TN 204
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
N +H I FG+ + PL+GT A E +F YYI+I+PT + GS L
Sbjct: 205 MINFSHRIDSFGFGVSTPGMVD---PLEGTYVIARESNRLFQYYIQIVPTTLQMRGGSDL 261
Query: 305 ----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
G G+PG+FF YE+ LMV + E + L ++ + G
Sbjct: 262 HTNQYSVTERNRAISHKAGSHGLPGLFFKYEIYSLMVLMKEVDRPLSLFLVRLCAIVGGV 321
Query: 349 YITFMLVDALL 359
+ T ++ L
Sbjct: 322 FATLGMISQFL 332
>gi|270003406|gb|EEZ99853.1| hypothetical protein TcasGA2_TC002635 [Tribolium castaneum]
Length = 380
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 161/378 (42%), Gaps = 70/378 (18%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
++K +D F K E F EK+ GG ++ ++ I++L+ +++ Y + D+
Sbjct: 18 KIKKIDIFPKIEETFKEKSSVGGTFSVFSFILITWLVFLEINYYLDSKFIFKFSPDTDFD 77
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
+KL I++DI V + C L D +DS+ + N YK LD + + + ++
Sbjct: 78 AKLKINVDITV-AMPCSNLGADILDSTNQ-------NAYKFGSLDEEDTWFEMAPNQQIH 129
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL--PEL 182
KK+ + V+E Y K L
Sbjct: 130 FHNKKQFNSY---------------------------------VREEYHALKDVLWKSRF 156
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
T+ + + E ST N + C+I+G L +N+VSG+FHI G S ++ H+H +
Sbjct: 157 STMFRHRPERST--YPNRPHDACRIHGSLILNKVSGNFHITAGKSLNLPRGHIHISAFMS 214
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-------- 294
+N +H I SFG PL+G G ++FNY+I+++PT
Sbjct: 215 ERDYNFSHRIDTFSFG---DSSPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN 271
Query: 295 ----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
+ +D K G GMPGIFF Y++S L V ++++ LG ++
Sbjct: 272 VNTYQYSVKELNRPIDHDK---GSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSI 328
Query: 345 ISGTYITFMLVDALLHSC 362
I G ++ V++ + C
Sbjct: 329 IGGIFVCSGFVNSFVQFC 346
>gi|189235693|ref|XP_966630.2| PREDICTED: similar to AGAP005044-PA [Tribolium castaneum]
Length = 373
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 161/378 (42%), Gaps = 70/378 (18%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
++K +D F K E F EK+ GG ++ ++ I++L+ +++ Y + D+
Sbjct: 11 KIKKIDIFPKIEETFKEKSSVGGTFSVFSFILITWLVFLEINYYLDSKFIFKFSPDTDFD 70
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
+KL I++DI V + C L D +DS+ + N YK LD + + + ++
Sbjct: 71 AKLKINVDITV-AMPCSNLGADILDSTNQ-------NAYKFGSLDEEDTWFEMAPNQQIH 122
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL--PEL 182
KK+ + V+E Y K L
Sbjct: 123 FHNKKQFNSY---------------------------------VREEYHALKDVLWKSRF 149
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
T+ + + E ST N + C+I+G L +N+VSG+FHI G S ++ H+H +
Sbjct: 150 STMFRHRPERST--YPNRPHDACRIHGSLILNKVSGNFHITAGKSLNLPRGHIHISAFMS 207
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-------- 294
+N +H I SFG PL+G G ++FNY+I+++PT
Sbjct: 208 ERDYNFSHRIDTFSFG---DSSPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN 264
Query: 295 ----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
+ +D K G GMPGIFF Y++S L V ++++ LG ++
Sbjct: 265 VNTYQYSVKELNRPIDHDK---GSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSI 321
Query: 345 ISGTYITFMLVDALLHSC 362
I G ++ V++ + C
Sbjct: 322 IGGIFVCSGFVNSFVQFC 339
>gi|343476464|emb|CCD12449.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 224
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 108/230 (46%), Gaps = 19/230 (8%)
Query: 5 ERLKGLDAFTKP----YEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
+R LD F K +D ++T GG ++I + I+ LI +V + E++V
Sbjct: 2 KRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVAIALLIIGEVRYFLTTVEQHEMYV 61
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQ 119
D G + + ++I P + CD + DA+D+ GE + + K R+D D P+ E
Sbjct: 62 DPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDSDTLAPLGE-A 120
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ +VN KK D + C SCYGAE CC+TC++V+ A+ ++W
Sbjct: 121 RPLVNMNKKAT------------SDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEF 168
Query: 180 PELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
E D +I+QC E EGC ++ V RV+ + H PG +
Sbjct: 169 HEDDVSIMQCAKERLQMAASTASREGCNLHSSFRVPRVTENIHFVPGRMF 218
>gi|303290895|ref|XP_003064734.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453760|gb|EEH51068.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 363
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 164/386 (42%), Gaps = 64/386 (16%)
Query: 7 LKGLDAFT--KPYEDFHEKT-VYGGAVTIVCWLFISYLICVDVCDYFQVSTT---EELFV 60
L+ +D ++ K EDF + + + GG +T C L L V +YF T L V
Sbjct: 5 LRRMDVYSSSKVIEDFRQSSSMSGGIITCACALLCFVLF---VNEYFYHRTPVVKSSLTV 61
Query: 61 D--------SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGE--QHLHVEHNIYKRRLDL 110
D S+ ++L + +DI + CD + +D +D +GE +H H + KRRLD
Sbjct: 62 DATGLDAKTSANSNRLHVEIDITFHQLPCDIINMDTMDQAGEAFHDVHSGH-LKKRRLDS 120
Query: 111 DGKPIQEPQK-EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVK 169
DGKP++ K E NA K+ + E+ ++ K +
Sbjct: 121 DGKPLEGVFKHEKANAHKEIREDIESHALALSGDEEYKTSE------------------E 162
Query: 170 EAYRYKKWALPELDTIVQCKNEYSTEK-LKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
+ + + L ++ + EK KN EGC++ GYLEVNRV GSF ++PG S
Sbjct: 163 DLMPEEGLTMFNLKQLLDKQFPGGIEKAFKNEAREGCEVIGYLEVNRVPGSFSVSPGKSI 222
Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
+ HV Q + N +H I +FG PLDG A+ + + Y+
Sbjct: 223 RLGMEHV---QLNVQSRLNMSHTINRFAFGKSFPG---FVSPLDGN-ARDLDPNYVHQYF 275
Query: 289 IKIIPTIYERLDGSKLGGGD----------------GGMP-GIFFSYELSPLMVKITEKS 331
+KI+PT + L G L G P G++F+Y+LSPL V E
Sbjct: 276 LKIVPTSFTPLRGEYLQSNQYSVTEASAPAKALNVVGSKPSGVYFNYDLSPLRVDYVESR 335
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDA 357
S+ T + + G LV A
Sbjct: 336 NSMTEFITSVCAIVGGVASMSGLVQA 361
>gi|443716796|gb|ELU08142.1| hypothetical protein CAPTEDRAFT_19918 [Capitella teleta]
Length = 403
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 154/370 (41%), Gaps = 63/370 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ LDAF K E + E + GG+++I+ + + LI ++ Y + VD
Sbjct: 12 VRELDAFPKVPEGYQECSASGGSISILVLVLSAILIISEIRYYTATEFKYDYEVDKHFEG 71
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C + D +D +G Q++ + + + + P Q + ++A+
Sbjct: 72 KLSINIDITV-AMKCHQVGADVLDITG-QNVASFGKLTEEEVHFELSPNQRKHLKSMSAI 129
Query: 127 KKKKVTTENGTTTTELEDPNKC--GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
N E +K S +G Y P D
Sbjct: 130 --------NEYIRNEYHSIHKFLWRSGFGG-------------------YLAQMPPREDH 162
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN-HVHVHDIQPYTS 243
KN GC+ YG L+VN+V+G+FHI G S +N H H
Sbjct: 163 PQTPKN-------------GCRFYGTLDVNKVAGNFHITAGKSVPLNIGGHAHMAMMVKE 209
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------- 294
+ +N TH I H SFG K+ R PLDG + M+ Y+I+++PT
Sbjct: 210 SDYNFTHRIEHFSFGDKVSG---RINPLDGEEKNTNDNYHMYQYFIQVVPTHVKTLFTDI 266
Query: 295 ------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
+ E+ G G G+PGIF Y+L+P+MVK+ E K L ++ I G
Sbjct: 267 NTYQFSVTEQNRTISHGKGSHGIPGIFVKYDLAPMMVKVIESHKPFSQLLIRLCGIIGGL 326
Query: 349 YITFMLVDAL 358
+ T ++ +
Sbjct: 327 FATSGMLHGM 336
>gi|307188057|gb|EFN72889.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Camponotus floridanus]
Length = 386
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 153/370 (41%), Gaps = 58/370 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + +KT GG +I L I+YLI + + + D+ +
Sbjct: 12 VKELDAFPKVPELYVDKTAVGGTFSIFTMLIIAYLIIAETSYFLDSRLQFKFEPDTEIDA 71
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C + D +DS+ + + +
Sbjct: 72 KLQINIDITV-AMPCGRIGADVLDSTNQNMISYD-------------------------- 104
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
T E T EL + A E K N+ ++E Y L + + I
Sbjct: 105 -----TLEEEDTWWELTQEQR------AHFEALKHMNSY--LREEYHAIHELLWKSNQIT 151
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ T C+I+G L VN+V+G+FHI G S S+ H+H T +
Sbjct: 152 LYSEMPMRSHKPDYATNACRIHGSLVVNKVAGNFHITAGKSLSLPRGHIHISAYMTDQDY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDGSKL- 304
N TH I SFG PL+G A+ ++ Y+++++PT I L SK
Sbjct: 212 NFTHRINRFSFG---GPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLSTSKTY 268
Query: 305 -------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
G G+PGIFF Y++S L +K+T++ ++ K+ + G ++T
Sbjct: 269 QYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFVT 328
Query: 352 FMLVDALLHS 361
LV ++ S
Sbjct: 329 SGLVKNIVQS 338
>gi|119596606|gb|EAW76200.1| ERGIC and golgi 3, isoform CRA_b [Homo sapiens]
Length = 239
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 87/158 (55%), Gaps = 20/158 (12%)
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
+HD+Q + N TH+I+HLSFG +D PLD T A + + MF Y++K++PT
Sbjct: 85 IHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPT 141
Query: 295 IYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+Y ++DG L GD G+PG+F YELSP+MVK+TEK +S H
Sbjct: 142 VYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFL 201
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
T + I G + L+D+L++ + I K+++G T
Sbjct: 202 TGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKTT 239
>gi|326911226|ref|XP_003201962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Meleagris gallopavo]
Length = 377
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 160/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + I++L ++ Y E VD S
Sbjct: 13 MKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKDFTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + IY+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMRCQYVGADVLDLAETMVASADGLIYEPVV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S E + C+I+G+L VN+V+G+FHI G + H H + ++
Sbjct: 159 --REDNSLES-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELIPGII---NPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAETH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y++S LMV +TE+ ++ I G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH + +++V
Sbjct: 329 ----TGILHGFGRFVAEV 342
>gi|169731514|gb|ACA64886.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
(predicted) [Callicebus moloch]
Length = 237
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 84/156 (53%), Gaps = 19/156 (12%)
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
HD+Q + N TH+I+HLSFG +D PLD T A + + MF Y++K++PT+
Sbjct: 84 HDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTV 140
Query: 296 YERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
Y ++DG L GD G+PG+F YELSP+MVK+TEK +S H T
Sbjct: 141 YMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLT 200
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
+ I G + L+D+L++ + I K GKT
Sbjct: 201 GVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 236
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 55/113 (48%), Gaps = 3/113 (2%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEP 118
KL I++D++ P + C A + S G ++++ H I D I P
Sbjct: 66 DKLKINIDVLFPHMPC---AFHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNP 115
>gi|313661438|ref|NP_001186332.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Gallus gallus]
Length = 377
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 160/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + I++L ++ Y E VD S
Sbjct: 13 MKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKDFTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + IY+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMRCQYVGADVLDLAETMVASADGLIYEPVV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S E + C+I+G+L VN+V+G+FHI G + H H + ++
Sbjct: 159 --REDNSLES-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELIPG---IINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAETH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y++S LMV +TE+ ++ I G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH + +++V
Sbjct: 329 ----TGILHGFGRFVAEV 342
>gi|384253563|gb|EIE27037.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 327
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 154/380 (40%), Gaps = 89/380 (23%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
A+ + ++T +G VT++ + L ++ +Y + + + VD+SR +
Sbjct: 9 FSAYARAESHLVQRTYFGAIVTVLGVILAIVLFANELREYTTPFSIQTMSVDTSRAHYIR 68
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEH----NIYKRRLDLDGKPIQEPQKEVVNA 125
++ + P++ C L+LDA D SGE+ H I+K RL+ G+ I
Sbjct: 69 MNFNFTYPSMPCQVLSLDATDMSGEKSGDSGHAANGEIHKVRLNEAGEKI---------- 118
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
E P + G G + + N+ +A+
Sbjct: 119 ------------GLGEYIPPRRWGFMMGKPRQ--QEVMEVNQAMDAH------------- 151
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT--- 242
EGC I+G+L++ RV+G+F ++ VHV D T
Sbjct: 152 -----------------EGCNIFGWLDLQRVAGNFRVS---------VHVEDFFALTRLQ 185
Query: 243 --SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ N++H I +SFG + PLDG ++ + F Y++K++PT Y+
Sbjct: 186 ADTTGINSSHIIHRVSFGPTFPG---QVNPLDGAERILDKESGTFKYFLKVVPTEYQWSA 242
Query: 301 GSK--------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
G++ + G+ MP ++FSY++SP+ V I+E KS HL + +
Sbjct: 243 GTRTTTNQYSVTEYDTVVHKGEMQMPSVWFSYDISPISVTISEIRKSFAHLLVRFCAVVG 302
Query: 347 GTYITFMLVDALLHSCVKKI 366
G + + D +H V I
Sbjct: 303 GVFAVTGMFDRWVHRIVTAI 322
>gi|156553212|ref|XP_001600226.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Nasonia vitripennis]
Length = 391
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 162/377 (42%), Gaps = 72/377 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAFTK ED+ +++ GG ++ + I YLI + + + D S
Sbjct: 11 VKELDAFTKIPEDYRKQSAVGGTFSLASFCIIVYLIYAETSYFLDSRLQFKFEPDVEYDS 70
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L +++DI V T CD + D +DS+ + +
Sbjct: 71 QLQMNIDITVAT-PCDRIGADILDSTNQNLM----------------------------- 100
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
T+EN LED + + + R + +R + AL EL +
Sbjct: 101 -----TSEN----FHLED-----TWWDLTPDQRAHFEALKHMNYYFREEYHALHEL---L 143
Query: 187 QCKNE--YSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
N+ +S E K + + C+IYG L+VN+V+G+FH+ G S + H H
Sbjct: 144 WKSNQLTFSNEMPKRDYIPSYPSNACRIYGSLDVNKVAGNFHVTSGKSVILPRGHFHFTS 203
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYER 298
++S A+N TH I SFG + PL+G + +F Y+I+++ T I
Sbjct: 204 FHSSTAYNFTHRINRFSFG---KPSPGIIHPLEGDEKITTDNMMLFQYFIEVVSTDINML 260
Query: 299 LDGSKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
+ SK G G+PGIFF Y+ S L +K++++ S+G K+
Sbjct: 261 MHKSKTYQYSVKDHQRPINHAKGSHGIPGIFFKYDTSALKIKVSQERDSIGQFLVKLCAT 320
Query: 345 ISGTYITFMLVDALLHS 361
+ ++T ++++++ +
Sbjct: 321 VGCIFVTNGILNSIVQN 337
>gi|332020071|gb|EGI60517.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Acromyrmex echinatior]
Length = 390
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 155/371 (41%), Gaps = 60/371 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + +KT GG +I L I YL+ + Y D+ +
Sbjct: 12 VKELDAFPKVPEVYVDKTAVGGTFSIFTVLIIMYLVIAETSYYLDSRLQFTFEPDTDIDA 71
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++D+ V + C + D +DS+ QH+ +D D ++ E+
Sbjct: 72 KLQINIDVTV-AMPCGRIGADVLDSTN-QHM----------IDFDSLTEEDTWWEL---- 115
Query: 127 KKKKVTTENGTTTTELEDPNK-CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
T E T L+ N Y A E N E +P
Sbjct: 116 -----TQEQRTHFEALKHMNSYLREEYHAIHELLWKSNQVTLYSE--------MP----- 157
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
K Y + N C+++G L +N+V+G+FHI G S S+ H H+H T
Sbjct: 158 ---KRSYVPDYAPN----ACRVHGSLNINKVAGNFHITAGKSLSVPHGHIHISAFMTDRD 210
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDGSKL 304
+N TH I SFG PL+G A+ ++ Y+++++PT I L SK
Sbjct: 211 YNFTHRINKFSFG---GPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLTTSKT 267
Query: 305 --------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
G G+PGIFF Y++S L +K+T++ ++ K+ + G ++
Sbjct: 268 YQYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFV 327
Query: 351 TFMLVDALLHS 361
T LV ++ S
Sbjct: 328 TSGLVKNVVQS 338
>gi|320170541|gb|EFW47440.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 408
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 164/381 (43%), Gaps = 32/381 (8%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
E +K LD F K + E + GG VT+VC + I +L+ ++ +YF VD
Sbjct: 17 EFVKQLDIFPKVASTYKETSSSGGTVTLVCLVLIVFLVGAELGEYFNQQAAFSYGVDPVV 76
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSG-EQHLHV-EHNIYKRRLDLDGKPIQEPQKEV 122
L + DIVV + CD L D + ++G +H H H+ QEP+ +
Sbjct: 77 DGSLKLTYDIVV-AMPCDLLGADVLQATGTSKHGHDHSHDDAAPVKPAPPPSPQEPRNRL 135
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
N +++ + T ++G ++ K + R+ E ++ + +L
Sbjct: 136 FNVMRQSRDTGDDGRDDHGHDEMRKEPVVFALSAAQREWLA---ENRKPLTREHLSLS-- 190
Query: 183 DTIVQCKNEYSTEKLKNTFTEG----CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
T + K + + + EG C+++G + ++++G+FHI G + + H H
Sbjct: 191 GTTRKAKKNFQAMPRELSSQEGTPDACRLHGSVSADKIAGNFHIIAGAAVEVPGGHAHMG 250
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
Q A N TH I HLSFG ++ + PLDG + Y+I+++PT+Y R
Sbjct: 251 QMIPQHALNFTHRINHLSFGEEMPGME---FPLDGDEWITTSHTMAYQYFIQVVPTVYTR 307
Query: 299 L--DGSKLGGGD-----------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
D +L G +PG+FF Y+ P++V + S HL ++ I
Sbjct: 308 HANDPEQLRSGQFSVTRHESPNSNRLPGLFFKYDTFPILVTVQYSPYSFWHLLIRLSGII 367
Query: 346 SGTYITFMLVDALLHSCVKKI 366
G + T +H V+ +
Sbjct: 368 GGVFAT----SGFIHQVVRFV 384
>gi|123499008|ref|XP_001327531.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121910461|gb|EAY15308.1| hypothetical protein TVAG_394520 [Trichomonas vaginalis G3]
Length = 357
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 157/374 (41%), Gaps = 52/374 (13%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD----- 61
L+ D F K + T +GG ++I L ++ Y + VD
Sbjct: 3 LRKFDVFPKLDRQYRVSTSFGGILSIASITVTIILFFSEIHTYLNPPIRQRFIVDNTKPM 62
Query: 62 -----SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEH--NIYKRRLDLDGKP 114
SS KL ++LDI P + C L +D VD + L +E N + R LD GK
Sbjct: 63 GISGKSSNQRKLSVNLDIEFPNVPCYLLHIDVVDPISQLDLPMESISNNFAR-LDKTGKN 121
Query: 115 IQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
I + E K + +N T+ SCY A K C TC +V +A++
Sbjct: 122 IGDFHPE-------KFLEPDNAKTS-------DSTSCYAANNT--KVCKTCKDVVQAHKN 165
Query: 175 KKWALPELDTIVQCKNEYST-EKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
++ P L TI QC + + +++K+ EGC++ + R++ FH+APG +Y
Sbjct: 166 QELLPPPLSTIAQCASTAAIIQEMKD---EGCKLTSAFQTVRLASEFHVAPGYNYLYKGW 222
Query: 234 HVHD--IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIK 290
H H+ I S N TH IR F + + + PLD T + +G+ Y
Sbjct: 223 HSHNTTILGSESKDLNLTHIIRSFRF-----NRVDGKFPLDNVTSIQTGKGSWRVVYSAD 277
Query: 291 IIPTI-----YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
I+ YE +D K G++F Y ++P+ ++ HL T+++ I
Sbjct: 278 IMDNTYTANKYELMDPPKFSS------GVYFRYAINPVSAIDYYDTEPFLHLCTRLLTVI 331
Query: 346 SGTYITFMLVDALL 359
F L+D+ L
Sbjct: 332 GAVLAAFRLLDSFL 345
>gi|449278843|gb|EMC86582.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Columba livia]
Length = 377
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 159/379 (41%), Gaps = 65/379 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + I++L ++ Y E VD S
Sbjct: 13 MKELDAFPKVPESYVETSATGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKDFTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + IY+ + + P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMRCQYVGADVLDLAETMVASADALIYEPVV-FELSPQQKEWQRMLQVI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
+ + L++ + K A++ ALP D
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPPREDNS 163
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
+Q + C+I+G+L VN+V+G+FHI G + H H + +
Sbjct: 164 LQSP-------------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHES 210
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
+N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 211 YNFSHRIDHLSFGELIPGII---NPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET 267
Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ER G G+ GIF Y++S LMV +TE+ ++ I G +
Sbjct: 268 HQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFS 327
Query: 351 TFMLVDALLHSCVKKISKV 369
T +LH + +++V
Sbjct: 328 T----TGILHGFGRFVAEV 342
>gi|302853436|ref|XP_002958233.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
nagariensis]
gi|300256421|gb|EFJ40687.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
nagariensis]
Length = 337
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 91/396 (22%), Positives = 162/396 (40%), Gaps = 93/396 (23%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L L A+ KP ++TV+G VT+ L + L ++ +++ ++ VD +R
Sbjct: 4 KLSSLSAYVKPEAHLVQQTVHGALVTLCGILLAAMLFVHELGSFYRQHRVTQMSVDLARR 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSG--------EQHLHVEHNIYKRRLDLDGKPI-- 115
+ L I++D+ P I C L++D +D +G H+H I+K RLD GKPI
Sbjct: 64 NALTINIDLTFPAIPCAVLSIDVLDIAGTAENDASYAHHMH----IHKLRLDGAGKPIGK 119
Query: 116 ---QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY 172
PQ + + +++ + N ++EA
Sbjct: 120 AEYHTPQSQQIMDTGAEQLVSVN--------------------------------IQEAM 147
Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH 232
++ +V + E EGC +YG ++V RV+G H +S++
Sbjct: 148 QH----------LVDMEEEAEHH-------EGCHVYGTMDVKRVAGRLH------FSVHQ 184
Query: 233 VHVHDIQPYTSAAF------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFN 286
V + P A N +H I+HL FG + PLDG V + F
Sbjct: 185 NMVFQMLPQLLGAHRIPKVANISHTIKHLGFGPHYPG---QLNPLDGYVRMVKGPPQSFK 241
Query: 287 YYIKIIPTIYERLDGSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSL 334
Y++K++PT Y G G +P + Y+LSP+++ I E+ SL
Sbjct: 242 YFLKVVPTEYYNRLGRVTETHQYSVTEYTQPLEPGYVPTLDVHYDLSPIVMTINERPPSL 301
Query: 335 GHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVE 370
H ++ + G + + D + V+ ++K++
Sbjct: 302 LHFVVRLCAVVGGAFAITRMTDRWVDWFVRLVTKLK 337
>gi|123425245|ref|XP_001306773.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121888365|gb|EAX93843.1| hypothetical protein TVAG_177510 [Trichomonas vaginalis G3]
Length = 353
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 87/360 (24%), Positives = 155/360 (43%), Gaps = 59/360 (16%)
Query: 7 LKGLDAFTKPYED-FHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-- 63
++ D + K +D F+ +TV GG VTI+ +LF+ + + + +V + V S
Sbjct: 1 MRKFDIYPKVQDDSFNIRTVSGGVVTIITFLFMIIVAIKEGSSFHRVEIKQHAVVQSQYI 60
Query: 64 -RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
+++ I +DI V C L L+ +D+SG + +I ++RLD+ KP+++ +
Sbjct: 61 KESNEIEIFMDITV-AYPCHMLQLNVIDASGNPQPNARQDISRQRLDVHFKPLEQ----L 115
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
++ K V CG+C GA KCC TC ++ ++R + +P L
Sbjct: 116 ISDSDPKSVF-------------QTCGNCLGANVS--KCCLTCTDIANSFRQMEEFIPNL 160
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG------LSYSINHVHVH 236
+ QC + + K E C+I L + G I G ++Y + H
Sbjct: 161 QNVEQCNRDKKAIEDK----ETCRIVAKLNTHFTKGKLTIMAGGIVPTPVNYKFDLSHFG 216
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTI 295
D N TH I L FG +D + + PLD T + ++ M+NY I ++PTI
Sbjct: 217 D-------NVNLTHTIHTLRFG---RDFEGLKNPLDNYTNNQLKKSQFMYNYKIDLVPTI 266
Query: 296 Y----ERLDGSKLGGGDGGM----------PGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
++ + PGI F ++ +P+ + + +SL T++
Sbjct: 267 TNDVENQIPAHQYSASSSSKEITKMITKKHPGITFDFDTAPVAARFIVEKQSLSSFLTQL 326
>gi|148674216|gb|EDL06163.1| ERGIC and golgi 3, isoform CRA_c [Mus musculus]
Length = 261
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 91/172 (52%), Gaps = 26/172 (15%)
Query: 227 SYSINHVHVHDIQ------PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE 280
S S+ + +HD+Q P N TH+I+HLSFG +D PLD T A +
Sbjct: 93 SNSLMCMVIHDLQSFGLDNPSDCLQINMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQ 149
Query: 281 GASMFNYYIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLM 324
+ MF Y++K++PT+Y ++DG L GD G+PG+F YELSP+M
Sbjct: 150 ASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMM 209
Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
VK+TEK +S H T + I G + L+D+L++ + I K+++G T
Sbjct: 210 VKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKTT 261
>gi|340709072|ref|XP_003393139.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Bombus terrestris]
Length = 392
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 92/374 (24%), Positives = 154/374 (41%), Gaps = 66/374 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LD F K E + +KT GG +I I+YLI + Y + D+ +
Sbjct: 12 VKELDGFPKVPELYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTDIDA 71
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V ++C ++ D +DS+ + +
Sbjct: 72 KLKINIDITV-AMTCSRISADVLDSTNQNMI----------------------------- 101
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL---D 183
G + E ED + + E R +V R + A+ EL
Sbjct: 102 ---------GHESLEQED-----TWWELTQEQRSHFEALKDVNSYLREEYHAIHELLWKS 147
Query: 184 TIVQCKNEYSTEKLKNTFT-EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
V +E + ++ C+I+G L VN+V+G+FHI G S S H+H + T
Sbjct: 148 NQVTLYSEMPKRTHQPSYPPNSCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIHILTFMT 207
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDG 301
+N TH I SFG PL+G A+ ++ Y+++++PT I L
Sbjct: 208 DKDYNFTHRINKFSFG---GPSPGIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLST 264
Query: 302 SKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
SK G G PGIFF Y++S L +K+T++ ++ K+ + G
Sbjct: 265 SKTYQYSVKDHQRPIDHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGG 324
Query: 348 TYITFMLVDALLHS 361
++T +V +++ S
Sbjct: 325 IFVTSGMVKSIVQS 338
>gi|383865060|ref|XP_003707993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Megachile rotundata]
Length = 392
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 150/373 (40%), Gaps = 66/373 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LD F K E + +KT GG +I I+YLI + Y + +D+ +
Sbjct: 12 VKELDGFPKVPEPYVDKTAVGGTFSIFTICIIAYLIIAETSYYLDSRLQFKFELDTDIDA 71
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C + D +DS+ + +
Sbjct: 72 KLKINIDITV-AMPCGRIGADVLDSTNQNMV----------------------------- 101
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL---D 183
G + E ED + + E R + R + A+ EL
Sbjct: 102 ---------GHESLEEED-----TWWELTQEQRSHFEALKHMNSYLREEYHAIHELLWKS 147
Query: 184 TIVQCKNEYSTEKLKNTFT-EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
V +E + ++ C+I+G L VN+VSG+FHI G S SI H+H
Sbjct: 148 NQVTLHSEMPKRSHQPSYPPNACRIHGSLNVNKVSGNFHITAGKSLSIPRGHIHISAFMI 207
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDG 301
+N TH I SFG PL+G A+ ++ Y+++++PT I L
Sbjct: 208 DRDYNFTHRINKFSFG---GPSPGVVHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLST 264
Query: 302 SKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
SK G G+PGIFF Y++S L +K+T++ ++ K+ + G
Sbjct: 265 SKTYQYSVKDYQRPIDHQKGSHGVPGIFFKYDMSALKIKVTQQRDTVSQFLVKLCATVGG 324
Query: 348 TYITFMLVDALLH 360
++T LV ++
Sbjct: 325 IFVTSGLVKNIVQ 337
>gi|431908425|gb|ELK12022.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Pteropus alecto]
Length = 377
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPVI-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSSSTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ E S++ + C+I G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REEDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT AE+ MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAEDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|224093106|ref|XP_002193654.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Taeniopygia guttata]
Length = 377
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 151/361 (41%), Gaps = 61/361 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + I++L ++ Y E VD S
Sbjct: 13 MKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFMVYRDTWMKYEYEVDKDFTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + IY+ + + P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMRCQYVGADVLDLAETMVASADGLIYEP-VPFELTPQQKELQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
+ + L++ + K A++ ALP D
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPPREDNS 163
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
+Q + C+I+G+L VN+V+G+FHI G + H H + +
Sbjct: 164 LQSP-------------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHES 210
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
+N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 211 YNFSHRIDHLSFGELIPGII---NPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET 267
Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ER G G+ GIF Y++S LMV +TE+ ++ I G +
Sbjct: 268 HQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFS 327
Query: 351 T 351
T
Sbjct: 328 T 328
>gi|350419069|ref|XP_003492060.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Bombus impatiens]
Length = 392
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 152/374 (40%), Gaps = 66/374 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LD F K E + +KT GG +I I+YLI + Y + D+ +
Sbjct: 12 VKELDGFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTDIDA 71
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V ++C ++ D +DS+ + +
Sbjct: 72 KLKINIDITV-AMTCSRISADVLDSTNQNMI----------------------------- 101
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL---D 183
G + E ED + + E R V R + A+ EL
Sbjct: 102 ---------GHESLEQED-----TWWELTQEQRSHFEALKNVNSYLREEYHAIHELLWKS 147
Query: 184 TIVQCKNEYSTEKLKNTFT-EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
V +E + ++ C+I+G L VN+V+G+FHI G S S H+H + T
Sbjct: 148 NQVTLYSEMPKRTHQPSYPPNSCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIHILTFMT 207
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDG 301
+N TH I SFG PL+G A+ ++ Y+++++PT I L
Sbjct: 208 DKDYNFTHRINKFSFG---GPSPGIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLST 264
Query: 302 SKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
SK G G PGIFF Y++S L +K+T++ ++ K+ + G
Sbjct: 265 SKTYQYSVKDHQRPIDHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGG 324
Query: 348 TYITFMLVDALLHS 361
++T ++ ++ S
Sbjct: 325 IFVTSGMIKNIVQS 338
>gi|156402826|ref|XP_001639791.1| predicted protein [Nematostella vectensis]
gi|156226921|gb|EDO47728.1| predicted protein [Nematostella vectensis]
Length = 413
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 159/385 (41%), Gaps = 86/385 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K E++ + T GG+V++V +LFI L+ + Y T VD+ S
Sbjct: 13 IKEFDAFPKIPENYQQTTASGGSVSLVSFLFIFVLVISEFWYYRATETKFSYEVDTDADS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++D+ + + C+ + D +D SG
Sbjct: 73 KLQINVDLTI-AMKCEDIDADVLDLSG--------------------------------- 98
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTC----NEVKEAYRYKKWALPEL 182
+T +L D K + T ++ T + E YR +L E+
Sbjct: 99 -----------STMQLGDSIKLEPTFFKLTPEQEMWLTMFRDFHFFYEGYR----SLGEM 143
Query: 183 DTI------VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS--YSINHVH 234
D K E S + + C++YG +VN+V+G+FHI G S + H H
Sbjct: 144 DEFNGDIPTYMPKREESKDAANTKEHDACRVYGSFKVNKVAGNFHITSGKSIHHPRGHAH 203
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
+ + P S N +H I LSFG ++ PLDG + E+ M+ YYI+++PT
Sbjct: 204 LSSMVPVES--LNFSHRIDMLSFGKRVPG---IVHPLDGEMQITEKRRMMYQYYIQVVPT 258
Query: 295 IYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+ L+ ++ G G+ G+FF Y++S +MV++ + S+
Sbjct: 259 SIKSLNSEEIKTNQYSMTQRIREISHDSGSHGIAGLFFKYDMSSIMVRVKHQHHSMVGFL 318
Query: 339 TKIMCNISGTYITFMLVDALLHSCV 363
++ + G + T +LH +
Sbjct: 319 VRLCGIVGGIFAT----SGMLHDFI 339
>gi|307206941|gb|EFN84785.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Harpegnathos saltator]
Length = 396
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 153/375 (40%), Gaps = 68/375 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + +KT GG +I FI+YLI + + + D+ +
Sbjct: 12 VKELDAFPKVPELYVDKTAVGGTFSIFTVCFIAYLIIAETSYFLDSRLQFKFETDTDIDA 71
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C + D +DS +E N++
Sbjct: 72 KLQINIDITV-AMPCGRIGADVLDS-------MEENVF---------------------- 101
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
G + E ED + + E R + R + A+ EL
Sbjct: 102 ---------GYDSLEQED-----TWWELTPEQRAHFEALKHMNSYLREEYHAIHELLWKS 147
Query: 187 QCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
YS E K ++ C+I+G L VN+V+G+FHI G S S+ H+H
Sbjct: 148 NQITLYS-EMPKRSYEPDYPPNACRIHGSLNVNKVAGNFHITTGKSLSVPRGHIHISAFM 206
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLD 300
T +N TH I SFG PL+G A+ ++ Y+++++PT I L
Sbjct: 207 TDRDYNFTHRINRFSFG---GPSPGIVHPLEGDEKIADYNMMLYQYFVEVVPTDIRTLLS 263
Query: 301 GSKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
SK G G+PGIF Y +S L +K+T++ ++ K+ +
Sbjct: 264 TSKTYQYSVKDYQRPINHNEGSHGVPGIFIKYNMSALKIKVTQQRDTIFQFLVKLCATVG 323
Query: 347 GTYITFMLVDALLHS 361
G ++T L+ ++ S
Sbjct: 324 GIFVTSGLIKNIVQS 338
>gi|380787459|gb|AFE65605.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
gi|383418929|gb|AFH32678.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
gi|384941148|gb|AFI34179.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
Length = 377
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|62897157|dbj|BAD96519.1| CDA14 variant [Homo sapiens]
Length = 377
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 156/379 (41%), Gaps = 65/379 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
+ + L++ + K A++ ALP D
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
Q N C+I+G+L VN+V+G+FHI G + H H +
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHES 210
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
+N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 211 YNFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT 267
Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G +
Sbjct: 268 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 327
Query: 351 TFMLVDALLHSCVKKISKV 369
T +LH K I ++
Sbjct: 328 T----TGMLHGIGKFIVEI 342
>gi|332233018|ref|XP_003265701.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 isoform 1 [Nomascus leucogenys]
Length = 377
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|397517363|ref|XP_003828883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Pan paniscus]
gi|410259224|gb|JAA17578.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410298004|gb|JAA27602.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410334949|gb|JAA36421.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410334951|gb|JAA36422.1| ERGIC and golgi 2 [Pan troglodytes]
Length = 377
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|50959176|ref|NP_057654.2| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Homo sapiens]
gi|108935982|sp|Q96RQ1.2|ERGI2_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|22760017|dbj|BAC11037.1| unnamed protein product [Homo sapiens]
gi|38173702|gb|AAH00887.2| ERGIC and golgi 2 [Homo sapiens]
gi|78070782|gb|AAI07795.1| ERGIC and golgi 2 [Homo sapiens]
gi|119616998|gb|EAW96592.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
gi|119617000|gb|EAW96594.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
gi|167773797|gb|ABZ92333.1| ERGIC and golgi 2 [synthetic construct]
Length = 377
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 156/379 (41%), Gaps = 65/379 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
+ + L++ + K A++ ALP D
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
Q N C+I+G+L VN+V+G+FHI G + H H +
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHES 210
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
+N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 211 YNFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT 267
Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G +
Sbjct: 268 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 327
Query: 351 TFMLVDALLHSCVKKISKV 369
T +LH K I ++
Sbjct: 328 T----TGMLHGIGKFIVEI 342
>gi|75075986|sp|Q4R5C3.1|ERGI2_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|67970720|dbj|BAE01702.1| unnamed protein product [Macaca fascicularis]
Length = 377
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAV-FDLSPQQKEWQRMLQ-- 128
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
T + L++ + K A++ ALP
Sbjct: 129 ----------LTQSRLQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|348562091|ref|XP_003466844.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Cavia porcellus]
Length = 377
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 156/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPQSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDVAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 158 ------PREANSSQSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|323310251|gb|EGA63441.1| Erv46p [Saccharomyces cerevisiae FostersO]
Length = 189
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 91/184 (49%), Gaps = 20/184 (10%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDAF K ED +T GG +T+ C L +L+ + + V T +L VD R +
Sbjct: 6 LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWXQFNSVVTRPQLVVDRDRHA 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL +++D+ P++ CD + LD +D SGE L + + RL+ +G+P+ + + V
Sbjct: 66 KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125
Query: 126 VKKKKVTTENGTTTTEL-EDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
NG T + DPN CG CYGA+ +++ CC C+ V+ AY
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176
Query: 176 KWAL 179
WA
Sbjct: 177 GWAF 180
>gi|71409118|ref|XP_806922.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870803|gb|EAN85071.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 310
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 134/302 (44%), Gaps = 38/302 (12%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGA-VTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
+++ +D F KP ED+ Y GA V++V + I L+ +VC Y + + T EL VD
Sbjct: 22 KKVAAVDLFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVCSYIFGRDAYTTELSVD 81
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+S +++ +LDI P + C ++LD +D +G +L+V N++K +D G
Sbjct: 82 TSLSTEVDFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNLFKTPVDAQGN-------- 133
Query: 122 VVNAVKKKKVTTENGTTTTELED----PNKCGSCYGAETET------RKCCNTCNEVKEA 171
+ ++ E G+ + +D P CG C+ E + +CCNTCN+V A
Sbjct: 134 -FAFIGTRQGVGEYGSFREQSKDDPSSPQFCGRCFINEHQVSMMENKNRCCNTCNDVLNA 192
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
Y + P+ + + QC E S GC G L V + G AP
Sbjct: 193 YDQQGLPRPQKNEVEQCIYELS------RINPGCNYKGTLIVKKFGGRLVFAP--KRVPG 244
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
+ D+ F+++H I LS G + RR PL+G A+ + Y+
Sbjct: 245 GFLIRDV-----MRFDSSHIINKLSIGDEHVTRFSRRGVQHPLNGHEFDAQRRFTEIRYF 299
Query: 289 IK 290
+
Sbjct: 300 FE 301
>gi|15010925|gb|AAK77355.1|AF302767_1 PTX1 protein [Homo sapiens]
Length = 377
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 155/379 (40%), Gaps = 65/379 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L + Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMKFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
+ + L++ + K A++ ALP D
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
Q N C+I+G+L VN+V+G+FHI G + H H +
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHES 210
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
+N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 211 YNFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISAYT 267
Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G +
Sbjct: 268 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 327
Query: 351 TFMLVDALLHSCVKKISKV 369
T +LH K I ++
Sbjct: 328 T----TGMLHGIGKFIVEI 342
>gi|426225295|ref|XP_004006802.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Ovis aries]
Length = 377
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 157/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQREWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFAVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|123430864|ref|XP_001307985.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121889642|gb|EAX95055.1| hypothetical protein TVAG_428580 [Trichomonas vaginalis G3]
Length = 358
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 162/387 (41%), Gaps = 54/387 (13%)
Query: 7 LKGLDAFTK-PYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
L+ LD F K +++ T YG V+I+ + S LI +V Y +L V S
Sbjct: 5 LEFLDLFDKNTHDELKMTTKYGSVVSILLTVVSSILIITNVALYINPRIYRDLSVKPSVT 64
Query: 66 SK---LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S + I L I + + C +L +D +DS G Q ++++ + RRL+ G+ I
Sbjct: 65 SASETINISLTIKI-AMPCYFLHIDYMDSLGFQRSYIKNTVTFRRLNNLGRVI------- 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
G T L D C CY T +CCN+C +V+ + + +
Sbjct: 117 -------------GYTNDTLSD--VCEPCYNLSTNPDECCNSCLKVQLLSLMQNKPV-DF 160
Query: 183 DTIVQCKNEYSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
C N EK N + +E C + G L VNR+ GSFHIAPG + ++HD+
Sbjct: 161 SKYRVCNNY---EKKPNVSLSEKCLVKGKLTVNRIPGSFHIAPGTNVP-QSAYLHDLSS- 215
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG--TVAKAEEGASMFNYYIKIIPTIYERL 299
+ TH I+ L FG + PLD + + + Y + I P I+ R
Sbjct: 216 MQMFHDMTHSIQRLRFGPHIP---RTSNPLDNFKSFQQIPTHDRTYFYNLLITPVIFYRD 272
Query: 300 DGSKLGGGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
L G + G PG+FF Y+ +P + ++ ++ + I
Sbjct: 273 GVEYLKGYEYTAFSEAIDTFQLFGISPGLFFQYQFTPYTIVVSANRQNFLQFISNTFGVI 332
Query: 346 SGTYITFMLVDALLHSCVKKISKVEIG 372
SG Y ++D L+ + + VEIG
Sbjct: 333 SGIYACLSILDKLIGEDIGS-NVVEIG 358
>gi|301783747|ref|XP_002927289.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Ailuropoda melanoleuca]
Length = 377
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMAILTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|355686514|gb|AER98081.1| ERGIC and golgi 2 [Mustela putorius furo]
Length = 365
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 150/356 (42%), Gaps = 60/356 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ ER G G+ GIF Y+LS LMV +TE+ + + +C I G
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVR-LCGIVG 323
>gi|327273481|ref|XP_003221509.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Anolis carolinensis]
Length = 377
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 89/362 (24%), Positives = 149/362 (41%), Gaps = 63/362 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 MKELDAFPKVPESYIETSASGGTVSLIAFTTMALLTIMEFTVYRDTWMKYEYEVDKDFTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + E + + + + P+Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYIGADVLDLA-ETMVASADGLSYEPVIFELSPLQREWQRMLQII 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP +
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKTAFKSASTALPPRE--- 160
Query: 187 QCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
NT + C+I+G+L VN+V+G+FHI G + H H +
Sbjct: 161 -----------DNTLQPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHE 209
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------- 294
++N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 210 SYNFSHRIDHLSFGELIPG---IINPLDGTEKVASDHNQMFQYFITVVPTKLHTHKISAE 266
Query: 295 -----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
+ ER G G+ GIF Y++S LMV +TE+ ++ I G +
Sbjct: 267 THQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIF 326
Query: 350 IT 351
T
Sbjct: 327 ST 328
>gi|345441780|ref|NP_001230861.1| ERGIC and golgi 2 [Sus scrofa]
Length = 377
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASTDGLVYEPAI-FDLSPQQKEWQRMLQRI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|21312962|ref|NP_080444.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
isoform 1 [Mus musculus]
gi|81903633|sp|Q9CR89.1|ERGI2_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|12835992|dbj|BAB23451.1| unnamed protein product [Mus musculus]
gi|12843481|dbj|BAB25998.1| unnamed protein product [Mus musculus]
gi|12844310|dbj|BAB26318.1| unnamed protein product [Mus musculus]
gi|13905198|gb|AAH06895.1| ERGIC and golgi 2 [Mus musculus]
gi|17390417|gb|AAH18188.1| ERGIC and golgi 2 [Mus musculus]
gi|20072972|gb|AAH26558.1| ERGIC and golgi 2 [Mus musculus]
gi|26326029|dbj|BAC26758.1| unnamed protein product [Mus musculus]
gi|40353061|gb|AAH64749.1| ERGIC and golgi 2 [Mus musculus]
gi|74191314|dbj|BAE39481.1| unnamed protein product [Mus musculus]
gi|148678796|gb|EDL10743.1| ERGIC and golgi 2, isoform CRA_c [Mus musculus]
Length = 377
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 154/378 (40%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + Y+ L D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ I G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|149048933|gb|EDM01387.1| rCG29652, isoform CRA_c [Rattus norvegicus]
Length = 377
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 154/378 (40%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + Y+ L D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSSSTALP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ I G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|12841082|dbj|BAB25070.1| unnamed protein product [Mus musculus]
Length = 377
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 154/378 (40%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + Y+ L D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ I G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|410964074|ref|XP_003988581.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Felis catus]
Length = 377
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPVI-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSDSTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|291392459|ref|XP_002712727.1| PREDICTED: PTX1 protein [Oryctolagus cuniculus]
gi|291416214|ref|XP_002724342.1| PREDICTED: PTX1 protein-like [Oryctolagus cuniculus]
Length = 377
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPHQREWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP +
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSPSTALPPRED-- 161
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ L++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 162 --------DSLQSP--DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I I+PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAIDHNQMFQYFITIVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|7341109|gb|AAF61208.1|AF216751_1 CDA14 [Homo sapiens]
Length = 378
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 159/382 (41%), Gaps = 70/382 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
+ + L++ + K A++ ALP D
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDI-QPYT 242
Q N C+I+G+L VN+V+G+FHI G + + H H+ QP+
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCQPWN 210
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-------- 294
F +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 211 LTIF--SHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKIS 265
Query: 295 -------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G
Sbjct: 266 ADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGG 325
Query: 348 TYITFMLVDALLHSCVKKISKV 369
+ T +LH K I ++
Sbjct: 326 IFST----TGMLHGIGKFIVEI 343
>gi|417399911|gb|JAA46936.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein 2 isoform 1 [Desmodus rotundus]
Length = 376
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 154/378 (40%), Gaps = 64/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + E + + + + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLA-ETMVASANGLVYEPVIFDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ T L++ + K A++ P D
Sbjct: 131 Q------------TRLQEEHSLQDVL---------------FKSAFKSSTALPPREDDSS 163
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
Q + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 164 QPP-------------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 210
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 211 NFSHRIDHLSFGELVPG---IVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADTH 267
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 268 QFSVTERERVVNHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 327
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 328 ----TGMLHGIGKFIVEI 341
>gi|403269250|ref|XP_003926667.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Saimiri boliviensis boliviensis]
Length = 377
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LD F K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDVFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERIINHAAGSYGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|9963759|gb|AAG09679.1|AF183410_1 cd002 protein [Homo sapiens]
Length = 387
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 159/381 (41%), Gaps = 68/381 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 22 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 82 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 139
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
+ + L++ + K A++ ALP D
Sbjct: 140 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 172
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTS 243
Q N C+I+G+L VN+V+G+FHI G + + H H+ T
Sbjct: 173 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCS-TM 218
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------- 294
++N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 219 ESYNFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISA 275
Query: 295 ------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G
Sbjct: 276 DTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGI 335
Query: 349 YITFMLVDALLHSCVKKISKV 369
+ T +LH K I ++
Sbjct: 336 FST----TGMLHGIGKFIVEI 352
>gi|57106442|ref|XP_534852.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 isoform 1 [Canis lupus familiaris]
Length = 377
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGEVVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|149713890|ref|XP_001502984.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Equus caballus]
Length = 377
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYDVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPVI-FDLSPQQKEWQRMLQVI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|12846043|dbj|BAB27008.1| unnamed protein product [Mus musculus]
Length = 377
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 153/378 (40%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + Y+ L D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ I G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I +
Sbjct: 329 ----TGMLHGIGKFIVDI 342
>gi|417399168|gb|JAA46612.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein 2 isoform 1 [Desmodus rotundus]
Length = 337
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 146/356 (41%), Gaps = 61/356 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + E + + + + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLA-ETMVASANGLVYEPVIFDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ T L++ + K A++ P D
Sbjct: 131 Q------------TRLQEEHSLQDVL---------------FKSAFKSSTALPPREDDSS 163
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
Q + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 164 QPP-------------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 210
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 211 NFSHRIDHLSFGELVPG---IVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADTH 267
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ ER G G+ GIF Y+LS LMV +TE+ + + +C I G
Sbjct: 268 QFSVTERERVVNHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVR-LCGIVG 322
>gi|395537817|ref|XP_003770886.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Sarcophilus harrisii]
Length = 378
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 156/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPVSYVETSAIGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCHYVGADVLDLAETMVAPADGLVYEPVI-FDLSPQQREWQRMLQTI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S + + C+I+G+L VN+V+G+FHI G + H H + ++
Sbjct: 159 --REDNSLQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAIDHNQMFQYFITVVPTKLNTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ ++ I G + T
Sbjct: 269 QFSVTERERAINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|344267803|ref|XP_003405755.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Loxodonta africana]
Length = 377
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 157/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A + ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAIKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|432954843|ref|XP_004085560.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Oryzias latipes]
Length = 122
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 70/110 (63%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DA+ K EDF KT G VTI+ + + L ++ + EL+VD+SRG
Sbjct: 6 KLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYFLTKEVHPELYVDTSRG 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
KL I++D++ P + C YL++DA+D +GEQ L VEHN++KRRLD D K +
Sbjct: 66 DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKRRLDKDLKAV 115
>gi|395839293|ref|XP_003792530.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Otolemur garnettii]
Length = 377
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 155/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKTASTALP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I G+L VN+V+G+FHI G + H H ++
Sbjct: 158 ------PREDNPSQSPDACRISGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|300123494|emb|CBK24766.2| unnamed protein product [Blastocystis hominis]
Length = 235
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/222 (31%), Positives = 106/222 (47%), Gaps = 14/222 (6%)
Query: 90 DSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNAVKKKKVTTEN----GTTTTELED 144
D+ G +E+ I K LD++G PI + K +V V K+ EN ++D
Sbjct: 8 DALGNDRADIENEILKTNLDVNGNPIGKTDKSQVTVTVPTKEEVLENTKHDDDEIVVIDD 67
Query: 145 PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV-QCKNEYSTEKLKNTFTE 203
+CG C+GA+ E +CCNTC E+ AYR K W + + QC +K KN
Sbjct: 68 KKECGDCFGAK-EKSECCNTCEELIAAYRKKNWDVDRIKAQAPQCAGFNYLQKWKNGVER 126
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC++ G L + +V G I PG IN + + + + N TH I H S G +
Sbjct: 127 GCRLEGKLSITKVQGHVFIIPG---RINDLLSNSEIRQIANSLNVTHTIHHFSLGEAIP- 182
Query: 264 DDERRKP-LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
E++ P +D A + ASM+ Y++ IPT Y G +L
Sbjct: 183 --EQKNPFVDHRGVMAVDHASMYQYFVNAIPTTYINKSGKEL 222
>gi|126339088|ref|XP_001363644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Monodelphis domestica]
Length = 378
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/360 (24%), Positives = 149/360 (41%), Gaps = 59/360 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPVSYVETSASGGTVSLIAFTTMALLTIMEFSVYRDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I+++I V + C Y+ D +D + + +Y+ + D P Q + ++ +
Sbjct: 73 KLRININITV-AMKCQYVGADVLDLAETMVAAADGLVYEPVI-FDLSPQQREWQRMLQTI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S + + C+I+G+L VN+V+G+FHI G + H H + ++
Sbjct: 159 --REDNSLQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIANDHNQMFQYFITVVPTKLNTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ ++ I G + T
Sbjct: 269 QFSVTERERAINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328
>gi|387015774|gb|AFJ50006.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2-like
[Crotalus adamanteus]
Length = 377
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 158/379 (41%), Gaps = 65/379 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKIPDSYIETSTSGGTVSLIAFTTMALLTIMEFMVYRDTWMKYEYEVDKDYTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C ++ D +D + + +Y+ + + P+Q + ++ +
Sbjct: 73 KLRINVDITV-AMKCQHIGADVLDLAETMVATADGLVYEPVI-FELSPLQREWQRILQNI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
+ + L++ + K A++ ALP D
Sbjct: 131 QSR------------LQEEHSLQDII---------------FKSAFKSASTALPPREDNP 163
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
VQ + C+I+G+L VN+V+G+FH+ G + H H + +
Sbjct: 164 VQS-------------ADACRIHGHLYVNKVAGNFHVTVGKAIPHPRGHAHLAALVSHES 210
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
+N +H I HLSFG + PLDGT A + MF Y++ ++PT
Sbjct: 211 YNFSHRIDHLSFGELIPGII---NPLDGTEKIASDHNQMFQYFVTVVPTKLQTHKISAET 267
Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ER G G+ GIF Y++S LMV +TE+ ++ + G +
Sbjct: 268 HQFAVTERERIINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIVGGIFS 327
Query: 351 TFMLVDALLHSCVKKISKV 369
T +LHS + I ++
Sbjct: 328 T----TGILHSIGRFIVEI 342
>gi|82074366|sp|Q5EHU7.1|ERGI2_GECJA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
Length = 377
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASTDGLVYEPAI-FDLSPQQKEWQRMLQRI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K ++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSTFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|115497448|ref|NP_001069031.1| endoplasmic reticulum-Golgi intermediate compartment protein 2 [Bos
taurus]
gi|113912114|gb|AAI22616.1| ERGIC and golgi 2 [Bos taurus]
gi|296487341|tpg|DAA29454.1| TPA: PTX1 protein [Bos taurus]
Length = 377
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 155/378 (41%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q + ++
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQREWQRMLQLF 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K ++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVV---------------FKSVFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S++ + C+I G+L VN+V+G+FHI G + H H ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I HLSFG + PLDGT A + MF Y+I I+PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIALDHNQMFQYFITIVPTKLQTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G + T
Sbjct: 269 QFAVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|148224086|ref|NP_001087666.1| ERGIC and golgi 2 [Xenopus laevis]
gi|51950053|gb|AAH82468.1| MGC81917 protein [Xenopus laevis]
Length = 377
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 150/367 (40%), Gaps = 59/367 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + G V+++ + + L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASRGTVSLMAFSIMGILTIMEFLVYRDTRMKYEYEVDKDFTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ +++DI V + C Y+ D +D + E + + + + D P Q + ++ +
Sbjct: 73 KIRLNIDITV-AMKCQYVGADVLDLA-ETMVTSAQGLAYQPVIFDLSPQQRQWQRMLQQI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A R +LP
Sbjct: 131 QGR------------LQEEHSLQDLL---------------FKSAMRTSVLSLP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E S + N C+I+G+L++N+V+G+FHI G + H H + ++
Sbjct: 158 --PREDSPMEQPN----ACRIHGHLDINKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I H SFG L PLDGT AE+ M+ Y+I I+PT
Sbjct: 212 NFSHRIDHFSFGEPL---PAIINPLDGTEKIAEDSNQMYQYFITIVPTKLNTNKVYCDTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y++S LMV +TE L ++ I G + T
Sbjct: 269 QFSVTERERVINHATGSHGVSGIFMKYDISSLMVTVTEDHMPLWKFLVRLCGIIGGIFTT 328
Query: 352 FMLVDAL 358
++ L
Sbjct: 329 TGMIHGL 335
>gi|89272944|emb|CAJ82943.1| ptx1 [Xenopus (Silurana) tropicalis]
Length = 377
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 150/367 (40%), Gaps = 59/367 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + G V+++ + + L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASRGTVSLMAFSIMGILTIMEFLVYRNTRMKYEYEVDKDFTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
K+ +++DI V + C Y+ D +D + + +Y+ + + P Q + ++ +
Sbjct: 73 KIRLNIDITV-AMKCQYVGADVLDLAETMVTSAQGLVYEPVI-FELSPQQRLWQRMLQQI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A R +LP
Sbjct: 131 QGR------------LQEEHSLQDLL---------------FKSAMRTSVMSLPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + TE C+I+G+LE+N+V+G+FHI G + H H + ++
Sbjct: 159 --REDSPTEP-----PNACRIHGHLEINKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I H SFG L PLDGT AE+ M+ Y+I I+PT
Sbjct: 212 NFSHRIDHFSFGEPLPGI---VNPLDGTEKIAEDSNQMYQYFITIVPTKLHTNKVDCDTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y++S LMV +TE L ++ + G + T
Sbjct: 269 QFSVTERERVINHASGSHGVSGIFMKYDISSLMVMVTEDHMPLWKFLVRLCGIVGGIFTT 328
Query: 352 FMLVDAL 358
++ L
Sbjct: 329 TGMIHGL 335
>gi|408393109|gb|EKJ72376.1| hypothetical protein FPSE_07400 [Fusarium pseudograminearum CS3096]
Length = 376
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 159/362 (43%), Gaps = 61/362 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + ++T GG T+ + LI ++ +++ + + V++
Sbjct: 23 VAAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGELGRWWRGAESHNFEVEAGVSR 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
++ I+LDIVV +SCD + ++ D+SG++ LH + ++ + D K + + ++
Sbjct: 83 EMQINLDIVV-KMSCDDIHVNVQDASGDRIMAAKRLHTDKTLWGQWAD--NKGVHKLGRD 139
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ +V T G + ED +G E + + V + KWA
Sbjct: 140 -----DQGRVNTGQGYNDPKYED-----EGFGEE-------HVHDIVALGKKRAKWA--- 179
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
T + + + C+IYG L++N+V G FHI A G Y + H+
Sbjct: 180 -----------KTPRFRGN-ADSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL----- 222
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ FN +H I LS+G PLDGTV A+ F YY+ ++PT+Y
Sbjct: 223 -DHSKFNFSHIISELSYGPFYP---SLENPLDGTVNTADGNFHKFQYYLSVVPTVYSVNS 278
Query: 301 GSKLGG-----------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
S L D +PGIFF Y++ P+++ + E + L+ KI+ ISG
Sbjct: 279 RSILTNQYAVTEQSKAVDDRYIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVL 338
Query: 350 IT 351
+
Sbjct: 339 VA 340
>gi|146079597|ref|XP_001463805.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398011570|ref|XP_003858980.1| hypothetical protein, conserved [Leishmania donovani]
gi|134067893|emb|CAM66174.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322497192|emb|CBZ32265.1| hypothetical protein, conserved [Leishmania donovani]
Length = 368
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 86/383 (22%), Positives = 155/383 (40%), Gaps = 63/383 (16%)
Query: 13 FTKPYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG--SKLP 69
F KP ED+ E+T +G +++ F+ +L+ + Y + + V +G +P
Sbjct: 2 FPKPKEDYQREQTRWGALLSVFTVFFVIFLVLWEGAAYLRGRDAYDTDVSLDKGLSEDMP 61
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
+H D++ P + C+ L++D VD++G + ++K LDG EVV K
Sbjct: 62 VHFDVLFPFMPCNRLSIDVVDTTGMAKFNYTGRLHKLPTALDG--------EVVYKGSLK 113
Query: 130 KVTTENGTTTTELEDPNKCGSCY-----GAETETR-----KCCNTCNEVKEAYRYKKWAL 179
+ +N T E KC C G E R KCC+TC V + Y+ +
Sbjct: 114 DL--DNEMETREGRAGKKCRPCPPSAFDGVPAEVRSAAELKCCDTCESVLDLYKELGKGI 171
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP---GLSYSINHVHVH 236
P + I QC + GC + G L++ +V + P G YS+ V
Sbjct: 172 PGTEYIPQCLEQLYQR------ASGCTVMGSLDLKKVPVTVIFGPRRTGHFYSLKDV--- 222
Query: 237 DIQPYTSAAFNTTHHIRHLSFG---IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
+T+H IR L G ++ + +PL G + ++ S Y +K++P
Sbjct: 223 -------IRLDTSHFIRKLRIGDETVERFSKNGVAEPLSGHKSSSKT-YSETRYLVKVVP 274
Query: 294 TIYERLDGSK-----------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
T Y + + G G +P + F +E +P+ V + + H
Sbjct: 275 TTYRKTKTKNAKASTYEYSAQWSRRTIVVGFAGAVPAVLFEFEPAPIQVNNVFERQPFSH 334
Query: 337 LWTKIMCNISGTYITFMLVDALL 359
++ + G ++ +D ++
Sbjct: 335 FLVQLCGIVGGLFVVLGFIDNVV 357
>gi|398412138|ref|XP_003857398.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
gi|339477283|gb|EGP92374.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
Length = 407
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 160/388 (41%), Gaps = 86/388 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + ++T GG T+V + L +V ++ +TT V+ G
Sbjct: 23 IKAFDAFPKTKPSYTQQTSSGGVWTLVLIALSTVLAYTEVTRWWSGTTTHSFSVEQGVGH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I++D+VV + C+ + ++ D++G++ L V AV
Sbjct: 83 DLQINVDLVV-AMKCEDIHINVQDAAGDRVL------------------------VDKAV 117
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K+ T L N GA + R + N + +A Y++ + + ++
Sbjct: 118 KEDP-------TLFRLWGENHGAHTLGASLKDRLEVD-GNRIVQA-EYEEEDVHDYLSLA 168
Query: 187 QC--KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS 243
+ + +Y+ +N + C+IYG + N+V G FHI A G Y H+
Sbjct: 169 RGGKRYQYTPRTPRNEEADSCRIYGSMHSNKVQGDFHITARGHGYMAYSQHL------DH 222
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY------- 296
+AFN +HHI LSFG + PLD T A+ E F YY+ ++PTIY
Sbjct: 223 SAFNFSHHINELSFGPYYP---KLVNPLDSTYARTEAHFHKFQYYLSVVPTIYTVDVNAL 279
Query: 297 ERLDG--SKLGGGDGGM-------------------------------PGIFFSYELSPL 323
+R+D GD G+ PGIFF Y++ PL
Sbjct: 280 KRMDSKYETPSSGDDGLNQHPRRVTQHSVFTNQYAVTEQSHSVPENHVPGIFFKYDIEPL 339
Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ I E+ S+ L +I+ +SG +
Sbjct: 340 QLTIAEEWTSVPALLLRIVNVVSGLLVA 367
>gi|321479391|gb|EFX90347.1| hypothetical protein DAPPUDRAFT_309719 [Daphnia pulex]
Length = 369
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 81/378 (21%), Positives = 151/378 (39%), Gaps = 75/378 (19%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
LDAF K + + EKT GG ++++C I YL+ +V D+ D +++
Sbjct: 15 LDAFPKVPDTYKEKTTSGGTISLICIFIILYLVFSEVNDFIHSGVKFHFVPDDDLDTRMD 74
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
+++D+ V + C Y+ D +DS+G+ + H
Sbjct: 75 LNVDMTV-AMPCRYIGADVLDSTGQSVVSFGH---------------------------- 105
Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
+T EN + + R + R K + +L +
Sbjct: 106 -LTEEN--------------TWFELSPRQRNHFEAAQRLNSILRDKPHGIQQLLWKSGYQ 150
Query: 190 NEY----STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN-HVHVHDIQPYTSA 244
N + S E + + ++ C+++G L++ +V+G+FHI G + H H
Sbjct: 151 NLFGEMPSREFVPSQPSDACRLHGTLQLTKVAGNFHITAGKVLPLPMRAHAHLSPMMDDE 210
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL----- 299
FN +H I SFG +PL+G ++GA +F Y++ +PT E L
Sbjct: 211 RFNYSHRIDKFSFG----HSSTLIQPLEGDEVITDKGAMLFQYFVTAVPTEIESLVSASS 266
Query: 300 --------------DGSKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
+ S++ G G G+PGI+F Y+++PL V++ + L ++
Sbjct: 267 GIHGSMKTWQYSVRNQSRIIGHQKGSHGIPGIYFKYDVAPLRVRVVPDAPPLLRFVLRLC 326
Query: 343 CNISGTYITFMLVDALLH 360
+ G Y + +V ++
Sbjct: 327 AIVGGVYTSAGIVHKVIQ 344
>gi|303278158|ref|XP_003058372.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459532|gb|EEH56827.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 399
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 133/311 (42%), Gaps = 79/311 (25%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + R+K DA+ KP +KT GGAVT+ L ++++ ++ Y V
Sbjct: 1 MGLAARVKLFDAYHKPERHLTKKTAAGGAVTLSSLLLMAFVFVFELRSYLATERVTTTGV 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI----- 115
D +R L I++D+ ++ C L+LDA+D+SG+ V ++K R+D G+ I
Sbjct: 61 DVTRDEMLAINVDVTFTSLPCQTLSLDALDASGKHDQDVGGELHKTRVDRFGRAIATYES 120
Query: 116 -QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
+E VVN + TEL YG ETE K +E+K A
Sbjct: 121 HRENDDGVVNLI-------------TEL--------FYGFETEGHKAH--VDEIKTAL-- 155
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
+ EGC+++G L+V RV+G+FH++ VH
Sbjct: 156 -------------------------SAGEGCRVHGRLKVQRVAGNFHVS---------VH 181
Query: 235 VHDIQPYTSAAF------NTTHHIRHLSFGIKLQDDDERRKPLDG---TVAKAEEGASMF 285
D + A F N +H + LSFG ++ PL G T A E + +
Sbjct: 182 GEDARTL-RATFEHPRNVNMSHAVHRLSFGKSFPRKED---PLSGFTRTTRHANETGT-Y 236
Query: 286 NYYIKIIPTIY 296
Y++K++P Y
Sbjct: 237 KYFLKVVPVTY 247
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 32/55 (58%)
Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
G +P ++F Y+LSP+ V I++ KS GH + + + G Y L+D ++H +
Sbjct: 337 GSLPAVYFIYDLSPIAVTISDARKSFGHFLARTVAGVGGAYAIAGLIDRMIHHSL 391
>gi|12857352|dbj|BAB30984.1| unnamed protein product [Mus musculus]
Length = 377
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 153/378 (40%), Gaps = 63/378 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + Y+ L D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
N +H I H SFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHCSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVMPTKLHTYKISADTH 268
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ ER G G+ GIF Y+LS LMV +TE+ + ++ I G + T
Sbjct: 269 QFSVTERESIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328
Query: 352 FMLVDALLHSCVKKISKV 369
+LH K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342
>gi|449303002|gb|EMC99010.1| hypothetical protein BAUCODRAFT_120300 [Baudoinia compniacensis
UAMH 10762]
Length = 387
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 153/369 (41%), Gaps = 67/369 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ DAF K + +KT GG T+V +L +V ++ TT E V+ G
Sbjct: 23 VRAFDAFPKTKPSYTQKTNNGGIWTVVLVCASLWLAWTEVMRWWWGHTTHEFSVEQGVGH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I+LD+VV + CD L ++ D+SG++ L G+ +Q
Sbjct: 83 DLQINLDVVV-KMRCDDLHVNVQDASGDR-------------ILAGETLQRDATLWSQWG 128
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYG-----AETETRKCCNTCNEVKEAYRYKKWALPE 181
+K+ T T LE S YG AE + + K ++KK P
Sbjct: 129 ANRKLHTLGATRDERLEMTGY--SSYGDAREYAEDDVHDYLGAASSTK---KFKK--TPR 181
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
+ K+ + C+IYG + N+V G FHI + H ++ Q
Sbjct: 182 VP--------------KSKEADSCRIYGSMHGNKVQGDFHIT-----ARGHGYMEFGQHL 222
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY----- 296
++FN +HHI LSFG PLD T+A E F YY+ ++PTIY
Sbjct: 223 EHSSFNFSHHINELSFGPFYP---SLTNPLDNTLAATEFNFFKFQYYLSVVPTIYTTNAK 279
Query: 297 --ERLDGSKLGGG------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
++ S + + +PG+F Y++ P+++ I E+ S L+ +++
Sbjct: 280 ALRKITKSTVFTNQYAVTEQSRPVPENQVPGVFVKYDIEPILLMIAEERNSFPALFIRLV 339
Query: 343 CNISGTYIT 351
ISG +
Sbjct: 340 NVISGVLVA 348
>gi|123435131|ref|XP_001308935.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121890639|gb|EAX96005.1| hypothetical protein TVAG_369150 [Trichomonas vaginalis G3]
Length = 353
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 137/311 (44%), Gaps = 44/311 (14%)
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S S + I L I+V + C YL D DS G +V + + R D + I
Sbjct: 65 SGDSLVNISLGILV-DLPCYYLHFDLTDSLGFTQNYVNNTLRFYRYDFNYSLI------- 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
G T + D KC C+ + CCN C+ +KE Y+ PE
Sbjct: 117 -------------GLTNQTMVD--KCYPCFKVQFHNYTCCNGCDRLKENYKLNNLT-PEP 160
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH-VHVHD-IQP 240
+ QC+ + + +E C + G + VNRV GSFHIA G + +N H+H+ +
Sbjct: 161 EKWPQCQ---TNARPDINSSEKCLVKGKVSVNRVRGSFHIAAGRNIYLNDGSHIHELLDD 217
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASM-FNYYIKIIPTIY--- 296
+ + AF +H I H+ FG ++ ++PL V +A+E ++ +Y + + P I+
Sbjct: 218 FPNLAF--SHAIEHIRFGPRII---TAKQPLQNLVMRAKENLTVTHDYSLLVTPVIFVAD 272
Query: 297 -ERLDGS-----KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ++ S L PGI+F Y+ +P ++IT S+S +G Y
Sbjct: 273 NQFIEKSFEYTVYLHPVQDKDPGIYFDYQFTPYTIQITWISRSFRGFLISTAGFTAGLYA 332
Query: 351 TFMLVDALLHS 361
++D L HS
Sbjct: 333 IASIIDQLFHS 343
>gi|46137745|ref|XP_390564.1| hypothetical protein FG10388.1 [Gibberella zeae PH-1]
Length = 376
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 159/362 (43%), Gaps = 61/362 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + ++T GG T+ + LI ++ +++ + + V++
Sbjct: 23 VAAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGELGRWWRGAESHNFEVEAGVSR 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
++ I+LDIVV ++CD + ++ D+SG++ LH + ++ + D K + + ++
Sbjct: 83 EMQINLDIVV-KMNCDDIHVNVQDASGDRIMAAKRLHTDKTLWGQWAD--NKGVHKLGRD 139
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ +V T G + ED +G E + + V + KWA
Sbjct: 140 -----DQGRVNTGQGYNDPKYED-----EGFGEE-------HVHDIVALGKKRAKWA--- 179
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
T + + + C+IYG L++N+V G FHI A G Y + H+
Sbjct: 180 -----------KTPRFRGN-ADSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL----- 222
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+ FN +H I LS+G PLDGTV A+ F YY+ ++PT+Y
Sbjct: 223 -DHSKFNFSHIISELSYGPFYP---SLENPLDGTVNTADGNFHKFQYYLSVVPTVYSVNS 278
Query: 301 GSKLGG-----------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
S L D +PGIFF Y++ P+++ + E + L+ KI+ ISG
Sbjct: 279 RSILTNQYAVTEQSKAVDDRYIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVL 338
Query: 350 IT 351
+
Sbjct: 339 VA 340
>gi|71013590|ref|XP_758634.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
gi|46098292|gb|EAK83525.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
Length = 415
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 154/360 (42%), Gaps = 63/360 (17%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+++ DAF K + +++ GG +TI+ + + L+ ++ Y VDS
Sbjct: 13 KIRQFDAFPKTQSIYTQRSSKGGLLTIIATVTLLALLWTELSSYLYGERGYSFSVDSRLQ 72
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
S + I++D+ V + C YL +D D+ G++ LHV + + + DG
Sbjct: 73 STMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DG------------- 113
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
TT E+ ++ + E +K N K YR K P
Sbjct: 114 ------------TTFEIGHADRLDALPMQEVSVQKTINQARR-KPVYRKK----PRNK-- 154
Query: 186 VQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
+ + + +K + +G C+IYG +EV RV+G+ HI ++ H ++ ++
Sbjct: 155 -KFSRQVAFQKTAHIVPDGPACRIYGSMEVKRVTGNLHIT-----TLGHGYL-SVEHTDH 207
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
N +H I SFG E +PLD +V E+ ++F Y++ +PT++ G K
Sbjct: 208 KLMNLSHVIHEFSFGPYF---PEISQPLDSSVETTEKHFTVFQYFVSAVPTLFIDARGRK 264
Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
L G+PGIF Y++ PL + I ++S SL ++ + G ++
Sbjct: 265 LHTHQYSVTDYTRQIEHGKGVPGIFIKYDIEPLQMTIRQRSTSLFQFLVRLAGVLGGVWV 324
>gi|449479952|ref|XP_004155757.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 266
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 139/301 (46%), Gaps = 58/301 (19%)
Query: 83 YLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTEL 142
+L++DA+D SG+ + ++ NI+K RL+ G+ I + + + V+K+ V ++ +
Sbjct: 3 FLSVDAIDMSGKHEVDLDTNIWKLRLNSHGQIIG--TEYLSDLVEKEHVDHKHDHDHDKE 60
Query: 143 ED-PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF 201
+D P+ G AE +K K AL E
Sbjct: 61 KDHPHIHGFDQAAENLVKKV--------------KQALEE-------------------- 86
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
+GC++YG L+V RV+G+FHI+ + +N + V + S N +H I LSFG K
Sbjct: 87 AQGCRVYGVLDVQRVAGNFHIS---VHGLN-IFVAQMIFGGSKHVNVSHMIHDLSFGPKY 142
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--------------SKLGGG 307
PLDGTV + + F YYIKI+PT Y+ + S +
Sbjct: 143 PGI---HNPLDGTVRILRDTSGTFKYYIKIVPTEYKYISKAVLPTNQFSVTEYFSPMTDS 199
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
D P ++F Y+LSP+ V I E+ +S H T++ + GT+ ++D + ++ ++
Sbjct: 200 DRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMFRFLEALT 259
Query: 368 K 368
K
Sbjct: 260 K 260
>gi|242006215|ref|XP_002423949.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
gi|212507219|gb|EEB11211.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
Length = 349
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 164/376 (43%), Gaps = 70/376 (18%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+ +K LDAF K E + GG ++I+ ++ + +++ ++ Y T + D
Sbjct: 14 KSVKVLDAFPKVDNSCRESSPVGGTLSIISYILMLWILYSEITYYTNSKITYKFLPDVDF 73
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLD-GKPIQEP 118
K+ I+LD+ V + C ++ D +DS+ + LH E+ + DL+ + I
Sbjct: 74 DQKVKIYLDMTV-AMPCSAVSADILDSTQQSVFNFGELHEENTWF----DLEPSQKINFD 128
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
Q + VNA+ ++ +EV E Y +K +
Sbjct: 129 QIKNVNALLRQDY----------------------------------HEVHE-YLWKSAS 153
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
++ V KN L N + C+IYG L +N+V+G+FHI+ G S + H+H
Sbjct: 154 PSFINVYVPRKN------LPNRPYDACRIYGELVLNKVAGNFHISAGKSLQLPRGHIHIA 207
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
+ FN +H + + SFG PL+G A + + Y+I+++PT +
Sbjct: 208 TFMSDKEFNFSHRLNYFSFG---DYSPGIVHPLEGDEKIATDAMMSYQYFIEVVPTEVKT 264
Query: 299 LDGSKL---------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
++L G G+PGIFF Y++S L V + ++ S + K+
Sbjct: 265 FLTNQLTYQYSVKDYQRPINHNTGSHGIPGIFFKYDMSALKVIVMQERDSPINFAVKLCA 324
Query: 344 NISGTYITFMLVDALL 359
+I G +IT LV+ ++
Sbjct: 325 SIGGIHITSGLVNNII 340
>gi|123408947|ref|XP_001303296.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884664|gb|EAX90366.1| hypothetical protein TVAG_036780 [Trichomonas vaginalis G3]
Length = 364
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/389 (24%), Positives = 157/389 (40%), Gaps = 67/389 (17%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
R LD F K + T GG ++++ I L +++ + + L V + R
Sbjct: 2 RFSKLDLFEKLDNNHRTGTTTGGILSLITIGLIISLFVIEIKSFLNPPLRQRLSVVNKRP 61
Query: 66 S-------------KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLD 111
+ K ++ DI P C L D +D+ + L NI R D
Sbjct: 62 TEADGVTITKESQEKTKVNFDIFFPNAPCYLLHFDLIDAVSQLDLFTYNQNITYTRFSSD 121
Query: 112 GKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE--TETRKCCNTCNEVK 169
GK I + KVT +CG C + + KCCNTC +V
Sbjct: 122 GKIIGDFDHSA--RFNTSKVT--------------ECGFCNATKGLKDKYKCCNTCQQVL 165
Query: 170 E-AYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
E A ++ +P QC ++ ++LK EGC+I G E ++ FHI+PG Y
Sbjct: 166 EVAQVFRVVDIP------QCSDK--VKELKKMQNEGCRIKGNFETIKIKAEFHISPG--Y 215
Query: 229 SI---NHVHVHDIQPYTS--AAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGA 282
S+ + VH HD+ + + N ++ + H FG D+ LDG + + + G
Sbjct: 216 SVIDEDGVHAHDVSSFIDDVSELNLSYKLNHCRFG------DQNHSQLDGFSTIQKQIGY 269
Query: 283 SMFNYYIKI-----IPTIY-ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
Y I + T Y E++D L +PGI F Y+ + K L H
Sbjct: 270 FYAVYTIDVSENNDYSTAYMEQVDNGTL------VPGIVFKYDFGIITAKSFPDRPPLIH 323
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKK 365
L++ ++ G + F ++D L S +K+
Sbjct: 324 LFSNLVSMAGGVAMIFYILDYALFSSIKQ 352
>gi|401416963|ref|XP_003872975.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322489202|emb|CBZ24457.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 368
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/391 (21%), Positives = 153/391 (39%), Gaps = 71/391 (18%)
Query: 13 FTKPYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG--SKLP 69
F KP ED+ E+T +G +++ + L+ + Y + + + RG +P
Sbjct: 2 FPKPKEDYQREQTRWGAVLSVATVSIVIILVLWEGAAYLRGRDAYDTDISLDRGLSEDMP 61
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
+H D+ P + C+ L++D VD++G + ++K LDG+ + K
Sbjct: 62 VHFDVFFPFMPCNRLSIDVVDTTGMAKFNYTGTLHKLPTALDGRVL----------YKGS 111
Query: 130 KVTTENGTTTTELEDPNKCGSCY-----GAETETR-----KCCNTCNEVKEAYRYKKWAL 179
+N T E + KC C G E R KCC+TC V + Y+ +
Sbjct: 112 LKDLDNAMETEEARNGTKCRPCPPSAFDGVAAEVRSAAVSKCCDTCESVLDLYKELGKGI 171
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP---GLSYSINHVHVH 236
P + + QC + + GC + G L++ +V + P G YS+ V
Sbjct: 172 PGTEYLPQCLEQLYQQ------ASGCNVVGSLDLKKVHVTVIFGPRRTGRFYSLKDV--- 222
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFN-------YYI 289
+T+H IR L G D+ R +G VA+ G F+ Y +
Sbjct: 223 -------IRLDTSHSIRKLRIG----DEAVERFSKNG-VAEPLSGHKSFSKTYSETRYLV 270
Query: 290 KIIPTIYERLDGSK-----------------LGGGDGGMPGIFFSYELSPLMVKITEKSK 332
K++PT Y + + G G +P + F +E +P+ V + +
Sbjct: 271 KVVPTTYRKTKKRNAKASTYEYSAQWSKRTIVVGFAGAVPAVLFEFEPAPIQVNNVFERQ 330
Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
H ++ + G ++ +D ++ V
Sbjct: 331 PFSHFVVQLCGIVGGLFVVLGFIDNVVDWAV 361
>gi|322791472|gb|EFZ15869.1| hypothetical protein SINV_02690 [Solenopsis invicta]
Length = 403
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 158/384 (41%), Gaps = 72/384 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGA--VTIV------------CWLFISYLICVDVCDYFQV 52
+K LDAF K E + +KT GG +T++ I+YLI + Y
Sbjct: 12 VKELDAFPKVPELYVDKTAVGGTCELTVINKIFSIIHISIFTIFIIAYLIIAETSYYLDS 71
Query: 53 STTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
+ D+ +KL I++D+ V + C + D +DS+ QH+ +D D
Sbjct: 72 RLQFKFEPDTEIDAKLQINIDVTV-AMPCGRIGADVLDSTN-QHM----------IDFD- 118
Query: 113 KPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY 172
+ + T EL + A E K N+ ++E Y
Sbjct: 119 -------------------SLKEEDTWWELTAEQR------AHFEALKHMNSY--LREEY 151
Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH 232
L + + ++ + C+++G L VN+V+G+FHI G S S+ H
Sbjct: 152 HAIHELLWKSNQVILYSEMPKRTSEPDYAPNACRVHGSLNVNKVAGNFHITAGKSLSVPH 211
Query: 233 VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
H+H T +N TH I SFG PL+G A+ ++ Y+++++
Sbjct: 212 GHIHISAFMTDRDYNFTHRINRFSFG---GPSPGIVHPLEGDEKIADNNMMLYQYFVEVV 268
Query: 293 PT-IYERLDGSKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
PT I L SK G G+PGIFF Y++S L +K+T++ ++
Sbjct: 269 PTDIRTLLSTSKTYQYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQF 328
Query: 338 WTKIMCNISGTYITFMLVDALLHS 361
K+ + G ++T L+ ++ S
Sbjct: 329 LVKLCATVGGIFVTSGLIKNIVQS 352
>gi|395744111|ref|XP_003780425.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Pongo abelii]
Length = 387
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 155/379 (40%), Gaps = 64/379 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 22 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C + D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 82 KLRINIDITV-AMKCQCIGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 139
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 140 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 166
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 167 ------PREDDSSQSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 220
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA-EEGASMFNYYIKIIPT----------- 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 221 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDRKHQMFQYFITVVPTKLHTYKISADT 277
Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G +
Sbjct: 278 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 337
Query: 351 TFMLVDALLHSCVKKISKV 369
T +LH K I ++
Sbjct: 338 T----TGMLHGIGKFIVEI 352
>gi|162852511|emb|CAO03348.2| ERGIC and golgi 3 [Homo sapiens]
Length = 118
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 69/109 (63%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DA+ K EDF KT G VTIV L + L ++ Y EL+VD SRG
Sbjct: 1 LKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGD 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+
Sbjct: 61 KLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPV 109
>gi|310800159|gb|EFQ35052.1| hypothetical protein GLRG_10196 [Glomerella graminicola M1.001]
Length = 377
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 149/369 (40%), Gaps = 74/369 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ + +L C +V +++ S T V+ G
Sbjct: 23 VSAFDAFPKAKPQYVTRTEGGGKWTVAMAVISFFLFCTEVGRWWRGSETHTFAVEKGVGH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD-----LDGKPIQEPQKE 121
++ I+LDIVV + CD L ++ D++G++ L ++ KR +D K I K+
Sbjct: 83 EMQINLDIVV-RMHCDDLHINVQDAAGDRIL--AGSMLKRDKTNWSQWVDSKGIHRLGKD 139
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
K KV T G E +G E + + V + KW
Sbjct: 140 -----SKGKVVTGAGWQEEE---------GFGEE-------HVHDIVSLGKKKAKWG--- 175
Query: 182 LDTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
K + EG C+IYG L+VNRV G FHI A G Y H+
Sbjct: 176 --------------KTPRLWGEGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGAHL--- 218
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
AAFN +H I LSFG PLD TV A F YY+ ++PT+Y
Sbjct: 219 ---DHAAFNFSHIISELSFGPFYP---SLVNPLDRTVNLARINFHKFQYYLSVVPTVYTV 272
Query: 299 LDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
+ D +PGIFF Y++ P+++ + E L KI+
Sbjct: 273 GKSASSSNTIFTNQYAVTEQSKETDDHNIPGIFFKYDIEPILLSVEESRDGFLQLLMKIV 332
Query: 343 CNISGTYIT 351
+SG +
Sbjct: 333 NIVSGVLVA 341
>gi|388858415|emb|CCF48009.1| uncharacterized protein [Ustilago hordei]
Length = 415
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 157/360 (43%), Gaps = 63/360 (17%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+++ DAF K + +++ GG +TI+ + + +L+ ++ Y VDS
Sbjct: 13 KIRQFDAFPKTQSIYTQRSSKGGLLTIISTVTLLFLLWTELSSYLYGERAYSFAVDSQLS 72
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
S + I++D+ V + C YL +D D+ G++ LHV + + + DG + ++A
Sbjct: 73 STMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DGTTFDIGHADRLDA 126
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ +++++ + T N+ ++ Y+K +
Sbjct: 127 MPREELSVQK----------------------------TINQARKKPLYRKKPKNK---- 154
Query: 186 VQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
+ + + K + +G C+IYG +EV RV+G+ HI ++ H ++ ++
Sbjct: 155 -KFSRQVAFHKTAHIVPDGPACRIYGSMEVKRVTGNLHIT-----TLGHGYL-SLEHTDH 207
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
N +H I SFG E +PLD +V ++ ++F Y+I +PT++ G K
Sbjct: 208 KLMNLSHVIHEFSFGPYFP---EISQPLDSSVETTDKHFTVFQYFISAVPTLFVDARGRK 264
Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
L G+PGIF Y++ P+ + I E+S + ++ + G ++
Sbjct: 265 LHTHQYSVTDYTRQIEHGKGVPGIFIKYDIEPIQMTIRERSSTFVQFLVRLAGVLGGVWV 324
>gi|258573091|ref|XP_002540727.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900993|gb|EEP75394.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 398
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 99/413 (23%), Positives = 167/413 (40%), Gaps = 94/413 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + + GG T+ +LF L+ ++ +++ + V+
Sbjct: 24 LRTFDAFPKTKPTYTTASRRGGQWTVFIFLFCGSLVFSELVSWYRGTENHHFSVEKGVSQ 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE--------HNIYKRRLDLDGKPIQEP 118
++ I+LD+VV + C+ L ++ D+ G+ L E + + R L+ K P
Sbjct: 84 EIQINLDMVV-HMPCEALRMNMQDAVGDFILAAELLHKDDTSWDAWNRELNYASKG-GSP 141
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
Q + +NA E+ T E E+ G G EV+ +++ K
Sbjct: 142 QYQTLNA--------EDDTRLAEQEEDQHVGHVLG-------------EVRRSWKRKFPK 180
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHD 237
P+L K K+ + C+IYG LE N+V G+FHI A GL Y D
Sbjct: 181 GPKL-------------KSKDAM-DSCRIYGSLEGNKVQGNFHITARGLGY-------WD 219
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
+ N TH I LSFG + PLD TVA ++ + YY+ ++PTIY
Sbjct: 220 PSGFHLEGLNFTHLITELSFGPRYS---TLLNPLDKTVAGTKDAFYKYQYYLSVVPTIYT 276
Query: 298 RL-----------DGSKLGGGD-----------------------GGMPGIFFSYELSPL 323
R D S + +PGIFF +++ P+
Sbjct: 277 RAGTVDPYNQELPDPSTITSRQRKNTIFTNQYAVTSQSHAIPQNVRAVPGIFFKFDIEPI 336
Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYIT----FMLVDALLHSCVKKISKVEIG 372
++ ++E+ SL L +++ +SG + F L L ++ + +G
Sbjct: 337 LLVVSEERGSLLALLVRLVNVVSGVLVAGGWVFQLATWALEVWGRRRKGMSLG 389
>gi|123361353|ref|XP_001295947.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121875215|gb|EAX83017.1| hypothetical protein TVAG_111750 [Trichomonas vaginalis G3]
Length = 338
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 130/313 (41%), Gaps = 53/313 (16%)
Query: 81 CDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTT 140
C+ L LD +DS G + L V + RR++ +K + KKK
Sbjct: 57 CEVLHLDILDSIGHKQLLVNDTLKWRRVN--------QEKGFMELYNKKK---------- 98
Query: 141 ELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK-KWALPELDTIVQCKNEYSTEKLKN 199
+C SCY + R CCN C ++KE Y K A PE QCK E K K
Sbjct: 99 ------QCHSCYDF-YDNRFCCNGCEKLKEIYHSNNKTATPE--NWTQCKPE---NKQKF 146
Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLS---YSINHVHVHDIQPYTSAAFNTTHHIRHLS 256
E C + G + VNRV GSFH+A G S Y H+ + D Y + F+ H I L
Sbjct: 147 DPNEKCHVKGKISVNRVPGSFHLAIGQSIEDYGHQHILLDD---YQTITFD--HDIIDLR 201
Query: 257 FGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGDGG------ 310
FG + PL GT K+ Y + I P ++ DG + G
Sbjct: 202 FGANI---PMTSHPLRGTHIKSTGEPLATEYNLIITPIVF-YADGQYIEKGFEYVYFYSM 257
Query: 311 ----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+PGI+F Y +P + +T +S+S +SG Y F +V L +K
Sbjct: 258 TYHLVPGIYFYYSFTPYTIAVTWQSRSFRSFLISTGGLLSGIYAIFSMVSTFLEKSDQKK 317
Query: 367 SKVEIGGKTVTKR 379
KVE + V ++
Sbjct: 318 KKVETKAEAVAEK 330
>gi|322792517|gb|EFZ16475.1| hypothetical protein SINV_13267 [Solenopsis invicta]
Length = 110
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 64/109 (58%), Gaps = 17/109 (15%)
Query: 281 GASMFNYYIKIIPTIYERLDGS----------------KLGGGDGGMPGIFFSYELSPLM 324
GA MF +YIKI+PT Y R DGS L G+ GMPGIFFSYELSPLM
Sbjct: 1 GAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHAKQVSLLTGESGMPGIFFSYELSPLM 60
Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
VK TEK+KS GH T I G + L+D+LL+ V+ I K+E+G
Sbjct: 61 VKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQRKIELG 109
>gi|343427702|emb|CBQ71229.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 412
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 79/360 (21%), Positives = 157/360 (43%), Gaps = 63/360 (17%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+++ DAF K + +++ GG +TIV + + L+ ++ Y VD
Sbjct: 13 KIRQFDAFPKTQSIYTQRSSKGGILTIVSTVTLLALLWTELSSYLYGERGYSFAVDQQLQ 72
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
S + I++D+ V + C YL +D D+ G++ LHV + + + DG + + ++A
Sbjct: 73 STMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DGTTFEIGHADRLDA 126
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ +++V+ + T N+ ++ Y+K +
Sbjct: 127 MPREEVSVQK----------------------------TINQARKKPLYRKKPKNK---- 154
Query: 186 VQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
+ + + K + +G C+IYG +EV RV+G+ HI ++ H ++ ++
Sbjct: 155 -KFSRQVAFHKTAHVVPDGPACRIYGSMEVKRVTGNLHIT-----TLGHGYL-SMEHTDH 207
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
N +H I SFG E +PLD +V ++ ++F Y++ +PT++ G K
Sbjct: 208 KLMNLSHVIHEFSFGPYFP---EISQPLDSSVETTDKHFTVFQYFVSAVPTLFVDARGRK 264
Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
L G+PGIF Y++ PL + I E+S +L ++ + G ++
Sbjct: 265 LHTHQYSVTDYTRQIEHGKGVPGIFIKYDIEPLQMTIRERSTTLLQFLVRLAGVLGGVWV 324
>gi|157865526|ref|XP_001681470.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68124767|emb|CAJ02321.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 365
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 85/390 (21%), Positives = 154/390 (39%), Gaps = 69/390 (17%)
Query: 13 FTKPYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQV--STTEELFVDSSRGSKLP 69
F KP ED+ E+T +G +++ + L+ + Y + + + ++ +D +P
Sbjct: 2 FPKPKEDYQREQTRWGAVLSVSTVSIVILLVLWEGAAYLRGRDAYSTDVSLDKGLSEDMP 61
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
+H D++ P + C+ L++D VD++G + ++K LDG EV+ K
Sbjct: 62 VHFDVLFPFMPCNRLSIDVVDTTGMAKFNYTGRLHKLPTALDG--------EVLYKGSLK 113
Query: 130 KVTTENGTTTTELEDPNKCGSCY-----GAETETR-----KCCNTCNEVKEAYRYKKWAL 179
+ +N T E+ KC C G E R KCC+TC V Y+ +
Sbjct: 114 DL--DNEMETEEVRTGKKCRQCPPSAFDGVAAEVRSAAASKCCDTCESVLGLYKELGRGV 171
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP---GLSYSINHVHVH 236
P + I QC + GC + G L++ +V + P G YS+ V
Sbjct: 172 PGTEYIPQCLEQLYQR------ASGCAVMGSLDLKKVPVTVIFGPRRTGQFYSLKDV--- 222
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAK------AEEGASMFNYYIK 290
+T+H IR L G D+ R +G + + + S Y +K
Sbjct: 223 -------IRLDTSHFIRKLRIG----DETVERFSKNGVAERLSGHKSSSKTYSETRYLVK 271
Query: 291 IIPTIYERLDGSK-----------------LGGGDGGMPGIFFSYELSPLMVKITEKSKS 333
++PT Y + L G G +P + F +E +P+ V + +
Sbjct: 272 VVPTTYRKTKTKNAKASTYEYSAQWSRRTILVGFAGAVPAVLFEFEPAPIQVNNVFERQP 331
Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCV 363
H ++ + G ++ +D ++ V
Sbjct: 332 FSHFLVQLCGIVGGLFVVLGFIDNVVDWVV 361
>gi|340502903|gb|EGR29544.1| hypothetical protein IMG5_153610 [Ichthyophthirius multifiliis]
Length = 342
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 90/393 (22%), Positives = 170/393 (43%), Gaps = 88/393 (22%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK +D + K D E TV G ++I L + L + Y ++ T E+++D R
Sbjct: 9 KLKSIDMYRKLPTDLTESTVSGAMISIASSLIMLILFISEFNGYLSITETSEMYIDEKRY 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
K+ I++DI P + CD ++ L VE DL G
Sbjct: 69 DKIRINIDIDYPRLPCDVIS-----------LDVE--------DLKGT------------ 97
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
E T + + N+ +T+K ++ +E + +
Sbjct: 98 ---HSYQLEGNIQITRISNTNQY-------FDTQKYDDSHSENNQEF------------- 134
Query: 186 VQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
+E +LK+ F EGC+I G++ VN+ G+FH++ ++S + + +H I + +
Sbjct: 135 ----SEARLNRLKSAFLDQEGCKIQGHIFVNKAPGNFHVS---AHSFDRI-LHQIASHVN 186
Query: 244 -AAFNTTHHIRHLSFG-----IKLQDDDERR---KPLDGTVA-KAEEGASM---FNYYIK 290
+ + +H I H+SFG I+++ + + PLD T K E+ ++ + YYI
Sbjct: 187 ISTIDVSHIINHISFGDETDIIRIKRQFKSQGILDPLDRTRKIKTEDQKNISISYQYYIN 246
Query: 291 IIPTIYERLDGS-----KLGGGDG-----GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
++ T Y + + + +P FF Y+LSP++V+ ++ S H +
Sbjct: 247 VVHTTYVNIQKKEYSVYQFTANNNELLSDRLPACFFRYDLSPVIVRFSQSRMSFLHFIVQ 306
Query: 341 IMCNISGTYITFMLVDALLH-SCVKKISKVEIG 372
+ I G + ++D+++H S V + K E+G
Sbjct: 307 VCAIIGGVFTVAGIIDSIIHKSVVHILKKAEMG 339
>gi|209877186|ref|XP_002140035.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555641|gb|EEA05686.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 384
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/401 (21%), Positives = 171/401 (42%), Gaps = 68/401 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-RG 65
++ DAF+KP +F KT +GG +TI+ L + +L ++ Y +V+ +E+ VD + G
Sbjct: 1 MQRFDAFSKPIAEFRIKTAFGGYLTILSILTMLFLFYSELRYYLKVNRNDEITVDKTLAG 60
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ I + + P + C+ + L +++ D P+ ++
Sbjct: 61 GNVNIKMLVEFPKLPCEVVGLRILNTQ------------------DNTEFSHPKDSII-Y 101
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + E+ ++ CGSCY ++ CCNTC+EV +Y+ LP+
Sbjct: 102 IPINPLNEESNIGSS-------CGSCYNP-SKKNHCCNTCSEVIRSYQEDNIKLPQKINF 153
Query: 186 VQCKNEYSTEKLKNTFT-----EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
QCK + E+L+ + GC+I + + +V G I+ + N + DI
Sbjct: 154 EQCKFD-PRERLEKAISAPLNISGCKIKVDINIPKVKGRIEISHKRWMNYNEMTNLDIS- 211
Query: 241 YTSAAFNTTHHIRHLSFGIKLQ------DDDERRKPLDGTVAKAEEGASMFNYYIKI--- 291
+ +N ++ +++L +G L ++ E + T K + + + ++ I
Sbjct: 212 -EAHLYNFSYIVKYLHYGDDLPGINNIWNNQEYIQTAKFTHNKESDNLFLEDAHLDIDMH 270
Query: 292 -IPTIYERLDGSK------------------LGGG----DGGMPGIFFSYELSPLMVKIT 328
IPT + ++ K L G + +PGI+ +Y+ +P +VKIT
Sbjct: 271 CIPTQFNSINSKKTKIGHQFSVRKQSKQVNVLNNGRFVPETSLPGIYINYDFTPFIVKIT 330
Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
E +S T+ I G + ++D + ++++
Sbjct: 331 ESRRSFLSFLTECCAIIGGIFAFSSMIDIFMFKLSSFLNRI 371
>gi|342878666|gb|EGU79974.1| hypothetical protein FOXB_09504 [Fusarium oxysporum Fo5176]
Length = 376
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 154/361 (42%), Gaps = 59/361 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + ++T GG T+ + LI ++ +++ + + V++
Sbjct: 23 VSAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLVLIWGELGRWWRGAESHNFEVEAGVSR 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLD----GKPIQEPQKEV 122
+L I++DIVV ++CD + ++ D+SG+ H + +RL D + +
Sbjct: 83 ELQINMDIVV-KMNCDDIHVNVQDASGD------HILAAKRLKADRTLWSQWVDNKGMHK 135
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+ + +V T +G ED +G E + + V + KWA
Sbjct: 136 LGRDSQGRVNTGSGYNELGYED-----EGFGEE-------HVHDIVALGKKRAKWA---- 179
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
T K + + C+IYG L++N+V G FHI A G Y N H+
Sbjct: 180 ----------KTPKFRGN-ADSCRIYGSLDLNKVQGDFHITARGHGYRGNGEHL------ 222
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
+ FN +H I LS+G PLDGTV A + F YY+ ++PT+Y
Sbjct: 223 DHSKFNFSHIISELSYGPFYP---SLVNPLDGTVNTAPDNFHKFQYYLSVVPTVYSVNSK 279
Query: 302 SKLGG-----------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S L + +PGIFF Y++ P+++ + E + L K++ +SG +
Sbjct: 280 SILTNQYAVTEQSKAVDERYIPGIFFKYDIEPILLTVHESRDGIISLLVKVINIMSGVLV 339
Query: 351 T 351
Sbjct: 340 A 340
>gi|403330686|gb|EJY64240.1| hypothetical protein OXYTRI_24846 [Oxytricha trifallax]
Length = 345
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/398 (25%), Positives = 160/398 (40%), Gaps = 94/398 (23%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
SER+K D + +D E + G V++ + LI + Q T E+ +D
Sbjct: 9 LSERIKFFDFYKDLPQDLAEPSWSGATVSMFVMGLMVALIISQTYSFMQFQRTSEILIDV 68
Query: 63 SRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+ G SKL I+++I + C L+LD VD +G + V ++K LD DG +
Sbjct: 69 NSGNSKLNININITMHKAPCHVLSLDIVDVTGVHVMDVGGKLHKHSLDKDGFYL------ 122
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
G T E P E ++ + N++ YR
Sbjct: 123 --------------GHHDTMDEGP-----------EFKQASSDVNDI---YR-------- 146
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
DTI ++ EGC + G + +N+V G+FH++ ++S V V I
Sbjct: 147 -DTIKAMDDQ-----------EGCMVEGTVIINKVPGNFHLS---THSFGEV-VQKIY-M 189
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKP------------LDGTVAKAEE----GASMF 285
+ TH + HLSFG DD++ K +DGT + G +
Sbjct: 190 NGKKLDFTHTVNHLSFG-----DDKQMKSIQSKYNEKYTFDMDGTYVDQNQHLYQGQLLA 244
Query: 286 NYYIKI--------IPTIYERLDG-----SKLGGGDGGMPGIFFSYELSPLMVKITEKSK 332
NYY+ I Y+ L G SK G+P IFF YELSP+ ++ T K
Sbjct: 245 NYYLDINQVDYLDATGIFYKLLQGFKYKSSKSIMAQMGLPAIFFRYELSPVKLQYTMTYK 304
Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVE 370
S + +I I G Y+ ++++ L + + S E
Sbjct: 305 SWSEFFIEISAIIGGMYVVAGIIESFLRNSLSIFSSDE 342
>gi|429862433|gb|ELA37083.1| copii-coated vesicle protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 375
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 147/364 (40%), Gaps = 64/364 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ + +L +V +++ S T V+ G
Sbjct: 23 VSAFDAFPKAKPQYVTRTSGGGKWTVAMAVISLFLFWTEVGRWWRGSETHTFAVEKGVGH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
++ I+LDIVV + CD L ++ D++G++ ++ A
Sbjct: 83 EMQINLDIVV-RMHCDDLHINVQDAAGDR--------------------------ILAAS 115
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K K+ + T ++ D NK G +T+ R + +E + + D +
Sbjct: 116 KLKR----DKTNWSQWVD-NKGIHRLGRDTKGRIVTGEGWQEEEGFGEEH----VHDIVA 166
Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS 243
K K + EG C+IYG L+VNRV G FHI A G Y H+
Sbjct: 167 IGKKRAKWAKTPKLWGEGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGEHL------DH 220
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
AAFN +H I +SFG PLD TV A F YY+ ++PT+Y +
Sbjct: 221 AAFNFSHIISEMSFGPFYP---SLVNPLDRTVNAARINFHKFQYYLSVVPTVYTVGKSAS 277
Query: 304 LGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
D +PGIFF Y++ P+++ + E KI+ +SG
Sbjct: 278 TSNTIFTNQYAVTEQSKEVDDHNVPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSG 337
Query: 348 TYIT 351
+
Sbjct: 338 VLVA 341
>gi|169778245|ref|XP_001823588.1| COPII-coated vesicle protein (Erv41) [Aspergillus oryzae RIB40]
gi|83772325|dbj|BAE62455.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 390
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 74/376 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K D+ + GG T++ L S + +F+ S V+
Sbjct: 24 LKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVEKGVSH 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LDIVV + CD L ++ D+SG++ L G+ +++
Sbjct: 84 DLQLNLDIVV-QMPCDALHVNIQDASGDR-------------ILAGELLKKDPTSWKLWT 129
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K+ E T + E +P++ A+ E + EV+ R K P+L
Sbjct: 130 DKRNYDHEYQTLSRE--EPSRLE----AQEEDAHVRHVLGEVRHNPRRKFPKGPKLR--- 180
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+ + C+IYG LE N+V G FHI A G Y H+ +
Sbjct: 181 -----------RGDAVDSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHST 223
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-----LD 300
FN +H I LSFG PLD T+A E + Y++ ++PTIY + LD
Sbjct: 224 FNFSHMITELSFGTHYP---TLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALD 280
Query: 301 -------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
G++L +PGIFF Y + P+++ I+E+ S
Sbjct: 281 STLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFL 340
Query: 336 HLWTKIMCNISGTYIT 351
L +++ +SG +T
Sbjct: 341 SLLIRLVNTVSGVMVT 356
>gi|328725267|ref|XP_003248406.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Acyrthosiphon pisum]
Length = 129
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 72/115 (62%), Gaps = 10/115 (8%)
Query: 7 LKGLDAFTKPYEDFHEKTV-YGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
LK DAF KP E KTV + + C+L +S + +Y + TEELF D+S+
Sbjct: 13 LKQFDAFAKPLEGVQMKTVCFFALFSNHCFLMVS-----NSVEY--LDNTEELFADTSQN 65
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
KL I+ DIVV ISCD+L +AV++SG +L V+HNIYK RL+L G+PI P+K
Sbjct: 66 KKLQINFDIVVLKISCDFL--NAVENSGVTNLQVDHNIYKWRLNLGGQPISNPEK 118
>gi|303313533|ref|XP_003066778.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
delta SOWgp]
gi|240106440|gb|EER24633.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
delta SOWgp]
gi|320036232|gb|EFW18171.1| COPII-coated vesicle protein [Coccidioides posadasii str. Silveira]
Length = 399
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/390 (22%), Positives = 157/390 (40%), Gaps = 93/390 (23%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + + GG T+ +LF L+ ++ + + V+
Sbjct: 24 LRTFDAFPKTKPTYTTASRRGGQWTVFTFLFCGILVLSELISWHGGTENHHFSVEKGVSE 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE--------HNIYKRRLDLDGKPIQEP 118
++ ++LD+VV + CD L ++ D++G+ L E + + R ++ GK
Sbjct: 84 EIQLNLDLVV-RMPCDSLRVNMQDAAGDFILAAELLHKTPTSWDAWNREMNFAGKG---- 138
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
+ + + ++ E+ E E+ G G EV+ +++ +
Sbjct: 139 -----GSRQYQTLSAEDNVRLAEQEEDQHVGHVLG-------------EVRRSWKRQFPP 180
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY--SINHVHV 235
P+L + + C+IYG LE N+V G+FHI A GL Y V+V
Sbjct: 181 GPKLK--------------RKDVVDSCRIYGSLEGNKVQGNFHITAKGLGYYDPTGMVNV 226
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
+D+ N TH I LSFG PLD TVA ++ + YY+ ++PTI
Sbjct: 227 NDM--------NFTHLITELSFGPHYP---TLLNPLDKTVAATKDKFYKYQYYLSVVPTI 275
Query: 296 YERL----------------------------------DGSKLGGGDGGMPGIFFSYELS 321
Y R + G +PGIFF +++
Sbjct: 276 YTRAGTVDPYSQRLPDPSTITPSQRKNTIFTNQYAVTSQSRTISQGPYSVPGIFFKFDIE 335
Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
P+++ ++E+ SL L +++ +SG +
Sbjct: 336 PILLVVSEERGSLLALLVRLVNVVSGVLVA 365
>gi|412991249|emb|CCO16094.1| predicted protein [Bathycoccus prasinos]
Length = 409
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/438 (22%), Positives = 157/438 (35%), Gaps = 126/438 (28%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LDA+ K + +T G V+++ + L ++ +Y +++ VD ++
Sbjct: 17 LSSLDAYKKIEDHLMVRTTSGAIVSLLGIALMCILGASEILNYITPPVVKQMAVDGTQNE 76
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ + +DI P + C L++DA D SG+ V ++K RL+ DGK + K
Sbjct: 77 LMTVRMDITFPRVPCSVLSVDAYDQSGKNDQDVRGELHKERLNKDGKSLGSYDK------ 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
VT E +L+ G + +K EVK A K
Sbjct: 131 AGGGVTDEEDALIQDLQQFFGG----GMKVVFQKRAEHSREVKHAVEKK----------- 175
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA- 245
EGC++YG + V RV G+FHI+ H ++ + A
Sbjct: 176 ----------------EGCRLYGRMHVQRVGGNFHISA-------HAEEYETLQHAFGAV 212
Query: 246 --FNTTHHIRHLSFG-------------IKLQDDDE------------------------ 266
N +H I HLSFG + DDE
Sbjct: 213 NKINISHTITHLSFGAGYPGLVNPLDGVARSGSDDEFHYDESSKDSRSSDRKNIEKEKEE 272
Query: 267 -----------RRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG---------- 305
R + +D T E G+ ++ Y++K++PT Y LG
Sbjct: 273 EEKRKKKEQVRRSRLMDLTW--DENGSGVYKYFLKLVPTFYRTHRSVFLGLFSWTKSVST 330
Query: 306 -------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
G +P ++F Y+ SP+ V I K + T+ +C + G F
Sbjct: 331 NQYSVTEYFRKTDAWSGSLPAVYFLYDFSPIAVTIDTKRPHFVYFLTR-LCAVCGGVFAF 389
Query: 353 M-----LVDALLHSCVKK 365
LVDALL KK
Sbjct: 390 AHMISNLVDALLTIITKK 407
>gi|405119686|gb|AFR94458.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Cryptococcus neoformans var. grubii H99]
Length = 431
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 90/384 (23%), Positives = 159/384 (41%), Gaps = 49/384 (12%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + K+ GG +T V L I L+ D+ +Y + VDS
Sbjct: 32 IKSFDAFPKVESTYTIKSRRGGVLTAVVGLIIFLLVLNDLGEYLYGAPDYAFQVDSDIQK 91
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L +++D+ V + C YL +D D+ G++ LH+ ++ K DG + +
Sbjct: 92 DLQLNVDLTV-AMPCRYLTIDLRDAVGDR-LHLSNSFAK-----DGTHFNVGKATCIKNS 144
Query: 127 KKKKVTTENGT-TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + + + +++ PN+ S G +K + + + T
Sbjct: 145 RSTAIPSASEIISSSRRRTPNQQSSFSG--------------IKRLFGFSSSSSSNRRT- 189
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSY-SINHVHVHDIQPYTS 243
Q Y K C+IYG +EV +V+ + HI G Y S H H
Sbjct: 190 GQGHTAYRPTYDKVEDGPACRIYGSVEVKKVTANLHITTLGHGYMSFQHTDHH------- 242
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-----ER 298
N +H + SFG +PLD + E+ ++F Y+++++PT Y +
Sbjct: 243 -LMNLSHVVHEFSFGPFFP---AIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRK 298
Query: 299 LDGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
L S+ D G+PG+FF Y+L P+ V I E++ SL ++ + G +
Sbjct: 299 LITSQYAVTDYSRSFEHGKGVPGLFFKYDLEPMSVVIRERTTSLYQFLIRLAGVVGGVWT 358
Query: 351 TFMLVDALLHSCVKKISKVEIGGK 374
+ + +++SK +G K
Sbjct: 359 VAAFALRVFNRAQREVSKAVVGEK 382
>gi|321258600|ref|XP_003194021.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
gi|317460491|gb|ADV22234.1| ER to Golgi transport-related protein, putative [Cryptococcus
gattii WM276]
Length = 444
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/383 (22%), Positives = 154/383 (40%), Gaps = 46/383 (12%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + K+ GG +T V L I L+ D+ +Y + VDS
Sbjct: 33 IKSFDAFPKVESTYMIKSKRGGVLTAVVGLIIFLLVLNDLGEYLYGAPDYAFQVDSDVQK 92
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L +++D+ V + C YL +D D+ G++ LH+ ++ K D +
Sbjct: 93 DLQLNVDLTV-AMPCRYLTIDLRDAVGDR-LHLSNSFVKDGTHFD--------------I 136
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K N ++TT P+ + T ++ + +K + +
Sbjct: 137 GKATSIKNNPSSTT----PSASEIISSSRRRTPNQQSSFSGIKRLFSSSPSSSSSNRRTA 192
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSY-SINHVHVHDIQPYTSA 244
Q Y K C+IYG ++V +V+ + HI G Y S H H
Sbjct: 193 QDHTAYRPTYDKVQDGPACRIYGSVQVKKVTANLHITTLGHGYMSFQHTDHH-------- 244
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-----ERL 299
N +H + SFG +PLD + + ++F Y+++++PT Y +L
Sbjct: 245 LMNLSHVVHEFSFGPFFP---AIAQPLDQSYEITLQPFTIFQYFLRVVPTTYIDASRRKL 301
Query: 300 DGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
S+ D G+PG+FF Y+L P+ V I E++ SL ++ + G +
Sbjct: 302 ITSQYAVTDYSRSFEHGKGVPGLFFKYDLEPMSVVIRERTTSLFQFLIRLAGVVGGVWTV 361
Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
+ + ++SK +G K
Sbjct: 362 AAFALRVFNRATMEVSKAVVGEK 384
>gi|391872305|gb|EIT81439.1| COPII vesicle protein [Aspergillus oryzae 3.042]
Length = 390
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 74/376 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K D+ + GG T++ L S + +F+ S V+
Sbjct: 24 LKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVEKGVSH 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LDIVV + CD L ++ D+SG++ L G+ +++
Sbjct: 84 DLQLNLDIVV-QMPCDALHVNIQDASGDR-------------ILAGELLKKDPTSWKLWT 129
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K+ E T + E +P++ A+ E + EV+ R K P+L
Sbjct: 130 DKRNYDHEYQTLSRE--EPSRLE----AQEEDAHVRHVLGEVRHNPRRKFPKGPKLR--- 180
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+ + C+IYG LE N+V G FHI A G Y H+ +
Sbjct: 181 -----------RGDAVDSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHST 223
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-----LD 300
FN +H I LSFG PLD T+A E + Y++ ++PTIY + LD
Sbjct: 224 FNFSHMITELSFGPHYP---TLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALD 280
Query: 301 -------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
G++L +PGIFF Y + P+++ I+E+ S
Sbjct: 281 STLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFL 340
Query: 336 HLWTKIMCNISGTYIT 351
L +++ +SG +T
Sbjct: 341 SLLIRLVNTVSGVMVT 356
>gi|238495520|ref|XP_002378996.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
NRRL3357]
gi|220695646|gb|EED51989.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
NRRL3357]
Length = 390
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 74/376 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K D+ + GG T++ L S + +F+ S V+
Sbjct: 24 LKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVEKGVSH 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LDIVV + CD L ++ D+SG++ L G+ +++
Sbjct: 84 DLQLNLDIVV-QMPCDALHVNIQDASGDR-------------ILAGELLKKDPTSWKLWT 129
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K+ E T + E +P++ A+ E + EV+ R K P+L
Sbjct: 130 DKRNYDHEYQTLSRE--EPSRLE----AQEEDAHVRHVLGEVRHNPRRKFPKGPKLR--- 180
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+ + C+IYG LE N+V G FHI A G Y H+ +
Sbjct: 181 -----------RGDAVDSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHST 223
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-----LD 300
FN +H I LSFG PLD T+A E + Y++ ++PTIY + LD
Sbjct: 224 FNFSHMITELSFGPHYP---TLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALD 280
Query: 301 -------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
G++L +PGIFF Y + P+++ I+E+ S
Sbjct: 281 STLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFL 340
Query: 336 HLWTKIMCNISGTYIT 351
L +++ +SG +T
Sbjct: 341 SLLIRLVNTVSGVMVT 356
>gi|443897407|dbj|GAC74748.1| CDK9 kinase-activating protein cyclin T [Pseudozyma antarctica
T-34]
Length = 414
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/349 (23%), Positives = 151/349 (43%), Gaps = 59/349 (16%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+++ DAF K + +++ GG +TI+ L + +L+ ++ Y VDS
Sbjct: 13 KIRQFDAFPKTQSIYTQRSSKGGVLTIISALALVFLLWTELSTYLYGERGYSFAVDSQLQ 72
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
S + I++D+ V + C YL +D D+ G++ LHV +K+ DG + ++A
Sbjct: 73 STMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDTEFKK----DGTTFDIGHADRLDA 126
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ ++ + + G T ++ + Y + +K K A+ +P+
Sbjct: 127 LPQEAL--DVGKTISK----ARKKPLYRRKPRNKKFSRQVAFHKTAH-----LVPD---- 171
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
C+IYG +EV RV+G+ HI ++ H ++ ++
Sbjct: 172 ----------------GPACRIYGSMEVKRVTGNLHIT-----TLGHGYL-SMEHTDHKL 209
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N +H I SFG E +PLD +V ++ ++F Y++ IPT++ G +L
Sbjct: 210 MNLSHVIHEFSFGPYFP---EISQPLDSSVETTDKHFTVFQYFVSAIPTLFIDARGRRLH 266
Query: 306 GGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
G+PGIF Y++ PL + I E+S SL ++
Sbjct: 267 THQYSVTDYARPIEHGKGVPGIFIKYDIEPLQMTIRERSVSLVQFLVRL 315
>gi|119191516|ref|XP_001246364.1| hypothetical protein CIMG_00135 [Coccidioides immitis RS]
gi|392864406|gb|EAS34753.2| COPII-coated vesicle protein [Coccidioides immitis RS]
Length = 399
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/390 (22%), Positives = 157/390 (40%), Gaps = 93/390 (23%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + + GG T+ +LF L+ ++ + + V+
Sbjct: 24 LRTFDAFPKTKPTYTTASRRGGQWTVFIFLFCGMLVLSELISWHGGTENHHFSVEKGVSE 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE--------HNIYKRRLDLDGKPIQEP 118
++ ++LD+VV + CD L ++ D++G+ L E + + R ++ GK
Sbjct: 84 EIQLNLDLVV-RMPCDSLRVNMQDAAGDFILAAELLHKTPTSWDAWNREMNFAGKG---- 138
Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
+ + + ++ E+ E E+ G G EV+ +++ +
Sbjct: 139 -----GSRQYQTLSAEDDVRLAEQEEDQHVGHVLG-------------EVRRSWKRQFPP 180
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY--SINHVHV 235
P+L + + C+IYG LE N+V G+FHI A GL Y V+V
Sbjct: 181 GPKLK--------------RKDVVDSCRIYGSLEGNKVQGNFHITAKGLGYYDPTGMVNV 226
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
+D+ N TH I LSFG PLD TVA ++ + YY+ ++PTI
Sbjct: 227 NDM--------NFTHLITELSFGPHYP---TLLNPLDKTVAATKDKFYKYQYYLSVVPTI 275
Query: 296 YERL----------------------------------DGSKLGGGDGGMPGIFFSYELS 321
Y R + G +PGIFF +++
Sbjct: 276 YTRAGTVDPYSQRLPDPSTITVSQRKNTIFTNQYAVTSQSRTISQGPYSVPGIFFKFDIE 335
Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
P+++ ++E+ SL L +++ +SG +
Sbjct: 336 PILLVVSEERGSLLALLVRLVNVVSGVLVA 365
>gi|402885549|ref|XP_003906216.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Papio anubis]
Length = 364
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/379 (24%), Positives = 150/379 (39%), Gaps = 78/379 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPT-ISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
KL I++DI V C Y N+ D P Q+ + ++
Sbjct: 73 KLRINIDITVAMKCQCKY----------------TFNLLNPHAVFDLSPQQKEWQRMLQL 116
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
++ + L++ + K A++ ALP
Sbjct: 117 IQSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP---- 145
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
+ + S++ + C+I+G+L VN+V+G+FHI G + H H +
Sbjct: 146 ---REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHES 197
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
+N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 198 YNFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT 254
Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ ER G G+ GIF Y+LS LMV +TE+ + ++ + G +
Sbjct: 255 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 314
Query: 351 TFMLVDALLHSCVKKISKV 369
T +LH K I ++
Sbjct: 315 T----TGMLHGIGKFIVEI 329
>gi|425765498|gb|EKV04175.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
digitatum PHI26]
gi|425783511|gb|EKV21358.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
digitatum Pd1]
Length = 396
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 159/385 (41%), Gaps = 85/385 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + T GG T++ L + ++ +++ + V+
Sbjct: 23 LKTFDAFPKTKASYTTPTRSGGQWTVLILLICTVFSWSELKTWWRGTENYHFSVEKGVSH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L ++LD+VV + CD L ++ D++G++ L G+ ++ + +
Sbjct: 83 ELQLNLDMVV-HMPCDQLRVNIQDAAGDR-------------ILAGELLKRDDTNWLLWM 128
Query: 127 KKKKVTTENGT---TTTELEDPNKCGSCYGAETET-RKCCNTCNEVKEAYRYKKWALPEL 182
+K+ T +G T E+ ++ AE E + EV+ R K P +
Sbjct: 129 QKRNYETNDGAHEYQTLSHEESDRL-----AEQEADAHVGHVLGEVRHNPRRKFPKGPRM 183
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
+ + C+IYG LE N+V G FHI A G Y N H+
Sbjct: 184 R--------------RGVVPDACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------ 223
Query: 242 TSAAFNTTHHIRHLSFGIK---LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
+AFN +H I LSFG LQ+ PLD T+A+ EE F Y++ I+PT+Y R
Sbjct: 224 DHSAFNFSHMITELSFGPHYPTLQN------PLDKTIAETEEHYYKFQYFLSIVPTLYSR 277
Query: 299 ----LD----------------------------GSKLGGGDGGMPGIFFSYELSPLMVK 326
LD S + +PGIFF Y++ P+++
Sbjct: 278 GKSALDLYTRSPETLAARHGRNTVFTNQYAATSQSSAIPESPMVVPGIFFKYDIEPILLL 337
Query: 327 ITEKSKSLGHLWTKIMCNISGTYIT 351
++E+ L +++ +SG +T
Sbjct: 338 VSEERAGFLSLLIRVINTVSGVLVT 362
>gi|123437985|ref|XP_001309782.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121891523|gb|EAX96852.1| hypothetical protein TVAG_470170 [Trichomonas vaginalis G3]
Length = 344
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 125/310 (40%), Gaps = 58/310 (18%)
Query: 79 ISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTT 138
+ C +++D D G +IYK RLD + PI P +V
Sbjct: 74 LPCILVSIDIYDVLGTLTDPNSKSIYKLRLDNNRNPI--PYSQV---------------- 115
Query: 139 TTELEDPNKCGSCYGAE-TETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
CGSCYG E E +CCNTC +V + L + T QC N EK
Sbjct: 116 ------SQNCGSCYGTEFAEGSRCCNTCEDVVSHHIKAGRPLTNVTTWQQCIN----EKY 165
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
T E CQI+G V+ + G I P S + +P+T N TH+I H++F
Sbjct: 166 DFTGKEKCQIFGNHHVSAIDGGIRILPRFS--------SNEEPFTK-LLNLTHYIDHITF 216
Query: 258 GIKLQDDDERRKPL-DGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG-----GDGGM 311
G +PL D + ++E G + Y +K +PT+ DGS G +
Sbjct: 217 GTSFGP-----QPLDDALIVQSEPGQFHYRYDLKAVPTVMHNQDGSITHGFQYAVDSAKI 271
Query: 312 P---------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
P GIFF+Y + + V ++ L +++ C G + L+D+ +
Sbjct: 272 PITDRTRLGEGIFFNYYFATVAVVGKPDRFTIYILISRLFCIFGGGFFLARLIDSFGYRI 331
Query: 363 VKKISKVEIG 372
K+ IG
Sbjct: 332 HTMEGKMRIG 341
>gi|326427137|gb|EGD72707.1| hypothetical protein PTSG_04435 [Salpingoeca sp. ATCC 50818]
Length = 357
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 90/397 (22%), Positives = 168/397 (42%), Gaps = 82/397 (20%)
Query: 3 FSERLKGLDAFTK--PYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
E++K LD F+K P + + G VT+V + L+ ++ +Y + + FV
Sbjct: 10 LQEQVKSLDVFSKVEPDTGITQSSTSGALVTLVTAAIVCVLVWSEISEYNTLKIKYDYFV 69
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGE-----QHLHVEHNIYKRRLDLDGKPI 115
D+ + + +D+ V + CD++ D ++ SGE ++L +E ++ +
Sbjct: 70 DTDLRRDMNMTVDMTV-AMQCDHIGADYINLSGESTDGSKYLKLEPAHFE---------L 119
Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
Q E + A K V +E G+ G ++ +R + E
Sbjct: 120 SPNQLEWLEAWAK--VKSEEGSR--------------GLDSLSRFLHGSMREPMPT---- 159
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS--YSINHV 233
A PE+D+ + C+++G L V +V+ +FHI G S +S H
Sbjct: 160 --AAPEIDS----------------EPDACRLHGVLPVAKVAANFHITAGKSVHHSRGHS 201
Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
HV+ + P A N +H I SF ++ LDG + ++ +F Y+++++P
Sbjct: 202 HVNSMVP--PDAVNFSHRIDRFSF----SEEPRGAMALDGDLRTTDQPRQVFQYFLEVVP 255
Query: 294 TIYERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+ +RL + L G G+PGI+F +++ + V ++E+ L L
Sbjct: 256 STTQRLGQRQPFRSNQYSVTEQHRVLKEGARGIPGIYFKFDIESIGVSVSEEHPPLSRLL 315
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
+ +C I G + +LHS + I + G KT
Sbjct: 316 IR-LCGIVGGIVA---ASGMLHSFIGWIIRTVSGNKT 348
>gi|58261152|ref|XP_567986.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|134115843|ref|XP_773404.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256029|gb|EAL18757.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57230068|gb|AAW46469.1| ER to Golgi transport-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 431
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 156/383 (40%), Gaps = 47/383 (12%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + K+ GG +T + L I L+ D+ +Y + VDS
Sbjct: 32 IKRFDAFPKVESTYTIKSRRGGVLTALVGLIIFLLVLNDLGEYLYGAPDYAFQVDSEVQK 91
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L +++D+ V + C YL +D D+ G++ LH+ ++ K DG V
Sbjct: 92 DLQLNVDLTV-AMPCRYLTIDLRDAVGDR-LHLSNSFAK-----DGTHFN---------V 135
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
N ++TT P+ + T ++ + +K + A T
Sbjct: 136 GTATFIKNNPSSTT----PSASEIISSSRRRTPNQQSSFSGIKRLFGLDSSASSNRRT-S 190
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSY-SINHVHVHDIQPYTSA 244
Q Y K C+IYG +EV +V+ + HI G Y S H H
Sbjct: 191 QGHTAYRPTYDKVQDGPACRIYGSVEVKKVTANLHITTLGHGYMSFQHTDHH-------- 242
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-----ERL 299
N +H + SFG +PLD + E+ ++F Y+++++PT Y +L
Sbjct: 243 LMNLSHVVHEFSFGPFFP---AIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRKL 299
Query: 300 DGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
S+ D G+PG+FF Y+L P+ V I E++ SL ++ + G +
Sbjct: 300 ITSQYAVTDYSRSFEHGKGVPGLFFKYDLEPMSVIIRERTTSLYQFLIRLAGVVGGVWTV 359
Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
+ + K +SK +G K
Sbjct: 360 AAFALRVFNRAQKHVSKAVMGEK 382
>gi|340914937|gb|EGS18278.1| hypothetical protein CTHT_0063020 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 388
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 149/368 (40%), Gaps = 66/368 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ L L ++ +++ + V+
Sbjct: 29 FQAFDAFPKTKSQYTTRTSGGGKWTVAMSLIALILFWAELSRWWRGTEEHTFAVEKGVAR 88
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL---DGKPIQEPQKEVV 123
L I+LDIVV + C L ++ D++G++ L E + + DGK + ++V
Sbjct: 89 TLDINLDIVV-RMRCADLHVNVQDAAGDRILAAERLTRDPTMWVQWVDGKGVHRLGRDV- 146
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
+ +V T G E +G E + + V + KWA
Sbjct: 147 ----QGRVVTGEGWVEDE---------GFGEE-------HVHDIVALGRKKAKWA----- 181
Query: 184 TIVQCKNEYSTEKL--KNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
T KL + + C+IYG LE+N+V G FHI A G Y + + Q
Sbjct: 182 ---------KTPKLPPRGGQADSCRIYGSLELNKVQGDFHITARGHGY----LEGGNAQH 228
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
+AFN +H I LSFG L PLD TV A F Y++ I+PT Y
Sbjct: 229 LDHSAFNFSHIISELSFGPFLP---SLSNPLDRTVNLASHHFHRFQYFLSIVPTTYSVGR 285
Query: 301 GSKLGG-----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
++G + +PGIFF Y++ P+++ I E S+ K++
Sbjct: 286 PGEMGSQSIFTNQYAVTEQSHPVSERNIPGIFFKYDIEPILLNIVETRDSVFKFLVKVVN 345
Query: 344 NISGTYIT 351
+SG +
Sbjct: 346 IVSGVLVA 353
>gi|190346055|gb|EDK38054.2| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
6260]
Length = 407
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 99/398 (24%), Positives = 158/398 (39%), Gaps = 89/398 (22%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
FS R+K DAF K ++ GG TI+ +FI +++ V + + + VD
Sbjct: 64 AFSTRVKTFDAFPKLNSQHAVRSQRGGLSTIMTVVFILFVMWVQIGGFLGGYVDHQFVVD 123
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
S L I+LD+ V + C++L + +D + ++ L E L+ G P
Sbjct: 124 DQVRSDLRINLDMKV-AMPCEFLHTNVMDITDDRFLASE------VLNFQGSYFFVP--- 173
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ ++ TT+ T PE
Sbjct: 174 --DLIRMNDATTDYET------------------------------------------PE 189
Query: 182 LDTIVQCKNEYSTEKLKNTFTE---GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHD 237
L+ I+ Y ++ E C I+G + VN+VSG FHI A G+ Y + HV D
Sbjct: 190 LEEIMLEAGRYEFDREGYHEAESAPACHIFGSIPVNQVSGDFHITAKGMGYR-DRAHV-D 247
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
Q A N +H I SFG + + PLD T ++ + YY K++PT+YE
Sbjct: 248 PQ-----ALNFSHIIAEFSFG---EFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYE 299
Query: 298 RL----DGSKL-------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
R+ D ++ G G+PGIFF YE + + +++K +
Sbjct: 300 RMGLQVDTNQYSITESHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTSFVAR 359
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
+ I G +I V L +K+ K+ G K TK
Sbjct: 360 LATIIGGVFI----VAGYLFRLYEKLLKILFGKKYATK 393
>gi|302882273|ref|XP_003040047.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256720914|gb|EEU34334.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 376
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 153/360 (42%), Gaps = 57/360 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + ++T GG T+ + LI + +++ + + V++ G
Sbjct: 23 VSAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGEAARWWRGAESHNFEVEAGVGR 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD---LDGKPIQEPQKEVV 123
+L I+LDIVV + CD + ++ D+SG++ + + + + L +D K + + ++
Sbjct: 83 ELQINLDIVV-RMQCDDIHVNVQDASGDRIMAAKRLRHDKTLWSQWVDSKGMHKLGRD-- 139
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
+ +V T++G E E + + V + KWA
Sbjct: 140 ---SQGRVVTQSGWNDLG------------YEEEGFGEEHVHDIVALGRKKAKWA----- 179
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
T K+K + C++YG L +N+V G FHI A G Y N H+
Sbjct: 180 ---------KTPKVKGR-ADSCRVYGSLHLNKVQGDFHITARGHGYMGNGEHL------D 223
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
FN +H I LS+G PLDGTV A + F YY+ I+PT+Y S
Sbjct: 224 HKNFNFSHIISELSYGPFYP---SLVNPLDGTVNAASDNFHKFQYYLSIVPTVYSVGSRS 280
Query: 303 KLGG-----------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
L + +PGIFF Y++ P+++ + E + KI+ +SG +
Sbjct: 281 ILTNQYAVTEQSKSVNEHYIPGIFFKYDIEPILLTVHESRDGILTFLVKIINIVSGVLVA 340
>gi|85101064|ref|XP_961083.1| hypothetical protein NCU04293 [Neurospora crassa OR74A]
gi|11611445|emb|CAC18610.1| conserved hypothetical protein [Neurospora crassa]
gi|28922621|gb|EAA31847.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 379
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 149/361 (41%), Gaps = 58/361 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ L L + +++ S + V+
Sbjct: 22 VSAFDAFPKSKPQYVTRTTAGGKWTVFVGLISFILFWSEASRWWRGSESHTFAVEKGVSH 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
L I+LDIVV + C + ++ D++G++ LH + +++ +D K I + ++
Sbjct: 82 ALDINLDIVV-KMKCQDIHINVQDAAGDRILAASRLHRDPTVWQHWVD--NKGIHKLGRD 138
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ KV T G + D +G E + + V R KWA
Sbjct: 139 A-----QGKVVTGEGYMQGQGHDEG-----FGEE-------HVHDIVSLGRRKAKWA--- 178
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
T +L + C+++G LE+N+V G FHI + H ++ Q
Sbjct: 179 -----------RTPRLWGATPDSCRVFGSLELNKVQGDFHIT-----AKGHGYMEFGQHL 222
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
+AFN +H I LSFG L PLD TV A F Y+I ++PT+Y
Sbjct: 223 DHSAFNFSHIISELSFGPFLP---SLVNPLDQTVNIASANFHKFQYFISVVPTVYSSSGK 279
Query: 302 SKLGGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S + + +PGIF Y++ P+++ I E+ S K++ ISG +
Sbjct: 280 SIVTNQYAVTEQSQEVTERIIPGIFVKYDIEPILLHIDEERDSFLVFIIKVVNVISGALV 339
Query: 351 T 351
Sbjct: 340 A 340
>gi|348529156|ref|XP_003452080.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oreochromis niloticus]
Length = 379
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 153/384 (39%), Gaps = 86/384 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y + E VD S
Sbjct: 13 VKELDAFPKVSESYVETSASGGTVSLLAFSAMALLAVLEFFVYRETWMKYEYSVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVD-------SSGEQH------LHVEHNIYKRRLDLDGK 113
KL I++DI V + C ++ D +D S+G Q+ L + +++R L L
Sbjct: 73 KLRINIDITV-AMKCQHVGADILDLAETMITSNGLQYEPVIFELTPQQRLWQRTLLLIQN 131
Query: 114 PIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYR 173
++E + + V K + GA T
Sbjct: 132 RLRE--EHALQEVLYKTLLK-------------------GAPT----------------- 153
Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
ALP + + S E L C+I+G++ VN+V+G+ HI G
Sbjct: 154 ----ALPP-------REDASMEPLN-----ACRIHGHVYVNKVAGNLHITVGKPIHHPQG 197
Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
H H + +N +H I HLSFG +L PLDGT MF Y+I ++P
Sbjct: 198 HAHIAAFVSHETYNFSHRIDHLSFGEELPGII---NPLDGTEKITYNNNQMFQYFITVVP 254
Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
T + ER G G+ GIF Y+ S LMV ++E+ L
Sbjct: 255 TKLNTYKISADTHQFSVTERERVINHAAGSHGVSGIFVKYDTSSLMVTVSEQHMPLWQFL 314
Query: 339 TKIMCNISGTYITFMLVDALLHSC 362
++ I G + T ++ L+ C
Sbjct: 315 VRLCGIIGGIFSTTGMLHGLVGFC 338
>gi|146421059|ref|XP_001486481.1| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
6260]
Length = 407
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/397 (24%), Positives = 158/397 (39%), Gaps = 89/397 (22%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
FS R+K DAF K ++ GG TI+ +FI +++ V + + + VD
Sbjct: 65 FSTRVKTFDAFPKLNSQHAVRSQRGGLSTIMTVVFILFVMWVQIGGFLGGYVDHQFVVDD 124
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I+LD+ V + C++L + +D + ++ L E L+ G P
Sbjct: 125 QVRSDLRINLDMKV-AMPCEFLHTNVMDITDDRFLASE------VLNFQGSYFFVP---- 173
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+ ++ TT+ T PEL
Sbjct: 174 -DLIRMNDATTDYET------------------------------------------PEL 190
Query: 183 DTIVQCKNEYSTEKLKNTFTE---GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
+ I+ Y ++ E C I+G + VN+VSG FHI A G+ Y + HV D
Sbjct: 191 EEIMLEAGRYEFDREGYHEAESAPACHIFGSIPVNQVSGDFHITAKGMGYR-DRAHV-DP 248
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
Q A N +H I SFG + + PLD T ++ + YY K++PT+YER
Sbjct: 249 Q-----ALNFSHIIAEFSFG---EFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYER 300
Query: 299 L----DGSKL-------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
+ D ++ G G+PGIFF YE + + +++K ++
Sbjct: 301 MGLQVDTNQYSITELHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTLFVARL 360
Query: 342 MCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
I G +I V L +K+ K+ G K TK
Sbjct: 361 ATIIGGVFI----VAGYLFRLYEKLLKILFGKKYATK 393
>gi|336472105|gb|EGO60265.1| hypothetical protein NEUTE1DRAFT_56465 [Neurospora tetrasperma FGSC
2508]
gi|350294686|gb|EGZ75771.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
2509]
Length = 379
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 149/361 (41%), Gaps = 58/361 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ L L + +++ S + V+
Sbjct: 22 VSAFDAFPKSKPQYVTRTTAGGKWTVFVALVSFILFWSEASRWWRGSESHTFAVEKGVSH 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
L I+LDIVV + C + ++ D++G++ LH + +++ +D K I + ++
Sbjct: 82 ALDINLDIVV-KMKCQDIHINVQDAAGDRILAASRLHRDPTVWQHWVD--NKGIHKLGRD 138
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ KV T G + D +G E + + V R KWA
Sbjct: 139 A-----QGKVVTGEGYMQGQGHDEG-----FGEE-------HVHDIVSLGRRKAKWA--- 178
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
T +L + C+++G LE+N+V G FHI + H ++ Q
Sbjct: 179 -----------RTPRLWGATPDSCRVFGSLELNKVQGDFHIT-----AKGHGYMEFGQHL 222
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
+AFN +H I LSFG L PLD TV A F Y+I ++PT+Y
Sbjct: 223 DHSAFNFSHIISELSFGPFLP---SLVNPLDQTVNIASANFHKFQYFISVVPTVYSSSGK 279
Query: 302 SKLGGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
S + + +PGIF Y++ P+++ I E+ S K++ ISG +
Sbjct: 280 SIVTNQYAVTEQSQEVTERIIPGIFVKYDIEPILLNIEEERDSFLVFIIKVVNVISGALV 339
Query: 351 T 351
Sbjct: 340 A 340
>gi|255944653|ref|XP_002563094.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587829|emb|CAP85889.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 396
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 155/382 (40%), Gaps = 79/382 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + T GG T++ + + + +++ + V+
Sbjct: 23 LKTFDAFPKTKAAYTTPTRSGGQWTVLILIICTIFSWSEFKTWWRGTENYHFSVEKGVSH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L ++LD+VV + CD L ++ D++G++ L E + KR +Q+ E + V
Sbjct: 83 ELQLNLDMVV-HMPCDQLRVNIQDAAGDRILAGE--LLKRDDTNWLLWMQKRNHETSDGV 139
Query: 127 KK-KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ + ++ E E E G G EV+ R K P L
Sbjct: 140 HEYQTLSHEEADRLAEQEADAHVGHVLG-------------EVRRNPRRKFEKGPRLR-- 184
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
+ + C+IYG LE N+V G FHI A G Y N H+ +
Sbjct: 185 ------------RGVVADACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------DHS 226
Query: 245 AFNTTHHIRHLSFGIK---LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
+F+ +H I LSFG LQ+ PLD T+A+ EE F Y++ ++PT+Y R G
Sbjct: 227 SFDFSHMITELSFGPHYPTLQN------PLDKTIAETEEHYYKFQYFLSVVPTLYSRGKG 280
Query: 302 --------------------------------SKLGGGDGGMPGIFFSYELSPLMVKITE 329
S + +PGIFF Y + P+++ ++E
Sbjct: 281 ALDAYTRSPDAAASRYGRDTVFTNQYAATSQSSAIPESPMVVPGIFFKYNIEPILLLVSE 340
Query: 330 KSKSLGHLWTKIMCNISGTYIT 351
+ S L +++ ISG +T
Sbjct: 341 ERASFLSLLVRVINTISGVLVT 362
>gi|402085784|gb|EJT80682.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 379
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/365 (24%), Positives = 149/365 (40%), Gaps = 61/365 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ + + L ++ +++ T V+ G
Sbjct: 21 VSAFDAFPKSKPQYVTRTAGGGKWTVAMLVISAVLTWSELARWWRGVETHTFAVEKGVGH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I+LD+VV + CD L ++ D++G++ L RL +D P Q N V
Sbjct: 81 SMQINLDVVV-HMKCDDLHVNVQDAAGDRILAAS------RLKMD--PTAWAQWVDGNGV 131
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K N T E + + +G E + + V + +W
Sbjct: 132 HKLGRDKHNRLITNEGFEHDGHDEGFGEE-------HVHDIVALGKKRARWG-------- 176
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQPYTSA 244
T +L + + C+++G L++N+V G FHI A G Y H+ HD
Sbjct: 177 ------KTPRLWGSTADSCRLFGSLDLNKVQGDFHITARGHGYMEFGEHLDHD------- 223
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
AFN TH I SFG + PLD T+ A F Y++ ++PT+Y + S
Sbjct: 224 AFNFTHIINEFSFG---EFYPSLVNPLDRTINGANTHFHKFQYFLSVVPTVYS-VKSSAG 279
Query: 305 GGG------------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
G G + +PGIFF Y++ P+++ I E + K++ +S
Sbjct: 280 GFGSTIFTNQYAVTEQNAEISERAIPGIFFKYDIEPVLLNIEESRDTFLLFLVKVVNILS 339
Query: 347 GTYIT 351
G +
Sbjct: 340 GAMVA 344
>gi|255563175|ref|XP_002522591.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223538182|gb|EEF39792.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 191
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 90/182 (49%), Gaps = 25/182 (13%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF--NTTHHIRHLSFGIK 260
EGC++YG L+V RV+G+FHI S++ +++ Q A N +H I LSFG K
Sbjct: 13 EGCRVYGVLDVQRVAGNFHI------SVHGLNIFVAQMIFDGAIHVNVSHIIHDLSFGPK 66
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--------------SKLGG 306
PLDGT + + F YYIKI+PT Y + S +
Sbjct: 67 FPG---LHNPLDGTARILHDASGTFKYYIKIVPTEYRYISKEVLPTNQFSVTEYFSPMSE 123
Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
D P ++F Y+LSP+ V I E+ +S H T++ + GT+ ++D ++ ++ +
Sbjct: 124 YDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFALTGMLDRWMYRLLEAV 183
Query: 367 SK 368
+K
Sbjct: 184 TK 185
>gi|226479782|emb|CAX73187.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Schistosoma japonicum]
Length = 410
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/380 (22%), Positives = 158/380 (41%), Gaps = 58/380 (15%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
LD F K ++ + T GG +TI+ + IS+L+ + DY +D K+
Sbjct: 26 LDVFPKLPKECKKSTWGGGLLTILTFCCISWLLVNEFRDYLDPPVKYSYEIDKDISGKIK 85
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
+++DIVV + C +++D VD++G P+ +K
Sbjct: 86 VNIDIVVAS-PCHAISMDVVDTTGS-------------------PLFGEEK--------- 116
Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
E +T +L P + A + + E A ++ W +
Sbjct: 117 ---IEYISTVFDLSPPARV-----AFKKRQYVAGALREKHHAIQHWLWKYASDTNVFTNF 168
Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY-SINHVHVHDIQPYTSAA-FN 247
NE T+ + C+I G L V +V G+ HI G + ++H+H + P+ S N
Sbjct: 169 NEPDTQVSGGRNPDACRIVGTLFVKKVEGNIHILLGKPLEGLGNLHLH-VAPFLSKTNLN 227
Query: 248 TTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDGSKL-- 304
+H I H SFG + + + PL+ + ++ F Y++ ++PT + + ++
Sbjct: 228 FSHRINHFSFGDLV---NGQIHPLEAIESITAVASTSFQYFVTMVPTKVVNQFHVTETYQ 284
Query: 305 ------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
G+PGIFF Y+ PL+VKIT + LG +T++ G + T
Sbjct: 285 YAATVQNRTIDHASDSHGIPGIFFIYDTFPLVVKITYDRELLGTFFTRLAALAGGIFATI 344
Query: 353 MLVDALLHSCVKKISKVEIG 372
+ + +L + + + + +G
Sbjct: 345 IYLREMLSNLPEILLRTRLG 364
>gi|448521200|ref|XP_003868450.1| Erv41 protein [Candida orthopsilosis Co 90-125]
gi|380352790|emb|CCG25546.1| Erv41 protein [Candida orthopsilosis]
Length = 352
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 144/371 (38%), Gaps = 86/371 (23%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
FS+R+K DAF K ++ GG T++ + F ++ V++ Y + VD
Sbjct: 4 FSKRVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFFGLLILWVEIGGYIGGYVDRQFIVDD 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I+LD++V + C++L +AVD +G++ L E L+ +G P
Sbjct: 64 VLRSDLTINLDMIV-AMPCEFLHTNAVDIAGDRFLAGE------TLNFEGLKFFIPSGFS 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+N +PN P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129
Query: 183 DTIVQCKNEYSTEKLKNTFTEG---CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
D ++Q +L EG C I+G + VN+V G F I A GL Y D
Sbjct: 130 DEVMQESLRAEFSQLGRRVNEGAPACHIFGSIPVNQVKGEFRITAKGLGYK-------DR 182
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
A N +H I+ S+G PLD T EE ++ Y+ K++PT+YE+
Sbjct: 183 SFVPVEALNFSHVIQEFSYGDFFP---FLNNPLDATGKVTEENLQIYLYHSKVVPTLYEK 239
Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
L D ++ + G+PGI+F+YE P+ + I EK K
Sbjct: 240 LGLEVDTTQYSLTENHHIVKVNPHSKKPQGIPGIYFAYEFEPIKLIIREKRIPFLQFIAK 299
Query: 341 IMCNISGTYIT 351
+ + G +
Sbjct: 300 LGTIVGGIIVA 310
>gi|449476586|ref|XP_004154778.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 140
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/126 (40%), Positives = 75/126 (59%), Gaps = 1/126 (0%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+L+ LDA+ K EDF+ +T GG +T+ F+ +L ++ Y T +L VD+SR
Sbjct: 6 NKLRNLDAYPKINEDFYRRTFSGGLITLASSFFMLFLFFSELRMYLHAKTETQLVVDTSR 65
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G +L I+ D+ P I C L+LDA+D SGEQHL + HNI K+R+D G I E + + +
Sbjct: 66 GGELHINFDLSFPAIPCSILSLDAIDISGEQHLDIRHNIIKKRIDHLGTVI-EARPDGIG 124
Query: 125 AVKKKK 130
A K K
Sbjct: 125 APKVSK 130
>gi|145479237|ref|XP_001425641.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392712|emb|CAK58243.1| unnamed protein product [Paramecium tetraurelia]
Length = 326
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 144/329 (43%), Gaps = 59/329 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQV---STTEELFVDSS 63
++ ++ FTK + +KT GG + +V + +LI ++ FQ+ ST + VD
Sbjct: 1 MQYINLFTKSKVE-TKKTTCGGILALVTIFSVGFLIIGEIIRSFQLEVLSTIDTTNVDE- 58
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
++ ++L+I V ++C L+LD D +G +E+ I+K R+ DG+ I + E V
Sbjct: 59 ---RIRVNLNITVHDMTCFALSLDQQDVTGTHLEDMEYTIHKLRIR-DGRFINKEYAENV 114
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
E + N+ CYGA+ + C TC +V AY + W LP +
Sbjct: 115 KLF-------EQSLYHWNWHNANEVNDCYGAQLFEGQKCITCQDVLLAYASRDWPLPRKE 167
Query: 184 TIVQCKNEYSTEK---------------------------LKNTFTEGCQIYGYLEVNRV 216
+I QCK Y + + T+ E CQI+G+ + R+
Sbjct: 168 SIQQCKYSYIQQNGRRVLFTEDFGEERRGQQYIDMNDLTAMAFTYGESCQIFGHFYIKRI 227
Query: 217 SGSFHIA-PGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR-----KP 270
G+FHI+ G +++ + DIQ +H I L F + Q R
Sbjct: 228 PGNFHISFHGKGQAVSLIS-QDIQ--------LSHTINWLEFTPQKQGPTFGRYFKTTNT 278
Query: 271 LDGTVAKAEEGASMFNYYIKIIPTIYERL 299
LDGT + ++ YY+K++ + YE L
Sbjct: 279 LDGTTHQLKQKEDT-QYYLKLVESHYETL 306
>gi|346970151|gb|EGY13603.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium dahliae VdLs.17]
Length = 373
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/388 (21%), Positives = 151/388 (38%), Gaps = 70/388 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + ++T GG T+ + L ++ +++ S + V+ G
Sbjct: 20 VSAFDAFPKSKPQYVQRTSGGGKWTVAMAVISVMLFWSELGRWWRGSESHTFAVEKGVGH 79
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LDIVV + C+ L ++ D+SG+ ++ A
Sbjct: 80 DLQVNLDIVV-KMRCEDLHVNVQDASGD---------------------------LILAA 111
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K + + ++ +K G ET + E + + D +
Sbjct: 112 TKLREEITSWHQWADMTGNHKLGRSPSGRIETNSGYHLDEGFGEEHVH--------DIVA 163
Query: 187 QCKNEYS---TEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
Q K T +L+ + C+I+G L++N+V G FHI A G Y H+
Sbjct: 164 QSKKRQKWARTPRLRGP-PDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL------D 216
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
+FN +H + LSFG + + PLD TV A F YY+ I+PT+Y +
Sbjct: 217 HTSFNFSHIVNELSFGAFYPNLE---NPLDRTVNLAPANFHKFQYYLSIVPTVYTVGRSA 273
Query: 303 KLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
GD +PG+F Y++ P+++ + E W K++ +S
Sbjct: 274 SKANTVYTNQFAVTEQSKEVGDHSVPGVFVKYDIEPILLLVEETRPGFVQFWLKVINVLS 333
Query: 347 GTYIT----FMLVDALLHSCVKKISKVE 370
G + F L + + KK + +
Sbjct: 334 GVLVAGHWGFTLSEWFKENWAKKKERTQ 361
>gi|145349688|ref|XP_001419260.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579491|gb|ABO97553.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 310
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 80/368 (21%), Positives = 144/368 (39%), Gaps = 82/368 (22%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
+DAF + ++T G V++V + L V++ D+ + + VD +R + L
Sbjct: 1 VDAFARAAPHLTKRTRAGACVSVVGVVLACALALVEITDFLTPTRAKTHGVDDARNATLR 60
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
I +D+ P + C L +DA D SG+ + + K RLD G+ I
Sbjct: 61 IEIDVTFPRMPCQLLYVDAYDESGKHEVDARGLLLKTRLDASGRAIG------------- 107
Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
E + G G ++ +EV+EA
Sbjct: 108 -------------EYESAGGVDLGGLVLFQRRPEHAHEVREA------------------ 136
Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTT 249
EGC+++G LE RV+G+ + G ++D +P+ +
Sbjct: 137 ---------KADVEGCRLHGELEARRVAGTLRASTGPESYEFLKEIYD-EPW---EIDMR 183
Query: 250 HHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------------ 297
H ++ +FG + P++G V + E + ++ Y++K++PT Y
Sbjct: 184 HAVKTFTFGAEFPGAV---NPMNG-VRRMETKSGIYKYFMKVVPTTYSSTRALFGFIPWT 239
Query: 298 -RLDGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
R ++ + G +P +FF Y+LS + V IT SKS+ + TK + + G
Sbjct: 240 VRTRTNQYSVTEHFIETPHWGALPQLFFIYDLSAIAVNITVTSKSIVYFLTKTLATMGGI 299
Query: 349 YITFMLVD 356
+ VD
Sbjct: 300 FALTRTVD 307
>gi|225717192|gb|ACO14442.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Esox lucius]
Length = 379
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 144/375 (38%), Gaps = 68/375 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E T GG V+++ + ++ L + Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETTASGGTVSLIAFTAMALLAFFEFFVYRDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C ++ D +D +
Sbjct: 73 KLRINIDITV-AMKCQHVGADILDLA---------------------------------- 97
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK----KWALPEL 182
+ + T NG +P G + R N ++E + + K L
Sbjct: 98 --ETMITSNGIQY----EPVVFGLTPEQKLWHRTLLLIQNRLREEHSLQEVLYKSVLKGA 151
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
T + + ++E L C+I+G++ VN+V+G+FHI G H H +
Sbjct: 152 PTALPPREVATSEPLG-----ACRIHGHVYVNKVAGNFHITVGKPIHHPRGHAHIAAFVS 206
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-------- 294
+N +H I H SFG ++ PLDGT MF Y+I ++PT
Sbjct: 207 HDTYNFSHRIDHFSFG---EEIPGIINPLDGTEKVTTNNNHMFLYFITVVPTKLHTSKVS 263
Query: 295 -------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ ER G G+ GIF Y+ S LMV ++E+ L ++ I G
Sbjct: 264 ADTHQFSVTERERVINHAAGSHGVSGIFMKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGG 323
Query: 348 TYITFMLVDALLHSC 362
+ T ++ + C
Sbjct: 324 IFSTTGMIHGFVGFC 338
>gi|380016475|ref|XP_003692209.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Apis florea]
Length = 392
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 149/378 (39%), Gaps = 74/378 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + +KT GG +I I+YLI + ++DS
Sbjct: 12 VKELDAFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAET----------SYYLDSRLQF 61
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLD---GKPIQEPQKEVV 123
K DI DA K ++++D P +V+
Sbjct: 62 KFETDTDI------------DA----------------KLKINIDITVAMPCGRIGADVL 93
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
++ + V G + E ED + + E R R + A+ EL
Sbjct: 94 DSTNQNMV----GHESLEQED-----TWWELTQEQRSHFEALKHTNSYLREEYHAIHELL 144
Query: 184 TIVQCKNEYSTEKLKNTFT-----EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
YS E K T C+I+G L VN+V+G+FHI G S SI H+H
Sbjct: 145 WKSNQVTLYS-EMPKRTHQPIYAPNACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHIS 203
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE 297
T +N TH I SFG PL+G A+ ++ Y+++++PT I
Sbjct: 204 AFMTEKDYNFTHRINKFSFG---GPSPGIVHPLEGDEKIADNNMLLYQYFVEVVPTDIQT 260
Query: 298 RLDGSKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
L SK G G PGIFF Y++S L +K+T++ ++ K+
Sbjct: 261 LLSTSKTYQYSVKDHQRPINHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCA 320
Query: 344 NISGTYITFMLVDALLHS 361
+ G ++T LV ++ S
Sbjct: 321 TVGGIFVTSGLVKNIVQS 338
>gi|326672443|ref|XP_003199668.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Danio rerio]
Length = 365
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 84/178 (47%), Gaps = 18/178 (10%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
C+I+G + VN+V+G+FHI G + H H +N +H I HLSFG D
Sbjct: 170 ACRIHGKIYVNKVAGNFHITLGKPIETHKGHAHYASFIKDEVYNFSHRIDHLSFG---ND 226
Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGGD 308
PLDG E ++F Y+I ++PT + ER G+
Sbjct: 227 VPGHINPLDGMEKTTLEQNTLFQYFITVVPTKLHTSNVSVDMHQFSVTERERVVSNEKGN 286
Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G+ GIFF Y+LSPLMV+++E+ L ++ + G + T L+ L+ S V I
Sbjct: 287 QGVSGIFFKYKLSPLMVRVSEEHMPLAAFLVRLCGIVGGIFSTSDLLHRLIGSFVDII 344
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 43/87 (49%), Gaps = 1/87 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + + +GG VT+ ++ ++ L + Y E VD S
Sbjct: 15 IKNLDAFPKVPESYVATSAFGGTVTLTVFILMALLTISEFFVYQDTWMKYEYEVDRDFTS 74
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSG 93
KL I +DI V + C+ L D +D +G
Sbjct: 75 KLKIKIDITV-AMKCERLGADVLDIAG 100
>gi|66500700|ref|XP_395190.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Apis mellifera]
Length = 389
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 54/173 (31%), Positives = 84/173 (48%), Gaps = 18/173 (10%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
C+I+G L VN+V+G+FHI G S SI H+H T +N TH I SFG
Sbjct: 169 ACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHISAFMTEKDYNFTHRINKFSFG---GP 225
Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDGSKL--------------GGGD 308
PL+G A+ ++ Y+++++PT I L SK G
Sbjct: 226 SPGIVHPLEGDEKIADNNMLLYQYFVEVVPTDIQTLLSTSKTYQYSVKDHQRPINHQKGS 285
Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
G PGIFF Y++S L +K+T++ ++ K+ + G ++T LV ++ S
Sbjct: 286 HGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGLVKNIVQS 338
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 1/88 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + +KT GG +I I+YLI + Y + D+ +
Sbjct: 12 VKELDAFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTDIDA 71
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGE 94
KL I++DI V + C + D +DS+ +
Sbjct: 72 KLKINIDITV-AMPCGRIGADVLDSTNQ 98
>gi|281206876|gb|EFA81060.1| DUF1692 family protein [Polysphondylium pallidum PN500]
Length = 344
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 62/231 (26%), Positives = 107/231 (46%), Gaps = 40/231 (17%)
Query: 22 EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISC 81
+KTVYGG +T +C +F +L+C ++ Y L VD +RG++L I++DI P++ C
Sbjct: 116 QKTVYGGVITAICMIFTMFLLCSELYYYTFPIRDHSLKVDVTRGNRLLINIDIHFPSLIC 175
Query: 82 DYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTE 141
+ ++++D +DG+PI++ ++V ++ NG
Sbjct: 176 SDINVESIDG------------------IDGRPIKDASYQIV-----RERLDRNGVVIDP 212
Query: 142 LEDPN---KCGSCYGAETE------TRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEY 192
P +C SC ++CCN C++++E YR K D QC
Sbjct: 213 SNPPPGFFECVSCRLPANSKYAVLYPQRCCNKCDDLREFYRTNKIPQHYADQSPQCM--I 270
Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV---HDIQP 240
S + ++ EGC+IYG L V ++ G HI G+ N + +D+ P
Sbjct: 271 SDPEAED---EGCRIYGTLWVQKMKGDIHILAGIRPGYNAPGIYFKYDLSP 318
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 24/36 (66%), Gaps = 1/36 (2%)
Query: 312 PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
PGI+F Y+LSPLM+++ + SK L T + C I G
Sbjct: 308 PGIYFKYDLSPLMIEVDQSSKPFVELVTSV-CAIGG 342
>gi|66360024|ref|XP_627190.1| ERV41 like membrane associated protein involved in vesicular
transport with a transmembrane region near the
C-terminus [Cryptosporidium parvum Iowa II]
gi|46228832|gb|EAK89702.1| ERV41 like membrane associated protein involved in vesicular
transport with a transmembrane region near the
C-terminus [Cryptosporidium parvum Iowa II]
Length = 403
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 86/405 (21%), Positives = 164/405 (40%), Gaps = 71/405 (17%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SR 64
++K DAF+KP +F KT +GG +TI+ + + L ++ Y ++ +E+ VD S
Sbjct: 15 KMKQFDAFSKPISEFRIKTAFGGYLTILSMIAMIILFYSELKYYLNITRKDEVTVDHLSS 74
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
+ + + + P + CD L + ++ + +++ DG
Sbjct: 75 NRNINLRMQLEFPKLPCDILGVRIINLQENKEIYLP----------DG------------ 112
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKKWALP 180
++ K+ G+ + + CG CY A + CCNTC ++ Y K LP
Sbjct: 113 GIEFVKI----GSNESNANSSSGCGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKLP 168
Query: 181 ELDTIVQCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
+ + QC + S +++ N +EGC+I + +V G I+ + + +
Sbjct: 169 HVISFKQCDYDKS-KRISNALSSNLNSEGCKIKVNGYIPKVKGKIEISH--KRWVKYKEM 225
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA----------SMF 285
D++ S FN ++ + +L FG +L R K + + E +
Sbjct: 226 TDLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAYI 285
Query: 286 NYYIKIIPTIYERLDGS------------------KLGGG----DGGMPGIFFSYELSPL 323
++ + IPT Y ++ L G D +PGI +Y+ +P
Sbjct: 286 DFDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPF 345
Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+VKITE +S T+ I G + ++D + ++K
Sbjct: 346 LVKITESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 390
>gi|307105802|gb|EFN54050.1| hypothetical protein CHLNCDRAFT_136126 [Chlorella variabilis]
Length = 319
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 83/388 (21%), Positives = 147/388 (37%), Gaps = 103/388 (26%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTI--VCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
+L L AF+ E +T++G VTI VC + ++ V C V +++ VD+S
Sbjct: 7 KLSHLTAFSHAQEHLRVQTIHGAIVTIIGVCVALVLFISEVQQC--MVVKRVQDMRVDTS 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSG----EQHLHVEHN--IYKRRLDLDGKPIQE 117
R +L + ++ P + C+ L +DA D SG E + V N ++K +D+ G+ +
Sbjct: 65 RREELHVSFNVTFPALPCEALLMDAGDVSGKWQTESRMKVAKNGEVHKHSVDISGRWL-- 122
Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKW 177
++ + E ++P + GA + + CN
Sbjct: 123 ------------RLAEYTAPSEGEWDNPFEMNEI-GAALKRHEGCN-------------- 155
Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA---PGLSYSINHVH 234
I+G+LEV RV+G+ H A L S+N
Sbjct: 156 -----------------------------IHGWLEVQRVAGNVHFAVRPEALFLSMNAEA 186
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
+ + P ++ N +H PL+G + Y++K++PT
Sbjct: 187 IMQLHP-DASKLNISH-----------------ANPLEGVAQIDRTATGIDKYFVKVVPT 228
Query: 295 IYERLDGSK--------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
+ L G K GG+ P ++ Y+ SP+MV I E L L +
Sbjct: 229 DFYTLWGRKTHTYQYSVTEYYHQFRGGEEQPPAVYLLYDASPIMVDIREMRPGLLRLLVR 288
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISK 368
+ + G + L D ++H V + +
Sbjct: 289 VCAVVGGAFALTGLFDKMVHRAVVAVKR 316
>gi|380492334|emb|CCF34678.1| hypothetical protein CH063_01185 [Colletotrichum higginsianum]
Length = 377
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 144/361 (39%), Gaps = 58/361 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ + +L +V +++ S T V+ G
Sbjct: 23 VSAFDAFPKAKPQYVTRTSGGGKWTVAMTVISVFLFWTEVGRWWRGSETHTFAVEKGIGH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
++ I+LDIVV + CD L ++ D++G++ L ++ KR + + +
Sbjct: 83 EMQINLDIVV-RMHCDDLHINVQDAAGDRIL--AGSMLKRDKTNWSQWVDSKGIHRLGRD 139
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K K+ T G E +G E + + V + KW
Sbjct: 140 SKGKIVTGAGWQEEE---------GFGEE-------HVHDIVSLGKKKAKWG-------- 175
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
T +L + C++YG L+VNRV G FHI + H ++ + AAF
Sbjct: 176 ------KTPRLWGD-GDSCRVYGNLDVNRVQGDFHIT-----ARGHGYMEFGEHLDHAAF 223
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
N +H + LSFG PLD TV A F YY+ I+PT+Y +
Sbjct: 224 NFSHIVSELSFGPFYP---SLVNPLDRTVNLARINFHKFQYYLSIVPTVYTVGKSASSSN 280
Query: 307 ----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
D +PGIFF Y++ P+++ + E KI+ +SG +
Sbjct: 281 TIFTNQYAVTEQSKETDDHNIPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSGVLV 340
Query: 351 T 351
Sbjct: 341 A 341
>gi|256052432|ref|XP_002569774.1| ptx1 protein [Schistosoma mansoni]
gi|353229921|emb|CCD76092.1| putative ptx1 protein [Schistosoma mansoni]
Length = 460
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 168/387 (43%), Gaps = 58/387 (14%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
S+ + LD F K + + T GG VTI+ + IS+L+ ++ Y +D S
Sbjct: 68 SQIVNELDVFPKLPRECKKSTWSGGLVTILTFGCISWLLIMEFRSYLDPPVNYSYELDKS 127
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
K+ +++DIVV + C +++D VD+SG L E NI + P
Sbjct: 128 TTGKVKVNIDIVVAS-PCHAVSMDVVDTSGSS-LSDEENIQYLPTSFELTPSARA----- 180
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
A K ++ Y AET K + + + +K + +
Sbjct: 181 -AFKYRQ---------------------YIAETLRAK-----HHTIQHWLWKYTSGTNVF 213
Query: 184 TIVQCKNEYSTEKLKNTF-TEGCQIYGYLEVNRVSGSFHIAPGLSYS-INHVHVHDIQPY 241
TI + + EK+ + ++ C+I G L V +V G+ HI G + ++H+H + P+
Sbjct: 214 TIFEVP--VADEKVSDDRNSDACRIVGTLFVKKVGGNIHILFGKPLNGFGNLHLH-VVPF 270
Query: 242 TSAAF-NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------ 294
+ + N +H I H SFG + + + PL+ + + + F Y++ ++PT
Sbjct: 271 SGQSLQNFSHRINHFSFGDLV---NGQIHPLEAVESVTDIAFTSFQYFVTMVPTKVVNHF 327
Query: 295 -IYERLDGSKL--------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
I E + G G+PGIFF Y++ PL+VKIT + LG +T++
Sbjct: 328 HITETYQYAATLQNRTIDHDAGSHGIPGIFFVYDIFPLVVKITYDRELLGTFFTRLAALA 387
Query: 346 SGTYITFMLVDALLHSCVKKISKVEIG 372
G + T + +L + + + +G
Sbjct: 388 GGIFATVAYLREILSNLPDILLRTRLG 414
>gi|41055383|ref|NP_956701.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Danio rerio]
gi|82188148|sp|Q7T2D4.1|ERGI2_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|32451749|gb|AAH54593.1| ERGIC and golgi 2 [Danio rerio]
gi|182890474|gb|AAI64472.1| Ergic2 protein [Danio rerio]
Length = 376
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 84/180 (46%), Gaps = 18/180 (10%)
Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
N C+I+G+L VN+V+G+FHI G + H H + +N +H I HLSFG
Sbjct: 163 NQPLNACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHETYNFSHRIDHLSFG 222
Query: 259 IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSK 303
++ PLDGT + + MF Y+I I+PT + ER
Sbjct: 223 ---EEIPGILNPLDGTEKVSADHNQMFQYFITIVPTKLQTYKVYADTHQYSVTERERVIN 279
Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
G G+ GIF Y++S LMVK+TE+ ++ I G + T ++ L+ CV
Sbjct: 280 HAAGSHGVSGIFMKYDISSLMVKVTEQHMPFWQFLVRLCGIIGGIFSTTGMLHNLVGFCV 339
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 1/84 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ LDAF K E + E T GG V+++ + ++ L + Y E VD S
Sbjct: 13 VRELDAFPKVPESYVETTASGGTVSLLAFTAMALLAFFEFFVYRDTWMKYEYEVDKDFTS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
KL I++DI V + C ++ D +D
Sbjct: 73 KLRINIDITV-AMRCQFVGADVLD 95
>gi|452822342|gb|EME29362.1| hypothetical protein Gasu_31910 [Galdieria sulphuraria]
Length = 170
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 91/178 (51%), Gaps = 14/178 (7%)
Query: 41 LICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE 100
LI +V Y++ T L VD +R I+LDI P I C L LD +D++G+ L V
Sbjct: 4 LIISEVGRYWKPQVTTHLVVDYNREESFEIYLDITFPHIGCGALGLDTMDATGDSQLEVV 63
Query: 101 HN-IYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTT-TTELEDPNKCGSCYGAETET 158
++ + K R+ +G + + + + ++G + LE+ C SCYGA+ T
Sbjct: 64 NSKLSKFRVFQNGSQV----------LWNQSIVEKDGKVHSFVLEEATNCKSCYGAQIST 113
Query: 159 RKCCNTC-NEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNR 215
+CCNTC EV AY + W+ +++ QC E + +++ ++GC G +EV +
Sbjct: 114 DQCCNTCEEEVLLAYEWIGWSY-QVEQFEQCHMEGVVQWVQSVLSQGCHFQGTIEVAK 170
>gi|323509323|dbj|BAJ77554.1| cgd8_2900 [Cryptosporidium parvum]
gi|323510503|dbj|BAJ78145.1| cgd8_2900 [Cryptosporidium parvum]
Length = 388
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/406 (21%), Positives = 165/406 (40%), Gaps = 75/406 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD---SS 63
+K DAF+KP +F KT +GG +TI+ + + L ++ Y ++ +E+ VD S+
Sbjct: 1 MKQFDAFSKPISEFRIKTAFGGYLTILSMIAMIILFYSELKYYLNITRKDEVTVDHLSSN 60
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
R L + L+ P + CD L + ++ + +++ DG
Sbjct: 61 RNINLRMQLEF--PKLPCDILGVRIINLQENKEIYLP----------DG----------- 97
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKKWAL 179
++ K+ G+ + + CG CY A + CCNTC ++ Y K L
Sbjct: 98 -GIEFVKI----GSNESNANSSSGCGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKL 152
Query: 180 PELDTIVQCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
P + + QC + S +++ N +EGC+I + +V G I+ + +
Sbjct: 153 PHVISFKQCDYDKS-KRISNALSSNLNSEGCKIKVNGYIPKVKGKIEISH--KRWVKYKE 209
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA----------SM 284
+ D++ S FN ++ + +L FG +L R K + + E +
Sbjct: 210 MTDLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAY 269
Query: 285 FNYYIKIIPTIYERLDGS------------------KLGGG----DGGMPGIFFSYELSP 322
++ + IPT Y ++ L G D +PGI +Y+ +P
Sbjct: 270 IDFDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTP 329
Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+VKITE +S T+ I G + ++D + ++K
Sbjct: 330 FLVKITESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 375
>gi|393231429|gb|EJD39021.1| DUF1692-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 518
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/349 (22%), Positives = 137/349 (39%), Gaps = 61/349 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + + GG +T+ L L+ D+ +Y E VD SR S
Sbjct: 20 LKQFDAFPKVPATYKSRRGEGGLLTLFACLLSVVLVLNDIAEYMWGWPDHEFSVDKSRQS 79
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+PI++D++V + C YL++D D+ G++ LH+ N V
Sbjct: 80 YMPINVDLIV-NMPCHYLSVDIRDAVGDR-LHLSDN-----------------------V 114
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K++ + G T S ++RK + + + +
Sbjct: 115 KREGTVWDVGQATRMANHSQTMMSATEVVRQSRKSRGLFSIFQRSSK------------P 162
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQPYTSAA 245
Q K Y+ + C+++G + V +V+ + HI G YS N H +
Sbjct: 163 QFKPTYNHPNMGKAVGSACRVFGSMFVKKVTANLHITTAGHGYSSNAHTDHTM------- 215
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N +H I SFG + D + PLD A+E + + Y++ ++PT Y +
Sbjct: 216 MNLSHIISEFSFGPFMPDISQ---PLDNLFEVAKEPFTAYQYFLTVVPTTYVAPRSYPMR 272
Query: 306 GGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
PGIFF +++ P+ + + +++ + L +I
Sbjct: 273 TNQYSVTNYKRVFEHGRATPGIFFKFDIDPMQLTVIQRTTTFTQLIIRI 321
>gi|154418008|ref|XP_001582023.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121916255|gb|EAY21037.1| hypothetical protein TVAG_172950 [Trichomonas vaginalis G3]
Length = 371
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 161/380 (42%), Gaps = 53/380 (13%)
Query: 23 KTVYGGAVTIVCWLFISYLICVDV--CDYFQVSTTEEL---FVDSSRGSKLPIHLDIVVP 77
+T GG ++ + L++ +L+ + Y ++ ++ L VD R K I+ DI +
Sbjct: 19 QTFTGGLISFLTTLWVCFLLVGKIHGLIYPEIKSSVVLDKEHVDGQR--KTFINFDITIG 76
Query: 78 TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGT 137
+ C L +D + G Q ++ NI R G+ I + ++ V + KK+
Sbjct: 77 S-PCTMLHIDLFEHDGYQKTNIIENISLTRYAQSGEDINDLLEKRVPSKSKKQDFP---- 131
Query: 138 TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
P+ CG+CY + +KCCNTC EV + ++ K QC E +
Sbjct: 132 -------PDYCGNCY--LSTDKKCCNTCREVMDVFKAKGLTYYASFRWEQCIRE----GV 178
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV-HVHDIQPYTSAAFNTTHHIRHLS 256
+ E C+I G L+V + SG+FHIA G + + N+ H HD+ A+ H I L+
Sbjct: 179 LDFGNETCRIKGKLKVKKQSGNFHIALGANTNDNYKGHSHDLSS-VDASHKLNHVIHSLT 237
Query: 257 FG-------IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP---TIYERLDGSKLGG 306
FG +L D + + L+G+ M YY+ P + +++D +
Sbjct: 238 FGEPVDYYKPQLTDVEMQLPELNGS------NYWMVTYYLHAAPERISTTDKIDSYRYSA 291
Query: 307 GDG----------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVD 356
G PGI F Y+ +P++V S+ + I + G + ++D
Sbjct: 292 FPSRRKVTNKTKKGFPGIVFYYDFAPMIVVYQPTHGSIRSIIVDICGIVGGAFSFAAIID 351
Query: 357 ALLHSCVKKISKVEIGGKTV 376
AL + I + GK
Sbjct: 352 ALAFGALSGIRGKTMIGKAA 371
>gi|432862155|ref|XP_004069750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oryzias latipes]
Length = 373
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 89/191 (46%), Gaps = 18/191 (9%)
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
T V+ T++ ++ C+I+G+L VN+V+G+FHI G S H H +
Sbjct: 146 TAVKGAQPAKTQRDSSSPPNACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSH 205
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------- 294
++N +H I HLSFG + PLDGT A + MF Y+I I+PT
Sbjct: 206 DSYNFSHRIDHLSFGEAIPG---LISPLDGTEKIAADYNHMFQYFITIVPTKLNTYKVSA 262
Query: 295 ------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
+ ER G G+ GIF Y++S LMVK+TE+ ++ + G
Sbjct: 263 ETHQYSVTERERVINHAAGSHGVSGIFMKYDISSLMVKVTEQHMPFWKFLVRLCGIVGGI 322
Query: 349 YITFMLVDALL 359
+ T ++ L+
Sbjct: 323 FSTTGMIHGLV 333
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E T GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVESTASGGTVSLIAFTLMAVLAFLEFFVYTNTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
KL I++DI V + C Y+ D +D
Sbjct: 73 KLRINVDITV-AMRCQYIGADVLD 95
>gi|225558748|gb|EEH07032.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 401
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 156/393 (39%), Gaps = 83/393 (21%)
Query: 2 VFSER------LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTT 55
F ER L+ DAF K + T GG TI+ + ++L ++ +++
Sbjct: 13 AFGERPGIGSGLRTFDAFPKTKPTYTTSTRRGGQWTIIVFALCAFLSLNELRTWYRGVEN 72
Query: 56 EELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
V+ +L ++LDIVV +SCD L ++ D++G++ L + LD +P
Sbjct: 73 HHFSVEKGVSRELQMNLDIVV-AMSCDALRVNVQDAAGDRILASDL--------LDKQPT 123
Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK-CCNTCNEVKEAYRY 174
A +++ E + N+ S E E + E K +Y+
Sbjct: 124 SW-------AAWNRELNGVTSGGGREYQTLNEEDSSRLMEQEADAHVGHALGEAKRSYKR 176
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHV 233
K P+L + + C+IYG LE N+V G FHI A G Y
Sbjct: 177 KFPKGPKLK--------------RGEKADSCRIYGSLEGNKVQGDFHITARGHGYPEFGE 222
Query: 234 HV-HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
H+ HD AFN +H + LSFG PLD T++ F YY+ ++
Sbjct: 223 HLSHD-------AFNFSHMVTELSFGPHYPS---LLNPLDKTISVTPARFFKFQYYLSVV 272
Query: 293 PTIYERL-----------DGSKLGGGDGG-----------------------MPGIFFSY 318
PTIY R D + + + G +PGIFF Y
Sbjct: 273 PTIYTRAGIVDPYNHVLPDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYHIPGIFFKY 332
Query: 319 ELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ P+++ ++E+ L L +++ ++G +
Sbjct: 333 NIEPILLVVSEERGGLLALLVRLVNVLAGVVVA 365
>gi|410082748|ref|XP_003958952.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
gi|372465542|emb|CCF59817.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
Length = 354
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 158/385 (41%), Gaps = 81/385 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K E++ +K+ GG +++ + F+ ++ + +YF E+ VD
Sbjct: 4 LRTFDAFPKTEEEYQKKSSKGGLSSLLTYFFLIFIAWTEFGNYFGGYIDEQYTVDPEVKE 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++DI V I C +L ++A D + ++ L E L L+ P P VN +
Sbjct: 64 DIQINMDIFV-NIPCKWLHINARDMTLDRKLAGEE------LKLEDMPFFIPFDTRVNDI 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAE----TETRKCCNTCNEVKEAYRYKKWALPEL 182
+ T EL+ G AE + R+ + N + K +PE
Sbjct: 117 TE--------IVTPELD--RILGEAIPAEFREKIDMRQFYDENNHDE-----TKHFVPEF 161
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
+ GC ++G + VNRV+G I A G+ Y D +
Sbjct: 162 N--------------------GCHVFGSIPVNRVTGELQITAKGMGYP-------DREKA 194
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFNYYIKIIPTIYERLD 300
N H I LSFG D PLD + +E S + Y++ +IPTIY++L
Sbjct: 195 PIDEVNFAHVINELSFGDFYPYID---NPLDNSAKFDQENPISAYVYHMNVIPTIYQKL- 250
Query: 301 GSKLGGGD------------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
G+++ G +PGIF Y PL + +T+K S +++
Sbjct: 251 GAEVDTNQYSVSEYHYTEADNAIRKAGRVPGIFLKYNFEPLSIVVTDKRLSFIQFVIRLV 310
Query: 343 CNISG-TYIT---FMLVDALLHSCV 363
+S YI F+LVD L + +
Sbjct: 311 AILSFIVYIASWLFILVDTALVAAM 335
>gi|145511431|ref|XP_001441642.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408894|emb|CAK74245.1| unnamed protein product [Paramecium tetraurelia]
Length = 329
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 90/191 (47%), Gaps = 24/191 (12%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG---- 258
EGCQI GY+ VN+V G+FH++ I H Q + +H I H+SFG
Sbjct: 139 EGCQIAGYIIVNKVPGNFHVSAHAFGGILH---QVFQRSQIQTLDLSHTINHISFGEEDD 195
Query: 259 ---IKLQDDDERRKPLDGT--VAKAEEGASM-FNYYIKIIPTIYERLDGSKLGGGD---- 308
IK Q PLD T VA+ + G M F YYI ++PT Y + G++
Sbjct: 196 LMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYISVVPTTYVDVSGNEYYVHQFTAN 255
Query: 309 ------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH-S 361
+P +F Y+LSP+ VK + +S H +I + G + +VD ++H S
Sbjct: 256 SNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTIASIVDGMIHKS 315
Query: 362 CVKKISKVEIG 372
V + K E+G
Sbjct: 316 VVALLKKYEMG 326
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 6/89 (6%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
RL+ LD + K D E T G ++++ LFI+ L Y +V + E+FVD +RG
Sbjct: 8 RLRKLDIYRKLPADLTEPTTAGALISVIIILFITELQA-----YIEVDNSSEMFVDINRG 62
Query: 66 S-KLPIHLDIVVPTISCDYLALDAVDSSG 93
++ ++LDI CD L+LD D G
Sbjct: 63 GEQIRVNLDIEFHKFPCDILSLDVQDYYG 91
>gi|67901384|ref|XP_680948.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
gi|40742675|gb|EAA61865.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
gi|259484020|tpe|CBF79887.1| TPA: COPII-coated vesicle protein (Erv41), putative
(AFU_orthologue; AFUA_2G01530) [Aspergillus nidulans
FGSC A4]
Length = 394
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/376 (23%), Positives = 146/376 (38%), Gaps = 71/376 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + + GG T++ + + + + + T V+
Sbjct: 24 LRTFDAFPKTKPSYTTPSRRGGQWTVLILIICTIFSITEFRTWLKGHETHHFTVEKGVSH 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++ D V+ + CD L ++ D++G++ L E + K+ EP +
Sbjct: 84 DLQLNFDAVI-HMPCDALHINIQDAAGDRVLASE--MLKK----------EPTSWKLWMD 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K+ ++E T + D + A E + NE++ + K P+L
Sbjct: 131 KRNYHSSEYQTLSDSRGDEERVA----AMEEDVHAGHVLNELRRNGKRKFAKGPKLR--- 183
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+ + C+IYG LE N+V G FHI A G Y H+ +A
Sbjct: 184 -----------RGDVVDSCRIYGSLEGNKVQGDFHITARGHGYRDGREHL------DHSA 226
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +H I LSFG PLD T+A E + Y++ I+PTIY R +L
Sbjct: 227 FNFSHIITELSFGPHYPS---LHNPLDKTIATTEFHYYKYQYFLSIVPTIYSRNQNLRLD 283
Query: 306 GGDGG------------------------------MPGIFFSYELSPLMVKITEKSKSLG 335
+PGIFF Y + P+M+ I+E+
Sbjct: 284 ALPSSSSARSNKNLIFTNQYAATSQSDAIPESPYVIPGIFFKYNIEPIMLLISEERTGFL 343
Query: 336 HLWTKIMCNISGTYIT 351
+L +I+ +SG +T
Sbjct: 344 NLLIRIVNTVSGVLVT 359
>gi|358333955|dbj|GAA52416.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Clonorchis sinensis]
Length = 306
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 20/165 (12%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSY-SINHVHVHDIQPYTSAA-FNTTHHIRHLSFGIK 260
+ C I G V +V+G+ H+ PG + HVH I P+ A FN +H I HLSFG +
Sbjct: 86 DACNIVGTFHVQKVAGNMHVLPGRPFDGPGGSHVH-IAPFVRLADFNFSHRINHLSFGAQ 144
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----IYERLDGSKL----------GG 306
+ + R PLD + F YYI I+PT + LD + G
Sbjct: 145 VAN---RVNPLDAVEEISYNPMETFRYYISIVPTRVVYAFSSLDTYQYAITVKNRTAEGN 201
Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+PGIFFSY+ PL+V++TE + G ++ + G + T
Sbjct: 202 KSDSIPGIFFSYDTFPLLVQVTESRELFGTFLARLAALVGGLFAT 246
>gi|67623433|ref|XP_667999.1| serologically defined breast cancer antigen 84 like (42.9 kD)
(XQ234) [Cryptosporidium hominis TU502]
gi|54659178|gb|EAL37768.1| serologically defined breast cancer antigen 84 like (42.9 kD)
(XQ234) [Cryptosporidium hominis]
Length = 388
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/406 (21%), Positives = 164/406 (40%), Gaps = 75/406 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD---SS 63
+K DAF+KP +F KT +GG +TI+ + + L ++ Y ++ +E+ VD S+
Sbjct: 1 MKQFDAFSKPISEFRIKTAFGGYLTILSIIAMIILFYSELKYYLNITRKDEVTVDHLSSN 60
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
R L + L+ P + CD L + ++ + +++ DG
Sbjct: 61 RNINLRMQLEF--PKLPCDILGVRIINLQENKEIYLP----------DG----------- 97
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRYKKWAL 179
++ K+ G+ + + CG CY A CCNTC +V Y K L
Sbjct: 98 -GIEFVKI----GSNESNANSSSGCGPCYDASINNDLGVVNCCNTCKDVFNEYDKKGIKL 152
Query: 180 PELDTIVQCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
P + + QC + S +++ N +EGC+I + +V G I+ + +
Sbjct: 153 PHVISFKQCDYDKS-KRISNALSSNLNSEGCKIKVNGYIPKVKGKIEISH--KRWVKYKE 209
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA----------SM 284
+ D++ S FN ++ + +L FG +L R K + + E +
Sbjct: 210 MTDLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFDDAY 269
Query: 285 FNYYIKIIPTIYERLDGS------------------KLGGG----DGGMPGIFFSYELSP 322
++ + IPT Y ++ L G D +PGI +Y+ +P
Sbjct: 270 IDFDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTP 329
Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
+VK+TE +S T+ I G + ++D + ++K
Sbjct: 330 FLVKMTESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 375
>gi|148678794|gb|EDL10741.1| ERGIC and golgi 2, isoform CRA_a [Mus musculus]
Length = 375
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 149/368 (40%), Gaps = 53/368 (14%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 21 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + Y+ L D P Q + ++ +
Sbjct: 81 KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 138
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 139 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 165
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 166 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 219
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP-----TIYERLDG 301
N +H I HLSFG + PLDGT A + + KI ++ ER
Sbjct: 220 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDLVPTKLHTYKISADTHQFSVTERERI 276
Query: 302 SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
G G+ GIF Y+LS LMV +TE+ + ++ I G + T +LH
Sbjct: 277 INHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST----TGMLHG 332
Query: 362 CVKKISKV 369
K I ++
Sbjct: 333 IGKFIVEI 340
>gi|367025937|ref|XP_003662253.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
42464]
gi|347009521|gb|AEO57008.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
42464]
Length = 380
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 152/368 (41%), Gaps = 67/368 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELF-VDSSRG 65
+K DAF K + + T GG T+ FIS ++ + T E F V+
Sbjct: 22 VKAFDAFPKAKPQYVQHTSAGGKWTVAM-AFISLILFWSELARWWRGTEEHTFAVEKGVS 80
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL-----DGKPIQEPQK 120
LPI+LD+VV + C L ++ D++G++ L + +R L DGK + +
Sbjct: 81 HVLPINLDVVV-RMRCADLHVNVQDAAGDRILAA--SALRRDPTLWAHWVDGKGVHRLGR 137
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
+ + +V T G T + ++ +G E + + V + KW+
Sbjct: 138 DA-----QGRVITGEGYTGADHDE------GFGEE-------HVHDIVALGRKRAKWS-- 177
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
T +L + C+IYG LE+N+V G FHI + H ++ +
Sbjct: 178 ------------RTPRLWGAEADSCRIYGSLELNKVQGDFHIT-----ARGHGYMEFGEH 220
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY---- 296
AFN +H I LSFG L PLD TV A F Y++ ++PT Y
Sbjct: 221 LDHNAFNFSHIISELSFGPFLP---SLVNPLDRTVNTAPAHFYKFQYFLSVVPTTYSVGH 277
Query: 297 --ERLDGSKLGGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
ER S L + +PGIF Y++ P+++ I E S K++
Sbjct: 278 PEERGSRSVLTNQYAVTEQSKAVPENTVPGIFVKYDIEPILLNIVETRDSFFVFLIKVIN 337
Query: 344 NISGTYIT 351
+SG +T
Sbjct: 338 VVSGVLVT 345
>gi|440801547|gb|ELR22565.1| serologically defined breast cancer antigen 84 isoform 1, putative
[Acanthamoeba castellanii str. Neff]
Length = 355
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/187 (29%), Positives = 87/187 (46%), Gaps = 24/187 (12%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPG----LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGI 259
GC+++G EV +V G+ HIA G S+ + HVH I P A+FN +H I HLSFG
Sbjct: 151 GCRVFGKAEVQKVKGNLHIAAGSNAPQSHDGHQHHVHHITPEQVASFNVSHFIPHLSFGP 210
Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL--------------- 304
R PL T E A N+ I+++PTIYE G+ +
Sbjct: 211 AF---PRRTDPLSWTRV-IEPNAMQVNHMIQLVPTIYEDWGGNVIEGYQYSAQTNYKHIV 266
Query: 305 -GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
G +PG+F +++SP +++ E +S H T++ GT++ L+ + L
Sbjct: 267 PGASSFPLPGVFIKWDMSPFVIQYRETGRSFAHFLTRLCAITGGTFVVLGLIYSGLTKAF 326
Query: 364 KKISKVE 370
+ V
Sbjct: 327 PALRTVR 333
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 12/117 (10%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVST--------- 54
++RL+ D F K ED E+ G AVTIV L + +L + Y QV T
Sbjct: 10 AKRLRSFDIFPKSVEDVREQASAGAAVTIVGVLVMLFLFVSEFSSYTQVVTEAWRGGAIW 69
Query: 55 --TEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
+ +FVD++R + I+ ++V ++C + +D VD+ G+ +I K+ +D
Sbjct: 70 AEADTIFVDTTREKTMWINFELVFLQLACKEVEVDIVDNFGDPQ-RGRRDIQKQAVD 125
>gi|195997845|ref|XP_002108791.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
gi|190589567|gb|EDV29589.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
Length = 324
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 92/187 (49%), Gaps = 25/187 (13%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLS--YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
+ C+I+G + +N+V+G+FH+ G+S + + H HV D+ P S F +H I L+FG+
Sbjct: 137 DACRIHGNIPLNKVAGNFHVTAGMSINHPMGHAHVSDLVPRESVNF--SHRIDLLAFGVA 194
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLG 305
+ PLDG + M+ Y+IKI+PT + E
Sbjct: 195 APN---VINPLDGVEFITKITDKMYQYFIKIVPTKVKTFSVAIDTYQYSVTEHFSKVDHM 251
Query: 306 GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV---DALLHSC 362
G G+ G+FF Y+LSP+ V++TE G L ++ + G + T ++ +L++
Sbjct: 252 NGKHGVSGLFFKYDLSPISVQVTEARVPFGQLLIRLCGIVGGIFATSGMIHIFSSLIYEA 311
Query: 363 VKKISKV 369
V + K+
Sbjct: 312 VTRRKKL 318
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 1/86 (1%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+ +K LDAF K ED E + GG ++ + I+ ++ +++ DY VD
Sbjct: 13 QEVKKLDAFPKIAEDCKESSTSGGTASVTAFFLITIMVIMELVDYSFSGVKYNYSVDKDI 72
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVD 90
SK+ +HLD+ + + C L D +D
Sbjct: 73 QSKMMLHLDLTI-AMKCRDLGADVLD 97
>gi|240275142|gb|EER38657.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Ajellomyces capsulatus H143]
gi|325094499|gb|EGC47809.1| COPII-coated vesicle protein [Ajellomyces capsulatus H88]
Length = 401
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 95/393 (24%), Positives = 157/393 (39%), Gaps = 83/393 (21%)
Query: 2 VFSER------LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTT 55
F ER L+ DAF K + T GG TI+ + ++L ++ +++
Sbjct: 13 AFGERPGIGSGLRTFDAFPKTKPTYTTSTRRGGQWTIIVFALCAFLSLNELRTWYRGVEN 72
Query: 56 EELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
V+ +L ++LDIV + CD L ++ D++G++ L + LD
Sbjct: 73 HHFSVEKGVSRELQMNLDIVA-AMPCDALRVNVQDAAGDRILASD------LLD------ 119
Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK-CCNTCNEVKEAYRY 174
++P + VT+ G E + N+ S E E + E K +Y+
Sbjct: 120 KQPTSWAAWNRELNGVTSGGGR---EYQTLNEEDSSRLMEQEADAHVGHALGEAKRSYKR 176
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHV 233
K P+L + + C+IYG LE N+V G FHI A G Y
Sbjct: 177 KFPKGPKLK--------------RGEKADSCRIYGSLEGNKVQGDFHITARGHGYPEYGE 222
Query: 234 HV-HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
H+ HD AFN +H + LSFG PLD T++ F YY+ ++
Sbjct: 223 HLSHD-------AFNFSHMVTELSFGPHYPS---LLNPLDKTISVTPARFFKFQYYLSVV 272
Query: 293 PTIYERL-----------DGSKLGGGDGG-----------------------MPGIFFSY 318
PTIY R D + + + G +PGIFF Y
Sbjct: 273 PTIYTRAGIVDPYNHVLPDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYHIPGIFFKY 332
Query: 319 ELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ P+++ ++E+ SL L +++ ++G +
Sbjct: 333 NIEPILLVVSEERGSLLALLVRLVNVLAGVVVA 365
>gi|212527292|ref|XP_002143803.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
marneffei ATCC 18224]
gi|210073201|gb|EEA27288.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
marneffei ATCC 18224]
Length = 402
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 151/381 (39%), Gaps = 78/381 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K ++ + GG T++ + ++L + ++++ + + V+
Sbjct: 24 LRTFDAFPKTKPNYTTASRRGGQWTVIIFAICTFLTFGEFVNWYRGTENQHFSVEKGVSR 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L +++D+VV + C+ L ++ D+SG+ H + L DG E E +N
Sbjct: 84 QLQMNIDMVV-KMHCNDLRVNVQDASGD------HIMAGMLLMKDGTNW-ELWNEKLNQQ 135
Query: 127 KKKKV---TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
V T N L D + TR R K P+
Sbjct: 136 SSSGVPEYQTLNAEDVKRLMDQEDDAHARHVLSHTR-------------RNPKRKFPK-- 180
Query: 184 TIVQCKNEYSTEKLKNTF-TEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
T +L + + T+ C+IYG LE N+V G FHI A G Y N V H
Sbjct: 181 ----------TPRLSSKYPTDSCRIYGSLESNKVHGDFHITARGHGY--NEVGQH----L 224
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
+ FN TH + LSFG PLD TVA E F Y+I ++PTIY + +
Sbjct: 225 DHSNFNFTHMVTELSFGPHYPS---LLNPLDKTVASTETHYYKFQYFINVVPTIYAKGNN 281
Query: 302 S-------------------------------KLGGGDGGMPGIFFSYELSPLMVKITEK 330
+ L PGIFF Y + P+++ ++E+
Sbjct: 282 AVEKYTANPAKAFEKSRNTIFTNQYSATSQSHPLPESPFNTPGIFFKYNIEPILLFVSEE 341
Query: 331 SKSLGHLWTKIMCNISGTYIT 351
S L +++ +SG +T
Sbjct: 342 RGSFLALLVRLVNVVSGVIVT 362
>gi|345325542|ref|XP_001508860.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Ornithorhynchus anatinus]
Length = 372
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 125/288 (43%), Gaps = 44/288 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + +++L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMAFLTVMEFLVYQDTWMKYEYEVDKDFAS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYIGADVLDLAETMVASADGLVYEPVI-FDLSPQQREWQRMLQMI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QNR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + S + + C+I+G+L VN+V+G+FHI G + H H + ++
Sbjct: 159 --RGDLSLQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPGIIN---PLDGTEKIAVDHNQMFQYFITVVPT 256
>gi|340514865|gb|EGR45124.1| predicted protein [Trichoderma reesei QM6a]
Length = 372
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 144/362 (39%), Gaps = 63/362 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + KT GG T+ L S + ++ +++ S V+ G
Sbjct: 21 VSAFDAFPKAKPQYVTKTAGGGKWTVAMLLVSSIFLWSEIGRWWRGSEHHTFAVEKGIGH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I+LDIVV +SC L ++ D+SG++ L G + V
Sbjct: 81 DMQINLDIVVK-MSCGDLHVNVQDASGDR-------------ILAGDKLTRDATNWEQWV 126
Query: 127 KKKKV----TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
K V ENG T +GA E + + V + + KWA
Sbjct: 127 DAKGVHRLGKNENGKLDT-------GAGWHGAHDEGFGEEHVHDIVSLSRKKAKWA---- 175
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQP 240
T K + T+ C++YG L++N+V G FHI A G YS H+ HD
Sbjct: 176 ----------KTPKPRGR-TDSCRMYGSLDLNKVQGDFHITARGHGYSGIGGHLDHD--- 221
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY---E 297
FN +H I LS+G PLD TV A F YY+ ++PT+Y
Sbjct: 222 ----KFNFSHIISELSYGPFYP---SLINPLDRTVNTAIVHFHKFQYYLSVVPTVYIASH 274
Query: 298 RLDGSKLGG--------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
R+ + D +PGIFF Y++ P+M+ + E K++ SG
Sbjct: 275 RIVNTNQYAVTEQSKTISDHQVPGIFFKYDIEPIMLSVEETRDGFFAFLLKLVNVFSGVM 334
Query: 350 IT 351
+
Sbjct: 335 VA 336
>gi|119497911|ref|XP_001265713.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
fischeri NRRL 181]
gi|119413877|gb|EAW23816.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
fischeri NRRL 181]
Length = 397
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 156/381 (40%), Gaps = 79/381 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + + GG T++ L ++ + + + + + V+
Sbjct: 24 LKTFDAFPKTKPSYTAPSPRGGQWTVLILLVCTFFSISEFRTWLKGTEKQHFSVEKGISH 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LDIVV + CD L ++ D+SG++ L E + KR EP
Sbjct: 84 DLQLNLDIVV-HMPCDTLDVNIQDASGDRVLAGE--LLKR----------EP-------- 122
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKKWALPEL 182
T+ +L + YG +T +++ + +E +EA + L E+
Sbjct: 123 -----------TSWQLWMDKRNFEIYGGAHEYQTLSQEHADRLSE-QEADAHVHHVLGEV 170
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
+ K + + + C+IYG LE N+V G FHI A G Y H+ P+
Sbjct: 171 RRNPRKKFAKGPKLRRGDAVDSCRIYGSLEGNKVQGDFHITARGHGY-------HNSAPH 223
Query: 242 TS-AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-- 298
FN +H I LSFG PLD T+A E+ + Y++ I+PTIY +
Sbjct: 224 LEHKTFNFSHMITELSFGPHYP---TLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGN 280
Query: 299 --LD--------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
LD S + +PGIFF Y + P+++ I+E+
Sbjct: 281 LALDTYANAPPTSRYSKNLIFTNQYAATSQSSAIPENPYFIPGIFFKYNIEPILLMISEE 340
Query: 331 SKSLGHLWTKIMCNISGTYIT 351
S L +++ ISG +T
Sbjct: 341 RTSFLSLLVRLVNTISGVMVT 361
>gi|410907774|ref|XP_003967366.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Takifugu rubripes]
Length = 388
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 85/176 (48%), Gaps = 18/176 (10%)
Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
+T C+I+G+L VN+V+G+FHI G S H H + ++N +H I HLSFG
Sbjct: 162 STSLHACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSHDSYNFSHRIDHLSFG 221
Query: 259 IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSK 303
+D PLDGT + + +F Y+I I+PT + E+
Sbjct: 222 ---EDLPGIISPLDGTEKVSADSNHIFQYFITIVPTKLNTYRVSAETHQYSVTEQDRAIN 278
Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
G G+ GIF Y+++ LMVK+TE+ L ++ I G + T ++ ++
Sbjct: 279 HAAGSHGVSGIFMKYDINSLMVKVTEQHMPLWQFLVRLCGIIGGIFSTTGMIHGIV 334
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E T GG V+++ + ++ L ++ Y E VD GS
Sbjct: 13 VKELDAFPKVPESYVESTASGGTVSLIAFSLMAILAFLEFFVYRDTWMKYEYEVDKDFGS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
KL I++DI V + C Y+ D +D
Sbjct: 73 KLRINVDITV-AMRCQYIGADVLD 95
>gi|261193579|ref|XP_002623195.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
gi|239588800|gb|EEQ71443.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
gi|239613876|gb|EEQ90863.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ER-3]
gi|327349942|gb|EGE78799.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ATCC 18188]
Length = 401
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 159/392 (40%), Gaps = 81/392 (20%)
Query: 2 VFSER------LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTT 55
F ER L+ DAF K + TV GG TI+ + ++L ++ +++
Sbjct: 13 AFGERPGIGSGLRTFDAFPKTKPTYTSSTVRGGQWTIIVFALCAFLSINELRTWYRGVEN 72
Query: 56 EELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
V+ +L ++LDIVV + CD L ++ D+ G++ L + LD +P
Sbjct: 73 HHFSVEKGISRELQMNLDIVV-AMPCDALRVNVQDAVGDRILASDL--------LDKQPT 123
Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
A +++ + + E + N+ + E E + + + + EA R
Sbjct: 124 SW-------AAWNRELNVVSSGGSREYQTLNEEDAVRLMEQE--EDVHVGHALGEAQRSY 174
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVH 234
K P+ + + +N + C+IYG L N+V G FHI A G Y H
Sbjct: 175 KRKFPKGPKLKRGEN-----------ADSCRIYGSLVGNKVQGDFHITARGHGYFEFGEH 223
Query: 235 V-HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
+ HD +FN +H I LSFG PLD T++ + YY+ I+P
Sbjct: 224 LSHD-------SFNFSHMITELSFGPHYS---TLLNPLDKTISTTPAHFHKYQYYMSIVP 273
Query: 294 TIYERL-----------DGSKLGGGDGG-----------------------MPGIFFSYE 319
TIY R D S + G +PGIFF Y
Sbjct: 274 TIYTRAGVVDPYSQALPDPSTITPSQRGNTIFTNQYAVTSRSHELPDAEYDVPGIFFKYT 333
Query: 320 LSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ P+++ ++E+ SL L +++ ++G +
Sbjct: 334 IEPILLVVSEERGSLLALLVRLVNVLAGVVVA 365
>gi|336269097|ref|XP_003349310.1| hypothetical protein SMAC_05593 [Sordaria macrospora k-hell]
gi|380089883|emb|CCC12416.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 379
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 86/364 (23%), Positives = 147/364 (40%), Gaps = 65/364 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ L L + +++ + + V+
Sbjct: 23 VSAFDAFPKSKPQYVTRTTAGGKWTVFVTLISFILFWSEASRWWRGTESHTFAVEKGVSH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDG--KPIQEPQ 119
L I+LDIVV + C + ++ D++G++ LH + +++ +D G K ++ Q
Sbjct: 83 SLDINLDIVV-KMKCQDIHINVQDAAGDRILAASKLHRDPTVWQHWVDNKGIHKLGRDAQ 141
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+VV G + D +G E + + V + KWA
Sbjct: 142 GKVVT-----------GEDYLQGHDEG-----FGEE-------HVHDIVALGRKRAKWA- 177
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
T +L + C+++G LE+N+V G FHI + H ++ Q
Sbjct: 178 -------------RTPRLWGATPDSCRVFGSLELNKVQGDFHIT-----AKGHGYMEFGQ 219
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
+AFN +H I LS+G L PLD TV A F Y+I ++PT+Y
Sbjct: 220 HLDHSAFNFSHIISELSYGPFLP---SLVNPLDQTVNLATSNFHKFQYFISVVPTVYSVS 276
Query: 300 DGSKLGGGDGG------------MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
G + +PGIF Y++ P+++ I E+ S K++ ISG
Sbjct: 277 GGRSIVTNQYAVTEQSQEVTERIIPGIFVKYDIEPILLNIVEERDSFLLFLIKVVNVISG 336
Query: 348 TYIT 351
+
Sbjct: 337 ALVA 340
>gi|388583623|gb|EIM23924.1| DUF1692-domain-containing protein [Wallemia sebi CBS 633.66]
Length = 396
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 143/358 (39%), Gaps = 70/358 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + ++ GG T++ + L+ ++ D+ + VD++ +
Sbjct: 10 LREFDAFPKTQASYKIRSKQGGIATVIVIFALVLLVFHEIGDWLYGHNEYQFSVDTTTET 69
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
++ +++D+ V + C YL +D D+ G++ L + +I K DG
Sbjct: 70 EMQLNVDLTV-AMPCHYLNVDIRDAVGDR-LKLSDSIQK-----DG-------------- 108
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
TT E E + GS K VK++ + +KW P
Sbjct: 109 -----------TTFEPEKYRQIGSA--------KQSTLSRIVKDSKKGRKWFRP-----T 144
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
+N + K C+IYG +E +V+G+ HI G YS ++
Sbjct: 145 STRNRFPKTKKLIKDGPACRIYGSVETKKVNGNMHITTLGHGYS-------SLEHTDHKL 197
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N +H I SFG Q +PLD +V + ++ Y++ ++PT Y G L
Sbjct: 198 MNLSHTIDEFSFG---QHFPYISQPLDKSVEITDNHFPVYQYFMHVVPTTYVDASGHSLS 254
Query: 306 GGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
G+PG+FF YEL P+ + ++ + S L ++ I G +
Sbjct: 255 TNQYSAREDIKFIHNHQRGIPGLFFRYELEPIHLSLSATTMSFTKLLIRLTALIGGVW 312
>gi|328862174|gb|EGG11276.1| hypothetical protein MELLADRAFT_33547 [Melampsora larici-populina
98AG31]
Length = 361
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 82/359 (22%), Positives = 144/359 (40%), Gaps = 76/359 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ DAF K + E++ GG +TIV I LI ++ +Y + T VD++ G
Sbjct: 14 IREFDAFPKTIPTYKERSSRGGILTIVVGFLIMILIWHELREYLFGAATYSFSVDNTVGH 73
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++ D+ + + C YL++D D+ G++ +H+
Sbjct: 74 DLGLNFDVTI-NMPCHYLSIDVRDAVGDR-MHISDEF----------------------- 108
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
KK+ T + LE N G + V++A W P
Sbjct: 109 -KKEGTEFSIGQAARLETNNDAG------------ISASKMVRDAQ--GGWTRPTF---- 149
Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
+K K EG C+I+G V +V+G+ HI ++ H ++ +
Sbjct: 150 --------KKTKPLIPEGPACRIFGSTHVKKVTGNLHIT-----TLGHGYL-SWEHTDHQ 195
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-------- 296
N TH I SFG + +PLD +V ++ +F Y+I ++PT Y
Sbjct: 196 LMNLTHVISEFSFGEFFPN---MVQPLDNSVEITDKPFHIFQYFISVVPTTYINSGGRQV 252
Query: 297 -----ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
D S+ G+PGIFF Y++ P+ + I E++ +L ++ + G +
Sbjct: 253 FTNQYSVTDMSRSTEHGRGVPGIFFKYDIEPMYLTIRERTTTLVQFLVRLAGIVGGIVV 311
>gi|440794754|gb|ELR15909.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 306
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 86/190 (45%), Gaps = 35/190 (18%)
Query: 205 CQIYGYLEVNRVSGSFHIA-----PGLSY--SIN-------HVHVHDIQPYTSAAFNTTH 250
C + G++ V ++ G F I+ P Y S+N H H H P S FN TH
Sbjct: 121 CLLTGHMAVRKIRGQFQISSRRFNPFSIYGSSLNKHTPTEDHPHPH---PEDSLPFNVTH 177
Query: 251 HIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFNYYIKIIPTIYERLDGSKLGGGDG 309
IR LSFG K+ D PLDG V EG S ++Y+++I+P Y DG +
Sbjct: 178 RIRELSFGPKVLPD---VGPLDGIVQTMREGERSQYSYFLQIVPASYHYADGRVVESYSF 234
Query: 310 GM-----------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDAL 358
PG+F+ Y+ SP + E KS H T+ I GT++ F L+ AL
Sbjct: 235 AFTMHTESRSELAPGVFWKYDFSPYATSLREVPKSFSHFITRCCAVIGGTFVVFGLLSAL 294
Query: 359 ---LHSCVKK 365
L + KK
Sbjct: 295 ASRLETAAKK 304
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 23 KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS---RGSKLPIHLDIVVPTI 79
KTV GAV+I+C+ + YL +V +Y + T ++ VD++ L + L + P +
Sbjct: 25 KTVSSGAVSILCFFLLGYLFLQEVAEYQKAEVTSQVSVDTTIRNEFDSLLVSLTVEFPNL 84
Query: 80 SCDYLALDAVDSSG 93
C+ +DA D +G
Sbjct: 85 GCEDFGVDAADYTG 98
>gi|123483410|ref|XP_001324018.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121906894|gb|EAY11795.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 384
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 135/320 (42%), Gaps = 60/320 (18%)
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
+ LD+ V + C +L LD +D+ G L++ RL + ++K
Sbjct: 72 VSLDVKV-NMPCYFLHLDVIDNLGFNQLNINTTAKFIRL----------------SAQEK 114
Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEV-------KEAYRYKKWALPEL 182
++ N T ++ C SCYG E CCN+C + +A K W
Sbjct: 115 ELGYANETISS------ICHSCYGLLPEG-SCCNSCEQTLLLHIMNGKAANTKDWP---- 163
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
QC+ + + +N E C+I G + +N+ G+FHIAPG + + HVHD+
Sbjct: 164 ----QCQGKNPGKVYEN---EKCRIKGKVCLNKAQGNFHIAPGTNMKERYGHVHDLSGQL 216
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYE---R 298
F+ +H I+ + G K+ PL + ++ Y + + P +Y+ R
Sbjct: 217 -PNFDLSHVIQGMRVGPKI---PLTYNPLRYVQQIQNPNQPVVYRYDLVVTPAVYKSGNR 272
Query: 299 LDGSK----------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
+ G G GG PGI+F Y +P V + ++ ++T I +SG
Sbjct: 273 ILGKGYDYTAMINRFFVGNSGGAPGIYFHYSFTPYGVTVNATYLTIAQIFTSIFGFMSGA 332
Query: 349 YITFMLVDALLHSCVKKISK 368
Y F ++D + K+++K
Sbjct: 333 YAIFSIIDESMFKDDKRMAK 352
>gi|452988546|gb|EME88301.1| hypothetical protein MYCFIDRAFT_25415 [Pseudocercospora fijiensis
CIRAD86]
Length = 380
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 146/366 (39%), Gaps = 68/366 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + E+T GG T+ L +L ++ +++ STT V+ G
Sbjct: 23 VKAFDAFPKTKPSYQERTSTGGIWTVTLILASLFLTWSELARWWKGSTTHTFSVEQGIGH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I+LD+VV ++C+ L ++ D++G++ L G Q+
Sbjct: 83 DLQINLDMVV-MMNCEDLHVNVQDAAGDR-------------ILAGSVFQKDPTIWTRWD 128
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
KK K + + E + G Y E N + + R+ K
Sbjct: 129 KKLKA---HALGHDKQERLGEAGKDYKEE----DVHNYLSVAHHSKRFPK---------- 171
Query: 187 QCKNEYSTEKLKNTFT-EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
T K+ +T + C+IYG + N+V G FHI + H ++ + +
Sbjct: 172 -------TPKIPRGWTADSCRIYGTMHGNKVQGDFHIT-----ARGHGYLEFAEHLDHSK 219
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +H I LSFG PLD T A + F Y++ ++PT+Y D L
Sbjct: 220 FNFSHRINELSFGPFYP---SLENPLDNTFATTDINYYKFQYFLSVVPTVYT-TDARALR 275
Query: 306 GGDGG--------------------MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
D +PGIF +++ P+ + I E+ S L+ +I+ +
Sbjct: 276 LLDNNFVFTNQYAVTEQSRKVSENFVPGIFIKFDMEPIGLTIAEEWSSFPALFIRIVNVV 335
Query: 346 SGTYIT 351
SG +
Sbjct: 336 SGLLVA 341
>gi|313220803|emb|CBY31643.1| unnamed protein product [Oikopleura dioica]
Length = 289
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 32/189 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC G VN+V G+FH++ H +QP + H I LSFG ++
Sbjct: 111 KGCIFGGTFHVNKVPGNFHVS---------THSSQVQPQNP---DMNHEIHELSFGESMK 158
Query: 263 DDDERRK----PLDGTVAKAEEGASMFNYYIKIIPTIYERL---------------DGSK 303
+ PL+G AE+ AS +Y +K++PT+Y+ + D
Sbjct: 159 GINSNLPANFIPLNGKKTGAEKMASH-DYTLKVVPTVYQDIKKRTKFGYQFTAVYKDFVA 217
Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
G G MP I+F YE+SP+ VK TEKSK L H T I GT+ ++D+++ S
Sbjct: 218 FGHGHRVMPAIWFRYEVSPITVKYTEKSKPLYHFLTTFCAIIGGTFTVAGMIDSMIFSAH 277
Query: 364 KKISKVEIG 372
+ + K G
Sbjct: 278 QMVKKAGEG 286
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 1/104 (0%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
++ D + K +D + T G ++I LFI +L+ + + + EL+VD + G
Sbjct: 5 IRRFDIYRKVPKDLTQPTTAGAVISISSGLFILFLLVSEFLTFMRTDIVSELYVDDPTVG 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
K+P+++ + +P I C +L +D D G + N K ++
Sbjct: 65 DKIPVNIRMSLPGIECKFLGIDIQDEHGRHEVGYLENTRKDPIN 108
>gi|313230728|emb|CBY08126.1| unnamed protein product [Oikopleura dioica]
Length = 289
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 32/189 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC G VN+V G+FH++ H +QP + H I LSFG ++
Sbjct: 111 KGCIFGGTFHVNKVPGNFHVS---------THSSQVQPQNP---DMNHEIHELSFGESMK 158
Query: 263 DDDERRK----PLDGTVAKAEEGASMFNYYIKIIPTIYERL---------------DGSK 303
+ PL+G AE+ AS +Y +K++PT+Y+ + D
Sbjct: 159 GINSNLPANFIPLNGKKTGAEKMASH-DYTLKVVPTVYQDIKKRTKFGYQFTAVYKDFVA 217
Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
G G MP I+F YE+SP+ VK TEKSK L H T I GT+ ++D+++ S
Sbjct: 218 FGHGHRVMPAIWFRYEVSPITVKYTEKSKPLYHFLTTFCAIIGGTFTVAGMIDSMIFSAH 277
Query: 364 KKISKVEIG 372
+ + K G
Sbjct: 278 QMVKKAGEG 286
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 1/104 (0%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
++ D + K +D + T G ++I LFI +L+ + + + EL+VD + G
Sbjct: 5 IRRFDIYRKVPKDLTQPTTTGAVISISSGLFILFLLVSEFLTFMRTDIVSELYVDDPTVG 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
K+P+++ + +P I C +L +D D G + N K ++
Sbjct: 65 DKIPVNIRMSLPGIECKFLGIDIQDEHGRHEVGYLENTRKDPIN 108
>gi|167523643|ref|XP_001746158.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775429|gb|EDQ89053.1| predicted protein [Monosiga brevicollis MX1]
Length = 1400
Score = 83.6 bits (205), Expect = 1e-13, Method: Composition-based stats.
Identities = 86/355 (24%), Positives = 149/355 (41%), Gaps = 82/355 (23%)
Query: 3 FSERLKGLDAFTK--PYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
E++K LD F K P D ++ G VTI+ L I LI ++ Y V E V
Sbjct: 10 LQEQVKQLDVFPKVEPDMDIQTTSISGAVVTIIVGLAIVGLIFTELMYYRTVDVVYEYAV 69
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGE-----QHLHVEHNIYKRRLDLDGKPI 115
D+ + + +D+ + + C+ +D +D SG Q + VE +K +
Sbjct: 70 DTDLDPHMNLTVDMTI-AMPCENFGVDYIDVSGRSTDALQFMAVEPAHFK---------L 119
Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
Q+E ++ + +V + G+ L+ ++ YG++ E
Sbjct: 120 SPNQQEWLD--QWAEVKAQEGSKG--LDSLHRF--LYGSKREPMPT-------------- 159
Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS--YSINHV 233
A PE+D +GC+++G + V RVS +FH + G S ++ H
Sbjct: 160 --AAPEIDA----------------EPDGCRVHGTMPVARVSSNFHFSAGKSVHHASGHA 201
Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR--KPLDGTVAKAEEGASMFNYYIKI 291
HV I P N +H I SF E+R LDG + ++ +F Y++K+
Sbjct: 202 HV-PIDP-NQKTINFSHRIDRFSF------SSEQRGAMALDGDMKVSDSNKQLFQYFLKV 253
Query: 292 IPTIYERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKS 331
+PT +R+D ++ L + +PGI F YE+ P+ V + E++
Sbjct: 254 VPTTTKRMDEAEPFRSNQYSVTEQHHILAANERKLPGIHFKYEIEPIGVLVHEQA 308
>gi|358388143|gb|EHK25737.1| hypothetical protein TRIVIDRAFT_33251 [Trichoderma virens Gv29-8]
Length = 370
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 145/361 (40%), Gaps = 63/361 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + KT GG T+ L S + ++ +++ + V+ G
Sbjct: 21 VSAFDAFPKSKPQYVTKTSGGGKWTVAMLLISSIFLWTEIGRWWRGAEHHTFAVEKGIGH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL---HVEHNIYKRRLDLDGKPIQEPQKEVV 123
+ ++LDIVV + CD L ++ D+SG++ L + + +DGK + K
Sbjct: 81 DMQVNLDIVV-KMDCDDLHINVQDASGDRILAGDKLNRDATTWHQWVDGKGMHRLGK--- 136
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK-KWALPEL 182
+ENG T G + A + +++ R K KWA
Sbjct: 137 ---------SENGKLDT--------GEGWLAAHDEGFGEEHVHDIVALSRKKAKWA---- 175
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
T K + C++YG L++NRV G FHI A G Y H+ HD
Sbjct: 176 ----------KTPSPKGR-PDSCRMYGSLDLNRVQGDFHITARGHGYGGQHLD-HD---- 219
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY---ER 298
FN +H I +S+G PLD TV A F YY+ ++PT+Y R
Sbjct: 220 ---KFNFSHIISEMSYGPFYP---SLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYLANNR 273
Query: 299 LDGSKLGG--------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ + D +PGIFF Y++ P+M+ + E KI+ SG +
Sbjct: 274 IVNTNQYAVTEQSKTISDHQVPGIFFKYDIEPIMLSVEESRDGFFTFLVKIVNIFSGVMV 333
Query: 351 T 351
Sbjct: 334 A 334
>gi|57208596|emb|CAI42845.1| ERGIC and golgi 3 [Homo sapiens]
Length = 129
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 69/129 (53%), Gaps = 23/129 (17%)
Query: 270 PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-------------------KLGGG--- 307
PLD T A + + MF Y++K++PT+Y ++DG K+ G
Sbjct: 1 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAPLPPQVLRTNQFSVTRHEKVANGLLG 60
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
D G+PG+F YELSP+MVK+TEK +S H T + I G + L+D+L++ + I
Sbjct: 61 DQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQ 120
Query: 368 -KVEIGGKT 375
K+++G T
Sbjct: 121 KKIDLGKTT 129
>gi|328700149|ref|XP_003241164.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Acyrthosiphon pisum]
gi|328700151|ref|XP_001951220.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Acyrthosiphon pisum]
gi|328700153|ref|XP_003241165.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 3 [Acyrthosiphon pisum]
Length = 289
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 123/292 (42%), Gaps = 48/292 (16%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAV-TIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+K LD+F K E+ +E + Y + T++ +F +L+ ++ + Q D+
Sbjct: 14 VKELDSFPKVQEEIYEPSTYSNVILTVLISVFGLWLLISEIQYFLQEHYIYRFVPDTDYE 73
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
SKLPI++DI V + +CD + D VD++G+
Sbjct: 74 SKLPINIDITVAS-TCDSIGADIVDTTGQ------------------------------- 101
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNE-VKEAYRYKKWALPELDT 184
N EL+ + + + + N ++E Y K L D
Sbjct: 102 ---------NMMLFGELKTDDTWWEMTKEQQQHFEKMRKFNAYLREEYHSMKDILWMFDD 152
Query: 185 IVQCKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP-YT 242
KN+ K NT + C+I+G L +N+V G+FHI PG S + HVH P +
Sbjct: 153 YNTLKNKIFVRTDKPNTLPDACRIHGSLILNKVIGNFHITPGKSLIVPGGHVHLTGPFFG 212
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
S A N +H I SFG+ + PL+G + + E A + Y+I ++ T
Sbjct: 213 SEATNFSHRINQFSFGVPTKG---IIYPLEGELYETNENAVSYKYFIDVVAT 261
>gi|452847826|gb|EME49758.1| hypothetical protein DOTSEDRAFT_58941 [Dothistroma septosporum
NZE10]
Length = 402
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/395 (24%), Positives = 150/395 (37%), Gaps = 97/395 (24%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
S +K DAF K + ++T GG T+V + L ++ ++ TT V+
Sbjct: 20 SSVVKSFDAFPKTKPSYTQRTESGGVWTVVLIVASLLLGWSEISGWWTGKTTHTFAVEQG 79
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
G L I+LD+VV + C L ++ DSSG++ L
Sbjct: 80 VGHDLQINLDVVV-AMQCGDLHVNVQDSSGDRIL------------------------AG 114
Query: 124 NAVKKKKVTTEN-GTTTTEL--EDPNKCGSCY---GAETETRKCCNTCNEVKEAYRYKKW 177
+A+KK T G + L E + S Y GAE E N K ++KK
Sbjct: 115 SALKKDPTTWRQWGGRSHALASEKEERIRSGYDGKGAEYEEEDVHNYLGAAKRQKKFKK- 173
Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVH 236
P L Q + C+IYG + N+V G FHI A G Y H+
Sbjct: 174 -TPGLPWGAQA--------------DSCRIYGSMHGNKVQGDFHITARGHGYMEFGAHL- 217
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
+ FN +H + LSFG PLD TVA + F YY+ ++PTIY
Sbjct: 218 -----DHSTFNFSHTVNELSFGPFYP---SLTNPLDNTVATTPDHFYKFQYYLSVVPTIY 269
Query: 297 -------ERLDG---SKLGGGDG------------------------------GMPGIFF 316
++D S G DG +PG+F
Sbjct: 270 TTDAKTLRKIDKHHESPSSGEDGLSQYPHRYSRNTVFTNQYAVTEQSHRVPENAVPGVFI 329
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+++ P+ + I E+ S+ L +++ +SG +
Sbjct: 330 KFDIEPIGLTIAEEWSSIPALLIRLVNVVSGLLVA 364
>gi|331239265|ref|XP_003332286.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309311276|gb|EFP87867.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 366
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 156/377 (41%), Gaps = 78/377 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ DAF K ++ +++ GG +T+ I LI ++ +Y VD S
Sbjct: 15 IREFDAFPKTLPNYKQRSSRGGVLTVFVACLILVLIWHELKEYLFGEPKYSFLVDPSIAH 74
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I++D+ V + C YL++D D+ G++ +++ K D + +A
Sbjct: 75 SLGINIDLTV-AMPCHYLSVDIKDAVGDR-MYMNQEFKKEGTHFD----------IGDAK 122
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + + + T++ +K G +G +TR +P+
Sbjct: 123 RIDHNNSTSELSATQILHASKKGQTFG---KTRPL-----------------VPD----- 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
C+IYG +V +V+G+ HI ++ H ++ +
Sbjct: 158 ---------------GPACRIYGNTQVKKVTGNLHIT-----TLGHGYL-SWEHTDHKLM 196
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-ERL------ 299
N +H I SFG Q + +PLD +V ++ +F Y+I ++PT Y +RL
Sbjct: 197 NLSHVITEFSFG---QFFPKIVQPLDNSVELTDKPFHIFQYFISVVPTTYIDRLGRQLHT 253
Query: 300 ------DGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI--- 350
D S+ G+PG+FF Y++ P+ + + E++ SL ++ I G +
Sbjct: 254 NQYSVTDMSRPVEHGQGIPGLFFKYDMEPMSLILHERTTSLIQFLVRLAGMIGGIVVCTG 313
Query: 351 -TFMLVDALLHSCVKKI 366
TF LVD + V I
Sbjct: 314 WTFRLVDRFVQKIVPGI 330
>gi|254572003|ref|XP_002493111.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv46p [Komagataella pastoris GS115]
gi|238032909|emb|CAY70932.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv46p [Komagataella pastoris GS115]
Length = 333
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 152/393 (38%), Gaps = 84/393 (21%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + ++ DAF K ++ G TI+ FI +LI V++ Y + +
Sbjct: 1 MDYHRTIRVFDAFPKTEPVNTVRSTKGSYSTILMGFFILFLIWVEIGGYVDGYIDRQFML 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D + L I+LD+ V T C+YL + D + ++ L E +L+ +G P
Sbjct: 61 DRNIQRVLNINLDMFVAT-PCNYLHTNVKDITQDRFLAQE------QLNFEGVNFFIPDS 113
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
VN E+ +T +L++ ++ AL
Sbjct: 114 FRVNG-------DESQGSTLDLDEV----------------------------MRESALA 138
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEG----CQIYGYLEVNRVSGSFHI-APGLSYSINHVHV 235
E + K +FT G C I+G + VN+V G FHI G Y
Sbjct: 139 EF-------------REKKSFTHGDAPACHIFGSIPVNKVHGFFHITGKGYGY------- 178
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
D A N TH I SFG + PLD T + FNYY+ ++PT
Sbjct: 179 RDRSIVPKEALNFTHVISEFSFG---EFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTE 235
Query: 296 YERL----DGSKLG------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
Y++L D ++ G PG+FF+Y+ P+++ I EK S +++
Sbjct: 236 YKKLGIVIDTTQYSMTVTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTIC 295
Query: 346 SGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
G M+V + V K+ +V G + +
Sbjct: 296 GGI----MVVAKWIFRTVDKLIRVVFGNQVANR 324
>gi|213512030|ref|NP_001133523.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
gi|209154344|gb|ACI33404.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
Length = 381
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/171 (30%), Positives = 81/171 (47%), Gaps = 18/171 (10%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
C+I+G+L VN+V+G+FHI G + H H + +N +H I HLSFG ++
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFG---EE 225
Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGGD 308
PLDGT + MF Y+I I+PT + ER G
Sbjct: 226 IPGIINPLDGTEKVCTDHNQMFQYFITIVPTKLNTYQISADTNQYSVTERERVINHAVGS 285
Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
G+ GIF Y++S LMVK+TE+ L ++ I G + T ++ ++
Sbjct: 286 HGVSGIFMKYDISSLMVKVTEQHMPLWRFLVRLCGIIGGIFSTTGMIHGMV 336
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E T GG V+++ + ++ L ++ Y E VD S
Sbjct: 14 VKELDAFPKVPESYVETTATGGTVSLIAFTAMALLAFLEFFVYRDTWMQYEYEVDKDFSS 73
Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
KL I++DI V + C ++ D +D
Sbjct: 74 KLRINIDITV-AMRCQFVGADVLD 96
>gi|405968654|gb|EKC33703.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Crassostrea gigas]
Length = 345
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 82/176 (46%), Gaps = 24/176 (13%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSI-NHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
+ C++YG LEVN+V+G+FHI G S + H H +N +H I H SFG +
Sbjct: 122 DACRVYGSLEVNKVAGNFHITAGKSVPVFPRGHAHISMMVHEKEYNFSHRIDHFSFGESV 181
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------------IYERLDGSKLG 305
+ PLDG + + +FNY+IKI+PT + +R
Sbjct: 182 KG---IINPLDGEEQVSSDNFHVFNYFIKIVPTEVRTYAAGNIDTYQFSVTQRNRTINHS 238
Query: 306 GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
G G+PGIF Y+L+ L +++ EK + + +C I G V +LH+
Sbjct: 239 KGSHGVPGIFVKYDLNALKIRVVEKHRPFSQFLIR-LCGIVG---GIFAVSGMLHN 290
>gi|145235453|ref|XP_001390375.1| COPII-coated vesicle protein (Erv41) [Aspergillus niger CBS 513.88]
gi|134058058|emb|CAK38286.1| unnamed protein product [Aspergillus niger]
gi|350632895|gb|EHA21262.1| hypothetical protein ASPNIDRAFT_191708 [Aspergillus niger ATCC
1015]
Length = 399
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 89/381 (23%), Positives = 145/381 (38%), Gaps = 77/381 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + + GG T++ + + + + S V+ G
Sbjct: 24 LKTFDAFPKTKPSYTAPSRRGGQWTVLILVICTVFTFSEFRTWLHGSENHHFSVEKGVGH 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LD+VV + CD L ++ D+SG++ L G +Q + +
Sbjct: 84 DLQLNLDLVV-RMPCDTLDVNIQDASGDR-------------ILAGDLLQRERTSWKLWM 129
Query: 127 KKKKVTTENGT---TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
K+ T G T ED ++ A + EV++ R K P L
Sbjct: 130 DKRNRETSGGVHEYQTLSQEDTDRIS----AREADAHVHHVLGEVRKNPRRKFAKGPRLR 185
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
+ + C+IYG LE N+V G FHI A G Y H+
Sbjct: 186 --------------RGDTVDSCRIYGSLEGNKVQGDFHITARGHGYRNFGEHL------D 225
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER---- 298
FN +H + LSFG PLD T+A E + Y++ ++PT+Y +
Sbjct: 226 HGVFNFSHMVTELSFGPHYP---TLLNPLDKTIATTETHYYKYQYFLSVVPTLYSKGASA 282
Query: 299 LD----------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
LD ++L +PGIFF Y + P+++ I+E+
Sbjct: 283 LDTYTNHPDLIATNRNRNLVFTNQYAATTQATELPENPYFIPGIFFKYNIEPILLMISEE 342
Query: 331 SKSLGHLWTKIMCNISGTYIT 351
S L +++ +SG +T
Sbjct: 343 RTSFLSLLIRLVNTVSGVMVT 363
>gi|320591987|gb|EFX04426.1| copii-coated vesicle protein [Grosmannia clavigera kw1407]
Length = 385
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 151/368 (41%), Gaps = 67/368 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ DAF K + ++T GG T+ + L ++ ++ S V G
Sbjct: 26 VQAFDAFPKAKPQYVQRTAGGGKWTVAMIVVSLLLFWTELRRWWAGSQEHTFAVAKGVGH 85
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
+ I++DIVV + CD L ++ D++G++ L + + + +D G
Sbjct: 86 SMQINMDIVV-KMRCDDLHINVQDAAGDRIMAAAKLQRDATTWAQWVDHGGN------HR 138
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ + + +T E TT P++ G +G E + + V R +W
Sbjct: 139 LGRDTQGRMITGEGWTTL-----PHEEG--FGEE-------HVHDIVALGRRKARWG--- 181
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
T +L+ + C+I+G L++NRV G +HI A G Y + + D
Sbjct: 182 -----------KTPRLRGAAPDSCRIFGSLDLNRVQGDYHITARGHGY----MEMGDHLD 226
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE-RL 299
+TS FN +H + LSFG PLD TV +A F Y++ I+PT+Y
Sbjct: 227 HTS--FNFSHVVNELSFGPFYP---SLVNPLDQTVNEATANFYRFQYFMSIVPTVYSVGH 281
Query: 300 DGSKLGGG----------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
GS+ +PGIFF Y++ P+++ I E KI+
Sbjct: 282 AGSRSARSIVTNQYAVTEQSAEIDQRAIPGIFFKYDIEPILLYIEESRDGFLVFVLKIVN 341
Query: 344 NISGTYIT 351
+SG +
Sbjct: 342 VLSGALVA 349
>gi|392577310|gb|EIW70439.1| hypothetical protein TREMEDRAFT_43159 [Tremella mesenterica DSM
1558]
Length = 435
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 152/379 (40%), Gaps = 55/379 (14%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + ++ G +T + I L+ D+ +Y + VD
Sbjct: 33 IKSFDAFPKVQSTYTSQSRRGAVLTALVGFIIFLLVLNDLGEYLYGAPDYTFDVDQQLQK 92
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L +++D+ V + C +L++D D+ G++ LH+ +
Sbjct: 93 DLQLNVDLTV-AMPCHFLSIDLRDAVGDR-LHLS-----------------------DGF 127
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K+ T G T P S + +R+ T R + P+ T
Sbjct: 128 TKEGTTFAVGKAVTSKTHPTPI-SASQVISSSRRRTPTQQRSFSGIRRLLSSRPKRRTRK 186
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ K N C+IYG +EV +V+ + HI ++ H ++ + A
Sbjct: 187 HAMFRPTPNKADNG--PACRIYGSVEVKKVTANLHIT-----TLGHGYM-SFEHTDHALM 238
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
N +H + SFG + PLD T+ ++ + Y+++++PT Y +G KL
Sbjct: 239 NLSHVVHEFSFGPFFPAIAQ---PLDMTMQVSDNPFTAIQYFLRVVPTTYIDANGRKLVT 295
Query: 307 GD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG---TYI 350
G+PGIFF Y+L + V + E++ SL H +++ I G T
Sbjct: 296 SQYAVTDYLRSFQHGQGVPGIFFKYDLEAMAVTVRERTTSLYHFVIRLIGVIVGGVWTVA 355
Query: 351 TFMLVDALLHSCVKKISKV 369
++ L +L+ K+ +KV
Sbjct: 356 SYAL--RVLNRAEKQFTKV 372
>gi|328352874|emb|CCA39272.1| Peroxisomal membrane protein PEX28 [Komagataella pastoris CBS 7435]
Length = 849
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 95/393 (24%), Positives = 154/393 (39%), Gaps = 84/393 (21%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + ++ DAF K ++ G TI+ FI +LI V++ Y + +
Sbjct: 517 MDYHRTIRVFDAFPKTEPVNTVRSTKGSYSTILMGFFILFLIWVEIGGYVDGYIDRQFML 576
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
D + L I+LD+ V T C+YL + D + ++ L E +L+ +G P
Sbjct: 577 DRNIQRVLNINLDMFVAT-PCNYLHTNVKDITQDRFLAQE------QLNFEGVNFFIPDS 629
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
VN E+ +T +L++ ++E+ AL
Sbjct: 630 FRVNG-------DESQGSTLDLDE----------------------VMRES------ALA 654
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEG----CQIYGYLEVNRVSGSFHI-APGLSYSINHVHV 235
E + K +FT G C I+G + VN+V G FHI G Y
Sbjct: 655 EF-------------REKKSFTHGDAPACHIFGSIPVNKVHGFFHITGKGYGY------- 694
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
D A N TH I SFG + PLD T + FNYY+ ++PT
Sbjct: 695 RDRSIVPKEALNFTHVISEFSFG---EFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTE 751
Query: 296 YERL----DGSKLG------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
Y++L D ++ G PG+FF+Y+ P+++ I EK S +++
Sbjct: 752 YKKLGIVIDTTQYSMTVTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTIC 811
Query: 346 SGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
G M+V + V K+ +V G + +
Sbjct: 812 GG----IMVVAKWIFRTVDKLIRVVFGNQVANR 840
>gi|146163751|ref|XP_001012240.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila]
gi|146145943|gb|EAR91995.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila
SB210]
Length = 331
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 95/399 (23%), Positives = 145/399 (36%), Gaps = 107/399 (26%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++GLD F K +D T GG +I+ ++ L ++ DY ++ V
Sbjct: 1 MRGLDFFQKVNQDIDTSTATGGVYSIIAFVVGFILFWNELKDYRTDQMIYKMRVQQLEVE 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ ++D+ + C LALD D G L I K R+ DG
Sbjct: 61 SVKANIDLHIYGSPCTLLALDLQDEVGNHTLDYTDTIKKIRVLKDG-------------- 106
Query: 127 KKKKVTTENGTTTTELE------DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
TELE +PN GS E+ EA
Sbjct: 107 -------------TELESGFGDGNPNYRGSS--------------QEIDEA--------- 130
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
I NE EGC+I GY+ + +V G+FHI+ + + + +P
Sbjct: 131 ----IDAVNNE-----------EGCRINGYINLKKVPGNFHISYHAKMDVMN-RIASTKP 174
Query: 241 YTSAAFNTTHHIRHLSFG---------------IKLQDDDERRKPLDGTVAKAEEGASMF 285
T + N + I HL FG Q+ + P D T G + +
Sbjct: 175 DTYSKINLNYKINHLGFGENTNHMATIFKIMGRTLFQETNTNDYPHDDT-KYINPGKNDY 233
Query: 286 NYYIKIIPTIYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITE 329
+ Y+KI+P R D +KL +P IFF YE+SP+ V +
Sbjct: 234 DNYLKILPC---RYDSNKLHMSVSRYKYAMYSTHTPKSSTEIPTIFFRYEISPINVYYST 290
Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
KSKS H +I + G + + ++L + KISK
Sbjct: 291 KSKSFYHFLVQIFAIVGGIFAVMGIFNSLTTGVISKISK 329
>gi|392297516|gb|EIW08616.1| Erv41p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 352
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 147/354 (41%), Gaps = 66/354 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K E + +K+ GG +++ +LF+ ++ + +YF ++ VDS
Sbjct: 4 LKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVRD 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++DI V T CD+L ++ D + ++ L +E + P P VN +
Sbjct: 64 TVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVNDI 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
N T EL++ G A E R+ +T + E+ K LPE +
Sbjct: 117 --------NEIITPELDE--ILGEAIPA--EFREKLDTRSFFDES-DPNKAHLPEFN--- 160
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
GC I+G + VNRVSG I ++ S+ +V P F
Sbjct: 161 -----------------GCHIFGSIPVNRVSGELQI---IAKSLGYVASRK-APLEELKF 199
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DG 301
N H I SFG D PLD T +E + + YY ++PT++++L D
Sbjct: 200 N--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDT 254
Query: 302 SKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
++ D MPGIFF Y PL + +++ S +++
Sbjct: 255 NQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 308
>gi|358374656|dbj|GAA91246.1| COPII-coated vesicle protein [Aspergillus kawachii IFO 4308]
Length = 399
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 89/381 (23%), Positives = 144/381 (37%), Gaps = 77/381 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + + GG T++ + + + + S V+ G
Sbjct: 24 LKTFDAFPKTKPSYTAPSRRGGQWTVLILVICTVFTFSEFRTWLNGSENHHFSVEKGVGH 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LD+VV + CD L ++ D+SG++ L G +Q + +
Sbjct: 84 DLQLNLDLVV-RMPCDTLDVNIQDASGDR-------------ILAGDLLQRERTSWKLWM 129
Query: 127 KKKKVTTENGT---TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
K+ T G T ED ++ A + EV++ R K P L
Sbjct: 130 DKRNRETSGGVHEYQTLSQEDSDRIS----AREADAHVHHVLGEVRKNPRRKFAKGPRLR 185
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
+ + C+IYG LE N+V G FHI A G Y H+
Sbjct: 186 --------------RGDTVDSCRIYGSLEGNKVQGDFHITARGHGYRNFGEHL------D 225
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER---- 298
FN +H + LSFG PLD T+A E + Y++ ++PT+Y +
Sbjct: 226 HGVFNFSHMVTELSFGPHYP---TLLNPLDKTIATTETHYYKYQYFLSVVPTLYSKGASA 282
Query: 299 LD----------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
LD +L +PGIFF Y + P+++ I+E+
Sbjct: 283 LDTYTNHPDLIATNRNRNLVFTNQYAATTQAQELPENPYFIPGIFFKYNIEPILLMISEE 342
Query: 331 SKSLGHLWTKIMCNISGTYIT 351
S L +++ +SG +T
Sbjct: 343 RTSFLSLLIRLVNTVSGVMVT 363
>gi|297262047|ref|XP_001105686.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Macaca mulatta]
Length = 374
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 84/182 (46%), Gaps = 22/182 (12%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+I+G+L VN+V+G+FHI G + H H ++N +H I HLSFG +
Sbjct: 165 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVP 224
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGG 307
PLDGT A + MF Y+I ++PT + ER G
Sbjct: 225 ---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTHQFSVTERERIINHAAG 281
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
G+ GIF Y+LS LMV +TE+ + ++ + G + T +LH K I
Sbjct: 282 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST----TGMLHGIGKFIV 337
Query: 368 KV 369
++
Sbjct: 338 EI 339
>gi|322697212|gb|EFY88994.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium acridum
CQMa 102]
Length = 372
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/359 (23%), Positives = 146/359 (40%), Gaps = 57/359 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K ++ +T GG T+ + +L+ ++ +++ S + V+
Sbjct: 21 VSAFDAFPKSKPEYVTRTEGGGKWTVAMAVVSIFLLWAEIARWWRGSESHTFAVEKGISH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE--PQKEVVN 124
+ I+LD V+ + C L ++ D++G++ L +L++D + QK V
Sbjct: 81 SMQINLDTVI-LMKCGDLHINVQDAAGDRIL------AGAKLNMDETSWSQWVNQKGVHK 133
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+ + G L+D +G E + + V R KWA
Sbjct: 134 LGRDSEGRVVTGAGWQNLDDEG-----FGEE-------HVHDIVALGQRRAKWA------ 175
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS 243
T ++K + C+IYG L++N+V G FHI A G Y H+ Q
Sbjct: 176 --------KTPRVKGP-PDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHLDHSQ---- 222
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
FN +H I LSFG PLD T+ AE F YY+ ++PT Y S
Sbjct: 223 --FNFSHIISELSFGSYYP---SLVNPLDRTINIAENHFHKFQYYVSVVPTRYSVGSSSI 277
Query: 304 L-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
G + +PGIF Y++ P+++ + E + K++ +SG +
Sbjct: 278 FTNQYAVTEQSKGVSEYNVPGIFVKYDIEPILLSVNEDRDGILMFVVKLINVLSGVLVA 336
>gi|154343635|ref|XP_001567763.1| hypothetical protein, unknown function [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065095|emb|CAM43209.1| hypothetical protein, unknown function [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 309
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 86/184 (46%), Gaps = 22/184 (11%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG---I 259
EGC++ GY++V +V G+FHI+ +H H + + N H I HLSFG +
Sbjct: 131 EGCRLEGYIKVGKVPGNFHIS-------SHGRQHLLMTHFPNGTNAEHSIHHLSFGTLDV 183
Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE---------RLDGSKLGGGDGG 310
K D + PLDG ++E ++ Y++ I+PTIYE + G+
Sbjct: 184 KKLDKKAQLHPLDGKEHRSEV-PKIYQYFLDIVPTIYESSFSTAHTYQFTGTSSSSPVPS 242
Query: 311 --MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
M + F Y++SP+ V+ + SL H T + I G Y L+ +HS + +
Sbjct: 243 SQMAAVVFQYQMSPITVRYSSARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQFQR 302
Query: 369 VEIG 372
+G
Sbjct: 303 RILG 306
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 46/109 (42%), Gaps = 2/109 (1%)
Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
D F +D E T G ++I C + L +V Y ++ + D S +
Sbjct: 11 DFFRHIPKDLTESTTSGAIISIACVTVMVLLFVGEVISYVSPRIQSDMIILPDLDETSTI 70
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
+ +DI P + C L LD +D + +I + RLD GKPI +
Sbjct: 71 KVSMDITFPKMPCAILTLDILDVLHNHMFNSMDHITRTRLDPAGKPISD 119
>gi|70988875|ref|XP_749289.1| COPII-coated vesicle protein (Erv41) [Aspergillus fumigatus Af293]
gi|66846920|gb|EAL87251.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
fumigatus Af293]
gi|159128703|gb|EDP53817.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
fumigatus A1163]
Length = 379
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 154/371 (41%), Gaps = 71/371 (19%)
Query: 12 AFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIH 71
A TKP + + GG T++ L ++L + + + + + V+ L ++
Sbjct: 13 AKTKP--SYTAPSPRGGQWTVLVLLVCTFLSISEFRTWLKGTEKQHFSVEKGISHDLQLN 70
Query: 72 LDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKV 131
LDIVV +SCD L ++ D+SG++ L + + KR EP + K+
Sbjct: 71 LDIVV-HMSCDMLDVNIQDASGDRILAGQ--LLKR----------EPTSWQLWMDKRNYE 117
Query: 132 TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNE 191
T G + +T +++ + +E +EA + L E+ + K
Sbjct: 118 T---------------YGGAHEYQTLSQEHADRLSE-QEADAHVHHVLGEVRRNPRKKFA 161
Query: 192 YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTH 250
+ + + C+IYG LE N+V G FHI A G Y N H+ FN +H
Sbjct: 162 KGPKLRRGDAVDSCRIYGSLEGNKVQGDFHITARGHGYHNNAPHLEH------KTFNFSH 215
Query: 251 HIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LD------ 300
I LSFG PLD T+A E+ + Y++ I+PTIY + LD
Sbjct: 216 MITELSFGPHY---PTLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGNLALDTYANAP 272
Query: 301 --------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
S + +PG+FF Y + P+++ I+E+ S L +
Sbjct: 273 PSNRRGKNLVFTNQYAVTSQSSVIPESPYFIPGLFFKYNIEPILLLISEERTSFLSLLVR 332
Query: 341 IMCNISGTYIT 351
++ +SG +T
Sbjct: 333 LVNTVSGVMVT 343
>gi|74189495|dbj|BAE22750.1| unnamed protein product [Mus musculus]
Length = 303
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 84/182 (46%), Gaps = 22/182 (12%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+I+G+L VN+V+G+FHI G + H H ++N +H I HLSFG +
Sbjct: 94 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 153
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGG 307
PLDGT A + MF Y+I ++PT + ER G
Sbjct: 154 G---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTHQFSVTERERIINHAAG 210
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
G+ GIF Y+LS LMV +TE+ + ++ I G + T +LH K I
Sbjct: 211 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST----TGMLHGIGKFIV 266
Query: 368 KV 369
++
Sbjct: 267 EI 268
>gi|325184531|emb|CCA19024.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 466
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 58/190 (30%), Positives = 86/190 (45%), Gaps = 30/190 (15%)
Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
N EGCQ+YG+L V RV G+FHI H+ H S+ N +H + L FG
Sbjct: 284 NAGPEGCQLYGHLIVKRVPGNFHI---------HLS-HPFYSMNSSLVNASHTVNELWFG 333
Query: 259 IKLQDDDERRKP----LDG-TVAKAEEGASMFNY----YIKIIPTIYERLDGSKLGG--- 306
L + P LD +A+ E A M NY YIK++ Y + +G +
Sbjct: 334 EVLSASALAKLPPNTRLDSHRLARQEFTAYMQNYTYVHYIKVVTNTYVQRNGEVISAYRY 393
Query: 307 --------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDAL 358
+P + F Y+LSP+ V+ITE+S H T I G + ++D L
Sbjct: 394 TAHSNEYLETEDLPSVMFRYDLSPMSVRITERSMPFYHFVTSACAIIGGVFTVIGIIDQL 453
Query: 359 LHSCVKKISK 368
+H V+ ++K
Sbjct: 454 VHQTVRAMNK 463
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 58/118 (49%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK D + K ED T+ G +++IV + L ++ Y V+ ++ +D
Sbjct: 7 LKKWDFYKKIPEDLTVSTLPGVSLSIVGCFIMLILFILEFNAYLSVNHAYDIVIDEGLDE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
K I+ +I +P + C++ ++D D +G + ++ N+ K R+D G+ + EV +
Sbjct: 67 KFEINFNITIPDLPCEFASIDVSDMTGTRKHNMTKNVSKFRIDTKGRLVGFASDEVTH 124
>gi|346322712|gb|EGX92310.1| COPII-coated vesicle protein (Erv41), putative [Cordyceps militaris
CM01]
Length = 376
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 149/359 (41%), Gaps = 54/359 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISY-LICVDVCDYFQVSTTEELFVDSSRG 65
+ DAF K ++ +T GG T+V +FIS L+ +V +++ S T V+
Sbjct: 21 VSAFDAFPKSKPEYVTRTAGGGKWTVVI-VFISLVLMGSEVGRWWRGSETHNFAVEKGIS 79
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ I+LDIVV + C+ L ++ D+SG++ L ++ R + + + +
Sbjct: 80 HDMQINLDIVVHML-CNDLHINVQDASGDRILAA--SMLHRDPTMWSHWVDQAGVHKLGH 136
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+V T G T+ D +G E + + V + KW+
Sbjct: 137 DANGRVNTGEGWTSLAHNDEG-----FGEE-------HVHDIVALGKKRAKWS------- 177
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
T + T + C++YG L++N+V G FHI A G Y H+ Q
Sbjct: 178 -------KTPRFWGT-ADSCRVYGSLDLNKVQGDFHITARGHGYMEFGQHLDHNQ----- 224
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------- 297
FN +H I LS+G PLD TV A F YY+ ++PTIY
Sbjct: 225 -FNFSHVISELSYGAFYP---SLVNPLDRTVNLAAAHFHKFQYYLSVVPTIYSVGSSTIQ 280
Query: 298 -----RLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ SK +PGIF Y++ P+++ + E S K++ +SG +
Sbjct: 281 TNQYAVTEQSKEIDEHSAVPGIFVKYDIEPILLAVHESRDSFPVFLLKLINIVSGVLVA 339
>gi|296821254|ref|XP_002850059.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
gi|238837613|gb|EEQ27275.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
Length = 399
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 85/388 (21%), Positives = 154/388 (39%), Gaps = 83/388 (21%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
+ +LK DAF K + + GG T+ + + L C ++ +++ V+
Sbjct: 21 AAKLKTFDAFPKTKPSYTSTSRSGGLWTVFIAILCAILSCSELVTWYRGHENHHFSVERG 80
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
++ ++LD+VV + CD + ++ D+ G+ H+ L G+ + Q+
Sbjct: 81 VSQEMQLNLDVVV-AMPCDDVRINVQDAVGD---HI----------LAGELLT--QQPTS 124
Query: 124 NAVKKKKVTTENGTTTTEL-----EDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
A ++ + G + E EDP + + E + EV+ + K
Sbjct: 125 WAAWNREFNRQRGGGSPEYQTLSKEDPFRLEE----QEEDLHVEHVLGEVRRGRKKKFPK 180
Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHD 237
P+L K+ + C+++G LE N+V G+ HI A G Y +
Sbjct: 181 APKLK--------------KSDAVDSCRVFGSLEGNKVQGNLHITARGFGY------LEW 220
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
QP + N TH I LSFG PLD TV+ + Y++ ++PTIY
Sbjct: 221 GQPTNPHSLNFTHLITELSFGPHYA---RLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYT 277
Query: 298 R-----------LDGSKLGGGDG-----------------------GMPGIFFSYELSPL 323
+ D S + D +PGIFF Y + P+
Sbjct: 278 KSGHIDPNHRSLPDPSSITAKDSKTTVSTNQYAVTSYSQPVQPRIESIPGIFFKYNIEPI 337
Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYIT 351
++ ++++ SL L +++ +SG +T
Sbjct: 338 LLIVSQERDSLLALLVRLVNVVSGVLVT 365
>gi|332373256|gb|AEE61769.1| unknown [Dendroctonus ponderosae]
Length = 382
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 84/381 (22%), Positives = 152/381 (39%), Gaps = 64/381 (16%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
R+K +D F K + + + GG +I+ +L I +L+ ++ Y + D
Sbjct: 17 NRVKKMDIFPKVEDPYKMTSSVGGTFSIISFLIIGWLVYSEISYYLNSKFVFKFSPDVQL 76
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
KL +++DI V + C L D +DS+ + N YK
Sbjct: 77 EDKLDMNIDITV-AMPCSKLGTDVLDSTNQ-------NTYKFG----------------- 111
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
T + T EL D K E +K N+ ++E Y K L +
Sbjct: 112 -------TLKQDDTWFELSDNQK------VHFEHKKHFNSY--LREEYHAIKDLLWKNSF 156
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
Q + + + + C+IYG L +N+V+G+F I+ G Y + +
Sbjct: 157 STQFGDLPPRDHTPSRPHDACRIYGTLGLNKVAGNFLISGGKRYMFGLGYQQFRTLISEG 216
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------- 294
+N TH I SFG PL+G + ++ NY+I+I+PT
Sbjct: 217 EYNFTHRINRFSFG---HSSPGIVHPLEGDELILPDPMTVVNYFIEIVPTTVNTFMYTIS 273
Query: 295 --------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
+ +D +K G G P I+F Y++S L V ++++ LG ++ +
Sbjct: 274 TYQYSVKELTRPIDHNK---GSHGTPAIYFKYDMSALRVTVSQERDHLGMFLARLCSIVG 330
Query: 347 GTYITFMLVDALLHSCVKKIS 367
G Y+ ++++++ + I+
Sbjct: 331 GVYVCSGILNSIVQLLLNFIT 351
>gi|242783317|ref|XP_002480163.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
stipitatus ATCC 10500]
gi|218720310|gb|EED19729.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
stipitatus ATCC 10500]
Length = 400
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/400 (22%), Positives = 162/400 (40%), Gaps = 73/400 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K ++ + GG T++ ++L ++ +++ + + V+
Sbjct: 24 LKTFDAFPKTKPNYTTPSRRGGQWTVIIIAICTFLSIGELITWYRGTENQHFSVEKGVSR 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L +++D+VV + C+ + ++ D+SG+ H+ + + + + E + + V
Sbjct: 84 QLQMNIDMVV-KMPCNDIRVNVQDASGD---HIMAGMLLMKDSTNWEMWNEKLNQQSSGV 139
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + T N T L + + + + TR R + P+
Sbjct: 140 TEYQ--TLNAEDTKRLLEQEEDMHAHHVLSHTR-------------RNPRRKFPK----- 179
Query: 187 QCKNEYSTEKLKNTF-TEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
T +L + T+ C+IYG LE N+V G FHI A G Y+ H+
Sbjct: 180 -------TPRLSAKYPTDSCRIYGSLESNKVHGDFHITARGHGYNELGEHL------DHK 226
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
FN TH I LSFG PLD TVA E+ F Y++ ++PTIY + + +
Sbjct: 227 TFNFTHMITELSFGPHYPS---LLNPLDKTVAYTEDHYYKFQYFLNVVPTIYAKGNNAVE 283
Query: 303 -----------------------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKS 333
L PGIFF Y + P+++ ++E+ S
Sbjct: 284 KYTANPALAFKKSRNTIFTNQYSATSQSHALPENPYNTPGIFFKYNIEPILLFVSEERGS 343
Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGG 373
L +++ +SG +T + L ++ + + GG
Sbjct: 344 FLALLVRLVNVVSGVIVTGGWLYQLSGWAMEVLRRRRRGG 383
>gi|260800124|ref|XP_002594986.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
gi|229280225|gb|EEN50997.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
Length = 292
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 90/199 (45%), Gaps = 32/199 (16%)
Query: 194 TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIR 253
TEK+ GC+ G +N+V G+FH++ H +QP A+ + TH +
Sbjct: 103 TEKVPVNNGLGCRFEGRFWINKVPGNFHMS---------THSAHVQP---ASPDMTHVVH 150
Query: 254 HLSFGIKLQD--DDERR---KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK----- 303
L FG L D + PLD S +Y++KI+PTI+E K
Sbjct: 151 DLRFGEDLAAFLPDHIKGSFNPLDEVERLHANALSSHDYFLKIVPTIFENRSDKKSFAFQ 210
Query: 304 ----------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
G G+ MP I+F Y+LSP+ VK T+K K H T I + GT+
Sbjct: 211 YTYAYKDYISFGHGNRVMPAIWFRYDLSPITVKYTDKRKPFYHFITTICAVVGGTFTVAG 270
Query: 354 LVDALLHSCVKKISKVEIG 372
++D+++ + + K E+G
Sbjct: 271 IIDSVIFTAAEVFKKAELG 289
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 2/93 (2%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--R 64
+K D + K +D + T+ G V+I+ +FI +L+ + + ELFVD+S
Sbjct: 5 VKRFDIYRKIPKDLTQPTLTGALVSILSGMFIVFLLLSEFHAFIMSDIMSELFVDNSGGG 64
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
G ++ + L+I +P + C+ + LD D G +
Sbjct: 65 GGQISVFLNISLPRLKCEVVGLDIQDEMGRHEV 97
>gi|254579156|ref|XP_002495564.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
gi|238938454|emb|CAR26631.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
Length = 353
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 150/381 (39%), Gaps = 80/381 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K E +K+ GG +I +LF+ ++ + YF E VD
Sbjct: 4 LRSFDAFPKTDETHVKKSSNGGLSSIFTYLFLLFIAWTEFGSYFGGYVDEHYEVDDQLRE 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
I++D+ V T C YL ++ D++ ++ + L+L+ P P VN +
Sbjct: 64 TFQINMDLYVKT-PCQYLDINVRDTT------MDRKFVSKELNLEDMPFFIPYGSRVNDM 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
N T +L++ N + +R K +DT
Sbjct: 117 --------NEIVTPDLDN------------------VLSNAIPAQFREK------IDT-- 142
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
N E+ ++ F C I+G ++VNRV+G I A G YS
Sbjct: 143 ---NNMFDEEERDAFN-SCHIFGSVQVNRVAGELQITAKGHGYS-------SFMRAPPEE 191
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFNYYIKIIPTIYERLDGSKL 304
+ +H I LS+G D PLD T + + F Y I+PTIYE+L G+K+
Sbjct: 192 IDFSHVINELSYGEFYPYID---NPLDSTAKFVPDAPRTTFVYDTAIVPTIYEKL-GAKI 247
Query: 305 ------------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
G G PGIF Y+ PL + I++ S +++ +S
Sbjct: 248 DTNQYAVSEYHINPEAQQGKGPIRFPGIFLRYDFEPLSIHISDVRLSFIQFVVRLVAILS 307
Query: 347 GTYIT----FMLVDALLHSCV 363
T F L+D +L +C+
Sbjct: 308 FVIYTASWAFRLIDLVLLTCL 328
>gi|340507573|gb|EGR33515.1| hypothetical protein IMG5_050820 [Ichthyophthirius multifiliis]
Length = 290
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/354 (23%), Positives = 140/354 (39%), Gaps = 102/354 (28%)
Query: 55 TEELFVDSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
T E+FVDS RG K+ ++LDI P CD L+LD D G ++VE +++K R+ G+
Sbjct: 3 TSEMFVDSLRGGQKIRVNLDIDFPKFPCDILSLDFQDIMGSHSVNVEGDLHKTRITKTGE 62
Query: 114 PIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYR 173
+++ NK S + + N+V
Sbjct: 63 YFDRHEQQ-----------------------QNKQHSGHAHDQ--------SNQVD---- 87
Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-------PGL 226
L + +Q K EGC++ G++ VNRV G+FHI+ G
Sbjct: 88 -----LQRIQQAIQNK-------------EGCKLSGFMYVNRVPGNFHISCHAFGQILGY 129
Query: 227 SYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR-----------KPLDGTV 275
+ I ++ D+ +H I HLSFG D+DE + P+D V
Sbjct: 130 VFRITGINTIDL----------SHKINHLSFG----DEDEIKIVKKQFTLGVLNPMDKLV 175
Query: 276 AKAEE-----GASMFNYYIKIIPTIY----------ERLDGSKLGGGDGGMPGIFFSYEL 320
++ G S +NYY+ ++PT Y + ++ +P I+F Y+L
Sbjct: 176 KTKQKHFENYGIS-YNYYLNVVPTTYIDEWGYTYYVNQFVFTENQIQTDYIPAIYFRYDL 234
Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
SP+ V + H ++ + G + +D + V ++ K G K
Sbjct: 235 SPVTVMFKKDRMPFLHFLVQVSAIVGGIFTIAAFMDEIAFKIVIQLFKNSEGEK 288
>gi|57208595|emb|CAI42844.1| ERGIC and golgi 3 [Homo sapiens]
Length = 156
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 72/159 (45%), Gaps = 51/159 (32%)
Query: 250 HHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS------- 302
H+I+HLSFG +D PLD T A + + MF Y++K++PT+Y ++DG
Sbjct: 1 HYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAQQERGR 57
Query: 303 KLGGGDGG-----------------------------------------MPGIFFSYELS 321
GG DGG +PG+F YELS
Sbjct: 58 SRGGADGGWSQVLALALAQAPLPPQVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELS 117
Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH 360
P+MVK+TEK +S H T + I G + L+D+L++
Sbjct: 118 PMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIY 156
>gi|403216157|emb|CCK70655.1| hypothetical protein KNAG_0E04020 [Kazachstania naganishii CBS
8797]
Length = 351
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/365 (23%), Positives = 143/365 (39%), Gaps = 72/365 (19%)
Query: 18 EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
E + +K+ GG +I+ +LF+ ++ + YF ++ VDS + ++LD+ V
Sbjct: 10 EQYKQKSSKGGLTSILTYLFLIFIAYSEFGSYFGGYLDQQYIVDSELREDVELNLDVFV- 68
Query: 78 TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGT 137
+ CD++ ++ DS+ + I L + P P VN + + +T E
Sbjct: 69 HMPCDFIHVNVRDST------FDRKIVSEELKFEDMPFFIPYDTKVNDIPEI-ITPEMDE 121
Query: 138 TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
E + + + + R + + + LPE +
Sbjct: 122 ILGE-----AIPASFREKVDMRLYYDENDPDTHHH------LPEFN-------------- 156
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLS 256
GC I+G + VNRV G F I A GL Y D+ N H I S
Sbjct: 157 ------GCHIFGSIPVNRVRGEFQITAKGLGY-------RDMNAAPKEKINFAHVINEWS 203
Query: 257 FGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERLDGS-----------KL 304
FG D PLD T ++ + F YY+ ++PTIY++L +
Sbjct: 204 FGDFYPYID---NPLDATAKFDKDDPLTAFVYYLSVVPTIYQKLGAEVDTNQYSVSEYRF 260
Query: 305 GGGD------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS-GTYIT---FML 354
D G +PGIFF Y L + +T++ S +++ +S YI F+L
Sbjct: 261 NSTDKTFRDTGYVPGIFFRYNFESLSIVMTDRRLSFLQFIVRLVAIMSFAVYIASWIFIL 320
Query: 355 VDALL 359
D LL
Sbjct: 321 TDTLL 325
>gi|151946097|gb|EDN64328.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
gi|190408176|gb|EDV11441.1| hypothetical protein SCRG_01831 [Saccharomyces cerevisiae RM11-1a]
gi|259148509|emb|CAY81754.1| Erv41p [Saccharomyces cerevisiae EC1118]
Length = 352
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/355 (24%), Positives = 142/355 (40%), Gaps = 68/355 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K E + +K+ GG +++ +LF+ ++ + +YF ++ VDS
Sbjct: 4 LKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVRD 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++DI V T CD+L ++ D + ++ L +E + P P VN +
Sbjct: 64 TVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVNDI 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
N T EL++ G AE + + + + R LPE +
Sbjct: 117 --------NEIITPELDE--ILGEAIPAEFREKLDTRSFFDESDPNRAH---LPEFN--- 160
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
GC I+G + VNRVSG I A L Y + P
Sbjct: 161 -----------------GCHIFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELK 198
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----D 300
FN H I SFG D PLD T +E + + YY ++PT++++L D
Sbjct: 199 FN--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVD 253
Query: 301 GSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
++ D MPGIFF Y PL + +++ S +++
Sbjct: 254 TNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 308
>gi|6323573|ref|NP_013644.1| Erv41p [Saccharomyces cerevisiae S288c]
gi|2497084|sp|Q04651.1|ERV41_YEAST RecName: Full=ER-derived vesicles protein ERV41
gi|558408|emb|CAA86254.1| unnamed protein product [Saccharomyces cerevisiae]
gi|285813935|tpg|DAA09830.1| TPA: Erv41p [Saccharomyces cerevisiae S288c]
Length = 352
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 145/355 (40%), Gaps = 68/355 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K E + +K+ GG +++ +LF+ ++ + +YF ++ VDS
Sbjct: 4 LKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVRD 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++DI V T CD+L ++ D + ++ L +E + P P VN +
Sbjct: 64 TVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVNDI 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
N T EL++ G A E R+ +T + E+ K LPE +
Sbjct: 117 --------NEIITPELDE--ILGEAIPA--EFREKLDTRSFFDES-DPNKAHLPEFN--- 160
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
GC ++G + VNRVSG I A L Y + P
Sbjct: 161 -----------------GCHVFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELK 198
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----D 300
FN H I SFG D PLD T +E + + YY ++PT++++L D
Sbjct: 199 FN--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVD 253
Query: 301 GSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
++ D MPGIFF Y PL + +++ S +++
Sbjct: 254 TNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 308
>gi|349580221|dbj|GAA25381.1| K7_Erv41p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 352
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 145/355 (40%), Gaps = 68/355 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K E + +K+ GG +++ +LF+ ++ + +YF ++ VDS
Sbjct: 4 LKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVRD 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++DI V T CD+L ++ D + ++ L +E + P P VN +
Sbjct: 64 TVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVNDI 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
N T EL++ G A E R+ +T + E+ K LPE +
Sbjct: 117 --------NEIITPELDE--ILGEAIPA--EFREKLDTRSFFDES-DPNKAHLPEFN--- 160
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
GC ++G + VNRVSG I A L Y + P
Sbjct: 161 -----------------GCHVFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELK 198
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----D 300
FN H I SFG D PLD T +E + + YY ++PT++++L D
Sbjct: 199 FN--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVD 253
Query: 301 GSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
++ D MPGIFF Y PL + +++ S +++
Sbjct: 254 TNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 308
>gi|121710902|ref|XP_001273067.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
clavatus NRRL 1]
gi|119401217|gb|EAW11641.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
clavatus NRRL 1]
Length = 401
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/382 (22%), Positives = 148/382 (38%), Gaps = 76/382 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + + GG T++ L ++ + + + + V+
Sbjct: 24 LKIFDAFPKTKPSYTAPSHRGGQWTVLILLICTFFSLSEFRAWLRGTEKHHFSVEKGISH 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LDIVV + C+ L ++ D+SG++ L G+ +Q + +
Sbjct: 84 DLQLNLDIVV-DMPCESLDVNIQDASGDR-------------ILAGELLQRERTSWNLWM 129
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K+ G + + + G + + + EV+ R K P L
Sbjct: 130 EKRNYEIHGGAHEYQTLN-QEHGDRLAEQEQDAHVHHVLGEVRRNPRKKFPRGPRLR--- 185
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS-A 244
+ + C+IYG LE N+V G FHI A G Y H P+ +
Sbjct: 186 -----------RGDVVDSCRIYGSLEGNKVQGDFHITARGHGY-------HAAAPHLEHS 227
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LD 300
FN +H + LSFG PLD T+A EE + Y++ ++PTIY + LD
Sbjct: 228 TFNFSHMVTELSFGPHYPTI---LNPLDKTIATTEEHYYKYQYFLSVVPTIYSKGNLALD 284
Query: 301 G-------------------------------SKLGGGDGGMPGIFFSYELSPLMVKITE 329
+ L +PGIFF Y + P+++ I+E
Sbjct: 285 AYSGSAPTLHDPNRNRNRNLIFTNQYAATSQSTALPESPYFVPGIFFKYSIEPILLIISE 344
Query: 330 KSKSLGHLWTKIMCNISGTYIT 351
+ S L +++ +SG +T
Sbjct: 345 ERGSFLTLLVRLVNTVSGVIVT 366
>gi|66773206|ref|NP_080631.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
isoform 2 [Mus musculus]
gi|12854944|dbj|BAB30175.1| unnamed protein product [Mus musculus]
Length = 302
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 120/288 (41%), Gaps = 44/288 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + Y+ L D P Q + ++ +
Sbjct: 73 KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPT 256
>gi|302422316|ref|XP_003008988.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
gi|261352134|gb|EEY14562.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
Length = 374
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/392 (22%), Positives = 155/392 (39%), Gaps = 77/392 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + ++T GG T+ IS ++ + E + S R S
Sbjct: 20 VSAFDAFPKSKPQYVQRTSGGGKWTVAM-AVISVMLFWPELGRGGRGSREPTRLRSRRAS 78
Query: 67 K--LPIHLDIVVPTISCDYLALDAVDSSGE-----QHLHVEHNIYKRRLDLDGKPIQEPQ 119
L ++LDIVV + C+ L ++ D+SG+ L E + + D+ G
Sbjct: 79 ATTLQVNLDIVV-KMRCEDLHINVQDASGDLILAATKLREEITSWHQWADITGN------ 131
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ ++ T +G E +G E + + V ++ + +KWA
Sbjct: 132 -HKLGRSPSGRIETNSGYHLDE---------GFGEE-------HVHDIVAQSKKRQKWA- 173
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
T +L+ + C+I+G L++N+V G FHI A G Y H+
Sbjct: 174 -------------RTPRLRGP-PDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL--- 216
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
+FN +H + LSFG + + PLD TV A F YY+ I+PT+Y
Sbjct: 217 ---DHTSFNFSHIVNELSFGAFYPNLE---NPLDRTVNLASANFHKFQYYLSIVPTVYTV 270
Query: 299 LDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
+ GD +PG+F Y++ P+++ + E W K++
Sbjct: 271 GRSASKANTVYTNQFAVTEQSKEVGDHSVPGVFVKYDIEPILLLVEETRPGFVQFWLKVI 330
Query: 343 CNISGTYIT----FMLVDALLHSCVKKISKVE 370
+SG + F L + + KK + +
Sbjct: 331 NVLSGVLVAGHWGFTLSEWFKENWAKKKERTQ 362
>gi|148678795|gb|EDL10742.1| ERGIC and golgi 2, isoform CRA_b [Mus musculus]
Length = 310
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 120/288 (41%), Gaps = 44/288 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 21 VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + Y+ L D P Q + ++ +
Sbjct: 81 KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 138
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 139 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 165
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI G + H H ++
Sbjct: 166 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 219
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
N +H I HLSFG + PLDGT A + MF Y+I ++PT
Sbjct: 220 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPT 264
>gi|390594538|gb|EIN03948.1| DUF1692-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 551
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/366 (22%), Positives = 146/366 (39%), Gaps = 78/366 (21%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
E LK DAF K + ++ G T + +L+ D+ ++ E VD+
Sbjct: 21 ESLKHFDAFPKLPASYKARSESRGLFTALVAFIAFFLVLNDLGEFIWGWPDYEFSVDNEA 80
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
S + I++D+VV + C YL++D D+ G+ RL L +
Sbjct: 81 RSHMNINVDMVV-KMPCQYLSVDLRDAVGD------------RLYLS------------S 115
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
A ++ + G T E + A+ RK + + + D
Sbjct: 116 AFRRDGTLFDIGQATALKE--------HAAQLSARKAVAQSRQSRGLF----------DV 157
Query: 185 IVQCKNEYSTEKLKNTFTE-----GCQIYGYLEVNRVSGSFHIA-PGLSY-SINHVHVHD 237
+++ S + K T+ C+IYG L+V +V+ + HI G Y S+ HV HD
Sbjct: 158 LLR----RSGQGYKPTYNHQPDGGACRIYGTLQVKKVTANLHITTAGHGYASVQHV-PHD 212
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
N +H I SFG D + PLD + + + Y++ ++PT Y
Sbjct: 213 -------QMNLSHVITEFSFGPYFPDITQ---PLDDSFEITTDPFIAYQYFLHVVPTTYV 262
Query: 298 RLDGSKLGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
S L G PGIFF +EL PL + + +++ +L L+ +++
Sbjct: 263 APRSSPLKTAQYSVTHYTRVLEHGRGTPGIFFKFELDPLSITVNQRTTTLAQLFIRVIGV 322
Query: 345 ISGTYI 350
+ G ++
Sbjct: 323 VGGIFV 328
>gi|255726548|ref|XP_002548200.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240134124|gb|EER33679.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 355
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/399 (23%), Positives = 158/399 (39%), Gaps = 92/399 (23%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F+ +++ DAF K + ++ GG T+V ++F ++ +++ Y + VD+
Sbjct: 4 FTNKVRTFDAFPKVDPNQQVRSQRGGFSTLVTYMFGLLILWIEIGGYIGGYVDRQFTVDN 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I+LD++V + C++L + D + +++L E L+ +G P
Sbjct: 64 QIRSDLTINLDMIV-GMPCEFLHTNVEDITRDRYLAGE------TLNFEGIHFIVPPSFR 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+N +PN P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129
Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSY-SINHVHVHD 237
D I+Q + E+ ++ + N C I+G + V +V G F I A G Y +HV +
Sbjct: 130 DEIMQESLRAEFRSQGARVNEGAPACHIFGSIPVTQVRGDFRITAKGFGYRDRSHVPIE- 188
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
AFN +H I+ SFG + PLD T EE + YY K++PT+YE
Sbjct: 189 -------AFNFSHVIQEFSFG---EFYPFINNPLDATGKITEEKLQTYLYYAKVVPTMYE 238
Query: 298 RL----DGSKLGGGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
+L D ++ + G+PGI+F Y+ P+ + I EK
Sbjct: 239 QLGLEIDTNQYSLTESQHVIQVDEQTKRPNGIPGIYFRYDFEPIKLVIREKRIPFFQFIA 298
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
K + I G M+ L +K+ + G K V K
Sbjct: 299 K-LGTIGG---GIMIAAGYLFKLYEKLLLILYGKKYVDK 333
>gi|401427507|ref|XP_003878237.1| hypothetical protein, unknown function [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494484|emb|CBZ29786.1| hypothetical protein, unknown function [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 309
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 85/187 (45%), Gaps = 22/187 (11%)
Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG- 258
+ EGC++ GY++V +V G+FHI+ +H H + + N H I HLSFG
Sbjct: 128 SVAEGCRLEGYIKVGKVPGNFHIS-------SHGRQHLLAQHFPNGINVEHSIHHLSFGT 180
Query: 259 --IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LDGSKLGGGDGGMP 312
+K PLDG ++E ++ Y++ I+PTIYE + + G P
Sbjct: 181 TDVKKLAKKAALHPLDGKEHRSEV-PMVYQYFLDIVPTIYESSFSTVHTYQFTGTSSSTP 239
Query: 313 -------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
+ F Y+LSP+ V+ + SL H T + I G Y L+ +HS +
Sbjct: 240 VPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQ 299
Query: 366 ISKVEIG 372
+ +G
Sbjct: 300 FQRRVLG 306
Score = 45.1 bits (105), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 2/109 (1%)
Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
D F D E T G ++I C + ++ L +V Y ++ + D + +
Sbjct: 11 DFFRHIPRDLTESTTAGSIISIACVVLMALLFAGEVISYVFPRIQSDMIIMPDLDDQNTI 70
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
+ +D+ P + C L LD +D + +I + RLD G+PI +
Sbjct: 71 KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPISD 119
>gi|398021306|ref|XP_003863816.1| hypothetical protein, unknown function [Leishmania donovani]
gi|322502049|emb|CBZ37133.1| hypothetical protein, unknown function [Leishmania donovani]
Length = 309
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 85/187 (45%), Gaps = 22/187 (11%)
Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG- 258
+ EGC++ GY++V +V G+FHI+ +H H + + N H I HLSFG
Sbjct: 128 SVAEGCRLEGYIKVAKVPGNFHIS-------SHGRQHLLAQHFPNGINVEHSIHHLSFGT 180
Query: 259 --IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LDGSKLGGGDGGMP 312
+K PLDG ++E ++ Y++ I+PTIYE + + G P
Sbjct: 181 IDVKKLAKKAALHPLDGKEHRSEV-PMVYQYFLDIVPTIYESSFSTVHTYQFTGTSSSTP 239
Query: 313 -------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
+ F Y+LSP+ V+ + SL H T + I G Y L+ +HS +
Sbjct: 240 VPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQ 299
Query: 366 ISKVEIG 372
+ +G
Sbjct: 300 FQRRVLG 306
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 2/109 (1%)
Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
D F D E T G +++ C + + L +V Y ++ + D + +
Sbjct: 11 DFFRHIPRDLTEPTTAGSIISVACVVVMVLLFAGEVISYVFPRIQSDMIIMPDLDDRNTI 70
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
+ +D+ P + C L LD +D + +I + RLD G+PI +
Sbjct: 71 KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPISD 119
>gi|146097219|ref|XP_001468078.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
gi|134072444|emb|CAM71154.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
Length = 309
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 85/187 (45%), Gaps = 22/187 (11%)
Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG- 258
+ EGC++ GY++V +V G+FHI+ +H H + + N H I HLSFG
Sbjct: 128 SVAEGCRLEGYIKVAKVPGNFHIS-------SHGRQHLLAQHFPNGINVEHSIHHLSFGT 180
Query: 259 --IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LDGSKLGGGDGGMP 312
+K PLDG ++E ++ Y++ I+PTIYE + + G P
Sbjct: 181 IDVKKLAKKAALHPLDGKEHRSEV-PMVYQYFLDIVPTIYESSFSTVHTYQFTGTSSSTP 239
Query: 313 -------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
+ F Y+LSP+ V+ + SL H T + I G Y L+ +HS +
Sbjct: 240 VPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQ 299
Query: 366 ISKVEIG 372
+ +G
Sbjct: 300 FQRRVLG 306
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 2/109 (1%)
Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
D F D E T G +++ C + + L +V Y ++ + D + +
Sbjct: 11 DFFRHIPRDLTEPTTAGSIISVACVVVMVLLFAGEVISYVFPRIQSDMIIMPDLDDRNTI 70
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
+ +D+ P + C L LD +D + +I + RLD G+PI +
Sbjct: 71 KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPISD 119
>gi|294655234|ref|XP_457337.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
gi|199429792|emb|CAG85341.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
Length = 354
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 162/401 (40%), Gaps = 95/401 (23%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGG---AVTIVCWLFISYLICVDVCDYFQVSTTEELF 59
F+ +++ DAF K + ++ GG VTIVC L I + V++ + +
Sbjct: 4 FTTKVRTFDAFPKVDAEHTVRSSRGGFSTLVTIVCGLLI---LWVEIGGFLGGYVDHQFT 60
Query: 60 VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
+D S L +++D++V + C++L + +D + ++ L E L+ +G PQ
Sbjct: 61 IDDKVKSDLSLNIDMLV-AMPCEFLHTNVMDITDDRFLAGE------LLNFEGTNFFLPQ 113
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+N+ NT ++
Sbjct: 114 HFEINS------------------------------------KNTDHDT----------- 126
Query: 180 PELDTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV 235
P+LD ++Q + E+ + N C I+G + VN+V G FHI G Y+
Sbjct: 127 PDLDHVMQETLRAEFRVAGARVNEGAPACHIFGSIPVNQVKGDFHITGKGFGYNDGR--- 183
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
+ P+ A N TH I S+G + PLD T E+ + YY K++PTI
Sbjct: 184 -SVVPF--EALNFTHVISEFSYGDFYPFIN---NPLDFTGKVTEQKLQAYKYYSKVVPTI 237
Query: 296 YERL----DGSKLGGGDG-------------GMPGIFFSYELSPLMVKITEKSKSLGHLW 338
YE+L D ++ + G+PGIFF YE P+ + I+EK
Sbjct: 238 YEKLGMIIDTNQYSLTEQHNVYKVNRFNNVEGIPGIFFKYEFEPIKLIISEKRIPFIQFV 297
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTKR 379
+++ I G ++V L+ +K V + GK T+R
Sbjct: 298 SRLATIIGG----LLIVAGYLYRLYEKFLTV-LFGKRYTER 333
>gi|313247758|emb|CBY15879.1| unnamed protein product [Oikopleura dioica]
Length = 285
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 93/201 (46%), Gaps = 30/201 (14%)
Query: 189 KNEYSTEKL-KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFN 247
KN T+ L KN GC+ +G VN+V G+FH++ H QP+ FN
Sbjct: 98 KNTRKTDMLNKNQQKSGCRFHGEFYVNKVPGNFHVS---------THASKKQPH-KHDFN 147
Query: 248 TTHHIRHLSFGIKLQ--DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI---------- 295
H I L FG L + + L G E S ++Y +KI+PT+
Sbjct: 148 --HKINKLFFGEDLSALELPGNQTSLAGQATTNEPSLS-YDYTLKIVPTVHNDNKRRTTF 204
Query: 296 -YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
Y+ SK G P I+F YE++P+ VK T K K HL T I + GT+ +
Sbjct: 205 GYQYTVTSKTFKNTRGTPAIWFRYEIAPITVKYTHKKKPFYHLLTTICAIVGGTFTVAGM 264
Query: 355 VDALL---HSCVKKISKVEIG 372
+D+++ H VKK S+ ++G
Sbjct: 265 IDSMIFSAHQAVKKASEGKLG 285
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 58/112 (51%), Gaps = 3/112 (2%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
+K D + K +D + T G ++I FI +L+ +V + Q EL+VD + G
Sbjct: 3 IKRFDIYRKLPKDLTQPTTTGALISICSTFFIIFLLVSEVLSFLQEEVVSELYVDDPTTG 62
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
+ +P+ +D+ +P ++C+Y+A+ D+ G + N R+ D+ K Q+
Sbjct: 63 ATIPVIVDLEIPNMACEYVAIPKKDNQGRHEVGYLKNT--RKTDMLNKNQQK 112
>gi|367038975|ref|XP_003649868.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
gi|346997129|gb|AEO63532.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
Length = 380
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 144/372 (38%), Gaps = 76/372 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ L L ++ +++ + V+
Sbjct: 23 VSAFDAFPKSKPQYVTRTSGGGKWTVAMGLVSLVLFWSELGRWWRGTEEHTFAVEKGVSH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD---------LDGKPIQE 117
L I+LD+VV + C L ++ D++G++ L + RL +DGK + +
Sbjct: 83 VLNINLDVVV-RMRCADLHVNVQDAAGDRILAAD------RLSRDPTAWAHWVDGKGMHK 135
Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKW 177
++ + +V T G T E +G E + + V R KW
Sbjct: 136 LGRDA-----QGRVITGEGYTAEHDE-------GFGEE-------HVHDIVALGRRRAKW 176
Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVH 236
+ T +L + C+IYG LE+N+V G FHI A G Y H+
Sbjct: 177 S--------------RTPRLWGAEPDSCRIYGSLELNKVQGDFHITARGHGYMAFGDHL- 221
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
AFN +H I LSFG L PLD TV A F Y++ ++PT Y
Sbjct: 222 -----DHNAFNFSHIISELSFGPFLP---SLANPLDRTVNIATAHFHKFQYFLSVVPTTY 273
Query: 297 ERLDGSKLGG-----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
LG D +PGIF Y++ P+++ I E
Sbjct: 274 SVGRPGALGARSIFTNQYAVTEQSQEVPDTTIPGIFVKYDIEPILLNIVETRDGFFVFLL 333
Query: 340 KIMCNISGTYIT 351
+++ +SG +
Sbjct: 334 RVINVVSGVLVA 345
>gi|50293697|ref|XP_449260.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528573|emb|CAG62234.1| unnamed protein product [Candida glabrata]
Length = 352
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 90/380 (23%), Positives = 150/380 (39%), Gaps = 81/380 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K E + +K+ GG +I+ ++F+ ++ + +F ++ VD
Sbjct: 4 LRTFDAFPKTDETYKKKSTKGGVTSILTYIFLLFIAWTEFGKFFGGYIDQQYTVDKVVRE 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
I++D+ V I C+ + ++ D + ++ L ++ L L+ P P VN V
Sbjct: 64 TAQINMDLYV-NIKCENIHINVRDQTQDRKLVIQD------LKLEDMPFFIPYDSKVNGV 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAE----TETRKCCNTCNEVKEAYRYKKWALPEL 182
N T ++++ G AE +TR+ + + E Y LP+
Sbjct: 117 --------NSIVTPDIDE--ILGEAIPAEFREKLDTRQFYDENDPESEKY------LPKF 160
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
+ GC I+G + VNRV G I A G Y +I
Sbjct: 161 N--------------------GCHIFGSVPVNRVKGELQITASGYGYPGKRAPKEEI--- 197
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL- 299
+ H I LSFG D PLD T E S + YYI +PT+Y++L
Sbjct: 198 -----DFAHAINELSFGDFYPYID---NPLDKTARFDKEHPLSAYMYYISAVPTMYKKLG 249
Query: 300 ----------DGSKLGGGDGG------MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
+ K D +PGIFF Y PL ++IT+ S +++
Sbjct: 250 VEIETFQYSVNDYKYSMTDADPATVRKIPGIFFRYGFEPLSIEITDVRISFLQFIVRLVA 309
Query: 344 NISGTYITFMLVDALLHSCV 363
+S FM V + + + +
Sbjct: 310 ILS----FFMFVVSWIFTII 325
>gi|256269733|gb|EEU05000.1| Erv41p [Saccharomyces cerevisiae JAY291]
Length = 353
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 146/355 (41%), Gaps = 67/355 (18%)
Query: 7 LKGLDAF-TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
LK DAF TK E + +K+ GG +++ +LF+ ++ + +YF ++ VDS
Sbjct: 4 LKTFDAFRTKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVR 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ I++DI V T CD+L ++ D + ++ L +E + P P VN
Sbjct: 64 DTVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVND 116
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ N T EL++ G AE R+ +T + E+ K LPE +
Sbjct: 117 I--------NEIITPELDE--ILGEAIPAEF--REKLDTRSFFDES-DPNKAHLPEFN-- 161
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
GC I+G + VNRVSG I S+ +V P
Sbjct: 162 ------------------GCHIFGSIPVNRVSGELQITAN---SLGYVASRK-APLEELK 199
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----D 300
FN H I SFG D PLD T +E + + YY ++PT++++L D
Sbjct: 200 FN--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVD 254
Query: 301 GSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
++ D MPGIFF Y PL + +++ S +++
Sbjct: 255 TNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 309
>gi|322710423|gb|EFZ01998.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium
anisopliae ARSEF 23]
Length = 372
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 83/359 (23%), Positives = 146/359 (40%), Gaps = 57/359 (15%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K ++ +T GG T+ + +L+ ++ +++ + + V+
Sbjct: 21 VSAFDAFPKSKPEYVTRTEGGGKWTVAMAVVSIFLLWAEIARWWRGAESHTFAVEKGVSH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE--PQKEVVN 124
+ I+LD V+ + C L ++ D++G++ L +L++D + QK V
Sbjct: 81 SMQINLDTVI-LMKCGDLHINVQDAAGDRIL------AGSKLNMDETSWSQWVNQKGVHK 133
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+ + G L+D +G E + + V R KWA
Sbjct: 134 LGRDSEGRVITGAGWQNLDDEG-----FGEE-------HVHDIVALGQRRAKWA------ 175
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS 243
T ++K + C+IYG L++N+V G FHI A G Y H+ Q
Sbjct: 176 --------KTPRVKGP-PDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHLDHEQ---- 222
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
FN +H I LSFG PLD T+ AE F YY+ ++PT Y S
Sbjct: 223 --FNFSHIISELSFGSYYP---SLVNPLDRTLNIAENHFHKFQYYVSVVPTRYSVGSSSI 277
Query: 304 L-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
G + +PG+F Y++ P+++ + E + K++ +SG +
Sbjct: 278 FTNQYAVTEQSKGVSEYNVPGVFVKYDIEPILLSVNEDRDGILMFVVKLINVLSGVLVA 336
>gi|213408569|ref|XP_002175055.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
yFS275]
gi|212003102|gb|EEB08762.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
yFS275]
Length = 331
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 85/364 (23%), Positives = 146/364 (40%), Gaps = 84/364 (23%)
Query: 5 ERLKGLDAFTKPYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
E ++ DAF K + + +++ GG ++I+ + I+ + ++ YFQ + ++ FV +
Sbjct: 8 EGIRVFDAFPKVAKTYRKQRSSQGGLLSIILAICITCISIMEFFFYFQGTREQQFFVYET 67
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
+ I+LD+ + + C +L +D +D + +
Sbjct: 68 ISEHMNINLDMTI-AMPCKFLQVDVLD------------------------------QTM 96
Query: 124 NAVKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+ V +V T+ TT ++ +P S T + + ++ + K LP+
Sbjct: 97 DHVFATEVFTKQETTVEDMRHEPLPVTS-----TGSFDAADLRRTRRKKFNKKSKTLPDG 151
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
+ C+ YG + V+R G HI APG Y ++++ ++
Sbjct: 152 GS-------------------ACRFYGAVTVHRTQGLLHITAPGWGYGMSNIPLN----- 187
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL-- 299
A N TH I LSFG LDG+ +E A F YY IIPT Y
Sbjct: 188 ---ALNFTHAIDELSFGDYYP---SLVNALDGSYGFTDEHAFAFQYYTSIIPTTYTSTFR 241
Query: 300 ------------DGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
+ G PGIF SY++ PL + I E SLG+ +I+ ISG
Sbjct: 242 NVQTNQYAVTENSVRRQTGFRSDPPGIFISYDIEPLGIHIRETYPSLGNTILRILA-ISG 300
Query: 348 TYIT 351
+T
Sbjct: 301 GLVT 304
>gi|358390077|gb|EHK39483.1| hypothetical protein TRIATDRAFT_302881 [Trichoderma atroviride IMI
206040]
Length = 372
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 144/363 (39%), Gaps = 65/363 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ L S + ++ +++ V+ G
Sbjct: 21 VSAFDAFPKSKPQYVTQTSGGGKWTVAMLLISSIFMWTELGRWWRGIEAHTFAVERGVGH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
+ I+LDIVV + CD L ++ D+SG++ L E + + +D G
Sbjct: 81 DMQINLDIVV-KMHCDDLHVNVQDASGDRILAADKLAREATTWSQWVDEKGM-------- 131
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
K ENG T L +K +G E + + + R KWA
Sbjct: 132 ------HKLGKNENGQLDTGLGWHSKHDEGFGEE-------HVHDIIALTQRRAKWA--- 175
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQ 239
T + + + C+++G +++N+V G FHI A G Y H+ HD
Sbjct: 176 -----------RTPRPRGK-PDSCRMFGSMDLNKVQGDFHITARGHGYMGMGQHLDHD-- 221
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY--- 296
FN +H I +S+G PLD TV A F YY+ ++PT+Y
Sbjct: 222 -----KFNFSHIISEMSYGPYYP---SLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYLAN 273
Query: 297 ERLDGSKLGG--------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
R+ + D +PGIFF Y++ P+++ + E KI+ SG
Sbjct: 274 RRIVNTNQYAVTEHSKTISDHQIPGIFFKYDIEPILLSVEESRDGFLSFVIKIVNIFSGV 333
Query: 349 YIT 351
+
Sbjct: 334 MVA 336
>gi|157874469|ref|XP_001685717.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
Friedlin]
gi|68128789|emb|CAJ08922.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
Friedlin]
Length = 309
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 85/187 (45%), Gaps = 22/187 (11%)
Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG- 258
+ EGC++ GY++V +V G+FHI+ +H H + + N H I HLSFG
Sbjct: 128 SVAEGCRLEGYIKVAKVPGNFHIS-------SHGRQHLLAQHFPNGINVEHSIHHLSFGT 180
Query: 259 --IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS----KLGGGDGGMP 312
+K PLDG ++E ++ Y++ I+PTIYE + + G P
Sbjct: 181 IDVKKLAKKAALHPLDGKEHRSEM-PMVYQYFLDIVPTIYESSFSTVYTYQFTGTSSSTP 239
Query: 313 -------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
+ F Y+LSP+ V+ + SL H T + I G Y L+ +HS +
Sbjct: 240 VPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQ 299
Query: 366 ISKVEIG 372
+ +G
Sbjct: 300 FQRHVLG 306
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 2/109 (1%)
Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
D F D E T G +++ C + + L +V Y ++ + D + +
Sbjct: 11 DFFRHIPRDLTESTTAGSIISVACVVVMVLLFAGEVIAYVFPRIQSDMIIMPDLDDRNTI 70
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
+ +D+ P + C L LD +D + +I + RLD G+PI +
Sbjct: 71 KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPISD 119
>gi|453088947|gb|EMF16987.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 404
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 84/385 (21%), Positives = 151/385 (39%), Gaps = 81/385 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + ++T GG T++ + L ++ +++ TT V+ G
Sbjct: 23 VKAFDAFPKTKPSYQQRTSTGGVWTVILIVASVALTWSELARWWKGETTHTFAVEQGVGH 82
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L ++LD VV + C L ++ D++G++ L +++ + DG + A
Sbjct: 83 DLQMNLDTVV-RMKCADLHVNVQDAAGDRIL--AGSVFHK----DGTTWDQW------AG 129
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K + +T+ E ++ GS AE + + + +++ + P +
Sbjct: 130 NRKA----HALGSTKEERLSQKGSAASAEYREEDVHHYLSSARMKHKFGR--TPHIP--- 180
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ + C+IYG + N+V G FHI + H ++ Q + F
Sbjct: 181 -----------RGREADSCRIYGSMHGNKVKGDFHIT-----ARGHGYMEFGQHLDHSTF 224
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY---------- 296
N +H I LSFG PLD T A E F YY+ ++PTIY
Sbjct: 225 NFSHRITELSFGPYYP---SLTNPLDNTFATTESNFYKFQYYLSVVPTIYTADAKALRKI 281
Query: 297 ERLDGSKLGGGDG------------------------------GMPGIFFSYELSPLMVK 326
++ S G DG +PGIF +++ P+ +
Sbjct: 282 DKYHESPTSGDDGLSQQPKRYSKNTVFTNQYAVTEQSHPVSESSVPGIFVKFDIEPIQLT 341
Query: 327 ITEKSKSLGHLWTKIMCNISGTYIT 351
I E S+ L +I+ +SG +
Sbjct: 342 IAENWSSVPALLIRIVNVVSGLLVA 366
>gi|198421328|ref|XP_002120997.1| PREDICTED: similar to Endoplasmic reticulum-Golgi intermediate
compartment protein 1 (ER-Golgi intermediate compartment
32 kDa protein) (ERGIC-32) [Ciona intestinalis]
Length = 289
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 89/198 (44%), Gaps = 30/198 (15%)
Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
++EK+ GC ++N+V G+FH++ H QP + TH I
Sbjct: 101 NSEKVPTHDGNGCLFTSRFQINKVPGNFHVS---------THSARSQPDNP---DMTHEI 148
Query: 253 RHLSFGIKLQDDDERRK---PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS------- 302
+ L G + + + L+G + S +Y +KI+PT+YE +DG+
Sbjct: 149 KELRIGDNMVIPGVKSQSFNALEGKTTFDKHPLSSHDYIMKIVPTVYESIDGNLRYLYQY 208
Query: 303 --------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
G G MP I+F YE++P+ VK TE+ K H T + I GT+ +
Sbjct: 209 TNAYKDYIAYGHGQRVMPAIWFRYEMTPITVKYTERRKPFYHFITMVCAIIGGTFTVAGI 268
Query: 355 VDALLHSCVKKISKVEIG 372
+D+++ S + K+ IG
Sbjct: 269 IDSMIFSATEMYKKLTIG 286
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 56/103 (54%), Gaps = 3/103 (2%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MVF ++ D + K +D + T G A+++ C FISYL+ ++ + + EL+V
Sbjct: 1 MVFD--IRRFDIYRKVPKDLTQPTTTGAAISVGCCFFISYLLISELLGFLTIDVASELYV 58
Query: 61 DSSR-GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHN 102
D + G K+P+ + I +P + C+YL +D DS G + + N
Sbjct: 59 DDPQSGDKIPVQIIISLPKMKCEYLGMDIQDSMGRHEVGMVDN 101
>gi|443700340|gb|ELT99344.1| hypothetical protein CAPTEDRAFT_162161 [Capitella teleta]
Length = 110
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 63/110 (57%), Gaps = 18/110 (16%)
Query: 284 MFNYYIKIIPTIYERLDG--------------SKLGGG---DGGMPGIFFSYELSPLMVK 326
MF+YY+K++PT Y R +G K+GGG + G+PG+F +YELSP+MVK
Sbjct: 1 MFSYYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFVTYELSPMMVK 60
Query: 327 ITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
TEK++S H T + I G + LVDA ++ + I K+++G T
Sbjct: 61 YTEKNRSFMHFLTGVCAIIGGVFTVAGLVDAFIYHSARAIQKKIDLGKAT 110
>gi|320580226|gb|EFW94449.1| COPii-coated vesicle-associated protein, putative [Ogataea
parapolymorpha DL-1]
Length = 901
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 142/367 (38%), Gaps = 78/367 (21%)
Query: 23 KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCD 82
++ G TI+ +LF+ +LI V+V Y + + VD L I+LD+VV + C+
Sbjct: 584 RSTRGSYSTIITYLFLLFLIWVEVGGYIDGAIDHQFTVDELVRKDLVINLDLVV-AMPCN 642
Query: 83 YLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTEL 142
Y+ + D + ++ L E L+ G P+ +A K T EL
Sbjct: 643 YIHTNVRDLTDDRFLAAE------LLNYQGTTFNIPRWYEQSAKK---------IVTPEL 687
Query: 143 EDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFT 202
E L+ +Q + +Y E +
Sbjct: 688 E------------------------------------AVLERSLQARFQYQGEH-HDEGA 710
Query: 203 EGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
C+I+G + VNRV G HI A G Y D + N TH I SFG
Sbjct: 711 PACRIFGAIPVNRVKGELHITAKGYGY-------RDRTRIPAEGLNFTHAISEFSFGEFF 763
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL----DGSK----LGGGDGG-MP 312
D PLD T+ + F Y+I ++PT+Y +L D ++ L G +P
Sbjct: 764 PYLD---NPLDMTLKTTDAHLHTFKYHINVVPTLYRKLGVEIDTNQYSLSLTESSGKYVP 820
Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
GIFF YE P+ + + E S ++ + G ++V L+ K+ + +
Sbjct: 821 GIFFQYEFEPIKLVVEETRLSFWQFVVRLATIMGG----ILVVAGWLYKLFDKLILLTL- 875
Query: 373 GKTVTKR 379
GK KR
Sbjct: 876 GKEFAKR 882
>gi|170108190|ref|XP_001885304.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
gi|164639780|gb|EDR04049.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
Length = 398
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 146/358 (40%), Gaps = 66/358 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K + +T G +TI L L+ D+ +Y E VD ++ S
Sbjct: 19 LAKFDAFPKLPSTYKTRTESRGFMTIFVILLAFLLMLNDIGEYIWGWPDFEFSVDDNKSS 78
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L +++D+VV + C ++++D D+ G+ RL L G
Sbjct: 79 FLDVNVDLVV-NMPCKFISVDLRDAMGD------------RLYLSGG------------- 112
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
++ T N T L++ ++ S A +++RK + +R K
Sbjct: 113 LRRDGTEFNVGQATALKEHSEALSARQAVSQSRKSRGLFANL---FRRNK---------S 160
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
K Y+ + N C+++G L+V RV+ + HI G Y+ ++ HV Q
Sbjct: 161 NFKPTYNYQPHGN----ACRVWGSLQVKRVTANLHITTLGHGYA-SYEHVDHNQ------ 209
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N +H I SFG D + PLD + +E + Y++ ++PT Y + L
Sbjct: 210 MNLSHVITEFSFGPHFPDITQ---PLDNSFESTDERFVAYQYFLHVVPTTYIAPRSAPLQ 266
Query: 306 -------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ G PGIFF ++L PL + +++ + L + + I G ++
Sbjct: 267 THQYSVTHYTRVMQHNQGTPGIFFKFDLDPLAITQHQRTTTFLQLLIRCVGVIGGVFV 324
>gi|367012766|ref|XP_003680883.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
gi|359748543|emb|CCE91672.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
Length = 348
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 149/375 (39%), Gaps = 76/375 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K E +++ GG +++ +LF+ ++ + YF ++ VD
Sbjct: 4 LRSFDAFPKTDETHQQRSFKGGLSSVMTYLFLLFMCWTEFGSYFGGYVDQQYKVDGEVRE 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
I++D+ V + C+ L ++ D + ++ + + L + P P +VN +
Sbjct: 64 TFQINMDMYV-NMPCNLLHINVRDKT------MDRKVVSKELSMQNMPFFVPYGTMVNDM 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
KK T +L++ G A+ R + V EA L + V
Sbjct: 117 KK--------IATPDLDE--ILGEAIPAQFRERMDPS----VLEA---------SLGSDV 153
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
TF +GC IYG + VNRV+G I A G Y D + +
Sbjct: 154 -------------TF-DGCHIYGSVPVNRVAGELQITAKGWGY-------QDFEKAPVSE 192
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASM-FNYYIKIIPTIYERL----- 299
N +H I S+G D PLD T + M + Y I+PT+YE+L
Sbjct: 193 INFSHVINEFSYGDFFPYID---NPLDNTAKISIVDRLMGYLYDTSIVPTVYEKLGAYVD 249
Query: 300 -----------DGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG- 347
D G +PGIFF Y+ PL + I ++ S +++ +S
Sbjct: 250 TNQYAVSERQFDQKSTKRGSTTVPGIFFRYDFEPLSISIKDRRLSFIQFIIRLVALLSFV 309
Query: 348 TYI---TFMLVDALL 359
YI TF +VD L
Sbjct: 310 VYIASWTFRMVDLTL 324
>gi|225712562|gb|ACO12127.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Lepeophtheirus salmonis]
Length = 290
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 84/188 (44%), Gaps = 31/188 (16%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC + +N+V G+FH++ H D+QP +N +H I +SFG K++
Sbjct: 112 GCLFEAHFHINKVPGNFHVS---------THSVDVQP---DEYNFSHEIHEVSFGSKIKK 159
Query: 264 DDERR----KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG------------- 306
+ L G + Y +KI+PT YE L G+KL
Sbjct: 160 ISSKNIGTFNSLSGRDSSESGALDSHEYVMKIVPTTYESLGGAKLFAYQYTYAYRSYVSF 219
Query: 307 GDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
G GG +P ++F Y+L+P+ VK E + H T + + GT+ ++D+ L + +
Sbjct: 220 GHGGRVVPALWFRYDLNPITVKYHETRPPIYHFLTTVCAIVGGTFTVAGIIDSTLFTATQ 279
Query: 365 KISKVEIG 372
K E+G
Sbjct: 280 LFKKFELG 287
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 53/106 (50%), Gaps = 3/106 (2%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MVF +K D + K +D + TV G ++I C +FI ++ + + EL V
Sbjct: 1 MVFD--VKRFDVYRKIPKDLTQPTVAGAIISICCTIFIFLMLVTEFWFFITPDVQSELIV 58
Query: 61 DSSRGS-KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
+++ + ++P+ ++I +P + C+YL +D D G + N K
Sbjct: 59 ENANPTDRIPVRINISLPKMKCEYLGIDIQDDMGRHEVGFVENTAK 104
>gi|145546125|ref|XP_001458746.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426567|emb|CAK91349.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 106/220 (48%), Gaps = 34/220 (15%)
Query: 170 EAYRYKKWALPELDTIVQCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLS 227
E Y+ + +D + + + E+ + + EGC + GY+ ++RV G+FHI+
Sbjct: 96 ELYKSRTLNGKVIDKYLSTNDSLNLERAQQAYQQKEGCDLAGYIIISRVPGNFHISA--- 152
Query: 228 YSINHVH---VHDIQPYTS-AAFNTTHHIRHLSFGIK--LQDDDERRK-----PLDGTV- 275
H + V+ + P+ + + +H I+HLSFG + +Q E+ K PLDG
Sbjct: 153 ----HPYGGQVNMVLPFVGLSVIDLSHSIKHLSFGKQNDIQKIREKFKQGLLNPLDGIRR 208
Query: 276 AKAEEGASM---FNYYIKIIPTIYERLDGSKLGGGDGG----------MPGIFFSYELSP 322
K +E ++ YYI I+PT+Y +D + MP ++F Y++SP
Sbjct: 209 IKTQELTNVGVTHQYYISIVPTLYVDIDNKEYFVNQFAANTNEAQTTQMPAVYFRYDISP 268
Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
+ V+ T+ +S H ++ + G + ++D++ ++C
Sbjct: 269 VTVQFTKYYESFNHFIVQLCAILGGVFTIAGIIDSIFYAC 308
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 53/106 (50%), Gaps = 1/106 (0%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
D + K +D E + G ++ + + L + +Y E+++D ++ KL
Sbjct: 4 FDLYRKLPQDLIEPSKSGALISFTSLILMFILFITEFQEYLTQQVQTEMYIDQNKDDKLL 63
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
+++DI P + CD++++D D G +VE +YK R L+GK I
Sbjct: 64 VNMDISFPNMPCDFISIDQQDVIGTHQQNVEGELYKSR-TLNGKVI 108
>gi|358058634|dbj|GAA95597.1| hypothetical protein E5Q_02253 [Mixia osmundae IAM 14324]
Length = 682
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/359 (22%), Positives = 138/359 (38%), Gaps = 78/359 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + + GG T++ + I L+ + +Y E VD G
Sbjct: 29 LRTFDAFPKTLPTYRSTSSRGGVYTVLLAVAILVLVWYEATEYLFGEPLYEFSVDKGIGK 88
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I++D+ V + C YL +D D+ G++ LHV K + I + Q+ V A
Sbjct: 89 MLQINVDMTV-AMPCHYLTVDIRDAVGDR-LHVSDEFVKDGTTFE---IGQAQRLVTMAF 143
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ DP EAY+ +
Sbjct: 144 ES---------------DP------------------------EAYK----------VVQ 154
Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
+ + + E+ + G C+IYG + V +V+G+ HI ++ H ++ +
Sbjct: 155 EARRPRAFEQTYHIVENGPACRIYGTMAVKKVTGNLHIT-----TLGHGYL-SWEHTDHK 208
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-------- 296
N +H I SFG + PLD T+ E +F Y++ I+ T Y
Sbjct: 209 LMNLSHVIHEFSFGPLFPGISQ---PLDNTLEVTESSFHIFQYFMSIVSTTYVDHHRNVL 265
Query: 297 -----ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
D S+ G+PGIF Y+ P+M+ + E++ +LG ++ + G +
Sbjct: 266 ETAQYSVTDMSRATVHGRGVPGIFLKYDPEPMMLTLRERTTTLGQFLIRLAGIVGGVIV 324
>gi|353236810|emb|CCA68797.1| related to ERV41-component of copii vesicles involved in transport
between the ER and golgi complex [Piriformospora indica
DSM 11827]
Length = 559
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/363 (22%), Positives = 143/363 (39%), Gaps = 72/363 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + +T +GG +T+ L+ D+ ++ + E +D+ +
Sbjct: 47 IKQFDAFPKLPASYKSRTKFGGFMTLFVVTLSFLLVLNDIGEFIWGWSDYEFAIDTDQHR 106
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I++D+VV T C L++D D+ G++ LH+ I + D E KE +
Sbjct: 107 LLEINVDLVVNT-PCSILSVDLRDAVGDR-LHLSDTIVRDGTLFDISQAHE-FKEHQRVL 163
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+++ A +R + + +R W
Sbjct: 164 STREIV--------------------AASRRSRGFFSMFKASRPQFR-PTW--------- 193
Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAP-GLSYSINHVHV-HDIQPYT 242
N +G C++YG V +++G+FHI G Y ++ H HD
Sbjct: 194 ------------NHTPDGGACRVYGSFAVRKLTGNFHITTLGHGYGGHNAHASHD----- 236
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
N +H I SFG D +PLD + +E F Y+I ++PT Y
Sbjct: 237 --NINMSHVITEFSFGPYYPD---IVQPLDYSFETTQEHFVAFQYFITVVPTTYVAPRSK 291
Query: 303 KLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
L G PGIFF Y++ P+ ++I +++ +L +I+ I G +
Sbjct: 292 PLHTHQYSVTHYVKELPHSQGTPGIFFKYDIDPVALEIHQRTTTLTQFLVRIVGVIGGVW 351
Query: 350 ITF 352
+ F
Sbjct: 352 VCF 354
>gi|400594740|gb|EJP62573.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Beauveria bassiana ARSEF 2860]
Length = 374
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 69/366 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISY-LICVDVCDYFQVSTTEELFVDSSRG 65
+ DAF K ++ +T GG T+ +FIS L+ +V +++ T V+
Sbjct: 21 VSAFDAFPKSKPEYVTRTAGGGKWTVAM-IFISLVLMGSEVARWWRGEQTHNFAVEKGIS 79
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
++ I+LDIVV + D L ++ D+SG++ L L P + Q V N
Sbjct: 80 HEMQINLDIVVNMLCAD-LHINVQDASGDRIL--------ASAMLHRDPTKWSQ-WVDNG 129
Query: 126 VKK------KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
V K +V T G T+ D +G E + + V + KW+
Sbjct: 130 VHKLGHDANGRVNTGEGWTSLANNDEG-----FGEE-------HVHDIVALGKKRAKWS- 176
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HD 237
T + T + C+IYG L++N+V G FHI A G Y H+ HD
Sbjct: 177 -------------KTPRFWGT-ADSCRIYGSLDLNKVQGDFHITARGHGYMEFGQHLDHD 222
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
FN +H I LS+G PLD TV A F YY+ ++PT+Y
Sbjct: 223 -------KFNFSHVISELSYGAFYP---SLVNPLDRTVNVAAAHFHKFQYYLSVVPTVYS 272
Query: 298 ------------RLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
+ SK +PGIF Y++ P+++ + E S K++ +
Sbjct: 273 VGRSTIQTNQYAVTEQSKEIDEHSAVPGIFVKYDIEPILLAVHESRDSFIVFLLKLINVV 332
Query: 346 SGTYIT 351
SG +
Sbjct: 333 SGVLVA 338
>gi|225712696|gb|ACO12194.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Lepeophtheirus salmonis]
Length = 372
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 84/180 (46%), Gaps = 20/180 (11%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+I+G L +N+V+G+FHI+PG + + HVH +N TH I SFG
Sbjct: 172 DACRIHGSLTLNKVAGNFHISPGKTLPLFRAHVHFATFGGDEVYNFTHRIDRFSFGTP-- 229
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------------IYERLDGSKLGG 306
+PL+G A + + + Y I+++PT + E +K
Sbjct: 230 -HGGIVQPLEGEEKIAMQDSMHYQYLIQVVPTDIQGYTDLIWSTYQYSVKEHKRATK-ER 287
Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G G PGI+F Y++S L V ++ + + +++ + G T +V + S ++KI
Sbjct: 288 GSGDTPGIYFKYDMSALKVLASQDREPIFKFLVRLLAAVGGRIATSQIVCVFIKSMIEKI 347
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 1/84 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + EKT G A++I+ + + L+C + + D+ S
Sbjct: 16 VKELDAFPKVPETYVEKTASGAAISIITTILVIVLLCSETSYFMDPGINFRFIPDTDFKS 75
Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
KL I++DI + T C + D +D
Sbjct: 76 KLEINVDITIAT-PCKAIGADVLD 98
>gi|164661257|ref|XP_001731751.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
gi|159105652|gb|EDP44537.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
Length = 454
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/358 (21%), Positives = 146/358 (40%), Gaps = 66/358 (18%)
Query: 20 FHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPT- 78
+ ++T YGG VT+ ++ +I ++ Y + T +DS G + I+LD+VV T
Sbjct: 53 YQKRTSYGGFVTLAVFIATMVVIWYEIQHYLMLKPTYSFDIDSHVGGFMQINLDVVVATP 112
Query: 79 --------------ISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
++ +++D D+SG+ E +I K +D + K Q QK +
Sbjct: 113 CGRTYPYDVRFPCILTLSGVSIDLRDASGDTLHFSEDDIVKDPVDFN-KERQRAQKRSLT 171
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
K + ++ ++E +K V R++ D
Sbjct: 172 QYFLKMLHSQY-RNMKKIERKDK------------------KIVAGGPRHRDSGFDFSDP 212
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
+ + C++YG + V +V+G+ HI+ + + V+ H+
Sbjct: 213 MENAEE-----------ARACRVYGSILVKKVTGNLHISTFVP-TFMAVNAHE----NGM 256
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------- 294
+ +H I SFG + E PLD ++ ++ A+ F Y++ ++PT
Sbjct: 257 GIDMSHIIHEFSFGDYFPNIAE---PLDASLELTDDPAAAFQYFLSVVPTHFIHGRRVIK 313
Query: 295 --IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
Y D + G PG++F Y++ PL +K+T KS SL ++ + G +I
Sbjct: 314 TNQYSVHDYKRNPQGSLTFPGLYFKYDIEPLTMKVTHKSVSLVAFIVRVCSVLGGLWI 371
>gi|308806572|ref|XP_003080597.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
gi|116059058|emb|CAL54765.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
Length = 327
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/384 (21%), Positives = 152/384 (39%), Gaps = 86/384 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICV-DVCDYFQVSTTEELFVDSSRG 65
+ LDA T KT G V++ C F++ ++ + D+F T+ VD R
Sbjct: 5 FRSLDALTSAPAHLRRKTSTGAVVSL-CGTFVAVILTLSQTIDFFTPLRTKTTRVDEQRA 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
++ + +D+ + C L +DA D+SG K +D+ G+ + K ++A
Sbjct: 64 GEMTMDIDVTFTRMPCQILYVDAYDASG-----------KHEVDVRGRLM----KTRLDA 108
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSC-YGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+ EL + G G R+ +EV++A
Sbjct: 109 AGR------------ELGEYESAGGVDLGGLVLFRRRPEHGSEVRKA------------- 143
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
EGC+++G +E RV+GS I+ G S + +P+
Sbjct: 144 --------------KADMEGCRLHGRVEARRVAGSLRISTG-PESFEFLREMFNEPW--- 185
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------- 297
+ H I+ +FG + PL+G V + E+ + ++ Y++K++PT Y
Sbjct: 186 EIDARHAIKTFAFGPEFPGSV---NPLNG-VKRKEKKSGIYKYFMKVVPTTYANSRNLFG 241
Query: 298 ------RLDGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
R+ ++ + G +P I FSY++S + V + +SKS + TK +
Sbjct: 242 MIPWTMRVRTNQYSVTEHFTESAHWGMLPQILFSYDISAISVNVESQSKSGVYFLTKTIA 301
Query: 344 NISGTYITFMLVDALLHSCVKKIS 367
+ G + +D + V+ S
Sbjct: 302 TVGGVFALTRTIDRYVDLAVRVTS 325
>gi|145551751|ref|XP_001461552.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429387|emb|CAK94179.1| unnamed protein product [Paramecium tetraurelia]
Length = 317
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 92/184 (50%), Gaps = 26/184 (14%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS-AAFNTTHHIRHLSFGIKL 261
EGC++ GY+ ++RV G+FHI+ SY V+ + P+ + + +H I+HLSFG +
Sbjct: 131 EGCEMTGYIIISRVPGNFHISAH-SYG---GQVNIVLPFVEMSTIDLSHTIKHLSFGNQN 186
Query: 262 QDDDERRK-------PLDG-TVAKAEEGASM---FNYYIKIIPTIYERLDGSKL------ 304
R K PLDG + K +E ++ YYI I+PTIY +D +
Sbjct: 187 DIQKIREKFQQGLLNPLDGISRIKTQELKNVGVTHQYYISIVPTIYVDIDNREYFVNQFT 246
Query: 305 ----GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH 360
MP I+F Y++SP+ V+ T+ ++ H ++ + G + ++D++ +
Sbjct: 247 ANTNEAQTNSMPAIYFRYDISPVTVQFTKYYETFNHFIVQLCAILGGVFTIAGIIDSVFY 306
Query: 361 SCVK 364
+ K
Sbjct: 307 ALQK 310
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/106 (22%), Positives = 53/106 (50%), Gaps = 1/106 (0%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
D + K +D E + G ++ + + L + +Y E+++D ++ L
Sbjct: 4 FDLYRKLPQDLIEPSKSGALISFTSLILMFILFITEFQEYLTQQVQTEMYIDQNKDDTLL 63
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
+++DI P + CD++++D D G +V+ + K+R+ L+G+ I
Sbjct: 64 VNMDISFPNMPCDFISIDQQDVIGTHQQNVKGELLKKRI-LNGRVI 108
>gi|219125194|ref|XP_002182871.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405665|gb|EEC45607.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 467
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 91/190 (47%), Gaps = 40/190 (21%)
Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
GCQ+ G+L VNRV G+FH+ A S+++N +A N +H + HLSFG +
Sbjct: 285 GCQVSGHLMVNRVPGNFHLEAKSKSHNLN-----------AAMTNLSHVVNHLSFGEPID 333
Query: 263 DDDERRK--------------PLDGTVAKAEEGASMFNYYIKIIPT-------------I 295
+++ + K P+DG + F++YIK++ T
Sbjct: 334 ENNRKSKRILKQVPEEHRQFAPMDGQAFLTKAFHQAFHHYIKVVSTHLNMGSSDANSMLT 393
Query: 296 YERLDGSKLGG-GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
Y+ L+ S++ D +P FSY+LSP+ V + ++ + T + I GT+ T L
Sbjct: 394 YQFLEQSQIVFYDDVNVPEARFSYDLSPMSVVVEKEGRKWYDYLTSLCAIIGGTFTTLGL 453
Query: 355 VDALLHSCVK 364
+DA L+ +K
Sbjct: 454 IDATLYKVLK 463
Score = 44.3 bits (103), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 21/106 (19%), Positives = 50/106 (47%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ +D + + +D E T G +++ + + L + + + + +D +
Sbjct: 1 MSSVDFYRRVPKDLTEATSLGAIMSVCALVVMGVLFLSETAAFARTGIATSITLDENTSP 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
++ ++ +I + + CDY+++D D+ G +V NI K +LD G
Sbjct: 61 QIRLNFNITLTDLQCDYVSIDVWDALGTNKQNVTKNIDKWQLDAQG 106
>gi|392594239|gb|EIW83563.1| DUF1692-domain-containing protein [Coniophora puteana RWD-64-598
SS2]
Length = 506
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 144/358 (40%), Gaps = 68/358 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K + ++ G +TI LI D+ +Y E VD S
Sbjct: 20 LAKFDAFPKLPSSYKSRSESRGFLTIFVGFLCFLLILNDLSEYIWGWPDYEFGVDKQSKS 79
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ +++D+VV + C +L++D D SG++ L++ RR DG
Sbjct: 80 FMDVNVDMVV-NMPCQFLSVDLRDVSGDR-LYLSKGF--RR---DG-------------- 118
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
T + T L++ K S A +++RK + K +
Sbjct: 119 -----TLFDIGQATSLKEHAKMLSAQQAVSQSRKSRGFFSWFKRS--------------- 158
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
K E+ C+IYG L V +V+ + H+ G Y+ +H+HV +
Sbjct: 159 --KAEFRPTYNHQPDGSACRIYGTLAVKKVTANLHVTTLGHGYT-SHMHVDHTK------ 209
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
N +H I SFG D + PLD + A++ + F YY+ ++PT Y L
Sbjct: 210 MNLSHVITEFSFGPYFPDISQ---PLDYSFEVAKDPYTAFQYYMHVVPTNYIAPRSKPLE 266
Query: 306 GGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
G+PGIFF ++L P+++ I +++ SL L + + I G +
Sbjct: 267 TNQYSVTHYTHIYKTPHEGIPGIFFKFDLDPMVLSIHQRTTSLTALIIRCVGVIGGVF 324
>gi|260950511|ref|XP_002619552.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
gi|238847124|gb|EEQ36588.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
Length = 347
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 141/380 (37%), Gaps = 89/380 (23%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGG---AVTIVCWLFISYLICVDVCDYFQVSTTEELF 59
FS +++ DAF K + ++ GG +T+ C L I I + + Y +
Sbjct: 4 FSSKVRVFDAFPKVAPEASVRSQRGGFSTILTVFCGLLI---IWIQIGGYLGGYIDRQFS 60
Query: 60 VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
VD+ L I+LD+VV + C +++ + +D + +++L E L+ G P+
Sbjct: 61 VDNETRKDLNINLDMVV-AMPCQFISTNVMDITSDRYLAGE------VLNFQGTGFYVPE 113
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+N EN T
Sbjct: 114 FFALN--------RENNDYDT--------------------------------------- 126
Query: 180 PELDTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
PELD I+Q + EY + N C I+G + VN V G F I P S
Sbjct: 127 PELDEIMQETLRAEYGIAGARVNEDAPACHIFGTIPVNHVRGEFFIVPKGS------MYR 180
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
D A+N +H I SFG PLD T EE + Y+ K++PT Y
Sbjct: 181 DRSSIDPKAYNFSHVISEFSFGDFYP---FITNPLDFTAKVTEENRQAYRYFAKLVPTHY 237
Query: 297 ERL---------DGSKLGGGDGGM----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
E+L +++ D PGIFF Y P+ + I EK ++M
Sbjct: 238 EKLGLVVDTYQYSLTEIHNVDHNRGIPPPGIFFDYSFEPIKLTIREKRIGFFAFVARLMT 297
Query: 344 NISGTYIT----FMLVDALL 359
+SG I F L + LL
Sbjct: 298 VLSGLLIAAGYLFRLYEKLL 317
>gi|45190741|ref|NP_984995.1| AER136Wp [Ashbya gossypii ATCC 10895]
gi|44983720|gb|AAS52819.1| AER136Wp [Ashbya gossypii ATCC 10895]
gi|374108218|gb|AEY97125.1| FAER136Wp [Ashbya gossypii FDAG1]
Length = 340
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 143/375 (38%), Gaps = 76/375 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + ++ GG ++I+ +LF+ ++ + YF E+ +D
Sbjct: 4 LRTFDAFPKTDQQHVRRSSRGGIMSIMMYLFLLFIAWGEFGSYFGGYLDEQYIIDPELRQ 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
I++D++V + C YL + A D + + N +RL P P ++V
Sbjct: 64 TTQINMDVMV-QMPCKYLDVKATDITRDI------NDVSKRLVFKNIPFFVPYGTTFDSV 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
E+ P+ G + + +R +P+ D
Sbjct: 117 N-------------EVRTPDIDGML-------------ADAIPLKFREN---IPDAD--- 144
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
E+ GC IYG + VNRV G HI P G YS HD
Sbjct: 145 -LPEEFEFN--------GCHIYGSIPVNRVKGELHITPKGWRYSSRQRVPHD-------E 188
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL----DG 301
N TH SFG D LD A++ + F+Y++ ++PTIY ++ D
Sbjct: 189 INLTHIFNEFSFGEFFPYIDNT---LDQVGRYAQQRLTRFHYFVSVLPTIYRKMGAVVDT 245
Query: 302 SKLGGGDGGM---------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG-TYI- 350
++ + PGIF Y L V + +K S +++ +S YI
Sbjct: 246 NQYSVSHNDITYTSSRLYTPGIFILYNFEALTVVVQDKRISFWAFLIRLVTMLSFIVYIA 305
Query: 351 --TFMLVDALLHSCV 363
F LVD LL S +
Sbjct: 306 AWAFRLVDWLLISTL 320
>gi|410914052|ref|XP_003970502.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Takifugu rubripes]
Length = 290
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/186 (29%), Positives = 85/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FHI+ H QP + TH I L+FG KLQ
Sbjct: 114 GCRFEGEFIINKVPGNFHIS---------THSASAQPQNP---DMTHFIHKLAFGDKLQM 161
Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
E+ L G A + +Y +KI+PT+YE L G + +
Sbjct: 162 HQEKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + T I + GT+ ++D+ + + +
Sbjct: 222 TGRIVPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGIIDSCIFTASEAW 281
Query: 367 SKVEIG 372
K++IG
Sbjct: 282 KKIQIG 287
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 52/101 (51%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
++ D + K +D + T G ++I+C +FI +L ++ + EL+VD
Sbjct: 5 VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + CD + LD D G + H+E+++
Sbjct: 65 SGGKIEVSLNITLPNLHCDLVGLDIQDEMGRHEVGHIENSM 105
>gi|326470603|gb|EGD94612.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
Length = 399
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 87/385 (22%), Positives = 152/385 (39%), Gaps = 77/385 (20%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
+ +LK DAF K + + GG TI + + L C ++ +++ V+
Sbjct: 21 ATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVERG 80
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEV 122
++ +++D VV + CD + ++ D++G+ H+ L G + QEP
Sbjct: 81 VSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW- 125
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE-TETRKCCNTCNEVKEAYRYKKWALPE 181
A +++ + E + NK S E E + EV+ + + K P+
Sbjct: 126 --AAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQAEDLHVEHVLGEVRRSRKKKFPKAPK 183
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
L K+ + C+++G LE N+V G+ HI A G Y P
Sbjct: 184 LK--------------KSDAVDSCRVFGSLEGNKVQGNLHITARGFGY---FEWGRATNP 226
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-- 298
++ N TH I LSFG PLD TV+ + YY+ ++PTIY +
Sbjct: 227 HS---LNFTHLITELSFGPHY---GRLLNPLDKTVSSTSINFYKYQYYLSVVPTIYTKSG 280
Query: 299 ---------LDGSKLGGGDG-----------------------GMPGIFFSYELSPLMVK 326
D S + D PGIFF Y + P+++
Sbjct: 281 HIDPNRRSLPDASTITAKDSKTTVSTNQYAVTSYSQPIQPRIDSTPGIFFKYNIEPILLI 340
Query: 327 ITEKSKSLGHLWTKIMCNISGTYIT 351
++++ SL L +++ +SG +T
Sbjct: 341 VSQERDSLLALMVRLVNVVSGVLVT 365
>gi|387015778|gb|AFJ50008.1| ER Golgi intermediate [Crotalus adamanteus]
Length = 290
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 88/187 (47%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC+ G+ +N+V G+FHI+ H QP + TH I LSFG KLQ
Sbjct: 113 DGCRFEGHFSINKVPGNFHIS---------THSATAQPQNP---DMTHVIHKLSFGDKLQ 160
Query: 263 DDD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+ L GT + + +Y +KI+PT+YE + G + +
Sbjct: 161 VPNIHGAFNALGGTDRLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVANKEYVAYS 220
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 221 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA 280
Query: 366 ISKVEIG 372
K+++G
Sbjct: 281 WKKIQLG 287
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C FI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTFTGAIISICCCFFILFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ ++L+I +P++ C+ + LD D G + H+++++
Sbjct: 65 SGGKIEVNLNISLPSLHCELIGLDIQDEMGRHEVGHIDNSM 105
>gi|195439332|ref|XP_002067585.1| GK16119 [Drosophila willistoni]
gi|194163670|gb|EDW78571.1| GK16119 [Drosophila willistoni]
Length = 443
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/389 (21%), Positives = 154/389 (39%), Gaps = 52/389 (13%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
E K LDAF K E + E T GG ++++ L I YL+ ++ Y+ + + D +
Sbjct: 16 EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELQYYWHETQIIYQFEPDIA 75
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV----EHNIYKRRLDLDGKPIQEPQ 119
++P+H+DI V +D +D + + ++ D D Q Q
Sbjct: 76 LEEQVPMHVDITVAMPCASLSGVDLMDETQQDVFAYGTLQREGVWWEMSDADRMQFQSAQ 135
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ N +++ + +D + G G + K A
Sbjct: 136 --LTNHYLREQY---HSVADILFKDIMRDGILKGRSDSSAKPA---------------AP 175
Query: 180 P--ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
P L ++ + ++ + F + C+++G L +N+V+G H+ G + H
Sbjct: 176 PPGSLPAVLDLHQDTHLQQPEAKF-DACRLHGTLGINKVAGVLHLVGGAQPVVGLFQDHW 234
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
+ + N TH I LSFG Q +PL+G +E A+ Y++KI+PT E
Sbjct: 235 MIEFRRMPANFTHRINRLSFG---QYSRRIVQPLEGDETIIQEEATTVQYFLKIVPTEIE 291
Query: 298 ------------------RLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
+LD + G PGI+F Y+ S L + ++ +
Sbjct: 292 QTFSTINTFQYSVTENVRKLDSER---NSYGSPGIYFKYDWSALKIVVSNDRDHILTFVI 348
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISK 368
++ ISG + +++LL +++ +
Sbjct: 349 RLCSIISGIIVLSGAINSLLLGMQRRLLR 377
>gi|402591333|gb|EJW85263.1| hypothetical protein WUBG_03826, partial [Wuchereria bancrofti]
Length = 244
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 85/188 (45%), Gaps = 29/188 (15%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--I 259
T GC++ G E+++V G+FHI+ H D QP T ++ H I + FG I
Sbjct: 66 TSGCRLEGKFEISKVPGNFHIS---------THAADTQPET---YDMRHTIHSVVFGDDI 113
Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG------------- 306
+ PL A +G+ +Y +KI+P++YE + G+K
Sbjct: 114 STSQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTY 173
Query: 307 --GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
MP ++F YEL P+ +K TE+ + T I + GT+ ++DA L S +
Sbjct: 174 HYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLFSLTE 233
Query: 365 KISKVEIG 372
K ++G
Sbjct: 234 LYRKHQMG 241
>gi|326928384|ref|XP_003210360.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Meleagris gallopavo]
Length = 321
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 88/187 (47%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC+ G+ +N+V G+FH++ H QP + TH I LSFG KLQ
Sbjct: 144 DGCRFEGHFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDKLQ 191
Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+ L+G + + +Y +KI+PT+YE + G + +
Sbjct: 192 VQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVANKEYVAYS 251
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 252 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILDSCIFTASEA 311
Query: 366 ISKVEIG 372
K+++G
Sbjct: 312 WKKIQLG 318
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 53/105 (50%), Gaps = 4/105 (3%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
S + G D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 32 LSHCVVGFDIYRKVPKDLTQPTYTGALISVCCCLFILFLFLSELTGFIATEIVNELYVDD 91
Query: 63 ---SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ ++L+I +P + C+ + LD D G + H+++++
Sbjct: 92 PDKDSGGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 136
>gi|345320110|ref|XP_001521132.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like, partial [Ornithorhynchus anatinus]
Length = 283
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 88/187 (47%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC+ G +N+V G+FH++ H QP + TH I LSFG KLQ
Sbjct: 106 DGCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDKLQ 153
Query: 263 DDD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+ L G ++ + ++Y +KI+PT+YE +G + +
Sbjct: 154 VQNIHGAFNALGGADKRSSNPLASYDYILKIVPTVYEDKNGKQRYSYQYTVANKEYVAYS 213
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 214 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA 273
Query: 366 ISKVEIG 372
K+++G
Sbjct: 274 WKKIQLG 280
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
D + K +D + T G +++ C LFI +L ++ + EL+VD G
Sbjct: 1 FDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDSGG 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ + L+I +P + CD + LD D G + H+++++
Sbjct: 61 KIDVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIDNSM 98
>gi|158292439|ref|XP_313915.3| AGAP005044-PA [Anopheles gambiae str. PEST]
gi|157016993|gb|EAA09437.3| AGAP005044-PA [Anopheles gambiae str. PEST]
Length = 371
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 86/181 (47%), Gaps = 26/181 (14%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
+ C+I+G L +N+V+G+FHI G + +S H+H++ I + + N +H I SFG
Sbjct: 169 DACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSI--FANTQTNFSHRINRFSFG-- 224
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP---------------TIYERLDGSKLG 305
PL+G + G M Y+I+++P T+ E L +
Sbjct: 225 -DHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHSKTYQYTVRENLQLIDID 283
Query: 306 GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G G+ GI+F Y++S L V + + S+ H ++ I+G +++ +L C+
Sbjct: 284 KGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAG----IVVISGMLSKCMHL 339
Query: 366 I 366
I
Sbjct: 340 I 340
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 44/85 (51%), Gaps = 1/85 (1%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
LDAF K E+F + T GG ++++ L I +LI +V Y D+ SKL
Sbjct: 17 LDAFPKVKEEFVQPTRVGGTLSLISRLVIVFLIYHEVTYYLDSRLVFTFVPDTDLQSKLK 76
Query: 70 IHLDIVVPTISCDYLALDAVDSSGE 94
+H+D+ V + C + D +DS+ +
Sbjct: 77 VHIDLTV-AMPCKSIGADILDSTNQ 100
>gi|344229081|gb|EGV60967.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
gi|344229082|gb|EGV60968.1| hypothetical protein CANTEDRAFT_115996 [Candida tenuis ATCC 10573]
Length = 352
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 146/371 (39%), Gaps = 86/371 (23%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F+ R++ DAF K + +++ G TI + F ++ V+V + + VD
Sbjct: 4 FATRVRTFDAFPKVDSEHTVRSLRGALSTIATYFFALVILWVEVGGFLGGYVDHQFVVDD 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
+ L I++D+ V T+ C+ + + VD + ++ L E L+ +G P +
Sbjct: 64 QIRTNLSINIDMTV-TMPCELIHTNVVDITDDRFLAAE------LLNFEGVHFFAPPQFF 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
++ ++N K++ P+L
Sbjct: 117 -------RINSQN---------------------------------------KEYETPDL 130
Query: 183 DTIVQ--CKNEY--STEKLKNTF-TEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVH 236
D +++ + E+ S +K+ C I+G + VN V G FHI A G+ Y + +H
Sbjct: 131 DHVMRENIRAEFYISGQKINQVAGAPACHIFGTIPVNHVQGEFHITAKGVGYQ-DSLHT- 188
Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
P+ F +H I+ SFG D PLD + E + YY ++PT+Y
Sbjct: 189 ---PWERMNF--SHVIQEFSFGTFYPMID---NPLDMSGKITHESLQSYKYYSNVVPTLY 240
Query: 297 ERL----DGSKLGGGDGGM-------------PGIFFSYELSPLMVKITEKSKSLGHLWT 339
ERL D ++ + + PGIFF YE P+ + I EK
Sbjct: 241 ERLGIVVDTNQYSISEQHLVIRKDSNGRIYSPPGIFFKYEFEPIKLTIVEKRLPFIQFVA 300
Query: 340 KIMCNISGTYI 350
++ + G I
Sbjct: 301 RLGTILGGLLI 311
>gi|115623567|ref|XP_794044.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Strongylocentrotus purpuratus]
Length = 289
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 88/197 (44%), Gaps = 28/197 (14%)
Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
+T+K+ +GC Y +N+V G+FH++ H + + + H I
Sbjct: 101 NTKKIPLNNGQGCLFYSAFTINKVPGNFHVS-----------THAVGMNQPQSTDFAHII 149
Query: 253 RHLSFGIKLQDD--DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK------- 303
+SFG +Q+ PL+G + + +YY+KI+PT+YE L G+K
Sbjct: 150 HEVSFGDDIQNKTLGASFNPLEGRDKRDSKSDLSHDYYMKIVPTVYEDLWGTKNVSYQYT 209
Query: 304 --------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
G G +P I+F Y++SP+ VK EK T + + GT+ +
Sbjct: 210 YAYKDYGSQGHGRRVLPAIWFRYDISPITVKYHEKRAPFYTFITTVCAIVGGTFTVAGIF 269
Query: 356 DALLHSCVKKISKVEIG 372
D+++ + + K E+G
Sbjct: 270 DSIIFTAAEVFKKAELG 286
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 53/110 (48%), Gaps = 3/110 (2%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MVF R LD + K +D + T G V+++ LFI++L+ + + + EL+V
Sbjct: 1 MVFDFRR--LDVYRKIPKDLTQPTYAGACVSLLSMLFITFLLLSEFMSFIRPEVVSELYV 58
Query: 61 DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
D+ +L + +++ +P + C + LD D G + N K L+
Sbjct: 59 DNPGEIERLTVRVNLSLPKLHCGVVGLDIQDDMGRHEVGYVDNTKKIPLN 108
>gi|224067439|ref|XP_002195791.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Taeniopygia guttata]
Length = 290
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 88/187 (47%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC+ G+ +N+V G+FH++ H QP + TH I LSFG KLQ
Sbjct: 113 DGCRFEGHFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDKLQ 160
Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+ L+G + + +Y +KI+PT+YE + G + +
Sbjct: 161 VHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVANKEYVAYS 220
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 221 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILDSCIFTASEA 280
Query: 366 ISKVEIG 372
K+++G
Sbjct: 281 WKKIQLG 287
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ ++L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|154415829|ref|XP_001580938.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121915161|gb|EAY19952.1| hypothetical protein TVAG_402060 [Trichomonas vaginalis G3]
Length = 359
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 143/350 (40%), Gaps = 62/350 (17%)
Query: 7 LKGLDAFTKPYE-DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELF----VD 61
LK LD F K + +F T+ G ++ + + LI ++ +Y + +L +D
Sbjct: 5 LKELDIFDKFADAEFALHTIGGKFMSAIFSIIAVILIFAELFNYTKPIVYRDLLNIPQLD 64
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
+ + +P C +L DA+DS G + L V ++I +R+ +D + I
Sbjct: 65 KDNTVNFTFSIQVALP---CFFLHFDALDSIGVEMLDVSNDIKFKRMSVDNRFID----- 116
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ L+D C C+G + E +CCNTC+EVK + +
Sbjct: 117 ---------------YSNESLKD--ICLPCHGLKPEG-ECCNTCDEVKAIFEARGEDFNP 158
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV-HVHD--I 238
L QC + K +E C I G + + G FHIAPG + H HD +
Sbjct: 159 L-PFDQCMGNVN---FKKDMSESCLIEGTIHTFKSPGQFHIAPGRNTKFRRTGHQHDTGL 214
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS---MFNYYI-KIIPT 294
P S H I G Q D R P+ G + + + +++ +I K++ T
Sbjct: 215 SPEASCP----HTIHEFYVG---QKYDNVRSPIRGKIFRDRDSLPRIYLYDLFITKVLHT 267
Query: 295 IYERLD----------GSKL-GGGDGGMPGIFFSYELSPLMVKITEKSKS 333
+ L G+K+ G PGI+F Y SP+ I E+S S
Sbjct: 268 FNDALQYTSYEYSYNLGAKIFNPGSFYQPGIYFKYMFSPM--TIVERSIS 315
>gi|291244956|ref|XP_002742359.1| PREDICTED: endoplasmic reticulum-golgi intermediate compartment
(ERGIC) 1-like [Saccoglossus kowalevskii]
Length = 318
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 87/195 (44%), Gaps = 24/195 (12%)
Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
+T K+ GC+ Y ++N+V G+FH++ + S QP + +T H I
Sbjct: 130 NTNKIPLNNNAGCRFEAYFKINKVPGNFHVSTHAAGSR--------QPQKADFVHTIHEI 181
Query: 253 RHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS---------- 302
+ I+ + + PL G S +YY+K++PT+YE + G
Sbjct: 182 I-IGDDIQNKSINAAFNPLAGYDRSDAAAESSHDYYMKVVPTVYEDVWGRVNLSYQYTYA 240
Query: 303 -----KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
G G MP I+F Y++SP+ VK EK T I + GT+ ++D+
Sbjct: 241 YKDYVSYGHGHRVMPAIWFRYDISPITVKYHEKRAPFYTFITTICAIVGGTFTVAGIIDS 300
Query: 358 LLHSCVKKISKVEIG 372
+++S + K EIG
Sbjct: 301 MIYSASEVFKKAEIG 315
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 52/108 (48%), Gaps = 4/108 (3%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
FS R D + K +D + T+ G V+I LFI +L+ + + EL+VD+
Sbjct: 33 FSNRF---DVYRKIPKDLTQPTLAGAMVSICSALFIVFLLLSEFTSFIAPDVRSELYVDN 89
Query: 63 -SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
KL + L+I +P + C+++ LD D G + + N K L+
Sbjct: 90 PGHIEKLNVKLNISLPRLKCEFIGLDIQDDMGRHEVGLVDNTNKIPLN 137
>gi|154335780|ref|XP_001564126.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061160|emb|CAM38182.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 309
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 69/326 (21%), Positives = 132/326 (40%), Gaps = 66/326 (20%)
Query: 68 LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVK 127
+P+H D++ P +SC+ L++D VD++G + I+K + DG+
Sbjct: 1 MPVHFDVLFPYMSCNRLSIDVVDATGTAKFNCTGTIHKLPISGDGE-------------V 47
Query: 128 KKKVTTENGTTTTELEDPN---KCGSC-----YGAETETR-----KCCNTCNEVKEAYRY 174
+ K T ++ E++D KC C G + R KCC++C+ V E Y+
Sbjct: 48 QYKGTMKDLGNDIEMDDTGGDKKCRRCPSFAFEGVAADVRNAAASKCCDSCDSVFELYKD 107
Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP---GLSYSIN 231
+ P ++ QC + GC + G L++ +V + P G YS+
Sbjct: 108 LEKEFPGIEYFPQCLEQLYER------ARGCNVIGSLDLKKVPVTVIFGPRRTGRRYSLK 161
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
V +T+H I+ L G + + + +PL G + + S Y
Sbjct: 162 DV----------IRLDTSHVIKKLRIGDEAVERFSKHGVAEPLCGH-ERFSKTYSETRYL 210
Query: 289 IKIIPTIYE--RLDGSKLG---------------GGDGGMPGIFFSYELSPLMVKITEKS 331
+K++PT Y R +K G G +P + F++E + + V +
Sbjct: 211 VKVVPTTYRKTRTRDAKASTYEYSAQCSSQAIVVGFSGVVPAVLFAFEPAAIQVNNVFER 270
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDA 357
+ + H ++ + G ++ +D+
Sbjct: 271 QPVSHFLVQLCGIVGGLFVVLGFIDS 296
>gi|443921357|gb|ELU41041.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
solani AG-1 IA]
Length = 579
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 123/312 (39%), Gaps = 69/312 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGG--AVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
L+ +D+ KP E T+Y +VT++ I +++ DY ++ ++ VD SR
Sbjct: 164 LEQVDSVGKP---LRENTLYANRFSVTLISMGIILIFTIIEIIDYRRIGMASDIIVDVSR 220
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G ++ ++++I P + C L+LD D SG+ V H+I K RL+ G I E
Sbjct: 221 GEQISVNMNITFPRVPCYLLSLDITDVSGDIQQDVSHHILKTRLEPSGAMIHE------- 273
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
++ +E G + +E PE D
Sbjct: 274 NTLNYRIKSETGISHQGME---------------------------------LRRPEHDR 300
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
E K + F L +N+V+G+FH +PG S+ H +D+ PY
Sbjct: 301 AGMLLLELIPFKEPHPF---------LRINKVTGNFHFSPGRSFLSQRGHAYDLVPYLKD 351
Query: 245 A--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-------------MFNYYI 289
+ H+I F + +D R+ GT +A G+ M Y++
Sbjct: 352 GNHHDFGHYIHEFHFEGDREIEDRWREGNRGTEWRARVGSDKQPLDGLEQPSNWMIQYFL 411
Query: 290 KIIPTIYERLDG 301
K++ T LDG
Sbjct: 412 KVVSTEVRHLDG 423
>gi|158292441|ref|XP_001688474.1| AGAP005044-PB [Anopheles gambiae str. PEST]
gi|157016994|gb|EDO64057.1| AGAP005044-PB [Anopheles gambiae str. PEST]
Length = 287
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 87/184 (47%), Gaps = 26/184 (14%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
+ C+I+G L +N+V+G+FHI G + +S H+H++ I + + N +H I SFG
Sbjct: 85 DACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSI--FANTQTNFSHRINRFSFG-- 140
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP---------------TIYERLDGSKLG 305
PL+G + G M Y+I+++P T+ E L +
Sbjct: 141 -DHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHSKTYQYTVRENLQLIDID 199
Query: 306 GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH----S 361
G G+ GI+F Y++S L V + + S+ H ++ I+G + ++ +H +
Sbjct: 200 KGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAGIVVISGMLSKCMHLIGDA 259
Query: 362 CVKK 365
C K+
Sbjct: 260 CCKR 263
>gi|300121843|emb|CBK22417.2| unnamed protein product [Blastocystis hominis]
Length = 251
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 97/218 (44%), Gaps = 34/218 (15%)
Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPE--LDTIVQCKNEYSTEKLKNTFTEGCQIY 208
CYGA E +CCNTC+ + EAY + W+ P L C+N + +F GC I+
Sbjct: 35 CYGAGAEG-QCCNTCSAIVEAYNSRGWS-PHFVLQFSPLCRNSRPSVL---SFKSGCMIW 89
Query: 209 GYLEVNRVSGSFHIAPGLSY-SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDER 267
G ++V++V+G HI I V+D + + ++H I H SFG + +
Sbjct: 90 GAIDVHQVAGDIHIQTTTGMIDILGAPVYDAE--IISKLKSSHFIEHFSFGKHIPGVE-- 145
Query: 268 RKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL------------------GGGDG 309
PL+G A + S Y I+I+P IYER G ++ G
Sbjct: 146 -NPLNGRRFLANQLTS-HAYQIEILPAIYER-GGVEIRSNEISVYETDKVVTVEPSGTAD 202
Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
PG+FF Y +SP I E K L + +C + G
Sbjct: 203 VEPGLFFKYRISPFEHVIREDRKEFWSLVVR-LCGVMG 239
>gi|449272958|gb|EMC82607.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Columba livia]
Length = 297
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 88/187 (47%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC+ G+ +N+V G+FH++ H QP + TH I LSFG KLQ
Sbjct: 120 DGCRFEGHFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDKLQ 167
Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+ L+G + + +Y +KI+PT+YE + G + +
Sbjct: 168 VHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMGGKQRYSYQYTVANKEYVAYS 227
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 228 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILDSCIFTASEA 287
Query: 366 ISKVEIG 372
K+++G
Sbjct: 288 WKKIQLG 294
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 50/98 (51%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
D + K +D + T G +++ C LFI +L ++ + EL+VD G
Sbjct: 15 FDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDSGG 74
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ ++L+I +P + C+ + LD D G + H+++++
Sbjct: 75 KIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 112
>gi|149241719|ref|XP_001526345.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450468|gb|EDK44724.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 353
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 137/356 (38%), Gaps = 100/356 (28%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
S+R+K DAF K ++ GG T++ + F ++ V+V + + VD
Sbjct: 5 SKRVKTFDAFPKVDPQHQVRSERGGLSTLLTYFFGLLILWVEVGGFIGGYVDRQFEVDRV 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
S L I++D++V + C+++ + D + ++ L E L+ +G PQ +
Sbjct: 65 VRSDLSINVDMIV-AMPCEFIHTNVEDITRDRFLAGE------TLNFEGIHFFIPQNFKI 117
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
N +PN P+LD
Sbjct: 118 N-------------------NPNDFHET----------------------------PDLD 130
Query: 184 TIVQCKNEYSTEKLKNTFTEG----------CQIYGYLEVNRVSGSFHI-APGLSYSINH 232
++Q E L+ F +G C I+G + VN+V G F I G YS +
Sbjct: 131 EVMQ-------ESLRAEFRQGGQRINEGAPACHIFGSIPVNQVKGDFRITGKGFGYS-DR 182
Query: 233 VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
+HV AA N TH I+ S+G + PLD T EE + Y +++
Sbjct: 183 LHV------PLAALNFTHVIQEFSYG---EFFPFLNNPLDATGKVTEEKLQAYIYNAQVV 233
Query: 293 PTIYERL------------------DGSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
PT+YE+L ++ G+PGI+F YE P+ + I EK
Sbjct: 234 PTLYEKLGLEVDTNQYSLTENHHVIKLDEISNRPQGVPGIYFRYEFEPIKLTIREK 289
>gi|229366152|gb|ACQ58056.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Anoplopoma fimbria]
Length = 290
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 88/187 (47%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IK 260
+GC+ G +N+V G+FH++ H QP + + TH+I L+FG I+
Sbjct: 113 DGCRFEGEFTINKVPGNFHVS---------THSATAQPQSP---DMTHNIHKLAFGEKIQ 160
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+Q L G + + +Y +KI+PT+YE L G + +
Sbjct: 161 VQRVQGAFNALGGADRLSSNPLASHDYILKIVPTVYEDLSGKQRFSYQYTVANKEYVAYS 220
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + + T I + GT+ ++D+ + + +
Sbjct: 221 HAGRIIPAIWFRYDLSPITVKYTERRQPVYRFITTICAIVGGTFTVAGIIDSCIFTASEA 280
Query: 366 ISKVEIG 372
K++IG
Sbjct: 281 WKKIQIG 287
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 52/101 (51%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
++ D + K +D + T G ++I+C +FI +L ++ + EL+VD
Sbjct: 5 VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATELVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + CD + LD D G + H+++++
Sbjct: 65 SGGKIEVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIDNSM 105
>gi|357627966|gb|EHJ77470.1| putative PTX1 protein isoform 1 [Danaus plexippus]
Length = 353
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 41/176 (23%), Positives = 83/176 (47%), Gaps = 18/176 (10%)
Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
N + C+++G L +N+V+G+FHI G S + H+H + N +H I LSFG
Sbjct: 138 NRRPDACRLHGVLTLNKVAGNFHITAGKSLHLPRGHIHLNMLFDDTPQNFSHRINRLSFG 197
Query: 259 IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSK 303
+ PL+G + + ++ Y+++++PT + E
Sbjct: 198 ---SPANGIIYPLEGDEKITSDESMLYQYFLEVVPTDVDTTFESIKTFQYSVKELARPIS 254
Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
G G+PG+FF Y+++ L V++ ++ ++L ++ I G Y+ ++ ++
Sbjct: 255 HSKGSHGVPGVFFKYDMAALKVQVYQERENLLQFMLRLFSIIGGIYVIISFINTIV 310
>gi|47222972|emb|CAF99128.1| unnamed protein product [Tetraodon nigroviridis]
Length = 288
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/186 (28%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FHI+ H QP + TH I L+FG KLQ
Sbjct: 112 GCRFEGEFNINKVPGNFHIS---------THSASAQPQNP---DMTHFIHKLAFGDKLQM 159
Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
+ L G A + +Y +KI+PT+YE L G + +
Sbjct: 160 HQVKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVANKEYVAYSH 219
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + T I + GT+ ++D+ + + +
Sbjct: 220 TGRIVPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGIIDSCIFTASEAW 279
Query: 367 SKVEIG 372
K++IG
Sbjct: 280 KKIQIG 285
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
L D + K +D + T G ++I+C +FI +L ++ + EL+VD
Sbjct: 3 LHRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 62
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + CD + LD D G + H+E+++
Sbjct: 63 SGGKIEVSLNITLPNLHCDLVGLDIQDEMGRHEVGHIENSM 103
>gi|296415728|ref|XP_002837538.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633410|emb|CAZ81729.1| unnamed protein product [Tuber melanosporum]
Length = 341
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 81/163 (49%), Gaps = 29/163 (17%)
Query: 205 CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG---IK 260
C+IYG + VNR+ G FHI A G Y + H+ +FN +H I LSFG K
Sbjct: 155 CRIYGSMGVNRILGDFHITAKGHGYWEDGAHI------DHRSFNFSHVITELSFGDYYPK 208
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE-RLDGSKLGGGD----------- 308
L + PLDG V+K +E F Y++ I+PT YE + G L
Sbjct: 209 LVN------PLDGVVSKTDENFHKFQYFLSIVPTTYESQTSGKSLLTNQYAVTEQSRKIS 262
Query: 309 -GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+PGI+F Y++ P+ +KI+++ +L +++ +SG +
Sbjct: 263 SHSVPGIYFKYDIEPISLKISDRRTALLAFVVRLVNIVSGILV 305
>gi|19112857|ref|NP_596065.1| COPII-coated vesicle component Erv41 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74582843|sp|O94283.1|ERV41_SCHPO RecName: Full=ER-derived vesicles protein 41
gi|3850069|emb|CAA21880.1| COPII-coated vesicle component Erv41 (predicted)
[Schizosaccharomyces pombe]
Length = 333
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 77/164 (46%), Gaps = 29/164 (17%)
Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
C+IYG L VNRV+G HI APG Y +++ H + N TH+I LSFG +
Sbjct: 151 ACRIYGQLVVNRVNGQLHITAPGWGYGRSNIPFHSL--------NFTHYIEELSFG---E 199
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS---------------KLGGG 307
LDG A + F YY+ ++PT Y+ S +LG G
Sbjct: 200 YYPALVNALDGHYGHANDHPFAFQYYLSVLPTSYKSSFRSFETNQYSLTENSVVRQLGFG 259
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
PGIF Y+L PL V++ +K ++ +I+ ISG IT
Sbjct: 260 SLP-PGIFIDYDLEPLAVRVVDKHPNVASTLLRILA-ISGGLIT 301
>gi|289741661|gb|ADD19578.1| cOPII vesicle protein [Glossina morsitans morsitans]
Length = 418
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/387 (22%), Positives = 159/387 (41%), Gaps = 68/387 (17%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
E K LDAF K E + E T GG ++++ L I YLI +V Y+Q + F
Sbjct: 16 ELAKNLDAFKKVPEKYTEATEIGGTLSLISRLLIIYLIYREV-KYYQDAGLVYQFEPDID 74
Query: 65 GSKLPIHLDIVVPTISCDYLA-LDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
K+ +H+DI V + C+ L+ +D +D E Q++V
Sbjct: 75 KEKVQMHVDITV-AMPCNSLSGVDLMD--------------------------ETQQDVF 107
Query: 124 --NAVKKKKV----TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKW 177
A++++ V T T ++ N R+ ++ ++ Y +
Sbjct: 108 AYGALRRQGVWWHLTPHERTEFERVQHENHF---------LREEYHSVADLLFKYIIQS- 157
Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
PE+D + E + L + C+++G L +N+V+G H+ G ++ + H
Sbjct: 158 --PEVD---ETATEEDEKPLSEEQYDACRLHGTLGINKVAGVLHLVGGTQPVVDLLGEHL 212
Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--- 294
+ + A N TH I LSFG Q +PL+G E ++ Y++ I+PT
Sbjct: 213 MIGFRHIAANFTHRINRLSFG---QYARRIVQPLEGDETFVSEEGTIVQYFLNIVPTEIH 269
Query: 295 ---------IYERLDGSKLGGGDG---GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
Y + ++ D G PGI+F Y+ S L + + ++ ++
Sbjct: 270 KTFTTISTYQYSVTENVRVLDSDRNSYGSPGIYFKYDWSALKIIVRTDRDNMLQFIIRLC 329
Query: 343 CNISGTYITFMLVDALLHSCVKKISKV 369
ISG + +++ L + + I K+
Sbjct: 330 SIISGIVVLSGILNVFLLTLRRNIIKI 356
>gi|326479518|gb|EGE03528.1| COPII-coated vesicle protein [Trichophyton equinum CBS 127.97]
Length = 399
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/386 (22%), Positives = 152/386 (39%), Gaps = 77/386 (19%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
+ +LK DAF K + + GG TI + + L C ++ +++ V+
Sbjct: 20 IATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVER 79
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKE 121
++ +++D VV + CD + ++ D++G+ H+ L G + QEP
Sbjct: 80 GVSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW 125
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE-TETRKCCNTCNEVKEAYRYKKWALP 180
A +++ + E + NK S E E + EV+ + + K P
Sbjct: 126 ---AAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQAEDLHVEHVLGEVRRSRKKKFPKAP 182
Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQ 239
+L K+ + C+++G LE N+V G+ HI A G Y
Sbjct: 183 KLK--------------KSDAVDSCRVFGSLEGNKVQGNLHITARGFGY---FEWGRATN 225
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
P+ + N TH I LSFG PLD TV+ + Y++ ++PTIY +
Sbjct: 226 PH---SLNFTHLITELSFGPHY---GRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTKS 279
Query: 300 -----------DGSKLGGGDG-----------------------GMPGIFFSYELSPLMV 325
D S + D PGIFF Y + P+++
Sbjct: 280 GHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTSYSQPIQPRIDSTPGIFFKYNIEPILL 339
Query: 326 KITEKSKSLGHLWTKIMCNISGTYIT 351
++++ SL L +++ +SG +T
Sbjct: 340 IVSQERDSLLALMVRLVNVVSGVLVT 365
>gi|308198100|ref|XP_001386838.2| predicted protein [Scheffersomyces stipitis CBS 6054]
gi|149388859|gb|EAZ62815.2| putative ER to golgi transport [Scheffersomyces stipitis CBS 6054]
Length = 352
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/348 (23%), Positives = 140/348 (40%), Gaps = 84/348 (24%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F+++++ DAF K ++ GG T++ ++ V++ + + VD+
Sbjct: 4 FAKKVRTFDAFPKVDSQHTVRSQRGGFSTLMTAFCGLLIVWVEIGGFLGGYVDHQFIVDN 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I++D++V + C++L + D + +++L E L+ G P
Sbjct: 64 EIKSSLVINVDMLV-AMPCEFLHTNVEDITKDRYLAGE------TLNFQGTNFITPPTFN 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+N + K T P+L
Sbjct: 117 INNINDKHDT-----------------------------------------------PDL 129
Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
D I+Q + E+S + N C I+G + V+ V G FHI A GL YS + HV
Sbjct: 130 DEIMQDSLRAEFSVSGARINEGAPACHIFGSIPVSHVKGDFHITAKGLGYS-DRSHV--- 185
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
A N +H I+ SFG + PLD + EE ++Y+ K++PT+Y+R
Sbjct: 186 ---PLEALNFSHVIQEFSFGDFYPFIN---NPLDASGKLTEEPLISYSYFAKVVPTLYQR 239
Query: 299 L----DGSKLGGGDG------------GMPGIFFSYELSPLMVKITEK 330
L D ++ + G+PGIFF Y+ P+ + I E+
Sbjct: 240 LGLVVDTNQYSLTENNHVFKLEHKRPTGIPGIFFKYDFEPIKLIIIER 287
>gi|71480113|ref|NP_001025133.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Danio rerio]
gi|78099248|sp|Q4V8Y6.1|ERGI1_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|66911928|gb|AAH97146.1| Zgc:114085 [Danio rerio]
Length = 290
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FH++ H QP + + TH I L+FG KLQ
Sbjct: 114 GCRFEGEFSINKVPGNFHVS---------THSATAQPQSP---DMTHIIHKLAFGAKLQV 161
Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
+ L G + +Y +KI+PT+YE L G + +
Sbjct: 162 QHVQGAFNALGGADRLQSNALASHDYILKIVPTVYEELGGKQRFSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRRPFYRFITTICAIIGGTFTVAGIIDSCIFTASEAW 281
Query: 367 SKVEIG 372
K++IG
Sbjct: 282 KKIQIG 287
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 85/188 (45%), Gaps = 9/188 (4%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
++ D + K +D + T G ++I C +F+ +L ++ + EL+VD
Sbjct: 5 VRRFDIYRKVPKDLTQPTYTGAFISICCCVFMLFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNIYKRRLDLDGKPIQEPQKEV 122
G K+ + L+I +P + CD + LD D G + H+E+++ K L+ +G + +
Sbjct: 65 SGGKIDVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIENSM-KVPLN-NGHGCRFEGEFS 122
Query: 123 VNAVKKK-KVTTENGTTTTELEDPNKC--GSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+N V V+T + T + D +GA+ + + N + A R + AL
Sbjct: 123 INKVPGNFHVSTHSATAQPQSPDMTHIIHKLAFGAKLQVQHVQGAFNALGGADRLQSNAL 182
Query: 180 PELDTIVQ 187
D I++
Sbjct: 183 ASHDYILK 190
>gi|148223633|ref|NP_001084786.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Xenopus laevis]
gi|78099249|sp|Q6NS19.1|ERGI1_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|47125098|gb|AAH70532.1| MGC78834 protein [Xenopus laevis]
Length = 290
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 86/186 (46%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FH++ H QP A + H I LSFG LQ
Sbjct: 114 GCRFEGLFSINKVPGNFHVS---------THSAIAQP---ANPDMRHIIHKLSFGNTLQV 161
Query: 264 DD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
D+ L G A + +Y +KI+PT+YE L+G + +
Sbjct: 162 DNIHGAFNALGGADKLASKALESHDYVLKIVPTVYEDLNGKQQFSYQYTVANKAYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + + T + I GT+ ++D+ + + +
Sbjct: 222 TGRVVPAIWFRYDLSPITVKYTERRQPMYRFITTVCAIIGGTFTVAGILDSFIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI++L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFITFLFLSELTGFIANEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L++ +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVTLNVTLPNLPCEVVGLDIQDEMGRHEVGHIDNSM 105
>gi|410918691|ref|XP_003972818.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Takifugu rubripes]
Length = 378
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 76/171 (44%), Gaps = 18/171 (10%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
C+IYG++ VN+V+G+ HI G H H + +N +H I HLSFG ++
Sbjct: 168 ACRIYGHIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHETYNFSHRIDHLSFGEEITG 227
Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGGD 308
PLDGT + M+ Y+I ++PT + ER G
Sbjct: 228 II---NPLDGTEKITSKHTQMYQYFITVVPTRLVTHKVSADTHQFSVTERERVINHAAGS 284
Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
G+ GIF Y+ S L V +TE+ L ++ + G + T ++ L+
Sbjct: 285 HGVSGIFVKYDTSSLTVTVTEQHMPLWQFLVRLCGIVGGIFSTTGMLHGLV 335
>gi|225685292|gb|EEH23576.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 386
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 146/375 (38%), Gaps = 75/375 (20%)
Query: 12 AFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIH 71
A TKP + T GG T+V ++ + L ++ +++ V+ +L ++
Sbjct: 16 AKTKP--TYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVEKGISRELQLN 73
Query: 72 LDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKV 131
LDIVV ++CD L ++ D++G++ L D+ K EP +
Sbjct: 74 LDIVV-AMTCDALRINVQDAAGDRILAS---------DMLNK---EPTSWAAWNRELNVA 120
Query: 132 TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNE 191
+ G L + + G E + + + + EA R K P+
Sbjct: 121 LSGGGREYQTLAEEDA-----GRLMEQEEDMHVGHALGEARRSHKRKFPK---------- 165
Query: 192 YSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTH 250
KLK + C+IYG LE N+V G FHI + H + + AFN +H
Sbjct: 166 --GPKLKRGEMPDSCRIYGSLEGNKVQGDFHIT-----ARGHGYFEFGEHLDHHAFNFSH 218
Query: 251 HIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL----------- 299
I LSFG PLD T++ + YY+ I+PTIY R
Sbjct: 219 MITELSFGPHYST---LLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTIDPYSQVLP 275
Query: 300 DGSKLGGGDGG-----------------------MPGIFFSYELSPLMVKITEKSKSLGH 336
D S + +PGIFF Y + P+++ I+E+ SL
Sbjct: 276 DPSTISPSQRKNTIFTNQYAVTSRSHELPDVQFHVPGIFFKYNIEPILLIISEERGSLLA 335
Query: 337 LWTKIMCNISGTYIT 351
L +++ +SG +
Sbjct: 336 LLVRLVNVMSGVVVA 350
>gi|315054535|ref|XP_003176642.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
gi|311338488|gb|EFQ97690.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
Length = 399
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/384 (21%), Positives = 155/384 (40%), Gaps = 75/384 (19%)
Query: 4 SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
+ +LK DAF K + + GG TI + + L C ++ +++ V+
Sbjct: 21 ATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAILCTLLTCSELITWYRGHENHHFSVERG 80
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEV 122
++ +++D VV + CD + ++ D++G+ H+ L G + QEP
Sbjct: 81 VSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW- 125
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
A +++ + E + NK + E E + + + + E R +K P+
Sbjct: 126 --AAWNREMNKRRSGGSPEYQTLNKEDTLRLEEQE--EDLHVEHVLGEVRRSRKKKFPK- 180
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
+ + K+ + C+++G LE N+V G+ HI A G Y P+
Sbjct: 181 ----------APKMKKSDVVDSCRVFGSLEGNKVQGNLHITARGFGY---FEWGRATNPH 227
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER--- 298
+ N TH I LSFG PLD TV+ + Y++ ++PTIY +
Sbjct: 228 ---SLNFTHLITELSFGPHY---GRLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYTKSGH 281
Query: 299 LDGSKLGGGDG-------------------------------GMPGIFFSYELSPLMVKI 327
+D S+ D PGIFF Y + P+++ +
Sbjct: 282 MDPSRRSLPDSSTITAKDSKTTVSTNQYAVTSYSQPIQPRIDSTPGIFFKYNIEPILLIV 341
Query: 328 TEKSKSLGHLWTKIMCNISGTYIT 351
+++ SL L +++ +SG +T
Sbjct: 342 SQERDSLLGLMIRLVNVVSGVLVT 365
>gi|301626814|ref|XP_002942582.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like, partial [Xenopus (Silurana) tropicalis]
Length = 298
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 87/186 (46%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G+ +N+V G+FH++ H QP A + H I LSFG LQ
Sbjct: 122 GCRFEGFFSINKVPGNFHVS---------THSAMAQP---ANPDMRHIIHKLSFGNTLQV 169
Query: 264 DD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
++ L G A + +Y +KI+PT+YE ++G + +
Sbjct: 170 ENIHGAFNALGGADKLASQALESHDYVLKIVPTVYEDMNGEQQFSYQYTVANKAYVAYSH 229
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + + T + I GT+ ++D+ + + +
Sbjct: 230 TGRVVPAIWFRYDLSPITVKYTERRQPIYRFITTVCAIIGGTFTVAGILDSFIFTASEAW 289
Query: 367 SKVEIG 372
K+++G
Sbjct: 290 KKIQLG 295
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 52/98 (53%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
D + K +D + T G ++I C LFI++L ++ + EL+VD + G
Sbjct: 16 FDIYRKVPKDLTQPTYTGAIISICCCLFITFLFLSELTGFIANEIVNELYVDDPDKNSGG 75
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ + L++ +P ++C+ + LD D G + H+++++
Sbjct: 76 KIEVTLNVSLPNLACEVVGLDIQDEMGRHEVGHIDNSM 113
>gi|432943284|ref|XP_004083140.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oryzias latipes]
Length = 372
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 79/175 (45%), Gaps = 18/175 (10%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+I+G + VN+V+G+ HI G H H + ++N +H I L FG +
Sbjct: 159 DACRIHGDIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHESYNFSHRIDRLCFG---E 215
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGG 307
+ PLDGT + M+ Y+I ++PT + ER G
Sbjct: 216 EIPGIINPLDGTEKITYDNNQMYQYFITVVPTKLKTYKITADTHQFSVTERERVINHTAG 275
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
G+ GIFF Y+ S LMV ++E+ L ++ I G Y T ++ +L+ C
Sbjct: 276 SHGVSGIFFKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIYSTTGMLHSLIGFC 330
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K + + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVSDSYVETSTSGGTVSLIAFSTMALLSVLEFFVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
KL I++D+ V + C ++ D +D
Sbjct: 73 KLRINVDVTV-AMRCQHVGADILD 95
>gi|195130281|ref|XP_002009580.1| GI15435 [Drosophila mojavensis]
gi|193908030|gb|EDW06897.1| GI15435 [Drosophila mojavensis]
Length = 433
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/393 (23%), Positives = 163/393 (41%), Gaps = 63/393 (16%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV---D 61
E K LDAF K E + E T GG ++++ L I YL+ ++ Y+ + TE ++ D
Sbjct: 16 EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYW--NETEIIYQFEPD 73
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-------EHNIYKRRLDLDGKP 114
S ++ +H+DI T++ +L VD E L V ++ + D D +
Sbjct: 74 ISLDEQVQMHVDI---TVAMPCASLSGVDLMDETQLDVFAYGTLQREGVWWQMSDADRRH 130
Query: 115 IQEPQKEVVNAVKKKKVTTENGTTTTEL---EDPNKCGSCYGAETETRKCCNTCNEVKEA 171
Q Q + N +++ + ++ P K E++T+
Sbjct: 131 FQSMQ--MTNHYLREEYHSVADILFKDILRERSPPK-------ESDTQSDAAAPPPPG-- 179
Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
AL +L I Q +++Y + C+++G L +N+V+G H+ G +
Sbjct: 180 ------ALQQLQQISQMESKY----------DACRLHGTLGINKVAGVLHLVGGAQPVVG 223
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
H + + N TH I LSFG Q +PL+G E A+ Y+IK+
Sbjct: 224 MFEDHWMIEFRRMPANFTHRINRLSFG---QYSRRIVQPLEGDETIIREEATTVQYFIKV 280
Query: 292 IPT---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
+PT + E + G PGI+F Y+ S L + ++ +L
Sbjct: 281 VPTEIRHTFSTISTFQYAVTENVRKLDAERNSYGSPGIYFKYDWSALKIVVSHDRDNLVT 340
Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
++ ISG + V+ALL + +++ ++
Sbjct: 341 FVIRLCSIISGIIVISGAVNALLVAIQRRLLRM 373
>gi|354545468|emb|CCE42196.1| hypothetical protein CPAR2_807450 [Candida parapsilosis]
Length = 351
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/383 (23%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
FS+R+K DAF K ++ GG T++ + ++ V+V Y + VD
Sbjct: 4 FSKRVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFLGLLILWVEVGGYIGGYVDRQFLVDD 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I+LD++V + C+YL + D + ++ L E L+ +G P
Sbjct: 64 VLRSDLTINLDMIV-AMPCEYLHTNVEDITRDRFLAGE------TLNFEGVKFFIP---- 112
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
PN N N+ E P+L
Sbjct: 113 ----------------------PNFS-------------INNPNDFHET--------PDL 129
Query: 183 DTIVQCKNEYSTEKLKNTFTEG---CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
D ++Q +L EG C I+G + VN+V G F I A G Y D
Sbjct: 130 DEVMQESLRAEFSQLGRRVNEGAPACHIFGSIPVNQVKGDFRITAKGFGY-------RDR 182
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
A N +H I+ S+G PLD T EE + Y+ K++PT+YE+
Sbjct: 183 SFVPLEALNFSHVIQEFSYG---DFYPFLNNPLDATGKVTEENLQTYLYHAKVVPTLYEK 239
Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
L D ++ + + GI+F+YE P+ + I EK K
Sbjct: 240 LGLEVDTTQYSLTENHHVVKVDPHSKRPQEISGIYFAYEFEPIKLIIREKRIPFLQFIAK 299
Query: 341 IMCNISGTYIT----FMLVDALL 359
+ G + F L + LL
Sbjct: 300 LGTIAGGVVVAAGYLFKLYEKLL 322
>gi|312081872|ref|XP_003143209.1| HT034 [Loa loa]
gi|307761627|gb|EFO20861.1| hypothetical protein LOAG_07628 [Loa loa]
Length = 292
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 84/188 (44%), Gaps = 29/188 (15%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--I 259
T GC+ G E+++V G+FH++ H D QP T ++ H I + FG I
Sbjct: 114 TSGCRFEGKFEISKVPGNFHLS---------THAADTQPET---YDMRHTIHSVVFGDNI 161
Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG------------- 306
+ PL A +G+ +Y +KI+P++YE ++G+
Sbjct: 162 ITSQNLGSFNPLKNREALQTDGSFTHDYVLKIVPSVYEDINGNTKYSYQYTYAHKEYVTY 221
Query: 307 --GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
MP ++F YEL P+ +K TE+ + T I + GT+ ++DA L S +
Sbjct: 222 HYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLFSLTE 281
Query: 365 KISKVEIG 372
K +IG
Sbjct: 282 LYRKHQIG 289
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 1/104 (0%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRGSKL 68
LD + K D + T G +++VC FI +++ D+ + + ELFVD R ++
Sbjct: 13 LDIYRKVPRDLTQPTTTGAVISVVCISFILFMVINDLLSFLTLEIRSELFVDDPGREGRI 72
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
+ L+I +P +SC Y+ +D D +G + N K + G
Sbjct: 73 EVQLNISLPYLSCYYIGIDIQDDNGRHEVGFVQNTEKIPIGTSG 116
>gi|170587366|ref|XP_001898447.1| HT034 [Brugia malayi]
gi|158594071|gb|EDP32661.1| HT034, putative [Brugia malayi]
Length = 286
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/188 (26%), Positives = 84/188 (44%), Gaps = 29/188 (15%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--I 259
T GC+ G ++++V G+FHI+ H D QP T ++ H I + FG +
Sbjct: 108 TSGCRFEGKFDISKVPGNFHIS---------THAADTQPET---YDMRHTIHSVVFGDDV 155
Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG------------- 306
+ PL A +G+ +Y +KI+P++YE + G+K
Sbjct: 156 STSQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTY 215
Query: 307 --GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
MP ++F YEL P+ +K TE+ + T I + GT+ ++DA L S +
Sbjct: 216 HYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLFSLTE 275
Query: 365 KISKVEIG 372
K ++G
Sbjct: 276 LYRKHQMG 283
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 1/100 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
+K D + K D + T G +++VC FI +++ D+ ++ + ELFVD R
Sbjct: 4 IKRFDIYRKVPRDLTQPTTTGAIISVVCISFILFMVINDLLNFLTLEVRSELFVDDPGRE 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
++ + L+I +P +SC Y+ +D D +G + N K
Sbjct: 64 GRIEVQLNISLPYLSCYYIGIDIQDDNGRHEVGFVRNTEK 103
>gi|449489976|ref|XP_004158474.1| PREDICTED: protein disulfide-isomerase 5-3-like [Cucumis sativus]
Length = 224
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 92/212 (43%), Gaps = 36/212 (16%)
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ NE K GC+I GY+ V +V GS IA + S +H + ++
Sbjct: 20 KSNNETGNVKRPAPSAGGCRIEGYVRVKKVPGSLVIA---ARSESH-------SFDASQM 69
Query: 247 NTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
N +H I HLSFG K+ D ++ P G G S N +Y++I
Sbjct: 70 NMSHIISHLSFGRKISPKAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEHYLQI 129
Query: 292 IPT-IYERLDGSKLG----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
+ T + R G L +P + F + LSP+ V ITE KS H T
Sbjct: 130 VKTEVLTRRSGKLLEEYEYTAHSSVSQSLYIPVVKFHFVLSPMQVVITENQKSFSHFITN 189
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+ I G + ++DALLH+ ++ + KVE+G
Sbjct: 190 VCAIIGGVFTVAGILDALLHNTIRLMKKVELG 221
>gi|348516790|ref|XP_003445920.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Oreochromis niloticus]
Length = 290
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 85/187 (45%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC+ G +N+V G+FH++ H QP + TH I L+FG KLQ
Sbjct: 113 DGCRFEGEFTINKVPGNFHVS---------THSATAQPQNP---DMTHTIHKLAFGEKLQ 160
Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+ L G + + +Y +KI+PT+YE L G + +
Sbjct: 161 VQKVQGAFNALGGADKMSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVANKEYVAYS 220
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + L T I I G + ++D+ + + +
Sbjct: 221 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGAFTVAGIIDSCIFTASEA 280
Query: 366 ISKVEIG 372
K++IG
Sbjct: 281 WKKIQIG 287
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 52/101 (51%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
++ D + K +D + T G ++I+C +FI +L ++ + EL+VD
Sbjct: 5 VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + CD + LD D G + H+E+++
Sbjct: 65 SGGKIEVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIENSM 105
>gi|224117462|ref|XP_002317580.1| predicted protein [Populus trichocarpa]
gi|222860645|gb|EEE98192.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/227 (28%), Positives = 98/227 (43%), Gaps = 43/227 (18%)
Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTE--GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
A+ ++ K E +T+ +K GC+I GY+ V +V G+ I+ +++ H
Sbjct: 266 AMESQRQALEHKPENATQHVKRPAPSAGGCRIEGYVRVKKVPGNLMIS-----ALSGAHS 320
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKL----QDDDERRKPLDGTVAKAEEGASMFNY---- 287
D S N +H I H SFG+K+ D +R P G G S N+
Sbjct: 321 FD-----SKQMNLSHVISHFSFGMKVLPRVMSDVKRLLPYIGRSHDKLNGRSFINHRDVG 375
Query: 288 -------YIKIIPTI---------------YERLDGSKLGGGDGGMPGIFFSYELSPLMV 325
Y++++ T YE S L MP F +ELSP+ V
Sbjct: 376 ANVTIEHYLQVVKTEVVTRRSSSERKLIEEYEYTAHSSLSQ-TVYMPTAKFHFELSPMQV 434
Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
ITE SKS H T + I G + ++D++LH V+ + KVE+G
Sbjct: 435 LITENSKSFSHFITNVCAIIGGVFTVAGILDSILHHTVRMMKKVELG 481
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 64/115 (55%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MV + +LK +D + K D E ++ G ++IV L + +L +++ +Y V+T+ + V
Sbjct: 1 MVSTNKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMMFLFGMELNNYLTVNTSTTVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I +I P++SC++ ++D D G L++ I K +D D KP
Sbjct: 61 DNSSDGEFLRIDFNISFPSLSCEFASVDVSDVLGTNRLNITKTIRKFSIDHDLKP 115
>gi|449468488|ref|XP_004151953.1| PREDICTED: protein disulfide-isomerase 5-4-like [Cucumis sativus]
Length = 481
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 92/212 (43%), Gaps = 36/212 (16%)
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ NE K GC+I GY+ V +V GS IA + S +H + ++
Sbjct: 277 KSNNETGNVKRPAPSAGGCRIEGYVRVKKVPGSLVIA---ARSESH-------SFDASQM 326
Query: 247 NTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
N +H I HLSFG K+ D ++ P G G S N +Y++I
Sbjct: 327 NMSHIISHLSFGRKISPKAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEHYLQI 386
Query: 292 IPT-IYERLDGSKLG----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
+ T + R G L +P + F + LSP+ V ITE KS H T
Sbjct: 387 VKTEVLTRRSGKLLEEYEYTAHSSVSQSLYIPVVKFHFVLSPMQVVITENQKSFSHFITN 446
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+ I G + ++DALLH+ ++ + KVE+G
Sbjct: 447 VCAIIGGVFTVAGILDALLHNTIRLMKKVELG 478
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 60/110 (54%), Gaps = 1/110 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E T+ G ++IV L + +L +++ +Y VST+ + V
Sbjct: 1 MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
D S+ G L + +I P +SC++ A+D D G L++ I K +D
Sbjct: 61 DNSTDGDFLRMDFNISFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSID 110
>gi|440632946|gb|ELR02865.1| hypothetical protein GMDG_05797 [Geomyces destructans 20631-21]
Length = 384
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 86/376 (22%), Positives = 151/376 (40%), Gaps = 67/376 (17%)
Query: 3 FSER---LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELF 59
F+E+ + DAF K ++ KT GG T++ + + L ++ +++ +
Sbjct: 14 FAEKGSIVSAFDAFPKSKPEYVTKTSGGGKWTVLMLIISALLTMSELGRWWRGNEDHTFE 73
Query: 60 VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
V+ L ++LD+VV + C + ++ D+SG++ L + K L + +
Sbjct: 74 VEKFVSRDLQVNLDMVV-AMRCPDIHINVQDASGDRIL--ASKVLKTELTNWLQWVNMKG 130
Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
+ + V T+ G + G G E E + + + A R KWA
Sbjct: 131 QHQLGHNADGSVITDEGWESD--------GHDEGFEEE-----HVHDIIYTAMRSNKWA- 176
Query: 180 PELDTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVH 236
T K+K +G C+I+G + +N+V G FHI A G Y
Sbjct: 177 -------------KTPKIKGHPRDGDSCRIFGSMMLNKVQGDFHITARGHGYQ----EAF 219
Query: 237 DIQPYTSAAFNTTHHIRHLSFGI---KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
+ ++FN +H + SFG KL + PLD T+ Y++ ++P
Sbjct: 220 GTKHLDHSSFNFSHIVSEFSFGAFYPKLIN------PLDQTITTTANQFYKSQYFMSVVP 273
Query: 294 TIYERLDGSKLGG------------------GDGGMPGIFFSYELSPLMVKITEKSKSLG 335
TIY + L + +PGIFF Y++ PLM+ I E+ S
Sbjct: 274 TIYTVSSPNPLSSKSTIFTNQYAVTHEDRKINERTVPGIFFKYDIEPLMLTIEERRDSFL 333
Query: 336 HLWTKIMCNISGTYIT 351
K++ +SG +
Sbjct: 334 RFAIKVVNILSGVLVA 349
>gi|302659461|ref|XP_003021421.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
gi|291185318|gb|EFE40803.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
Length = 427
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 88/407 (21%), Positives = 158/407 (38%), Gaps = 91/407 (22%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
+ +LK DAF K + + GG TI + + L C ++ +++ V+
Sbjct: 20 IATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVER 79
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKE 121
++ +++D VV + CD + ++ D++G+ H+ L G + QEP
Sbjct: 80 GVSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW 125
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
A +++ + E + NK S E E + + + + E R +K P+
Sbjct: 126 ---AAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQE--EDLHVEHVLGEVRRSRKKKFPK 180
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY-----SINHVHV 235
S + K+ + C+++G LE N+V G+ HI A G Y + N +
Sbjct: 181 -----------SPKLKKSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWGRATNPHSM 229
Query: 236 HDIQPYTS-----------------AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA 278
+QP + N TH I LSFG PLD TV+
Sbjct: 230 SLLQPIITCIHGDAKNLTDQLTKLFPGLNFTHLITELSFGPHY---GRLLNPLDKTVSST 286
Query: 279 EEGASMFNYYIKIIPTIYER-----------LDGSKLGGGDG------------------ 309
+ Y++ ++PTIY + D S + D
Sbjct: 287 SINFYKYQYHLSVVPTIYTKSGHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTSYSQPI 346
Query: 310 -----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
PGIFF Y + P+++ ++++ SL L +++ +SG +T
Sbjct: 347 QPRIDATPGIFFKYNIEPILLIVSQERDSLLALMVRLVNVVSGVLVT 393
>gi|67482091|ref|XP_656395.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56473591|gb|EAL51010.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449705171|gb|EMD45274.1| Hypothetical protein EHI5A_018710 [Entamoeba histolytica KU27]
Length = 315
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 86/203 (42%), Gaps = 39/203 (19%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPG-LSYSINHV-------------HVHDIQPYTSAAFNTT 249
GC++YG ++V+RVSG FH+A G +S+ + H+H +FN T
Sbjct: 116 GCRMYGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175
Query: 250 HHIRHLSFGIKLQDD-DERRKPLDG---TVAKAEEGASMFNYYIKIIPTIY--------- 296
H+I HLSF L PL+G T++ + YYI +IPT++
Sbjct: 176 HYINHLSFSNTLGSTVHSGETPLNGKKFTLSGFDNARK--TYYINVIPTLFKYPSYTLRT 233
Query: 297 ------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
ER D G PG+FF YELSP +V S H + I G I
Sbjct: 234 YQLSVNER-DVPVTYGASFTQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIIGGVLI 292
Query: 351 TFMLVDALL---HSCVKKISKVE 370
L+ L H V + ++E
Sbjct: 293 IMGLLSRLFDSKHELVTSVVEME 315
>gi|301093181|ref|XP_002997439.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262110695|gb|EEY68747.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 278
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 79/179 (44%), Gaps = 26/179 (14%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC++YG ++V +V+G A S ++ + FN++H + HL FG ++ D
Sbjct: 109 GCRLYGTVQVQKVAGDLSFAHEGSLTV-------FSFFDFLNFNSSHVVNHLRFGPQIPD 161
Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG----------------SKLGGG 307
PL + + + Y++ ++P+ Y L+G S+ G
Sbjct: 162 ---METPLIDVSKILTKNLATYKYFVSVVPSRYVYLNGRSVTTFQYSVTEHETSSRGPNG 218
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
PG+ FSYE SP+ V+ E S+ H T + G + ++D ++S KK+
Sbjct: 219 QVSFPGVIFSYEFSPIAVEYIESKLSVLHFLTSTSAIVGGVFAVARMIDGAIYSVSKKV 277
Score = 45.1 bits (105), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 54/98 (55%), Gaps = 3/98 (3%)
Query: 8 KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSK 67
+ D K E E+T+ GG VT++ + +++L+ ++ ++ VS T + VD+
Sbjct: 4 RRFDLNAKGVEGIQERTIGGGVVTLMSCVAVAFLLLSELSVWWTVSVTHRMHVDTDP-QD 62
Query: 68 LPIHLDIVVPTI--SCDYLALDAVDSSGEQHLHVEHNI 103
PI++++ V + +C +A+D DS G + + ++ +I
Sbjct: 63 FPINIEVDVSFLHEACKEVAMDVSDSKGHKEIMLQKDI 100
>gi|238880883|gb|EEQ44521.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 345
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 153/398 (38%), Gaps = 90/398 (22%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
FS+++K DAF K ++ GG T++ + ++ +++ Y + VD
Sbjct: 4 FSQKVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFCGLLILWIEIGGYIGGYVDRQFTVDD 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I++D++V + C ++ + D + + +L E L+ +G P
Sbjct: 64 QIRSALTINVDMIV-AMPCQFIHTNVEDITHDTYLAGE------TLNFEGIHFFVPDSFK 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+N +PN P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129
Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
D ++Q + E+ +E + N C I+G + VN+V G F I G Y + HV
Sbjct: 130 DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRITGKGFGYR-DRSHV--- 185
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
P+ S N +H I+ SFG + PLD T EE + YY K++PT+YE+
Sbjct: 186 -PFES--LNFSHVIQEFSFG---EFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLYEQ 239
Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
L D ++ + G+PGI+F Y+ P+ + I EK K
Sbjct: 240 LGLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAK 299
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
+ I G ++ L +K+ + G K V +
Sbjct: 300 -LATIGG---GLLIAAGYLFRLYEKLLFIFYGQKAVQQ 333
>gi|448105220|ref|XP_004200441.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|448108351|ref|XP_004201072.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|359381863|emb|CCE80700.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|359382628|emb|CCE79935.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
Length = 344
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/369 (22%), Positives = 145/369 (39%), Gaps = 85/369 (23%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
FS +++ DAF K +++ GG T+V LFI + V++ + + VD
Sbjct: 4 FSTKVRTFDAFPKIDPHKTQRSSSGGFSTLVTALFILLVTWVEIGGFLGGYVDHQFIVDD 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I+LD++V + C+YL + +D + ++ L E L+ G P ++
Sbjct: 64 KLTSDLFINLDMLV-GMPCEYLHTNVMDVTHDRLLAGE------LLNFQGMNFFVP--DI 114
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
V ++ +EN T P+L
Sbjct: 115 V------QMNSENNDHNT---------------------------------------PDL 129
Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
D +++ + E++ + N C IYG + VN+V+G FHI G Y+ H
Sbjct: 130 DEVMRETVRAEFNVAGTRMNEDASACHIYGSIPVNKVAGDFHITGKGFGYADRHR----- 184
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
P+ N +H I SFG + + PLD T A + + Y++ +PT+YE+
Sbjct: 185 VPF--EKLNFSHVIMEFSFG---EFYPMIKNPLDFTGKIASQKLQSYKYFMTAVPTLYEK 239
Query: 299 LD-----------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
L + G +PG++F Y+ + + I EK ++
Sbjct: 240 LGIEVDTYQYSLTEQHRAITTDETGLPSDIPGLYFKYDFDTIKLLIAEKRIPFLQFVARL 299
Query: 342 MCNISGTYI 350
+SG +I
Sbjct: 300 ATIVSGLFI 308
>gi|427788003|gb|JAA59453.1| Putative copii vesicle protein [Rhipicephalus pulchellus]
Length = 285
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 84/191 (43%), Gaps = 28/191 (14%)
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
K GC+ G +++V G+FH++ H QP + TH I L+F
Sbjct: 104 KTPVGSGCRFEGKFFIHKVPGNFHVS---------THAAAKQP---DKIDMTHIIHDLTF 151
Query: 258 GIKLQDDDERR-KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG---------- 306
G+K+ D+ LD G +Y +KI+PT+YE+ G ++
Sbjct: 152 GVKMTDEVRGSFNSLDEMDKSGANGIESHDYVMKIVPTVYEKSKGERIESYQYTYAYKSY 211
Query: 307 ---GDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
G MP I+F Y+L+P+ VK T + L T + + GT+ +VD+L+ +
Sbjct: 212 VSISHSGRIMPAIWFRYDLTPITVKYTRRGIPLYSFLTSVCAIVGGTFTVAGIVDSLVFT 271
Query: 362 CVKKISKVEIG 372
+ K E+G
Sbjct: 272 ASEVFRKFEMG 282
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 48/106 (45%), Gaps = 3/106 (2%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MVF R D + K +D + TV G ++I+ FIS L + Y EL+V
Sbjct: 1 MVFDVR--RFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELVSELYV 58
Query: 61 DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
D+ S K+P+ ++I + + C + LD D G + N K
Sbjct: 59 DNPSSADKIPVSINITLLKLDCSVVGLDIQDDMGRHEVGFVENTEK 104
>gi|347828541|emb|CCD44238.1| similar to endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Botryotinia fuckeliana]
Length = 381
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/354 (22%), Positives = 136/354 (38%), Gaps = 81/354 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + +T GG T+ L L+ + ++ T V+ G
Sbjct: 21 VKAFDAFPKAKPQYITQTSGGGKWTVAMMLVSFALLVSEFMRWWTGHETHTFVVEKGVGH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLH---VEHNIYKRRLDLDGKPIQEPQKEVV 123
L +++D+VV + C L ++ D++G++ L ++ + +D K + + K+
Sbjct: 81 SLQVNMDMVV-KMKCSELHINVQDAAGDRILAGIMLKEDATNWNQWVDAKGMHQLGKDAH 139
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
V + E G + D G K+ ++ K
Sbjct: 140 GRVITGEEYHEEGFGEEHVHDIVTLGG------------------KKRAKFAK------- 174
Query: 184 TIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIA------PGLSYSINHVHV 235
T ++K G C++YG LEVN+V G FH+ P + + ++H
Sbjct: 175 ----------TPRVKGGPKGGDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHLDH--- 221
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
+AFN +H I LSFG PLD T+A + Y++ I+PT+
Sbjct: 222 --------SAFNFSHIINELSFGPFYP---SLLNPLDRTIAGTPNHFHKYQYFLSIVPTL 270
Query: 296 YERLDGSKLGG--------------------GDGGMPGIFFSYELSPLMVKITE 329
Y + G+ +PGIFF Y++ PL++ + E
Sbjct: 271 YSLSPSTFSPSSSPTLLRTNQYAVTSQEHIVGERSVPGIFFKYDIEPLLLTVEE 324
>gi|432879813|ref|XP_004073560.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Oryzias latipes]
Length = 271
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 85/187 (45%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
EGC+ G +N+V G+FH++ H QP + TH I L+FG LQ
Sbjct: 94 EGCRFEGKFTINKVPGNFHVS---------THSATAQPQNP---DMTHSIHKLAFGDTLQ 141
Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+ + L G + + +Y +KI+PT+YE L G + +
Sbjct: 142 VHNVKGAFNALGGADKLSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVANKEYVAYS 201
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + T I + GT+ ++D+ + + +
Sbjct: 202 HTGRIIPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGIIDSCIFTASEA 261
Query: 366 ISKVEIG 372
K++IG
Sbjct: 262 WKKIQIG 268
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 4/77 (5%)
Query: 31 TIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGSKLPIHLDIVVPTISCDYLALD 87
+I+C FI +L ++ + EL+VD G K+ + L+I +P + CD + LD
Sbjct: 10 SILCCFFILFLFLSELTGFIATEIVNELYVDDPDKDSGGKIDVSLNISLPNLHCDLVGLD 69
Query: 88 AVDSSGEQHL-HVEHNI 103
D G + H+++++
Sbjct: 70 IQDEMGRHEVGHIDNSM 86
>gi|403337257|gb|EJY67839.1| hypothetical protein OXYTRI_11647 [Oxytricha trifallax]
Length = 279
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 81/345 (23%), Positives = 146/345 (42%), Gaps = 99/345 (28%)
Query: 58 LFVDSSR-GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ 116
+FVD+S +L I++DIV P + C+ L LD +D G + + ++YK+ L
Sbjct: 1 MFVDASHHDDRLNINIDIVFPKMPCEVLTLDIMDIMGTHIVDIGGSLYKKGL-------- 52
Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKK 176
++NG +E S G +TR+
Sbjct: 53 ----------------SQNGEFVSET-------SMLGG-IQTRQ---------------- 72
Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
D + + K+E + +GCQ+ G+ +NRV G+FHI+ S+S + V+
Sbjct: 73 ------DLLKRIKDEMDQK-------QGCQLKGFFNINRVPGNFHIS---SHSQKDLIVN 116
Query: 237 -DIQPYTSAAFNTTHHIRHLSFGIK-----LQDDDERR---KPLDGTVAKAEEG------ 281
++Q YT F+ TH I H+SFG + +Q + +++ PLDG A +
Sbjct: 117 LEMQGYT---FDFTHKINHVSFGRQEDFKVIQKNFKQQGVLNPLDGLEFSANQDNKGKPQ 173
Query: 282 ASMFNYYIKIIPTIYERLDGSK--------------LGGGDGGMPGIFFSYELSPLMVKI 327
A N+++ + + Y +D ++ + + FSYELSP+ V
Sbjct: 174 ALATNFFMVAVSSYY--MDTNRNTYNMYQLTSTHKSQSNANVNENMLVFSYELSPIKVLF 231
Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
++ +++ ++ I G + +VD ++H V + K IG
Sbjct: 232 NQEKENIVDFMIQLCAIIGGVFTISSVVDTIIHRSVSLLFKQRIG 276
>gi|50303625|ref|XP_451754.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640886|emb|CAH02147.1| KLLA0B04950p [Kluyveromyces lactis]
Length = 341
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 150/371 (40%), Gaps = 76/371 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K E + + GG +I+ +LF+ ++ + +F ++ VD
Sbjct: 4 LRTFDAFPKTDEQHVKTSSKGGLSSILTYLFLLFIAWSEFGSFFGGYIDQQYVVDDQIKE 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I+LD+ V ++C + ++A D +G++ L E+ + ++G P P VN +
Sbjct: 64 TVTINLDLYV-NMACKNIRINARDITGDRGLISEN------IQMEGMPFYIPVGTRVNEM 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
N + +L++ G A+ R+ +T +
Sbjct: 117 --------NNIVSPDLDE--ILGEAIPAQF--REAIDTSE-------------------L 145
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+++++ GC I+G + VN+V G HI A G Y D
Sbjct: 146 TGRDDFN----------GCHIFGSVPVNKVKGELHITAHGWGYRSASAIPKD-------Q 188
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
N H I LSFG D PLD T ++E + Y+ I+PT+Y+++
Sbjct: 189 INFNHVINELSFGDFYPYID---NPLDNTAKFSDEKIKAYYYFTSIVPTLYKKMGAEVDT 245
Query: 302 -----SKLGGGDG----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT- 351
S+ G+ G+PGIF Y+ P+ + I++ +++ +S T
Sbjct: 246 NQYALSETEYGESSKATGVPGIFIRYQFEPMKIIISDMRIGFFQFIIRLVAILSFIVYTA 305
Query: 352 ---FMLVDALL 359
F LVD L
Sbjct: 306 SWIFRLVDKSL 316
>gi|323448816|gb|EGB04710.1| hypothetical protein AURANDRAFT_55105 [Aureococcus anophagefferens]
Length = 324
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 87/187 (46%), Gaps = 36/187 (19%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC + G++ VNRV G+FHI + SI+H +A N +H + HLSFG L
Sbjct: 143 GCMVSGHVLVNRVPGNFHIE---ARSIHH-------NLNAAMTNLSHVVNHLSFGTPLAK 192
Query: 264 DDERR----------KPLDGTVAKAEEGASMFNYYIKIIPTIYE---------RLDGSKL 304
D +R+ PLDG + + + + ++Y K++ T +E + G ++
Sbjct: 193 DMQRKVSKYPQFQSVHPLDGGIFVSRDYHQVHHHYSKVVSTHFEVGGMMTKSREIVGYQM 252
Query: 305 GGGDGGM-------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
M P FSY+LSP+ V ++ K + T + I GT+ +VDA
Sbjct: 253 LAQSQIMHYNEMDVPEAKFSYDLSPMAVLVSSKGRRWYDFVTSVCAIIGGTFTVVGIVDA 312
Query: 358 LLHSCVK 364
+L+ +K
Sbjct: 313 VLYKIIK 319
>gi|241953329|ref|XP_002419386.1| COPii-coated vesicle-associated protein, putative [Candida
dubliniensis CD36]
gi|223642726|emb|CAX42980.1| COPii-coated vesicle-associated protein, putative [Candida
dubliniensis CD36]
Length = 345
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 89/398 (22%), Positives = 153/398 (38%), Gaps = 90/398 (22%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F++++K DAF K ++ GG T+V + ++ +++ Y + VD
Sbjct: 4 FAQKVKTFDAFPKVDPHHQVRSQRGGLSTLVTYFCGLLILWIEIGGYIGGYVDRQFTVDD 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I++D++V + C ++ + D + + +L E L+ +G P
Sbjct: 64 QIRSDLTINIDMIV-AMPCQFIHTNVEDITHDTYLAGE------TLNFEGIHFFVPDSFK 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+N +PN P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129
Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
D ++Q + E+ +E + N C I+G + VN+V G F I G Y + HV
Sbjct: 130 DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRITGKGFGYR-DRSHV--- 185
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
P+ S N +H I+ SFG + PLD T EE + YY K++PT+YE+
Sbjct: 186 -PFES--LNFSHVIQEFSFG---EFYPYLNNPLDATGKITEERLQTYMYYAKVVPTLYEQ 239
Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
L D ++ + G+PGI+F Y+ P+ + I EK K
Sbjct: 240 LGLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAK 299
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
+ I G ++ L +K+ + G K V +
Sbjct: 300 -LATIGG---GLLIAAGYLFRLYEKLLFIFYGQKAVQQ 333
>gi|301100294|ref|XP_002899237.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262104154|gb|EEY62206.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 469
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 86/189 (45%), Gaps = 34/189 (17%)
Query: 203 EGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
EGC+++G+L V RV G+FH+ +YS++ S+ N +H + L FG L
Sbjct: 289 EGCRLFGHLYVKRVPGNFHVHLANPAYSMD-----------SSLVNASHTVNELWFGEHL 337
Query: 262 QDDDERRKPLDGTVA------KAEEGASMFN-----YYIKIIPTIYERLDGSKLGG---- 306
D R P + + ++ S++ +YIK++ Y + DGS++
Sbjct: 338 APGDMSRLPREAQTQLYTHRLENQDFTSLYKNHTYVHYIKVVTNSYVQGDGSEINVYKYT 397
Query: 307 -------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
+P + F Y+LSP+ V+I+E + H T I G + +VD ++
Sbjct: 398 AHSNEYLETDDLPSVMFRYDLSPMSVRISEDTVPFYHFVTSACAIIGGVFTVIGIVDQII 457
Query: 360 HSCVKKISK 368
H + ++K
Sbjct: 458 HQTARALNK 466
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 54/109 (49%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK D + K ED T+ G +++I + L ++ Y V ++ +D
Sbjct: 7 LKKWDFYKKIPEDLTVSTLPGVSLSIAGCFIMFLLFILEFNSYLTVDYKYDIVMDEGLDQ 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
+ I+ +I VP + C++ ++D D +G + ++ +I+K RLD G+ +
Sbjct: 67 TMRINFNITVPDLPCEFASVDVSDMTGTRKHNMTSDIFKIRLDQKGRMV 115
>gi|334311203|ref|XP_001380577.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Monodelphis domestica]
Length = 321
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 85/187 (45%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IK 260
EGC+ G +N+V G+FH++ H QP + TH I LSFG ++
Sbjct: 144 EGCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQ 191
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 192 VQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYS 251
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 252 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA 311
Query: 366 ISKVEIG 372
K+++G
Sbjct: 312 WKKIQLG 318
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 51/102 (50%), Gaps = 4/102 (3%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS--- 62
RL D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 35 RLTRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEVVNELYVDDPDK 94
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 95 DSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 136
>gi|169860063|ref|XP_001836668.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Coprinopsis cinerea okayama7#130]
gi|116502344|gb|EAU85239.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Coprinopsis cinerea okayama7#130]
Length = 516
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 84/364 (23%), Positives = 141/364 (38%), Gaps = 78/364 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K + ++ G + + + L+ D+ ++ E VD+ +GS
Sbjct: 19 LTKFDAFPKLPSTYKARSESRGFLMVFVIILAFLLMLNDIGEFIWGWPDFEFGVDNDKGS 78
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
LPI+LD+ V + C YL +D D+ G+ RL L N
Sbjct: 79 TLPINLDMTV-NMPCKYLTVDLRDAMGD------------RLFLS------------NGF 113
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
++ + G T L++ S A ++RK + +R KK
Sbjct: 114 RRDGTIFDVGQATA-LKEHAAALSAQEAVAQSRKSRGFFATL---FRSKK---------- 159
Query: 187 QCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAP-GLSY-SINHVHVHDIQ 239
K K T+ C+I+G + V +V+ + H+ G Y S HV H +
Sbjct: 160 --------SKFKPTYNHQADASACRIWGTMYVKKVTANLHVTTLGHGYASYEHVDHHLM- 210
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
N +H I+ SFG E +PLD + E + Y++ ++PT Y
Sbjct: 211 -------NLSHVIQEFSFGPHFP---EIVQPLDNSFEATHEHFIAYQYFLHVVPTTYVAP 260
Query: 300 DGSKLGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
+ L + G PGIFF +EL PL + +++ +L L + + I
Sbjct: 261 RTAPLETNQYSVTHYTRVLEHNRGTPGIFFKFELDPLKITQYQRTTTLLQLMIRCVGVIG 320
Query: 347 GTYI 350
G ++
Sbjct: 321 GVFV 324
>gi|407037175|gb|EKE38536.1| hypothetical protein ENU1_163530 [Entamoeba nuttalli P19]
Length = 315
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 87/211 (41%), Gaps = 39/211 (18%)
Query: 196 KLKNTFTEGCQIYGYLEVNRVSGSFHIAPG-LSYSINHV-------------HVHDIQPY 241
K N GC+++G ++V+RVSG FH+A G +S+ + H+H
Sbjct: 108 KFDNRLLGGCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQ 167
Query: 242 TSAAFNTTHHIRHLSFGIKLQDD-DERRKPLDG---TVAKAEEGASMFNYYIKIIPTIY- 296
+FN TH+I HLSF L PL+G T+ + YYI +IPT++
Sbjct: 168 EMKSFNPTHYINHLSFSNILGSTVHSGETPLNGKEFTLNGFDNARK--TYYINVIPTLFK 225
Query: 297 --------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
ER D G PG+FF YELSP +V S H +
Sbjct: 226 YPSYTLRTYQLSVNER-DVPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVG 284
Query: 343 CNISGTYITFMLVDALL---HSCVKKISKVE 370
I G I L+ L H V + ++E
Sbjct: 285 AIIGGVLIIMGLLSRLFDSKHELVTSVVEME 315
>gi|340505495|gb|EGR31815.1| hypothetical protein IMG5_101180 [Ichthyophthirius multifiliis]
Length = 327
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 86/191 (45%), Gaps = 23/191 (12%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG---- 258
EGC I G + VN+V G+FHI+ S++ HV + + +H ++HLSFG
Sbjct: 137 EGCNISGTMLVNKVPGNFHIS---SHAYGHVLGQVLSNAGKNTIDLSHKVKHLSFGDEFD 193
Query: 259 ---IKLQDDDERRKPLDGTVAKAEEG---ASMFNYYIKIIPTIY----------ERLDGS 302
IK Q P+D + + YYI I+PT Y + +
Sbjct: 194 LKNIKRQFSQGLLHPMDNKQKDKPQNILNGITYQYYINIVPTTYVDTGNKNYHVYQFTYN 253
Query: 303 KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
+ +P +++ Y+LSP+ VK + + +S H +I I G + +VD++++
Sbjct: 254 SNEQINNHLPTVYYRYDLSPVTVKFSMQKESFLHFLVQICAIIGGIFTVASIVDSIVYRA 313
Query: 363 VKKISKVEIGG 373
V I K + G
Sbjct: 314 VLNILKRDASG 324
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 56/109 (51%), Gaps = 1/109 (0%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+K D + K D + T G V+I+C + + L ++ + + T E+F+D RG
Sbjct: 2 NIKSFDMYRKLPSDLTQSTTSGAVVSIICGIIVLILFISELRSFLAIEETSEMFIDIVRG 61
Query: 66 -SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
K+ ++LDI P CD L+LD D G +++E I KRR+ DG
Sbjct: 62 GQKIKVNLDIDFPKFPCDILSLDMQDIMGSHTVNIEGTINKRRISSDGN 110
>gi|154305556|ref|XP_001553180.1| hypothetical protein BC1G_08547 [Botryotinia fuckeliana B05.10]
Length = 381
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 77/354 (21%), Positives = 136/354 (38%), Gaps = 81/354 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K DAF K + +T GG T+ L L+ + ++ T V+ G
Sbjct: 21 VKAFDAFPKAKPQYITQTSGGGKWTVAMMLVSFALLVSEFMRWWTGHETHTFVVEKGVGH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLH---VEHNIYKRRLDLDGKPIQEPQKEVV 123
L +++D+VV + C L ++ D++G++ L ++ + +D K + + K+
Sbjct: 81 SLQVNMDMVV-KMKCSELHINVQDAAGDRILAGIMLKEDATNWNQWVDAKGMHQLGKDAH 139
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
V + E G + D G K+ ++ K
Sbjct: 140 GRVITGEEYHEEGFGEEHVHDIVTLGG------------------KKRAKFAK------- 174
Query: 184 TIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIA------PGLSYSINHVHV 235
T ++K G C++YG LEVN+V G FH+ P + + ++H
Sbjct: 175 ----------TPRVKGGPKGGDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHLDH--- 221
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
+AFN +H I LSFG PLD T+A + Y++ ++PT+
Sbjct: 222 --------SAFNFSHIINELSFGPFYPS---LLNPLDRTIAGTPNHFHKYQYFLSVVPTL 270
Query: 296 YERLDGSKLGG--------------------GDGGMPGIFFSYELSPLMVKITE 329
Y + G+ +PGIFF Y++ PL++ + E
Sbjct: 271 YSLSPSTFSPSSSPTLLRTNQYAVTSQEHIVGERSVPGIFFKYDIEPLLLTVEE 324
>gi|426200953|gb|EKV50876.1| hypothetical protein AGABI2DRAFT_113626 [Agaricus bisporus var.
bisporus H97]
Length = 542
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 88/360 (24%), Positives = 147/360 (40%), Gaps = 69/360 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K F ++ G +TI L L+ D+ +Y + VD
Sbjct: 20 LAKFDAFPKVPSAFKARSESRGFMTIFVMLVALLLMLNDIGEYIWGWPEFKFAVDQDNAP 79
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ ++LD+VV + C YL++D D G+ RL L G +
Sbjct: 80 YMFVNLDMVV-NMQCRYLSVDLRDVVGD------------RLLLSG------------GL 114
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
++ V G T L++ +K S A +++RK D+++
Sbjct: 115 QRDGVKFNIGEATA-LKEHSKGLSARQALSQSRKSRGF-----------------FDSLL 156
Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTS 243
+ +E + N +G C+IYG + V RV+ + HI G YS ++ HV Q
Sbjct: 157 RRNSEPKFKPTYNHVPDGGACRIYGTMPVKRVTANLHITTVGHGYS-SYQHVDHNQ---- 211
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
N +H I SFG E +PLD + ++ + + Y++ ++PT Y S
Sbjct: 212 --MNLSHVITEFSFGPYF---PEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPRTSP 266
Query: 304 LGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
L + G PGIFF ++L PL + I +K+ +L L + + I G ++
Sbjct: 267 LRTNQYSVTHYTRQVEHNKGTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVIGGVFV 326
>gi|346469653|gb|AEO34671.1| hypothetical protein [Amblyomma maculatum]
Length = 285
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 85/191 (44%), Gaps = 28/191 (14%)
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
K GC+ G +++V G+FH++ H QP + TH I L+F
Sbjct: 104 KTPVGSGCRFEGKFFIHKVPGNFHVS---------THAAAKQP---EKIDMTHIIHDLTF 151
Query: 258 GIKLQDDDERR-KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG---------- 306
G+K+ D+ + LD G +Y +KI+PT+YE+ G ++
Sbjct: 152 GVKMTDEVKGSFNSLDEMDKSGGNGIESHDYVMKIVPTVYEKSRGERIESYQYTYAYKSY 211
Query: 307 ---GDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
G MP I+F Y+L+P+ VK T + L T + + GT+ +VD+L+ +
Sbjct: 212 VSISHTGRIMPAIWFRYDLTPITVKYTRRGVPLYSFLTSVCAIVGGTFTVAGIVDSLIFT 271
Query: 362 CVKKISKVEIG 372
+ K E+G
Sbjct: 272 ASEVFRKFEMG 282
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 1/100 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
++ D + K +D + TV G ++I+ FIS L + Y EL+VD+ S
Sbjct: 5 VRRFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELVSELYVDNPSSA 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
K+P+ ++I + + C + LD D G + N K
Sbjct: 65 EKIPVSINITLLKLDCSVVGLDIQDDMGRHEVGFVENTEK 104
>gi|115433364|ref|XP_001216819.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114189671|gb|EAU31371.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 449
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 128/320 (40%), Gaps = 75/320 (23%)
Query: 68 LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVK 127
L ++LDIVV + CD L ++ D++G++ L E + KR EP + +
Sbjct: 133 LQLNLDIVV-EMPCDTLDVNIQDAAGDRVLAGE--LLKR----------EPTSWQL-WMD 178
Query: 128 KKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQ 187
K+ + G+ + G A+ E + EV+ R K P+L
Sbjct: 179 KRNYESYGGSHEYQTLSQEDAGRLE-AQDEDAHVHHVLGEVRRNPRKKFPKSPKLR---- 233
Query: 188 CKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS-AA 245
+ + C+IYG LE N+V G FHI A G Y D P+
Sbjct: 234 ----------RGDAVDSCRIYGSLEGNKVQGDFHITARGHGYR-------DFAPHLDHQT 276
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER------- 298
FN +H I LSFG PLD T+A+ E F Y++ ++PTIY +
Sbjct: 277 FNFSHMITELSFGPHYP---TLLNPLDKTIAETETHYYKFQYFLSVVPTIYSKGNRVLDT 333
Query: 299 --------LDGSK-------------------LGGGDGGMPGIFFSYELSPLMVKITEKS 331
D S+ L +PGIFF Y + P+++ I+E+
Sbjct: 334 YSIAPPTLHDNSRHNKNLVFTNQYAATSQSDALPESPFFVPGIFFKYNIEPILLLISEER 393
Query: 332 KSLGHLWTKIMCNISGTYIT 351
S L +++ +SG +T
Sbjct: 394 GSFLSLLIRLVNTVSGVMVT 413
>gi|409083992|gb|EKM84349.1| hypothetical protein AGABI1DRAFT_32491 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 542
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 88/360 (24%), Positives = 147/360 (40%), Gaps = 69/360 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K F ++ G +TI L L+ D+ +Y + VD
Sbjct: 20 LAKFDAFPKVPSAFKARSESRGFMTIFVMLVALLLMLNDIGEYIWGWPEFKFAVDQDNAP 79
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ ++LD+VV + C YL++D D G+ RL L G +
Sbjct: 80 YMFVNLDMVV-NMQCRYLSVDLRDVVGD------------RLLLSG------------GL 114
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
++ V G T L++ +K S A +++RK D+++
Sbjct: 115 QRDGVKFNIGEATA-LKEHSKGLSARQALSQSRKSRGF-----------------FDSLL 156
Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTS 243
+ +E + N +G C+IYG + V RV+ + HI G YS ++ HV Q
Sbjct: 157 RRNSEPKFKPTYNHVPDGGACRIYGTMPVKRVTANLHITTVGHGYS-SYQHVDHNQ---- 211
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
N +H I SFG E +PLD + ++ + + Y++ ++PT Y S
Sbjct: 212 --MNLSHVITEFSFGPYF---PEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPRTSP 266
Query: 304 LGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
L + G PGIFF ++L PL + I +K+ +L L + + I G ++
Sbjct: 267 LRTNQYSVTHYTRQVEHNKGTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVIGGVFV 326
>gi|68465583|ref|XP_723153.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|68465876|ref|XP_723006.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|46445018|gb|EAL04289.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|46445174|gb|EAL04444.1| likely COPII secretory vesicle component [Candida albicans SC5314]
Length = 345
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 88/398 (22%), Positives = 153/398 (38%), Gaps = 90/398 (22%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F++++K DAF K ++ GG T++ + ++ +++ Y + VD
Sbjct: 4 FAQKVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFCGLLILWIEIGGYIGGYVDRQFTVDD 63
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S L I++D++V + C ++ + D + + +L E L+ +G P
Sbjct: 64 QIRSALTINVDMIV-AMPCQFIHTNVEDITHDTYLAGE------TLNFEGIHFFVPDSFK 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
+N +PN P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129
Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
D ++Q + E+ +E + N C I+G + VN+V G F I G Y + HV
Sbjct: 130 DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRITGKGFGYR-DRSHV--- 185
Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
P+ S N +H I+ SFG + PLD T EE + YY K++PT+YE+
Sbjct: 186 -PFES--LNFSHVIQEFSFG---EFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLYEQ 239
Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
L D ++ + G+PGI+F Y+ P+ + I EK K
Sbjct: 240 LGLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAK 299
Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
+ I G ++ L +K+ + G K V +
Sbjct: 300 -LATIGG---GLLIAAGYLFRLYEKLLFIFYGQKAVQQ 333
>gi|395505103|ref|XP_003756885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Sarcophilus harrisii]
Length = 290
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 85/187 (45%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IK 260
EGC+ G +N+V G+FH++ H QP + TH I LSFG ++
Sbjct: 113 EGCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQ 160
Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
+Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 161 VQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYS 220
Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 221 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA 280
Query: 366 ISKVEIG 372
K+++G
Sbjct: 281 WKKIQLG 287
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|156065931|ref|XP_001598887.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980]
gi|154691835|gb|EDN91573.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 421
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 81/349 (23%), Positives = 134/349 (38%), Gaps = 71/349 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ DAF K + +T GG T+ + L+ + ++ T V+ G
Sbjct: 21 VQAFDAFPKAKPQYITQTSGGGKWTVAMLIISFALLLSEFSRWWTGYETHTFVVEKGIGH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL---HVEHNIYKRRLDLDGKPIQEPQKEVV 123
L I++D+VV + C L ++ D++G++ L ++ + +D K + + K+
Sbjct: 81 SLQINMDMVV-KMKCSGLHINVQDAAGDRILAGIMLKEDPTNWSQWVDAKGVHQLGKDAH 139
Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
V + E G + D G K+ ++ K
Sbjct: 140 GRVVTGEEYHEEGFGEEHVHDIVALGG------------------KKRAKFAK------- 174
Query: 184 TIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
T +LK G C++YG LEVN+V G FHI A G Y H+
Sbjct: 175 ----------TPRLKGGPRGGDSCRVYGSLEVNKVQGDFHITAKGHGYPELGQHL----- 219
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
AFN +H I LSFG PLD T+A + Y++ I+PT+Y
Sbjct: 220 -DHNAFNFSHIINELSFGPFYPS---LLNPLDRTIAGTPNHFHKYQYFLSIVPTLYSLSP 275
Query: 301 GSKLGG--------------------GDGGMPGIFFSYELSPLMVKITE 329
+ G+ +PGIFF Y++ PL++ + E
Sbjct: 276 STFSPSSSPSLLRTNQYAVTSQEHIVGERNVPGIFFKYDIEPLLLTVEE 324
>gi|255563725|ref|XP_002522864.1| thioredoxin domain-containing protein, putative [Ricinus communis]
gi|223537948|gb|EEF39562.1| thioredoxin domain-containing protein, putative [Ricinus communis]
Length = 478
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 92/217 (42%), Gaps = 41/217 (18%)
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
++ +N + K T GC+I GY+ V +V G+ I+ H P +
Sbjct: 270 LKSENATQSTKRPAPLTGGCRIEGYVRVKKVPGNLIISA-------RSGAHSFDP---SQ 319
Query: 246 FNTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFNY-----------YIK 290
N +H I HLSFG+K+ ++ +R P G G S N+ Y++
Sbjct: 320 MNMSHVISHLSFGLKVSPKVMNEAKRLVPYIGGSHDKLNGRSFVNHRDVDANVTIEHYLQ 379
Query: 291 IIPTI---------------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
I+ T YE S L +P F +ELSP+ V ITE KS
Sbjct: 380 IVKTEVVTRRSSREHKLLEEYEYTAHSSLVQS-VYIPAAKFHFELSPMQVLITENPKSFS 438
Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
H T + I G + ++D++LH V+ + KVE+G
Sbjct: 439 HFITNVCAIIGGVFTVAGILDSILHHTVRLMKKVELG 475
>gi|361126303|gb|EHK98312.1| putative ER-derived vesicles protein 41 [Glarea lozoyensis 74030]
Length = 343
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 85/206 (41%), Gaps = 40/206 (19%)
Query: 194 TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA-----AFNT 248
T +L+ + C+IYG LEVN+V G FH+ H Q + + AFN
Sbjct: 140 TPRLRGNVGDSCRIYGNLEVNKVQGDFHLT---------ARGHGYQEWGAGHLDHTAFNF 190
Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGD 308
+H + LSFG PLD TV+ F Y++ ++PT Y +D S D
Sbjct: 191 SHIVNELSFGAFYP---SLLNPLDRTVSTTPNHFHKFQYFLSVVPTAYT-VDSSSRSARD 246
Query: 309 G------------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+PGIFF Y++ P+++ + E S K++ SG +
Sbjct: 247 TIFTNQYAVTEQSHEVNERSVPGIFFKYDIEPMLLTVEESRDSFLRFVVKVVNVFSGVLV 306
Query: 351 T----FMLVDALLHSCVKKISKVEIG 372
F L + + + K+ + +G
Sbjct: 307 AGHWGFTLTEWAVSAFGKRKRSMSVG 332
>gi|406868300|gb|EKD21337.1| copii-coated vesicle protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 382
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 137/352 (38%), Gaps = 67/352 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ DAF K + +T GG T+ + LI + +++ T V+ +
Sbjct: 22 VQAFDAFPKAKPQYVTRTSGGGKWTVAMLIVSFMLIYSEFSRWWRGHETHTFTVEKAVER 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I+LDIVVP + C+ + ++ D++G++ L ++ R P Q Q V
Sbjct: 82 GLQINLDIVVP-MKCEDIHINVQDAAGDRIL--AGVMFTR------NPTQWAQWVHERGV 132
Query: 127 KKKKVTTENGTTTTE--LEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+ T E L+ G + + + + R +K A E+D+
Sbjct: 133 HRLGTDANGKIITGEEYLDHDEGFGEEHVHDIVAAAGKLKKAKFAKTPRSRKSA--EMDS 190
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY---SINHVHVHDIQP 240
C+I+G LEVN+V G HI A G Y + H+ H
Sbjct: 191 --------------------CRIFGNLEVNKVQGELHITARGHGYQELAAGHLDHH---- 226
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
AFN +H + LSFG PLD TV+ F Y++ ++PT+Y +D
Sbjct: 227 ----AFNFSHVVSELSFGPFYP---SLHNPLDRTVSTTPNNFHKFQYFLSVVPTVYS-VD 278
Query: 301 GSKLGGG------------------DGGMPGIFFSYELSPLMVKITEKSKSL 334
S + +PGIFF Y+ P+++ + E S
Sbjct: 279 SSTTYSSQTLFTNQYAVTEQSHVVSEFSVPGIFFKYDFEPMLLTVQESRDSF 330
>gi|357474735|ref|XP_003607653.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355508708|gb|AES89850.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 477
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 83/197 (42%), Gaps = 36/197 (18%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
T GC++ GY+ V +V GS ++ D + ++ N +H I HLSFG K+
Sbjct: 288 TGGCRVEGYVRVKKVPGSLVVSAR----------SDAHSFDASQMNMSHVINHLSFGKKV 337
Query: 262 QD----DDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTIYERLDGSKL-- 304
D + P G G S N +YI+++ T G KL
Sbjct: 338 TPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQVVKTEVITRKGYKLIE 397
Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
+P F ELSP+ V ITE KS H T + I G + ++
Sbjct: 398 EYEYTAHSSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVFTVAGIL 457
Query: 356 DALLHSCVKKISKVEIG 372
D++LH+ +K + K+EIG
Sbjct: 458 DSILHNTIKAMKKIEIG 474
>gi|21618302|gb|AAM67352.1| unknown [Arabidopsis thaliana]
Length = 317
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 95/211 (45%), Gaps = 37/211 (17%)
Query: 189 KNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFN 247
K++ S+ LK T GC++ GY+ V +V G+ ++ + S +H + S+ N
Sbjct: 114 KSDNSSRTLKKAPSTGGCRVEGYMRVKKVPGNLMVS---ARSGSH-------SFDSSQMN 163
Query: 248 TTHHIRHLSFGIKLQDDD----ERRKPLDGTVAKAEEGASMFN-----------YYIKII 292
+H + HLSFG ++ +R P G +G S N +Y++I+
Sbjct: 164 MSHVVNHLSFGRRIMPQKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIV 223
Query: 293 PTIYERLDGSKLG-----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
T + +G L +P F +ELSP+ V ITE SKS H T +
Sbjct: 224 KTEVVKSNGQALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNV 283
Query: 342 MCNISGTYITFMLVDALLHSCVKKISKVEIG 372
I G + ++D++LH + + K+E+G
Sbjct: 284 CAIIGGAFTVAGILDSILHHSMTLMKKIELG 314
>gi|238480964|ref|NP_680742.2| protein PDI-like 5-4 [Arabidopsis thaliana]
gi|332659898|gb|AEE85298.1| protein PDI-like 5-4 [Arabidopsis thaliana]
Length = 532
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 98/214 (45%), Gaps = 37/214 (17%)
Query: 186 VQCKNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
++ K++ S+ LK T GC++ GY+ V +V G+ ++ + S +H + S+
Sbjct: 326 LEDKSDNSSRTLKKAPSTGGCRVEGYMRVKKVPGNLMVS---ARSGSH-------SFDSS 375
Query: 245 AFNTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYI 289
N +H + HLSFG ++ + +R P G +G S N +Y+
Sbjct: 376 QMNMSHVVNHLSFGRRIMPQKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYL 435
Query: 290 KIIPTIYERLDGSKLG-----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+I+ T + +G L +P F +ELSP+ V ITE SKS H
Sbjct: 436 QIVKTEVVKSNGQALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFI 495
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
T + I G + ++D++LH + + K+E+G
Sbjct: 496 TNVCAIIGGVFTVAGILDSILHHSMTLMKKIELG 529
>gi|22328963|ref|NP_567765.2| protein PDI-like 5-4 [Arabidopsis thaliana]
gi|75213708|sp|Q9T042.1|PDI54_ARATH RecName: Full=Protein disulfide-isomerase 5-4; Short=AtPDIL5-4;
AltName: Full=Protein disulfide-isomerase 7; Short=PDI7;
AltName: Full=Protein disulfide-isomerase 8-2;
Short=AtPDIL8-2; Flags: Precursor
gi|4490704|emb|CAB38838.1| putative protein [Arabidopsis thaliana]
gi|7269561|emb|CAB79563.1| putative protein [Arabidopsis thaliana]
gi|15450832|gb|AAK96687.1| putative protein [Arabidopsis thaliana]
gi|20259836|gb|AAM13265.1| putative protein [Arabidopsis thaliana]
gi|332659897|gb|AEE85297.1| protein PDI-like 5-4 [Arabidopsis thaliana]
Length = 480
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 98/214 (45%), Gaps = 37/214 (17%)
Query: 186 VQCKNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
++ K++ S+ LK T GC++ GY+ V +V G+ ++ + S +H + S+
Sbjct: 274 LEDKSDNSSRTLKKAPSTGGCRVEGYMRVKKVPGNLMVS---ARSGSH-------SFDSS 323
Query: 245 AFNTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYI 289
N +H + HLSFG ++ + +R P G +G S N +Y+
Sbjct: 324 QMNMSHVVNHLSFGRRIMPQKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYL 383
Query: 290 KIIPTIYERLDGSKLG-----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+I+ T + +G L +P F +ELSP+ V ITE SKS H
Sbjct: 384 QIVKTEVVKSNGQALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFI 443
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
T + I G + ++D++LH + + K+E+G
Sbjct: 444 TNVCAIIGGVFTVAGILDSILHHSMTLMKKIELG 477
>gi|73953406|ref|XP_852891.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 isoform 1 [Canis lupus familiaris]
Length = 290
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 85/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G+ +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGHFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|327307836|ref|XP_003238609.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
gi|326458865|gb|EGD84318.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
Length = 399
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 85/383 (22%), Positives = 152/383 (39%), Gaps = 77/383 (20%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+LK DAF K + + GG TI + + L C ++ +++ V+
Sbjct: 23 KLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVERGVS 82
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
++ +++D VV + CD + ++ D++G+ H+ L G + QEP
Sbjct: 83 QEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSWTA- 127
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+++ + E + NK + E E + + + + E R +K P+
Sbjct: 128 --WNREMNQRRSGGSPEYQTLNKEDTFRLEEQE--EDLHVEHVLGEVRRSRKKKFPK--- 180
Query: 185 IVQCKNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
KLK + + C+++G LE N+V G+ HI A G Y P+
Sbjct: 181 ---------APKLKRSDAVDSCRVFGSLEGNKVQGNLHITARGFGY---FEWGRTTNPH- 227
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL--- 299
+ N TH I LSFG PLD TV+ + Y++ ++PTIY +
Sbjct: 228 --SLNFTHLITELSFGPHY---GRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTKSGHI 282
Query: 300 --------DGSKLGGGDG-----------------------GMPGIFFSYELSPLMVKIT 328
D S + D PGIFF Y + P+++ ++
Sbjct: 283 DPNRRSLPDASTITAKDSKTTVSTNQYAVTSYSQPIQPRIDATPGIFFKYNIEPILLIVS 342
Query: 329 EKSKSLGHLWTKIMCNISGTYIT 351
++ SL L +++ +SG +T
Sbjct: 343 QEWDSLLALMVRLVNVVSGVLVT 365
>gi|302508773|ref|XP_003016347.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
gi|291179916|gb|EFE35702.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
Length = 427
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/407 (21%), Positives = 157/407 (38%), Gaps = 91/407 (22%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
+ +LK DAF K + + GG TI + + L C ++ +++ V+
Sbjct: 20 IATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVER 79
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKE 121
++ +++D VV + CD + ++ D++G+ H+ L G + QEP
Sbjct: 80 GVSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW 125
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+++ + E + NK S E E + + + + E R +K P+
Sbjct: 126 ---GAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQE--EDLHVEHVLGEVRRSRKKKFPK 180
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY-----SINHVHV 235
S + K+ + C+++G LE N+V G+ HI A G Y + N +
Sbjct: 181 -----------SPKLKKSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWGRATNPHSM 229
Query: 236 HDIQPYTS-----------------AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA 278
+QP + N TH I LSFG PLD TV+
Sbjct: 230 SLLQPIITCIHGDAKNLTDQLTKLFPGLNFTHLITELSFGPHY---GRLLNPLDKTVSST 286
Query: 279 EEGASMFNYYIKIIPTIYERL-----------DGSKLGGGDG------------------ 309
+ Y++ ++PTIY + D S + D
Sbjct: 287 SINFYKYQYHLSVVPTIYTKSGHIDPNRRSLPDTSTITAKDSKTTVSTNQYAVTSYSQPI 346
Query: 310 -----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
PGIFF Y + P+++ ++++ SL L +++ +SG +T
Sbjct: 347 QPRIDATPGIFFKYNIEPILLIVSQERDSLLALMVRLVNVVSGVLVT 393
>gi|226294628|gb|EEH50048.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb18]
Length = 392
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/383 (23%), Positives = 145/383 (37%), Gaps = 88/383 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + T GG T+V ++ + L ++ +++ V+
Sbjct: 24 LRTFDAFPKTKPTYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVEKGISR 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L ++LDIVV ++CD L ++ D++G++ L D+ K EP
Sbjct: 84 ELQLNLDIVV-AMTCDALRINVQDAAGDRILAS---------DMLNK---EPTSWAAWNR 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + G L + + G E + + + + EA R K P+
Sbjct: 131 ELNVALSGGGREYQTLAEEDA-----GRLMEQEEDMHVGHALGEARRSHKRKFPK----- 180
Query: 187 QCKNEYSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
KLK + C+IYG LE N+V G FHI A G Y
Sbjct: 181 -------GPKLKRGEMPDSCRIYGSLEGNKVQGDFHITARGHGY---------------- 217
Query: 245 AFNTTHHIRH--LSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL--- 299
F H+ H LSFG PLD T++ + YY+ I+PTIY R
Sbjct: 218 -FEFGEHLDHHELSFGPHYST---LLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTV 273
Query: 300 --------DGSKLGGGDGG-----------------------MPGIFFSYELSPLMVKIT 328
D S + +PGIFF Y + P+++ I+
Sbjct: 274 DPYSQVLPDPSTISPSQRKNTIFTNQYAVTSRSHELPDVQFHVPGIFFKYNIEPILLIIS 333
Query: 329 EKSKSLGHLWTKIMCNISGTYIT 351
E+ SL L +++ ++G +
Sbjct: 334 EERGSLLALLVRLVNVMAGVVVA 356
>gi|224126339|ref|XP_002319814.1| predicted protein [Populus trichocarpa]
gi|222858190|gb|EEE95737.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 95/219 (43%), Gaps = 43/219 (19%)
Query: 186 VQCKNEYSTEKLKNTFTE--GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
++ K E +TE +K GC+I GY+ V +V G+ I+ + S H + S
Sbjct: 274 LEHKPENATEHVKRPAPSAGGCRIEGYVRVKKVPGNLVIS---ARSGAH-------SFDS 323
Query: 244 AAFNTTHHIRHLSFGIKL----QDDDERRKPLDGTVAKAEEGASMFNY-----------Y 288
A N +H I H SFG+K+ D +R P G G S N+ Y
Sbjct: 324 AQMNLSHVISHFSFGMKVLPRVMSDVKRLIPHIGRSHDKLNGRSFINHRDVGANVTIEHY 383
Query: 289 IKIIPTI---------------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKS 333
++++ T YE S L MP F +ELSP+ V ITE KS
Sbjct: 384 LQVVKTEVVTRRSSAEHKLIEEYEYTAHSSLAQ-TVYMPTAKFHFELSPMQVLITENPKS 442
Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
H T + I G + ++D++LH+ + + KVE+G
Sbjct: 443 FSHFITNVCAIIGGVFTVAGILDSILHNTFRMMKKVELG 481
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 64/115 (55%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MV + +LK +D + K D E ++ G ++IV L + +L +++ +Y V+T+ + V
Sbjct: 1 MVSTNKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELNNYLTVNTSTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I ++ P++SC++ ++D D G L++ I K +D D KP
Sbjct: 61 DNSSDGEFLRIDFNLSFPSLSCEFASVDVSDVLGTNRLNITKTIRKFSIDHDLKP 115
>gi|295663046|ref|XP_002792076.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279251|gb|EEH34817.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 392
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/388 (23%), Positives = 148/388 (38%), Gaps = 98/388 (25%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + T GG T+V ++ + L ++ +++ V+
Sbjct: 24 LRTFDAFPKTKPTYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVEKGISR 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+L ++LDIVV ++CD L ++ D++G++ L + L+ + +E+ A+
Sbjct: 84 ELQLNLDIVV-AMTCDALRINVQDAAGDRILASD------MLNKEPTSWAAWNRELNVAL 136
Query: 127 -----KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
+ + +T E+ E E+ G G EA R K P+
Sbjct: 137 SGGGREYQTLTEEHAGRLMEQEEDMHVGHALG----------------EARRSHKRKFPK 180
Query: 182 LDTIVQCKNEYSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQ 239
KLK + C+IYG LE N+V G FHI A G Y
Sbjct: 181 ------------GPKLKRGEMPDSCRIYGSLEGNKVQGDFHITARGHGY----------- 217
Query: 240 PYTSAAFNTTHHIRH--LSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
F H+ H LSFG PLD T++ + YY+ I+PTIY
Sbjct: 218 ------FEYGEHLDHHELSFGPHYST---LLNPLDKTMSTTPFNFYKYQYYMSIVPTIYT 268
Query: 298 RL-----------DGSKLGGGDGG-----------------------MPGIFFSYELSPL 323
R D S + +PGIFF Y + P+
Sbjct: 269 RTGTIDPYSQVLPDPSTISPSQRKNTIFTNQYAVTSRSHELPDVQFYVPGIFFKYSIEPI 328
Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYIT 351
++ I+E+ SL L +++ ++G +
Sbjct: 329 LLIISEERGSLLALLVRLVNVMAGVVVA 356
>gi|432100023|gb|ELK28916.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Myotis davidii]
Length = 298
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 122 GCRFEGQFSINKVPGNFHVS---------THSASAQPQNP---DMTHVIHKLSFGDTLQV 169
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 170 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 229
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 230 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 289
Query: 367 SKVEIG 372
K+++G
Sbjct: 290 KKIQLG 295
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
D + K +D + T G +++ C LFI +L ++ + EL+VD G
Sbjct: 16 FDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 75
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ + L+I +P + CD + LD D G + H+++++
Sbjct: 76 KIDVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIDNSM 113
>gi|217072996|gb|ACJ84858.1| unknown [Medicago truncatula]
gi|388501234|gb|AFK38683.1| unknown [Medicago truncatula]
Length = 243
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 82/197 (41%), Gaps = 36/197 (18%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK- 260
T GC++ GY+ V +V GS ++ D + ++ N +H I HLSFG K
Sbjct: 54 TGGCRVEGYVRVKKVPGSLVVSAR----------SDAHSFDASQMNMSHVINHLSFGKKV 103
Query: 261 --------------LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL-- 304
L + +R + EG +YI+++ T G KL
Sbjct: 104 TPRAMIDVKHWIPYLGINHDRLNGRSFVNTRDLEGNVTIEHYIQVVKTEVITRKGYKLIE 163
Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
+P F ELSP+ V ITE KS H T + I G + ++
Sbjct: 164 EYEYTAHSSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVFTVAGIL 223
Query: 356 DALLHSCVKKISKVEIG 372
D++LH+ +K + K+EIG
Sbjct: 224 DSILHNTIKAMKKIEIG 240
>gi|167382848|ref|XP_001736294.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901464|gb|EDR27547.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 315
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 86/206 (41%), Gaps = 34/206 (16%)
Query: 196 KLKNTFTEGCQIYGYLEVNRVSGSFHIAPG------------LSYSINHV--HVHDIQPY 241
K N GC+++G ++V+RVSG FH+A G ++ + H H+H
Sbjct: 108 KFDNRLLGGCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQ 167
Query: 242 TSAAFNTTHHIRHLSFGIKLQDD-DERRKPLDG---TVAKAEEGASMFNYYIKIIPTIYE 297
+FN TH+I HLSF L PL+G T+ + YYI +IPT+++
Sbjct: 168 EMKSFNPTHYINHLSFSNTLGSTVHSGETPLNGKEFTLNGFDNARK--TYYINVIPTLFK 225
Query: 298 ----RLDGSKLG----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
L +L G PG+FF YELSP +V S H +
Sbjct: 226 YPSYTLRTYQLSVSERDIPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVGA 285
Query: 344 NISGTYITFMLVDALLHSCVKKISKV 369
+ G I + L S + ++ V
Sbjct: 286 IVGGVLIIIGWLSKLFDSNRELVTSV 311
Score = 38.1 bits (87), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 27/108 (25%), Positives = 46/108 (42%), Gaps = 4/108 (3%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+ LK D F K E T +++ ++ I L+ + ++ + VD+ +
Sbjct: 6 QVLKECDIFLKVPEKLKITTNTTKLFSVISYIIIGLLVFSETYNFLNPQWVSHVDVDTVK 65
Query: 65 GSKLP---IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNI-YKRRL 108
LP I++DI P + CD LD + +G L V I + RL
Sbjct: 66 AGVLPNMYINIDITFPKMKCDDFGLDVTEITGSLQLGVTDGIKFDNRL 113
>gi|224013158|ref|XP_002295231.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969193|gb|EED87535.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 492
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 90/206 (43%), Gaps = 55/206 (26%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG----- 258
GCQ+ G+L VNRV G+FHI + S+NH +A N TH + HLSFG
Sbjct: 293 GCQVSGHLMVNRVPGNFHIE---AKSVNH-------NLNAAMTNLTHRVNHLSFGEPITK 342
Query: 259 ------------------IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------ 294
++ ++ ++ P+D T + F++YIK++ T
Sbjct: 343 LPPHMENTPFMRKVKRVLKQVPEEHKQFNPMDDTEYVTAQFHQAFHHYIKVVSTHLNMGS 402
Query: 295 ---------------IYERLDGSKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+Y+ L+ S++ D +P FSY++SP+ V + ++ +
Sbjct: 403 SSKSEYSVNDVNAVTVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWYDYL 462
Query: 339 TKIMCNISGTYITFMLVDALLHSCVK 364
T + I GT+ T L+DA L+ K
Sbjct: 463 TSLCAIIGGTFTTLGLIDATLYKVFK 488
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 22/107 (20%), Positives = 53/107 (49%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ +D + + +D E T G ++I ++ L + + + + + +D +
Sbjct: 13 MSSVDFYRRVPKDLTEATSLGAIMSICAITVMAILFFSETLAFARTAMVTSIALDENDQP 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
++ ++ +I + + CD++++D D+ G +V NI K +LD DG+
Sbjct: 73 QIRLNFNITLMDLHCDFVSVDVWDTLGTNRQNVTKNIEKWQLDEDGQ 119
>gi|13385678|ref|NP_080446.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Mus
musculus]
gi|52000733|sp|Q9DC16.1|ERGI1_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|12835932|dbj|BAB23423.1| unnamed protein product [Mus musculus]
gi|13529617|gb|AAH05516.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
musculus]
gi|26351067|dbj|BAC39170.1| unnamed protein product [Mus musculus]
gi|26353098|dbj|BAC40179.1| unnamed protein product [Mus musculus]
gi|53236959|gb|AAH83144.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
musculus]
gi|71059789|emb|CAJ18438.1| 1200007D18Rik [Mus musculus]
gi|74185526|dbj|BAE30231.1| unnamed protein product [Mus musculus]
gi|148690563|gb|EDL22510.1| RIKEN cDNA 1200007D18 [Mus musculus]
gi|158148953|dbj|BAF82010.1| MAA-136 protein [Mus musculus]
Length = 290
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHTIHKLSFGDTLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|50510831|dbj|BAD32401.1| mKIAA1181 protein [Mus musculus]
Length = 320
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 144 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHTIHKLSFGDTLQV 191
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 192 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 251
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 252 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 311
Query: 367 SKVEIG 372
K+++G
Sbjct: 312 KKIQLG 317
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 35 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 94
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 95 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 135
>gi|301763094|ref|XP_002916978.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Ailuropoda melanoleuca]
Length = 306
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 130 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 177
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 178 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 237
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 238 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 297
Query: 367 SKVEIG 372
K+++G
Sbjct: 298 KKIQLG 303
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS---RGS 66
D + K +D + T G ++I C LFI +L ++ + EL+VD G
Sbjct: 24 FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 83
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 84 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 121
>gi|344265732|ref|XP_003404936.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Loxodonta africana]
Length = 338
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 85/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 162 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 209
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE +G + +
Sbjct: 210 QNVQGAFNALGGADRLHSNPLASHDYILKIVPTVYEDKNGKQRYSYQYTVANKEYVAYSH 269
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 270 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 329
Query: 367 SKVEIG 372
K+++G
Sbjct: 330 KKIQLG 335
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 53 FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 112
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 113 SGGKIDVTLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 153
>gi|114603487|ref|XP_001145588.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pan troglodytes]
Length = 424
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 248 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 295
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 296 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 355
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 356 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 415
Query: 367 SKVEIG 372
K+++G
Sbjct: 416 KKIQLG 421
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS---RGS 66
D + K +D + T G ++I C LFI +L ++ + EL+VD G
Sbjct: 142 FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 201
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 202 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 239
>gi|109079798|ref|XP_001099287.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Macaca mulatta]
Length = 379
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 203 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 250
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 251 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 310
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 311 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 370
Query: 367 SKVEIG 372
K+++G
Sbjct: 371 KKIQLG 376
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 94 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 153
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 154 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 194
>gi|194382656|dbj|BAG64498.1| unnamed protein product [Homo sapiens]
Length = 235
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 59 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 106
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 107 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 166
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 167 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 226
Query: 367 SKVEIG 372
K+++G
Sbjct: 227 KKIQLG 232
>gi|156406959|ref|XP_001641312.1| predicted protein [Nematostella vectensis]
gi|156228450|gb|EDO49249.1| predicted protein [Nematostella vectensis]
Length = 287
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 87/190 (45%), Gaps = 36/190 (18%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
EGC I +N+V G+FH++ H QP + + H I ++FG ++
Sbjct: 111 EGCFISTRFTINKVPGNFHVS---------THGAGKQPDSP---DMNHIINAVNFGSRIM 158
Query: 263 DDDERRKPLDGTVAKAEE-----GASMFNYYIKIIPTIYERLDGS--------------- 302
D + P T K + G + +Y +KI+PTIY++LDG+
Sbjct: 159 D----KLPGAFTALKDRKRHDTNGLASHDYILKIVPTIYQKLDGTTTFSYQYTWAYKEYV 214
Query: 303 KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
G +P I+F Y+LSP+ VK E+ + L H T + + GT+ ++D+ + +
Sbjct: 215 SYSHGGQMLPAIWFRYDLSPITVKYIERRQPLYHFITTVCAIVGGTFTVAGIIDSAVFTA 274
Query: 363 VKKISKVEIG 372
+ K ++G
Sbjct: 275 SEMWRKHQLG 284
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 53/104 (50%), Gaps = 1/104 (0%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
++ D + K +D E T G ++I LFI++L + + ELFVD+ +
Sbjct: 5 VRRFDIYRKVPKDLTEPTFAGAVISICSCLFITFLFLSEFYGFIGTEIASELFVDNPTED 64
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
K+P+ L+I +P + C++ LD D G + + N+ +R ++
Sbjct: 65 DKIPVILNITLPRMKCEFPGLDIQDEMGRHEVGFKENVERREIN 108
>gi|403290258|ref|XP_003936243.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Saimiri boliviensis boliviensis]
Length = 415
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 239 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 286
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 287 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGRQQYSYQYTVANKEYVAYSH 346
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 347 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 406
Query: 367 SKVEIG 372
K+++G
Sbjct: 407 KKIQLG 412
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
L D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 130 LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 189
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 190 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 230
>gi|410349413|gb|JAA41310.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
gi|410349417|gb|JAA41312.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
Length = 290
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 48/101 (47%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P C + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNSQCRLVGLDIQDEMGRHEVGHIDNSM 105
>gi|417409674|gb|JAA51332.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein, partial [Desmodus rotundus]
Length = 318
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 142 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 189
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 190 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 249
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 250 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 309
Query: 367 SKVEIG 372
K+++G
Sbjct: 310 KKIQLG 315
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 33 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 92
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 93 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 133
>gi|390459630|ref|XP_002744599.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Callithrix jacchus]
Length = 342
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 166 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 213
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 214 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVANKEYVAYSH 273
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 274 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 333
Query: 367 SKVEIG 372
K+++G
Sbjct: 334 KKIQLG 339
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
L D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 57 LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 116
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 117 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 157
>gi|348575225|ref|XP_003473390.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Cavia porcellus]
Length = 345
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 169 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 216
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 217 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 276
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 277 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 336
Query: 367 SKVEIG 372
K+++G
Sbjct: 337 KKIQLG 342
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 60 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 119
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 120 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 160
>gi|395817675|ref|XP_003782285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Otolemur garnettii]
Length = 356
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 180 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 227
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 228 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVANKEYVAYSH 287
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 288 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 347
Query: 367 SKVEIG 372
K+++G
Sbjct: 348 KKIQLG 353
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 71 FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 130
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 131 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 171
>gi|355691849|gb|EHH27034.1| hypothetical protein EGK_17136, partial [Macaca mulatta]
gi|355750428|gb|EHH54766.1| hypothetical protein EGM_15664, partial [Macaca fascicularis]
Length = 290
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
L+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 LRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|338713524|ref|XP_001499596.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Equus caballus]
Length = 356
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 180 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 227
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 228 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 287
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 288 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 347
Query: 367 SKVEIG 372
K+++G
Sbjct: 348 KKIQLG 353
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
L+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 71 LRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 130
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 131 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 171
>gi|6330243|dbj|BAA86495.1| KIAA1181 protein [Homo sapiens]
Length = 336
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 160 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 207
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 208 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 267
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 268 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 327
Query: 367 SKVEIG 372
K+++G
Sbjct: 328 KKIQLG 333
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 51 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 110
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 111 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 151
>gi|354477345|ref|XP_003500881.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Cricetulus griseus]
Length = 333
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 157 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 204
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 205 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 264
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 265 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 324
Query: 367 SKVEIG 372
K+++G
Sbjct: 325 KKIQLG 330
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
D + K +D + T G ++I C LFI +L ++ + EL+VD G
Sbjct: 51 FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 110
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 111 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 148
>gi|72534712|ref|NP_001026881.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Homo sapiens]
gi|332248275|ref|XP_003273290.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Nomascus leucogenys]
gi|426351000|ref|XP_004043047.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Gorilla gorilla gorilla]
gi|51701446|sp|Q969X5.1|ERGI1_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|15215343|gb|AAH12766.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[Homo sapiens]
gi|15680269|gb|AAH14490.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[Homo sapiens]
gi|119581826|gb|EAW61422.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1,
isoform CRA_a [Homo sapiens]
gi|208966210|dbj|BAG73119.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[synthetic construct]
gi|410301142|gb|JAA29171.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
gi|410349415|gb|JAA41311.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
Length = 290
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|402873423|ref|XP_003900575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Papio anubis]
gi|380784387|gb|AFE64069.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
gi|383408185|gb|AFH27306.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
gi|384941372|gb|AFI34291.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
Length = 290
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|350594414|ref|XP_003134100.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Sus scrofa]
Length = 313
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 137 GCRFEGQFSINKVPGNFHVS---------THSATAQPPNP---DMTHVIHKLSFGDTLQV 184
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 185 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 244
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 245 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 304
Query: 367 SKVEIG 372
K+++G
Sbjct: 305 KKIQLG 310
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS---RGS 66
D + K +D + T G ++I C LFI +L ++ + EL+VD G
Sbjct: 31 FDIYRKVPKDLTQPTYTGAIISICCCLFIFFLFLSELTGFITTEIVNELYVDDPDKDSGG 90
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 91 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 128
>gi|149052230|gb|EDM04047.1| rCG34297 [Rattus norvegicus]
Length = 283
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 107 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 154
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 155 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 214
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 215 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 274
Query: 367 SKVEIG 372
K+++G
Sbjct: 275 KKIQLG 280
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDY 83
G K+ + L+I +P + C++
Sbjct: 65 SGGKIDVSLNISLPNLHCEH 84
>gi|297803392|ref|XP_002869580.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
lyrata]
gi|297315416|gb|EFH45839.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
lyrata]
Length = 480
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 96/214 (44%), Gaps = 37/214 (17%)
Query: 186 VQCKNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
++ K++ S+ LK T GC+I GY+ V +V G+ ++ + S +H + S+
Sbjct: 274 LEDKSDNSSRTLKKAPSTGGCRIEGYIRVKKVPGNLMVS---ARSGSH-------SFDSS 323
Query: 245 AFNTTHHIRHLSFGIKLQDDD----ERRKPLDGTVAKAEEGASMFN-----------YYI 289
N +H + HLSFG ++ +R P G +G N +Y+
Sbjct: 324 QMNMSHVVNHLSFGQRIMPQKFSELKRLSPYLGLSHDRLDGRPFINQRDLGPNVTIEHYL 383
Query: 290 KIIPTIYERLDGSKLG-----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
+I+ T + +G L +P F +ELSP+ V ITE SKS H
Sbjct: 384 QIVKTEVVKSNGQALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFI 443
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
T + I G + ++D++LH + + K+E+G
Sbjct: 444 TNVCAIIGGVFTVAGILDSILHHSMTLMKKIELG 477
>gi|392331685|ref|XP_003752358.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Rattus norvegicus]
Length = 290
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|356543934|ref|XP_003540413.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 54/198 (27%), Positives = 86/198 (43%), Gaps = 38/198 (19%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--- 258
T GC+I GY+ V +V G+ I+ + + ++ N +H I HLSFG
Sbjct: 291 TGGCRIDGYVRVKKVPGNLIISAR----------SNAHSFDASQMNMSHVINHLSFGRKV 340
Query: 259 -IKLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTI----------- 295
+++ D +R P G+ G S N +Y++I+ T
Sbjct: 341 SLRVMSDVKRLIPYVGSSHDRLNGRSFINTHDLGANVTIEHYLQIVKTEVITRKEYKLVE 400
Query: 296 -YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
YE S + +P F ELSP+ V ITE KS H T + I G + +
Sbjct: 401 EYEYTAHSSVAQS-LHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIIGGIFTVAGI 459
Query: 355 VDALLHSCVKKISKVEIG 372
+DA+ H+ ++ + KVE+G
Sbjct: 460 MDAIFHNTIRLMKKVELG 477
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 63/115 (54%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S ++K +D + K D E ++ G ++IV L + +L +++ Y VST+ ++ V
Sbjct: 1 MISSSKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMIFLFGMELNSYLSVSTSTQVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I +I P +SC++ A+D D G L++ + K +D + +P
Sbjct: 61 DKSSDGDYLRIDFNISFPALSCEFAAVDVSDVLGTNRLNLTKTVRKFSIDSNLRP 115
>gi|397485838|ref|XP_003814045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pan paniscus]
Length = 290
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDMLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|410949214|ref|XP_003981318.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Felis catus]
Length = 398
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 222 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 269
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 270 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 329
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 330 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 389
Query: 367 SKVEIG 372
K+++G
Sbjct: 390 KKIQLG 395
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 54/108 (50%), Gaps = 7/108 (6%)
Query: 3 FSERLKG---LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELF 59
FS+ +G D + K +D + T G ++I C LFI +L ++ + EL+
Sbjct: 106 FSKPYEGTPLFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELY 165
Query: 60 VDSS---RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
VD G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 166 VDDPDKDSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 213
>gi|395736490|ref|XP_002816264.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pongo abelii]
Length = 290
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPHLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|351705474|gb|EHB08393.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Heterocephalus glaber]
Length = 305
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 129 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 176
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 177 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQWYSYQYTVANKEYVAYSH 236
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 237 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 296
Query: 367 SKVEIG 372
K+++G
Sbjct: 297 KKIQLG 302
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 52/101 (51%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
++G D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 20 VEGFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 79
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 80 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 120
>gi|355686511|gb|AER98080.1| endoplasmic reticulum-golgi intermediate compartment 1 [Mustela
putorius furo]
Length = 312
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 137 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 184
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 185 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 244
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 245 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 304
Query: 367 SKVEIG 372
K+++G
Sbjct: 305 KKIQLG 310
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 28 FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 87
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 88 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 128
>gi|336370998|gb|EGN99338.1| hypothetical protein SERLA73DRAFT_108802 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383753|gb|EGO24902.1| hypothetical protein SERLADRAFT_449635 [Serpula lacrymans var.
lacrymans S7.9]
Length = 503
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/357 (21%), Positives = 139/357 (38%), Gaps = 66/357 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K + ++ G +TI L+ D +Y E VDS S
Sbjct: 16 LAQFDAFPKLPSTYKSRSESRGFITIFITFLAFLLVLNDFGEYIWGWPDYEFSVDSQSNS 75
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++D+ V + C L++D D G++ L++
Sbjct: 76 FMSINVDMAV-NMPCHLLSVDLRDVVGDR-LYLSKGF----------------------- 110
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
++ T + T L++ S A +++RK + V
Sbjct: 111 -RRDGTLFDVGQATSLKEHAAMLSARQALSQSRKSRGLLSSV----------------FR 153
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
+ + +Y C+IYG L+V +V+ + HI G Y+ N VHV +
Sbjct: 154 RSQPDYRPTYNYQADGSACRIYGTLQVKKVTANLHITTLGHGYTSN-VHVDHTK------ 206
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI---------- 295
N +H I SFG D + PLD + A++ + Y++ ++PT
Sbjct: 207 MNLSHVITEFSFGPYFPDITQ---PLDYSFEVAKDPFVAYQYFLHVVPTTFIAPRSEPLH 263
Query: 296 ---YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
Y +++ G G PGIFF ++L P+++ I +++ S L+ + + I G +
Sbjct: 264 TNQYSVTHYTRVLKGHHGTPGIFFKFDLDPMVITIHQRTTSFLQLFIRCVGVIGGVF 320
>gi|392351111|ref|XP_001066818.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Rattus norvegicus]
Length = 497
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 321 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 368
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 369 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 428
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 429 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 488
Query: 367 SKVEIG 372
K+++G
Sbjct: 489 KKIQLG 494
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 52/102 (50%), Gaps = 4/102 (3%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-- 63
+++ D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 211 KVERFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDK 270
Query: 64 -RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 271 DSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 312
>gi|281351238|gb|EFB26822.1| hypothetical protein PANDA_005115 [Ailuropoda melanoleuca]
Length = 238
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 62 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 109
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 110 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 169
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 170 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 229
Query: 367 SKVEIG 372
K+++G
Sbjct: 230 KKIQLG 235
>gi|393221326|gb|EJD06811.1| DUF1692-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 537
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 150/348 (43%), Gaps = 78/348 (22%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICV-DVCDYFQVSTTEELFVDSSRG 65
L DAF K + ++ G +T++ FIS+L+ V D+ +Y T + +D+ G
Sbjct: 22 LNQFDAFPKLPSTYKARSGGRGFLTVLV-AFISFLLVVNDIGEYIFGWPTYKFGLDNRPG 80
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
L I++D+VV + C +L++D D+ G++ L++ +KR
Sbjct: 81 HYLAINVDLVV-NMPCKHLSVDLRDAVGDR-LYLSDG-FKR------------------- 118
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+GT L D G ++ T + + V +A + + + DTI
Sbjct: 119 ---------DGT----LFD---IGQAQALQSHT-QALDARLAVAQARKSRGF----FDTI 157
Query: 186 VQCKNEYSTEKLKNTFT-----EGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQ 239
++ +N+ +K + T+ C++YG ++ +V+ + HI G Y H HV Q
Sbjct: 158 LR-RNK---DKFRPTYNYKPDGGACRVYGSIQAKKVTANLHITTAGHGYRSMH-HVDHSQ 212
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
N +H I SFG D +PL T E + Y++ ++PT Y
Sbjct: 213 ------MNLSHVITDFSFGPYFPD---MAQPLKNTFELTHEPFIAYQYFLSVVPTTYIAS 263
Query: 300 DGSKLGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSL 334
+G ++ + G PGIFF Y+L PL + I +K+ +L
Sbjct: 264 NGKQVHTSQYSVTHYTRVLQHEQGTPGIFFKYDLEPLQMTIHQKTTTL 311
>gi|297830752|ref|XP_002883258.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
lyrata]
gi|297329098|gb|EFH59517.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 93/204 (45%), Gaps = 39/204 (19%)
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
K T GC++ GY+ V +V G+ I+ H H + S+ N +H + HLSF
Sbjct: 287 KAPVTGGCRVEGYVRVKKVPGNLVISA-------HSGAHS---FDSSQMNMSHVVSHLSF 336
Query: 258 G----IKLQDDDERRKP--------LDGT--VAKAEEGASM-FNYYIKIIPT-IYERLDG 301
G +L D +R P LDG + + E GA++ +Y++I+ T + R G
Sbjct: 337 GRMISPRLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQIVKTEVITRRSG 396
Query: 302 SKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
+ +P F +ELSP+ + ITE KS H T + I G
Sbjct: 397 QEHSLIEEYEYTAHSSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGV 456
Query: 349 YITFMLVDALLHSCVKKISKVEIG 372
+ ++D++ H+ V+ I KVE+G
Sbjct: 457 FTVAGILDSIFHNTVRLIKKVELG 480
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 61/110 (55%), Gaps = 1/110 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MV S +LK +D + K D E ++ G ++IV LF+ +L +++ Y +V+TT + V
Sbjct: 1 MVSSTKLKSVDFYRKIPRDLTEASLSGAGLSIVAALFMMFLFGMELSSYLEVNTTTAVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
D SS G L I +I P +SC++ ++D D G L++ I K +D
Sbjct: 61 DKSSDGDFLRIDFNISFPALSCEFASVDVSDVLGTNRLNITKTIRKFPID 110
>gi|226497610|ref|NP_001145501.1| uncharacterized protein LOC100278902 [Zea mays]
gi|195657145|gb|ACG48040.1| hypothetical protein [Zea mays]
Length = 110
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 56/99 (56%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK L+AF E +KT G VTI+ L + L ++ Y T ++ VD RG
Sbjct: 7 LKSLNAFPHAEEHLLKKTYSGAVVTILGLLIMITLFVHELQFYLTTYTVHQMSVDLKRGE 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
LPIH+++ P++ C+ L++DA+D SG+ + + NI+K
Sbjct: 67 TLPIHVNMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWK 105
>gi|389749487|gb|EIM90658.1| DUF1692-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 533
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 73/361 (20%), Positives = 138/361 (38%), Gaps = 68/361 (18%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
E +K DAF K + ++ G +TI L+ D+ ++ E VD
Sbjct: 18 ESIKSFDAFPKLPATYKSRSESRGFLTIFVAFLAFLLVLNDIGEFIWGWPDHEFAVDRDD 77
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
S + +++D+VV + C +L++D D G+ RL L
Sbjct: 78 SSFMNVNVDLVV-NMPCRWLSVDLRDVVGD------------RLFLS------------K 112
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
++ + G T A E K +T V+++ + + + D
Sbjct: 113 GFRRDGTLFDIGQAT--------------ALKEHAKALSTRQAVRQSRKSRGF----FDL 154
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTS 243
+ ++ Y C++YG LEV +V+ + HI G Y+ + VHV +
Sbjct: 155 FRRSQDIYKPTYNYQADGSACRVYGSLEVKKVTANLHITSLGHGYA-SKVHVDHTK---- 209
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
N +H I SFG D + PLD + + + + Y+++++PT Y +
Sbjct: 210 --INMSHVITEFSFGPHFPDIVQ---PLDNSFEITHDHFTAYQYFMRVVPTTYVAPRSAP 264
Query: 304 LGGGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
L G PGIFF +E+ P+ + +++ + + + + G +
Sbjct: 265 LNTNQYSVTHYTRTFEQHSGLAPGIFFKFEIEPVRLIQHQRTTTFAQFFVRWAGVVGGVF 324
Query: 350 I 350
+
Sbjct: 325 V 325
>gi|440798302|gb|ELR19370.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 328
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 94/205 (45%), Gaps = 44/205 (21%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAP----------GLSYSINHVHVHDIQP---YTSA----A 245
GC I GY+ V +V G+FH++ + ++IN D P Y S A
Sbjct: 120 SGCSIAGYINVPKVPGNFHLSTHGRNVQAQDIDMQHNINSFFFTD-SPRVFYPSGVSVPA 178
Query: 246 FNTTHH--IRHLSFGIKLQDDDERR----KPLDGTV---AKAEEGASM-FNYYIKIIPTI 295
+ H + L+ + QD D+ +PLDG ++ + G + + YYI+I+PTI
Sbjct: 179 WRNWHSNVVAELNAQARDQDTDDDVVGLFRPLDGITKANSQRKNGVGVSYEYYIQIVPTI 238
Query: 296 YERLDG------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
E DG + + +G P ++F Y++SP+ VKIT SLGH + +C
Sbjct: 239 LEFPDGRTKHTYQFTYNFNDVATPEGKTPSVYFKYDISPITVKITRGRGSLGHFLLQ-LC 297
Query: 344 NISGTYITFMLVDALLHSCVKKISK 368
I G T V L+ S +++K
Sbjct: 298 AIVGGIFT---VSGLIASVTARVAK 319
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 1/88 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG- 65
LK D + + +D + +V G V++VC ++ LI +V Y + T ++ VD+ R
Sbjct: 9 LKSFDLYRRVPKDLTKGSVPGAIVSLVCLTIMAMLISWEVYCYASIKTETQMLVDTPRNL 68
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSG 93
K+ I++++ VP I C +ALD D G
Sbjct: 69 EKIRININVTVPRIPCYVIALDTEDVLG 96
>gi|356517290|ref|XP_003527321.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 84/195 (43%), Gaps = 36/195 (18%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL-- 261
GC++ GY+ V +V G+ I+ D + ++ N +H I +LSFG K+
Sbjct: 293 GCRVEGYVRVKKVPGNLIISAR----------SDAHSFDASQMNMSHFINNLSFGKKVTP 342
Query: 262 --QDDDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTIYERLDGSKL---- 304
D + P G+ G S N +YI+I+ T +G KL
Sbjct: 343 RAMSDVKLLIPYIGSSHDRLNGRSFTNTHDLGANVTIEHYIQIVKTEVVTRNGYKLIEEY 402
Query: 305 -------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
+P F ELSP+ V ITE +S H T + I G + ++D+
Sbjct: 403 EYTAHSSVAHSVDIPAAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAGILDS 462
Query: 358 LLHSCVKKISKVEIG 372
+LH+ ++ + KVE+G
Sbjct: 463 ILHNTIRMMKKVELG 477
>gi|451847161|gb|EMD60469.1| hypothetical protein COCSADRAFT_98785 [Cochliobolus sativus ND90Pr]
Length = 395
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 86/384 (22%), Positives = 144/384 (37%), Gaps = 90/384 (23%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + + + A T+ + YL ++ ++ +TT+ ++
Sbjct: 22 VSSFDAFPKTKKTYLVQGRNSSAWTVTLIITCIYLTWSEIARWYAGTTTQSFTIEKGVSH 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I+LDI+V + C L ++ D++G++ L E +
Sbjct: 82 DMQINLDIIV-AMKCADLHVNMQDAAGDRTLAGE------------------------LL 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K DP G TE +E + ++++
Sbjct: 117 RK---------------DPTSWSQWTGKNTEKGTHELGKDETTQIPEWEEYGDVHEHLGK 161
Query: 187 QCKNEYS-TEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
K ++S T KL+ T+ C+IYG L N+V G FHI A G Y H+ +
Sbjct: 162 ATKKKFSKTPKLRGP-TDSCRIYGNLVGNKVQGDFHITARGHGYMEFGEHLE------HS 214
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS---MFNYYIKIIPTIYE---- 297
+FN +H IR +SFG PLD T+A A F YY+ I+PTIY
Sbjct: 215 SFNFSHIIREMSFGPYYP---SLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPA 271
Query: 298 -------------------------------RLDGSKLGGGDGGMPGIFFSYELSPLMVK 326
+ D +PGIF +++ P+M+
Sbjct: 272 LMPIMESMVSTNDQPSSNMFRMAHAIKTNQYAVTSQSHKVDDSYVPGIFVKFDIEPIMLA 331
Query: 327 ITEKSKSLGHLWTKIMCNISGTYI 350
I E+SKS L ++ +SG +
Sbjct: 332 IVEESKSFWKLVITLVNVVSGVMV 355
>gi|145524934|ref|XP_001448289.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415833|emb|CAK80892.1| unnamed protein product [Paramecium tetraurelia]
Length = 324
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 97/231 (41%), Gaps = 27/231 (11%)
Query: 161 CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSF 220
N + E KK+ DTI+ ++ +K I GY+ VN+V G+F
Sbjct: 99 VVNVEEQRMERQFLKKFIQIMKDTIIIINHQQILRDVK--------IAGYIIVNKVPGNF 150
Query: 221 HIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLS-----FGIKLQDDDERRKPLDGT- 274
H++ I H Q T +T HL IK Q PLD T
Sbjct: 151 HVSAHAFGGILHQVFQRSQISTLDLSHTYQSYSHLVKKDDLVKIKKQFQKGVLNPLDNTK 210
Query: 275 -VAKAEEGASM-FNYYIKIIPTIYERLDGS-----KLGGGDG-----GMPGIFFSYELSP 322
+A+ + G M F YYI ++PT Y + G+ + +P ++F Y+LSP
Sbjct: 211 KIAQPQGGTGMMFQYYISVVPTTYIDVSGNEYYVHQFTANSNEVQTDHLPAVYFRYDLSP 270
Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH-SCVKKISKVEIG 372
+ VK + +S H +I + G + ++D ++H S V + K E+G
Sbjct: 271 VTVKFLQYRESFLHFLVQICAILGGVFTIASIIDGMIHKSVVALLKKYEMG 321
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 54/104 (51%), Gaps = 1/104 (0%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR- 64
RL+ LD + K D E T G ++++ + I L ++ Y +V + E+FVD +R
Sbjct: 8 RLRKLDIYRKLPADLTEPTTAGALISVISTIVIVILFTTELQAYIEVDNSSEMFVDINRG 67
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
G ++ ++LDI CD L+LD D G ++VE +R+
Sbjct: 68 GEQIRVNLDIEFHKFPCDILSLDVQDIMGSHVVNVEEQRMERQF 111
>gi|348667045|gb|EGZ06871.1| hypothetical protein PHYSODRAFT_319561 [Phytophthora sojae]
Length = 469
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 95/231 (41%), Gaps = 46/231 (19%)
Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTF------------TEGCQIYGYLEVNRVSGSF 220
++K+ E+D + K E + KN EGC++YG+L V RV G+F
Sbjct: 247 KFKQLMAGEVDAVEARKKELFEQDKKNAREQGKAIARSAVGPEGCRLYGHLYVKRVPGNF 306
Query: 221 HI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA--- 276
H+ +YS++ S+ N +H + L FG L + P D +
Sbjct: 307 HVHLANPAYSMD-----------SSLVNASHTVNELWFGEHLTSGEMSMLPRDAQMQLYT 355
Query: 277 ---KAEEGASMFN-----YYIKIIPTIYERLDGSKLGG-----------GDGGMPGIFFS 317
++ S + +YIK++ Y + D + + +P I F
Sbjct: 356 HRLDNQDYTSFYKNHTYVHYIKVVTNSYVQSDAADINVYKYTAHSNEYLETDDLPSIMFR 415
Query: 318 YELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
Y+LSP+ V+I+E S H T I G + ++D ++H + ++K
Sbjct: 416 YDLSPMSVRISEDSVPFYHFLTSACAIIGGVFTVIGILDQIIHQTARALNK 466
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 57/117 (48%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK D + K ED T+ G +++I + L ++ Y V ++ +D
Sbjct: 7 LKKWDFYKKIPEDLTVSTLPGVSLSIAGCFIMFLLFILEFNSYLTVDYKYDIVMDEGLDQ 66
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
+ I+ +I VP + C++ +D D +G + ++ NIYK RLD G+ + Q++ +
Sbjct: 67 TMRINFNITVPDLPCEFATVDVSDMTGTRKHNMTSNIYKIRLDQKGRSVGLAQEKQI 123
>gi|366997520|ref|XP_003678522.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
gi|342304394|emb|CCC72184.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
Length = 347
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 133/353 (37%), Gaps = 70/353 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K E++ +K+ GG TI +LF+ ++ + YF ++ VD+
Sbjct: 4 LKSFDAFPKTDEEYTKKSTKGGLSTIATYLFLLFIAWSEFGSYFGGFVEQKYVVDNQVRE 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
I+LDI V T +C L + D + + + E L + P VN +
Sbjct: 64 VTEINLDIYVNT-TCRLLDVRVFDETKDMRMVSEE------LSFEDMVFFIPFGVKVN-L 115
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ VT + +E + +G ++R+ N + ALP
Sbjct: 116 MNEIVTADIDKILSE-----AVPAQFGPRVDSREFLNQGTD--------DVALPL----- 157
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
EYS C I+G + VNRV+G F I + H QP +
Sbjct: 158 ----EYS----------ACHIFGSIPVNRVAGEFQIT--------TIDRH--QPIENVV- 192
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA-EEGASMFNYYIKIIPTIYERLD----- 300
+ TH I SFG D PLD T +E + + Y++ ++PTIY ++
Sbjct: 193 DFTHVINEFSFGDFFPYVD---NPLDSTAKYVPDEKLTSYQYHLSVVPTIYNKMGVLINT 249
Query: 301 ----------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
+ D PGIF Y L + + ++ +++
Sbjct: 250 NQYSLSEYHYKNITNANDKNSPGIFIKYNFESLTIIVNDRRLGFTQFLIRLIA 302
>gi|440902711|gb|ELR53466.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1,
partial [Bos grunniens mutus]
Length = 290
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 82/186 (44%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FH++ H QP + TH I LSFG LQ
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 HNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
L+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 5 LRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|366998832|ref|XP_003684152.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
gi|357522448|emb|CCE61718.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
Length = 349
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 81/360 (22%), Positives = 135/360 (37%), Gaps = 73/360 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K E +K+ GG +I+ + F+ + + YF ++ VD
Sbjct: 4 LKTFDAFPKTEERHVKKSKKGGLSSILTYAFLLLIAWTEFGSYFGGYIDKQYSVDKDIRK 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++DI V + C++L ++ +D + ++ + E I+ + P P VN +
Sbjct: 64 VVQINMDIYV-KMPCEWLHVNVLDDTNDRKIVSEELIF------EDMPFFVPHGSKVNNL 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
N T EL+D AE E +E K P+ I
Sbjct: 117 --------NKVVTPELDD-------ILAEA-------IPAEFREKIETKPLLGPDGKPIF 154
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+ GC +YG + VNRV+G I A G Y D+
Sbjct: 155 ELT--------------GCHVYGSVTVNRVAGEMQITAKGYGYRDRKRAPKDL------- 193
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFNYYIKIIPTIYERLDG--- 301
+ H + SFG PLDGT S +NY++ ++PT Y++L
Sbjct: 194 IDFNHVVNEFSFG---DFYPYIENPLDGTCKMYPNSPFSSYNYFMSVVPTFYQKLGAEID 250
Query: 302 ---------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
S + +PGIF Y+ PL + I++ + +++ +S
Sbjct: 251 TNQYSIREYHVDLKNSNVNAKLSTIPGIFLKYDFEPLAIIISDVRLTFLQFIVRLVAILS 310
>gi|327265232|ref|XP_003217412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Anolis carolinensis]
Length = 291
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 85/187 (45%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC+ + +N++ G+FH++ H QP + TH I LSFG +LQ
Sbjct: 114 DGCRFESHFSINKIPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDQLQ 161
Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK--------------LGG 306
R L+G + + +Y +KI+PT+YE + G + +
Sbjct: 162 AQKIRGSFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQQYPFQYTVANKEYVVYS 221
Query: 307 GDGGM-PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G + P I+F Y+L+P+ +K E+ + L T I I GT+ + D+ + + +
Sbjct: 222 HTGRITPAIWFRYDLTPITLKYIERRQPLYRFITTICAIIGGTFTVAGIFDSCIFTASEA 281
Query: 366 ISKVEIG 372
K+++G
Sbjct: 282 WKKIQLG 288
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/98 (23%), Positives = 48/98 (48%), Gaps = 4/98 (4%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV---DSSRGS 66
D + K +D + T G +++ C FI +L+ ++ + EL+V D
Sbjct: 9 FDIYRKVPKDLTQPTFTGAIISVCCCFFILFLLLSELTGFIATEVVNELYVEDPDKDSSG 68
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 69 KIEVTLNISLPNLHCELIGLDIQDEMGRHEIGHIDNSV 106
>gi|426246271|ref|XP_004016918.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Ovis aries]
Length = 290
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 82/186 (44%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FH++ H QP + TH I LSFG LQ
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 HNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|115497382|ref|NP_001069885.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Bos
taurus]
gi|111308658|gb|AAI20358.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Bos
taurus]
Length = 290
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 82/186 (44%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FH++ H QP + TH I LSFG LQ
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161
Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 HNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|296475934|tpg|DAA18049.1| TPA: endoplasmic reticulum-golgi intermediate compartment 32 kDa
protein [Bos taurus]
Length = 290
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 82/186 (44%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FH++ H QP + TH I LSFG LQ
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 161
Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
+ L G + +Y +KI+PT+YE G + +
Sbjct: 162 HNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSH 221
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281
Query: 367 SKVEIG 372
K+++G
Sbjct: 282 KKIQLG 287
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|389640739|ref|XP_003718002.1| hypothetical protein MGG_00949 [Magnaporthe oryzae 70-15]
gi|351640555|gb|EHA48418.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae 70-15]
gi|440464580|gb|ELQ33987.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae Y34]
gi|440481695|gb|ELQ62250.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae P131]
Length = 376
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 71/364 (19%), Positives = 139/364 (38%), Gaps = 63/364 (17%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + +T GG T+ L + L ++ +++ T V+ G
Sbjct: 22 VSAFDAFPKSKPQYVTRTSGGGKWTVAMLLVSAILTWSELARWWRGVETHTFAVEKGVGQ 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++D VV + C Q +HV +Q+ + + A
Sbjct: 82 SMQINMDTVV-HMRC-------------QDIHVN--------------VQDAAGDRIMAA 113
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ K+ + ++ G + T + + + ++ +
Sbjct: 114 ARLKMDDTTWAQWVDGSGVHRLGHDQHGKVVTGEGHEEGFGEEHIH--------DIVALG 165
Query: 187 QCKNEYS-TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
+ + +S T +L + C+I+G L++N+V G FHI + H ++ +A
Sbjct: 166 KKRARWSKTPRLWGATPDSCRIFGSLDLNKVQGDFHIT-----ARGHGYIEFGDHLDHSA 220
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
FN +H + SFG PLD TV E+ F Y++ ++PT+Y +
Sbjct: 221 FNFSHIVNEFSFGDFYP---SLVNPLDKTVNTCEKNFHKFQYFLSVVPTLYSVKSSTGAF 277
Query: 306 G------------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
G + +PGIFF Y++ P+++ I E ++ K++ +SG
Sbjct: 278 GYSTIFTNQYAVTEQSSEISEMNVPGIFFKYDIEPILLDIEESRDTILVFLIKVINILSG 337
Query: 348 TYIT 351
+
Sbjct: 338 AMVA 341
>gi|397641928|gb|EJK74922.1| hypothetical protein THAOC_03372 [Thalassiosira oceanica]
Length = 583
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 89/202 (44%), Gaps = 51/202 (25%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG----- 258
GCQ+ G+L VNRV G+FHI + S+NH +A N TH + H+SFG
Sbjct: 388 GCQVSGHLMVNRVPGNFHIE---AKSVNH-------NLNAAMTNLTHRVNHISFGEPITK 437
Query: 259 ------------------IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------ 294
++ ++ ++ P+D + F++YIK++ T
Sbjct: 438 LPYHMENTPFMRKVKRVLKQVPEEHKQFNPMDDQEYITTQFHQAFHHYIKVVSTHLNMGS 497
Query: 295 -----------IYERLDGSKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
+Y+ L+ S++ D +P FSY++SP+ V + ++ + T +
Sbjct: 498 SSTVNDVNSITVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWYDYLTSLC 557
Query: 343 CNISGTYITFMLVDALLHSCVK 364
I GT+ T L+DA L+ K
Sbjct: 558 AIIGGTFTTLGLIDATLYKVFK 579
>gi|451997913|gb|EMD90378.1| hypothetical protein COCHEDRAFT_27091 [Cochliobolus heterostrophus
C5]
Length = 395
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 91/409 (22%), Positives = 154/409 (37%), Gaps = 88/409 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + + + A T+ + YL ++ ++ +TT+ ++
Sbjct: 22 VSSFDAFPKTKKTYLVQGRNSSAWTVTLIITCIYLTWSEIARWYAGTTTQSFTIEKGVSH 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I+LDI+V + C L ++ D++G++ L E L P Q N
Sbjct: 82 DMQINLDIIV-AMKCADLHVNMQDAAGDRTLAGEL--------LRKDPTSWSQWTGKN-- 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
TE GT +D + E + + + +A
Sbjct: 131 ------TEKGTHELGKDDTTQI-------PEWEEYGDVHEHLGKA--------------- 162
Query: 187 QCKNEYS-TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
K ++S T KL+ T+ C+IYG L N+V G FHI + H ++ + ++
Sbjct: 163 -TKKKFSKTPKLRGP-TDSCRIYGNLVGNKVQGDFHIT-----ARGHGYMEFGEHLDHSS 215
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS---MFNYYIKIIPTIYE----- 297
FN +H IR +SFG PLD T+A A F YY+ I+PTIY
Sbjct: 216 FNFSHIIREMSFGPYYP---SLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPSL 272
Query: 298 ------------------------------RLDGSKLGGGDGGMPGIFFSYELSPLMVKI 327
+ D +PGIF +++ P+M+ I
Sbjct: 273 MPLMESVVSTNDQPSSNMFRMAHAIKTNQYAVTSQSHKVDDTYVPGIFVKFDIEPIMLAI 332
Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
E+SKS L ++ +SG + V + + + K + G V
Sbjct: 333 VEESKSFWKLLITLVNVVSGVMVAGSWVWQMFDWASEFVGKRKRRGDGV 381
>gi|156841160|ref|XP_001643955.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
70294]
gi|156114586|gb|EDO16097.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
70294]
Length = 349
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 87/368 (23%), Positives = 148/368 (40%), Gaps = 78/368 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K E +K+V GG +I+ + + + + YF E+ VD +
Sbjct: 4 LKTFDAFPKTEERHVKKSVNGGLSSILTYFMLLLIAWTEFGSYFGGYIDEQYSVDPTIRE 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++D+ + + C + ++A+D + ++ L + P P
Sbjct: 64 TVQINMDMYI-KMPCQLIHVNAMDET------MDRKFVSNELIFEDMPFFVPYG------ 110
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
KV +N + L++ G AE +E +K + + +
Sbjct: 111 --TKVNNKNDIVSPGLDE--IIGEAIPAE------------FREKLDFKSQVDADGNPLF 154
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+ +GC IYG +++NRV+G A G Y N P
Sbjct: 155 KV--------------DGCHIYGSVKLNRVAGELQFTAKGWGYRDNGR-----APLDQID 195
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASM--FNYYIKIIPTIYERLDGSK 303
FN H I SFG D PLDGT AK E+ S+ + Y ++PTI+++L G++
Sbjct: 196 FN--HVINEFSFGDFYPYID---NPLDGT-AKIEKQKSISRYIYSTSVVPTIFQKL-GAE 248
Query: 304 L------------GGGDG------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
+ DG +PGIFF Y+ PL + I++K S +++ +
Sbjct: 249 VDTNQYSLAEYHTAPKDGKIKLTTSIPGIFFRYDFEPLSIVISDKRLSFVQFIVRLVAIL 308
Query: 346 SGTYITFM 353
S +I +M
Sbjct: 309 S--FILYM 314
>gi|356549839|ref|XP_003543298.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/249 (26%), Positives = 104/249 (41%), Gaps = 42/249 (16%)
Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY 210
Y + +T T + + + LP D KN TE+ + T GC+I GY
Sbjct: 244 SYYGDRDTDSLVKTMENLVASLPSESQKLPLEDKSDVAKN---TERPAPS-TGGCRIDGY 299
Query: 211 LEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ----DDDE 266
+ V +V G+ L +S + + ++ N +H I HLSFG K+ D +
Sbjct: 300 VRVKKVPGN------LIFSARS----NAHSFDASQMNMSHVINHLSFGRKVSPRVMSDVK 349
Query: 267 RRKPLDGTVAKAEEGASMFN-----------YYIKIIPTI------------YERLDGSK 303
R P G+ G S N +Y++I+ T YE S
Sbjct: 350 RLIPYVGSSHDRLNGRSFINTHDLGANVTMEHYLQIVKTEVITRKDYKLVEEYEYTAHSS 409
Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
+ +P F ELSP+ V ITE KS H T + + G + ++DA+LH+ +
Sbjct: 410 VAQS-LHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIVGGIFTVAGIMDAILHNTI 468
Query: 364 KKISKVEIG 372
+ + KVE+G
Sbjct: 469 RLMKKVELG 477
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 63/115 (54%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S ++K +D + K D E ++ G ++IV L + +L +++ Y V+T+ ++ V
Sbjct: 1 MISSSKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMIFLFGMELNSYLSVTTSTQVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I +I P +SC++ A+D D G L++ + K +D + +P
Sbjct: 61 DKSSDGDYLRIDFNISFPALSCEFAAVDVSDVLGTNRLNLTKTVRKFSIDSNLRP 115
>gi|363738942|ref|XP_414530.3| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 1 [Gallus gallus]
Length = 291
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 86/188 (45%), Gaps = 30/188 (15%)
Query: 203 EGCQIYGYLEVNRVSG-SFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
+GC+ G+ +N+VS H++ H QP + TH I LSFG KL
Sbjct: 113 DGCRFEGHFSINKVSPWXLHVS---------THSATAQPQNP---DMTHIIHKLSFGDKL 160
Query: 262 QDDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGG 306
Q + L+G + + +Y +KI+PT+YE + G + +
Sbjct: 161 QVQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVANKEYVAY 220
Query: 307 GDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
G +P I+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + +
Sbjct: 221 SHTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILDSCIFTASE 280
Query: 365 KISKVEIG 372
K+++G
Sbjct: 281 AWKKIQLG 288
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
+ D + K +D + T G +++ C LFI +L ++ + EL+VD
Sbjct: 5 FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ ++L+I +P + C+ + LD D G + H+++++
Sbjct: 65 SGGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105
>gi|407927953|gb|EKG20833.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
Length = 366
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/360 (21%), Positives = 141/360 (39%), Gaps = 69/360 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + + ++T G T++ + +L + ++ T+ V+ G
Sbjct: 20 LQAFDAFPKTKKTYLQQTTQGANWTLLLIVTCVWLSITETRRWWTGETSHTFSVEKGVGH 79
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
++ I+LDIVV + C L ++ D+SG++ L G + + + V
Sbjct: 80 EMQINLDIVV-AMRCRDLHVNIQDASGDR-------------ILAGVALAKDDTRWLQWV 125
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+K K + +E RY + + +
Sbjct: 126 EKSK------------------------------NVHKLERSQEQKRYDEEDVHDYLGAS 155
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+ K T + + + C+IYG L+ NRV G FHI A G Y H+ Q
Sbjct: 156 KSKKFPKTPRYRGV-PDSCRIYGSLDANRVQGDFHITARGHGYMEFGEHLDHSQ------ 208
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIYERLDGS 302
FN +H I LSFG PLD T A ++ F YY+ ++PT+Y +
Sbjct: 209 FNFSHQINELSFGPYYP---SLTNPLDYTRAVTPTPDDHFYKFQYYLSVVPTVYTDNSHT 265
Query: 303 KLGGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+ + +PG+F +++ P+ + I+E + L +++ +SG +
Sbjct: 266 IVTNQYAVTEQSHSVPEMSVPGVFVKFDIEPIKLTISEYNGGFLALLIRLVNVVSGVMVA 325
>gi|345567560|gb|EGX50490.1| hypothetical protein AOL_s00075g219 [Arthrobotrys oligospora ATCC
24927]
Length = 354
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 85/176 (48%), Gaps = 27/176 (15%)
Query: 205 CQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
C+I+G ++VNRV G FHI A G Y HV HD FN +H + LSFG +
Sbjct: 164 CRIWGSMDVNRVMGDFHITAKGHGYWDPGQHVDHD-------TFNFSHVVNELSFG---E 213
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LDGSKLGGGDGG-------M 311
+ PLDG + E+ + Y++ ++PT Y+ L ++ + G +
Sbjct: 214 FYPKLVNPLDGVASVTEDKFYRYQYFMSVVPTTYKAHGRTLQTNQYSVTEQGRSMNPQSV 273
Query: 312 PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT----FMLVDALLHSCV 363
PGIFF +++ P+M+ IT+ +L ++ I G + + + D +L S +
Sbjct: 274 PGIFFKFDIEPIMLTITDTHTPWIYLIVRLANVIGGVMVAGGWLYKISDGVLGSVL 329
Score = 45.1 bits (105), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 1/93 (1%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
E LK DAF K + ++ GG +T+V +L+ ++ Y + E V
Sbjct: 8 EGLKSFDAFPKTRVSYTTRSSKGGVITMVFVAICVWLVWGELSLYLDGKSEEHFSVQGGE 67
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
G + I+LD++V + CD L ++ D++G++ L
Sbjct: 68 GHFMQINLDVIV-AMPCDSLHVNVQDAAGDRIL 99
>gi|219111363|ref|XP_002177433.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411968|gb|EEC51896.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 520
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 88/192 (45%), Gaps = 43/192 (22%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG----- 258
GC I G+L ++RV G+FHI + HD+ P+ + N +H + HLS G
Sbjct: 338 GCNIAGHLLLDRVPGNFHIQARSPH-------HDLVPHMT---NVSHVVHHLSIGEPVAE 387
Query: 259 -------IKLQDDDERR-KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGD-- 308
+ L +D +R+ KP++G +E +++Y+K+I T +DG K G D
Sbjct: 388 RLIEQEKVILPEDVKRKLKPMNGNAYVTKELHEAYHHYLKVITT---NVDGLKFGKRDLR 444
Query: 309 ---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
+P F ++LSP+ V S+ +T I+ I GT+
Sbjct: 445 AYQILQSSQLSFYRNDIIPEAKFVFDLSPVAVSYRTTSRRWYDYFTSILAIIGGTFTVVG 504
Query: 354 LVDALLHSCVKK 365
L+++ +H+ V +
Sbjct: 505 LLESTIHATVAR 516
>gi|18402672|ref|NP_566664.1| protein PDI-like 5-3 [Arabidopsis thaliana]
gi|75273652|sp|Q9LJU2.1|PDI53_ARATH RecName: Full=Protein disulfide-isomerase 5-3; Short=AtPDIL5-3;
AltName: Full=Protein disulfide-isomerase 12;
Short=PDI12; AltName: Full=Protein disulfide-isomerase
8-1; Short=AtPDIL8-1; Flags: Precursor
gi|11994143|dbj|BAB01164.1| unnamed protein product [Arabidopsis thaliana]
gi|15215847|gb|AAK91468.1| AT3g20560/K10D20_9 [Arabidopsis thaliana]
gi|332642877|gb|AEE76398.1| protein PDI-like 5-3 [Arabidopsis thaliana]
Length = 483
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 61/110 (55%), Gaps = 1/110 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MV S +LK +D + K D E ++ G ++IV LF+ +L +++ Y +V+TT + V
Sbjct: 1 MVSSTKLKSVDFYRKIPRDLTEASLSGAGLSIVAALFMMFLFGMELSSYLEVNTTTAVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
D SS G L I +I P +SC++ ++D D G L++ + K +D
Sbjct: 61 DKSSDGDFLRIDFNISFPALSCEFASVDVSDVLGTNRLNITKTVRKFPID 110
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 91/204 (44%), Gaps = 39/204 (19%)
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
K T GC++ GY+ V +V G+ I+ H H + S+ N +H + H SF
Sbjct: 287 KGPVTGGCRVEGYVRVKKVPGNLVISA-------HSGAHS---FDSSQMNMSHVVSHFSF 336
Query: 258 G----IKLQDDDERRKP--------LDGT--VAKAEEGASM-FNYYIKIIPT-IYERLDG 301
G +L D +R P LDG + + E GA++ +Y++ + T + R G
Sbjct: 337 GRMISPRLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQTVKTEVITRRSG 396
Query: 302 SKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
+ +P F +ELSP+ + ITE KS H T + I G
Sbjct: 397 QEHSLIEEYEYTAHSSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGV 456
Query: 349 YITFMLVDALLHSCVKKISKVEIG 372
+ ++D++ H+ V+ + KVE+G
Sbjct: 457 FTVAGILDSIFHNTVRLVKKVELG 480
>gi|449542382|gb|EMD33361.1| hypothetical protein CERSUDRAFT_117979 [Ceriporiopsis subvermispora
B]
Length = 530
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/357 (21%), Positives = 134/357 (37%), Gaps = 68/357 (19%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
DAF K + ++ G +T+ L+ D+ +Y + VDS S L
Sbjct: 27 FDAFPKLPTTYKARSESRGFLTLFVAFAAFLLVLNDLGEYIWGWPVYDFTVDSDPSSDLK 86
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
I++D++V + C YL++D D+ G+ RL L NA ++
Sbjct: 87 INVDMMV-NMPCAYLSVDLRDAMGD------------RLYLS------------NAFRRD 121
Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
+ G TT E + A R+ + + + + +
Sbjct: 122 GTKFDIGQATTLQE--------HAAALSARQVIAQSRKSRGFFS---------NLFRRTN 164
Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAAFNT 248
Y C+++G + +V+ + HI G Y+ H HV + N
Sbjct: 165 GGYKATYNHQPDGSACRVFGSITAKKVTANLHITTLGHGYA-THSHVDHSK------MNL 217
Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGD 308
+H I SFG D + PLD + A + + Y++ ++PT Y S L
Sbjct: 218 SHVITEFSFGPHFPDITQ---PLDNSFEVAHDPFVAYQYFLHVVPTTYIAPRSSPLHTHQ 274
Query: 309 GGM---------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+ PGIFF ++L PL +KI +++ SL L + + I G ++
Sbjct: 275 YSVTHYTRILDPSHHRHTPGIFFKFDLDPLAIKIEQRTTSLVQLAIRCVGVIGGVFV 331
>gi|302675040|ref|XP_003027204.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
gi|300100890|gb|EFI92301.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
Length = 528
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/353 (21%), Positives = 136/353 (38%), Gaps = 64/353 (18%)
Query: 2 VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
V L LDAF K + ++ G +T+ L+ D+ +Y E VD
Sbjct: 11 VLPPGLAKLDAFPKLPGTYKARSESRGFLTLFVAFICFILVFNDISEYIWGWPDYEFSVD 70
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
S + I++D+VV + C ++++D D+ G++ H + RR DG
Sbjct: 71 RHSSSFMNINVDMVV-NMPCRFISVDLRDAVGDRLFLSNHGL--RR---DG--------- 115
Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
T + T+L++ + S A + RK + +
Sbjct: 116 ----------TKFDVGQATKLKEHARALSAREAVAQGRKNRGLFSGLFGG---------- 155
Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQP 240
+ K+ + C+++G LEV +V+ + HI G Y+ H +
Sbjct: 156 -----KSKDLFPPTYNYEPHGSACRVWGSLEVKKVTANLHITTAGHGYASREHADHKV-- 208
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
N TH I SFG D + PLD T A++ + YY+ ++PT Y
Sbjct: 209 -----MNLTHVISEFSFGPHFPDIVQ---PLDYTFEVAKDPFVAYQYYLHVVPTTYIAPR 260
Query: 301 GSKLGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
+ L + PGIFF +++ PL ++I +++ S L+ +
Sbjct: 261 SAPLSTNQYSVTHYKKVFEHNQATPGIFFKFDIDPLAIQIHQRTTSFARLFIR 313
>gi|366987569|ref|XP_003673551.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
gi|342299414|emb|CCC67168.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
Length = 355
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 87/377 (23%), Positives = 146/377 (38%), Gaps = 74/377 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K E +K+ GG TI+ ++F ++ + YF E VD
Sbjct: 6 LRVFDAFPKTEEQHEKKSTKGGVSTILIYIFAIFIAWSEFGSYFGGFVGERYVVDGDVKE 65
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++D+ V I C ++ ++ D + ++ L E L+ + P P +N +
Sbjct: 66 TVSINMDLFV-NIPCKWITVNVRDQTMDRKLASEE------LNFEEMPFFIPFDVRINDI 118
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ T +L++ G AE + + + Y LP+ +
Sbjct: 119 AE--------IITPQLDE--ILGEAIPAEFREKLDTRMYYDENDPETYNN--LPDFN--- 163
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
GC I+G L VNRV+G I A G Y+ + P
Sbjct: 164 -----------------GCHIFGSLPVNRVAGELQITAKGYGYA-----DRERTPMDQIK 201
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERLDGSKL 304
FN H I SFG D PLD + E + ++Y + +IPT + +L G+++
Sbjct: 202 FN--HVINEFSFGDFYPYID---NPLDKSAKFDLETPKTAYSYDLSVIPTTFRKL-GTEV 255
Query: 305 G------------GGD------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
G D G +PGIFF Y L + +++ + +++ +S
Sbjct: 256 NTFQYSVAEYHYKGKDSPVPRSGRVPGIFFDYNFESLSIIVSDSRLNFIQFIIRLIAILS 315
Query: 347 -GTYIT---FMLVDALL 359
YI F L D L+
Sbjct: 316 FALYIASWIFTLGDLLI 332
>gi|118386954|ref|XP_001026594.1| hypothetical protein TTHERM_01146090 [Tetrahymena thermophila]
gi|89308361|gb|EAS06349.1| hypothetical protein TTHERM_01146090 [Tetrahymena thermophila
SB210]
Length = 712
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 62/205 (30%), Positives = 95/205 (46%), Gaps = 16/205 (7%)
Query: 5 ERLKGLDAFTKPYEDFH-EKTVYGGAVTI-VCWLFISYLICVDVCDYFQVSTTEELFVDS 62
ER K D F K +D EKT+ GG + +L I+ +I +F T +
Sbjct: 2 ERFKQFDYFRKVQDDLKSEKTLIGGLIGFSTIFLVITLVIYETYQVFFGNYKTFPFINNY 61
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEH-NIYKRRLDLDGKPIQEPQKE 121
+ K+ ++L+I I C L++D D SG HL H ++K RLD GK I +
Sbjct: 62 NPNEKVRVNLNITFEEIFCKALSVDYQDVSGA-HLEDMHWTVHKIRLDQFGKFIN---YD 117
Query: 122 VVNAVKKKKVTTENGT-------TTTELEDP-NKCGSCYGAETETRKCCNTCNEVKEAYR 173
N +KK++ G T ++++ + SCYGAE + C TC++V A+
Sbjct: 118 SANDIKKQEQKFYPGNPFFEAVKTNNQVQNQFSNSVSCYGAELYEGQICLTCSDVLIAFA 177
Query: 174 YKKWALPELDTIVQCKNEYSTEKLK 198
+ W P + I QC NE + E K
Sbjct: 178 QRGWPQPMKEQISQC-NEGTKENFK 201
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 68/164 (41%), Gaps = 32/164 (19%)
Query: 203 EGCQIYGYLEVNRVSGSFHIA---PGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF-- 257
E CQIYG+ V +V G+FH++ GL + ++ FN H I L F
Sbjct: 547 EKCQIYGHFYVKKVPGNFHVSFHNEGL-----------LLMNSNLIFNLRHTIHTLEFTT 595
Query: 258 ---GIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL-----------DGSK 303
+ L + PLD T+ G +YY+K++ T++E +
Sbjct: 596 EDGSLTLGKYTKSSNPLDKTIHNPGHGMDT-DYYLKVVNTVFENMLSEHNNIYSFTSLET 654
Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
G D +P + F YE P+ V KS+SL +C I G
Sbjct: 655 SGVRDFRLPSVNFRYEFDPITVLHYRKSRSLTQFIV-TLCAIVG 697
>gi|356545151|ref|XP_003541008.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 453
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 83/195 (42%), Gaps = 36/195 (18%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL-- 261
GC++ GY+ V +V G+ I+ D + ++ N +H I +LSFG K+
Sbjct: 266 GCRVEGYVRVKKVPGNLIISAR----------SDAHSFDASQMNMSHVINNLSFGKKVTP 315
Query: 262 --QDDDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTIYERLDGSKL---- 304
D + P G+ G S N +YI+I+ T G KL
Sbjct: 316 RAMSDVKLLIPYIGSSHDRLNGRSFINTRDLGANVTIEHYIQIVKTEVVTRKGYKLIEEY 375
Query: 305 -------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
+P F ELSP+ V ITE +S H T + I G + ++D+
Sbjct: 376 EYTAHSSVAHSLDIPVAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAGILDS 435
Query: 358 LLHSCVKKISKVEIG 372
+LH+ ++ + K+E+G
Sbjct: 436 ILHNTIRMVKKIELG 450
>gi|321465392|gb|EFX76393.1| hypothetical protein DAPPUDRAFT_306117 [Daphnia pulex]
Length = 289
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 81/187 (43%), Gaps = 29/187 (15%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+GC +NRV G+FH++ H D QP ++ + H+I L+FG L
Sbjct: 112 KGCIFESRFHINRVPGNFHVS---------THSADKQPDSA---DMAHYITSLTFGEMLD 159
Query: 263 DDD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG-------------- 306
+ + PL + A +Y +KI+PTIYE G+ L
Sbjct: 160 NKNLPGNFNPLARRDRSQADPAESHDYTMKIVPTIYEDSAGTTLVSYQYTYAYSNYVSFS 219
Query: 307 -GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G I+F Y+L+P+ VK E+ + + T + I GT+ ++D+ + + +
Sbjct: 220 LGGRSPAAIWFRYDLNPITVKYHERRQPIYAFLTSVCAIIGGTFTVAGIIDSFVFTASEI 279
Query: 366 ISKVEIG 372
K E+G
Sbjct: 280 FKKFELG 286
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 52/101 (51%), Gaps = 2/101 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ LD + K +D + TV G ++I C F+++L + + ELFVD+ +
Sbjct: 5 LRRLDIYRKVPKDLTQPTVTGAVISICCCAFMTFLFFSEFFHFISPEVVSELFVDNPGNT 64
Query: 67 --KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
K+P+ ++I +P ++C+Y+ +D D G + N K
Sbjct: 65 DEKIPVQINITLPRLACEYVGIDIQDDLGRHDVGFIENTLK 105
>gi|384244593|gb|EIE18093.1| protein disulfide isomerase [Coccomyxa subellipsoidea C-169]
Length = 479
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 61/110 (55%), Gaps = 1/110 (0%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD-SS 63
++L+ +D + K D E T+ G +++V I L+ ++ + + T EEL VD S+
Sbjct: 6 QKLRSVDFYRKIPNDLTEATLAGAGISLVAAFTIVVLLTAELSSFLAIETKEELIVDRSA 65
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
G L I+ +I P++SC++ LD D+ G + +++ I K +D DG+
Sbjct: 66 HGDLLRINFNISFPSLSCEFATLDVSDALGTKRMNLTKTIRKLPIDEDGQ 115
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 82/204 (40%), Gaps = 43/204 (21%)
Query: 202 TEGCQIYGYLEVNRVSGSFHI---APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
T GC + G++ V +V G+ H +PG S+ D Q A N +H + +L FG
Sbjct: 289 TSGCALSGFVLVKKVPGALHFLAKSPGHSF--------DYQ-----AMNMSHVVNYLYFG 335
Query: 259 IKLQD--------------DDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
K D+ L G + + F +Y++++ T E
Sbjct: 336 NKPSPRRHQSLAKLHPAGLSDDWADKLAGQDFFSRAAKATFEHYMQVVLTTIEPSKHRPE 395
Query: 305 GGGDG-------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
D +P F+Y+LSP+ + ++EK ++ H T I G +
Sbjct: 396 LSYDAYEYTVHSHTYDTADIPAAKFTYDLSPIQILVSEKRRAWYHFVTTTCAIIGGVFTV 455
Query: 352 FMLVDALLHSCVKKISKVEIGGKT 375
+VD L+H+ + KVE+G T
Sbjct: 456 AGIVDGLVHTGARFAKKVELGKHT 479
>gi|17570549|ref|NP_508375.1| Protein Y102A11A.6 [Caenorhabditis elegans]
gi|351063407|emb|CCD71590.1| Protein Y102A11A.6 [Caenorhabditis elegans]
Length = 286
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 80/193 (41%), Gaps = 37/193 (19%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ E+N+V G+FH++ H QP ++ +RHL IK D
Sbjct: 110 GCRFESRFEINKVPGNFHLS---------THSAATQP-------ESYDMRHLIHSIKFGD 153
Query: 264 DDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------- 306
D + PL E G + Y +KI+P+++E G+ L
Sbjct: 154 DVSHKNLKGSFDPLAKRNTSQENGLNTHEYILKIVPSVHEDYSGTILNSYQYTFGHKSYI 213
Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
+P ++F YEL P+ +K TE+ +S T I + GT+ ++D+ +
Sbjct: 214 TYHHSGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTI 273
Query: 363 VKKISKVEIGGKT 375
+ + K +G T
Sbjct: 274 SELVKKQRLGKLT 286
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 1/92 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
++ D + K +D + T G ++I+C LFIS++I D+ Y + E F+D R
Sbjct: 4 IRRFDIYRKVPKDLTQPTTVGAVISILCVLFISFMIFNDILAYIFIDLRSEFFIDDPGRE 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
K+ + +++ P ++C+YL +D D +G +
Sbjct: 64 GKIDVQVNVSFPHMACEYLGVDIQDENGRHEV 95
>gi|324499844|gb|ADY39943.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Ascaris suum]
Length = 429
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 160/387 (41%), Gaps = 41/387 (10%)
Query: 3 FSERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEE--LF 59
E ++ LDAF K ++ EK G +++VC+ I L+ ++ Y T E
Sbjct: 15 LQEIVQSLDAFDKTTDEIKEEKKTSGAIISVVCFTVIGVLVFGELKTYIYGDTEFEYKFT 74
Query: 60 VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
VD++ + + LD++V T C L ++ E+ + N +KR D E +
Sbjct: 75 VDTAFDEQPELELDMIVAT-PCTNLVAQLSGTAAEEFFLL--NQFKR--DPTRFEFTERE 129
Query: 120 KEVVNAVKK-KKVTTENGTTTTELED----PNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
++ + +K+ VT G LE AE E ++ KE
Sbjct: 130 QKYWDELKRVHGVTKPGGMVFKGLEKMEFVSGHVEEGLKAEAEVKQREEAIAIEKERKNN 189
Query: 175 KK-----WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSG-SFHIAPGLSY 228
K+ A+ + + + +++ K+ T C+++G + VN+V G S I G
Sbjct: 190 KQEDTFGGAILLIGNGINVFHILASDSQKDEGT-ACRVHGRVRVNKVKGDSVIITAGKGA 248
Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
I+ + H S A N +H I L FG + PL GT +E G + Y+
Sbjct: 249 GIDGLFAH--VDGASNAGNISHRIARLHFGPWIGG---LLTPLAGTEQISESGIDEYRYF 303
Query: 289 IKIIPT-IYER--LDGSKL-------------GGGDGGMPGIFFSYELSPLMVKITEKSK 332
+K++PT I+ GS + G + P I YE + L+V++ E
Sbjct: 304 LKVVPTRIFHSGFFGGSTMRYQYSVTKTHKRPSGREHMHPAIAIHYEFAALVVEVRETQT 363
Query: 333 SLGHLWTKIMCNISGTYITFMLVDALL 359
SL L+ ++ + G + T +++ L
Sbjct: 364 SLFQLFVRLCSVVGGVFATSSILNELF 390
>gi|154286632|ref|XP_001544111.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407752|gb|EDN03293.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 315
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 126/318 (39%), Gaps = 75/318 (23%)
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
++LDIVV + CD L ++ D++G++ L + LD +E+
Sbjct: 1 MNLDIVV-AMPCDALRVNVQDAAGDRILASD------LLDKQQTSWAAWNREL------N 47
Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
VT+ G L + + S + + E K +Y+ K P+L
Sbjct: 48 GVTSGGGREYQTLNEEDL--SRLMEQEADAHVGHALGEAKRSYKRKFPKGPKLK------ 99
Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQPYTSAAFN 247
+ + C+IYG LE N+V G FHI A G Y H+ HD AFN
Sbjct: 100 --------RGEKADSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHLSHD-------AFN 144
Query: 248 TTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL-------- 299
+H + LSFG PLD T++ F YY+ ++PTIY R
Sbjct: 145 FSHMVTELSFGPHYP---SLLNPLDKTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNH 201
Query: 300 ---DGSKLGGGDGG-----------------------MPGIFFSYELSPLMVKITEKSKS 333
D + + + G +PGIFF Y + P+++ ++E+ S
Sbjct: 202 VLPDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYHIPGIFFKYNIEPILLVVSEERGS 261
Query: 334 LGHLWTKIMCNISGTYIT 351
L L +++ ++G +
Sbjct: 262 LLALLVRLVNVLAGVVVA 279
>gi|409048375|gb|EKM57853.1| hypothetical protein PHACADRAFT_116248 [Phanerochaete carnosa
HHB-10118-sp]
Length = 546
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 75/345 (21%), Positives = 135/345 (39%), Gaps = 66/345 (19%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
DAF K + ++ G +T+ L+ D+ +Y E VD R S L
Sbjct: 27 FDAFPKLPSTYKARSEGRGFLTVFVTFMAFLLVLNDLGEYIWGWPDHEFSVDRDRSSDLR 86
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
I++D++V + C YL++D D+ G++ L++ + ++R L KE A+ +
Sbjct: 87 INVDMLV-NMPCQYLSVDLRDAVGDR-LYLS-DSFRRDGTLFDIGQATALKEHAAALSAR 143
Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
+V T++ K + T R+ + Y YK
Sbjct: 144 QVVTQS----------RKSRGLF--ATLFRR---NSGGFRPTYNYKPSG----------- 177
Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAAFNT 248
C++YG + V +V+ + H+ G Y+ H++ N
Sbjct: 178 -------------SACRVYGSVAVKKVTANLHVTTLGHGYASRQHVDHNL-------MNL 217
Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGG- 307
+H I SFG D + PLD + E+ + YY+ ++PT Y L
Sbjct: 218 SHVITEFSFGPYFPDITQ---PLDNSFELTEDSFVSYQYYLHVVPTTYIAPRSRPLHTHQ 274
Query: 308 ------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
+ G+PGIFF +++ P+ + I +++ SL L +
Sbjct: 275 YSVTHYTRVLKHNNGIPGIFFKFDVDPMSLTIHQRTTSLLQLLIR 319
>gi|299116076|emb|CBN74492.1| DEAD box helicase [Ectocarpus siliculosus]
Length = 865
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 83/189 (43%), Gaps = 34/189 (17%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGI---- 259
GC + G++ VNRV G+FHI + S +H + A N +H + H+SFG
Sbjct: 684 GCMVTGHIMVNRVPGNFHIE---AASKSHT-------FHGATTNLSHIVHHMSFGNDPPR 733
Query: 260 -------KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------RLDGSKLGG 306
+L +D + PLDG V A ++Y++++ ++Y G ++
Sbjct: 734 RTQTKINRLTEDLRQNAPLDGNVYVANAYHQAPHHYLRVVGSMYHLSPMKTPWHGYQIVA 793
Query: 307 GDGGM-------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
M P FSY +SP+ V + + + TK++ + GT+ LVDA +
Sbjct: 794 NSQMMLYDEEEVPEARFSYNISPMSVLVRSEKRPWYDFVTKVLAIVGGTFSMVGLVDAAV 853
Query: 360 HSCVKKISK 368
+K +
Sbjct: 854 FRASRKAGR 862
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 25/104 (24%), Positives = 50/104 (48%), Gaps = 1/104 (0%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
LD + K D + T GG + + + + L V++ + ++ VD+ +KL
Sbjct: 411 LDLYPKIPTDLSQSTAVGGWFSTLTGVIMLLLFQVELFSFMSAPIESQVVVDNVLETKLQ 470
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVE-HNIYKRRLDLDG 112
I+ ++ + C+YL++DA+D G +++ + K LD G
Sbjct: 471 INFNMSFLDLPCEYLSVDALDVLGSNRVNITGKEVQKWHLDPQG 514
>gi|406607484|emb|CCH41148.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Wickerhamomyces ciferrii]
Length = 359
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 86/192 (44%), Gaps = 33/192 (17%)
Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSIN---HVHVHDIQPYTSAAFNTTHHIR 253
K++ C IYG + VN+VSG FHI A G Y N HV + + N TH I
Sbjct: 163 KDSGAPACHIYGSIPVNKVSGDFHITAQGYGYRGNSRSHVGIDGL--------NFTHIIS 214
Query: 254 HLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD------------G 301
SFG + PLD TV +E + YY+ ++PT+Y++L
Sbjct: 215 EFSFG---EFYPYIHNPLDATVQITKEHLQSYQYYLSVVPTVYKKLGVEIETNQYSTSLQ 271
Query: 302 SKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI----TFMLVD 356
KL + G+PG+FF Y+ P+ + + +K ++ G + ++ L D
Sbjct: 272 KKLYSFENKGVPGLFFKYDFEPISLIVEDKRIPFSTFLVRLATIYGGIIVVAKFSYKLFD 331
Query: 357 -ALLHSCVKKIS 367
AL++ K+ +
Sbjct: 332 KALIYFFGKRFA 343
>gi|328771759|gb|EGF81798.1| hypothetical protein BATDEDRAFT_86854 [Batrachochytrium
dendrobatidis JAM81]
Length = 333
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 67/169 (39%), Gaps = 31/169 (18%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-HDIQPYTSAAFNTTHHIRHLSFGIKL 261
+ C+ G + N+V G H L + VH HD A N TH I LSFG +
Sbjct: 158 DACRFRGSFQANKVEGMLHFT-ALGHGYFGVHTPHD-------AINFTHRIDELSFGARY 209
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG--------------- 306
D PLD T+ F Y++ ++PTIY S G
Sbjct: 210 PD---LHNPLDHTLEIGTTNFDSFMYFLGVVPTIYVDKARSLFGATLLTNQYAVTEFSHA 266
Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+PGIF Y + P+ V+ITE L T++ I G ++T
Sbjct: 267 VDPQNPDALPGIFIKYHIEPISVRITESRLGLVQFTTRMCGIIGGAFVT 315
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 45/92 (48%), Gaps = 3/92 (3%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
S+RL LDAF K + + T GG V+++ + YL C ++ + + + VD
Sbjct: 10 LSKRLASLDAFPKIEKQLQQTTKSGGLVSLMMLAVLVYLACTEIYRWRSIDQRYDFIVDQ 69
Query: 63 SRGSK--LPIHLDIVVPTISCDYLALDAVDSS 92
+R + L I++D+ + + C L D D S
Sbjct: 70 TRSHEHSLQINVDLTI-AMDCKVLRADIQDIS 100
>gi|189207969|ref|XP_001940318.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187976411|gb|EDU43037.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 394
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 79/382 (20%), Positives = 147/382 (38%), Gaps = 90/382 (23%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
DAF K + + + A T+ L YL ++ + ST++ V+ +
Sbjct: 24 FDAFPKTKKTYLVQGRNSSAWTVTLILTCIYLSWSEISRWLAGSTSQSFSVEKGISHDMQ 83
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
++LD++V + C L ++ D++G++ L E + +K
Sbjct: 84 LNLDVIV-AMRCADLHVNMQDAAGDRTLAGE-------------------------LLRK 117
Query: 130 KVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQC 188
T+ + T LE ++ G G + + ++ +A++ K P +
Sbjct: 118 DPTSWSQWTGRNLERGTHELGIDAGKAQPWEEVWDVHEQLGKAHKRKFSKTPRI------ 171
Query: 189 KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNT 248
+ E T+ C+IYG L+ N+V G FHI + H ++ Q ++FN
Sbjct: 172 RGE----------TDSCRIYGSLDGNKVQGDFHIT-----ARGHGYIEFGQHLDHSSFNF 216
Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIY--------- 296
+H IR +SFG PLD T+A ++ F YY+ I+PTIY
Sbjct: 217 SHIIREMSFGPYYP---SLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPSLIPL 273
Query: 297 -ERLDGSKLGGGDGGM--------------------------PGIFFSYELSPLMVKITE 329
E + + G M PGIF +++ P+++++ E
Sbjct: 274 LELVGSTSNHPGAASMFHGAHAIKTNQYAVTSQSHKVPENYVPGIFVKFDIEPIVLRVVE 333
Query: 330 KSKSLGHLWTKIMCNISGTYIT 351
+ L ++ +SG +
Sbjct: 334 EWGGFWRLIVTLINVVSGVMVA 355
>gi|449530722|ref|XP_004172342.1| PREDICTED: protein disulfide isomerase-like 5-4-like, partial
[Cucumis sativus]
Length = 176
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 60/110 (54%), Gaps = 1/110 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E T+ G ++IV L + +L +++ +Y VST+ + V
Sbjct: 1 MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
D S+ G L + +I P +SC++ A+D D G L++ I K +D
Sbjct: 61 DNSTDGDFLRMDFNISFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSID 110
>gi|402224967|gb|EJU05029.1| DUF1692-domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 517
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 74/346 (21%), Positives = 131/346 (37%), Gaps = 72/346 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + ++ GG +T+ L L+ D +Y +TT VD
Sbjct: 20 LKSFDAFPKVPSTYRTRSSGGGFITLGIALLCLLLVLNDWAEYVWGTTTWRFVVDDKIEK 79
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
++ +++DI V + C Y+++D D+ G++ LH+ + D + +++ +
Sbjct: 80 EMMLNVDITV-AMPCHYISVDLRDAVGDR-LHLSDQFKRDGTLFDARQATHIREQYTD-- 135
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
Y A+ R+ + I
Sbjct: 136 -------------------------YSAQQMVREAKTRRGRIG---------------IF 155
Query: 187 QCKNEYSTEKLKNTFT-----EGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQP 240
+ TF C++YG +EV +V + HI G Y N H +
Sbjct: 156 DWLRRRQPSAFQPTFNHVKDGSACRVYGSMEVKKVQANLHITTLGHGYHSNEHTDHSL-- 213
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
N +H I SFG D + PLD T+ +++ + F Y++ ++PT Y
Sbjct: 214 -----MNLSHIITEFSFGPYFPDIVQ---PLDYTIESSDDPFTAFQYFLTVVPTEYRTSK 265
Query: 301 G----SKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSL 334
G ++ G G P IFF Y+L PL + + +++ +L
Sbjct: 266 GVVKTNQYSVGSHMQHIQHGRGTPVIFFKYDLEPLSLIVEQRTTTL 311
>gi|443683891|gb|ELT87978.1| hypothetical protein CAPTEDRAFT_224400 [Capitella teleta]
Length = 292
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 85/200 (42%), Gaps = 32/200 (16%)
Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
+T+K+ EGC+ ++N+V G+FHI+ H QP N H +
Sbjct: 102 NTDKVPINNNEGCRFKSSFKINKVPGNFHIS---------THASKEQPPQP---NMKHIV 149
Query: 253 RHLSFGIKLQDDDERRKPLDGTVA--KAEEGA-SMFNYYIKIIPTIYERLDGSKL----- 304
L FG ++ + + K+E A S +YY+KI+P ++ G L
Sbjct: 150 HELIFGDRVPQTIHIPGSFNPLLEKDKSESNALSSHDYYLKIVPAVFNDYSGKTLMHPYQ 209
Query: 305 -----------GGGDGGMPGIFFSYELSPLMVKITEKSK-SLGHLWTKIMCNISGTYITF 352
GG +P I+F Y+L+P+ VK +E+ H T + + GT+
Sbjct: 210 YTFAYRHSIRQRGGQVVIPAIWFKYKLNPMCVKYSEQRPIPFYHFLTAVCAIVGGTFTVA 269
Query: 353 MLVDALLHSCVKKISKVEIG 372
+ D+ L + + K E+G
Sbjct: 270 GIFDSFLFTAAEIFKKAELG 289
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 25/93 (26%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD--SSR 64
++ LD + K +D + T G +++ LFI+YL ++ Y E++VD ++
Sbjct: 5 IRRLDIYRKIPKDLTQPTKTGACISVGSVLFIAYLFISELTSYLSSEIVTEMYVDDPATN 64
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
++P+ LDI + + C Y+ LD D G +
Sbjct: 65 SERIPVKLDISLLNMECKYIGLDIQDDLGRHEV 97
>gi|341874049|gb|EGT29984.1| hypothetical protein CAEBREN_24080 [Caenorhabditis brenneri]
Length = 286
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/193 (22%), Positives = 83/193 (43%), Gaps = 37/193 (19%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ E+N+V G+FH++ +++A+ + ++H+ IK D
Sbjct: 110 GCRFESRFEINKVPGNFHLST----------------HSAASQPENYDMKHIIHSIKFGD 153
Query: 264 DDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------- 306
D + PL + E G S Y +KI+P+++E G+ L
Sbjct: 154 DVSHKNLKGSFDPLANRDSLQENGLSTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYI 213
Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
+P ++F YEL P+ +K TE+ +S T I + GT+ ++D+ +
Sbjct: 214 TYHHSGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTI 273
Query: 363 VKKISKVEIGGKT 375
+ + K ++G T
Sbjct: 274 SELVKKQQMGKLT 286
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
++ D + K +D + T G ++I C LFIS++I DV Y + E F+D R
Sbjct: 4 IRRFDIYRKVPKDLTQPTTVGALISIFCVLFISFMIFNDVLAYIFIDLRSEFFIDDPGRE 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
K+ + +++ P ++C+YL +D D +G +
Sbjct: 64 GKIDVQVNVSFPHMACEYLGVDIQDENGRHEV 95
>gi|324516732|gb|ADY46617.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Ascaris suum]
Length = 286
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 82/186 (44%), Gaps = 29/186 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ E+N+V G+FH++ H QP +++ H + + FG LQ+
Sbjct: 110 GCRFEANFEINKVPGNFHLS---------THSAASQP---ESYDMRHIVNSVKFGDDLQE 157
Query: 264 DDE--RRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
+ PL A + + Y +K++P++YE + G +
Sbjct: 158 KAQIGSFNPLQDRTALQGDPLNTHEYILKVVPSVYEDIAGRTKYSYQYTYAHKEYIAYHH 217
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
G +P ++F YEL P+ VK TE+ + L T + + GT+ ++D+ L S +
Sbjct: 218 SGRIIPAVWFKYELQPITVKYTERRQPLYAFITSVCAVVGGTFTVAGIIDSSLFSLSELY 277
Query: 367 SKVEIG 372
K ++G
Sbjct: 278 KKHQLG 283
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 51/92 (55%), Gaps = 1/92 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
++ LD + K +D + T G ++I+C FI++++ D+ + V ELFVD R
Sbjct: 4 IRRLDIYRKVPKDLTQPTRTGAVISIICVCFIAFMLFNDLRMFLSVDLHSELFVDDPGRE 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
++ +HL+ +P + C+YL +D D +G +
Sbjct: 64 GRIKVHLNATLPYLPCEYLGVDIQDENGRHEV 95
>gi|395326723|gb|EJF59129.1| hypothetical protein DICSQDRAFT_156384 [Dichomitus squalens
LYAD-421 SS1]
Length = 559
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/361 (22%), Positives = 141/361 (39%), Gaps = 72/361 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K E + + G +T+ LI D+ ++ E VD +
Sbjct: 28 LAQFDAFPKLPETYKTHSESRGFLTLFVAFVAFLLILNDLGEFIWGWPDFEFGVDKMPSA 87
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I++D+VV + C YL++D D+ G+ RL L +
Sbjct: 88 NLDINVDMVV-NMPCQYLSIDLRDAVGD------------RLYLS------------DGF 122
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
++ + G T+ E + A R+ V ++ R + + DT++
Sbjct: 123 RRDGTKFDIGQATSLKE--------HAAMLSARQA------VSQSRRSRGF----FDTLL 164
Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAP-GLSYSINHVHV-HDIQPYT 242
+ + S + N +G C+IYG + RV+ + H+ G Y+ +H HV H
Sbjct: 165 H-RTKSSFKPTYNYQPDGSACRIYGTITAKRVTANLHVTTLGHGYA-SHEHVDHKF---- 218
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
N +H I SFG D + PLD + A + + Y++ ++PT Y
Sbjct: 219 ---MNLSHVITEFSFGPYFPDITQ---PLDNSFEMAHDPFVAYQYFLHVVPTTYIAPRSK 272
Query: 303 KLGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
L G PGIFF ++L P+ + I +++ SL + + G +
Sbjct: 273 PLHTNQYSVTHYTRVLDHHRGTPGIFFKFDLEPIHMTIHQRTTSLAAFLLRCAGVVGGVF 332
Query: 350 I 350
+
Sbjct: 333 V 333
>gi|361132020|gb|EHL03635.1| hypothetical protein M7I_0279 [Glarea lozoyensis 74030]
Length = 235
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 71/180 (39%), Gaps = 68/180 (37%)
Query: 207 IYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY----TSAAFNTTHHIRHLSFGIKLQ 262
I G L VN+V G+FHIAPG S+S ++HVHD+ Y +H I HL FG +L
Sbjct: 38 IEGALRVNKVIGNFHIAPGRSFSNGNMHVHDLNNYFDTPVEGGHVFSHTIHHLRFGPQLP 97
Query: 263 DD-------------DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL---------- 299
++ + PLD T E A F Y++K++ T Y L
Sbjct: 98 EELTKKLGTKTNLWTNHHLNPLDDTKQTTTEPAYNFMYFVKVVSTSYLPLGWETQAYKSQ 157
Query: 300 ---------------DGS-------------KLGGGD-------------GGMPGIFFSY 318
DGS L GGD GG+PG+FFSY
Sbjct: 158 LGSEWVGIGSYGHQHDGSVETHQYSVTSHRRSLNGGDDASEGHKEKVHARGGIPGVFFSY 217
>gi|308494873|ref|XP_003109625.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
gi|308245815|gb|EFO89767.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
Length = 286
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 79/193 (40%), Gaps = 37/193 (19%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ E+N+V G+FH++ H QP + +RH IK D
Sbjct: 110 GCRFESRFEINKVPGNFHLS---------THSAATQP-------DNYDMRHTIHSIKFGD 153
Query: 264 DDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------- 306
D + PL E G + Y +KI+P+++E G+ L
Sbjct: 154 DVSHKNLKGSFDPLANRDTSQENGLNTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYI 213
Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
+P ++F YEL P+ +K TE+ +S T I + GT+ ++D+ +
Sbjct: 214 TYHHSGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTI 273
Query: 363 VKKISKVEIGGKT 375
+ + K ++G T
Sbjct: 274 SELVKKQQMGKLT 286
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
++ D + K +D + T G ++++C FI+++I DV Y + E F+D R
Sbjct: 4 IRRFDIYPKIPKDLTQPTTAGAVISMLCVAFIAFMIFNDVLAYIFIDLRSEFFIDDPGRE 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
K+ + +++ P ++C+YL +D D +G +
Sbjct: 64 GKIDVQVNVSFPHMACEYLGVDIQDENGRHEV 95
>gi|392564830|gb|EIW58008.1| DUF1692-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 539
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/361 (21%), Positives = 141/361 (39%), Gaps = 72/361 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K + ++ G +T+ L+ D+ +Y E VD+ + +
Sbjct: 25 LAQFDAFPKVPSSYKTRSESRGFLTLFVAFVAFLLVLNDIGEYIWGWPDYEFGVDTDQTN 84
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L I++D+V+ + C +L++D D+ G+ RL L +
Sbjct: 85 ALDINVDMVI-NMPCQFLSVDLRDAVGD------------RLFLS------------DGF 119
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEV--KEAYRYKKWALPELDT 184
++ + G T+ L++ + S A +++R + + + A RYK
Sbjct: 120 RRDGTKFDIGQATS-LKEHAEALSARQAVSQSRSSRGFFDVLLRRAAVRYKP-------- 170
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHV-HDIQPYT 242
Y + C+++G + RV+ + HI G Y+ + HV H +
Sbjct: 171 ----TYNYQPDG------SACRVFGTITAKRVTANLHITTLGHGYA-SQTHVDHKL---- 215
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
N +H I SFG D + PLD + E + YY+ ++PT Y
Sbjct: 216 ---MNLSHVITEFSFGPYFPDITQ---PLDNSFELTSEPFVAYQYYLHVVPTTYIAPRTK 269
Query: 303 KLGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
L G PGIFF ++L P+ + I +++ S L+ + + I G +
Sbjct: 270 PLNTNQYSVTHYTRVLDHHRGTPGIFFKFDLEPMKLTIHQRTTSFVQLFIRTVGVIGGVF 329
Query: 350 I 350
+
Sbjct: 330 V 330
>gi|300122162|emb|CBK22736.2| unnamed protein product [Blastocystis hominis]
Length = 331
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 61/112 (54%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + ++ LD F K D E ++ G +TIVC++ ++ L+ ++ +YF + T + +
Sbjct: 1 MGWRSTVRKLDMFRKVPVDLTEGSICGTILTIVCYILVAALVALEFNNYFTIDTRTDYII 60
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
+ + I+ DI + ++SCD +LD V+ G ++V NI + ++ +G
Sbjct: 61 EQHDDEYIQINFDITMKSLSCDLASLDIVNQMGTHRINVTQNIRRWQVFENG 112
>gi|397568633|gb|EJK46248.1| hypothetical protein THAOC_35093 [Thalassiosira oceanica]
Length = 601
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 98/214 (45%), Gaps = 54/214 (25%)
Query: 196 KLKNTFTE----GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHH 251
K+K+++ E GCQI G+L V+R G+FHI + S N HD+ + + N +H
Sbjct: 396 KVKHSWDEDEHPGCQISGFLLVDRAPGNFHIQ---AQSKN----HDLAAHMT---NVSHI 445
Query: 252 IRHLSFG-------IK--LQDDD----ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE- 297
I HLSFG IK L++ + +P DG V ++Y+K+I T +E
Sbjct: 446 INHLSFGKPFSKYFIKEGLKNTPAGFLDTTRPFDGNVYVTHNEHEAHHHYLKVITTEFEP 505
Query: 298 RLDGSKLGGGDGG--------------------------MPGIFFSYELSPLMVKITEKS 331
+ D K G G +P F+Y+LSP+ V ++K
Sbjct: 506 QRDTKKQYGKKKGFYKPPEPQRAYQILQSSQLSLYRNDIVPEAKFTYDLSPIAVSYSKKY 565
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
++ +T +M I GT+ +V++ L++ KK
Sbjct: 566 RAWYDYFTSLMAIIGGTFTVVGMVESSLYAVSKK 599
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 33/143 (23%), Positives = 64/143 (44%), Gaps = 5/143 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L LD + K D E T G ++ + + ++ L ++ +F S + L +DS+
Sbjct: 80 LASLDMYRKVPVDLLEGTKRGSIMSTLAIMSMATLFFLETRAFFSSSLSTNLALDSNTDQ 139
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ ++ +I + + CDY +D V G Q +V ++ K +D G + Q+ +
Sbjct: 140 NVRVNFNITMMDLRCDYATIDVVSVLGTQQ-NVTQHVQKYPIDQYGVRQRYQQRN----L 194
Query: 127 KKKKVTTENGTTTTELEDPNKCG 149
K+ V + T +ED + G
Sbjct: 195 KQHDVQQFDATVEETIEDLHADG 217
>gi|168012320|ref|XP_001758850.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689987|gb|EDQ76356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 487
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 87/195 (44%), Gaps = 36/195 (18%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK--- 260
GC++ G++ V +V G I+ ++S +H + + + N TH++ SFG K
Sbjct: 300 GCRVEGFVRVKKVPGELMIS---AHSGSH-------SFDATSMNMTHYVGFFSFGRKTSW 349
Query: 261 ---------LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----IYERLDGSKLGGG 307
L D L G V +E ++Y++++ T ++ + D L
Sbjct: 350 RSVHWVNEMLPALDSNIDRLTGQVFPSEYENITHDHYLQVVKTEVITLHRKQDLRVLEQY 409
Query: 308 D----------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
D +P + F YELSP+ V + E KS H T + I G + ++D+
Sbjct: 410 DYTAHSNMIQSTKVPVVKFHYELSPMQVLVKENPKSFSHFLTNLCAIIGGVFTVAGIIDS 469
Query: 358 LLHSCVKKISKVEIG 372
+LH+ + + KVE+G
Sbjct: 470 MLHNAMHIMKKVELG 484
>gi|330935325|ref|XP_003304912.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
gi|311318248|gb|EFQ86993.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
Length = 395
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/386 (20%), Positives = 144/386 (37%), Gaps = 91/386 (23%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ DAF K + + + A T+ L YL ++ ++ ST + V+
Sbjct: 21 VSSFDAFPKTKKTYLVQGRNSSAWTVTLILTCIYLSWSEISRWYAGSTWQSFAVEKGVSH 80
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I+LDI+V + C L ++ D++G++ L E +
Sbjct: 81 DMQINLDIIV-AMRCADLHVNMQDAAGDRTLAGE-------------------------L 114
Query: 127 KKKKVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+K T+ + T LE ++ G+ G + + ++ +A+
Sbjct: 115 LRKDPTSWSQWTGRNLERGTHELGTEAGDAPSWEEAWDVREQLGKAH------------- 161
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
K ++S + C+IYG L+ N+V G FHI + H ++ + ++
Sbjct: 162 ---KRKFSKTPRIRGNPDSCRIYGSLDGNKVQGDFHIT-----ARGHGYMEFGEHLDHSS 213
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIYE----- 297
FN +H IR +SFG PLD T+A ++ F YY+ I+PTIY
Sbjct: 214 FNFSHIIREMSFGPYYP---SLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPTL 270
Query: 298 ----RLDGSKLGGGDGG----------------------------MPGIFFSYELSPLMV 325
S G G +PG+F +++ P+M+
Sbjct: 271 IPYLEAVSSTAGNHPGAASIFHGARAIKTNQYAVTSQSHKVPENYVPGVFVKFDIEPIML 330
Query: 326 KITEKSKSLGHLWTKIMCNISGTYIT 351
+ E+ L ++ +SG +
Sbjct: 331 AVVEEWSGFWRLIVTLVNVVSGVMVA 356
>gi|241560364|ref|XP_002401002.1| COPII vesicle protein, putative [Ixodes scapularis]
gi|215501827|gb|EEC11321.1| COPII vesicle protein, putative [Ixodes scapularis]
gi|442749161|gb|JAA66740.1| Putative copii vesicle protein [Ixodes ricinus]
Length = 285
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 81/185 (43%), Gaps = 28/185 (15%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +++V G+FH++ H QP + TH I L+FG K+ +
Sbjct: 110 GCRFEGKFYIHKVPGNFHMS---------THAAAKQP---DKIDMTHIIHDLTFGNKMVE 157
Query: 264 DDERR-KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG-------------GDG 309
LD G +Y +KI+PT++E+ ++
Sbjct: 158 GVRGSFNSLDEMDKSEANGLESHDYVMKIVPTVFEKSPSERIESYQYTYAYKSYVSISHS 217
Query: 310 G--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
G MP I+F Y+L+P+ VK T +S L T + + GT+ +VD+L+ + +
Sbjct: 218 GRIMPAIWFRYDLTPITVKYTRRSVPLYSFLTSVCAIVGGTFTVAGIVDSLVFTASEIFK 277
Query: 368 KVEIG 372
K E+G
Sbjct: 278 KYEMG 282
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 3/106 (2%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MVF R D + K +D + TV G ++I+ FIS L + Y ELFV
Sbjct: 1 MVFDVR--RFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELASELFV 58
Query: 61 DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
D+ S K+P+ ++I + + C + LD D G + N K
Sbjct: 59 DNPSSADKIPVSINITLLKLDCSAVGLDIQDDMGRHEVGFVENTEK 104
>gi|357452761|ref|XP_003596657.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355485705|gb|AES66908.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 482
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 84/198 (42%), Gaps = 40/198 (20%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG----I 259
GC+I GY+ V +V G+ I+ D + ++ N +H + HLSFG
Sbjct: 293 GCRIEGYVRVKKVPGNLIISAR----------SDAHSFDASQMNMSHAVHHLSFGKKLSP 342
Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTI------------Y 296
KL D +R P G +G S N +Y++I+ T Y
Sbjct: 343 KLMSDVQRLIPYVGNSHDRLDGLSFINSHDFGANVTLEHYLQIVKTEVITRQGYQLVEEY 402
Query: 297 ERLDGSKLGGGDGGMPGIFFSYELSPLMV--KITEKSKSLGHLWTKIMCNISGTYITFML 354
E S L +P F +LSP+ V ITE KS H T + + G + +
Sbjct: 403 EYTAHSSLAHS-LHVPVARFHLQLSPMQVCVLITEDHKSFSHFITNVCAIVGGVFTVAGI 461
Query: 355 VDALLHSCVKKISKVEIG 372
+++LH+ ++ + KVE+G
Sbjct: 462 TESILHNTIRLMRKVELG 479
>gi|393908149|gb|EJD74928.1| hypothetical protein LOAG_17836 [Loa loa]
Length = 430
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 89/396 (22%), Positives = 166/396 (41%), Gaps = 56/396 (14%)
Query: 5 ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE--ELFVD 61
E ++ DAF K ++ EK GG + + +L I+ L+ ++ +YF VD
Sbjct: 22 EVVRDFDAFNKTVDEVSEEKRATGGFLASLSFLIIAALVFGELQNYFYGDEGHYYRFSVD 81
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
++ + +D++V T + +A +S H N +K D + +KE
Sbjct: 82 TAFSEHPELEVDMIVATPCTNLMAHLTGTAS---HEFNSMNGFK----YDPTRFEFTEKE 134
Query: 122 VV--NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY----RYK 175
+ N +KK + T+ GTT + ++ G E K + +EA+ + K
Sbjct: 135 AMYWNELKKVQHRTKEGTTL--FKSLDEMTFVSGRVEEGLKTEAETKQREEAHAIQLQRK 192
Query: 176 KWALPELD--TIVQCKNEYSTEKLKNTFTE-----GCQIYGYLEVNRVSG-SFHIAPGLS 227
K LD T++ N ++ + + +E C+I+G + VN+V G SF I+ G
Sbjct: 193 KNPKQSLDGGTLILIGNGFNVFHVVASNSEKNEGTACRIHGRMRVNKVKGDSFIISTGKG 252
Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
++ + H S+ N +H I +FG ++ PL G +E G F Y
Sbjct: 253 LDVDGIFAHF--GGVSSPSNISHRIERFNFGPRIYG---LVTPLAGIEQISETGVDEFRY 307
Query: 288 YIKIIPTIYERLDGSKLGGGDG-------------------GMPGIFFSYELSPLMVKIT 328
++KI+PT R+ S L GG I YE + ++++
Sbjct: 308 FLKIVPT---RIYHSGLFGGSTLTYQYSVTFMKKTPKKDVHKHTAIIIHYEFAATVIEVR 364
Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
SL + ++ + G + T +L++++ C++
Sbjct: 365 HVQSSLLQMLVRLCSAVGGVFATSILLNSI---CIR 397
>gi|145540599|ref|XP_001455989.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423798|emb|CAK88592.1| unnamed protein product [Paramecium tetraurelia]
Length = 322
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 76/383 (19%), Positives = 144/383 (37%), Gaps = 89/383 (23%)
Query: 7 LKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
L+ LD F K D + + GG +T + + ++ L + +F + +D+
Sbjct: 3 LRQLDFFRKLNTDIGDTSSALGGFLTTIAFALVTILTMNECRLFFSTELNYQTVIDNDTE 62
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ +HLD++V C L+LD D G + V + K LD D + V+ +
Sbjct: 63 QFIKVHLDMIVGA-PCMVLSLDQQDEVGVHVMDVSGTLKKISLDKD--------RHVLPS 113
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+ D N+ + G+E E N+
Sbjct: 114 I-----------------DSNERPNYEGSEQELLDAIEAINQ------------------ 138
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQPYTSA 244
E CQ+ G+ +VN+V G+FH++ Y + +H D+ +
Sbjct: 139 ----------------GEQCQLKGFFQVNKVPGNFHVSYHAHHYLLQRIHQRDLSVFRKM 182
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRK----------PLDGTVAKAEEGASM-FNYYIKIIP 293
+ H I L FG ++ + RK V A EG + YYI +P
Sbjct: 183 KLD--HSIYELRFG-EITTTSKMRKYSKSLQKFQNSWKQIVKSAPEGEKQDYEYYIDALP 239
Query: 294 T-IYERLDGS-----KLGGGDGGMP-------GIFFSYELSPLMVKITEKSKSLGHLWTK 340
Y+ + + K + MP I+F Y++SP+ + + + KS+ H +
Sbjct: 240 VRFYDENERNYQTLYKYSINEAQMPRTFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQ 299
Query: 341 IMCNISGTYITFMLVDALLHSCV 363
++ I G + ++++++ +
Sbjct: 300 LLAIIGGVFAVIGILNSIVQKAI 322
>gi|348690307|gb|EGZ30121.1| COPII vesicle trafficking protein [Phytophthora sojae]
Length = 306
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/205 (21%), Positives = 80/205 (39%), Gaps = 49/205 (23%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+++G ++V +V+G A S ++ + FN++H + HL FG ++ D
Sbjct: 108 GCRLFGTVQVQKVAGDLSFAHEGSLTV-------FSFFDFLNFNSSHVVNHLRFGPQIPD 160
Query: 264 D--------------------------DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
D L +A + + Y++ ++P+ Y
Sbjct: 161 METPLIDVSKILERNCTQESCWLARSWDSVAALLTSFIALLLFTVATYKYFVNVVPSRYV 220
Query: 298 RLDG----------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
L+G S+ G PG+ FSYE SP+ V+ E S+ H T
Sbjct: 221 YLNGRSVTTFQYSVTEHETSSRGPNGQVSFPGVIFSYEFSPIAVEYIESKPSVLHFLTST 280
Query: 342 MCNISGTYITFMLVDALLHSCVKKI 366
+ G + ++D ++S KKI
Sbjct: 281 SAIVGGVFAVARMIDGAIYSVSKKI 305
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 26/105 (24%), Positives = 53/105 (50%)
Query: 8 KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSK 67
+ D K E E+T+ GG VT++ + +++L+ + ++ VS T + VD+
Sbjct: 4 RRFDLNVKGVEGIQERTIGGGVVTLLSCVVVAFLLLSEFSVWWTVSVTHRMHVDTDPDYP 63
Query: 68 LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
+ I +D+ +C +ALD DS G + + ++ +I + +G
Sbjct: 64 INIEVDVSFLHEACKEVALDVSDSKGHKEILLKKDIQEEPFGENG 108
>gi|363748002|ref|XP_003644219.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
DBVPG#7215]
gi|356887851|gb|AET37402.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
DBVPG#7215]
Length = 340
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 116/294 (39%), Gaps = 59/294 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K + +K+ GG +IV +LF+ ++ + YF E+ VD +
Sbjct: 4 LRTFDAFPKTEQQHVKKSSKGGLTSIVIYLFLLFIAWSEFGSYFGGYIDEQYIVDDEIRT 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
I+++I V + C YL + A D +G+ I RL+ + P
Sbjct: 64 TAQINMNIYV-KMPCKYLEVTARDQTGDLQ------IVSERLNFQDIHFRVPYG------ 110
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
K+T N + +L+D A+ + +PEL I
Sbjct: 111 --TKMTEFNDVISPDLDD--ILADAIPAQFTSD-------------------MPELPMI- 146
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
+ +GC IYG + VN+VSG I A G +Y P+ +
Sbjct: 147 -----------EGINFDGCSIYGSVPVNKVSGELQITAKGWTYMSTRR-----TPF--SV 188
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
N +H I LSFG D LDG A+E + Y+ ++PT Y+++
Sbjct: 189 LNFSHVINELSFGDFFPYIDNT---LDGVGRIADEPLKAYYYFTSVLPTAYKKM 239
>gi|162462518|ref|NP_001105762.1| protein disulfide isomerase12 [Zea mays]
gi|59861281|gb|AAX09970.1| protein disulfide isomerase [Zea mays]
gi|414590455|tpg|DAA41026.1| TPA: putative thioredoxin superfamily protein [Zea mays]
Length = 483
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E ++ G ++IV L + +L +++ Y V+TT + V
Sbjct: 1 MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I ++ P +SC++ ++D D G L++ + K +D + P
Sbjct: 61 DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 85/200 (42%), Gaps = 42/200 (21%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+I G++ V RV GS I+ + S +H + + N +H++ SFG +L
Sbjct: 292 GCRIEGFVRVKRVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTQFSFGKRLSP 341
Query: 264 D---------------DERRKPLDGTVAKAEEGASM-FNYYIKIIPTI------------ 295
+R TV E A++ +Y++++ T
Sbjct: 342 RMLHEFIRLTPYLRGYHDRLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRSSKELKV 401
Query: 296 ---YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
YE S L +P + F +E SP+ V +TE KS H T + I G +
Sbjct: 402 LEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVA 460
Query: 353 MLVDALLHSCVKKISKVEIG 372
++D++ H+ ++ + K+E+G
Sbjct: 461 GILDSIFHNTLRMVKKIELG 480
>gi|195639434|gb|ACG39185.1| PDIL5-4 - Zea mays protein disulfide isomerase [Zea mays]
Length = 485
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E ++ G ++IV L + +L +++ Y V+TT + V
Sbjct: 1 MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I ++ P +SC++ ++D D G L++ + K +D + P
Sbjct: 61 DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 85/200 (42%), Gaps = 42/200 (21%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+I G++ V RV GS I+ + S +H + + N +H++ SFG +L
Sbjct: 294 GCRIEGFVRVKRVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTQFSFGKRLSP 343
Query: 264 D---------------DERRKPLDGTVAKAEEGASM-FNYYIKIIPTI------------ 295
+R TV E A++ +Y++++ T
Sbjct: 344 RMLHEFIRLTPYLRGYHDRLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRSSKELKV 403
Query: 296 ---YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
YE S L +P + F +E SP+ V +TE KS H T + I G +
Sbjct: 404 LEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVA 462
Query: 353 MLVDALLHSCVKKISKVEIG 372
++D++ H+ ++ + K+E+G
Sbjct: 463 GILDSIFHNTLRMVKKIELG 482
>gi|325187435|emb|CCA21973.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 283
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 75/178 (42%), Gaps = 24/178 (13%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
EGC+ G L + ++ G G S SI ++ FN++H I L+FG+ +
Sbjct: 115 EGCRYKGTLTIQKLQGDIFFCHGGSLSIFNL-------MEMFRFNSSHVITKLNFGLSIP 167
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS--------------KLGGGD 308
+ + PL + + Y+ K++P+ Y LDG K+ G
Sbjct: 168 ---KMQTPLTDVHKTVLAQVATYKYFAKVVPSRYVYLDGKSTMTYQYSVTEHLLKMDGFV 224
Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
+PG+ SY+ SP+ V E ++ H T + G + DA L+S KK+
Sbjct: 225 TNIPGVIISYDFSPIAVDYIETKPNIFHFITNTCAILGGVIAVARIFDAALYSMSKKL 282
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/98 (24%), Positives = 53/98 (54%), Gaps = 3/98 (3%)
Query: 8 KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSK 67
+ DA+ K E E+T+ GG +T++ +F+ +L ++ ++ V+ + VD++
Sbjct: 5 RRFDAYAKAVEGIQERTIGGGIITLLSCVFVCFLFISEISVWWTVNVVHRMHVDTAPQES 64
Query: 68 LPIHLDIVVPTI--SCDYLALDAVDSSGEQHLHVEHNI 103
PI LD+ + + +C + +D DS G+ + + +N+
Sbjct: 65 -PITLDVDISMLHETCRDIKVDVSDSQGDGSILIANNL 101
>gi|357122608|ref|XP_003563007.1| PREDICTED: protein disulfide isomerase-like 5-4-like [Brachypodium
distachyon]
Length = 485
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E ++ G ++IV L + +L +++ Y V+TT + V
Sbjct: 1 MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I ++ P +SC++ ++D D G L++ + K +D + P
Sbjct: 61 DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDRNLVP 115
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 49/202 (24%), Positives = 88/202 (43%), Gaps = 40/202 (19%)
Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
T GC++ G++ V +V GS I+ + S +H + + N +H++ SFG +
Sbjct: 291 MTSGCRVEGFVRVKKVPGSVIIS---ARSGSH-------SFDPSQINVSHYVTQFSFGNR 340
Query: 261 LQ----DDDERRKPLDG-----------TVAKAEEGASM-FNYYIKIIPTIYERLDGSKL 304
L + +R P G V + A++ +Y++I+ T L SK
Sbjct: 341 LSPNMFSELKRLIPYVGGHHDRLAGQSYIVKHGDNNANVTIEHYLQIVKTELVTLRSSKE 400
Query: 305 GG--------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+P + F +E SP+ V +TE KS H T + I G +
Sbjct: 401 LKVFEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVFT 460
Query: 351 TFMLVDALLHSCVKKISKVEIG 372
++D++LH+ ++ + KVE+G
Sbjct: 461 VAGILDSILHNTLRLVKKVELG 482
>gi|388501278|gb|AFK38705.1| unknown [Medicago truncatula]
Length = 148
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 18/137 (13%)
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
N +H I LSFG K PLD T + + F YYIKI+PT Y + L
Sbjct: 10 NVSHVIHDLSFGPKYPGI---HNPLDETSRILHDASGTFKYYIKIVPTEYRYISKEVLPT 66
Query: 307 G---------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
D P ++F Y+LSP+ V I E+ +S H T++ + GT+
Sbjct: 67 NQFSVTEYFSPITSQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAV 126
Query: 352 FMLVDALLHSCVKKISK 368
++D ++ V+ +K
Sbjct: 127 TGMLDRWMYRLVEAATK 143
>gi|414590456|tpg|DAA41027.1| TPA: putative thioredoxin superfamily protein [Zea mays]
Length = 439
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E ++ G ++IV L + +L +++ Y V+TT + V
Sbjct: 1 MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I ++ P +SC++ ++D D G L++ + K +D + P
Sbjct: 61 DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115
>gi|414590454|tpg|DAA41025.1| TPA: putative thioredoxin superfamily protein [Zea mays]
Length = 435
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E ++ G ++IV L + +L +++ Y V+TT + V
Sbjct: 1 MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I ++ P +SC++ ++D D G L++ + K +D + P
Sbjct: 61 DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115
>gi|268577857|ref|XP_002643911.1| Hypothetical protein CBG02175 [Caenorhabditis briggsae]
Length = 282
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 81/193 (41%), Gaps = 38/193 (19%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ E+N+V G+FH++ H QP + +RH+ IK D
Sbjct: 107 GCRFESRFEINKVPGNFHLS---------THSATTQP-------DGYDMRHIIHSIKFGD 150
Query: 264 DDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------- 306
D + PL AK E G + Y +KI+P+++E G+ L
Sbjct: 151 DVSHKNLKGSFDPLANREAK-ESGLNTHEYILKIVPSVHEDYSGNILNSYQYTYGHKSYV 209
Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
+P ++F YEL P+ +K TE +S T I + GT+ ++D+ +
Sbjct: 210 TYHHSGKIIPAVWFKYELQPITLKQTEHRQSFYIFLTSICAVVGGTFTVAGIIDSTFFTI 269
Query: 363 VKKISKVEIGGKT 375
+ + K ++G T
Sbjct: 270 SEMVKKQQMGKLT 282
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 47/88 (53%), Gaps = 1/88 (1%)
Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRGSKLP 69
D + K +D + T G ++I+C FI+++I D+ Y + E F+D R K+
Sbjct: 5 DIYRKVPKDLTQPTTAGAVISILCVAFITFMIFNDILAYIFIDLRSEFFIDDPGREGKID 64
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHL 97
+ +++ P ++CDY+ +D D +G +
Sbjct: 65 VQVNVSFPHMACDYIGVDIQDENGRHEV 92
>gi|344301277|gb|EGW31589.1| hypothetical protein SPAPADRAFT_62204 [Spathaspora passalidarum
NRRL Y-27907]
Length = 353
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 87/221 (39%), Gaps = 36/221 (16%)
Query: 180 PELDTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV 235
PELD I+Q + E+ + + N C I+G + +N+V G F I A G Y
Sbjct: 127 PELDEIMQESLRAEFRVQGQRVNENAPACHIFGSIPINQVKGDFRITAKGYGY------- 179
Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
D+ N +H I+ S+G + PLD T EE + Y K++PT
Sbjct: 180 RDVIAAPIDKLNFSHVIQEFSYG---EFYPFINNPLDATGKVTEEKFQKYMYSAKVVPTS 236
Query: 296 YER------------------LDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
YE+ L + G G+PGI+ Y+ P+ + I EK
Sbjct: 237 YEKLGLIVETNQYSVTENHQVLQKNSQTGVPIGVPGIYIKYDFEPIKMVIKEKRMPFMQF 296
Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
K+ G IT + L +KI V G K V K
Sbjct: 297 VAKLATIAGGILIT----ASYLFRLYEKILGVVFGKKYVEK 333
>gi|388517493|gb|AFK46808.1| unknown [Lotus japonicus]
Length = 156
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 67/153 (43%), Gaps = 26/153 (16%)
Query: 246 FNTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYIK 290
N +H + HL+FG K+ D +R P G+ G S N +YI+
Sbjct: 1 MNMSHVVNHLTFGKKVTPRAISDMQRLIPHIGSSHDRLNGRSFVNTHNLEANVTIEHYIQ 60
Query: 291 IIPTIYERLDGSKL-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
I+ T +G KL +P F ELSP+ V ITE KS H T
Sbjct: 61 IVKTEVVTRNGYKLIEDYEYTAHSSVAHSLDIPVAKFHLELSPMQVLITENQKSFSHFIT 120
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+ I G + +VD++LH+ ++ I KVE+G
Sbjct: 121 NVCAIIGGVFTVAGIVDSILHNTIRMIKKVELG 153
>gi|297847442|ref|XP_002891602.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
lyrata]
gi|297337444|gb|EFH67861.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
lyrata]
Length = 484
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 88/215 (40%), Gaps = 40/215 (18%)
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ N ST K K + GC+I GY+ +V G I+ H H + ++
Sbjct: 278 KSDNAASTIK-KAPVSGGCRIEGYVRAKKVPGELVISA-------HSGAHS---FDASQM 326
Query: 247 NTTHHIRHLSFGI----KLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
N +H + HLSFG +L D +R P G G S N +Y++I
Sbjct: 327 NMSHIVTHLSFGTMVSERLWTDMKRLLPYLGQSHDRLNGKSFINQRKFDVNVTIEHYLQI 386
Query: 292 IPT-IYERLDGSKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
+ T + R G + P F +ELSP+ V I+E KS H
Sbjct: 387 VKTEVISRRSGKEHSLIEEYEYTAHSSVAHSYHYPEAKFHFELSPMQVLISENPKSFSHF 446
Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
T + I G + ++D++ + V+ + K+E+G
Sbjct: 447 ITNVCAIIGGVFTVAGILDSIFQNTVRMVKKIELG 481
>gi|224030141|gb|ACN34146.1| unknown [Zea mays]
Length = 483
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E ++ G ++IV L + +L +++ Y V+TT + V
Sbjct: 1 MISSSKLKSVDFYRKIPRDLTEVSLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I ++ P +SC++ ++D D G L++ + K +D + P
Sbjct: 61 DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 85/200 (42%), Gaps = 42/200 (21%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+I G++ V RV GS I+ + S +H + + N +H++ SFG +L
Sbjct: 292 GCRIEGFVRVKRVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTQFSFGKRLSP 341
Query: 264 D---------------DERRKPLDGTVAKAEEGASM-FNYYIKIIPTI------------ 295
+R TV E A++ +Y++++ T
Sbjct: 342 RMLHEFIRLTPYLRGYHDRLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRSSKELKV 401
Query: 296 ---YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
YE S L +P + F +E SP+ V +TE KS H T + I G +
Sbjct: 402 LEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVA 460
Query: 353 MLVDALLHSCVKKISKVEIG 372
++D++ H+ ++ + K+E+G
Sbjct: 461 GILDSIFHNTLRMVKKIELG 480
>gi|440293957|gb|ELP87004.1| hypothetical protein EIN_318630 [Entamoeba invadens IP1]
Length = 316
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 56/189 (29%), Positives = 77/189 (40%), Gaps = 34/189 (17%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPG-LSY---SINHV----------HVHDIQPYTSAAFNTT 249
GC+++G ++V+RVSG FH+A G ++Y N V H H +FN T
Sbjct: 117 GCRMHGTMKVSRVSGEFHVAFGKIAYRQQRTNQVITATQKHTQMHTHQFTMQEMKSFNPT 176
Query: 250 HHIRHLSF-GIKLQDDDERRKPLDGT--VAKAEEGASMFNYYIKIIPT------------ 294
H I +L+F PL+G K + A + YYI +IPT
Sbjct: 177 HFINNLAFSNTPSYTTHAGETPLNGKEYTLKGYDNAR-YTYYINVIPTLNKYPTHTTRSY 235
Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
I ER G PG+FF YELSP +V S H I G +I
Sbjct: 236 QLSINERFVPVTY-GPTFTQPGVFFKYELSPYIVINEMMDHSFAHSIASTAAIIGGVWII 294
Query: 352 FMLVDALLH 360
F + L+
Sbjct: 295 FGWISRFLN 303
>gi|443920575|gb|ELU40475.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
solani AG-1 IA]
Length = 506
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 65/298 (21%), Positives = 119/298 (39%), Gaps = 63/298 (21%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
++ DAF K ++ +T GG +T++ + L+ D+ DY E VD++ +
Sbjct: 15 VRQFDAFPKVRPNYKARTTGGGLMTVLVAVISFILVLNDLGDYLWGWREYEFTVDNNLAT 74
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQ-HLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ +++D+VV + C +L++D D++G++ L EH ++R
Sbjct: 75 VMYVNVDLVV-NMPCHFLSVDLRDAAGDRLFLTDEHGGFRR------------------- 114
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
+G T S Y K + EV A + + L +
Sbjct: 115 ---------DGAT-----------SAYALNFRDSKVSVSPQEVVSASKRSQRGL--FSSF 152
Query: 186 VQCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQ 239
+ K+ + T+ C+++G + V +V+ + HI G Y H +
Sbjct: 153 KKPKD----PTFRPTYNHIPDASACRVFGTVAVKKVTANLHITTLGHGYRSAEHTDHTL- 207
Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
N TH I SFG + D + PLD + E + F Y+I ++PT Y+
Sbjct: 208 ------MNLTHVINEFSFGPFIPDLSQ---PLDYSFEVTHEHFTAFQYFITVVPTTYQ 256
>gi|403413226|emb|CCL99926.1| predicted protein [Fibroporia radiculosa]
Length = 546
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 143/360 (39%), Gaps = 70/360 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L DAF K + ++ G +TI L LI D+ +Y + E VDS +
Sbjct: 27 LAQFDAFPKLPSTYKARSESRGFLTIFVALVAFLLILNDLGEYLWGWSDHEFSVDSDTTN 86
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
L +++D++V + C YL++D D+ G++ L + + + D V +A
Sbjct: 87 GLNLNVDLMV-NMPCQYLSVDLRDAVGDR-LFLSRGFRRDGIKFD----------VGHA- 133
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
T L++ S A ++RK + + ++K +
Sbjct: 134 -------------TALKEHAAALSAQQAIAQSRKSRGFFSTL-----FRK-------DVA 168
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV---HDIQPYTS 243
Q + ++ +K + C+IYG + + + + HI +I H + H Y
Sbjct: 169 QYRPTHNYQKDGSA----CRIYGTITAKKATANLHIT-----TIGHGYASRDHVDHKY-- 217
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
N +H I SFG E +PLD + A + + YY+ ++PT Y +
Sbjct: 218 --MNLSHVINEFSFGPFFP---EIVQPLDNSFELALDPFVAYQYYLHVVPTTYIAPRSTP 272
Query: 304 LG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
L G PGIFF ++L P+ + I +++ +L + + + G ++
Sbjct: 273 LHTHQYSVTHYTRTMSTHQGTPGIFFKFDLEPMHLTIHQRTTTLAQFLIRCVGVVGGIFV 332
>gi|47219772|emb|CAG03399.1| unnamed protein product [Tetraodon nigroviridis]
Length = 378
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 83/220 (37%), Gaps = 79/220 (35%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPG--------------------------LSYSINHV--H 234
C+I+G+L VN+V+G+FHI G LS SI H H
Sbjct: 130 RACRIHGHLYVNKVAGNFHITVGKYVTSLLGYSVVSLHSIPIGVTLFLLLSRSIPHPRGH 189
Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS----------- 283
H + ++N +H I HLSFG +D PLDGT + + +
Sbjct: 190 AHLAALVSHDSYNFSHRIDHLSFG---EDLPGIISPLDGTEKVSADCTAVLSLTPLHRCD 246
Query: 284 ---------------------MFNYYIKIIPT---------------IYERLDGSKLGGG 307
+F Y+I I+PT + E+ G
Sbjct: 247 FFLPRLFFKMCDFRFSLLANHIFQYFITIVPTKLNTYKVSAETHQYSVTEQDRAINHAAG 306
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
G+ GIF Y++S LMVK+TE+ L + +C I G
Sbjct: 307 SHGVSGIFMKYDISSLMVKVTEQHMPLWQFLVR-LCGIVG 345
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 57/117 (48%), Gaps = 8/117 (6%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E T GG V+++ + ++ L ++ Y E VD GS
Sbjct: 13 VKELDAFPKVPESYVESTASGGTVSLIAFSLMAILAFLEFFVYRDTWMKYEYEVDKDFGS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR--RLDLDGKPIQEPQKE 121
KL I++DI V D + + + ++ L VEH++ + + G P +PQ +
Sbjct: 73 KLRINVDITV----ADEMPMTLLHI--QERLKVEHSLQDLIFKTAMKGAPPPQPQTD 123
>gi|42562656|ref|NP_175508.2| protein Disulfide Isomerase (PDIa) family, redox active TRX
domain-containing protein [Arabidopsis thaliana]
gi|332194483|gb|AEE32604.1| protein Disulfide Isomerase (PDIa) family, redox active TRX
domain-containing protein [Arabidopsis thaliana]
Length = 484
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 88/215 (40%), Gaps = 40/215 (18%)
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ N ST K K + GC+I GY+ +V G I+ H H + ++
Sbjct: 278 KSDNAASTFK-KAPVSGGCRIEGYVRAKKVPGELVISA-------HSGAHS---FDASQM 326
Query: 247 NTTHHIRHLSFGI----KLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
N +H + HL+FG +L D +R P G G S N +Y++I
Sbjct: 327 NMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQI 386
Query: 292 IPT-IYERLDGSKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
I T + R G + P F +ELSP+ V I+E KS H
Sbjct: 387 IKTEVISRRSGQEHSLIEEYEYTAHSSVARSYHYPEAKFHFELSPMQVLISENPKSFSHF 446
Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
T + I G + ++D++ + V+ + K+E+G
Sbjct: 447 ITNVCAIIGGVFTVAGILDSIFQNTVRMVKKIELG 481
>gi|302808800|ref|XP_002986094.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
gi|300146242|gb|EFJ12913.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
Length = 475
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 91/199 (45%), Gaps = 40/199 (20%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL-- 261
GC+I G++ +V G+ I+ ++S +H + ++A N TH++ SFG +L
Sbjct: 284 GCRIEGFIRAKKVPGNIIIS---AHSGSH-------SFDASAMNMTHYVSQFSFGRELNF 333
Query: 262 --QDDDERRKP------------LDGTVAKAEEGASMFNYYIKIIPT----IYERLDGSK 303
+ + R P L G + ++ ++Y++++ T + +R + S
Sbjct: 334 WMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHYLQVVKTEVVSLQKRKEFSL 393
Query: 304 LGGGD----------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
L D +P F YELSP+ V + E KS H T + I G +
Sbjct: 394 LEQYDYTSHSNTVQNTNVPVAKFHYELSPMQVLVKENPKSFSHFITNVCAIIGGVFTVAG 453
Query: 354 LVDALLHSCVKKISKVEIG 372
+VD++LH ++ + K+E+G
Sbjct: 454 IVDSMLHGAMRMVKKIELG 472
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/114 (23%), Positives = 60/114 (52%), Gaps = 1/114 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + ++K +D + K D E ++ G ++++ + +L +++ +Y VS+T + V
Sbjct: 1 MTTTSKIKSIDFYRKIPRDLTEASLSGAGLSLIAAFAMIFLFGMELNNYLTVSSTTNVVV 60
Query: 61 DSSR-GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
D S+ G L I ++ P +SC++ ++D D+ G ++ + K +D + K
Sbjct: 61 DRSKDGEYLRIQFNMSFPALSCEFASVDVSDALGTNRYNLTKTVRKYPIDPNLK 114
>gi|255714272|ref|XP_002553418.1| KLTH0D16324p [Lachancea thermotolerans]
gi|238934798|emb|CAR22980.1| KLTH0D16324p [Lachancea thermotolerans CBS 6340]
Length = 340
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 71/366 (19%), Positives = 133/366 (36%), Gaps = 74/366 (20%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ DAF K E K+ GG +I+ ++F+ ++ + +F E+ V
Sbjct: 4 LRTFDAFPKTEEQHVRKSSKGGYTSILTYVFLIFIAWSEFGSFFGGYVDEQYGVSKDLRE 63
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+ I++D+ V + C +L + D +G++ L + L ++ P P VN
Sbjct: 64 AVQINMDMFV-HMPCQWLDVIVQDHTGDRKL------VREELKMESIPFFLPFGTAVNER 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + + + +G+E E+++
Sbjct: 117 NEIASLGLDEVLAEAIPGQFRDQIDFGSEDESKEF------------------------- 151
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
GC ++G + VN V G I P V D A
Sbjct: 152 ----------------NGCHVFGTITVNMVKGDLIIIP------RSQSVRDFGRMPPDAI 189
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
N +H I SFG D PLD + E + F+Y+ ++PTI+++L G+++
Sbjct: 190 NLSHVINEFSFGDFYPYID---NPLDRSARITAEHTTSFHYHTSVVPTIFQKL-GAEVNT 245
Query: 307 GDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
+ P I FSY L + I ++ S +++ +S +I +
Sbjct: 246 NQYSLSETKHETPPSGLRVPAIIFSYSFEALTITIRDERISFWQFIVRLVAILS--FIVY 303
Query: 353 MLVDAL 358
++ A
Sbjct: 304 IMTWAF 309
>gi|12321801|gb|AAG50943.1|AC079284_18 hypothetical protein [Arabidopsis thaliana]
Length = 451
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 88/215 (40%), Gaps = 40/215 (18%)
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
+ N ST K K + GC+I GY+ +V G I+ H H + ++
Sbjct: 245 KSDNAASTFK-KAPVSGGCRIEGYVRAKKVPGELVISA-------HSGAHS---FDASQM 293
Query: 247 NTTHHIRHLSFGI----KLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
N +H + HL+FG +L D +R P G G S N +Y++I
Sbjct: 294 NMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQI 353
Query: 292 IPT-IYERLDGSKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
I T + R G + P F +ELSP+ V I+E KS H
Sbjct: 354 IKTEVISRRSGQEHSLIEEYEYTAHSSVARSYHYPEAKFHFELSPMQVLISENPKSFSHF 413
Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
T + I G + ++D++ + V+ + K+E+G
Sbjct: 414 ITNVCAIIGGVFTVAGILDSIFQNTVRMVKKIELG 448
>gi|422295540|gb|EKU22839.1| hypothetical protein NGA_0271420 [Nannochloropsis gaditana CCMP526]
Length = 405
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 50/192 (26%)
Query: 195 EKLKNT-FTE----GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTT 249
EK++ T F E GC + G+L VNRV G+FHI Y H++ P + N +
Sbjct: 217 EKIERTLFAEAEHPGCLLSGFLLVNRVPGNFHIEARSKY-------HNLNPTLT---NVS 266
Query: 250 HHIRHLSFGIKLQDD------------DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
H + L+FG + + + R PL V + F++Y+K++ T YE
Sbjct: 267 HVVHDLTFGPPVTREYREKLALLPKGFQQTRSPLADQVYVVSKVHHAFHHYLKVVSTHYE 326
Query: 298 RLDGSKLGGG--------------------DGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
S+ GG D +P FSY++SPL I+ K ++
Sbjct: 327 ---VSRTFGGQKSTVLQYQMVANSQVMHYQDDEVPEAKFSYDISPLATVISSKKRAWYEF 383
Query: 338 WTKIMCNISGTY 349
T +M I GT+
Sbjct: 384 LTSLMAIIGGTF 395
>gi|391338468|ref|XP_003743580.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Metaseiulus occidentalis]
Length = 292
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 63/121 (52%), Gaps = 2/121 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
L+ LD + K D + T +G A+++ C +FI+ L+ + ++F +L+VD+ S
Sbjct: 5 LRRLDVYRKVPADLTQPTYFGAAISVGCIIFITTLLIYETYNFFSPELVSDLYVDNPAPS 64
Query: 67 -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
K+ + L+I +P +SCD + LD D +G + N K L+ DGK K +N
Sbjct: 65 EKIIVFLNISLPKLSCDVVGLDIQDENGRHEVGHIDNTEKTVLN-DGKGCNFVSKFTINK 123
Query: 126 V 126
V
Sbjct: 124 V 124
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 82/201 (40%), Gaps = 33/201 (16%)
Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
+TEK +GC +N+V G+FH++ H QP + +H I
Sbjct: 101 NTEKTVLNDGKGCNFVSKFTINKVPGNFHVS---------THAAKTQP---DDIDMSHEI 148
Query: 253 RHLSFGIKL-----QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG- 306
L+FG +L D L +G +Y +KI+PT+YE G L G
Sbjct: 149 HSLTFGEQLIYELGDDIKGSFNALQNHDRLKADGKESHDYVMKIVPTVYELSSGDSLVGY 208
Query: 307 ---------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
+P I+F Y+L+P+ V+ +++ L T + + GT+
Sbjct: 209 QYTHAHKSYITLSFSAGRIIPAIWFKYDLNPITVRYHRRTQPLYSFLTNVCAIVGGTFTV 268
Query: 352 FMLVDALLHSCVKKISKVEIG 372
+++++ + + K E+G
Sbjct: 269 VGIINSICFTAGEVFRKFEMG 289
>gi|299469370|emb|CBG91903.1| putative PDI-like protein [Triticum aestivum]
gi|299469398|emb|CBG91917.1| putative PDI-like protein [Triticum aestivum]
Length = 485
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E ++ G ++I L + +L +++ Y V+TT + V
Sbjct: 1 MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIFAALAMVFLFGMELSSYLAVNTTTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L I ++ P +SC++ ++D D G L++ + K +D + P
Sbjct: 61 DRSSDGEFLRIDFNLSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDRNLVP 115
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 87/203 (42%), Gaps = 42/203 (20%)
Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
T GC+I G++ V +V GS I+ + S +H + + N +H++ SFG +
Sbjct: 291 MTGGCRIEGFVRVKKVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTTFSFGKR 340
Query: 261 LQ----DDDERRKPLDGTVAKAEEGAS------------MFNYYIKIIPTI--------- 295
L ++ +R P G G S +Y++I+ T
Sbjct: 341 LSSKMFNELKRLFPYVGGHHDRLAGQSYIVKHGDVNANVTIEHYLQIVKTELVTLRYAKE 400
Query: 296 ------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
YE S L +P + F +E SP+ V +TE KS H T + I G +
Sbjct: 401 LKVLEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459
Query: 350 ITFMLVDALLHSCVKKISKVEIG 372
++D++LH+ ++ + KVE+G
Sbjct: 460 TVAGILDSILHNTLRLVKKVELG 482
>gi|145536478|ref|XP_001453961.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421705|emb|CAK86564.1| unnamed protein product [Paramecium tetraurelia]
Length = 592
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 79/179 (44%), Gaps = 27/179 (15%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
++ F K ++ + + +GG + ++ + I I ++ + Q T +L VD + S++
Sbjct: 2 INLFPKIQDNQYNRQSWGGLLFLITIICIVVFIWAEITNALQ--GTIQLQVDPAIDSRIR 59
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
++LD V+ C L L+ D G V+H I K R+ D E V+ +
Sbjct: 60 VNLDAVIQA-PCQALTLNIQDMMGSYLQDVQHTIIKTRIVDDNL-------EYVDVKQNV 111
Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQC 188
T SCYGAE + C +C +V A+ ++W P ++IVQC
Sbjct: 112 NFT-----------------SCYGAELLIDQKCYSCQDVMMAFAQRRWRQPNFESIVQC 153
>gi|123407515|ref|XP_001303026.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884369|gb|EAX90096.1| hypothetical protein TVAG_396530 [Trichomonas vaginalis G3]
Length = 234
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 102/236 (43%), Gaps = 29/236 (12%)
Query: 147 KCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYS-TEKLKNTFTEGC 205
+CGSCYGA + CCN+C EV +A++ + + P I QC+N +S + L N + C
Sbjct: 14 ECGSCYGA---SNGCCNSCKEVLDAFQKIEKSHPPTAMIQQCRNTFSDADSLIN---DSC 67
Query: 206 QIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG------- 258
+ L V GSF I G + + D N TH S G
Sbjct: 68 TLGITLTVPHTHGSFFITIGQNTTNTSA---DYLGVPKENLNFTHSFDFFSMGGGYHPAQ 124
Query: 259 -----IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGDGGMPG 313
+K+Q + R K + +A + ++ + T Y+R +PG
Sbjct: 125 ILQNYMKVQKEYGRYKAM--YYIRATRILNDYDTQYSLSVTSYDRYRDES----SDKLPG 178
Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
+F +Y++SPL+++ + + + +M I G + +L+D + + + S++
Sbjct: 179 VFINYDISPLILQYV-LDRPIYQIIIDMMAIIGGIFAFGLLIDNIYLASTLQSSQI 233
>gi|393908150|gb|EJD74929.1| hypothetical protein, variant [Loa loa]
Length = 368
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 153/363 (42%), Gaps = 40/363 (11%)
Query: 5 ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE--ELFVD 61
E ++ DAF K ++ EK GG + + +L I+ L+ ++ +YF VD
Sbjct: 22 EVVRDFDAFNKTVDEVSEEKRATGGFLASLSFLIIAALVFGELQNYFYGDEGHYYRFSVD 81
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
++ + +D++V T + +A +S H N +K D + +KE
Sbjct: 82 TAFSEHPELEVDMIVATPCTNLMAHLTGTAS---HEFNSMNGFK----YDPTRFEFTEKE 134
Query: 122 VV--NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY----RYK 175
+ N +KK + T+ GTT + ++ G E K + +EA+ + K
Sbjct: 135 AMYWNELKKVQHRTKEGTTL--FKSLDEMTFVSGRVEEGLKTEAETKQREEAHAIQLQRK 192
Query: 176 KWALPELD--TIVQCKNEYSTEKLKNTFTE-----GCQIYGYLEVNRVSG-SFHIAPGLS 227
K LD T++ N ++ + + +E C+I+G + VN+V G SF I+ G
Sbjct: 193 KNPKQSLDGGTLILIGNGFNVFHVVASNSEKNEGTACRIHGRMRVNKVKGDSFIISTGKG 252
Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
++ + H S+ N +H I +FG ++ PL G +E G F Y
Sbjct: 253 LDVDGIFAH--FGGVSSPSNISHRIERFNFGPRIYG---LVTPLAGIEQISETGVDEFRY 307
Query: 288 YIKIIPTIYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
++KI+PT R+ S L GG +Y+ S +K T K H I +
Sbjct: 308 FLKIVPT---RIYHSGLFGGST------LTYQYSVTFMKKTPKKDVHKHTAIIIHYEFAA 358
Query: 348 TYI 350
T I
Sbjct: 359 TVI 361
>gi|255074657|ref|XP_002501003.1| predicted protein [Micromonas sp. RCC299]
gi|226516266|gb|ACO62261.1| predicted protein [Micromonas sp. RCC299]
Length = 515
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 86/210 (40%), Gaps = 54/210 (25%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
T GC I G VNRV G+F++ P H H++ P N TH ++HLSFG +
Sbjct: 311 TSGCIIDGSFRVNRVPGAFYVTP-------HSMGHNLNP---DVINMTHTVKHLSFGKHV 360
Query: 262 QD------DDERR------KPLDGTVAK-------AEEGASMFNYYIKIIPTIYERLDGS 302
+ RR K L G A +EE ++ +Y+KI+ +E L+G
Sbjct: 361 PGRPSYVPRNLRRVWNRVPKDLGGRFAAGDEATFYSEEPNTVHEHYLKIVSRTFEPLEGQ 420
Query: 303 KLG-----------------GGDGGM------PGIFFSYELSPLMVKITEKSKSLGHLWT 339
+ DG P I FSY++SP+ V + E K L W
Sbjct: 421 AVQLYEYTFNSNRFRLNPPLAADGDPDQHVDGPMIKFSYDVSPMSVVLKEVKKPLLD-WI 479
Query: 340 KIMCN-ISGTYITFMLVDALLHSCVKKISK 368
MC + G Y L++ L S V + +
Sbjct: 480 LGMCALLGGVYTCAGLLETFLQSSVCAVKR 509
>gi|323449499|gb|EGB05387.1| hypothetical protein AURANDRAFT_31008 [Aureococcus anophagefferens]
Length = 445
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 78/183 (42%), Gaps = 35/183 (19%)
Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
NT GC + G+L VNRV G+FH+ +++S +H + N +H + HLSFG
Sbjct: 266 NTDHPGCLVSGFLLVNRVPGNFHV---MAHSRHH-------SLNTLRTNLSHTVHHLSFG 315
Query: 259 IKLQDDDERR-----------KPLDGTVAKAEEGASMFNYYIKIIPTIY-------ERLD 300
+ L D R+ LDG ++ + +++ I+PT Y +R
Sbjct: 316 VPLTDAQHRKLATIDVRHARTDTLDGEDYYHDDYHYAYQHFVHIVPTKYNLGVFWRDRFA 375
Query: 301 GSK-------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
+ L + P FSY++SP+ V + T ++ + GT+ F
Sbjct: 376 AFQTLHSHHLLKYAEHVPPEARFSYDISPMAVVVDTVRVKWYDFLTSLLAIVGGTFALFK 435
Query: 354 LVD 356
L +
Sbjct: 436 LAN 438
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/104 (24%), Positives = 53/104 (50%)
Query: 9 GLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKL 68
+D + K ++ E + GG +++ ++ + ++ + + ++ VD+ GS+L
Sbjct: 1 AMDFYRKVPDELKEASRTGGLLSLCACGVVALTLVTEIGAFLRTEVRTKIDVDTFAGSQL 60
Query: 69 PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
++ ++ P + CDY ++D D G +V NI K +LD DG
Sbjct: 61 RVNFNLSFPHLHCDYASVDLWDKIGRNQANVTQNIEKWQLDEDG 104
>gi|223995687|ref|XP_002287517.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976633|gb|EED94960.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 457
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 89/206 (43%), Gaps = 48/206 (23%)
Query: 189 KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNT 248
++EYS LKN GCQI G+L V+R G+FHI HD+ + + N
Sbjct: 266 ESEYSV--LKNH--PGCQISGFLLVDRAPGNFHIQA-------QSKGHDLAAHMT---NV 311
Query: 249 THHIRHLSFGIK-----LQDDD--------ERRKPLDGTVAKAEEGASMFNYYIKIIPT- 294
+H I HLSFG L+D E KP DG V + ++Y+K+I T
Sbjct: 312 SHIINHLSFGKPFSKYFLKDGLKNTPPGFLETTKPFDGNVYITQNEHEAHHHYLKVITTE 371
Query: 295 -------------------IYERLDGSKLGGGDGGM-PGIFFSYELSPLMVKITEKSKSL 334
Y+ L S+L + P F+Y+LSP+ V +K +
Sbjct: 372 FEPEKGAQNSKYNKKEPSRAYQILQSSQLSLYRSDIVPEAKFTYDLSPIAVSYNKKYRHW 431
Query: 335 GHLWTKIMCNISGTYITFMLVDALLH 360
+T +M I GT+ ++++ +H
Sbjct: 432 YDYFTSLMAIIGGTFTVVGMLESGIH 457
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/95 (23%), Positives = 42/95 (44%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+ LD + K D E T G ++ + ++ L ++ YF + L +DS+
Sbjct: 1 IANLDMYRKVPVDLLEGTRRGSILSTIAIFTMTTLFFLETKAYFSSTLATSLALDSNSDP 60
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEH 101
+ ++ +I + + CDY +D V G Q +H
Sbjct: 61 NIRVNFNITMMDLKCDYATIDVVSVLGTQQNVTQH 95
>gi|299115405|emb|CBN74236.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 447
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 86/192 (44%), Gaps = 33/192 (17%)
Query: 196 KLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHL 255
+LK + GCQ+ G++ VNRV G+FHI + +H I P A N +H ++ L
Sbjct: 264 RLKQDY-PGCQLSGFIMVNRVPGNFHIEARSA-------LHSIDP---TAANISHVVKTL 312
Query: 256 SFGIKLQDDDER----------RKPLDGTVAKAEEGASMFNYYIKIIPTI---------- 295
FG ++ R L+ V + + ++YIK++ T
Sbjct: 313 KFGTQVPVRGRRVIESGVELEGLPALEDRVYSIDSLHTAPHHYIKVVSTFVGGLAKTDNL 372
Query: 296 -YERLDGSK-LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
Y+ + S+ + +P FSY+LSP+ V I ++ + T ++ + GT+
Sbjct: 373 QYQMMVSSQTMPYEQDQVPEAKFSYDLSPMSVHIKQRRRKWYDFLTSVLAIVGGTFTVVG 432
Query: 354 LVDALLHSCVKK 365
++D +L VK+
Sbjct: 433 VLDNILFRVVKQ 444
Score = 45.4 bits (106), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 56/109 (51%), Gaps = 6/109 (5%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE---ELFVDSS 63
+K D + K D E T+ G AV C LF ++ + +C+ T E + +DS+
Sbjct: 4 IKTFDFYRKIPLDLTETTLQG-AVMSGCALFC--MLILFLCELRAFLTPEVYTTVAIDSN 60
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
+ SKL I+ +I + + CDY ++D +D G +++ NI K D +G
Sbjct: 61 QDSKLRINFNITMLALPCDYASVDVLDLLGTNKVNMTQNIVKWHTDENG 109
>gi|326503558|dbj|BAJ86285.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 60/115 (52%), Gaps = 1/115 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M+ S +LK +D + K D E ++ G ++I L + +L +++ Y V+TT + V
Sbjct: 1 MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIFAALAMVFLFGMELSSYLAVNTTTSVIV 60
Query: 61 D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
D SS G L + ++ P +SC++ ++D D G L++ + K +D + P
Sbjct: 61 DRSSDGEFLRMDFNLSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDRNLVP 115
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 90/203 (44%), Gaps = 42/203 (20%)
Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
T GC+I G++ V +V GS I+ + S +H + + N +H++ SFG +
Sbjct: 291 MTGGCRIEGFVRVKKVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTTFSFGKR 340
Query: 261 LQ----DDDERRKPLDG-----------TVAKAEEGASM-FNYYIKIIPTI--------- 295
L ++ +R P G V + A++ +Y++I+ T
Sbjct: 341 LSSKMFNELKRLFPYVGGHHDRLAGQSYVVKHGDVNANVTIEHYLQIVKTELVTLRYSKE 400
Query: 296 ------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
YE S L +P + F +E SP+ V +TE KS H T + I G +
Sbjct: 401 LKVLEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459
Query: 350 ITFMLVDALLHSCVKKISKVEIG 372
++D++LH+ ++ + KVE+G
Sbjct: 460 TVAGILDSILHNTLRLVKKVELG 482
>gi|123454020|ref|XP_001314836.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121897494|gb|EAY02613.1| hypothetical protein TVAG_260730 [Trichomonas vaginalis G3]
Length = 356
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 86/377 (22%), Positives = 144/377 (38%), Gaps = 54/377 (14%)
Query: 7 LKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS--- 62
+K D F K ++ KT+ GG +TI+ ++ I + + + D E L ++
Sbjct: 1 MKNFDLFPKVKNEYQGVKTISGGIITILTFILIQFSLIFFIKDALNYKIQESLHQNNTIL 60
Query: 63 SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
S ++L + +I V C++L + D SG + K+ LD D P + Q
Sbjct: 61 SGDTELWLSFNITVDA-PCNFLQVYITDESGHHRKQSIRALMKQNLDKDYCPYGDFQ--- 116
Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
T + D +CG CYG + + +CC TC +V + A P L
Sbjct: 117 --------------LFTKNISDNGECGYCYGHKYQ--ECCYTCLDVVYGHIATYRAPPSL 160
Query: 183 DTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
+ I QCK + N + G C + G G I+ + + D
Sbjct: 161 EGISQCKRDL------NFYNNGSKCLLMGSTRTPYAYGQLIISMNSQNQVPKKTLID-NT 213
Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPT-IY-- 296
+ N +H I H FG ++ + PLD + + + + Y + +I T IY
Sbjct: 214 LVTKYLNLSHTIGHFFFG---KESKFIKNPLDSYIQIQNDTKYHQYIYRLSLIQTSIYYP 270
Query: 297 ----------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
L PGI F + + P+ KIT L L + I
Sbjct: 271 DQIFATTQYSAHFSDKILEKKSEERPGIIFKFSIYPINSKITVTKTKLHFLLLSVCSIIG 330
Query: 347 GTYITFMLVDALLHSCV 363
G + ++ +L+HSC+
Sbjct: 331 GGF----MISSLIHSCL 343
>gi|118357982|ref|XP_001012239.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila]
gi|89294006|gb|EAR91994.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila
SB210]
Length = 323
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 80/387 (20%), Positives = 148/387 (38%), Gaps = 91/387 (23%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
+ + DAF K +D + GG +I+ L C + ++ + + +L V S
Sbjct: 2 QSFRKFDAFQKVNQDIDSSSSVGGLFSIIALAIGFILFCHEFQEWNKYTIVRKLEVQSLN 61
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
+ + ++D+ + C ++LD + G+Q L +++ R+ LD
Sbjct: 62 QAIIKANIDLTFFNVPCSLISLDVLYQDGQQVLQ-DYSSTLTRIKLDR------------ 108
Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
+ K++ TE TT E+E N
Sbjct: 109 --QNKEIGTE--TTYVEVEQENS------------------------------------- 127
Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
Q K E E++KN E C+I+G L +N + GSF + + +
Sbjct: 128 --QQKIEEVLEQIKN--KEQCRIHGQLLLNTIPGSFKFRI--------LQMKGLDEQLLK 175
Query: 245 AFNTTHHIRHLSFGIKLQDDD-ERRKPLDGTVAKAEEGASMFNY--------YIKIIPTI 295
N H I LSFG ++ E+ LD + ++A + S +NY YIKI+P
Sbjct: 176 QLNINHKINKLSFGDTIKTKKIEKVLGLDKSDSEAFD-ESRYNYEYRCSYDNYIKILPLN 234
Query: 296 --------YERLDGSKLGGGDGGMPG-------IFFSYELSPLMVKITEKSKSLGHLWTK 340
Y R + + +P + F+Y++SP+ + K+KS +
Sbjct: 235 AENIKELGYIRTNSFRFTMYQQVIPKEQTDIIEVSFNYQVSPINIVYQTKNKSFYSFVVQ 294
Query: 341 IMCNISGTYITFMLVDALLHSCVKKIS 367
+ I G + F +++ L+ + + I+
Sbjct: 295 VCAIIGGIFCVFGVINTLVLNIISSIN 321
>gi|170588701|ref|XP_001899112.1| hypothetical protein [Brugia malayi]
gi|158593325|gb|EDP31920.1| conserved hypothetical protein [Brugia malayi]
Length = 430
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 89/397 (22%), Positives = 161/397 (40%), Gaps = 52/397 (13%)
Query: 5 ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE--ELFVD 61
E ++ DAF K ++ EK GG + + +L I+ L+ ++ +YF VD
Sbjct: 23 EVVRDFDAFNKTVDEVSEEKRAAGGFLASLSFLIIAALVFGELRNYFYGDEGHYYRFSVD 82
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
++ + LD++V T + +A +S H N +K D + +KE
Sbjct: 83 TAFSEHPELELDMIVATPCTNLMAHLTGTTS---HEFSSMNEFKH----DPTRFEFTEKE 135
Query: 122 VV--NAVKKKKVTTENGTT-------TTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY 172
+ N +KK + T+ GTT T + + G AET+ R+ + K+
Sbjct: 136 AMYWNELKKVQHRTKEGTTLFKSLDEMTFISGQVEEGLKNEAETKQREEAHAIQLEKKKN 195
Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSG-SFHIAPGLSYS 229
+ L I N + + EG C+I+G + VN+V G SF ++ G
Sbjct: 196 PKESMDGGMLILIGNGFNVFHVVASNSEKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLG 255
Query: 230 INHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
++ + H S N +H I +FG + PL G +E G F Y++
Sbjct: 256 VDGIFAH--FGGVSNPGNLSHRIERFNFGPTIYG---LVTPLAGIEQISETGIDEFRYFL 310
Query: 290 KIIPTIYERLDGSKLGGGDG-------------------GMPGIFFSYELSPLMVKITEK 330
K++PT R+ S L GG I YE + ++++
Sbjct: 311 KVVPT---RIYHSGLFGGSTLTYQYSVTFMKKTPKKDVHKHAAIVIHYEFAATVIEVRRI 367
Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
SL + ++ + G + T +L++++ CV+ ++
Sbjct: 368 QSSLLQMLIRLCSAVGGVFATSVLLNSI---CVRVLT 401
>gi|225461068|ref|XP_002281649.1| PREDICTED: protein disulfide isomerase-like 5-4 [Vitis vinifera]
gi|297735969|emb|CBI23943.3| unnamed protein product [Vitis vinifera]
Length = 482
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 88/199 (44%), Gaps = 39/199 (19%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
T GC+I G++ V +V G+ I+ + S +H + + N +H I HLSFG K+
Sbjct: 292 TGGCRIEGFVRVKKVPGNLVIS---ARSGSH-------SFDPSQMNMSHVISHLSFGRKI 341
Query: 262 ----QDDDERRKPLDGTVAKAEEGASMFNY------------YIKIIPTI---------- 295
D +R P G G S ++ Y++++ T
Sbjct: 342 APRVMSDMKRVLPYIGGSHDRLNGRSYISHPSDSNANVTIEHYLQVVKTEVITTRDHKLV 401
Query: 296 --YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
YE S L +P F +ELSP+ V +TE KS H T + I G +
Sbjct: 402 EEYEYTAHSSLVQS-LYIPVAKFHFELSPMQVLVTENRKSFWHFITNVCAIIGGVFTVAG 460
Query: 354 LVDALLHSCVKKISKVEIG 372
++D++LH+ ++ + K+E+G
Sbjct: 461 ILDSVLHNTMRLMKKIELG 479
>gi|224013160|ref|XP_002295232.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969194|gb|EED87536.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 488
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/193 (27%), Positives = 86/193 (44%), Gaps = 38/193 (19%)
Query: 195 EKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRH 254
E+ + GC I G+L VNRV G F I + S+NH +H SA N TH +
Sbjct: 297 EEFEEDHHPGCLISGHLMVNRVPGRFQIE---ARSVNH-ELH------SAMTNLTHRVHD 346
Query: 255 LSFGI----------------KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---- 294
L+FG + + + P+ E F++++KII T
Sbjct: 347 LTFGALSGPPGHMLHVLPFFDTVPEKYKHTNPMQDKYYPTYEFHQAFHHHLKIISTHIDY 406
Query: 295 -------IYERLDGSKLGG-GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
+Y+ L+ S+L + +P I FS++LSP+ V ++++ + T + I
Sbjct: 407 LFSRSTVLYQILEQSQLVFYEEVNVPEIQFSFDLSPMSVNVSKEGRKWYEYVTSLCAIIG 466
Query: 347 GTYITFMLVDALL 359
GTY T L++A L
Sbjct: 467 GTYTTLGLINATL 479
>gi|426372082|ref|XP_004052960.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Gorilla gorilla
gorilla]
Length = 354
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 64/151 (42%), Gaps = 22/151 (14%)
Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
H H ++N +H I HLSFG + PLDGT A + MF Y+I ++P
Sbjct: 176 HAHLAALVNHESYNFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVP 232
Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
T + ER G G+ GIF Y+LS LMV +TE+ +
Sbjct: 233 TKLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFF 292
Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKV 369
++ + G + T +LH K I ++
Sbjct: 293 VRLCGIVGGIFST----TGMLHGIGKFIVEI 319
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 59/123 (47%), Gaps = 2/123 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 22 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 82 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 139
Query: 127 KKK 129
+ +
Sbjct: 140 QSR 142
>gi|313227239|emb|CBY22386.1| unnamed protein product [Oikopleura dioica]
Length = 380
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 89/382 (23%), Positives = 153/382 (40%), Gaps = 75/382 (19%)
Query: 3 FSERLKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE-ELFV 60
F E+ + LDAFTK E+ +T +GG T+V + + L+ ++ +F + + E V
Sbjct: 21 FLEKFRELDAFTKITEEAESPQTSHGGVCTMVTFTIMLLLLLGEMTVWFTTTKIKYEFDV 80
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
DS SK+ +++DI + C ++ + VDSSG+ Y +L D + ++
Sbjct: 81 DSEYESKMHLNMDITFNS-PCHMISAEIVDSSGDAW------GYSFQLQEDAADFELTKE 133
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVK--EAYRYKKWA 178
+ + K K+ + DPN + ++VK E R K
Sbjct: 134 KALERAKLLKM-------KESMTDPNM----------RDQLLREGHDVKHLEFSRKKNKK 176
Query: 179 LPE---LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP----------G 225
+ E + +VQ L +GC+++G +E+ +++G+ I G
Sbjct: 177 MMEQGMMHKVVQI-------NLDPNEPQGCRVWGSVELQKIAGTIKIQAGGFGGMGGIPG 229
Query: 226 LSYSINHVHVH----------DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV 275
LS ++ + IQ A F +H I H SFG LDG +
Sbjct: 230 LSGGLDAIMGMFMMPMMGMGAQIQDGKKANF--SHRIDHFSFG---DPSSGLVYGLDGDI 284
Query: 276 AKAEEGASMFNYYIKIIPT----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMV 325
E+ Y +K++PT Y+ +G D P + Y+ S L V
Sbjct: 285 QIQEKENDDTTYVVKVVPTDLKTFKFQQKAYQYAVTQHVGKSD--KPAVTIKYDFSGLGV 342
Query: 326 KITEKSKSLGHLWTKIMCNISG 347
ITE +S L T++ + G
Sbjct: 343 SITEYRESFVGLLTRLAGILGG 364
>gi|303279378|ref|XP_003058982.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460142|gb|EEH57437.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 486
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 58/111 (52%), Gaps = 1/111 (0%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-R 64
+L+ +D + K D E TV G ++I L I+ L+ ++ Y + ++ VD S
Sbjct: 8 KLRSVDFYRKIPRDMSEGTVPGSVISIGSALLIALLLVSEIGRYATPTWKTKVVVDRSLD 67
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
G + I+ ++ P +SC++ ++D D+ G ++ ++KR L DG P+
Sbjct: 68 GDMMKINFNVSFPALSCEFASVDVGDAMGLNRYNLTKTVFKRALARDGTPL 118
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/226 (21%), Positives = 85/226 (37%), Gaps = 40/226 (17%)
Query: 177 WALPELD-----TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
W + E D +V + E ++ GC + G++ +V PG +
Sbjct: 268 WKIEEADKTESRAVVTREEALRHESVRAVKGPGCSVTGFVLAKKV-------PGHVWITA 320
Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDD----ERRK---------PLDGTVAKA 278
+ + H P N TH + HL FG +L + ERR+ L G ++
Sbjct: 321 NSNSHSFHP---EEMNMTHTVNHLFFGNQLGRNKLKALERRERGASSNWHDKLAGVTFRS 377
Query: 279 EEGASMFNYYIKIIPTI------------YERLDGSKLGGGDGGMPGIFFSYELSPLMVK 326
+ +Y++ + T YE S +P F + SP+ V
Sbjct: 378 LQTNVTHEHYLQTVLTTLRPAGSYVAYHAYEYTQHSHALVTTRELPRAKFHFNPSPVQVV 437
Query: 327 ITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+TE+ + H T +M + G Y + D +H+ + + K E+G
Sbjct: 438 VTEEREPFYHFITTLMAIVGGVYSVCGIADGFVHNTLNMMRKFELG 483
>gi|402595088|gb|EJW89014.1| hypothetical protein WUBG_00081 [Wuchereria bancrofti]
Length = 578
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 89/397 (22%), Positives = 161/397 (40%), Gaps = 52/397 (13%)
Query: 5 ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE--ELFVD 61
E ++ DAF K ++ EK GG + + +L I+ L+ ++ +YF VD
Sbjct: 172 EVVRDFDAFNKTVDEVSEEKRATGGFLASLSFLIIAALVFGELRNYFYDGEGHYYRFSVD 231
Query: 62 SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
++ + LD++V T C L ++ + V N +K D + +KE
Sbjct: 232 TAFSEHPELELDMIVAT-PCTNLMAHLTGTTSHEFSSV--NEFKH----DPTRFEFTEKE 284
Query: 122 VV--NAVKKKKVTTENGTT-------TTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY 172
+ N +KK + T+ GTT T + + G AET+ R+ + K+
Sbjct: 285 AMYWNELKKVQHRTKEGTTLFKSLDEMTFISGQVEEGLKNEAETKQREEAHAIQLEKKKN 344
Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSG-SFHIAPGLSYS 229
+ L I N + + EG C+I+G + VN+V G SF ++ G
Sbjct: 345 PKESMDGGMLILIGNGFNVFHVVASNSEKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLG 404
Query: 230 INHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
++ + H S N +H I +FG + PL G +E G F Y++
Sbjct: 405 VDGIFAHF--GGLSNPGNVSHRIERFNFGPTIYG---LVTPLAGIEQISETGMDEFRYFL 459
Query: 290 KIIPTIYERLDGSKLGGGDG-------------------GMPGIFFSYELSPLMVKITEK 330
K++PT R+ S L GG I YE + ++++
Sbjct: 460 KVVPT---RIYHSGLFGGSTLTYQYSVTFMKKTPKKDVHKHAAIIIHYEFAATVIEVRRI 516
Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
SL + ++ + G + T +L++++ CV+ ++
Sbjct: 517 QSSLLQMLIRLCSAVGGVFATSVLLNSI---CVRVLT 550
>gi|365759132|gb|EHN00939.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|401842937|gb|EJT44934.1| ERV41-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 285
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 76/175 (43%), Gaps = 33/175 (18%)
Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINH-VHVHDIQPYTSAAFN 247
+E +K K GC I+G + VNRVSG I A G Y+ +H + D+ N
Sbjct: 79 DENDPDKAKLLDFNGCHIFGSVPVNRVSGVLQITAKGFGYADSHRASLEDL--------N 130
Query: 248 TTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERLDGSKLG- 305
H I SFG D PLD T +E + + YY ++PT++++L G+++
Sbjct: 131 FAHVINEFSFGDFYPYID---NPLDNTAQFDQDEPLTTYLYYTSVVPTLFKKL-GAEVDT 186
Query: 306 -----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
G+ +PGIFF Y PL + +++ S +++
Sbjct: 187 NQYSVNDYRYLNKDSSVKGNRRVPGIFFKYNFEPLSIVVSDVRISFIQFLVRLVA 241
>gi|344250048|gb|EGW06152.1| UPF0474 protein C5orf41-like [Cricetulus griseus]
Length = 745
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 66/148 (44%), Gaps = 29/148 (19%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
GC+ G +N+V G+FH++ H QP + TH I LSFG +++
Sbjct: 133 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 180
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
Q+ L G + +Y +KI+PT+YE G + +
Sbjct: 181 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 240
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSL 334
G +P I+F Y+LSP+ VK TE+ + L
Sbjct: 241 TGRIIPAIWFRYDLSPITVKYTERRQPL 268
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 50/101 (49%), Gaps = 4/101 (3%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
L D + K +D + T G ++I C LFI +L ++ + EL+VD
Sbjct: 24 LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 83
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
G K+ + L+I +P + C+ + LD D G + H+++++
Sbjct: 84 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 124
>gi|308807242|ref|XP_003080932.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
gi|116059393|emb|CAL55100.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
Length = 533
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 65/125 (52%), Gaps = 6/125 (4%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-RG 65
++G+D + K +F E T+ G ++I+ + + YL ++ Y S ++ VD S G
Sbjct: 26 IRGMDFYRKVPREFSEGTLGGSIISILSAVLMLYLFLSELGKYSTSSFETKVVVDRSVDG 85
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-----K 120
L I+ ++ P +SC++ ++D D+ G ++ ++KR +D + PI Q K
Sbjct: 86 ELLRINFNLSFPALSCEFASVDVGDALGLNRFNLTKTVFKRAIDAEMNPIGPLQWDRAVK 145
Query: 121 EVVNA 125
EV+ A
Sbjct: 146 EVLKA 150
>gi|145549492|ref|XP_001460425.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428255|emb|CAK93028.1| unnamed protein product [Paramecium tetraurelia]
Length = 320
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/398 (21%), Positives = 142/398 (35%), Gaps = 115/398 (28%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG- 65
LK +D + K + E T G V+I+ + ++ +I + +Y + E+ VD
Sbjct: 4 LKSIDLYGKVPKGLAEPTSSGAVVSIITLILLALMIINEGIEYITIDVQSEIIVDQKLSK 63
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
++ ++LDI CD+L +D D+ G+ + RLD + + I E
Sbjct: 64 DRVQVNLDIKFIKAPCDFLEIDQQDAMGQSLSQQFMELKYYRLDSNERRISE-------- 115
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
T N E+ED R N
Sbjct: 116 ------YTRNSNNWVEIED-------------ARTAIN---------------------- 134
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
EK +GC++ G L+VNRV G SYS Y A
Sbjct: 135 ---------EK------QGCEVIGNLKVNRVRGKISFGAHRSYS-----------YIGAV 168
Query: 246 FNT------THHIRHLSFGIKLQDDDERRK--------PLD---GT--VAKAEEGASMFN 286
N +H SFG D+D +K LD GT + K E +
Sbjct: 169 GNLNLPLDYSHKFVSFSFG----DEDALKKVKSLFQQGQLDSFAGTQRIKKPELASQSMQ 224
Query: 287 --YYIKIIPTIYERLDG------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSK 332
++I IIPT Y L+ +++ + G + Y+ +P V + +
Sbjct: 225 HEHFISIIPTHYTLLNKQVYSVYQYTANHNEVRSNNYG--NVQLRYDFAPTTVTYWQTKE 282
Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVE 370
+ H + +I I G + +++A ++ ++ + KVE
Sbjct: 283 DILHFYVQICAVIGGIFTVSSMIEACVYKVMRMLLKVE 320
>gi|145543941|ref|XP_001457656.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124425473|emb|CAK90259.1| unnamed protein product [Paramecium tetraurelia]
Length = 322
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/381 (20%), Positives = 144/381 (37%), Gaps = 87/381 (22%)
Query: 7 LKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
L+ LD F K D + + GG +T++ + ++ + +F + +D+
Sbjct: 3 LRQLDFFRKLNTDIGDTSSSLGGFLTMIAFALVTIFTMNECRLFFSTELNYQTVIDNDTE 62
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ ++LD +V C L+LD D G + V N+ K LD
Sbjct: 63 QFIKVYLDAIVGA-PCMVLSLDQQDEVGVHVMDVSGNLKKIALD---------------- 105
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
K++ V T E PN GS EL
Sbjct: 106 -KERHVLP----TIDNNERPNYRGSD----------------------------QELVDA 132
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQPYTSA 244
++ N+ E CQ G+ VN+V G+FHI+ + I +H D+ Y
Sbjct: 133 IEAINQ----------GEQCQFKGFFSVNKVPGNFHISYHAHHHLIQRIHQRDLSTYRKL 182
Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLD--------GTVAK-AEEGASM-FNYYIKIIP- 293
+ H I L FG ++ P ++AK A EG + YYI +P
Sbjct: 183 KLD--HTIYELRFGDNSSSFKMKKYPKSLQKFQSSWNSIAKTAPEGEKQDYEYYINALPV 240
Query: 294 -----------TIYE-RLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
T+Y+ ++ +++ + I+F Y++SP+ + + + KS+ H ++
Sbjct: 241 RFYDDKERNYQTLYKYSINEAQMTRSFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQL 300
Query: 342 MCNISGTYITFMLVDALLHSC 362
+ + G + +V++++
Sbjct: 301 LAIVGGVFAVIGIVNSIIQKA 321
>gi|396485364|ref|XP_003842153.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
gi|312218729|emb|CBX98674.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
Length = 486
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 83/204 (40%), Gaps = 46/204 (22%)
Query: 202 TEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
T+ C+I+G +E N+V G FHI A G Y VH+ FN +H IR LSFG
Sbjct: 268 TDSCRIFGSIEGNKVQGDFHITARGHGYIEYGVHL------DHKTFNFSHIIRELSFGPY 321
Query: 261 LQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIY-------ERLDGSKLGGGDGG 310
PLD T+A ++ F Y++ I+PTIY LD G +
Sbjct: 322 YP---SLTNPLDNTIAITPTPDDHFYKFQYFLSIVPTIYTDDPSLIPYLDILNRYGKNPD 378
Query: 311 M--------------------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
+ PG+F +++ P+M+ + E+ L +++
Sbjct: 379 LFNSAHAVKTNQYAVTSQSHPVSEYYVPGVFVKFDIEPIMLNVVEEWGGFWRLLVRLVNV 438
Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
ISG + L+ ++ + +
Sbjct: 439 ISGVMVAGSWAWQLMDWAIEVMGR 462
>gi|323303637|gb|EGA57425.1| Erv41p [Saccharomyces cerevisiae FostersB]
Length = 284
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 66/158 (41%), Gaps = 28/158 (17%)
Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
GC I+G + VNRVSG I A L Y + P FN H I SFG
Sbjct: 93 GCHIFGSIPVNRVSGELQITAKSLXYVASRK-----APLEELKFN--HVINEFSFGDFYP 145
Query: 263 DDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD--------- 308
D PLD T +E + + YY ++PT++++L D ++ D
Sbjct: 146 YID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVA 202
Query: 309 ---GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
MPGIFF Y PL + +++ S +++
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDXRLSFIQFLVRLVA 240
>gi|302800507|ref|XP_002982011.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
gi|300150453|gb|EFJ17104.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
Length = 476
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/199 (23%), Positives = 83/199 (41%), Gaps = 39/199 (19%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+I G++ +V PG H H + ++A N TH++ +FG +L
Sbjct: 284 GCRIEGFIRAKKV------VPGNIIISAHSGSHS---FDASAMNMTHYVSQFTFGRELNF 334
Query: 264 DDERR----------------KPLDGTVAKAEEGASMFNYYIKIIPT----IYERLDGSK 303
R L G + ++ ++Y++++ T + +R + S
Sbjct: 335 WMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHYLQVVKTEVVSLRKRKEFSL 394
Query: 304 LGGGD----------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
L D +P F YELSP+ V + E KS H T + I G +
Sbjct: 395 LEQYDYTSHSNTIQNTNVPVAKFHYELSPMQVLVKENPKSFSHFITNVCAIIGGVFTVAG 454
Query: 354 LVDALLHSCVKKISKVEIG 372
+VD++LH ++ + K+E+G
Sbjct: 455 IVDSMLHGAMRMVKKIELG 473
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/114 (23%), Positives = 60/114 (52%), Gaps = 1/114 (0%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
M + ++K +D + K D E ++ G ++++ + +L +++ +Y VS+T + V
Sbjct: 1 MTTASKIKSIDFYRKIPRDLTEASLSGAGLSLIAAFAMIFLFGMELNNYLTVSSTTNVVV 60
Query: 61 DSSR-GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
D S+ G L I ++ P +SC++ ++D D+ G ++ + K +D + K
Sbjct: 61 DRSKDGEYLRIQFNMSFPALSCEFASVDVSDALGTNRYNLTKTVRKYPIDPNLK 114
>gi|357474783|ref|XP_003607677.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355508732|gb|AES89874.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 156
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 64/153 (41%), Gaps = 26/153 (16%)
Query: 246 FNTTHHIRHLSFGIKLQD----DDERRKPLDGTVAKAEEGASMFN-----------YYIK 290
N +H I HLSFG K+ D + P G G S N +YI+
Sbjct: 1 MNMSHVINHLSFGKKVTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQ 60
Query: 291 IIPTIYERLDGSKL-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
++ T G KL +P F ELSP+ V ITE KS H T
Sbjct: 61 VVKTEVITRKGYKLIEEYEYTAHSSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFIT 120
Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+ I G + ++D++LH+ +K + K+EIG
Sbjct: 121 NVCAIIGGVFTVAGILDSILHNTIKAMKKIEIG 153
>gi|339233696|ref|XP_003381965.1| conserved hypothetical protein [Trichinella spiralis]
gi|316979152|gb|EFV61980.1| conserved hypothetical protein [Trichinella spiralis]
Length = 331
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 77/187 (41%), Gaps = 23/187 (12%)
Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI-QPYTSAAFNT 248
N STE + C+I+GY +N++ G I + + V I + FN
Sbjct: 134 NASSTE---TAIVDACRIHGYFLMNKLRGKLRIKFKETVRLEAVSNFIIFARRQNEGFNF 190
Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL---- 304
+H I FG ++ PLDG ++ + MF YYI+++PT L+G +
Sbjct: 191 SHRIEKFGFGPRIAGIIN---PLDGFQKESFDRRDMFYYYIQVVPTKITDLNGMETFTSQ 247
Query: 305 ------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
G G GIF ++ +P+MV I + SL +I + G +
Sbjct: 248 YSVTHKRRIIDHDQGSHGSCGIFIYFDFAPMMVLIRKSKTSLFVFALRICAIVGGIFACT 307
Query: 353 MLVDALL 359
+ AL+
Sbjct: 308 DFIIALM 314
>gi|167383125|ref|XP_001736415.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901233|gb|EDR27345.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 116
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 16/107 (14%)
Query: 269 KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG----------------SKLGGGDGGMP 312
P+DG V SM+ Y+++++P Y LD L + G+P
Sbjct: 3 NPMDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNRIINTNGYSVTEHYRPGNLKSPEQGIP 62
Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
G+F Y++S + V E+ S GHL T I I G + F L+D +
Sbjct: 63 GVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFI 109
>gi|365991164|ref|XP_003672411.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
gi|343771186|emb|CCD27168.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
Length = 341
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 121/297 (40%), Gaps = 64/297 (21%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
+L DAF K E+ +K+ GG +I+ +LF+ ++I +V YF ++ VD
Sbjct: 3 KLGAFDAFPKTEEEHVKKSTRGGLSSILTYLFLLFMIYNEVGRYFGGFIEQQYIVDIEIQ 62
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
+ I+ DI + T +CD + + VD + + +++ ++ + + P +N
Sbjct: 63 ERAQINFDIFLNT-TCDLIDVRIVDLTSD---NMKRSV-SDEISFEDLTFYIPYGTRINI 117
Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCY--GAETETRKCCNTCNEVKEAYRYKKWALPELD 183
+ NG TTE ++ Y G + R PE D
Sbjct: 118 L--------NGIYTTEFDEVLTQAIPYEFGMRIDERP-------------------PEDD 150
Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
+ N C ++G ++VNR+ G I+ + +IN
Sbjct: 151 -------------MPN--INACHLFGSVDVNRLPGILEISTNSTGNIND---------NG 186
Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGASMFNYYIKIIPTIYERL 299
+F H I LSFG D PLD T ++ + ++YY+ +IPTIYE+L
Sbjct: 187 KSF--AHVINELSFGEFFPFID---NPLDNTAKVLPDQPLTTYSYYLTVIPTIYEKL 238
>gi|397568493|gb|EJK46164.1| hypothetical protein THAOC_35181 [Thalassiosira oceanica]
Length = 480
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/233 (23%), Positives = 100/233 (42%), Gaps = 48/233 (20%)
Query: 172 YRYKKWALPEL--DTIVQCKNEYSTEKL-------KNTFTEGCQIYGYLEVNRVSGSFHI 222
Y + + +P+ D V EY+T +L GCQ+ G+L VNRV G+ H+
Sbjct: 258 YEHGRAVMPDYKGDRTVGALVEYATRRLGEGQEDESEDHHPGCQVSGHLMVNRVPGNLHM 317
Query: 223 -APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK----------------LQDDD 265
A + + IN SA N TH + HLSFG + + D+
Sbjct: 318 EAKSIHHEIN-----------SAMTNLTHRVDHLSFGDERGPQGHFLDRFAFLGGVPDEF 366
Query: 266 ERRKPLDGTVAKAEEGASMFNYYIKIIPT----------IYERLDGSKLGGGD-GGMPGI 314
+ P+ G + + F++++K++ T +Y+ L S+L + +P I
Sbjct: 367 KHTNPMKGRLFQTHRFHESFHHHLKVVTTTIDYLFRPTALYQILAESQLVLYELQEVPEI 426
Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
F +++SP+ +++ + + T + + G Y + L++ L + K S
Sbjct: 427 KFLWDMSPMGIEVDVERRPWYDYITTCLAIVGGAYASLGLINRALLAMFKPKS 479
>gi|323307814|gb|EGA61076.1| Erv41p [Saccharomyces cerevisiae FostersO]
Length = 284
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 66/158 (41%), Gaps = 28/158 (17%)
Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
GC I+G + VNRVSG I A L Y + P FN H I SFG
Sbjct: 93 GCHIFGSIPVNRVSGELQITAKSLXYVASRK-----APLEELKFN--HVINEFSFGDFYP 145
Query: 263 DDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD--------- 308
D PLD T +E + + YY ++PT++++L D ++ D
Sbjct: 146 YID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVA 202
Query: 309 ---GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
MPGIFF Y PL + +++ S +++
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDIRLSFIQFLVRLVA 240
>gi|351707253|gb|EHB10172.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Heterocephalus glaber]
Length = 211
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 56/129 (43%), Gaps = 19/129 (14%)
Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
H H ++N +H I HLSFG + PLDGT A + MF Y+I ++P
Sbjct: 80 HAHLAALVNHDSYNFSHRIDHLSFGELVPG---IINPLDGTEKIAIDHNQMFQYFITVVP 136
Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
T + ER G G+ GIF Y+LS LMV +TE+ +
Sbjct: 137 TKLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFF 196
Query: 339 TKIMCNISG 347
+ +C I G
Sbjct: 197 VR-LCGIVG 204
>gi|207342541|gb|EDZ70277.1| YML067Cp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|323336174|gb|EGA77445.1| Erv41p [Saccharomyces cerevisiae Vin13]
gi|323347070|gb|EGA81345.1| Erv41p [Saccharomyces cerevisiae Lalvin QA23]
Length = 284
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 66/158 (41%), Gaps = 28/158 (17%)
Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
GC I+G + VNRVSG I A L Y + P FN H I SFG
Sbjct: 93 GCHIFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELKFN--HVINEFSFGDFYP 145
Query: 263 DDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD--------- 308
D PLD T +E + + YY ++PT++++L D ++ D
Sbjct: 146 YID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVA 202
Query: 309 ---GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
MPGIFF Y PL + +++ S +++
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 240
>gi|313241668|emb|CBY33893.1| unnamed protein product [Oikopleura dioica]
Length = 380
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 88/382 (23%), Positives = 152/382 (39%), Gaps = 75/382 (19%)
Query: 3 FSERLKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE-ELFV 60
F E+ + LDAFTK E+ +T +GG T+ + + L+ ++ +F + + E V
Sbjct: 21 FLEKFRELDAFTKITEEAESPQTSHGGVCTMFTFTIMLLLLLGEMTVWFTTTKIKYEFDV 80
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
DS SK+ +++DI + C ++ + VDSSG+ Y +L D + ++
Sbjct: 81 DSEYESKMHLNMDITFNS-PCHMISAEIVDSSGDAW------GYSFQLQEDAADFELTKE 133
Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVK--EAYRYKKWA 178
+ + K K+ + DPN + ++VK E R K
Sbjct: 134 KALERAKLLKM-------KESMTDPNM----------RDQLLREGHDVKHLEFSRKKNKK 176
Query: 179 LPE---LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP----------G 225
+ E + +VQ L +GC+++G +E+ +++G+ I G
Sbjct: 177 MMEQGMMHKVVQI-------NLDPNEPQGCRVWGSVELQKIAGTIKIQAGGFGGMGGIPG 229
Query: 226 LSYSINHVHVH----------DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV 275
LS ++ + IQ A F +H I H SFG LDG +
Sbjct: 230 LSGGLDAIMGMFMMPMMGMGAQIQDGKKANF--SHRIDHFSFG---DPSSGLVYGLDGDI 284
Query: 276 AKAEEGASMFNYYIKIIPT----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMV 325
E+ Y +K++PT Y+ +G D P + Y+ S L V
Sbjct: 285 QIQEKENDDTTYVVKVVPTDLKTFKFQQKAYQYAVTQHVGKSD--KPAVTIKYDFSGLGV 342
Query: 326 KITEKSKSLGHLWTKIMCNISG 347
ITE +S L T++ + G
Sbjct: 343 SITEYRESFVGLLTRLAGILGG 364
>gi|169614774|ref|XP_001800803.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
gi|111060809|gb|EAT81929.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
Length = 404
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 76/382 (19%), Positives = 138/382 (36%), Gaps = 90/382 (23%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
DAF K + + + A T+ L YL ++ ++ STT+ V+ +
Sbjct: 25 FDAFPKTKKTYLVQGRNSSAWTVTLILTCIYLSWSEITRWYAGSTTQSFSVEKGVSHDMQ 84
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
I+LDI+V ++C L ++ D++G+ + + + +
Sbjct: 85 INLDIIV-AMNCHDLRVNMQDAAGD-------------------------RTLAGDLLRN 118
Query: 130 KVTTENGTTTTELEDP-NKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQC 188
T + T ++E ++ G G + + ++ +A + K
Sbjct: 119 DPTNWSQWTGRKMEKGMHELGKDDGVNPGWEELWDVHEQLGKAKKRK------------- 165
Query: 189 KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFN 247
+S + C+I+G L+ N+V G FHI A G Y Q FN
Sbjct: 166 ---FSKTPRVRGAPDACRIFGSLDGNKVQGDFHITARGHGY-----QEFGEQHLDHKTFN 217
Query: 248 TTHHIRHLSFGIKLQDDDERRKPLDGTVAKA---EEGASMFNYYIKIIPTIYERLDG--- 301
+H IR +SFG PLD T+A ++ F YY+ I+PTIY G
Sbjct: 218 FSHIIREMSFGPYYP---SLTNPLDNTIATTPTDQDHFYKFQYYLSIVPTIYTDNPGLLP 274
Query: 302 --------------------------------SKLGGGDGGMPGIFFSYELSPLMVKITE 329
+ +PG+F +++ P+M+ + E
Sbjct: 275 LLESVNRDPSAHPAKSIFSTHAIKTNQYAVTSQSHTVPENYVPGVFVKFDIEPIMLAVVE 334
Query: 330 KSKSLGHLWTKIMCNISGTYIT 351
+ L +I+ +SG +
Sbjct: 335 EWGGFWRLLVRIVNVVSGVMVA 356
>gi|159464951|ref|XP_001690702.1| hypothetical protein CHLREDRAFT_180779 [Chlamydomonas reinhardtii]
gi|158270379|gb|EDO96229.1| predicted protein [Chlamydomonas reinhardtii]
Length = 656
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 78/194 (40%), Gaps = 35/194 (18%)
Query: 208 YGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP------YTSAAFNTTHHIRHLSFGIKL 261
Y +V RV+G H+ S++ V + P + N +H I+HL FG
Sbjct: 84 YHTPQVKRVAGRLHL------SVHQNMVFQMLPQLLGTHHIPKILNMSHVIKHLGFGPHY 137
Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGG-------------- 307
+ PLDG V + Y++K++PT Y ++LG
Sbjct: 138 PG---QLNPLDGYVRMVGREPFSYKYFLKVVPTEYY----NRLGRATETHQYSVTEYAQP 190
Query: 308 --DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
G P + Y+LSP+++ I E+ SL H ++ + G + L D + V+
Sbjct: 191 LQRGYAPAVDVHYDLSPIVMTINERPPSLLHFVVRLCAVVGGVFAITRLTDRWVDWLVRL 250
Query: 366 ISKVEIGGKTVTKR 379
++K G + R
Sbjct: 251 VNKAAARGPSFVDR 264
>gi|558407|emb|CAA86253.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 284
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 66/158 (41%), Gaps = 28/158 (17%)
Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
GC ++G + VNRVSG I A L Y + P FN H I SFG
Sbjct: 93 GCHVFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELKFN--HVINEFSFGDFYP 145
Query: 263 DDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD--------- 308
D PLD T +E + + YY ++PT++++L D ++ D
Sbjct: 146 YID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVA 202
Query: 309 ---GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
MPGIFF Y PL + +++ S +++
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 240
>gi|323332255|gb|EGA73665.1| Erv41p [Saccharomyces cerevisiae AWRI796]
gi|323352959|gb|EGA85259.1| Erv41p [Saccharomyces cerevisiae VL3]
gi|365763687|gb|EHN05213.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 250
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 67/157 (42%), Gaps = 26/157 (16%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC I+G + VNRVSG I + S+ +V P FN H I SFG
Sbjct: 59 GCHIFGSIPVNRVSGELQIT---AKSLGYVASRK-APLEELKFN--HVINEFSFGDFYPY 112
Query: 264 DDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD---------- 308
D PLD T +E + + YY ++PT++++L D ++ D
Sbjct: 113 ID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVAA 169
Query: 309 --GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
MPGIFF Y PL + +++ S +++
Sbjct: 170 KGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 206
>gi|255082155|ref|XP_002508296.1| predicted protein [Micromonas sp. RCC299]
gi|226523572|gb|ACO69554.1| predicted protein [Micromonas sp. RCC299]
Length = 507
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 46/221 (20%), Positives = 88/221 (39%), Gaps = 44/221 (19%)
Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
DT + + T+ +K GC + G++ V +V G + A S+S +
Sbjct: 297 DTELAIRQPVETQTVKKIDGPGCSVTGFVLVKKVPGHLWVTATSKSHS-----------F 345
Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLD-------------------GTVAKAEEGA 282
+ + N +H + H FG +L +R++ LD GT E+
Sbjct: 346 HAESMNMSHVVHHFYFGQQLTP--QRKRYLDRFHSREKDPKGDWHDKLAGGTFTSEEDNV 403
Query: 283 SMFNYYIKIIPTI-----------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKS 331
+ +Y ++ TI YE S + +P F ++ SP+ + ++E+
Sbjct: 404 THEHYLQTVLTTIKPSGSPAPFNVYEYTQHSHSLRSEKELPRAKFHFDPSPVQISVSEER 463
Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+ H T +M + G Y + D +H+ ++ K E+G
Sbjct: 464 QKFYHFITTLMAIVGGVYSVMGIADGFVHNSIQAWKKKELG 504
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/139 (21%), Positives = 67/139 (48%), Gaps = 1/139 (0%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD-SSR 64
+ K +D + K +D E T+ G ++++ L I L+ +V Y + +D S+
Sbjct: 8 KFKNVDFYRKIPKDMTEGTIPGSVISMLAALVIGLLLVSEVGSYLTPKFDTRVVIDRSAD 67
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
G + I+ ++ P +SC++ ++D D+ G ++ ++KR +D P+ Q E +
Sbjct: 68 GEMMRINFNVSFPALSCEFASVDVGDAMGLNRFNLTKTVFKRAIDAKLNPLGPIQWERGH 127
Query: 125 AVKKKKVTTENGTTTTELE 143
+K+ ++ T ++
Sbjct: 128 ENRKEPEHADDAATAVAIK 146
>gi|302841900|ref|XP_002952494.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
nagariensis]
gi|300262133|gb|EFJ46341.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
nagariensis]
Length = 478
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 54/101 (53%), Gaps = 1/101 (0%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD-SSR 64
+LK +D F K D E T+ G ++I+ + + +L ++ + +TT +L VD S +
Sbjct: 7 KLKAIDFFKKIPSDLTEATLTGAWISILAAVIMVFLFTAEMMSFLSTTTTTQLIVDRSPQ 66
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
L ++ +I P +SC++ +D D+ G + +++ + K
Sbjct: 67 NELLKLNFNISFPALSCEFATVDVSDTLGTKRMNLTKTVRK 107
Score = 44.7 bits (104), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 43/195 (22%), Positives = 79/195 (40%), Gaps = 30/195 (15%)
Query: 202 TEGCQIYGYLEVNRVSGSFHI---APGLSYS---INHVH-VHDIQPYTSAAFNTTHHIRH 254
T GC + G++ V +V G+ + + G S+ +N H VH T + ++
Sbjct: 287 TPGCNLAGFVMVKKVPGTLTVVARSEGHSFDHTWMNMTHLVHTFHVGTRPSPRKYQQLKR 346
Query: 255 LSFGIKLQDD----DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGDG- 309
L + + D E+R+ + E S +Y++I+ T E G D
Sbjct: 347 LHPAGEGEGDLFWWREKRE------KRGEHPQSTHEHYLQIVLTSIEPRRSRHSGNYDAY 400
Query: 310 ------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
+P F+Y+LSP+ + + E ++ T I G + ++DA
Sbjct: 401 EYTAHSHTYQSDAIPSARFTYDLSPIQILVQETARPWYQFLTTSCAIIGGVFTVAGILDA 460
Query: 358 LLHSCVKKISKVEIG 372
LL+ K + K+ +G
Sbjct: 461 LLYQSFKVVKKLNLG 475
>gi|442614645|ref|NP_001259099.1| CG4293, isoform E [Drosophila melanogaster]
gi|440216271|gb|AGB94945.1| CG4293, isoform E [Drosophila melanogaster]
Length = 439
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 62/138 (44%), Gaps = 16/138 (11%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERL------------DGSKLGGGDG 309
+PL+G E A+ Y++K++PT I++ + KL
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTIYAFQYAVTENVRKLERNSY 314
Query: 310 GMPGIFFSYELSPLMVKI 327
G PGI+F Y+ S L + +
Sbjct: 315 GSPGIYFKYDWSALKIIV 332
>gi|115472445|ref|NP_001059821.1| Os07g0524100 [Oryza sativa Japonica Group]
gi|75118816|sp|Q69SA9.1|PDI54_ORYSJ RecName: Full=Protein disulfide isomerase-like 5-4;
Short=OsPDIL5-4; AltName: Full=Protein disulfide
isomerase-like 8-1; Short=OsPDIL8-1; Flags: Precursor
gi|50508559|dbj|BAD30858.1| thioredoxin family-like protein [Oryza sativa Japonica Group]
gi|113611357|dbj|BAF21735.1| Os07g0524100 [Oryza sativa Japonica Group]
gi|215704615|dbj|BAG94243.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218199742|gb|EEC82169.1| hypothetical protein OsI_26259 [Oryza sativa Indica Group]
gi|222637167|gb|EEE67299.1| hypothetical protein OsJ_24505 [Oryza sativa Japonica Group]
Length = 485
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 49/202 (24%), Positives = 85/202 (42%), Gaps = 40/202 (19%)
Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
T GC+I G++ V +V GS I+ + S +H + + N +H++ SFG +
Sbjct: 291 LTSGCRIEGFVRVKKVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTQFSFGKR 340
Query: 261 LQ----DDDERRKPLDGTVAKAEEGAS------------MFNYYIKIIPTIYERLDGSKL 304
L ++ +R P G G S +Y++I+ T L SK
Sbjct: 341 LSAKMFNELKRLTPYVGGHHDRLAGQSYIVKHGDVNANVTIEHYLQIVKTELVTLRSSKE 400
Query: 305 GG--------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
+P + F +E SP+ V +TE KS H T + I G +
Sbjct: 401 LKLVEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVFT 460
Query: 351 TFMLVDALLHSCVKKISKVEIG 372
++D++ H+ ++ + KVE+G
Sbjct: 461 VAGILDSIFHNTLRLVKKVELG 482
>gi|410046954|ref|XP_003952285.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Pan troglodytes]
Length = 333
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 75/363 (20%), Positives = 133/363 (36%), Gaps = 86/363 (23%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 22 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 81
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 82 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 139
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ + L++ + K A++ ALP
Sbjct: 140 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 166
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
E + + C+I+G+L VN+V+G+FHI + + Y
Sbjct: 167 ------PREDDSSQSPDACRIHGHLYVNKVAGNFHITVD----------NQMFQYFITVV 210
Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
T H +S ER + ++ A G S
Sbjct: 211 PTKLHTYKISADTHQFSVTERERIINH--AAGSHGVS----------------------- 245
Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
GIF Y+LS LMV +TE+ + ++ + G + T +LH K I
Sbjct: 246 ------GIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST----TGMLHGIGKFI 295
Query: 367 SKV 369
++
Sbjct: 296 VEI 298
>gi|431918151|gb|ELK17379.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Pteropus alecto]
Length = 313
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 66/155 (42%), Gaps = 29/155 (18%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
GC+ G +N+V G+FH++ H QP + TH I LSFG LQ
Sbjct: 144 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 191
Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
+ L G + +Y +KI+PT+YE G + +
Sbjct: 192 RNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVANKEYVAYSH 251
Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
G +P I+F Y+LSP+ VK TE+ + L T +
Sbjct: 252 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTV 286
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 55/108 (50%), Gaps = 7/108 (6%)
Query: 9 GLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRG 65
G D + K +D + T G ++I C +FI +L ++ + EL+VD G
Sbjct: 37 GFDIYRKVPKDLTQPTYTGAIISICCCVFILFLFLSELTGFLTTEVVNELYVDDPDKDSG 96
Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNIYKRRLDLDG 112
K+ + L+I +P + C+ + LD D G + H+++++ ++ L+G
Sbjct: 97 GKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM---KIPLNG 141
>gi|145347301|ref|XP_001418112.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578340|gb|ABO96405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 534
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 66/137 (48%), Gaps = 5/137 (3%)
Query: 7 LKGLDAFT-KPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
L+ LD + P E F E+TV GG TIV L L + V F + ++ VD +
Sbjct: 27 LRRLDMYAHAPPEISGFTERTVGGGLFTIVVSLIFIALFTMQVSALFAATYVTDIVVDHT 86
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE-HNIYKRRLDLDGKPIQEPQKEV 122
+KL +++ + P + C++L LD VD+ G + ++ N+YK L K +
Sbjct: 87 ADAKLRVNVRVDFPFVECEFLHLDVVDAIGSRKTNISGENVYKHPLSGPMKYMNIQHAAP 146
Query: 123 VNAVKKKKVTTENGTTT 139
VNA + E GTTT
Sbjct: 147 VNA-ETLDDAFEYGTTT 162
Score = 45.1 bits (105), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 38/128 (29%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
T GC + G VNRV G+F+ P S+S+ A + TH +RHLSFG
Sbjct: 349 TPGCSVNGQFNVNRVPGAFYFVPRSRSHSL-------------ADVDMTHVVRHLSFGEH 395
Query: 261 LQDDD-----ERRK-----PLD--GTVAKAEEGA------------SMFNYYIKIIPTIY 296
+ RK P+D G AK + G + F +Y+K+IP +
Sbjct: 396 VPGKPSFIPRHLRKAWSLIPVDMGGRFAKKDNGGGGAQFDARENRRTAFEHYMKVIPRTF 455
Query: 297 ERLDGSKL 304
+DG+ +
Sbjct: 456 APIDGAPI 463
>gi|385302753|gb|EIF46868.1| putative copii secretory vesicle component [Dekkera bruxellensis
AWRI1499]
Length = 203
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 10/95 (10%)
Query: 205 CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDD 264
C+I+G L VNRV GS +I G + + QP T N TH I SFG
Sbjct: 81 CRIFGTLPVNRVRGSLYIT-GKGFGSTFLRS---QPQT---LNFTHQITEFSFGDFYPFF 133
Query: 265 DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
D PLD T EE A F Y + +IPT YE+L
Sbjct: 134 D---NPLDMTYQVTEENAHTFQYKLSVIPTQYEKL 165
>gi|195402035|ref|XP_002059616.1| GJ14724 [Drosophila virilis]
gi|194147323|gb|EDW63038.1| GJ14724 [Drosophila virilis]
Length = 434
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 60/138 (43%), Gaps = 15/138 (10%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + + N TH I LSFG Q
Sbjct: 200 DACRLHGTLGINKVAGVLHLVGGAQPVVGMFEDHWMIEFRRMPANFTHRINRLSFG---Q 256
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------IYERLDGSKLGGGDGG 310
+PL+G E ++ Y++K++PT Y + G
Sbjct: 257 YSRRIVQPLEGDETIIHEESTTVQYFLKVVPTEIQHTFSTISTFQYAVTENVHSERNSYG 316
Query: 311 MPGIFFSYELSPLMVKIT 328
PGI+F Y+ S L + ++
Sbjct: 317 SPGIYFKYDWSALKIVVS 334
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 1/91 (1%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
E K LDAF K E + E T GG ++++ L I YL+ ++ Y+ + + D +
Sbjct: 16 EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYWNETEIIYQFEPDMA 75
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGE 94
++ +HLDI V +D +D + +
Sbjct: 76 LDEQVQMHLDITVAMPCASLSGVDLMDETQQ 106
>gi|195042004|ref|XP_001991346.1| GH12601 [Drosophila grimshawi]
gi|193901104|gb|EDV99970.1| GH12601 [Drosophila grimshawi]
Length = 434
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 24/144 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + + N TH I LSFG Q
Sbjct: 194 DACRLHGTLGINKVAGVLHLVGGAQPVVGMFDDHWMIEFRRMPANFTHRINRLSFG---Q 250
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------------------RLDGSKL 304
+PL+G E A+ Y+IK++PT + +LD +
Sbjct: 251 YSRRIVQPLEGDETTITEEATTVQYFIKVVPTEIQQTFSTVSTFQYAVTENVRKLDSER- 309
Query: 305 GGGDGGMPGIFFSYELSPLMVKIT 328
G PGI+F Y+ S L V I+
Sbjct: 310 --NSYGSPGIYFKYDWSALKVVIS 331
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 45/91 (49%), Gaps = 1/91 (1%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
E K LDAF K E + E T GG ++++ L I YL+ ++ Y+ + + + D S
Sbjct: 16 EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYWSETNIIYQFEPDMS 75
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGE 94
++ +H+DI V +D +D + +
Sbjct: 76 LDEQVQMHVDITVAMPCASLSGVDLMDETQQ 106
>gi|119616999|gb|EAW96593.1| ERGIC and golgi 2, isoform CRA_b [Homo sapiens]
Length = 215
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 96/230 (41%), Gaps = 43/230 (18%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E + GG V+++ + ++ L ++ Y E VD S
Sbjct: 13 VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
KL I++DI V + C Y+ D +D + + +Y+ + D P Q+ + ++ +
Sbjct: 73 KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL-PELDTI 185
+ + L++ + K A++ AL P D
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
Q N C+I+G+L VN+V+G+FHI G + + +H+
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGQFHILVVMHI 200
>gi|198468706|ref|XP_001354796.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
gi|198146533|gb|EAL31851.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
Length = 445
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 66/150 (44%), Gaps = 24/150 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 203 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWVIELRRMPANFTHRINRLSFG---Q 259
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
+PL+G + E A+ Y++K++PT I++ +LD +
Sbjct: 260 YSRRIVQPLEGDESIIHEEATTVQYFLKVVPTEIHQTFTTINTFQYAVTENVRKLDSER- 318
Query: 305 GGGDGGMPGIFFSYELSPLMVKITEKSKSL 334
G PGI+F Y+ S L + ++ L
Sbjct: 319 --NSYGSPGIYFKYDWSALKIVVSNDRDHL 346
>gi|194768867|ref|XP_001966532.1| GF22223 [Drosophila ananassae]
gi|190617296|gb|EDV32820.1| GF22223 [Drosophila ananassae]
Length = 448
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 77/181 (42%), Gaps = 18/181 (9%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 204 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 260
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGG 307
+PL+G +E A+ Y++K++PT + E +
Sbjct: 261 YSRRIVQPLEGDETIIQEEATTVQYFLKVVPTEIRQTFSTINTFQYSVTENVRKLDSERN 320
Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
G PGI+F Y+ S L + + L ++ ISG + +++LL + +++
Sbjct: 321 SYGSPGIYFKYDWSALKIVVDNDRDHLATFVIRLCSIISGIIVISGAINSLLIAIQRRLL 380
Query: 368 K 368
+
Sbjct: 381 R 381
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 1/91 (1%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
E K LDAF K E + E T GG ++++ L I YL+ ++ Y+ + + D S
Sbjct: 16 EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELHYYWHETDIVYQFQPDMS 75
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGE 94
++ +H+DI V +D +D + +
Sbjct: 76 LDDQVQMHVDITVAMPCASLSGVDLMDETQQ 106
>gi|195165324|ref|XP_002023489.1| GL20164 [Drosophila persimilis]
gi|194105594|gb|EDW27637.1| GL20164 [Drosophila persimilis]
Length = 445
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 66/150 (44%), Gaps = 24/150 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 203 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWVIELRRMPANFTHRINRLSFG---Q 259
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
+PL+G + E A+ Y++K++PT I++ +LD +
Sbjct: 260 YSRRIVQPLEGDESIIHEEATTVQYFLKVVPTEIHQTFTTINTFQYAVTENVRKLDSER- 318
Query: 305 GGGDGGMPGIFFSYELSPLMVKITEKSKSL 334
G PGI+F Y+ S L + ++ L
Sbjct: 319 --NSYGSPGIYFKYDWSALKIVVSNDRDHL 346
Score = 37.7 bits (86), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 45/88 (51%), Gaps = 4/88 (4%)
Query: 8 KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSSRGS 66
+ LDAF K E + E T GG ++++ L I YL+ ++ Y+ + + D S
Sbjct: 19 RNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELHYYWHETDIVYQFEPDISLDE 78
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGE 94
++ +H+DI T++ +AL VD E
Sbjct: 79 QVQMHVDI---TVAMPCVALSGVDLMDE 103
>gi|195469521|ref|XP_002099686.1| GE16580 [Drosophila yakuba]
gi|194187210|gb|EDX00794.1| GE16580 [Drosophila yakuba]
Length = 430
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 24/143 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
+PL+G E A+ Y++K++PT I++ +LD +
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTINAFQYAVTENVRKLDSER- 313
Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
G PGI+F Y+ S L + +
Sbjct: 314 --NSYGSPGIYFKYDWSALKIMV 334
>gi|365986066|ref|XP_003669865.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
gi|343768634|emb|CCD24622.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
Length = 353
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/377 (20%), Positives = 141/377 (37%), Gaps = 75/377 (19%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
LK DAF K + +K+ GG +I+ ++ I ++ + YF ++ VD
Sbjct: 5 LKVFDAFPKIEDQNKKKSTKGGITSILTYVLIIFIAWSEFGSYFGGFVDQQYIVDGMLRE 64
Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
+PI+LD+ V + C+++ ++ D + ++ + L + P P +N
Sbjct: 65 TVPINLDLYV-NVPCEWVHVNVRDQT------LDRKFASQELKFEEMPFFIPFDVRLND- 116
Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
+ VT E E + + + +TR + N K LP+ +
Sbjct: 117 NPEIVTPELDEILGE-----AIPAEFREKLDTRMFFDENNPDKS-------HLPDFN--- 161
Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
GC I+G + VN+V+G + A G Y+ D
Sbjct: 162 -----------------GCHIFGSVNVNQVAGELQVTAKGHGYA-------DYHRAPLEK 197
Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERLDGSKL 304
N H I SFG D PLD + ++ + + Y +IP IY ++ G+++
Sbjct: 198 VNFAHVINEFSFGEFFPYID---NPLDNSAKFNMDDPLTAYVYDTSVIPMIYRKM-GAEV 253
Query: 305 GGGDGG------------------MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
+PGIFF Y L + ++++ +++ +S
Sbjct: 254 DTFQYSVAEHQYKSKESSSSNSFRVPGIFFQYNFENLSIVVSDRRLGFIQFIVRLVAILS 313
Query: 347 -GTYIT---FMLVDALL 359
YI F+L D +
Sbjct: 314 FAVYIASWLFILADMFI 330
>gi|18921097|ref|NP_569847.1| CG4293, isoform A [Drosophila melanogaster]
gi|24638890|ref|NP_726677.1| CG4293, isoform B [Drosophila melanogaster]
gi|85724768|ref|NP_001033816.1| CG4293, isoform D [Drosophila melanogaster]
gi|85724770|ref|NP_001033817.1| CG4293, isoform C [Drosophila melanogaster]
gi|2961397|emb|CAA18090.1| EG:65F1.1 [Drosophila melanogaster]
gi|7290051|gb|AAF45518.1| CG4293, isoform A [Drosophila melanogaster]
gi|7290052|gb|AAF45519.1| CG4293, isoform B [Drosophila melanogaster]
gi|15292011|gb|AAK93274.1| LD35174p [Drosophila melanogaster]
gi|84798360|gb|ABC67159.1| CG4293, isoform C [Drosophila melanogaster]
gi|84798361|gb|ABC67160.1| CG4293, isoform D [Drosophila melanogaster]
gi|220955778|gb|ACL90432.1| CG4293-PA [synthetic construct]
Length = 441
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 62/143 (43%), Gaps = 24/143 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------IY----------ERLDGSKL 304
+PL+G E A+ Y++K++PT IY +LD +
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTIYAFQYAVTENVRKLDSER- 313
Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
G PGI+F Y+ S L + +
Sbjct: 314 --NSYGSPGIYFKYDWSALKIIV 334
>gi|195629654|gb|ACG36468.1| hypothetical protein [Zea mays]
Length = 76
Score = 53.5 bits (127), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 44/70 (62%)
Query: 3 FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
F +RLK LDA+ K EDF++ T++GG VT+V + + L + YF +T +L VD+
Sbjct: 4 FLQRLKRLDAYPKVNEDFYKWTLFGGIVTLVAAVVMLLLFISETRSYFYSATETKLVVDT 63
Query: 63 SRGSKLPIHL 72
SR +L +++
Sbjct: 64 SRRERLRVNV 73
>gi|303275141|ref|XP_003056869.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461221|gb|EEH58514.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 604
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 88/210 (41%), Gaps = 54/210 (25%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
T GC I G + VNRV G+F Y H H+I N TH +RHLSFG +
Sbjct: 400 TSGCIIEGSVRVNRVPGAF-------YVTAHSKGHNIN---VDVVNMTHVLRHLSFGKTV 449
Query: 262 QDDDE------RR------KPLDG--TVAKAEEGAS------MFNYYIKIIPTIYERLDG 301
RR K + G VA AEE + + +Y+K++ +E +DG
Sbjct: 450 PGRPSYVPRHMRRVWSKIPKDMGGRFAVAGAEETFASAEPYTVHEHYLKVVSHAFEPIDG 509
Query: 302 S--------------KLGGGDGG--------MPGIFFSYELSPLMVKITEKSKSLGHLWT 339
KL G P I FSY++SP+ V + E++K + WT
Sbjct: 510 DAVQLYEYTFNSNRFKLAPAAYGDEDDAHVDGPMIKFSYDVSPMRVVLREETKPVLD-WT 568
Query: 340 KIMCNI-SGTYITFMLVDALLHSCVKKISK 368
MC + G Y L++A + + V + +
Sbjct: 569 LGMCALMGGVYTCSGLLEAFISNGVSVVKR 598
>gi|194911936|ref|XP_001982403.1| GG12755 [Drosophila erecta]
gi|190648079|gb|EDV45372.1| GG12755 [Drosophila erecta]
Length = 441
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 24/143 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
+PL+G E A+ Y++K++PT I++ +LD +
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTIQYFLKVVPTEIHQTFTTINAFQYAVTENVRKLDSER- 313
Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
G PGI+F Y+ S L + +
Sbjct: 314 --NSYGSPGIYFKYDWSALKIVV 334
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 45/91 (49%), Gaps = 1/91 (1%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
E K LDAF K E + E T GG ++++ L I YL+ ++ Y+ + + + D +
Sbjct: 16 EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELYYYWHETAIVYQFEPDIA 75
Query: 64 RGSKLPIHLDIVVPTISCDYLALDAVDSSGE 94
++ +H+DI V +D +D + +
Sbjct: 76 LDEQVQMHVDITVAMPCASLSGVDLMDETQQ 106
>gi|444732203|gb|ELW72509.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Tupaia chinensis]
Length = 250
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 18/112 (16%)
Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
H H ++N +H I HLSFG + PLDGT A + MF Y+I ++P
Sbjct: 114 HAHLAALVNHDSYNFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVP 170
Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
T + ER G G+ GIF Y+LS LMV +TE+
Sbjct: 171 TKLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEE 222
>gi|195564437|ref|XP_002105825.1| GD16474 [Drosophila simulans]
gi|194203186|gb|EDX16762.1| GD16474 [Drosophila simulans]
Length = 441
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 24/143 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
+PL+G E A+ Y++K++PT I++ +LD +
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTINAFQYAVTENVRKLDSER- 313
Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
G PGI+F Y+ S L + +
Sbjct: 314 --NSYGSPGIYFKYDWSALKIMV 334
>gi|12006037|gb|AAG44724.1|AF267855_1 HT034 [Homo sapiens]
Length = 199
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 64/138 (46%), Gaps = 17/138 (12%)
Query: 252 IRHLSFG--IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK------ 303
I LSFG +++Q+ L G + +Y +KI+PT+YE G +
Sbjct: 59 IHKLSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQY 118
Query: 304 -------LGGGDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
+ G +P I+F Y+LSP+ VK TE+ + L T I I GT+ +
Sbjct: 119 TVANKEYVAYSHTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGI 178
Query: 355 VDALLHSCVKKISKVEIG 372
+D+ + + + K+++G
Sbjct: 179 LDSCIFTASEAWKKIQLG 196
>gi|195347402|ref|XP_002040242.1| GM19035 [Drosophila sechellia]
gi|194121670|gb|EDW43713.1| GM19035 [Drosophila sechellia]
Length = 437
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 24/143 (16%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
+ C+++G L +N+V+G H+ G + H + N TH I LSFG Q
Sbjct: 194 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 250
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
+PL+G E A+ Y++K++PT I++ +LD +
Sbjct: 251 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTINAFQYAVTENVRKLDSER- 309
Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
G PGI+F Y+ S L + +
Sbjct: 310 --NSYGSPGIYFKYDWSALKIMV 330
>gi|378726952|gb|EHY53411.1| hypothetical protein HMPREF1120_01605 [Exophiala dermatitidis
NIH/UT8656]
Length = 326
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 48/96 (50%), Gaps = 9/96 (9%)
Query: 203 EGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
+ C+IYG LE N+V G FHI A G Y + H + FN +HHI LSFG
Sbjct: 86 DSCRIYGSLEGNKVQGDFHITARGHGYMEFGMQQH----LDHSRFNFSHHINELSFGPHY 141
Query: 262 QDDDERRKPLDGTVAKAEEGASM-FNYYIKIIPTIY 296
PLD T A + M + YY+ I+PTI+
Sbjct: 142 PG---LLNPLDKTSAVTTDVHFMRYQYYLSIVPTIF 174
>gi|412994089|emb|CCO14600.1| predicted protein [Bathycoccus prasinos]
Length = 528
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/110 (24%), Positives = 57/110 (51%), Gaps = 1/110 (0%)
Query: 6 RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-R 64
+ KG+D + K D + T G ++I+ I +L+ + Y + + ++ VD S
Sbjct: 6 KAKGMDFYRKIPRDMTQGTYLGTILSILATSLIVFLLIAETRAYLKTTFETKVVVDRSVD 65
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
G L I+ ++ P +SC++ ++D D+ G ++ ++KR +D + +P
Sbjct: 66 GELLRINFNVSFPALSCEFASVDVGDALGLTRYNLTKTVFKRPIDGNFRP 115
>gi|119928709|ref|XP_001256294.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Bos taurus]
Length = 144
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 56/118 (47%), Gaps = 15/118 (12%)
Query: 270 PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGDGG--MPGI 314
P +V + + +Y +KI+PT+YE G + + G +P I
Sbjct: 24 PTPASVRRTFRALASHDYILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSHTGRIIPAI 83
Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+F Y+LSP+ VK TE+ + L T I I GT+ ++D+ + + + K+++G
Sbjct: 84 WFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAWKKIQLG 141
>gi|157872987|ref|XP_001685013.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128084|emb|CAJ08215.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 341
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 52/245 (21%), Positives = 94/245 (38%), Gaps = 33/245 (13%)
Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY 210
C+ TET + + + +P + Y + ++ + +GC + G
Sbjct: 90 CHRIATETVSVFAHDEQTERDTHVSLYHIPYGSYVSNSSAAYISGEVLSGTEDGCLVTGT 149
Query: 211 LEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD---DDER 267
+ SF+I + D + S + I H S G D R
Sbjct: 150 APIAAKPSSFNII-----------LKDYRVEDSRKYRPDFQIHHFSGGNAYDDWGVPQVR 198
Query: 268 RK---PLDG-TVAKAEEGASMFNYYIKIIPTIYERLDG--SKLG------------GGDG 309
R+ P+ G A+A +G F +++++IPT + L G S+ G G G
Sbjct: 199 RQTLEPMSGLKSARALQGPYFFQFFLQLIPTTVD-LAGKDSRFGYQYTAFHSMLRYNGHG 257
Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
PG++FSY+LSP + + ++ H + + G Y +V+A L +K
Sbjct: 258 RAPGLYFSYKLSPFSMDCAVQYDTMSHFVVNLCAVVGGVYTVAEMVEAGLEWLARKRRLR 317
Query: 370 EIGGK 374
E+ +
Sbjct: 318 EVSAR 322
>gi|388497088|gb|AFK36610.1| unknown [Medicago truncatula]
Length = 457
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 67/174 (38%), Gaps = 36/174 (20%)
Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
T GC++ GY+ V +V GS ++ D + ++ N +H I HLSFG K+
Sbjct: 288 TGGCRVEGYVRVKKVPGSLVVSAR----------SDAHSFDASQMNMSHVINHLSFGKKV 337
Query: 262 QD----DDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTIYERLDGSKL-- 304
D + P G G S N +YI+++ T G KL
Sbjct: 338 TPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQVVKTEVITRKGYKLIE 397
Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
+P F ELSP+ V ITE KS H T + I G +
Sbjct: 398 EYEYTAHSSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGCF 451
>gi|412989304|emb|CCO15895.1| predicted protein [Bathycoccus prasinos]
Length = 674
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 56/109 (51%), Gaps = 4/109 (3%)
Query: 4 SERLKGLDAFTK--PYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
++ LK +D F + E+F E + GG +T++ FI L+ V F S +L V
Sbjct: 50 TQTLKTVDVFKRNDALEEFSKEGSNKGGVLTLLFAWFIFGLVTSQVQKLFATSMRTDLSV 109
Query: 61 DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE-HNIYKRRL 108
D L + D+ P I+C++L++D VD+ G + ++ +IYK +
Sbjct: 110 DHDMDPTLVMQFDVSFPAINCEHLSVDLVDAVGHRAFNLSGESIYKHSM 158
>gi|323445875|gb|EGB02274.1| hypothetical protein AURANDRAFT_69033 [Aureococcus anophagefferens]
Length = 329
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/103 (24%), Positives = 53/103 (51%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
+D + K ++ E + GG +++ ++ + ++ + + ++ VD+ GS+L
Sbjct: 1 MDFYRKVPDELKEASRTGGLLSLCACGVVALTLVTEIGAFLRTEVRTKIDVDTFAGSQLR 60
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
++ ++ P + CDY ++D D G +V NI K +LD DG
Sbjct: 61 VNFNLSFPHLHCDYASVDLWDKIGRNQANVTQNIEKWQLDEDG 103
Score = 44.7 bits (104), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 10/70 (14%)
Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
NT GC + G+L VNRV G+FH+ +++S +H + N +H + HLSFG
Sbjct: 265 NTDHPGCLVSGFLLVNRVPGNFHV---MAHSRHH-------SLNTLRTNLSHTVHHLSFG 314
Query: 259 IKLQDDDERR 268
+ L D R+
Sbjct: 315 VPLTDAQHRK 324
>gi|323449341|gb|EGB05230.1| hypothetical protein AURANDRAFT_72293 [Aureococcus anophagefferens]
Length = 221
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 22/105 (20%)
Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
GC + G++ VNRV G+FHI A L +++N +A N +H + HLSFG L
Sbjct: 128 GCMVSGHVLVNRVPGNFHIEARSLHHNLN-----------AAMTNLSHIVNHLSFGTPLA 176
Query: 263 DDDERR----------KPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
D +R+ PLDG + ++Y K++ T +E
Sbjct: 177 RDLQRKVSKYPQFQSAHPLDGGSFINRDYHQAHHHYSKVVSTHFE 221
>gi|159483443|ref|XP_001699770.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
gi|158281712|gb|EDP07466.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
Length = 474
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 77/201 (38%), Gaps = 39/201 (19%)
Query: 202 TEGCQIYGYLEVNRVSGSFH-IAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
T GC + G++ V +V G+ H +A +S +H + N TH I G +
Sbjct: 284 TPGCNLAGFVMVKKVPGTVHFVARSEGHSFDHTWM-----------NMTHMIHSFHVGTR 332
Query: 261 LQDDD----ERRKP----------LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
+R P L + +E S +Y++++ T E G
Sbjct: 333 PSPRKYQQLKRLHPAGLTADWADKLHDQLFVSEHTQSTHEHYLQVVLTTIEPRHSRHTGN 392
Query: 307 GDG-------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
D +P F+Y+LSP+ + + E SK T I G +
Sbjct: 393 YDAYEYTAHSHSYQSDSIPSARFTYDLSPIQILVHETSKPWYQFLTTSCAIIGGVFTVAG 452
Query: 354 LVDALLHSCVKKISKVEIGGK 374
++DALL+ K + K+ +G +
Sbjct: 453 ILDALLYQSFKVVKKLNLGKQ 473
>gi|428175103|gb|EKX43995.1| hypothetical protein GUITHDRAFT_159761 [Guillardia theta CCMP2712]
Length = 475
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 7/117 (5%)
Query: 3 FSERLKGLDAFTKPYEDFH----EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEEL 58
F + LK +D + K D E +V G A++I+ + + L+ ++ Y V + +
Sbjct: 4 FLQGLKSVDFYRKLKRDLQQELTEASVSGAALSIIAAVIMIGLVAAELTAYLTVQSESRV 63
Query: 59 FVD---SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
+D SS L ++ + P + CDY ++DA + G + + K RLD +G
Sbjct: 64 VLDHFESSSDDTLQVNFNFTFPHLKCDYASVDATNFMGTHDAGLAARVSKIRLDKNG 120
>gi|388493200|gb|AFK34666.1| unknown [Medicago truncatula]
Length = 106
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 50/106 (47%), Gaps = 20/106 (18%)
Query: 284 MFNYYIKIIPTIY----------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKI 327
M Y+IK++PT+Y E S+LG +PG+FF Y++SP+ V
Sbjct: 1 MCQYFIKVVPTVYTDIRGRVIHSNQYSVTEHFKSSELGAA---VPGVFFFYDISPIKVNF 57
Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
E+ H T I I G + +VD+ ++ K I K+EIG
Sbjct: 58 KEEHIPFLHFLTNICAIIGGIFTIAGIVDSSIYYGQKTIKKKMEIG 103
>gi|32566449|ref|NP_510494.2| Protein C18B12.6 [Caenorhabditis elegans]
gi|25809204|emb|CAA20929.2| Protein C18B12.6 [Caenorhabditis elegans]
Length = 428
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 71/167 (42%), Gaps = 23/167 (13%)
Query: 215 RVSGSFHIAPGLSYSI-----NHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRK 269
R+ G F + G I N + + D Q S N +H I +FG ++
Sbjct: 227 RLHGKFKVRKGKEEKIVMSISNPMMMFDHQEKQSG--NISHRIEKFNFGPRIPG---LVT 281
Query: 270 PLDGTVAKAEEGASMFNYYIKIIPT-IYERLD------------GSKLGGGDGGMPGIFF 316
PL G +E G ++ Y+IKI+PT IY +L G+ GI F
Sbjct: 282 PLAGAEHISESGQDIYRYFIKIVPTKIYGYFSYTMAYQYSVTFLKKQLKEGEHSHGGILF 341
Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
YE + ++++ + S +L +I + G Y T +V+ +L C+
Sbjct: 342 EYEFTANVIEVHKTSITLISYLIRICSILGGVYATSTIVNNILQFCL 388
>gi|444706692|gb|ELW48018.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Tupaia chinensis]
Length = 821
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 15/102 (14%)
Query: 286 NYYIKIIPTIYERLDGSK-------------LGGGDGG--MPGIFFSYELSPLMVKITEK 330
+Y +KI+PT+YE G + + G +P I+F Y+LSP+ VK TE+
Sbjct: 717 DYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSHTGRIIPAIWFRYDLSPITVKYTER 776
Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+ L T I I GT+ ++D+ + + + KV++G
Sbjct: 777 RQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAWKKVQLG 818
>gi|390370794|ref|XP_001186477.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like, partial [Strongylocentrotus purpuratus]
Length = 221
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 13/113 (11%)
Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
+T+K+ GC Y +N+V G+FH++ H + + + H I
Sbjct: 101 NTKKIPLNNGLGCLFYSAFTINKVPGNFHVS-----------THAVGMNQPQSTDFAHII 149
Query: 253 RHLSFGIKLQDD--DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
+SFG +Q+ PL+G + + +YY+KI+PT+YE L G+K
Sbjct: 150 HEVSFGDDIQNKTLGASFNPLEGRDKRDSKSDLSHDYYMKIVPTVYEDLWGTK 202
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 48/96 (50%), Gaps = 3/96 (3%)
Query: 1 MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
MVF R LD + K +D + T G V+++ LFI++L+ + + + EL+V
Sbjct: 1 MVFDFR--RLDVYRKIPKDLTQPTYAGACVSLLSMLFITFLLLSEFMSFIRPEVVSELYV 58
Query: 61 DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQ 95
D+ +L + +++ +P + C + LD D G
Sbjct: 59 DNPGEIERLTVRVNLSLPKLHCGVVGLDIQDDMGRH 94
>gi|50545267|ref|XP_500171.1| YALI0A17600p [Yarrowia lipolytica]
gi|49646036|emb|CAG84103.1| YALI0A17600p [Yarrowia lipolytica CLIB122]
Length = 337
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 62/158 (39%), Gaps = 25/158 (15%)
Query: 205 CQIYGYLEVNRVSGSFHI--APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
C+I G + +N V G+ I P Y IN + D N TH I LSFG
Sbjct: 151 CRISGSVPINHVEGALQIFNLPDNQYFINPMKASD-------GLNLTHAIHELSFGDYF- 202
Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE-------------RLDGSKLGGGDG 309
+ PLDG +E + Y++ +P Y + + L
Sbjct: 203 --PKVLNPLDGVSTVTDEPLMSYQYFLSAVPVEYSSGRKKIHTYQYAVKKQTTNLQEHFV 260
Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
P IFF Y+ P+ +KI + ++L K++ + G
Sbjct: 261 TRPAIFFHYKYEPVTLKIQDSRETLTVFVVKLLSILGG 298
>gi|339244785|ref|XP_003378318.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Trichinella spiralis]
gi|316972786|gb|EFV56437.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Trichinella spiralis]
Length = 334
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/103 (26%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 285 FNYYIKIIPTIYERLDGS---------------KLGGGDGGMPGIFFSYELSPLMVKITE 329
++Y +KI+PT+YE + G+ ++ P ++F Y+ +P+ VK E
Sbjct: 155 YDYILKIVPTVYENIAGNMKHAYQYTYARKTYIEMSFTGQTNPTLWFRYDFTPITVKYHE 214
Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
+ + L T I I GT+ L+D+ + + KVE+G
Sbjct: 215 RRQPLYIFLTSICAIIGGTFTVAGLIDSFFFTASQLYKKVELG 257
>gi|145544034|ref|XP_001457702.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124425520|emb|CAK90305.1| unnamed protein product [Paramecium tetraurelia]
Length = 463
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 3/98 (3%)
Query: 93 GEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPN-KCGS 150
G+ L V+ I R R++LD IQ P + + ++ T + PN S
Sbjct: 44 GKISLLVDSTIDSRIRVNLDAT-IQAPCQALFQHIRYDGFLFIRSTFEEAIFKPNVNFTS 102
Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQC 188
CYGAE + C +C +V A+ ++W P ++IVQC
Sbjct: 103 CYGAELIVDQRCYSCQDVMMAFAQRRWTQPNFESIVQC 140
>gi|223646904|gb|ACN10210.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
gi|223672767|gb|ACN12565.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
Length = 238
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
C+I+G+L VN+V+G+FHI G + H H + +N +H I HLSFG ++
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFG---EE 225
Query: 264 DDERRKPLDGT 274
PLDGT
Sbjct: 226 IPGIINPLDGT 236
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 7 LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
+K LDAF K E + E T GG V+++ + ++ L ++ Y E VD S
Sbjct: 14 VKELDAFPKVPESYVETTATGGTVSLIAFTAMALLAFLEFFVYRDTWMQYEYEVDKDFSS 73
Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
KL I++DI V + C ++ D +D
Sbjct: 74 KLRINIDITV-AMRCQFVGADVLD 96
>gi|312374049|gb|EFR21698.1| hypothetical protein AND_16520 [Anopheles darlingi]
Length = 252
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 5 ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
E + LDAF K E+F E T GG ++++ L I +LI +V Y D+
Sbjct: 12 EAVSQLDAFPKVKEEFVEATRVGGTLSLISRLVIIFLIYHEVTYYLDSRLVFTFKPDTDL 71
Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQ 95
SKL +H+D+ V + C + D +DS+ +
Sbjct: 72 HSKLKVHIDLTV-AMPCKSIGADILDSTNQN 101
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 36/58 (62%), Gaps = 4/58 (6%)
Query: 203 EGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
+ C+I+G L +N+V+G+FHI G + ++ H+H++ I + + N +H I SFG
Sbjct: 169 DACRIHGVLTLNKVAGNFHITVGKTIHFARGHIHLNSI--FANTQTNFSHRINRFSFG 224
>gi|325185550|emb|CCA20033.1| thioredoxinlike protein putative [Albugo laibachii Nc14]
Length = 503
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 47/217 (21%), Positives = 80/217 (36%), Gaps = 42/217 (19%)
Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI---APGLSYSINHVHVHDIQPYT 242
+ N KL EGC++ G L VNRV + LS+ + +
Sbjct: 299 LNANNPEKNVKLPVGSVEGCEVSGSLNVNRVPSRLVFTARSKDLSFDLRGI--------- 349
Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRK---------PLDGTVAKAEEGASMFNYYIKIIP 293
N TH + HLSFG + + PLDG + E +++ +I
Sbjct: 350 ----NVTHVVHHLSFGQVTRKQSTKSTQLSMSFDHFPLDGKTFRTENENITVEHFLSVIG 405
Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
Y+ + S +P F++++SPL+++++ S
Sbjct: 406 VDHMEAKSKHMGLVERTYQIVARSNQYNATDMLPAALFTFDISPLVIQMSSDSTPFYRFL 465
Query: 339 TKIMCNISGTYITFM-LVDALLHSCVKKISKVEIGGK 374
T +C I G +T + VDA + + I + GK
Sbjct: 466 TS-LCAIVGGMVTIIGFVDAGAYHAMNSIKRKRQLGK 501
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/99 (24%), Positives = 46/99 (46%)
Query: 10 LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
D F K E E++ G T++ + YLI V+ Y S + +D + +L
Sbjct: 11 FDLFRKVPEHLSERSSLGTVFTVLTLVLSVYLITVNFRSYQDTSIHSIVVMDDHQEDQLR 70
Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
I+ +I + I C + ++D D G Q +++ ++ +L
Sbjct: 71 INFNISLLAIPCQFASVDVSDYIGMQLINITRHLRHFQL 109
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,192,350,207
Number of Sequences: 23463169
Number of extensions: 267442456
Number of successful extensions: 589453
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 795
Number of HSP's successfully gapped in prelim test: 284
Number of HSP's that attempted gapping in prelim test: 585261
Number of HSP's gapped (non-prelim): 1757
length of query: 379
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 235
effective length of database: 8,980,499,031
effective search space: 2110417272285
effective search space used: 2110417272285
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)