BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy4691
(380 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|340713692|ref|XP_003395373.1| PREDICTED: nuclear export mediator factor NEMF homolog [Bombus
terrestris]
Length = 971
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 211/416 (50%), Positives = 263/416 (63%), Gaps = 92/416 (22%)
Query: 29 SQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P + E++I N+EFHP + + Q+ +P +EF+SF AV
Sbjct: 242 SKGYIIQKKESKPTTDGKENFIYTNIEFHPFLFE---------QYADYPYKEFDSFDVAV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQERDALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 293 DEYFSTMEGQKLDLKALQQERDALKKLENVKKDHDQRLINLEKTQELDKQKAELISRNQA 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKILLKEAESRGDPVASAIKQLKLETNHISLLLH----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
DP+ DS E KP L+D+DL +A+ NA ++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNATKYYNQKRSAAKKQQKT 453
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---------- 313
I+S +KALKSAEKKTKQTLK+VQT+ +INK RK+YWFEKFYWFISSENYL
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQN 513
Query: 314 ------------------------------GNVAWDAKVVTN----------AW------ 327
GN + K + AW
Sbjct: 514 ELIVKRYLKSGDIYVHADLTGASSVVIKNPGNDSVPPKTLAEAGTMAVAYSIAWDAKVVA 573
Query: 328 ---WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WV DQVSKTAPTGE+LTTGSFMIRGKKN+ PPCQL MG+ FLF+LEESSI R
Sbjct: 574 GAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIER 629
>gi|350409527|ref|XP_003488770.1| PREDICTED: nuclear export mediator factor NEMF homolog [Bombus
impatiens]
Length = 971
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 213/416 (51%), Positives = 265/416 (63%), Gaps = 92/416 (22%)
Query: 29 SQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P + ED+I N+EFHP + + Q+ +P +EF+SF AV
Sbjct: 242 SKGYIIQKKESKPTADGKEDFIYTNIEFHPFLFE---------QYTNYPYKEFDSFDVAV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQERDALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 293 DEYFSTMEGQKLDLKALQQERDALKKLENVKKDHDQRLINLEKTQELDKQKAELISRNQT 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKVLLKEAESRGDPVASAIKQLKLETNHISLLLH----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
DP+ DS E KP L+D+DL +A+ NA ++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNATKYYNQKRSAAKKQQKT 453
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---------- 313
I+S +KALKSAEKKTKQTLK+VQT+ +INK RK+YWFEKFYWFISSENYL
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQN 513
Query: 314 ----------GNVAWDAK-------VVTN-----------------------AW------ 327
G++ A V+ N AW
Sbjct: 514 ELIVKRYLKSGDIYVHADLTGASSVVIKNPGSDSVPPKTLAEAGTMAVAYSIAWDAKVVA 573
Query: 328 --W-VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
W V DQVSKTAPTGE+LTTGSFMIRGKKN+ PPCQL MG+ FLF+LEESSI R
Sbjct: 574 GAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIER 629
>gi|345495372|ref|XP_001603770.2| PREDICTED: nuclear export mediator factor NEMF homolog [Nasonia
vitripennis]
Length = 972
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 218/444 (49%), Positives = 274/444 (61%), Gaps = 97/444 (21%)
Query: 6 DVSSDIHKCGPTLHVLVSNLNVS----SQGYLVQKKEKRP---NSEDYIVANMEFHPMIL 58
DV+ D+ K L LN + S+GY++QKKE++P E+++ AN+EFHP +
Sbjct: 215 DVAKDVDKLYSALENAEKMLNNAKKDVSKGYIIQKKEEKPIKDGEEEFMYANIEFHPFLF 274
Query: 59 QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD 118
+ QH +E+E+F AVDE+FST E K+DLK +QQERDALKKL+NVK+D
Sbjct: 275 E-----QCKNQH----YKEYETFDKAVDEYFSTMEGQKLDLKVLQQERDALKKLDNVKKD 325
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
H+ RL L +TQ DK+KAELI NQE VD AILA++ +ANQ+SW+DI+ +K+AQ
Sbjct: 326 HDQRLVTLGKTQEADKQKAELITRNQELVDNAILAMQSALANQMSWQDIQTLLKEAQAKG 385
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK---KPSLVD 235
DPVAS IK LKL NHIT+LL +DP+ DS + + KP VD
Sbjct: 386 DPVASAIKHLKLESNHITMLL-------------------SDPYEDSDDDEPELKPMTVD 426
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKAR 295
+DL SA++NA R++D KRSAAKKQQKTI+S KALKSAE+KTKQTLK+VQ + +INKAR
Sbjct: 427 IDLAHSAFSNATRYYDQKRSAAKKQQKTIESQGKALKSAERKTKQTLKEVQAIHSINKAR 486
Query: 296 KVYWFEKFYWFISSENYL--------------------GNVAWDAK-------VVTN--- 325
KVYWFEKFYWFI+SENYL G+V A VV N
Sbjct: 487 KVYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLRSGDVYVHADLTGASSVVVKNPNG 546
Query: 326 --------------------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKN 356
AW WV +DQVSKTAPTGE+LTTGSFMIRGKKN
Sbjct: 547 GPVPPKSLAEAGTMAVAYSIAWEAKVIAGSYWVNSDQVSKTAPTGEYLTTGSFMIRGKKN 606
Query: 357 FFPPCQLAMGISFLFKLEESSISR 380
+ PPCQL MG+ FLF+LE+SSI R
Sbjct: 607 YLPPCQLIMGLGFLFRLEDSSIER 630
>gi|403277934|ref|XP_003930597.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Saimiri
boliviensis boliviensis]
Length = 1056
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 208/414 (50%), Positives = 256/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKRETKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+QTQ +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645
>gi|390469065|ref|XP_003734045.1| PREDICTED: nuclear export mediator factor NEMF isoform 2
[Callithrix jacchus]
Length = 1056
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 207/414 (50%), Positives = 256/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+QTQ +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA +SAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFRSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645
>gi|345804334|ref|XP_863447.2| PREDICTED: nuclear export mediator factor NEMF isoform 6 [Canis
lupus familiaris]
Length = 1056
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 204/414 (49%), Positives = 255/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREVKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
+ E++ + + + +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDISVEKNETELPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTTGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645
>gi|426376842|ref|XP_004055191.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Gorilla
gorilla gorilla]
Length = 1056
Score = 369 bits (948), Expect = e-99, Method: Compositional matrix adjust.
Identities = 207/414 (50%), Positives = 256/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645
>gi|397523544|ref|XP_003831789.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Pan
paniscus]
Length = 1055
Score = 369 bits (948), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 207/414 (50%), Positives = 255/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645
>gi|395838618|ref|XP_003792209.1| PREDICTED: nuclear export mediator factor NEMF [Otolemur garnettii]
Length = 1056
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 206/414 (49%), Positives = 256/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N + +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFNGKGYIIQKREIKPSLEADKPAEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVATAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + S + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVVDDVSVEKNETEPSKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645
>gi|194388162|dbj|BAG65465.1| unnamed protein product [Homo sapiens]
Length = 1055
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 207/414 (50%), Positives = 255/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645
>gi|332842178|ref|XP_003314363.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Pan
troglodytes]
Length = 1055
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 207/414 (50%), Positives = 255/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDIFTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645
>gi|426233098|ref|XP_004010554.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Ovis
aries]
Length = 1055
Score = 368 bits (945), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 202/412 (49%), Positives = 254/412 (61%), Gaps = 68/412 (16%)
Query: 28 SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+ +GY++QK+E +P+ E + I+ EFHP + QH P EFESF
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413
Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
+ E++ + + + +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533
Query: 307 ISSENYL--------------------------------GNV------AWDAKVVTNAWW 328
ISSENYL G + AWDA+V+T+AWW
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSAWW 593
Query: 329 VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
V QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 594 VYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFSFLFKVDESCVWR 645
>gi|338717943|ref|XP_001496390.3| PREDICTED: nuclear export mediator factor NEMF [Equus caballus]
Length = 827
Score = 367 bits (943), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 203/414 (49%), Positives = 254/414 (61%), Gaps = 68/414 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N+ +GY++QK+E +P+ E I+ EFHP + QH P EFES
Sbjct: 12 NLKVKGYIIQKREMKPSLEVDKPTQDILTYEEFHPFLFS---------QHSQCPYIEFES 62
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 63 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELI 122
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 123 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 182
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
+ E++ + + + +P LVDVDL LSAYA
Sbjct: 183 NPYLLSEEEDDDVDGDISVEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 242
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 243 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 302
Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
WFISSENYL G + AWDA+V+T+A
Sbjct: 303 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 362
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 363 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 416
>gi|328781799|ref|XP_395865.4| PREDICTED: serologically defined colon cancer antigen 1 homolog
isoform 1 [Apis mellifera]
Length = 970
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 199/416 (47%), Positives = 256/416 (61%), Gaps = 92/416 (22%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P ++F SF AV
Sbjct: 242 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKKFASFDVAV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 293 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAAKKQQKT 453
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
I+S +KALKSAEKKTKQTLK+VQT+ +INK RK+YWFEKFYWFISSENYL D +
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQN 513
Query: 322 -------VVTNAWWVKADQVSKTA-----PTG---------------------------- 341
+ T +V AD ++ P G
Sbjct: 514 ELIVKRYLKTGDIYVHADLTGASSVIIKNPGGSTVPPKTLAEAGTMAVAYSIAWDAKVVA 573
Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+LTTGSFMIRGKKN+ PPCQL MG+ FLF+LEESSI R
Sbjct: 574 GAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIER 629
>gi|380024993|ref|XP_003696268.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
homolog [Apis florea]
Length = 970
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 200/416 (48%), Positives = 255/416 (61%), Gaps = 92/416 (22%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P +EF SF AV
Sbjct: 242 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 293 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQT 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAAKKQQKT 453
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
I+S +KALKSAEKKTKQTLK+VQT+ +INK RK+YWFEKFYWFISSENYL D +
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQN 513
Query: 322 -------VVTNAWWVKADQVSKTA-----PTG---------------------------- 341
+ T +V AD ++ P G
Sbjct: 514 ELIVKRYLKTGDIYVHADLTGASSVIIKNPGGGSVPPKTLAEAGTMAVAYSIAWDAKVVA 573
Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+LTTGSFMIRGKKN+ PPCQL MG+ FLF LEESSI R
Sbjct: 574 GAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFXLEESSIER 629
>gi|307209071|gb|EFN86238.1| Serologically defined colon cancer antigen 1-like protein
[Harpegnathos saltator]
Length = 989
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 198/417 (47%), Positives = 257/417 (61%), Gaps = 92/417 (22%)
Query: 28 SSQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
+S+GY++QKKE RP E++I AN+EFHP++ + Q++ P +EF+SF A
Sbjct: 241 ASKGYIIQKKEARPTQDGKEEFIFANIEFHPLLFE---------QYKDQPYKEFDSFDAT 291
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDE+FST E K+DLKA+QQER+ALKKLENV++DH+ RL LE+TQ VDK+KAELI NQ
Sbjct: 292 VDEYFSTMEGQKLDLKALQQEREALKKLENVRKDHDQRLITLEKTQEVDKQKAELISRNQ 351
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
VD AILAI+ +ANQ+SW DI+ +K+AQ +DPVAS IKQLKL NHI+LLL
Sbjct: 352 TLVDNAILAIQSALANQMSWPDIQVLLKEAQARSDPVASAIKQLKLETNHISLLLH---- 407
Query: 205 RSNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK 262
DP+ +S E KP ++DVDL +A+ NA++++ KRSAAKKQQK
Sbjct: 408 ---------------DPYEESDEESELKPMIIDVDLAHTAFGNARKYYSQKRSAAKKQQK 452
Query: 263 TIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWD 319
TI+S KALKSAEKKTKQTLK+VQT+ +I K RKVYWFEKFYWFI+SENYL G
Sbjct: 453 TIESHGKALKSAEKKTKQTLKEVQTIHSIIKLRKVYWFEKFYWFITSENYLVIGGRDQQQ 512
Query: 320 AKVVTNAW------WVKADQVSKTA-----PTG--------------------------- 341
+++ + +V AD ++ PTG
Sbjct: 513 NELIVKRYLRAGDLYVHADLTGASSVVIKNPTGGFVPPKSLAEAGTMAIAYSVAWDAKVV 572
Query: 342 ------------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+LTTGSFMIRGKKN+ PP QL MG+ +F+LEE+SI R
Sbjct: 573 ANAWWVHHDQVSKSAPTGEYLTTGSFMIRGKKNYLPPSQLIMGLGIMFRLEENSIER 629
>gi|332237024|ref|XP_003267700.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
[Nomascus leucogenys]
Length = 1058
Score = 348 bits (893), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 199/416 (47%), Positives = 250/416 (60%), Gaps = 70/416 (16%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYW--FEK 302
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYW F K
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWXVFSK 531
Query: 303 FYWFISSENYLG--------------------------------------NVAWDAKVVT 324
+S EN+L + AWDA+V+T
Sbjct: 532 LLGRLSQENHLNPGGEDLQRTEVLILCIVIPGEPIPPRTLTEAGTMALCYSAAWDARVIT 591
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
+AWWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 592 SAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 647
>gi|195451571|ref|XP_002072981.1| GK13887 [Drosophila willistoni]
gi|194169066|gb|EDW83967.1| GK13887 [Drosophila willistoni]
Length = 1004
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 200/430 (46%), Positives = 255/430 (59%), Gaps = 97/430 (22%)
Query: 20 VLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
+L S+GY++Q KE++P DY N+EFHP + Q + ++
Sbjct: 274 ILSDGRKGPSKGYIMQVKEEKPTDGGDVDYFFRNVEFHPFLFS---------QLKHLEVE 324
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
E E+F AVDEFFS ES +ID+K + QERDALKKL N+K DH RL L + Q VDK K
Sbjct: 325 EHETFMTAVDEFFSKQESQRIDMKTLGQERDALKKLSNIKNDHAQRLEDLNKVQSVDKRK 384
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
AELI NQ VD AILA++ IA+QL W DI VK+AQ + D VA+ IKQLKL NHI+
Sbjct: 385 AELITCNQSLVDKAILAVQSAIASQLPWPDIRQLVKEAQANGDIVANSIKQLKLETNHIS 444
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFA-------DSSETKKPSLVDVDLDLSAYANAKRF 249
L+L +DP++ D E+++P +VDVDL LSA+ANA+R+
Sbjct: 445 LIL-------------------SDPYSANDSDEDDDEESEEPMIVDVDLALSAWANARRY 485
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
+DLKRSAAKK+QKT+ ++EKALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISS
Sbjct: 486 YDLKRSAAKKEQKTVDASEKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFYWFISS 545
Query: 310 ENYLGNVAWDAK---------------------------VVTN----------------- 325
ENYL DA+ ++ N
Sbjct: 546 ENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIRNPNADEIPPKTLLEAGTM 605
Query: 326 ------AWWVK---------ADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW K +DQVSKTAPTGE+L TGSFMIRGKKNF P C L MG+SFL
Sbjct: 606 AISYSVAWDAKVITNAYWVTSDQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMGLSFL 665
Query: 371 FKLEESSISR 380
FKLE+S + R
Sbjct: 666 FKLEDSFVQR 675
>gi|195151655|ref|XP_002016754.1| GL21904 [Drosophila persimilis]
gi|194111811|gb|EDW33854.1| GL21904 [Drosophila persimilis]
Length = 966
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 198/427 (46%), Positives = 259/427 (60%), Gaps = 92/427 (21%)
Query: 18 LHVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFP 74
L+++ N S+GY++ KE++P + +Y + N+EF P + Q +
Sbjct: 249 LNLIKDGNNGESKGYIIHVKEEKPIEDGKIEYFLRNIEFQPFLFA---------QFKDNE 299
Query: 75 IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
FESF AVDEF+ST ES KID+K +QQER+ALKKL NVK+DH RL L + Q DK
Sbjct: 300 FSMFESFLEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVKKDHAKRLEELTKVQDDDK 359
Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINH 194
+KAELI +NQ VD AI A++ IA+QL+W DI VK+AQ + D VAS IKQLKL INH
Sbjct: 360 KKAELITSNQSLVDNAIRAVQSAIASQLTWPDIHELVKEAQTNGDVVASSIKQLKLEINH 419
Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSE--TKKPSLVDVDLDLSAYANAKRFFDL 252
I+L+L +DP+ +E + ++VDVDL LSA+ANA+R++DL
Sbjct: 420 ISLIL-------------------SDPYVSQNEKDCEDLTVVDVDLALSAWANARRYYDL 460
Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
KRSAA+K+QKT+ +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSEN+
Sbjct: 461 KRSAAQKEQKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENF 520
Query: 313 LGNVAWDAK---------------------------VVTN-------------------- 325
L DA+ V+ N
Sbjct: 521 LVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVVIRNTTGEDIPPKTLVEAGSMAIS 580
Query: 326 ---AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AW WV +DQVSKTAPTGE+L TGSFMIRGKKNF P C L MG+S LFKL
Sbjct: 581 YSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKL 640
Query: 374 EESSISR 380
EES ++R
Sbjct: 641 EESFVAR 647
>gi|242018711|ref|XP_002429817.1| Serologically defined colon cancer antigen, putative [Pediculus
humanus corporis]
gi|212514835|gb|EEB17079.1| Serologically defined colon cancer antigen, putative [Pediculus
humanus corporis]
Length = 1024
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 202/427 (47%), Positives = 260/427 (60%), Gaps = 81/427 (18%)
Query: 21 LVSNLNVS-SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
+V N ++S S+GY++QK EKRP D+ V N EFHP + RQH P
Sbjct: 234 IVENASLSVSKGYIIQKVEKRPTESGMSDFHV-NTEFHPFLF---------RQHVKNPFN 283
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
E E+F AVD FFS+ ES KID+KA+ QE++ALKK+ENV+RDH RL L +TQ +D+ K
Sbjct: 284 ECETFLKAVDSFFSSLESQKIDMKAINQEKEALKKIENVRRDHNQRLQQLFETQELDRIK 343
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
AELI N VD A+LAIR IANQ+SW DI+ VK+ + DPVAS IK+LKL+INHIT
Sbjct: 344 AELITTNLTLVDQAVLAIRTAIANQISWPDIDILVKEGKNAGDPVASSIKKLKLDINHIT 403
Query: 197 LLLR-VHKERSNIKDSWVLHHDNAD-PFADSSETKKPSLVDVD--LDLSAYANAKRFFDL 252
L L ++ S+ + N D P + K P ++DVD LDL+A+ANA++++D+
Sbjct: 404 LQLSDPYRSDSSSSEEEEEEETNDDKPI----KIKVPKIIDVDIDLDLTAFANARKYYDM 459
Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF--------Y 304
KRSAAKKQQKTI+S +KALKSAEKKTKQ LK+++T+ NI K RK +WFEKF Y
Sbjct: 460 KRSAAKKQQKTIESQDKALKSAEKKTKQALKEMKTIVNITKVRKTFWFEKFFWFISSENY 519
Query: 305 WFISSENYLGN------------------------------------------------- 315
I+ + + N
Sbjct: 520 LVIAGRDMMQNELLVKRYMKSGDLYVHADIHGASSVIIKNPSNEPVPPKTLNEAGVMAIS 579
Query: 316 --VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AW+AKVVT+AWWV QVSKTAPTGE+L TGSFMIRGKKN+ PP L MG SFLFKL
Sbjct: 580 YSQAWEAKVVTSAWWVHNTQVSKTAPTGEYLGTGSFMIRGKKNYLPPANLIMGFSFLFKL 639
Query: 374 EESSISR 380
+++S+SR
Sbjct: 640 DDNSLSR 646
>gi|195388566|ref|XP_002052950.1| GJ23608 [Drosophila virilis]
gi|194151036|gb|EDW66470.1| GJ23608 [Drosophila virilis]
Length = 966
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 198/415 (47%), Positives = 257/415 (61%), Gaps = 81/415 (19%)
Query: 28 SSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
+ +G+++Q KE++P+S D+ N+EFHP + QH+ P +E+ +F A
Sbjct: 251 NCKGFIIQIKEEKPSSTESTDHFYRNVEFHPYLFS---------QHKKMPFKEYNTFMEA 301
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDEFFST ES KID+K +QQER+ALKKL NVK DH RL L + Q +DK+KAELI NQ
Sbjct: 302 VDEFFSTQESQKIDMKTLQQEREALKKLSNVKNDHTRRLEELNKVQDLDKKKAELITCNQ 361
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
VD AILAI+ IA+QLSW DI+ VK+AQ + D VA IK+LKL INHI+LLL +
Sbjct: 362 SLVDKAILAIQSAIASQLSWPDIQELVKEAQANGDIVARSIKKLKLEINHISLLLTDPYK 421
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
N + N + AD+ ++ L+DVDL LSA+ANA R++DLKRSAA K++KTI
Sbjct: 422 CGN-------EYLNDENGADNDDS---LLIDVDLALSAWANACRYYDLKRSAALKEKKTI 471
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK--- 321
+++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKF+WF+SSENYL DA+
Sbjct: 472 DASQKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFFWFVSSENYLIIGGRDAQQNE 531
Query: 322 ------------------------VVTN-----------------------AWWVK---- 330
++ N AW K
Sbjct: 532 LIVKRYMRPKDVYVHADIQGASSVIIRNSTGGDIPPKTLLEAGTMAISYSVAWDAKVVTN 591
Query: 331 -----ADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
+DQVSKTAPTGE+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 592 SYWVYSDQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMGLSILFKLEDSFIQR 646
>gi|357620683|gb|EHJ72794.1| hypothetical protein KGM_20428 [Danaus plexippus]
Length = 1001
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 195/421 (46%), Positives = 257/421 (61%), Gaps = 84/421 (19%)
Query: 26 NVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
N ++GY++QK+E RPN + D+ + N EFHP++ Q++ E+E+F
Sbjct: 244 NQMAKGYIIQKREDRPNQDGGPDFFLTNQEFHPLL---------YLQNKDQVYVEYETFD 294
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEF+S E KIDLK +Q ER+A+KKL+N++ DHE RLS LE+ QL D+ AE+I
Sbjct: 295 RAVDEFYSALEGQKIDLKTIQVEREAMKKLQNIRTDHEKRLSNLEKVQLEDRRAAEMIAR 354
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVH 202
N+ V+ A LAI+ IANQ+SW+DI+ VK AQ + DPVAS IKQLKLN NHITLLL
Sbjct: 355 NEPLVEQARLAIQTAIANQMSWDDIKLLVKAAQDNKDPVASAIKQLKLNTNHITLLL--- 411
Query: 203 KERSNIKDSWVLHHDNADPFADSS-----ETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
KD + D+ D D+ E +P +VD+DL L+A+ANA+R++D KRSAA
Sbjct: 412 ------KDPYDDDDDDDDDDDDNDGGGDKERLEPMMVDIDLSLTAFANARRYYDQKRSAA 465
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---- 313
KKQQKT++S +KALKSAEKKTKQTLK+ Q +++I+KAR+ YWFEKFYWFISS+NYL
Sbjct: 466 KKQQKTLESADKALKSAEKKTKQTLKEAQAISSISKARRNYWFEKFYWFISSDNYLVIAG 525
Query: 314 ----------------------GNVAWDAKVVTN-----------------------AW- 327
+V+ + VV AW
Sbjct: 526 RDQQQNELLVKRYMRSTDVYVHADVSGASSVVIKCPSGPPPPRTLSEAGQAAVAYSVAWE 585
Query: 328 --------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
WV QVSK+APTGE+L+TGSFMIRGKKN+ P L G SF+F+LE+SSI
Sbjct: 586 AKVLTRAWWVHGHQVSKSAPTGEYLSTGSFMIRGKKNYLLPEHLQFGFSFMFRLEDSSID 645
Query: 380 R 380
R
Sbjct: 646 R 646
>gi|348544245|ref|XP_003459592.1| PREDICTED: nuclear export mediator factor Nemf-like [Oreochromis
niloticus]
Length = 1074
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 197/430 (45%), Positives = 249/430 (57%), Gaps = 85/430 (19%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N S +GY++QK EK+P+ SE+ + + EFHP + QH P EF+
Sbjct: 242 NFSGKGYIIQKTEKKPSLTPGKPSEELLTYD-EFHPFLFA---------QHAKSPYLEFD 291
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
+F AVDEFFS ES KIDLKA+QQE+ ALKKLENVK+DHE RL AL Q Q VD+ K EL
Sbjct: 292 TFDKAVDEFFSKMESQKIDLKALQQEKQALKKLENVKKDHEQRLEALHQAQEVDRIKGEL 351
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
+ N VD A+ +R +ANQ+ W +I VK+AQ DPVA IK+LKL NHIT+LL
Sbjct: 352 VEMNLPVVDRALQVVRSALANQVDWTEIGVLVKEAQAAGDPVACAIKELKLQTNHITMLL 411
Query: 200 R----------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRF 249
+ +++ ++ + + + KP LVDVDL LSAYANAK++
Sbjct: 412 KNPYISEEDQEEEEKKEIVETKGKKNKNKEKGQNKKLQRNKPMLVDVDLGLSAYANAKKY 471
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
+D KRSA KK+QKTI++ +KA+KSAEKKT+QTLK+VQT+T I KARKVYWFEKF WFISS
Sbjct: 472 YDSKRSAEKKEQKTIEAADKAMKSAEKKTQQTLKEVQTVTTIQKARKVYWFEKFLWFISS 531
Query: 310 ENYL--------------------GNV-----------------------------AWDA 320
ENYL G++ A
Sbjct: 532 ENYLVIAGRDQQQNEMIVKRYLRAGDIYVHADLHGATSCVIKNPSGNPIPPRTLTEAGTM 591
Query: 321 KVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
V +A W V QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG FL
Sbjct: 592 AVCYSAAWDAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIMGFGFL 651
Query: 371 FKLEESSISR 380
FK+++ S+ R
Sbjct: 652 FKVDDQSVFR 661
>gi|334310399|ref|XP_001370312.2| PREDICTED: nuclear export mediator factor NEMF isoform 1
[Monodelphis domestica]
Length = 1094
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 201/433 (46%), Positives = 249/433 (57%), Gaps = 88/433 (20%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N S +GY++QKKEK+P+ SED I+ EFHP + QH P EFE
Sbjct: 241 NFSGKGYIIQKKEKKPSLEPDKQSED-ILTYEEFHPFLFS---------QHSKCPYLEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKLEGQKIDLKALQQEKQALKKLDNVRKDHEHRLEALHQAQEIDKIKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N +VD AI +R +ANQ+ W +I VK+AQ D VA+ I++LKL NH+T+LL
Sbjct: 351 IEMNLHTVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDTVANAIRELKLQTNHVTMLL 410
Query: 200 R---VHKERSNIKDSWVLHHDNADP----------FADSSETKKPSLVDVDLDLSAYANA 246
+ + + + + + +P + KP LVDVDL LSAYANA
Sbjct: 411 KNPYLVSDEEEEDEINIEKEETEEPKGKKKKQKNKQLQKLQKNKPLLVDVDLSLSAYANA 470
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
K+++D KR AA+K QKT+++ EKA KSAEKKTKQTLK+VQ +T I KARKVYWFEKF WF
Sbjct: 471 KKYYDHKRHAARKTQKTVEAAEKAFKSAEKKTKQTLKEVQMVTTIQKARKVYWFEKFLWF 530
Query: 307 ISSENYL--------------------GNVAWDAK-------VVTN-------------- 325
ISSENYL G+V A V+ N
Sbjct: 531 ISSENYLIIGGRDQQQNEMIVKRYLTPGDVYVHADLHGATSCVIKNPTGEPIPPRTLTEA 590
Query: 326 ---------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
AW WV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG
Sbjct: 591 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 650
Query: 368 SFLFKLEESSISR 380
SFLFK++E + R
Sbjct: 651 SFLFKVDEPCVWR 663
>gi|307173031|gb|EFN64173.1| Serologically defined colon cancer antigen 1-like protein
[Camponotus floridanus]
Length = 988
Score = 330 bits (846), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 191/416 (45%), Positives = 248/416 (59%), Gaps = 92/416 (22%)
Query: 29 SQGYLVQKKEKR---PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE + ED+I AN+EFHP + + Q++ P +EF+SF AAV
Sbjct: 242 SKGYIIQKKEAKMTQDGKEDFIFANIEFHPFLFE---------QYKNQPYKEFDSFDAAV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLK +QQER+AL+KLE VK+DH+ RL LE++Q +DK+KAELI NQ
Sbjct: 293 DEYFSTMEGQKLDLKVLQQEREALQKLERVKKDHDQRLVTLEKSQELDKQKAELISRNQI 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ+SW DI+ +K+AQ DPVAS IKQLKL NHITLLL
Sbjct: 353 LVDNAILAIQSALANQMSWPDIQILLKEAQVIGDPVASAIKQLKLETNHITLLLH----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
DP+ DS E KP L+D+DL +A++NAK ++ K+SAA+K QKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFSNAKNYYSQKKSAARKHQKT 453
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDA 320
I+S KALKSAEKKTKQTLK+VQT+ INK RK YWFEKFYWFI+SENYL G
Sbjct: 454 IESQGKALKSAEKKTKQTLKEVQTIHTINKLRKTYWFEKFYWFITSENYLVIGGRDQQQN 513
Query: 321 KVVTNAW------WVKADQVSKTA-----PTGE--------------------------- 342
+++ + +V AD ++ P+G+
Sbjct: 514 ELIVKRYLKAGDLYVHADLTGASSVVIKNPSGDPVPPKSLAEAGTMAVAYSIAWDSKVIA 573
Query: 343 ------------------FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
+LTTGSFMIRGKKN+ QL MG+ +F+LEESSI R
Sbjct: 574 SAWWVHHDQVSKSAPTGEYLTTGSFMIRGKKNYLTQSQLIMGLGVMFRLEESSIER 629
>gi|432938285|ref|XP_004082515.1| PREDICTED: nuclear export mediator factor Nemf-like [Oryzias
latipes]
Length = 1089
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 197/431 (45%), Positives = 251/431 (58%), Gaps = 87/431 (20%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +G+++QK EK+P+ +E+ + + EFHP + QH P E +
Sbjct: 242 NFNGKGFIIQKSEKKPSVAPGKPAEELLTYD-EFHPFLFV---------QHAKSPFLELD 291
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEFFS E KID+KA+QQE+ ALKKLENVK+DHE RL AL Q Q VD+ K EL
Sbjct: 292 SFNKAVDEFFSKMEGQKIDMKALQQEKQALKKLENVKKDHEQRLEALHQAQEVDRLKGEL 351
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
+ N V+ A+ +R +ANQ+ W +I VK+AQ DPVA IK+LKL+ NHIT+LL
Sbjct: 352 VEINLAVVERALQVVRSALANQVDWAEIGHIVKEAQAAGDPVACAIKELKLHSNHITMLL 411
Query: 200 R------VHKERSNIKDSWVLHHDNADPFADSSETK-----KPSLVDVDLDLSAYANAKR 248
+ +E +KD+ V + D + K KP LVDVDL LSAYANAK+
Sbjct: 412 KNPYISEEEQEDEEMKDA-VEEKGKKNKNRDKGQKKKLQRNKPMLVDVDLGLSAYANAKK 470
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
++D KRSA KKQQKT+++ +KA+KSAEKKT++TLK+VQT+T I KARKVYWFEKF WFIS
Sbjct: 471 YYDHKRSAEKKQQKTLEAADKAMKSAEKKTQKTLKEVQTVTTIQKARKVYWFEKFLWFIS 530
Query: 309 SENYL--------------------GNV-----------------------------AWD 319
+ENYL G++ A
Sbjct: 531 AENYLVIAGRDQQQNEIIVKRYLRAGDIYVHADLHGATSCVIKNPSGDPIPPRTLTEAGT 590
Query: 320 AKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
V +A W V QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG F
Sbjct: 591 MAVCYSAAWDAKIITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIMGFGF 650
Query: 370 LFKLEESSISR 380
LFK+EE S+ R
Sbjct: 651 LFKVEEQSVFR 661
>gi|149051344|gb|EDM03517.1| rCG61611 [Rattus norvegicus]
Length = 899
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 198/434 (45%), Positives = 249/434 (57%), Gaps = 89/434 (20%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 75 NFNGKGYIIQKREVKPSLDANKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 124
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 125 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 184
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NHIT+LL
Sbjct: 185 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHITMLL 244
Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
R + +E D + + D P + K KP LVDVDL LSAYAN
Sbjct: 245 RNPYLLSEEEDGDGDGSIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 304
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 305 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 364
Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
FISSENYL G +++ + +V AD T+ PTG
Sbjct: 365 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGDPIPPRTLTE 424
Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 425 AGTMALCYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 484
Query: 367 ISFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 485 FSFLFKVDESCVWR 498
>gi|281604208|ref|NP_001164057.1| serologically defined colon cancer antigen 1 [Rattus norvegicus]
Length = 1065
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 198/434 (45%), Positives = 249/434 (57%), Gaps = 89/434 (20%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREVKPSLDANKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NHIT+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHITMLL 410
Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
R + +E D + + D P + K KP LVDVDL LSAYAN
Sbjct: 411 RNPYLLSEEEDGDGDGSIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 470
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 471 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 530
Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
FISSENYL G +++ + +V AD T+ PTG
Sbjct: 531 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGDPIPPRTLTE 590
Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 591 AGTMALCYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 650
Query: 367 ISFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 651 FSFLFKVDESCVWR 664
>gi|148704665|gb|EDL36612.1| mCG3169, isoform CRA_a [Mus musculus]
Length = 1083
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 196/434 (45%), Positives = 249/434 (57%), Gaps = 89/434 (20%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 260 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 309
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 310 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 369
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NH+T+LL
Sbjct: 370 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 429
Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
R + +E D+ + + D P + K KP LVDVDL LSAYAN
Sbjct: 430 RNPYLLSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 489
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 490 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 549
Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
FISSENYL G +++ + +V AD T+ PTG
Sbjct: 550 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTE 609
Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 610 AGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 669
Query: 367 ISFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 670 FSFLFKVDESCVWR 683
>gi|32130521|ref|NP_079717.2| nuclear export mediator factor Nemf [Mus musculus]
gi|47606756|sp|Q8CCP0.2|NEMF_MOUSE RecName: Full=Nuclear export mediator factor Nemf; AltName:
Full=Serologically defined colon cancer antigen 1
homolog
Length = 1064
Score = 327 bits (837), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 196/434 (45%), Positives = 249/434 (57%), Gaps = 89/434 (20%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410
Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
R + +E D+ + + D P + K KP LVDVDL LSAYAN
Sbjct: 411 RNPYLLSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 470
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 471 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 530
Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
FISSENYL G +++ + +V AD T+ PTG
Sbjct: 531 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTE 590
Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 591 AGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 650
Query: 367 ISFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 651 FSFLFKVDESCVWR 664
>gi|157116544|ref|XP_001658543.1| hypothetical protein AaeL_AAEL007639 [Aedes aegypti]
gi|108876416|gb|EAT40641.1| AAEL007639-PA [Aedes aegypti]
Length = 995
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 193/439 (43%), Positives = 257/439 (58%), Gaps = 75/439 (17%)
Query: 5 EDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRP----NSEDYIVANMEFHPMILQP 60
ED+++ + +L + S+G+++QKKE +P E++ N+E+HP +
Sbjct: 246 EDMTALMCAINDAETMLRKAMKEPSRGFIIQKKELKPAKDKEQEEFYFTNLEYHPFLYN- 304
Query: 61 GSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
Q++ P++EF+SF AAVDEF+ST E KIDLKA QER+ALKKL NV+ DH
Sbjct: 305 --------QYKEDPVKEFDSFTAAVDEFYSTLEGQKIDLKAFAQEREALKKLSNVRTDHA 356
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
RL L + QL D++KAELI NQ VD+AILA++ +A+Q+SW DI+ VK AQ +NDP
Sbjct: 357 KRLEDLTKAQLEDRKKAELITRNQNLVDSAILAVQSALASQMSWSDIQDLVKAAQANNDP 416
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VAS IKQLKL INHI+L+L K+ D D+ + D +P +VDVDL +
Sbjct: 417 VASCIKQLKLEINHISLML---KDPYGALDEDFEDDDDEEEREDGEGKLEPMVVDVDLAM 473
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+A+ANA+R++D +R AA+K+QKTI+S+ KALK+AEKKT QTLKDV+T T I+KARKVYWF
Sbjct: 474 TAFANARRYYDQRRFAARKEQKTIESSSKALKNAEKKTMQTLKDVRTQTTISKARKVYWF 533
Query: 301 EKFYWFISSENYLGNVAWD--------------AKVVTNAWWVKADQVSKTAPTGE---- 342
EKFYWFISSENYL D + + +A A V P+GE
Sbjct: 534 EKFYWFISSENYLVIGGRDQQQNELIVKRYMRPSDIYVHAEIQGASSVIIKNPSGEDIPP 593
Query: 343 --FLTTGS---------------------------------------FMIRGKKNFFPPC 361
L G+ FMIRGKKNF PPC
Sbjct: 594 KTLLEAGTMAISYSVAWDAKVVTSAYWVKSEQVSKTAPTGEYLTTGSFMIRGKKNFLPPC 653
Query: 362 QLAMGISFLFKLEESSISR 380
L +G+SF+FKLEESSI R
Sbjct: 654 HLVLGLSFMFKLEESSIER 672
>gi|403277932|ref|XP_003930596.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Saimiri
boliviensis boliviensis]
Length = 1077
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 196/435 (45%), Positives = 246/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKRETKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+QTQ +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|354506443|ref|XP_003515270.1| PREDICTED: nuclear export mediator factor Nemf, partial [Cricetulus
griseus]
Length = 699
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 197/434 (45%), Positives = 248/434 (57%), Gaps = 89/434 (20%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 70 NFHGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 119
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 120 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 179
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LL
Sbjct: 180 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLL 239
Query: 200 R----VHKERSNIKDSWVLHHD----------NADPFADSSETKKPSLVDVDLDLSAYAN 245
R + +E + D+ V D + + KP LVDVDL LSAYAN
Sbjct: 240 RNPYLLSEEEDDDGDASVEVSDAEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 299
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 300 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 359
Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
FISSENYL G +++ + +V AD T+ PTG
Sbjct: 360 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTE 419
Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 420 AGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 479
Query: 367 ISFLFKLEESSISR 380
SFLFK++ES I R
Sbjct: 480 FSFLFKVDESCIWR 493
>gi|125858778|gb|AAI29514.1| LOC733300 protein [Xenopus laevis]
Length = 906
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 199/439 (45%), Positives = 248/439 (56%), Gaps = 103/439 (23%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +G+++QK+EK+P+ SED I N EFHP + QH E +
Sbjct: 241 NFNGKGFIIQKREKKPSLEPDKASED-IFTNEEFHPFLFA---------QHANSTYIELD 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF VDEFFS E KID+KA+QQE+ ALKKL+NV++DHE RL +L+ Q DK K EL
Sbjct: 291 SFNKTVDEFFSKLEGQKIDIKALQQEKQALKKLDNVRKDHEHRLESLQYAQDADKAKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NHIT++L
Sbjct: 351 IEMNLDIVDRAIQVVRSALANQIDWTEIGLIVKEAQIQGDPVALAIKELKLQTNHITMML 410
Query: 200 RVHKERSNIKDSWVLHHDNADPFADSS-------------------ETKKPSLVDVDLDL 240
K+ +VL + ++ D + KP LVDVDL L
Sbjct: 411 ---------KNPYVLSEEESEDEEDEKEEEPKGKKKKAKNKQPKKVQKNKPVLVDVDLSL 461
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
SAYANAK+++D KR AAKK QKTI++ EKA KSAEKKTKQTLK+VQT++ I KARKVYWF
Sbjct: 462 SAYANAKKYYDHKRHAAKKSQKTIEAAEKAFKSAEKKTKQTLKEVQTVSTIQKARKVYWF 521
Query: 301 EKFYWFISSENYL--------------------GNVAWDAK-------VVTN-------- 325
EKF WFISSENYL G+V A V+ N
Sbjct: 522 EKFLWFISSENYLIIAGRDQQQNELIVKRYLNPGDVYVHADLHGATSCVIKNPTGEPVPP 581
Query: 326 ---------------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC 361
AW WV +QVSKTAPTGE+LTTGSFMIRGKKNF PP
Sbjct: 582 RTLTEAGTMAVCYSAAWDARVITSAWWVHHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPS 641
Query: 362 QLAMGISFLFKLEESSISR 380
L MG FLFK++E+ + R
Sbjct: 642 YLMMGFGFLFKVDETCVWR 660
>gi|326921280|ref|XP_003206889.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
[Meleagris gallopavo]
Length = 1080
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 200/442 (45%), Positives = 247/442 (55%), Gaps = 105/442 (23%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
+ + +GY++QKKEK+P+ E + I EFHP + QH P EF+S
Sbjct: 253 DFNGKGYIIQKKEKKPSLEPDKPAEDIYTYEEFHPFLFS---------QHSKCPYLEFDS 303
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F A DEF+S E KIDLKA+QQE+ ALKKLENV+RDHE RL AL+Q Q VDK K ELI
Sbjct: 304 FNKAADEFYSKLEGQKIDLKALQQEKQALKKLENVRRDHEQRLEALQQAQEVDKIKGELI 363
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N E V+ AI +R +ANQ+ W +I A VK+AQ DPVA+ IK+LKL NHIT+LLR
Sbjct: 364 EMNLEIVNRAIQVVRSALANQIDWTEIGAIVKEAQAQGDPVANAIKELKLQTNHITMLLR 423
Query: 201 VHKERSNIKDSWVL-----HHDNAD------------------PFADSSETKKPSLVDVD 237
+ +VL ++AD + KPSLVDVD
Sbjct: 424 ---------NPYVLSEEEEEGEDADLEKEETEEPKGKKKKNKNKQLKKPQKNKPSLVDVD 474
Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
L LSAYANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T I KARKV
Sbjct: 475 LSLSAYANAKKYYDHKRHAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKV 534
Query: 298 YWFEKFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG-- 341
YWFEKF WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 535 YWFEKFLWFISSENYLVIAGRDQQQNELIVKRYLKPGDIYVHADLHGATSCVIKNPTGEP 594
Query: 342 -------------------------------------------EFLTTGSFMIRGKKNFF 358
E+LTTGSFMIRGKKNF
Sbjct: 595 IPPRTLTEAGTMALCYSAAWDARVVTSAWWVSHNQVSKTAPTGEYLTTGSFMIRGKKNFL 654
Query: 359 PPCQLAMGISFLFKLEESSISR 380
PP L MG SFLFK++ES + R
Sbjct: 655 PPSYLMMGFSFLFKVDESCVWR 676
>gi|348572143|ref|XP_003471853.1| PREDICTED: nuclear export mediator factor NEMF-like [Cavia
porcellus]
Length = 1076
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 196/435 (45%), Positives = 246/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEVDKPAEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHETRL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHETRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETELPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEVVPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|73962860|ref|XP_851229.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Canis
lupus familiaris]
Length = 1077
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 193/435 (44%), Positives = 244/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREVKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
+ E++ + + + +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDISVEKNETELPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT---------- 340
WFISSENYL D + + T +V AD T+ PT
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 341 -----------------------------------GEFLTTGSFMIRGKKNFFPPCQLAM 365
GE+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|417405795|gb|JAA49597.1| Putative rna-binding protein [Desmodus rotundus]
Length = 1081
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 194/435 (44%), Positives = 243/435 (55%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREVKPSLEVDKPAEEILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+N ++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNFRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 352 EMNLPVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
++ E+S + + + +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGEINVEKSETEPPKGKKKKQKNKQLQRPQKNRPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +V+ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEVIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARIITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK+EES R
Sbjct: 652 GFSFLFKVEESCAWR 666
>gi|296214948|ref|XP_002753922.1| PREDICTED: nuclear export mediator factor NEMF isoform 1
[Callithrix jacchus]
Length = 1077
Score = 323 bits (828), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+QTQ +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA +SAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFRSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|449504623|ref|XP_002200475.2| PREDICTED: nuclear export mediator factor Nemf [Taeniopygia
guttata]
Length = 1213
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 196/433 (45%), Positives = 243/433 (56%), Gaps = 87/433 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+EK+P+ E + I EFHP + QH P EF+S
Sbjct: 379 NFSGKGYVIQKREKKPSLEPDKPAEDIYTYEEFHPFLFS---------QHSKCPYLEFDS 429
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F A DEF+S E KIDLKA+QQE+ ALKKLENV+RDHE RL AL+Q Q DK K ELI
Sbjct: 430 FNKATDEFYSKLEGQKIDLKALQQEKQALKKLENVRRDHEHRLEALQQAQEADKLKGELI 489
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N VD AI +R +ANQ+ W +I A VK+AQ DPVA+ IK+LKL NHIT+LLR
Sbjct: 490 EMNLAVVDRAIQVVRSALANQIDWTEIGAIVKEAQAQGDPVATAIKELKLQTNHITMLLR 549
Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
E+ ++ N + KPSLVDVDL+LSAYANA
Sbjct: 550 NPYVLSEEEEEEDDADIEKEETEEPKGKKKKNKTKQLKKPQKNKPSLVDVDLNLSAYANA 609
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
K+++D KR AAKK QKT+++ EKA KSAEKKTKQTL++VQT+T I KARKVYWFEKF WF
Sbjct: 610 KKYYDHKRHAAKKTQKTVEAAEKAFKSAEKKTKQTLREVQTVTTIQKARKVYWFEKFLWF 669
Query: 307 ISSENYLGNVAWDAK---VVTNAW------WVKADQVSKTA-----PTGE---------- 342
ISSENYL D + ++ + +V AD T+ P+GE
Sbjct: 670 ISSENYLVIAGRDQQQNELIVKRYLKPGDIYVHADLHGATSCVIKNPSGEPIPPRTLTEA 729
Query: 343 -----------------------------------FLTTGSFMIRGKKNFFPPCQLAMGI 367
+LTTGSFMIRGKKNF PP L MG
Sbjct: 730 GTMALCYSAAWDARVVTSAWWVSHSQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 789
Query: 368 SFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 790 SFLFKVDESCVWR 802
>gi|431893718|gb|ELK03539.1| Serologically defined colon cancer antigen 1 [Pteropus alecto]
Length = 1077
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 192/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
++ E+ + + + +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDINVEKIETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|301773240|ref|XP_002922036.1| PREDICTED: serologically defined colon cancer antigen 1-like
[Ailuropoda melanoleuca]
Length = 1077
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 193/435 (44%), Positives = 243/435 (55%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEVDKPTEDIFTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
+ E++ + + + +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDLGVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT---------- 340
WFISSENYL D + + T +V AD T+ PT
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 341 -----------------------------------GEFLTTGSFMIRGKKNFFPPCQLAM 365
GE+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|291403822|ref|XP_002718277.1| PREDICTED: serologically defined colon cancer antigen 1
[Oryctolagus cuniculus]
Length = 1076
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 197/436 (45%), Positives = 248/436 (56%), Gaps = 91/436 (20%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N +GY++QK+E +P+ SED I+ EFHP + QH P EFE
Sbjct: 241 NFRGKGYIIQKREIKPSLEVDKPSED-ILTYEEFHPFLFS---------QHSQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL+ NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLHTNHVTMLL 410
Query: 200 R-----VHKERSNIKDSWVLHHDNADPFADSSETK-----------KPSLVDVDLDLSAY 243
R +E ++ + + +P + + KP LVDVDL LSAY
Sbjct: 411 RNPYLLSEEEDDDVDGDVTVEKNENEPLKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAY 470
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
ANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 471 ANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKF 530
Query: 304 YWFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT--------- 340
WFISSENYL D + + T +V AD T+ PT
Sbjct: 531 LWFISSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPRTL 590
Query: 341 ------------------------------------GEFLTTGSFMIRGKKNFFPPCQLA 364
GE+LTTGSFMIRGKKNF PP L
Sbjct: 591 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 650
Query: 365 MGISFLFKLEESSISR 380
MG SFLFK++ES + R
Sbjct: 651 MGFSFLFKVDESCVWR 666
>gi|281343421|gb|EFB19005.1| hypothetical protein PANDA_010972 [Ailuropoda melanoleuca]
Length = 1058
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 193/435 (44%), Positives = 243/435 (55%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I EFHP + QH P EFES
Sbjct: 222 NFSGKGYIIQKREIKPSLEVDKPTEDIFTYEEFHPFLFS---------QHSQCPYIEFES 272
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 273 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 332
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 333 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 392
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
+ E++ + + + +P LVDVDL LSAYA
Sbjct: 393 NPYLLSEEEDDDVDGDLGVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 452
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 453 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 512
Query: 305 WFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT---------- 340
WFISSENYL D + + T +V AD T+ PT
Sbjct: 513 WFISSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 572
Query: 341 -----------------------------------GEFLTTGSFMIRGKKNFFPPCQLAM 365
GE+LTTGSFMIRGKKNF PP L M
Sbjct: 573 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 632
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 633 GFSFLFKVDESCVWR 647
>gi|269849764|sp|O60524.4|NEMF_HUMAN RecName: Full=Nuclear export mediator factor NEMF; AltName:
Full=Antigen NY-CO-1; AltName: Full=Serologically
defined colon cancer antigen 1
Length = 1076
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|426376840|ref|XP_004055190.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Gorilla
gorilla gorilla]
Length = 1077
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|397523542|ref|XP_003831788.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Pan
paniscus]
Length = 1076
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|355693257|gb|EHH27860.1| hypothetical protein EGK_18167 [Macaca mulatta]
Length = 1077
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|395745874|ref|XP_002824790.2| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
[Pongo abelii]
Length = 1061
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 220 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 270
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 271 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 330
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 331 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 390
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + KP LVDVDL LSAYA
Sbjct: 391 NPYLLSEEEDDDVDDDVNVEKNETEPPKGKKKKQKSKQLQKPQKNKPLLVDVDLSLSAYA 450
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 451 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 510
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 511 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 570
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 571 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 630
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 631 GFSFLFKVDESCVWR 645
>gi|355778566|gb|EHH63602.1| hypothetical protein EGM_16603 [Macaca fascicularis]
gi|380817886|gb|AFE80817.1| nuclear export mediator factor NEMF [Macaca mulatta]
gi|383422753|gb|AFH34590.1| nuclear export mediator factor NEMF [Macaca mulatta]
gi|384950256|gb|AFI38733.1| nuclear export mediator factor NEMF [Macaca mulatta]
Length = 1077
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|344273431|ref|XP_003408525.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor
NEMF-like [Loxodonta africana]
Length = 1000
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 193/436 (44%), Positives = 246/436 (56%), Gaps = 89/436 (20%)
Query: 25 LNVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
LN S +GY++QK+E +P+ E + I+ EFHP + QH P EFE
Sbjct: 240 LNFSGKGYIIQKREVKPSLEIDKPTEDILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I + VK+AQ DPVA+ IK+LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGSIVKEAQAQGDPVANAIKELKLQTNHVTMLL 410
Query: 200 R----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
R + E++ + + + KP VDVDL LSAY
Sbjct: 411 RNPYLLSEEEDDDVDGDISIEKNETEPLKGKKKKQKNKQLQKPQKNKPLPVDVDLSLSAY 470
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
ANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 471 ANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKF 530
Query: 304 YWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG-------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 531 LWFISSENYLIIGGRDQQQNEIIVKRYLTAGDIYVHADLHGATSCVIKNPTGEPIPPRTL 590
Query: 342 -------------------------------------EFLTTGSFMIRGKKNFFPPCQLA 364
E+LTTGSFMIRGKKNF PP L
Sbjct: 591 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 650
Query: 365 MGISFLFKLEESSISR 380
MG SFLFK++ES + R
Sbjct: 651 MGFSFLFKVDESCVWR 666
>gi|32130516|ref|NP_004704.2| nuclear export mediator factor NEMF [Homo sapiens]
gi|119586148|gb|EAW65744.1| serologically defined colon cancer antigen 1, isoform CRA_d [Homo
sapiens]
gi|148922399|gb|AAI46282.1| Serologically defined colon cancer antigen 1 [synthetic construct]
gi|151556560|gb|AAI48733.1| Serologically defined colon cancer antigen 1 [synthetic construct]
Length = 1076
Score = 322 bits (824), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|55640675|ref|XP_509934.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Pan
troglodytes]
gi|410223614|gb|JAA09026.1| nuclear export mediator factor [Pan troglodytes]
gi|410263654|gb|JAA19793.1| nuclear export mediator factor [Pan troglodytes]
gi|410263656|gb|JAA19794.1| nuclear export mediator factor [Pan troglodytes]
gi|410299008|gb|JAA28104.1| nuclear export mediator factor [Pan troglodytes]
gi|410354861|gb|JAA44034.1| nuclear export mediator factor [Pan troglodytes]
Length = 1076
Score = 322 bits (824), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDIFTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|119586150|gb|EAW65746.1| serologically defined colon cancer antigen 1, isoform CRA_f [Homo
sapiens]
Length = 1010
Score = 322 bits (824), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|402876104|ref|XP_003901819.1| PREDICTED: nuclear export mediator factor NEMF [Papio anubis]
Length = 1048
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 194/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 212 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 262
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 263 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 322
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T++LR
Sbjct: 323 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMMLR 382
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V E++ + + + KP LVDVDL LSAYA
Sbjct: 383 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 442
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 443 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 502
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 503 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 562
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 563 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 622
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 623 GFSFLFKVDESCVWR 637
>gi|426233096|ref|XP_004010553.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Ovis
aries]
Length = 1076
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)
Query: 28 SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+ +GY++QK+E +P+ E + I+ EFHP + QH P EFESF
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413
Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
+ E++ + + + +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533
Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
ISSENYL G +++ + +V AD T+ PTG
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593
Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGF 653
Query: 368 SFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666
>gi|345306303|ref|XP_001515044.2| PREDICTED: nuclear export mediator factor NEMF [Ornithorhynchus
anatinus]
Length = 1076
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 193/432 (44%), Positives = 244/432 (56%), Gaps = 86/432 (19%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+EK+P+ E + I+ EFHP + QH +P EFES
Sbjct: 243 NFSGKGYIIQKREKKPSLEPDKPAEDILTYEEFHPFLFS---------QHSKYPYVEFES 293
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKLENV++DHE RL AL Q Q +DK K ELI
Sbjct: 294 FDKAVDEFYSKLEGQKIDLKALQQEKQALKKLENVRKDHEHRLEALHQAQEIDKVKGELI 353
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LL+
Sbjct: 354 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLK 413
Query: 201 -------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAK 247
E+ ++ D + KP +VDVDL LSAYANAK
Sbjct: 414 NPYVMSEEEDDDGEDIEKEETEEPKGKKKKQKDKQLKKPQKNKPLVVDVDLSLSAYANAK 473
Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFI 307
+++D KR AA+K QKT+++ EKA KSAEKKTKQTLK+VQT+T I KARKVYWFEKF WFI
Sbjct: 474 KYYDHKRHAARKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFI 533
Query: 308 SSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG------------ 341
SSENYL G +++ + +V AD T+ PTG
Sbjct: 534 SSENYLIIGGRDQQQNEMIVKRYLNSGDIYVHADLHGATSCVIKNPTGEAIPPRTLTEAG 593
Query: 342 ---------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGIS 368
E+LTTGSFMIRGKKNF PP L MG S
Sbjct: 594 TMALCYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 653
Query: 369 FLFKLEESSISR 380
FLFK+EE+ + R
Sbjct: 654 FLFKVEETCVWR 665
>gi|119586145|gb|EAW65741.1| serologically defined colon cancer antigen 1, isoform CRA_a [Homo
sapiens]
Length = 828
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666
>gi|440907236|gb|ELR57405.1| Serologically defined colon cancer antigen 1 [Bos grunniens mutus]
Length = 1077
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)
Query: 28 SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+ +GY++QK+E +P+ E + I+ EFHP + QH P EFESF
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413
Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
+ E++ + + + +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533
Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
ISSENYL G +++ + +V AD T+ PTG
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593
Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGF 653
Query: 368 SFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666
>gi|329664770|ref|NP_001192434.1| nuclear export mediator factor NEMF [Bos taurus]
Length = 1076
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)
Query: 28 SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+ +GY++QK+E +P+ E + I+ EFHP + QH P EFESF
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413
Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
+ E++ + + + +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533
Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
ISSENYL G +++ + +V AD T+ PTG
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593
Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGF 653
Query: 368 SFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666
>gi|356640194|ref|NP_001239258.1| serologically defined colon cancer antigen 1 [Gallus gallus]
Length = 1071
Score = 320 bits (821), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 199/442 (45%), Positives = 245/442 (55%), Gaps = 105/442 (23%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
+ + +GY++QKKEK+P+ E + I EFHP + QH P EF+S
Sbjct: 242 DFNGKGYIIQKKEKKPSLEPDKPAEDIYTYEEFHPFLFS---------QHSKCPYLEFDS 292
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F A DEF+S E KIDLKA+QQE+ ALKKLENV+RDHE RL AL+Q Q VDK K ELI
Sbjct: 293 FNKAADEFYSKLEGQKIDLKALQQEKQALKKLENVRRDHEQRLEALQQAQEVDKIKGELI 352
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N E V AI +R +ANQ+ W +I A VK+AQ DPVA+ IK+LKL NHIT+LLR
Sbjct: 353 EMNLEIVSRAIQVVRSALANQIDWTEIGAIVKEAQAQGDPVANAIKELKLQTNHITMLLR 412
Query: 201 VHKERSNIKDSWVL-----HHDNAD------------------PFADSSETKKPSLVDVD 237
+ +VL ++AD + KPSLVDVD
Sbjct: 413 ---------NPYVLSEEEEEGEDADLEKEETEEPKGKKKKNKSKQLKKPQKNKPSLVDVD 463
Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
L LSAYANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T I KARKV
Sbjct: 464 LSLSAYANAKKYYDHKRHAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKV 523
Query: 298 YWFEKFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG-- 341
YWFEKF WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 524 YWFEKFLWFISSENYLVIAGRDQQQNELIVKRYLKPGDIYVHADLHGATSCVIKNPTGEP 583
Query: 342 -------------------------------------------EFLTTGSFMIRGKKNFF 358
E+LTTGSFMIRGKKNF
Sbjct: 584 IPPRTLTEAGTMALCYSAAWDARVVTSAWWVSHNQVSKTAPTGEYLTTGSFMIRGKKNFL 643
Query: 359 PPCQLAMGISFLFKLEESSISR 380
P L MG SFLFK++ES + R
Sbjct: 644 QPSYLMMGFSFLFKVDESCVWR 665
>gi|296483277|tpg|DAA25392.1| TPA: hypothetical protein BOS_10863 [Bos taurus]
Length = 1076
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)
Query: 28 SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+ +GY++QK+E +P+ E + I+ EFHP + QH P EFESF
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413
Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
+ E++ + + + +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533
Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
ISSENYL G +++ + +V AD T+ PTG
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593
Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGF 653
Query: 368 SFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666
>gi|194375658|dbj|BAG56774.1| unnamed protein product [Homo sapiens]
Length = 999
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 194/435 (44%), Positives = 244/435 (56%), Gaps = 89/435 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 199 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 249
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 250 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 309
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 310 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 369
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 370 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 429
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ KA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 430 NAKKYYDHKRYAAKKTQKTVEAAGKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 489
Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 490 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 549
Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
E+LTTGSFMIRGKKNF PP L M
Sbjct: 550 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 609
Query: 366 GISFLFKLEESSISR 380
G SFLFK++ES + R
Sbjct: 610 GFSFLFKVDESCVWR 624
>gi|410898599|ref|XP_003962785.1| PREDICTED: nuclear export mediator factor Nemf-like [Takifugu
rubripes]
Length = 1029
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 193/424 (45%), Positives = 246/424 (58%), Gaps = 79/424 (18%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK EK+P+ SE+ + + EFHP + QH P EF+
Sbjct: 242 NFTGKGYIIQKSEKKPSVTPGKPSEELLTYD-EFHPFLFA---------QHSKSPYLEFD 291
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEFFS ES KID+KA+Q E+ A+KKLENVK+DHE RL AL Q Q +D+ K EL
Sbjct: 292 SFDKAVDEFFSKMESQKIDMKALQLEKHAMKKLENVKKDHEQRLEALHQAQEIDRIKGEL 351
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N V+ A+ + +ANQ+ W +I VK+AQ DPVA IK+LKL NHITLLL
Sbjct: 352 IEMNLAIVERALQVVCSALANQVDWTEIGILVKEAQAAGDPVACAIKELKLQANHITLLL 411
Query: 200 R--VHKERSNIKDSWV--LHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
+ E +D V N + + + KP LVDVDL LSAYANAK+++D KRS
Sbjct: 412 KNPYVSEDDEQEDDVVEETGRKNKNKKSKKFQKNKPMLVDVDLSLSAYANAKKYYDNKRS 471
Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL-- 313
A +K+ KTI++ +KA+KSAEKKT++TLK+VQT+T I KARKVYWFEKF WFIS+ENYL
Sbjct: 472 AKRKEFKTIEAADKAMKSAEKKTQKTLKEVQTVTTIQKARKVYWFEKFLWFISAENYLVI 531
Query: 314 ------------------GNV-----------------------------AWDAKVVTNA 326
G++ A V +A
Sbjct: 532 AGRDQQQNEMIVKRYLRAGDIYVHADLHGATSCVIKNPSGDPVPPRTLTEAGTMAVCYSA 591
Query: 327 WW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
W V QVSKTAPTGE+LTTGSFMIRGKKN+ PP L MG FLFK++E
Sbjct: 592 AWEAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNYLPPSYLIMGFGFLFKVDEH 651
Query: 377 SISR 380
S+ R
Sbjct: 652 SVFR 655
>gi|194379038|dbj|BAG58070.1| unnamed protein product [Homo sapiens]
Length = 782
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 194/436 (44%), Positives = 245/436 (56%), Gaps = 90/436 (20%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N+ +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 12 NLKLKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 62
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 63 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 122
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 123 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 182
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 183 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 242
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 243 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 302
Query: 305 WFISSENYL---GNVAWDAKVVTNAW-------WVKADQVSKTA-----PTG-------- 341
WFISSENYL G +++ + +V AD T+ PTG
Sbjct: 303 WFISSENYLIIGGRDQQQNEIIVKRYLTPVGDIYVHADLHGATSCVIKNPTGEPIPPRTL 362
Query: 342 -------------------------------------EFLTTGSFMIRGKKNFFPPCQLA 364
E+LTTGSFMIRGKKNF PP L
Sbjct: 363 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 422
Query: 365 MGISFLFKLEESSISR 380
MG SFLFK++ES + R
Sbjct: 423 MGFSFLFKVDESCVWR 438
>gi|311245467|ref|XP_001924665.2| PREDICTED: nuclear export mediator factor NEMF [Sus scrofa]
Length = 1076
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)
Query: 28 SSQGYLVQKKEKRPNSEDY-----IVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+ +GY++QK+E +P+ E I+ EFHP + QH P EFESF
Sbjct: 243 NGKGYIIQKREVKPSLEVDKPTVDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413
Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
++ E++ + + + +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDINTEKNESEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533
Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
ISSENYL G +++ + +V AD T+ PTG
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593
Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
E+LTTGSFMIRGKKNF PP L MG
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653
Query: 368 SFLFKLEESSISR 380
SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666
>gi|301617501|ref|XP_002938173.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
[Xenopus (Silurana) tropicalis]
Length = 951
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 194/427 (45%), Positives = 242/427 (56%), Gaps = 85/427 (19%)
Query: 29 SQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+G+++QK+EK+P+ SED I N EFHP + QH E +SF
Sbjct: 244 GKGFIIQKREKKPSLEPDKPSED-IFTNEEFHPFLFA---------QHCNNTYIELDSFN 293
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEFFS E +IDLKA+QQE+ ALKKLENV++DHE RL +L+ Q DK K ELI
Sbjct: 294 KAVDEFFSKMEGQRIDLKALQQEKQALKKLENVRKDHEERLESLQHAQDADKAKGELIEM 353
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
N + VD AI +R +ANQ+ W++I VK+AQ D VA IK+LKL NHIT+LL+
Sbjct: 354 NLDIVDRAIQVVRSALANQIDWKEIGLIVKEAQIQGDSVALAIKELKLQTNHITMLLKNP 413
Query: 201 --------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDL 252
+E ++ + + + KP LVDVDL LSAYANAK+++D
Sbjct: 414 YTLSEEGSEDEEEEKEEEPKGKKKKSKNKQPKKVQKNKPVLVDVDLSLSAYANAKKYYDH 473
Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
KR AAKK QKTI++ EKA KSAEKKTKQTLK+VQT++ I KARKVYWFEKF WFISSENY
Sbjct: 474 KRHAAKKSQKTIEAAEKAFKSAEKKTKQTLKEVQTVSTIQKARKVYWFEKFLWFISSENY 533
Query: 313 L--------------------GNV-----------------------------AWDAKVV 323
L G++ A V
Sbjct: 534 LVIAGRDQQQNELIVKRYLNPGDLYVHADLHGATSCVIKNPTGEPVPPRTLTEAGTMAVC 593
Query: 324 TNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
+A W V +QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG FLFK+
Sbjct: 594 YSAAWDARVITSAWWVHHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFGFLFKV 653
Query: 374 EESSISR 380
+E + R
Sbjct: 654 DEPCVWR 660
>gi|410962212|ref|XP_003987668.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
[Felis catus]
Length = 1080
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 193/438 (44%), Positives = 243/438 (55%), Gaps = 92/438 (21%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQ---ERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F AVDEF+S E KIDLKA+QQ + ALKKL+NV++DHE RL AL+Q Q +DK K
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQVWYXKQALKKLDNVRKDHENRLEALQQAQEIDKLKG 351
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
ELI N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+
Sbjct: 352 ELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTM 411
Query: 198 LLR----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
LLR + E++ + + + +P LVDVDL LS
Sbjct: 412 LLRNPYLLSEEEDDDVDGDITVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLS 471
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFE 301
AYANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFE
Sbjct: 472 AYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFE 531
Query: 302 KFYWFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT------- 340
KF WF+SSENYL D + + T +V AD T+ PT
Sbjct: 532 KFLWFVSSENYLIIGGRDQQQNEMIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPR 591
Query: 341 --------------------------------------GEFLTTGSFMIRGKKNFFPPCQ 362
GE+LTTGSFMIRGKKNF PP
Sbjct: 592 TLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSY 651
Query: 363 LAMGISFLFKLEESSISR 380
L MG SFLFK++ES I R
Sbjct: 652 LMMGFSFLFKVDESCIWR 669
>gi|194742419|ref|XP_001953700.1| GF17891 [Drosophila ananassae]
gi|190626737|gb|EDV42261.1| GF17891 [Drosophila ananassae]
Length = 999
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 188/427 (44%), Positives = 247/427 (57%), Gaps = 94/427 (22%)
Query: 20 VLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
+L + ++GY+VQ KE++P ++ N+EFHP Q + F
Sbjct: 269 LLTEGASGKTKGYIVQVKEEKPTENGKVEFFFRNIEFHPY---------QFVQFKDFECA 319
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
FESF AVDEF+ST ES KID+K +QQER+ALKKL NVK DH RL L + Q D++K
Sbjct: 320 TFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDEDRKK 379
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
AELI +NQ VD AI A++ IA+QLSW DI VK+AQ + D VAS IKQLKL NHI+
Sbjct: 380 AELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHIS 439
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSS----ETKKPSLVDVDLDLSAYANAKRFFDL 252
L+L +DP+ ++ +T + ++VDVDL LSA+ANA+R++DL
Sbjct: 440 LIL-------------------SDPYGENEDEDLDTPEVTVVDVDLALSAWANARRYYDL 480
Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
KRSAA+K++KT+ +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENY
Sbjct: 481 KRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENY 540
Query: 313 LGNVAWDAK--------------VVTNAWWVKADQVSKTAPTG----------------- 341
L DA+ + +A A V PTG
Sbjct: 541 LVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIRNPTGEEIPPKTLLEAGSMAIS 600
Query: 342 ----------------------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
E+L TGSFMIRGKKNF P C L MG+S LFKL
Sbjct: 601 YSVAWDAKVVTNSYWVTSEQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLIMGLSLLFKL 660
Query: 374 EESSISR 380
E+S I+R
Sbjct: 661 EDSFIAR 667
>gi|189239405|ref|XP_001813943.1| PREDICTED: similar to CG11847 CG11847-PA [Tribolium castaneum]
gi|270010510|gb|EFA06958.1| hypothetical protein TcasGA2_TC009916 [Tribolium castaneum]
Length = 972
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 191/424 (45%), Positives = 244/424 (57%), Gaps = 101/424 (23%)
Query: 30 QGYLVQKKEKRPNSED-----YIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
+GY++QKKE+R S D Y +N EFHP++ + QH+ +EF SF +A
Sbjct: 243 KGYIIQKKEERVVSADNPEKEYYYSNQEFHPVLYE---------QHKSSISKEFPSFNSA 293
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDEFFS+ ES K++LKA+QQER+ALKKLENVK+DH RL ALE+TQ +DK+KAELI NQ
Sbjct: 294 VDEFFSSLESQKLELKALQQEREALKKLENVKKDHSQRLLALEKTQEIDKQKAELITRNQ 353
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
E VD AILA++ +A Q+SW D+ +K+A D +A IK+LKL NHI+L L
Sbjct: 354 ELVDKAILAVQTALATQISWSDLADLIKEAASQGDEIAQRIKELKLETNHISLYL----- 408
Query: 205 RSNIKDSWVLHHDNADPFADSSETKK---------PSLVDVDLDLSAYANAKRFFDLKRS 255
DP+A+ P +VDVDLDLSA+AN +R++D KR+
Sbjct: 409 --------------TDPYAEDDSESDDEDNDDKIPPMVVDVDLDLSAFANGRRYYDQKRN 454
Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGN 315
AAKKQQKTI+S KA KSAEKKTKQTLKDVQT+TNINKARKVYWFEKF+WFISSENYL
Sbjct: 455 AAKKQQKTIESQSKAFKSAEKKTKQTLKDVQTITNINKARKVYWFEKFFWFISSENYLVI 514
Query: 316 VAWDAK---------VVTNAWWVKAD-----QVSKTAPTGEFL----------------- 344
D + + + +V AD V P+G+ +
Sbjct: 515 AGRDQQQNELIVKRYMKSTDVYVHADVHGASSVVIKNPSGQAVPPKTLNEAGTMAICYSV 574
Query: 345 ----------------------TTGSFM------IRGKKNFFPPCQLAMGISFLFKLEES 376
TG ++ IRGKKNF P L +G+SFLFKLEES
Sbjct: 575 AWDAKVVTNAYWVWGEQVSKTAPTGEYLSTGSFMIRGKKNFLPLSHLILGLSFLFKLEES 634
Query: 377 SISR 380
I +
Sbjct: 635 CIEK 638
>gi|170055538|ref|XP_001863626.1| serologically defined colon cancer antigen 1 [Culex
quinquefasciatus]
gi|167875449|gb|EDS38832.1| serologically defined colon cancer antigen 1 [Culex
quinquefasciatus]
Length = 995
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 186/429 (43%), Positives = 246/429 (57%), Gaps = 86/429 (20%)
Query: 20 VLVSNLNVSSQGYLVQKKEKRP----NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
+L + S+G+++QKKE RP E+Y + N+E+ P + Q++ P
Sbjct: 260 MLRQGMREPSKGFIIQKKELRPAKEGEPEEYYLTNLEYQPYLYN---------QYKDEPY 310
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
QEF SF AAVDEF+ST E KIDLK+ QER+ALKKL NV+ DH RL L + QL D++
Sbjct: 311 QEFASFTAAVDEFYSTLEGQKIDLKSFAQEREALKKLSNVRTDHAKRLDDLIKAQLEDRK 370
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
KAELI NQ VD+A+LA++ +A+Q++W DI+ VK AQ +NDP+AS I+QLKL INHI
Sbjct: 371 KAELITRNQNLVDSALLAVQSALASQMAWSDIQDLVKAAQANNDPIASCIRQLKLEINHI 430
Query: 196 TLLLRVHKERSNIKDSWVL-----HHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF 250
+LLL KD + + + D + +P +VDVDL LSA+ANA++++
Sbjct: 431 SLLL---------KDPYAVLDEEEEEEEDSDREDDEQKLEPMVVDVDLALSAFANARKYY 481
Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSE 310
D +R AA+K+QKTI+S+ KALK+AEKKT QTLKDV+T T I+KARKVYWFEKFYWFISSE
Sbjct: 482 DQRRFAARKEQKTIESSSKALKNAEKKTLQTLKDVRTQTTISKARKVYWFEKFYWFISSE 541
Query: 311 NYLGNVAWD--------------AKVVTNAWWVKADQVSKTAPTG------EFLTTGSFM 350
NYL D A + +A A V P+G L G+
Sbjct: 542 NYLVIGGRDQQQNELLVKRYMRPADIYVHAEIQGASSVVIKNPSGAEIPPKTLLEAGTMA 601
Query: 351 I---------------------------------------RGKKNFFPPCQLAMGISFLF 371
I RGKKNF PPC L +G+SF+F
Sbjct: 602 ISYSVAWDAKVVTSAYWVRSEQVSKTAPTGEYLTTGSFMIRGKKNFLPPCHLVLGLSFMF 661
Query: 372 KLEESSISR 380
KLEESS+ R
Sbjct: 662 KLEESSVER 670
>gi|195504496|ref|XP_002099104.1| GE23561 [Drosophila yakuba]
gi|194185205|gb|EDW98816.1| GE23561 [Drosophila yakuba]
Length = 996
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 188/416 (45%), Positives = 240/416 (57%), Gaps = 88/416 (21%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++Q KE++P ++ N+EFHP + Q + F FESF AV
Sbjct: 276 SKGYIIQVKEEKPTENGKVEFFFRNIEFHPYLFT---------QFKNFETATFESFMEAV 326
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+ST ES KID+K +QQER+ALKKL NVK DH RL L + Q VD++KAELI +NQ
Sbjct: 327 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AI A++ IA+QLSW DI VK+AQ + D VAS IKQLKL NHI+L+L +
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
D + K P L VDVDL LSA+ANA+R++D+KRSAA+K++KT
Sbjct: 447 DEDDDDDL---------------KAPELTVVDVDLALSAWANARRYYDMKRSAAQKEKKT 491
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
+ +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL DA+
Sbjct: 492 VDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQN 551
Query: 322 ------------VVTNAWWVKADQVSKTAPTG---------------------------- 341
+ +A A V PTG
Sbjct: 552 ELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVT 611
Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 612 NSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 667
>gi|195107152|ref|XP_001998180.1| GI23827 [Drosophila mojavensis]
gi|193914774|gb|EDW13641.1| GI23827 [Drosophila mojavensis]
Length = 962
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 189/423 (44%), Positives = 251/423 (59%), Gaps = 88/423 (20%)
Query: 20 VLVSNLNVSSQGYLVQ-KKEKRPNSE--DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
++ N SS+G+++Q K+EK N+E ++ N+EFHP + Q++ P +
Sbjct: 247 IMAIGRNGSSKGFIIQVKEEKLTNAENTEHFYRNIEFHPYLFS---------QYKKLPFK 297
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
E+E+F AVDEFFS+ ES KID+K +QQER+ALKKL NVK+DH RL L + Q DK+K
Sbjct: 298 EYETFMEAVDEFFSSQESQKIDIKTLQQEREALKKLSNVKKDHTKRLEELNRVQDDDKKK 357
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
AELI +NQ VD AILAI+ IA+QLSW DI+ VK+AQ + D VAS IKQLKL INHI+
Sbjct: 358 AELITSNQCLVDKAILAIQSAIASQLSWPDIQELVKEAQANGDIVASSIKQLKLEINHIS 417
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
LLL + N +DNAD +VD+DL LSA+ANA+R++DLKRSA
Sbjct: 418 LLLSDPYKNEN-------ENDNADSV----------IVDIDLALSAWANARRYYDLKRSA 460
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL--- 313
A K++KTI +++KALKSAE+KT+QTLK+V+T++NI KARK++WFEKF+WFISSENYL
Sbjct: 461 ALKEKKTIDASQKALKSAERKTQQTLKEVRTISNIAKARKIFWFEKFFWFISSENYLVIG 520
Query: 314 GNVAWDAKVVTNAW------WVKAD-----------QVSKTAPTGEFLTTG--------- 347
G A +++ + +V AD + P L G
Sbjct: 521 GRDAQQNELIVKRYMRPKDIYVHADIQGASSVIIRNTTGEEIPPKTLLEAGTMAISYSVA 580
Query: 348 ------------------------------SFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
SFMIRGKKNF P C L MG+S LFKLE+S
Sbjct: 581 WDAKVVTNSYWVYSHQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMGLSLLFKLEDSF 640
Query: 378 ISR 380
+ R
Sbjct: 641 LQR 643
>gi|291230458|ref|XP_002735180.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 834
Score = 314 bits (804), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 190/445 (42%), Positives = 253/445 (56%), Gaps = 89/445 (20%)
Query: 4 NEDVSSDIHKCGPTL----HVLVSNLNVSSQGYLVQKKEKRPNS-----EDYIVANMEFH 54
N D+S+D+ + L +VL + S+GY++QKKEK+ + + ++ E+H
Sbjct: 223 NVDISTDLPRIMAALQDMENVLKKIVESPSKGYVIQKKEKKTSKLSGDVPEELITYAEYH 282
Query: 55 PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
PM+ + QHQ E ESF AVDEFFS + K+D+KA+QQE+ A+KKLEN
Sbjct: 283 PMLFE---------QHQKSLYIELESFGKAVDEFFSQMGTQKLDIKALQQEKSAIKKLEN 333
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
VK+DHE R+ L+ +Q VD KA+LI N VD AI ++ IANQ+ W +I VK+A
Sbjct: 334 VKKDHEKRIQQLQASQNVDMVKAQLIEINLPLVDRAIQVVQSAIANQIDWAEIWDIVKEA 393
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
Q D VA IK LKL+ NHITLLLR S++ D D P +
Sbjct: 394 QTQGDEVAKSIKSLKLDKNHITLLLRDPFVSSDVDDE------------DKHSGIGPLKI 441
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
D+DLDLSAYANA+++++ K+ +A K+QKT+ +++KALKSAE KTKQTLKDV T+T+INKA
Sbjct: 442 DIDLDLSAYANARKYYEAKKHSAVKEQKTLAASQKALKSAEIKTKQTLKDVATVTSINKA 501
Query: 295 RKVYWFEKFYWFISSENYL--------------------GNV------------------ 316
RK YWFEKF WFISSENYL G++
Sbjct: 502 RKTYWFEKFIWFISSENYLIIGGRDQQQNEIVVRKYLNKGDIYVHADLHGASSVIIKNPT 561
Query: 317 -----------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKK 355
A + +A W V +QVSKTAPTGE+LTTGSFM+RGKK
Sbjct: 562 GADIPPKTLNEAGSMAICYSAAWQARVVTSAWWVYHNQVSKTAPTGEYLTTGSFMVRGKK 621
Query: 356 NFFPPCQLAMGISFLFKLEESSISR 380
N+ PP L MG FLFK++E S+ R
Sbjct: 622 NYLPPSYLVMGFGFLFKVDEDSLWR 646
>gi|195354790|ref|XP_002043879.1| GM17806 [Drosophila sechellia]
gi|194129117|gb|EDW51160.1| GM17806 [Drosophila sechellia]
Length = 972
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 186/416 (44%), Positives = 241/416 (57%), Gaps = 90/416 (21%)
Query: 29 SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++Q KE++P + ++ N+EFHP + Q + F FESF AV
Sbjct: 256 SKGYIIQVKEEKPAENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 306
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+ST ES KID+K +QQER+ALKKL NVK DH RL L + Q VD++KAELI +NQ
Sbjct: 307 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 366
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AI A++ IA+QLSW DI VK+AQ + D VAS IKQLKL NHI+L+L +
Sbjct: 367 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 426
Query: 206 SNIKDSWVLHHDNADPFADSSETKKP--SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
+ K P ++VDVDL LSA+ANA+R++D+KRSAA+K++KT
Sbjct: 427 DE-----------------DDDLKAPEVTVVDVDLALSAWANARRYYDMKRSAAQKEKKT 469
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
+ +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL DA+
Sbjct: 470 VDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQN 529
Query: 322 ------------VVTNAWWVKADQVSKTAPTG---------------------------- 341
+ +A A V PTG
Sbjct: 530 ELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVT 589
Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 590 NSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 645
>gi|194908933|ref|XP_001981863.1| GG11364 [Drosophila erecta]
gi|190656501|gb|EDV53733.1| GG11364 [Drosophila erecta]
Length = 994
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 187/416 (44%), Positives = 239/416 (57%), Gaps = 90/416 (21%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++Q KE++P ++ N+EFHP + Q + F FESF AV
Sbjct: 276 SKGYIIQVKEEKPTENGKVEFFFKNIEFHPYLFI---------QFKNFEKATFESFMDAV 326
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+ST ES KID+K +QQER+ALKKL NVK DH RL L + Q VD++KAELI +NQ
Sbjct: 327 DEFYSTQESQKIDIKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AI A++ IA+QLSW DI VK+AQ + D VAS IKQLKL NHI+L+L +
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
+ K P L VDVDL LSA+ANA+R++D+KRSAA+K++KT
Sbjct: 447 DE-----------------DDDLKAPELTVVDVDLALSAWANARRYYDMKRSAAQKEKKT 489
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
+ +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL DA+
Sbjct: 490 VDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQN 549
Query: 322 ------------VVTNAWWVKADQVSKTAPTG---------------------------- 341
+ +A A V PTG
Sbjct: 550 ELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVT 609
Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 610 NSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 665
>gi|281362528|ref|NP_001163721.1| caliban, isoform B [Drosophila melanogaster]
gi|281362530|ref|NP_651341.2| caliban, isoform C [Drosophila melanogaster]
gi|332319785|sp|Q9VBX1.2|NEMF_DROME RecName: Full=Nuclear export mediator factor NEMF homolog; AltName:
Full=Protein Caliban
gi|157816462|gb|ABV82224.1| IP12923p [Drosophila melanogaster]
gi|272477156|gb|ACZ95015.1| caliban, isoform B [Drosophila melanogaster]
gi|272477157|gb|AAF56406.2| caliban, isoform C [Drosophila melanogaster]
Length = 992
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 185/414 (44%), Positives = 241/414 (58%), Gaps = 86/414 (20%)
Query: 29 SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++Q KE++P + ++ N+EFHP + Q + F FESF AV
Sbjct: 276 SKGYIIQVKEEKPTENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 326
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+ST ES KID+K +QQER+ALKKL NVK DH RL L + Q VD++KAELI +NQ
Sbjct: 327 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AI A++ IA+QLSW DI VK+AQ + D VAS IKQLKL NHI+L+L +
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
D + + ++VDVDL LSA+ANA+R++D+KRSAA+K++KT+
Sbjct: 447 DE---------------DDDLKDPEVTVVDVDLALSAWANARRYYDMKRSAAQKEKKTVD 491
Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK---- 321
+++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL DA+
Sbjct: 492 ASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQNEL 551
Query: 322 ----------VVTNAWWVKADQVSKTAPTG------------------------------ 341
+ +A A V PTG
Sbjct: 552 IVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVTNS 611
Query: 342 ---------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 612 YWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 665
>gi|284005983|gb|ADB57053.1| MIP15468p [Drosophila melanogaster]
Length = 939
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 185/414 (44%), Positives = 241/414 (58%), Gaps = 86/414 (20%)
Query: 29 SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++Q KE++P + ++ N+EFHP + Q + F FESF AV
Sbjct: 276 SKGYIIQVKEEKPTENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 326
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+ST ES KID+K +QQER+ALKKL NVK DH RL L + Q VD++KAELI +NQ
Sbjct: 327 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AI A++ IA+QLSW DI VK+AQ + D VAS IKQLKL NHI+L+L +
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
D + + ++VDVDL LSA+ANA+R++D+KRSAA+K++KT+
Sbjct: 447 DE---------------DDDLKDPEVTVVDVDLALSAWANARRYYDMKRSAAQKEKKTVD 491
Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK---- 321
+++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL DA+
Sbjct: 492 ASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQNEL 551
Query: 322 ----------VVTNAWWVKADQVSKTAPTG------------------------------ 341
+ +A A V PTG
Sbjct: 552 IVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVTNS 611
Query: 342 ---------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 612 YWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 665
>gi|240978882|ref|XP_002403060.1| conserved hypothetical protein [Ixodes scapularis]
gi|215491284|gb|EEC00925.1| conserved hypothetical protein [Ixodes scapularis]
Length = 651
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 181/415 (43%), Positives = 236/415 (56%), Gaps = 94/415 (22%)
Query: 29 SQGYLVQKKEKRPNSED----YIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
S+GY++ + EKR D I + EFHP + RQH+ + E SF+AA
Sbjct: 243 SKGYILVRVEKRVTPADDGSTEITSYQEFHPFL---------WRQHEKERVVELASFSAA 293
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VD+FFS+ E +I LKA Q+E++ALKKLEN++ DHE R+ ALEQ Q DK KAELI N
Sbjct: 294 VDQFFSSLEMQRISLKAHQKEKEALKKLENIRMDHEKRIVALEQVQREDKHKAELIEINL 353
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
+ V+ A+L +R +ANQ+ W +I +++AQ DPVA IKQLKL+ NH +LLR
Sbjct: 354 DLVERALLVLRSALANQIGWAEITELLREAQEQGDPVAQSIKQLKLDTNHFAMLLR---- 409
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
DP+ E + +LVD+DLDLSAYANA+R++D KR AA KQQKT+
Sbjct: 410 ---------------DPY---EEDARDTLVDIDLDLSAYANARRYYDQKRHAAGKQQKTL 451
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK--- 321
+S+ KA KSAEKKTK+ LK V +NI +ARK +WFEKF+WFISSE+YL DA+
Sbjct: 452 ESSTKAYKSAEKKTKEALKQVALTSNIARARKAFWFEKFFWFISSEDYLVIGGRDAQQNE 511
Query: 322 ------------------------VVTN-----------------------AW------- 327
V+ N AW
Sbjct: 512 MIVKRHLNPGDVYVHADLHGASSIVIKNPGGGSVPPKTLNEAGTMAICYSAAWDAKVVTS 571
Query: 328 --WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WV QVSKTAPTG++LT G+FMIRGKKN+ PP L MG FL+KL+E S+ R
Sbjct: 572 AWWVHHHQVSKTAPTGQYLTPGAFMIRGKKNYLPPSYLIMGFGFLYKLDEDSVER 626
>gi|195573753|ref|XP_002104856.1| GD21177 [Drosophila simulans]
gi|194200783|gb|EDX14359.1| GD21177 [Drosophila simulans]
Length = 972
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 184/416 (44%), Positives = 240/416 (57%), Gaps = 90/416 (21%)
Query: 29 SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
+GY++Q KE++P + ++ N+EFHP + Q + F FESF AV
Sbjct: 256 GKGYIIQVKEEKPAENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 306
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+ST ES KID+K +QQER+ALKKL NVK DH RL L + Q VD++KAELI +NQ
Sbjct: 307 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 366
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AI A++ IA+QLSW DI VK+AQ + D VAS IKQLKL NHI+L+L +
Sbjct: 367 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 426
Query: 206 SNIKDSWVLHHDNADPFADSSETKKP--SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
+ K P ++VDVDL +SA+ANA+R++D+KRSAA+K++KT
Sbjct: 427 DE-----------------DDDLKAPEVTVVDVDLAMSAWANARRYYDMKRSAAQKEKKT 469
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
+ +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL DA+
Sbjct: 470 VDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQN 529
Query: 322 ------------VVTNAWWVKADQVSKTAPTG---------------------------- 341
+ +A A V PTG
Sbjct: 530 ELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVT 589
Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 590 NSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 645
>gi|291190355|ref|NP_001167106.1| Serologically defined colon cancer antigen 1 homolog [Salmo salar]
gi|223648156|gb|ACN10836.1| Serologically defined colon cancer antigen 1 homolog [Salmo salar]
Length = 1069
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 182/426 (42%), Positives = 239/426 (56%), Gaps = 83/426 (19%)
Query: 29 SQGYLVQKKEKRPN----SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
+GY++QK +K+P+ + ++ EFHP + QH EF++F A
Sbjct: 246 GKGYIIQKCDKKPSLAPEKPEELLTYEEFHPFLFA---------QHANSHYVEFDTFDKA 296
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDE++S ES +ID+KA+QQE+ ALKKL+NVKRDH RL AL Q Q VD+ + EL+ N
Sbjct: 297 VDEYYSKMESQRIDVKALQQEKQALKKLDNVKRDHVQRLEALHQLQEVDRLRGELVEMNL 356
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR---- 200
V+ A+ +R +ANQ+ W +I VK+AQ DPVA IK+LKL NHIT+LL+
Sbjct: 357 PIVERALQVVRSALANQVDWAEIGLIVKEAQAAGDPVACAIKELKLQTNHITMLLKNPYI 416
Query: 201 -------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLK 253
+ + + + +P LVDVDL LSAYANAK+++D K
Sbjct: 417 VPDEVEEEDVAEVAEEKKGKKNKNKDKGQKGKPKKDQPMLVDVDLSLSAYANAKKYYDHK 476
Query: 254 RSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
R+AAKK+QKT+++ +KA KSAEKKTKQTLK+VQT+T I KARKVYWFEKF WFISSENYL
Sbjct: 477 RTAAKKEQKTVEAAQKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSENYL 536
Query: 314 --------------------GNV-----------------------------AWDAKVVT 324
G++ A V
Sbjct: 537 IIAGRDQQQNEIIVKRYLRAGDIYVHADLHGATSCVIKNASGVPIPPRTLTEAGTMAVCY 596
Query: 325 NAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+A W V QV+K+APTGE+LTTGSFMIRGKKNF PP L MG SFLFK++
Sbjct: 597 SAAWDAKVITSAWWVHHHQVTKSAPTGEYLTTGSFMIRGKKNFMPPSYLMMGFSFLFKVD 656
Query: 375 ESSISR 380
E + R
Sbjct: 657 EQCVFR 662
>gi|383852746|ref|XP_003701886.1| PREDICTED: nuclear export mediator factor NEMF homolog [Megachile
rotundata]
Length = 970
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 157/290 (54%), Positives = 211/290 (72%), Gaps = 33/290 (11%)
Query: 29 SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P +E++I N+EFHP + + Q++ +P +EF+SF A+V
Sbjct: 242 SKGYIIQKKEVKPVVDGTEEFIYTNIEFHPYLFE---------QYKDYPFKEFDSFDASV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+D+K +QQER+ALKKL+NVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 293 DEYFSTMEGQKLDMKVLQQEREALKKLDNVKKDHDQRLITLEKTQELDKQKAELISRNQM 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL+ NHI+LLL
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKILLKEAESRGDPVASAIKQLKLDTNHISLLLH----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
DP+ +S E KP L+D+DL +A+ NA+++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEESDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAAKKQQKT 453
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
I+S +KALKSAEKKTKQTLK+VQ + +INK RK+YWFEKFYWFISSENYL
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQAIHSINKLRKIYWFEKFYWFISSENYL 503
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 51/66 (77%), Positives = 57/66 (86%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
++AWDAKVV AWWV DQVSKTAPTGE+LTTGSFMIRGKKN+ PCQL MG+ FLF+LE
Sbjct: 564 SIAWDAKVVAGAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLSPCQLVMGLGFLFRLE 623
Query: 375 ESSISR 380
ESSI R
Sbjct: 624 ESSIER 629
>gi|395504204|ref|XP_003756446.1| PREDICTED: nuclear export mediator factor NEMF [Sarcophilus
harrisii]
Length = 996
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 180/375 (48%), Positives = 229/375 (61%), Gaps = 52/375 (13%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N S +GY++QKKEK+P+ SED I+ EFHP + QH P EFE
Sbjct: 241 NFSGKGYIIQKKEKKPSLEPDKQSED-ILTYEEFHPFLFS---------QHSKCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKLEGQKIDLKALQQEKQALKKLDNVRKDHEHRLEALHQAQEIDKIKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ D VA+ I++LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDIVANAIRELKLQTNHVTMLL 410
Query: 200 R--------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
+ ++ E+ ++ + + KP LVDVDL LSAYAN
Sbjct: 411 KNPYLISDEEEEDDEINIEKEETEEPKGKKKKQKNKQLQKLQKNKPLLVDVDLSLSAYAN 470
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
AK+++D KR AA+K QKT+++ EKA +SAEKKTKQTLK+VQ +T I KARKVY
Sbjct: 471 AKKYYDHKRHAARKTQKTVEAAEKAFRSAEKKTKQTLKEVQMVTTIQKARKVYCIS---L 527
Query: 306 FISSENYLGNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
I S N+ + QVSKTAPTGE+LTTGSFMIRGKKNF PP L M
Sbjct: 528 VIES-----NIIF--------------QVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 568
Query: 366 GISFLFKLEESSISR 380
G SFLFK++E + R
Sbjct: 569 GFSFLFKVDEPCVWR 583
>gi|119586147|gb|EAW65743.1| serologically defined colon cancer antigen 1, isoform CRA_c [Homo
sapiens]
Length = 1001
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 185/404 (45%), Positives = 228/404 (56%), Gaps = 84/404 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
EFHP + QH P EFESF AVDEF+S E KIDLKA+QQE+ ALKK
Sbjct: 263 EFHPFLFS---------QHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKK 313
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+NV++DHE RL AL+Q Q +DK K ELI N + VD AI +R +ANQ+ W +I V
Sbjct: 314 LDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIV 373
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLR----------------VHKERSNIKDSWVLH 215
K+AQ DPVAS IK+LKL NH+T+LLR V+ E++ +
Sbjct: 374 KEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDVDGDVNVEKNETEPPKGKK 433
Query: 216 HDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE 275
+ + KP LVDVDL LSAYANAK+++D KR AAKK QKT+++ EKA KSAE
Sbjct: 434 KKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAE 493
Query: 276 KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAW----- 327
KKTKQTLK+VQT+T+I KARKVYWFEKF WFISSENYL G +++ +
Sbjct: 494 KKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGD 553
Query: 328 -WVKADQVSKTA-----PTG---------------------------------------- 341
+V AD T+ PTG
Sbjct: 554 IYVHADLHGATSCVIKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSK 613
Query: 342 -----EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 614 TAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 657
>gi|119586146|gb|EAW65742.1| serologically defined colon cancer antigen 1, isoform CRA_b [Homo
sapiens]
Length = 1067
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 185/404 (45%), Positives = 228/404 (56%), Gaps = 84/404 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
EFHP + QH P EFESF AVDEF+S E KIDLKA+QQE+ ALKK
Sbjct: 263 EFHPFLFS---------QHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKK 313
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+NV++DHE RL AL+Q Q +DK K ELI N + VD AI +R +ANQ+ W +I V
Sbjct: 314 LDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIV 373
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLR----------------VHKERSNIKDSWVLH 215
K+AQ DPVAS IK+LKL NH+T+LLR V+ E++ +
Sbjct: 374 KEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDVDGDVNVEKNETEPPKGKK 433
Query: 216 HDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE 275
+ + KP LVDVDL LSAYANAK+++D KR AAKK QKT+++ EKA KSAE
Sbjct: 434 KKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAE 493
Query: 276 KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAW----- 327
KKTKQTLK+VQT+T+I KARKVYWFEKF WFISSENYL G +++ +
Sbjct: 494 KKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGD 553
Query: 328 -WVKADQVSKTA-----PTG---------------------------------------- 341
+V AD T+ PTG
Sbjct: 554 IYVHADLHGATSCVIKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSK 613
Query: 342 -----EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 614 TAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 657
>gi|159155700|gb|AAI54741.1| Zgc:153813 protein [Danio rerio]
Length = 883
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 183/428 (42%), Positives = 239/428 (55%), Gaps = 97/428 (22%)
Query: 26 NVSSQGYLVQKKEKRPN-----SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S QG+++QK E++PN + + ++ EFHP + QH EFES
Sbjct: 242 NFSGQGFIIQKSEQKPNVCAGDAAEELLTYEEFHPFLFC---------QHVKSRYVEFES 292
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEFFS ES K+D++A+QQE+ ALKKLENV++DH+ RL AL Q Q V++ K EL+
Sbjct: 293 FNKAVDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELV 352
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N V A+ +R +ANQ+ W +I V +AQ DPVA IK+LKL NHITLLLR
Sbjct: 353 ELNLPVVQRALQVVRSALANQVDWVEIGQMVTEAQAAGDPVACAIKELKLQSNHITLLLR 412
Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETK---------KPSLVDVDLDLSAYANAKRFFD 251
+ A P ++E + K LVD+D++LSA+ANAKR++D
Sbjct: 413 ---------------NPEACPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYD 457
Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSEN 311
KRSAAKK+QKT+++ +KA KSAEKKTKQTLKDVQT+T+I KARKVYWFEKF WF+SSEN
Sbjct: 458 SKRSAAKKEQKTVEAAQKAFKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSEN 517
Query: 312 YLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PTGE--------------- 342
YL D + + +V AD T+ P+GE
Sbjct: 518 YLIIAGRDQQQNEMIVKRYLRAGDLYVHADLHGATSCVIKNPSGEAVPPRTLTEAATMAV 577
Query: 343 ------------------------------FLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
+LTTGSFMIRGKKNF PP L MG FLFK
Sbjct: 578 CYSAAWDAKVITSAWWVQHDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFK 637
Query: 373 LEESSISR 380
+++ S+ R
Sbjct: 638 VDDQSVFR 645
>gi|71679669|gb|AAI00005.1| Zgc:153813 protein [Danio rerio]
Length = 881
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 183/428 (42%), Positives = 239/428 (55%), Gaps = 97/428 (22%)
Query: 26 NVSSQGYLVQKKEKRPN-----SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S QG+++QK E++PN + + ++ EFHP + QH EFES
Sbjct: 242 NFSGQGFIIQKSEQKPNVCAGDAAEELLTYEEFHPFLFC---------QHVKSRYVEFES 292
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEFFS ES K+D++A+QQE+ ALKKLENV++DH+ RL AL Q Q V++ K EL+
Sbjct: 293 FNKAVDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELV 352
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N V A+ +R +ANQ+ W +I V +AQ DPVA IK+LKL NHITLLLR
Sbjct: 353 ELNLPVVQRALQVVRSALANQVDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLR 412
Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETK---------KPSLVDVDLDLSAYANAKRFFD 251
+ A P ++E + K LVD+D++LSA+ANAKR++D
Sbjct: 413 ---------------NPEACPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYD 457
Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSEN 311
KRSAAKK+QKT+++ +KA KSAEKKTKQTLKDVQT+T+I KARKVYWFEKF WF+SSEN
Sbjct: 458 SKRSAAKKEQKTVEAAQKAFKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSEN 517
Query: 312 YLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PTGE--------------- 342
YL D + + +V AD T+ P+GE
Sbjct: 518 YLIIAGRDQQQNEMIVKRYLRAGDLYVHADLHGATSCVIKNPSGEAVPPRTLTEAATMAV 577
Query: 343 ------------------------------FLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
+LTTGSFMIRGKKNF PP L MG FLFK
Sbjct: 578 CYSAAWDAKVITSAWWVQHDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFK 637
Query: 373 LEESSISR 380
+++ S+ R
Sbjct: 638 VDDQSVFR 645
>gi|115529351|ref|NP_001070202.1| uncharacterized protein LOC767767 [Danio rerio]
gi|115313121|gb|AAI24465.1| Zgc:153813 [Danio rerio]
Length = 694
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 183/428 (42%), Positives = 239/428 (55%), Gaps = 97/428 (22%)
Query: 26 NVSSQGYLVQKKEKRPN-----SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S QG+++QK E++PN + + ++ EFHP + QH EFES
Sbjct: 242 NFSGQGFIIQKSEQKPNVCAGDAAEELLTYEEFHPFLFC---------QHVKSRYVEFES 292
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEFFS ES K+D++A+QQE+ ALKKLENV++DH+ RL AL Q Q V++ K EL+
Sbjct: 293 FNKAVDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELV 352
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N V A+ +R +ANQ+ W +I V +AQ DPVA IK+LKL NHITLLLR
Sbjct: 353 ELNLPVVQRALQVVRSALANQVDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLR 412
Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETK---------KPSLVDVDLDLSAYANAKRFFD 251
+ A P ++E + K LVD+D++LSA+ANAKR++D
Sbjct: 413 ---------------NPEACPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYD 457
Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSEN 311
KRSAAKK+QKT+++ +KA KSAEKKTKQTLKDVQT+T+I KARKVYWFEKF WF+SSEN
Sbjct: 458 SKRSAAKKEQKTVEAAQKAFKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSEN 517
Query: 312 YLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PTGE--------------- 342
YL D + + +V AD T+ P+GE
Sbjct: 518 YLIIAGRDQQQNEMIVKRYLRAGDLYVHADLHGATSCVIKNPSGEAVPPRTLTEAATMAV 577
Query: 343 ------------------------------FLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
+LTTGSFMIRGKKNF PP L MG FLFK
Sbjct: 578 CYSAAWDAKVITSAWWVQHDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFK 637
Query: 373 LEESSISR 380
+++ S+ R
Sbjct: 638 VDDQSVFR 645
>gi|322784867|gb|EFZ11647.1| hypothetical protein SINV_03144 [Solenopsis invicta]
Length = 985
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 187/416 (44%), Positives = 244/416 (58%), Gaps = 92/416 (22%)
Query: 29 SQGYLVQKKEKR---PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE + ED+I AN+EFHP + Q+ P +EF+SF AAV
Sbjct: 242 SKGYILQKKESKLTQDGKEDFIFANIEFHPFLFD---------QYNNQPYKEFDSFDAAV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE++ST E KIDLKA+QQER+AL+KLE V++DH RL LE+TQ +DK+KAELI NQ
Sbjct: 293 DEYYSTMEGQKIDLKALQQEREALQKLERVRKDHSQRLITLEKTQELDKQKAELISRNQA 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ+SW DI+ +K+AQ DPVAS IKQLKL NHI LLL
Sbjct: 353 LVDNAILAIQSALANQMSWPDIQVLLKEAQARGDPVASAIKQLKLETNHIALLLH----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
DP+ DS E KP ++D+DL +A++NAK+++ K+SAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMIIDIDLAHTAFSNAKKYYSQKKSAAKKQQKT 453
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDA 320
I+S KALKSAEKKTKQTLK+VQT+ INK RK+YWFEKFYWFI+SENYL G
Sbjct: 454 IESHGKALKSAEKKTKQTLKEVQTIHTINKLRKMYWFEKFYWFITSENYLVIGGRDQQQN 513
Query: 321 KVVTNAW------WVKADQVSKTA-----PTGEFL------------------------- 344
+++ + +V AD ++ P+G +
Sbjct: 514 ELIVKRYLKAGDLYVHADLTGASSVVIKNPSGNPVPPKSLAEAGTMAVAYSIAWDSKVIA 573
Query: 345 --------------TTGSF------MIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
TG + MIRGKKN+ QL MG+ +F+LE+SSI R
Sbjct: 574 SAWWVHHDQVSKSAPTGEYLTTGSFMIRGKKNYLTQSQLIMGLGLMFRLEDSSIER 629
>gi|332016223|gb|EGI57136.1| Serologically defined colon cancer antigen 1 [Acromyrmex
echinatior]
Length = 990
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 186/416 (44%), Positives = 242/416 (58%), Gaps = 92/416 (22%)
Query: 29 SQGYLVQKKEKR---PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE + ED+I AN+EFHP + + Q+ P +EF SF AAV
Sbjct: 242 SKGYIIQKKESKLTQDGKEDFIFANIEFHPFLFE---------QYNNQPYKEFNSFDAAV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FS E KIDLKA+QQER+AL+KLE V++DH RL LE+TQ +DK+KAELI NQ
Sbjct: 293 DEYFSMMEGQKIDLKALQQEREALQKLERVRKDHSQRLITLEKTQELDKQKAELISRNQV 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ+SW DI+ +K+AQ DPVAS IKQLKL NHI L+L
Sbjct: 353 LVDNAILAIQSALANQMSWPDIQVLLKEAQTRGDPVASAIKQLKLETNHIALMLH----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
DP+ DS E KP ++D+DL +A++NAK+++ K+SAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESKLKPMMIDIDLAHTAFSNAKKYYSQKKSAAKKQQKT 453
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDA 320
I+S KALKSAEKKTKQTLK+VQT+ INK RK YWFEKFYWFI+SENYL G
Sbjct: 454 IESQGKALKSAEKKTKQTLKEVQTIHTINKLRKTYWFEKFYWFITSENYLVIGGRDQQQN 513
Query: 321 KVVTNAW------WVKADQVSKTA-----PTGEFL------------------------- 344
+++ + +V AD ++ P+G +
Sbjct: 514 ELIVKRYLKAGDLYVHADLTGASSVVIKNPSGNPVPPKSLAEAGTMAVAYSIAWDSKVIA 573
Query: 345 --------------TTGSF------MIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
TG + MIRGKKN+ QL MG+ +F+LE+SSI R
Sbjct: 574 SAWWVHHDQVSKSAPTGEYLTTGSFMIRGKKNYLTHSQLIMGLGIMFRLEDSSIER 629
>gi|390331684|ref|XP_003723334.1| PREDICTED: nuclear export mediator factor Nemf-like
[Strongylocentrotus purpuratus]
Length = 1116
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 175/411 (42%), Positives = 241/411 (58%), Gaps = 83/411 (20%)
Query: 30 QGYLVQKKEKRPN-SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF 88
+GY+VQK+EK+P S+D A + IL S P + EF++F A DEF
Sbjct: 244 KGYIVQKREKKPAASQDGEDAEL-LTEFILYTNSQPYL----------EFDTFDQAADEF 292
Query: 89 FSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148
FS ES K+D+K +QQER ALKKL+NVK+DHE R+S+L+Q Q ++++K LI N V+
Sbjct: 293 FSKMESQKLDMKVIQQERGALKKLDNVKKDHEKRISSLQQNQELNEKKGALIEINLPLVE 352
Query: 149 AAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNI 208
A+ +R +ANQ+ W++I++ +K+AQ DPVA IK L+L+ NH +LLR
Sbjct: 353 QALRVVRSAVANQIDWKEIDSIIKEAQTQGDPVALAIKSLRLDTNHFQMLLR-------- 404
Query: 209 KDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTE 268
D + +D+AD + +P LVD+D+ SAYANA+++F K+++ KK+QKT++S+
Sbjct: 405 -DPYK-QYDDAD--EGEEDVARPMLVDIDIAQSAYANARKYFVQKKTSQKKEQKTMESSS 460
Query: 269 KALKSAEKKTKQTLKDVQTMTNINKARKV------YWF--EKFYWFISSEN--------- 311
KA+KSAEKKT Q LKDV T+ +INK+RK YW + Y I+ +
Sbjct: 461 KAIKSAEKKTMQALKDVATVASINKSRKTYWFEKYYWCISSENYIIIAGRDQQQNEIVVK 520
Query: 312 -YLG-----------------------------------------NVAWDAKVVTNAWWV 329
YL +VAWDAKV+T+AWWV
Sbjct: 521 KYLSPGDIYVHADIHGASSVIIKNPKGGPVPPKTLQEAGTMAVCYSVAWDAKVITSAWWV 580
Query: 330 KADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
+ DQVSKTAPTGEFLTTGSFM+RGKKNF PP QL MG FL K++ES R
Sbjct: 581 RHDQVSKTAPTGEFLTTGSFMVRGKKNFLPPTQLVMGFGFLMKIDESCAWR 631
>gi|195038845|ref|XP_001990823.1| GH19576 [Drosophila grimshawi]
gi|193895019|gb|EDV93885.1| GH19576 [Drosophila grimshawi]
Length = 983
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 182/415 (43%), Positives = 237/415 (57%), Gaps = 92/415 (22%)
Query: 28 SSQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
+S GY++Q KE++P + ++ N+EFHP + Q++ P +++E+F A
Sbjct: 275 NSNGYIIQVKEEKPLDTENTEHFYRNVEFHPYLFV---------QNKDQPFKKYETFMEA 325
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDEFFST ES KID+K +QQER+ALKKL NVK DH RL L + Q +DK KAELI +NQ
Sbjct: 326 VDEFFSTQESQKIDIKTLQQEREALKKLSNVKNDHTKRLDELNKLQDIDKRKAELITSNQ 385
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
VD AILAI+ IA+QLSW DI+ VK+AQ + D VAS IKQLKL INHI+LLL
Sbjct: 386 SLVDKAILAIQSAIASQLSWPDIQELVKEAQTNGDVVASSIKQLKLEINHISLLL----- 440
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
DP+ +VDVDL LSA+ANA+R++D KRSAA K++KTI
Sbjct: 441 --------------TDPY--ECNDDDSIIVDVDLALSAWANARRYYDQKRSAALKEKKTI 484
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAK 321
+++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWF+SSENYL G A +
Sbjct: 485 DASQKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFYWFVSSENYLVIGGRDAQQNE 544
Query: 322 VVTNAW------WVKAD-----------QVSKTAPTGEFLTTGSF--------------- 349
++ + +V AD P L G+
Sbjct: 545 LIVKRYMRPKDIYVHADIQGASSVIIRNATGGDIPPKTLLEAGTMAISYSVAWDAKVVTN 604
Query: 350 ------------------------MIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
MIRGKKNF P C L MG+S LFKLEE + R
Sbjct: 605 SYWVYSNQVSKTAPSGEYLGTGSFMIRGKKNFLPSCHLIMGLSLLFKLEEGFVQR 659
>gi|198422494|ref|XP_002122733.1| PREDICTED: similar to serologically defined colon cancer antigen 1
[Ciona intestinalis]
Length = 1103
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 172/412 (41%), Positives = 232/412 (56%), Gaps = 92/412 (22%)
Query: 26 NVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
++ +GY+VQK E + + E + N+EFHP V QH+ +QEFESF AV
Sbjct: 251 SLQCKGYIVQKIETKSDGE--LKTNVEFHPF---------VFNQHKHRNLQEFESFNKAV 299
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEFF + ES K D+K++Q+ER A++KLENV++DHE+RLS L Q D+ KA LI N
Sbjct: 300 DEFFGSLESQKNDMKSLQRERAAMRKLENVRKDHESRLSGLRSEQESDEMKAALIETNLH 359
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD +IL +R IANQ+ W++I+ VK+AQ DPVAS IK LKL N + + LR H +
Sbjct: 360 LVDQSILVVRSAIANQVDWDEIKLLVKEAQGRGDPVASCIKTLKLETNSMVMALRSHDD- 418
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
+ +KP+ +++DL LSAYANA++++ KR+AAKK+QKTI
Sbjct: 419 ---------------------DDQKPTKIEIDLSLSAYANARKYYGRKRNAAKKEQKTID 457
Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKV 322
++ KA KSAEKKTKQTLK+ + NI KARKVYWFEKF WFISSENYL G A +V
Sbjct: 458 ASTKAFKSAEKKTKQTLKEAAAVRNILKARKVYWFEKFLWFISSENYLVIGGREAQQNEV 517
Query: 323 VTNAW------WVKADQVSKTA-----PTG------------------------------ 341
+ + +V AD T+ P+G
Sbjct: 518 LVKKYLNQGDIYVHADLHGATSCIIKNPSGQPIPPKTLNEAGTMATCHSAAWDAKVVTSA 577
Query: 342 ---------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
E+LTTGSF+IRGKKN+ PP L G FLFK++E+ +
Sbjct: 578 WWVHHDQVSKTAPSGEYLTTGSFLIRGKKNYLPPSYLVYGFGFLFKVDETCV 629
>gi|443707183|gb|ELU02895.1| hypothetical protein CAPTEDRAFT_151175 [Capitella teleta]
Length = 1023
Score = 290 bits (742), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 178/413 (43%), Positives = 228/413 (55%), Gaps = 88/413 (21%)
Query: 28 SSQGYLVQKKEKRPNSE--DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
SS+GY++QKKEK+ +S D ++ EFHP + +QH+ P EFESF+ AV
Sbjct: 242 SSKGYIIQKKEKKASSTEGDELITYNEFHPYL---------YKQHESCPHLEFESFSKAV 292
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEFFS ES K+D+K +QQE+ L+KLENV++DH RL AL Q D K +LI N
Sbjct: 293 DEFFSKIESQKLDMKTLQQEKSVLRKLENVRKDHAQRLQALANEQEKDNIKGQLIEMNLP 352
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
V+ AIL ++ +ANQL W DI VK+AQ DPVAS I L+L NH T++LR
Sbjct: 353 LVERAILVVQSALANQLDWADINQLVKEAQAQGDPVASSISSLQLQSNHFTMMLR----- 407
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
D + E P+ V +DL LSAYANA++++D K+ AA+K+QKT+
Sbjct: 408 --------------DCYEGDEEDMLPAQKVQIDLGLSAYANARKYYDKKKHAAQKEQKTV 453
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY------WF--EKFYWFISSENYLGN- 315
++ KALKSAEKKTKQTLK+VQ I K RK + WF + Y I + N
Sbjct: 454 AASTKALKSAEKKTKQTLKEVQVAATIRKQRKTHWFEKFLWFISSENYLVIGGRDQQQNE 513
Query: 316 ------------------------------------------------VAWDAKVVTNAW 327
AWDAKVVT+AW
Sbjct: 514 LLVKRHLRPGDLYVHADLHGASSVIIKNPSGVPPKTLNEAGTMALCHSAAWDAKVVTSAW 573
Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WV QVSKTAPTGE+LTTGSFMIRGKKNF PP L G FLFK++++SI R
Sbjct: 574 WVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIYGFGFLFKVDDTSIFR 626
>gi|26333303|dbj|BAC30369.1| unnamed protein product [Mus musculus]
Length = 641
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 179/407 (43%), Positives = 229/407 (56%), Gaps = 89/407 (21%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410
Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
R + +E D+ + + D P + K KP LVDVDL LSAYAN
Sbjct: 411 RNPYLLSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 470
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 471 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 530
Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
FISSENYL G +++ + +V AD T+ PTG
Sbjct: 531 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTE 590
Query: 342 -----------------------------------EFLTTGSFMIRG 353
E+LTTGSFMIRG
Sbjct: 591 AGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRG 637
>gi|340374096|ref|XP_003385574.1| PREDICTED: nuclear export mediator factor Nemf-like [Amphimedon
queenslandica]
Length = 1137
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 176/423 (41%), Positives = 236/423 (55%), Gaps = 99/423 (23%)
Query: 26 NVSSQGYLVQKKEKRPN----------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
++ S+GYL+QKK+ P + + N+EFHP++ RQH
Sbjct: 242 SMPSKGYLIQKKDVAPTVGVATSDTPTTSTEVTTNIEFHPLL---------YRQHLSSCY 292
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
+E E+F AVDEFFS+ S K D+K +Q ++ A+KKLEN+K+DHE R+ AL ++Q D+
Sbjct: 293 KEVETFDRAVDEFFSSKSSQKQDVKVIQLQKSAVKKLENIKQDHEKRIEALRKSQDEDRY 352
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
KAELI N + V+ A L IR +A+ + W DIE V AQ DPVA+ I+ LKL+ N I
Sbjct: 353 KAELIEWNTDLVERACLVIRSAVASSMDWGDIELLVHDAQGRGDPVANSIQGLKLHSNLI 412
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
TL L+ E D+ D VD+DL LS YANA+R++D+K+
Sbjct: 413 TLWLKAPYEE-----------DDDDSIK----------VDIDLGLSVYANARRYYDMKKQ 451
Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY--- 312
AAKK+QKT +S+ KALKSAE+KTKQTLK+ ++ I KARKV+WFEKFYWFISSEN+
Sbjct: 452 AAKKEQKTSESSNKALKSAERKTKQTLKEAAVISRITKARKVHWFEKFYWFISSENFVVI 511
Query: 313 -----------------------------------------------LGNVAWDAKVVTN 325
L A ++
Sbjct: 512 GGRDQQQNELLVKKYLNEHDVYVHADLHGATSVIVKNHSGGPVPPKTLNEAGVMAVCYSS 571
Query: 326 AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW WV A+QVSKTAP+GE+LTTGSFMIRGKKNF PPC L +G S +FK++ES
Sbjct: 572 AWEAKIVTSAWWVYANQVSKTAPSGEYLTTGSFMIRGKKNFLPPCHLVLGFSIMFKVDES 631
Query: 377 SIS 379
S++
Sbjct: 632 SLA 634
>gi|119586149|gb|EAW65745.1| serologically defined colon cancer antigen 1, isoform CRA_e [Homo
sapiens]
Length = 628
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 199/309 (64%), Gaps = 30/309 (9%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531
Query: 305 WFISSENYL 313
WFISSENYL
Sbjct: 532 WFISSENYL 540
>gi|321467512|gb|EFX78502.1| hypothetical protein DAPPUDRAFT_305191 [Daphnia pulex]
Length = 997
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 153/315 (48%), Positives = 206/315 (65%), Gaps = 34/315 (10%)
Query: 10 DIHKCGPTLHV-------LVSNLNVSSQGYLVQKKEKRPN----SEDYIVANMEFHPMIL 58
DI + P LH+ ++ N+ S+G +VQK+E RP ++D+ N EFHPM+
Sbjct: 217 DITRDLPKLHLALKSADSIMQNIGSISKGIVVQKRESRPTPSGENQDFFT-NQEFHPML- 274
Query: 59 QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD 118
+QH+ P E SF AVDEFFS ES K+DLK VQQERDA+KKL N+++D
Sbjct: 275 --------YKQHESHPFIELPSFNQAVDEFFSKMESQKLDLKVVQQERDAMKKLANIRQD 326
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
HE RL+ L Q +D+ KA LI NQ +D AI +R +ANQ+SW++I+ V++A R
Sbjct: 327 HEKRLANLHHVQEIDELKARLIEMNQPLIDHAIQVVRSALANQVSWKEIDELVEEATRKG 386
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
DPVA IIK+LKL+ NHI+L+L H DS +S E+ KP LVD+DL
Sbjct: 387 DPVAKIIKKLKLSTNHISLMLS-HPYAEQDSDS------------ESDESYKPQLVDIDL 433
Query: 239 DLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY 298
DL+A+ANA+++F K++A+KK+QKTI+S+ KA KSAEKK KQTLK+ + I KARKV
Sbjct: 434 DLTAFANARKYFGEKKNASKKEQKTIESSHKAFKSAEKKAKQTLKESAAIATIRKARKVL 493
Query: 299 WFEKFYWFISSENYL 313
WFEKFYWFISS+NY+
Sbjct: 494 WFEKFYWFISSDNYI 508
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 45/66 (68%), Positives = 56/66 (84%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+ AW+AKV+T AWWV++ QVSKTAP+GE+LTTGSFMIRGKKNF PP + +G LF+LE
Sbjct: 570 SAAWEAKVMTTAWWVESSQVSKTAPSGEYLTTGSFMIRGKKNFLPPLPIVLGFGLLFRLE 629
Query: 375 ESSISR 380
ESSI+R
Sbjct: 630 ESSIAR 635
>gi|347968346|ref|XP_312244.5| AGAP002680-PA [Anopheles gambiae str. PEST]
gi|333468048|gb|EAA08148.6| AGAP002680-PA [Anopheles gambiae str. PEST]
Length = 1053
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 150/289 (51%), Positives = 197/289 (68%), Gaps = 13/289 (4%)
Query: 29 SQGYLVQKKEKRP----NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
S GY++QKKE +P + E+Y N+E+ P + Q+QG P + F+SF A
Sbjct: 268 SPGYIIQKKEVKPTKQGDEEEYYFTNLEYQPYMYN---------QYQGEPFKAFDSFTTA 318
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDEF+S+ ES KIDLKA QER+ALKKL NVK DH R+ L + QL D+++AELI NQ
Sbjct: 319 VDEFYSSLESQKIDLKAFAQEREALKKLSNVKTDHAKRIEELTKAQLEDRKRAELITRNQ 378
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
+ VD A+LA++ +A Q+SW DI+ VK AQ + DPVAS I+QLKL INHI+L L
Sbjct: 379 DLVDKALLAVQSALAAQMSWTDIQDLVKAAQANKDPVASCIRQLKLEINHISLHLTDPYA 438
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
+ + S + D P +VDVDL LSA+ANA+R++D +R AA+K+QKTI
Sbjct: 439 SLDEQASDEEEEEEDSEREDDEAKLVPMVVDVDLALSAFANARRYYDQRRFAARKEQKTI 498
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
+S+ KALK+AE+KT QTLKDV+T T I+K RKVYWFEKFYWF+SSENYL
Sbjct: 499 ESSSKALKNAERKTIQTLKDVRTQTTISKVRKVYWFEKFYWFVSSENYL 547
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 50/66 (75%), Positives = 61/66 (92%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+VAWDAKVVT+A+WV ++QVSKTAPTGE+LTTGSFMIRG+KNF PPC L +G+SFLFKLE
Sbjct: 608 SVAWDAKVVTSAYWVHSEQVSKTAPTGEYLTTGSFMIRGRKNFLPPCHLVLGLSFLFKLE 667
Query: 375 ESSISR 380
+SS+ R
Sbjct: 668 DSSVER 673
>gi|312384850|gb|EFR29482.1| hypothetical protein AND_01485 [Anopheles darlingi]
Length = 1109
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 150/305 (49%), Positives = 198/305 (64%), Gaps = 55/305 (18%)
Query: 29 SQGYLVQKKEKRP----NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
S GY++QKKE RP E+Y N+E+ P + Q+QG P +EF+SF +A
Sbjct: 268 SPGYIIQKKEVRPAKEGEKEEYYFTNLEYQPYMYS---------QYQGEPCKEFDSFTSA 318
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDEF+S+ L+ + QER+ALKKL NVK DH R+ L + QL D++KAELI NQ
Sbjct: 319 VDEFYSS-------LETLAQEREALKKLSNVKTDHAKRIEELTKAQLGDRKKAELITRNQ 371
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
+ VD A+LA++ +A Q+SW DI+ VK AQ + DPVAS I+QLKL INHI+L L
Sbjct: 372 DLVDKALLAVQSALAAQMSWTDIQDLVKAAQANKDPVASCIRQLKLEINHISLYL----- 426
Query: 205 RSNIKDSWVLHHDNADPFA--DSSETK--------------KPSLVDVDLDLSAYANAKR 248
+DP+A D +E+ +P +VDVDL LSA+ANA+R
Sbjct: 427 --------------SDPYAFLDENESDNEEDSDREEDEEKLEPMVVDVDLALSAFANARR 472
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
++D +R AA+K+QKTI+S+ KALK+AE+KT QTLKDV+T T I+K RKVYWFEKFYWFIS
Sbjct: 473 YYDQRRFAARKEQKTIESSSKALKNAERKTIQTLKDVRTQTTISKVRKVYWFEKFYWFIS 532
Query: 309 SENYL 313
SENYL
Sbjct: 533 SENYL 537
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 50/66 (75%), Positives = 61/66 (92%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+VAWDAKVVT+A+WV ++QVSKTAPTGE+LTTGSFMIRG+KNF PPC L +G+SFLFKLE
Sbjct: 598 SVAWDAKVVTSAYWVHSEQVSKTAPTGEYLTTGSFMIRGRKNFLPPCHLVLGLSFLFKLE 657
Query: 375 ESSISR 380
+SS+ R
Sbjct: 658 DSSVER 663
>gi|351702906|gb|EHB05825.1| Serologically defined colon cancer antigen 1, partial
[Heterocephalus glaber]
Length = 762
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 158/352 (44%), Positives = 198/352 (56%), Gaps = 75/352 (21%)
Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
+E+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI N + VD AI +R +ANQ+
Sbjct: 1 KEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQVVDRAIQVVRSALANQID 60
Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR----------------VHKERSN 207
W +I VK+AQ DPVAS IK+LKL NH+T+LLR V E++
Sbjct: 61 WTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDADGDVSVEKNE 120
Query: 208 IKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQST 267
+ + + KP LVDVDL LSAYANAK+++D KR AAKK QKT+++
Sbjct: 121 TEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAA 180
Query: 268 EKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVT 324
EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WFISSENYL G +++
Sbjct: 181 EKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIV 240
Query: 325 NAW------WVKADQVSKTA-----PTG-------------------------------- 341
+ +V AD T+ PTG
Sbjct: 241 KRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEVGTMALCYSAAWDARVITSAWW 300
Query: 342 -------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 301 VYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 352
>gi|405952718|gb|EKC20496.1| Serologically defined colon cancer antigen 1-like protein
[Crassostrea gigas]
Length = 1084
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 170/452 (37%), Positives = 235/452 (51%), Gaps = 108/452 (23%)
Query: 7 VSSDIHKCGPTLHVLVSNLNVSS----QGYLVQKKEKRP------NSEDYIVANMEFHPM 56
V+ D+ K + + S L S QG +VQKKEKR N+E ++ EFHPM
Sbjct: 217 VTEDMDKLTSAIELAESLLKTLSEEPCQGVIVQKKEKRAAVKEGENAE--LLTYEEFHPM 274
Query: 57 ILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVK 116
+ +Q + P F++F +VDEFFS ES K+D+KA+QQE+ ALKKL+N+K
Sbjct: 275 LF---------KQFENKPHSIFDNFNKSVDEFFSQIESQKLDMKALQQEKSALKKLDNIK 325
Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
+DHE R+ L++ Q D K LI N VD A+L +R +ANQ+ W +IE V +AQ
Sbjct: 326 KDHEKRIEGLQKEQETDINKGRLIELNLPLVDQALLIVRSALANQIDWTEIENLVHEAQL 385
Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK------- 229
DPVAS I LKL+ N ITLLLR DP+ S +
Sbjct: 386 QGDPVASCITGLKLDSNMITLLLR-------------------DPYRYSDDEYDDDDDDD 426
Query: 230 --KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQT 287
KP+ +D+D+ +SAY N++++FD K++AAKK+QKTI ++ KALKSAE+KTK+TLK+V T
Sbjct: 427 VLKPTKIDIDISMSAYGNSRKYFDKKKTAAKKEQKTIDASAKALKSAERKTKETLKEVAT 486
Query: 288 MTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA 338
INKARK YWFEKF WFI+SENYL G +++ + +V AD ++
Sbjct: 487 AATINKARKTYWFEKFLWFITSENYLVIGGRDQQQNEMIVKRYLRPGDLYVHADLHGASS 546
Query: 339 -----PTGEFL---------------------------------------TTGSFMIRG- 353
P+GE + +G ++ G
Sbjct: 547 CVLKNPSGEPVPPKSLNEAGTMAICNSVAWDAKVVTSAWWVYHDQVSKTAPSGEYLTTGS 606
Query: 354 -----KKNFFPPCQLAMGISFLFKLEESSISR 380
KKN+ PP L G LFKLE+ SI R
Sbjct: 607 FMIRGKKNYLPPTHLVYGFGLLFKLEDDSIER 638
>gi|47230001|emb|CAG10415.1| unnamed protein product [Tetraodon nigroviridis]
Length = 582
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 157/370 (42%), Positives = 206/370 (55%), Gaps = 51/370 (13%)
Query: 28 SSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESF 81
+ +GY++QK EK+P+ E+ + + EFHP + QH P EF+SF
Sbjct: 244 TGKGYIIQKSEKKPSVTPGKPCEELLTYD-EFHPFLFA---------QHSKSPYLEFDSF 293
Query: 82 AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
AVDEFFS ES KID+KA+Q E+ ALKKLENVK+DHE RL AL Q Q +D+ K ELI
Sbjct: 294 DKAVDEFFSKMESQKIDMKALQLEKHALKKLENVKKDHEQRLEALHQAQEIDRIKGELIE 353
Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR- 200
N V+ A+ + +ANQ+ W +I VK+AQ DPVA IK+LKL NHITLLL+
Sbjct: 354 MNLAIVERALQVVCGALANQVDWTEIGILVKEAQAAGDPVACAIKELKLQANHITLLLKN 413
Query: 201 VHKERSNIKDSWVLHHDN---ADPFADSSETKKPSLVDVDLDLSAYANAKRF-------- 249
+ + ++ VL + KP LVDVDL LSAYANAK+F
Sbjct: 414 PYISEDDEQEDDVLEETGRKNKNKKNKKFHKNKPVLVDVDLSLSAYANAKKFEKFLWFIS 473
Query: 250 -FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
+ A + QQ+ ++ L++ E +TL + TM A
Sbjct: 474 AENYLVIAGRDQQQNEMIVKRYLRAGEPIPPRTLTEAGTMAVCYSA-------------- 519
Query: 309 SENYLGNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW+AK+VT+AWWV QVSKTAPTGE+LTTGSFMIRGKKN+ PP L MG
Sbjct: 520 --------AWEAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNYLPPSYLIMGFG 571
Query: 369 FLFKLEESSI 378
FLFK+ + +
Sbjct: 572 FLFKVTLTGV 581
>gi|324502310|gb|ADY41017.1| Serologically defined colon cancer antigen 1 [Ascaris suum]
Length = 958
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 160/412 (38%), Positives = 217/412 (52%), Gaps = 92/412 (22%)
Query: 29 SQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF 88
S G+++ K + R + + + EFHP +Q + ++EFE+F+ VDE+
Sbjct: 245 SHGFIIYKLDTRADGHSF-ESYQEFHPYRF---------KQFESENLREFENFSECVDEY 294
Query: 89 FSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148
FS ES + D +A+ ER+ALKKLENVKRD + R+ +LE Q+ ++ AELI N + VD
Sbjct: 295 FSKIESQRADQRALNAEREALKKLENVKRDQQERIESLELAQVEKRQMAELIELNSDLVD 354
Query: 149 AAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNI 208
A+L IR IANQLSWE IE +A DP+AS I L LN N +TL LR
Sbjct: 355 KALLIIRSAIANQLSWEMIEEMRIKASEAGDPIASSIVGLNLNSNEMTLSLR-------- 406
Query: 209 KDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTE 268
DP+ D S KK V +D+ LSAY N+++F K++A K+QKTI S+
Sbjct: 407 -----------DPYHDDSSPKK---VPIDIALSAYQNSRKFHSEKKAAVDKKQKTISSSA 452
Query: 269 KALKSAEKKTKQTLKDVQTMTNINKAR------KVYWF--EKFYWFISSEN--------- 311
KALKSA+ K K+TL V+ ++ K+R K +WF + Y I +
Sbjct: 453 KALKSAQLKAKETLATVRAKADVVKSRRQMWFEKFFWFVSSENYLVIGGRDAQQNELLVK 512
Query: 312 -YL------------------------------------GNVA------WDAKVVTNAWW 328
YL G++A W+AKV+ AWW
Sbjct: 513 RYLRTGDIYVHADVRGASSVVIRNKVNGGEIPPKTLNEAGSMAVCYSSSWEAKVIAAAWW 572
Query: 329 VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
V QVS+TAPTGE+LT GSFMIRGKKNF P CQL MG +FKL+E S+ R
Sbjct: 573 VYHHQVSRTAPTGEYLTPGSFMIRGKKNFLPSCQLQMGFGLMFKLDEDSVER 624
>gi|313211850|emb|CBY15998.1| unnamed protein product [Oikopleura dioica]
Length = 699
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 157/446 (35%), Positives = 236/446 (52%), Gaps = 82/446 (18%)
Query: 2 VSNEDVSSD---IHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMIL 58
V+ VS D I +C + +++ QG++ K + D ++ +++ P +
Sbjct: 220 VATASVSQDADEILECLQNCYEFLNSGKTKFQGFISTK------TNDNVLQYVDYQPFLF 273
Query: 59 QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD 118
S + E E F+ AVD+F+ +S K + K +Q E+ A+KKLENVK D
Sbjct: 274 NQSQLDSTI---------ELEKFSLAVDKFYGEIQSQKAEQKMMQAEKSAMKKLENVKLD 324
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
H RL +L+ Q + KA+LI N + VD+A+ +R +A+Q+ WE+IE +++ Q
Sbjct: 325 HMKRLESLKLAQADNVRKAQLIEMNLDLVDSALNQVRSAVASQIGWEEIEDFLEEGQDEG 384
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS----LV 234
DPV+ I++LKL N I ++L + S + + + S+ + S +
Sbjct: 385 DPVSIAIRELKLKTNQIVMMLSEPMYDDSDSSSEEEENPSESEYTKSARVTEGSEIIIYI 444
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
+DL LSA+ NAK F+D KR+AA K+ KTI +++KALKSAEKKT ++LK++QT+ + K
Sbjct: 445 FLDLSLSAFGNAKAFYDSKRAAADKESKTIDASKKALKSAEKKTNESLKNIQTVRQVTKV 504
Query: 295 RKVYWFEKFYWFISSENYLGNVAWDAK---------------------------VVTN-- 325
RK WFEKF+WFISSENYL DA+ +V N
Sbjct: 505 RKQMWFEKFFWFISSENYLVIAGKDAQQNETIVKKYLKNGDVYVHADIHGASSCIVKNID 564
Query: 326 ----------------------AW--------W-VKADQVSKTAPTGEFLTTGSFMIRGK 354
AW W V A+QVSKTAP+GE+L+TGSFMIRGK
Sbjct: 565 PSKPVSPVTLHEVGHAAVCHSAAWNAKVLTSAWWVHANQVSKTAPSGEYLSTGSFMIRGK 624
Query: 355 KNFFPPCQLAMGISFLFKLEESSISR 380
KN+ PP QL +G FLFKL+++ ++R
Sbjct: 625 KNYLPPSQLVLGFGFLFKLDDACVAR 650
>gi|391330989|ref|XP_003739933.1| PREDICTED: nuclear export mediator factor NEMF homolog [Metaseiulus
occidentalis]
Length = 956
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/290 (47%), Positives = 181/290 (62%), Gaps = 36/290 (12%)
Query: 47 IVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER 106
I + EFHP + +Q +G ++ F +F AVD FFS E KIDL+A E+
Sbjct: 267 IASYNEFHPFLF---------KQLEGSRVEHFATFGQAVDAFFSMQEQQKIDLRAHNLEK 317
Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
+A+KKLENVK DHE RL+ALE TQ D EKA LI NN E V+ A+ A+R +A+Q SW++
Sbjct: 318 EAVKKLENVKLDHEKRLNALEGTQRTDLEKAMLIENNLELVEKALYAVRSFVASQYSWDE 377
Query: 167 IEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS 226
I +K+AQ DPVA IK L L+ N +LL ++S
Sbjct: 378 IGHMIKEAQHMGDPVACTIKALHLDRNQFGMLL-----------------------SNSF 414
Query: 227 ETK-KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
E PS+VD+D+DLSAYANA+R+FD+K+ AA+KQQKTI+S+ KALKSA+KKTK+ LK V
Sbjct: 415 ENDLSPSVVDIDIDLSAYANARRYFDMKKHAARKQQKTIESSAKALKSAQKKTKEILKQV 474
Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
+ TNI + RK YWFEKF+WFISSENYL G A +V+ + K D
Sbjct: 475 ELTTNIARTRKSYWFEKFFWFISSENYLVIGGRDAQQNEVIVKKYMTKGD 524
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 41/64 (64%), Positives = 52/64 (81%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+ AW+AKVVT+AWWV QV+KTAP+GE+LT GSFMIRGKKN+ PP L MG F+F+L+
Sbjct: 572 SAAWEAKVVTSAWWVHHHQVTKTAPSGEYLTAGSFMIRGKKNYLPPLYLIMGFGFMFRLD 631
Query: 375 ESSI 378
E S+
Sbjct: 632 EESV 635
>gi|330841435|ref|XP_003292703.1| hypothetical protein DICPUDRAFT_40970 [Dictyostelium purpureum]
gi|325077022|gb|EGC30763.1| hypothetical protein DICPUDRAFT_40970 [Dictyostelium purpureum]
Length = 1084
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 142/376 (37%), Positives = 209/376 (55%), Gaps = 73/376 (19%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P + +Q++ E++SF AVD+FFS ES K++ + +QQE+ LKKL+ VK D + R+
Sbjct: 284 PFLYKQYESKKYIEYDSFDGAVDQFFSEIESQKLEQQRIQQEQTVLKKLDKVKEDQQRRI 343
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND-PVA 182
+L + + KAELI N + VD IL IR +AN ++W+ + +K+ ++ N VA
Sbjct: 344 DSLFANEAENVRKAELIEANLQEVDQCILIIRSGVANSMNWDTLNQLLKEEKKKNPYSVA 403
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+ I++LKL N ITL L L+ D+ + + KKP+L+DVD+ LSA
Sbjct: 404 TKIQRLKLESNQITLALT----------DGFLYDDDEEV---NKTNKKPTLIDVDISLSA 450
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
+ANA++++D K+ + +K QKTI E ALK+AE KT+Q L +V++ ++ + RKV+WFEK
Sbjct: 451 FANARKYYDTKKQSHEKAQKTISQAEFALKAAESKTRQQLSEVKSKHSMIQMRKVFWFEK 510
Query: 303 FYWFISSENYLGNVAWDAK----------------------------------------- 321
F+WFISS+NY+ DA+
Sbjct: 511 FHWFISSDNYIVVSGRDAQQNELLFKKYLEKDDVYVHADIFGSTSCVIKNPNGGEIPPNT 570
Query: 322 ---------VVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
+NAW WV + QVSKTAP+GE+LTTGSFMIRGKKN+ P QL
Sbjct: 571 LIQAGTMTMCYSNAWSAKVVTSAYWVYSHQVSKTAPSGEYLTTGSFMIRGKKNYLPHSQL 630
Query: 364 AMGISFLFKLEESSIS 379
MG F+FK++ES I
Sbjct: 631 VMGFGFMFKIDESCIG 646
>gi|341901167|gb|EGT57102.1| hypothetical protein CAEBREN_19463 [Caenorhabditis brenneri]
Length = 920
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 147/393 (37%), Positives = 204/393 (51%), Gaps = 94/393 (23%)
Query: 60 PGSTPSVMRQHQGF-PI---------QEFESFAAAVDEFFSTAESHKIDLKAVQQERDAL 109
P T + ++ +Q F PI +E SF +VDEF+S E+ K + KA+ E+ AL
Sbjct: 261 PSPTSAPIQLYQEFNPIPMPLTSKFTKELPSFCESVDEFYSRIETQKQEQKAINMEKQAL 320
Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
KKLENV++D + R+ AL+ TQ + A II NQE V+ A+L IR +ANQ SW+ IE
Sbjct: 321 KKLENVEKDQKDRIEALQMTQEQREHMANRIILNQELVEKALLLIRSALANQFSWQTIEE 380
Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
K A ++ DPVA I K N + L DP+ D +E
Sbjct: 381 MKKTAAKNGDPVAKSIDSFKFESNEFVMTL-------------------GDPYDDEAEIL 421
Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
K V +D+ ++A NA+R F K+SAA+K +KT+ S+EKA+K+A++K K TL+ V+ +T
Sbjct: 422 K---VPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQVKIVT 478
Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
+ K+RK WFEKF WFISSE Y+ DA+
Sbjct: 479 EVKKSRKAMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSVV 538
Query: 322 -------------------------VVTNAW---------WVKADQVSKTAPTGEFLTTG 347
+NAW WV+ +QVS+TAPTGE+L +G
Sbjct: 539 IRNKSFEESQEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVRPEQVSRTAPTGEYLPSG 598
Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
SFMIRGKKNF PP QL MG+ LF+++E SI R
Sbjct: 599 SFMIRGKKNFMPPSQLVMGLGVLFRMDEESIER 631
>gi|32565397|ref|NP_497411.2| Protein Y82E9BR.18 [Caenorhabditis elegans]
gi|373220360|emb|CCD73050.1| Protein Y82E9BR.18 [Caenorhabditis elegans]
Length = 921
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 153/394 (38%), Positives = 209/394 (53%), Gaps = 94/394 (23%)
Query: 57 ILQPGSTPSVMRQHQGF-PI---------QEFESFAAAVDEFFSTAESHKIDLKAVQQER 106
+L P STP ++ +Q F PI +E SF AVDEF+S E+ K + KAV E+
Sbjct: 262 VLSPISTP--IQIYQDFNPISMEFTAKLSKELSSFCEAVDEFYSRIETQKQEQKAVNMEK 319
Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
ALKKLENV++D + R+ AL+ TQ ++ A II N E V+ A+L IR +ANQ SW+
Sbjct: 320 QALKKLENVEKDQKDRIEALQLTQSQREQMANRIILNTELVEKALLLIRSALANQFSWQT 379
Query: 167 IEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS 226
IE K A + DPVA I K N + L ADP+ D +
Sbjct: 380 IEEMRKTAAGNGDPVAKSIDSFKFENNEFMMSL-------------------ADPYDDEA 420
Query: 227 ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ 286
E K V +D+ L+A NA+R F K+SAA+K +KT+ S+EKA+K+A++K K TL+ V+
Sbjct: 421 EVLK---VPIDISLNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQVK 477
Query: 287 TMTNINKARKVYWFEKFYWFISSENY---------------------------------- 312
+ + K+RK WFEKF WFISSE +
Sbjct: 478 IVVEVKKSRKSMWFEKFRWFISSEGFIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGAS 537
Query: 313 ---LGNVAWDAKV--------------VTNAW---------WVKADQVSKTAPTGEFLTT 346
+ N ++DA++ +NAW WV DQVS+TAPTGE+L +
Sbjct: 538 SVVIRNKSFDAEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPDQVSRTAPTGEYLPS 597
Query: 347 GSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
GSFMIRGKKNF PP QL MG+ LF+++E SI R
Sbjct: 598 GSFMIRGKKNFMPPSQLVMGLGILFRMDEESIER 631
>gi|281200297|gb|EFA74518.1| DUF814 family protein [Polysphondylium pallidum PN500]
Length = 1134
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 149/418 (35%), Positives = 221/418 (52%), Gaps = 73/418 (17%)
Query: 22 VSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESF 81
+ N+ + S+GY+V + K P D V EF P + +Q+ P QE++SF
Sbjct: 225 IFNVTMQSRGYIVLRPPKTPT--DTTVIYEEFVPFLY---------KQYHSKPNQEYDSF 273
Query: 82 AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
AVD+FFS E+ +++ + + QE+ LKKL+ V+ D + R+ +L + + KA+LI
Sbjct: 274 DQAVDQFFSEIEAQRVEQQRIAQEQTVLKKLDKVREDQQRRIDSLFAAEADNVRKAQLIE 333
Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND-PVASIIKQLKLNINHITLLLR 200
N + VD I I+ + + W + +K+ ++ N VA+II +LKL N I L L
Sbjct: 334 ANLQEVDQCITIIKSGVNASMDWTALSQLLKEEKKKNPYSVANIIHKLKLESNQIQLALN 393
Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQ 260
+ + +D + + KKP+LVDV++ L+AYANA+ ++D K+ A +K
Sbjct: 394 DNYDDDYDEDEDDDEEEEKK--QQKKDKKKPTLVDVNIALTAYANAREYYDSKKHANEKA 451
Query: 261 QKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDA 320
KTIQ E A+K+AEKKT+Q L +V+ + + + RKV+WFEKF+WF+SS+NYL DA
Sbjct: 452 NKTIQQAEFAMKAAEKKTRQQLSEVKAKSAMIQMRKVFWFEKFHWFLSSDNYLVISGKDA 511
Query: 321 K--------------------------------------------------VVTNAW--- 327
+ +NAW
Sbjct: 512 QQNEMLFKKYLEKDDIYVHADIFGSTSCVIKNHGGGAIPPNTLIQAGTMTMCYSNAWSAK 571
Query: 328 ------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
WV A+QVSKTAP+GEFLTTGSFMIRGKKN+ P QL MG F+FKL+ES I+
Sbjct: 572 VVTSAYWVYANQVSKTAPSGEFLTTGSFMIRGKKNYLPHSQLVMGFGFMFKLDESCIA 629
>gi|440797731|gb|ELR18808.1| isoform 2 of serologically defined colon cancer antigen 1 family
protein [Acanthamoeba castellanii str. Neff]
Length = 1138
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 139/369 (37%), Positives = 195/369 (52%), Gaps = 81/369 (21%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P V +Q +G P EF SF AVD FFS A+ ++++K QQE+ VK+DHETR+
Sbjct: 330 PYVYKQFEGRPRLEFPSFDEAVDIFFSKAQEQQVEVKKEQQEK-------TVKKDHETRI 382
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
+AL + + +KA LI N VDAAI ++A + W + VK+A++ DP+A+
Sbjct: 383 AALTKAEEECIKKAHLIETNVSDVDAAIKVTCSELARGMDWAQLTRVVKEAKKAGDPIAN 442
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDL 240
+I L N ITLLL DP AD+S + V+VD+
Sbjct: 443 LIHSLDFANNRITLLL-------------------VDPLEAAADASGAMQK--VEVDIGQ 481
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+AYANA+ F+ + A K KT+ S++ A+K+AEKK ++ +KDV I K RK YWF
Sbjct: 482 TAYANAQEFYAEAKRRAHKHAKTVASSQMAVKAAEKKARREIKDVGVKAAIQKVRKAYWF 541
Query: 301 EKFYWFISSENYLGNVAWDAK--------------------------------------- 321
EKF+WFISSENY+ DA+
Sbjct: 542 EKFHWFISSENYVVISGRDAQQNELIVKRYLRKGDAYVHADLHGAATCVVKNPHPDKPIP 601
Query: 322 -----------VVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ T+AWWV +QVSKTAP+GE+L TGSFMIRGKKNF PP QL MG +++
Sbjct: 602 ALTLAEAGSMTIPTSAWWVHPEQVSKTAPSGEYLVTGSFMIRGKKNFLPPSQLVMGFAYM 661
Query: 371 FKLEESSIS 379
FK++ +S++
Sbjct: 662 FKVDPTSVA 670
>gi|393907053|gb|EJD74501.1| serologically defined colon cancer antigen 1 [Loa loa]
Length = 1568
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 153/416 (36%), Positives = 213/416 (51%), Gaps = 98/416 (23%)
Query: 28 SSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
++QG+LV +++ R + E Y EFHP + Q + F+SF+
Sbjct: 244 AAQGFLVYREDARMDGVIVETY----QEFHPFMFS---------QFSDMQTKHFDSFSEC 290
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDEFFS E K D+KA+ E++A+KKL NV +D + R++AL+ QL +E AELI N
Sbjct: 291 VDEFFSKLELQKADVKALNAEKEAMKKLNNVIKDQQDRIAALKVAQLEREEMAELIELNS 350
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
+ VD A+L IR IANQLSWE IE A +P+A+ I L LN N +TLLLR
Sbjct: 351 DLVDKALLVIRSAIANQLSWEAIEEMRVNACEAGNPIAASIVGLNLNSNQMTLLLR---- 406
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
DP+ + KK V +D+ LS+Y NA++ K++A +K+QKTI
Sbjct: 407 ---------------DPYRPEIDPKK---VTIDIALSSYQNARKLHTEKKAAQQKEQKTI 448
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKV------YWF--EKFYWFISSEN----- 311
++ KALKS + K K+TL V + + K R+V +WF + Y I +
Sbjct: 449 CASSKALKSTKVKIKETLNVVHSKAEVMKKRRVMWFEKFFWFVSSENYLVIGGRDAQQNE 508
Query: 312 -----YL--GNV----------------------------------------AWDAKVVT 324
YL G++ AW+AKV +
Sbjct: 509 LLVKRYLRPGDIYMHADTRGASSIIIRNKLGGGDMPPRTLNEAATMAVSYSSAWEAKVTS 568
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
AWWV QVS+TAPTGE+LT GSFMIRGKKN+ P CQL MG +F+L+E S+ R
Sbjct: 569 AAWWVHQHQVSRTAPTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLER 624
>gi|449681046|ref|XP_002157080.2| PREDICTED: nuclear export mediator factor NEMF-like, partial [Hydra
magnipapillata]
Length = 1467
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 134/297 (45%), Positives = 184/297 (61%), Gaps = 35/297 (11%)
Query: 26 NVSS---QGYLVQKKEKRPNS--EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N+SS +GYL+QKKE N+ E ++ EFHP + QH+ P F S
Sbjct: 699 NISSKEGEGYLIQKKESNANAVGEKDLLIYTEFHPFLYH---------QHKSLPFIHFHS 749
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F VDEFFS ES KIDLKA+QQE+ ALK+LENV+ DHE R+ +L++TQ + +A+LI
Sbjct: 750 FNKCVDEFFSKLESQKIDLKALQQEKAALKRLENVREDHEKRIHSLKETQDKEARRAKLI 809
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N ++ AI+ + IANQL WE+IE +K+A+ DPVA+IIK L+L N IT+ +
Sbjct: 810 ELNLPLIERAIIIVNSAIANQLDWEEIEDLLKEAKLKGDPVANIIKSLQLKTNQITISVN 869
Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF----DLKRSA 256
+E + + +DL L+A+ NA+R++ D +R+A
Sbjct: 870 DEEET-----------------ESDEDDLDEVDIIIDLGLTAFGNARRYYYILHDKRRNA 912
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
A K++KTIQ+++KALKSAE KTK+TLK+VQ INK RK +WFEKFYWFISSENYL
Sbjct: 913 ATKEEKTIQASKKALKSAEYKTKETLKEVQNAKIINKTRKTFWFEKFYWFISSENYL 969
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 47/64 (73%), Positives = 55/64 (85%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW+A+V+T+AWWV +QVSKTAP+GE+LTTGSFMIRGKKNF PP L MG S LFKL+ES
Sbjct: 1032 AWEARVITSAWWVYHNQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFSVLFKLDES 1091
Query: 377 SISR 380
ISR
Sbjct: 1092 CISR 1095
>gi|312082754|ref|XP_003143575.1| serologically defined colon cancer antigen 1 [Loa loa]
Length = 899
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 153/416 (36%), Positives = 213/416 (51%), Gaps = 98/416 (23%)
Query: 28 SSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
++QG+LV +++ R + E Y EFHP + Q + F+SF+
Sbjct: 244 AAQGFLVYREDARMDGVIVETY----QEFHPFMFS---------QFSDMQTKHFDSFSEC 290
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDEFFS E K D+KA+ E++A+KKL NV +D + R++AL+ QL +E AELI N
Sbjct: 291 VDEFFSKLELQKADVKALNAEKEAMKKLNNVIKDQQDRIAALKVAQLEREEMAELIELNS 350
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
+ VD A+L IR IANQLSWE IE A +P+A+ I L LN N +TLLLR
Sbjct: 351 DLVDKALLVIRSAIANQLSWEAIEEMRVNACEAGNPIAASIVGLNLNSNQMTLLLR---- 406
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
DP+ + KK V +D+ LS+Y NA++ K++A +K+QKTI
Sbjct: 407 ---------------DPYRPEIDPKK---VTIDIALSSYQNARKLHTEKKAAQQKEQKTI 448
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKV------YWF--EKFYWFISSEN----- 311
++ KALKS + K K+TL V + + K R+V +WF + Y I +
Sbjct: 449 CASSKALKSTKVKIKETLNVVHSKAEVMKKRRVMWFEKFFWFVSSENYLVIGGRDAQQNE 508
Query: 312 -----YL--GNV----------------------------------------AWDAKVVT 324
YL G++ AW+AKV +
Sbjct: 509 LLVKRYLRPGDIYMHADTRGASSIIIRNKLGGGDMPPRTLNEAATMAVSYSSAWEAKVTS 568
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
AWWV QVS+TAPTGE+LT GSFMIRGKKN+ P CQL MG +F+L+E S+ R
Sbjct: 569 AAWWVHQHQVSRTAPTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLER 624
>gi|328723949|ref|XP_001945685.2| PREDICTED: serologically defined colon cancer antigen 1 homolog
[Acyrthosiphon pisum]
Length = 987
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 134/323 (41%), Positives = 193/323 (59%), Gaps = 36/323 (11%)
Query: 6 DVSSDIHK---CGPTLHVLVSNLNVSSQGYLVQKKEKR--PNS-EDYIVANMEFHPMILQ 59
++ +DI K C + N+ +G+++QK +++ P+ + N E+HP +
Sbjct: 220 NIDTDIQKLMNCFCIAEKFLDNITTLKEGFIIQKIDQQLLPDGIMKELCTNQEYHPFLFA 279
Query: 60 PGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH 119
QHQ P + +ESF AVDEF+S ES K D+K +QQE+ A+KKL+N+ +DH
Sbjct: 280 ---------QHQKLPSKTYESFNEAVDEFYSNLESQKYDVKCMQQEKGAVKKLQNIVKDH 330
Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
E RL L+ TQ K KAELI NN + VD I +RQ +A QL W++I ++Q +D
Sbjct: 331 EERLKKLQDTQDEHKFKAELITNNLDLVDNTIQFVRQAVAKQLHWDEIWDMIRQLNFEDD 390
Query: 180 PVA-SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS-LVDVD 237
+I+K LKL++NHITL L DP+ + ++ ++ S L+D+D
Sbjct: 391 GCTYAIVKNLKLSVNHITLQL-------------------FDPYNEENKNEENSQLIDID 431
Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
L SA+ NA+R++ K+ +A K++KTI S+ LK AEKKTKQTLKD+Q + +INK RK
Sbjct: 432 LGQSAFGNAERYYGSKKQSAIKEKKTIDSSSTVLKMAEKKTKQTLKDMQVVASINKVRKT 491
Query: 298 YWFEKFYWFISSENYLGNVAWDA 320
YWFEKFYWFISSENYL DA
Sbjct: 492 YWFEKFYWFISSENYLVIAGRDA 514
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/66 (68%), Positives = 55/66 (83%), Gaps = 1/66 (1%)
Query: 316 VAWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
V+W K+ + NA+WVK +QVSKTAPTGE+LTTGSFMIRGKKN+ P L +G+SFLFKLE
Sbjct: 569 VSWTMKINLQNAFWVKPEQVSKTAPTGEYLTTGSFMIRGKKNYLPATHLILGLSFLFKLE 628
Query: 375 ESSISR 380
+SSI R
Sbjct: 629 DSSIPR 634
>gi|268571229|ref|XP_002640975.1| Hypothetical protein CBG11722 [Caenorhabditis briggsae]
Length = 894
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 145/391 (37%), Positives = 200/391 (51%), Gaps = 94/391 (24%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
EF+P+ S P ++ + E SF +VDEF+S E+ K + KAV E+ ALKK
Sbjct: 248 EFNPI-----SMPLTLKLQK-----ELRSFCESVDEFYSRIETQKQEQKAVNMEKQALKK 297
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LENV++D + R+ AL+ TQ + A II NQE V+ A+L IR +ANQ SW+ IE
Sbjct: 298 LENVEKDQKERIEALQLTQEQREHMANRIILNQELVEKALLLIRSALANQFSWQTIEEMR 357
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
K A + DPVA I K N + L DP+ + +E K
Sbjct: 358 KSAAANGDPVAKSIDSFKFENNEFFMKL-------------------GDPYDEEAELLK- 397
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
V +D+ ++A NA+R F K+SAA+K +KT+ S+EKA+K+A++K K TL+ V+ +T +
Sbjct: 398 --VPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKCTLEQVKIVTEV 455
Query: 292 NKARKVYWFEKFYWFISSENYLGNVAWDAK------------------------------ 321
K+RK WFEKF WFISSE Y+ DA+
Sbjct: 456 KKSRKTMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSVIIR 515
Query: 322 -----------------------VVTNAW---------WVKADQVSKTAPTGEFLTTGSF 349
+NAW WV QV++TAPTGE+L +GSF
Sbjct: 516 NKSFEESMEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPSQVTRTAPTGEYLPSGSF 575
Query: 350 MIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
MIRGKKNF PP QL MG+ LF+++E SI R
Sbjct: 576 MIRGKKNFMPPSQLVMGLGILFRMDEESIER 606
>gi|66804841|ref|XP_636153.1| DUF814 family protein [Dictyostelium discoideum AX4]
gi|60464500|gb|EAL62645.1| DUF814 family protein [Dictyostelium discoideum AX4]
Length = 1268
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 141/397 (35%), Positives = 210/397 (52%), Gaps = 73/397 (18%)
Query: 56 MILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENV 115
M++ P + +Q++ EFESF AAVD+FFS ES K++ + + QE+ LKKL+ V
Sbjct: 300 MVIYEEFVPYLYKQYELKKFIEFESFDAAVDQFFSEIESQKVEQQRIAQEQVVLKKLDKV 359
Query: 116 KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
K D + R+ +L ++ + KA+LI N + VD IL IR +A+ + WE + +K+ +
Sbjct: 360 KEDQQRRIDSLFANEVENIRKAQLIEANLQEVDQCILIIRSGVASSMDWETLNQLLKEEK 419
Query: 176 RHND-PVASIIKQLKLNINHITLLL-------------RVHKERSNIKDSWVLHHDNADP 221
+ N VA+ I +LKL N ITL L E S+ + +
Sbjct: 420 KKNPYSVATKIHRLKLESNQITLSLTDNFLYDDNDGDDDDDDEESDEESDEEDQNTKKSI 479
Query: 222 FADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
+ +KP+L+DVD+ LSA+ANA++++D K+ + +K QKTI E ALK+AEKKT+Q
Sbjct: 480 KKSKTSNQKPNLIDVDISLSAFANARKYYDTKKQSHEKAQKTISQAEFALKAAEKKTRQQ 539
Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-------------------- 321
L + ++ ++ RK++WFEKF+WFISS+NY+ DA+
Sbjct: 540 LSETKSKNSMIAMRKIFWFEKFHWFISSDNYIVVSGRDAQQNELLYKKYLEKDDIYVHAD 599
Query: 322 ------------------------------VVTNAW---------WVKADQVSKTAPTGE 342
+NAW WV + QVSKTAP+GE
Sbjct: 600 IFGSTSCVIKNPNGGEIPPNTLIQAGTMTMCYSNAWSAKVVTSAYWVYSHQVSKTAPSGE 659
Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
FLTTGSFMIRGKKN+ P QL MG F+FK+++S +
Sbjct: 660 FLTTGSFMIRGKKNYLPHSQLVMGFGFMFKIDDSCLG 696
>gi|225432064|ref|XP_002273922.1| PREDICTED: nuclear export mediator factor Nemf-like [Vitis
vinifera]
Length = 1110
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 144/377 (38%), Positives = 199/377 (52%), Gaps = 76/377 (20%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ Q + +FE+F AA+DEF+S ES + + + +E A++KL ++ D E R
Sbjct: 333 CPILLNQFKSREFVKFETFDAALDEFYSKIESQRSEQQQKAKEGSAMQKLTKIRVDQENR 392
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L++ + AELI N E VDAAILA+R +AN ++WED+ VK+ ++ +PVA
Sbjct: 393 VHTLKKEVDHCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDLARMVKEEKKSGNPVA 452
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I +L L N +TLLL +N D D +T V+VDL LSA
Sbjct: 453 GLIDKLYLERNCMTLLLS----------------NNLDEMDDDEKTLPVDKVEVDLALSA 496
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
+ANA+R+++ K+ KQ+KT+ + EKA K+AEKKT+ L +T+ I+ RKV+WFEK
Sbjct: 497 HANARRWYEQKKRQENKQEKTVIAHEKAFKAAEKKTRLQLSQEKTVATISHMRKVHWFEK 556
Query: 303 FYWFISSENYLGNVAWDAK---------------------------VVTN---------- 325
F WFISSENYL DA+ V+ N
Sbjct: 557 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLHGASSTVIKNHKPEHPVPPL 616
Query: 326 --------------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ 362
AW WV QVSKTAPTGE+LT GSFMIRGKKNF PP
Sbjct: 617 TLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 676
Query: 363 LAMGISFLFKLEESSIS 379
L MG LF+L+ESS+
Sbjct: 677 LIMGFGLLFRLDESSLG 693
>gi|168034467|ref|XP_001769734.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679083|gb|EDQ65535.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 147/388 (37%), Positives = 200/388 (51%), Gaps = 85/388 (21%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
EF P+ L+ +VMR E++ AA+DEFFS E + + + QE A K
Sbjct: 315 EFSPLHLKQFDDRTVMR---------METYDAALDEFFSKIEGQRAEQQRKAQEDSAFSK 365
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+ ++ D R+ L+Q AELI N E VD AILA+R +A+ + W+D+ +
Sbjct: 366 LDKIRADQTQRVEVLKQEVDQTVRMAELIEYNLEDVDNAILAVRSTVASGMDWKDLARMI 425
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
K+ ++ +PVA +I L+L N ITLLL +N D D +T+
Sbjct: 426 KEEKKAGNPVAGLIHSLQLEKNQITLLL----------------SNNLDDMDDDEKTQPV 469
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
S VDVD+ LSA+ANA+R+F+ K+ A KQ KT + EKA K+AEKKT Q L +++ I
Sbjct: 470 SKVDVDIGLSAHANARRWFEQKKKHAVKQDKTKAAHEKAFKAAEKKTLQQLAQAKSVAAI 529
Query: 292 NKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------VVT 324
+ RKV+WFEKF WF+SSENYL DA+ V+
Sbjct: 530 SHMRKVHWFEKFNWFVSSENYLIISGRDAQQNELVVKRYMRKGDLYVHADLHGASSTVIQ 589
Query: 325 N------------------------AW--------W-VKADQVSKTAPTGEFLTTGSFMI 351
N AW W V+A QVSKTAPTGE+LT GSFM+
Sbjct: 590 NHNPLYPIPPLTINQAGVFTVCRSQAWDSKIVTSAWWVEAHQVSKTAPTGEYLTVGSFMV 649
Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
RGKKNF PP L MG LF+L++SSI+
Sbjct: 650 RGKKNFLPPNPLVMGFGVLFRLDDSSIA 677
>gi|328864957|gb|EGG13343.1| DUF814 family protein [Dictyostelium fasciculatum]
Length = 1244
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 140/384 (36%), Positives = 209/384 (54%), Gaps = 67/384 (17%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P + +Q+Q E++SF AVD+FFS ES K++ + + QE+ LKKL+ V+ D + R
Sbjct: 348 VPYLYKQYQDKKYLEYDSFDLAVDQFFSEIESQKVEQQRMSQEQTVLKKLDKVREDQQRR 407
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND-PV 181
+ +L ++ + +KA+LI +N + VD IL IR +A + W ++ +K+ ++ N V
Sbjct: 408 IDSLYASEGENIKKAQLIESNLQDVDQCILIIRSGVAASMDWGNLNQLLKEEKKKNPYSV 467
Query: 182 ASIIKQLKLNINHITL-LLRVH------KERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
A+ I +LKL+ N ITL L +H +E + + KKP +
Sbjct: 468 ANKIHKLKLDTNQITLSLTDLHLDDDEDEEDEDENSDDDSEDEEKKKKNQKKNAKKPVFI 527
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
DVD+ LSAYANA+ F+D K+ + +K +KTIQ + ALK+AEKK +Q L +V+T +++ +
Sbjct: 528 DVDISLSAYANARNFYDSKKQSHEKAEKTIQQADFALKAAEKKARQQLSEVKTKSSMQQM 587
Query: 295 RKVYWFEKFYWFISSENYLGNVAWDAK--------------------------------- 321
RKV+WFEKF+WFISS+NY+ DA+
Sbjct: 588 RKVFWFEKFHWFISSDNYIVISGKDAQQNELLFKKYLDKDDVYVHADIFGSTSCVIKNPK 647
Query: 322 -----------------VVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKK 355
+NAW WV + QVSKTAP+GEFLTTGSFMIRGKK
Sbjct: 648 GGEIPPNTLIQAGTMTMCYSNAWSAKVVTSAYWVYSHQVSKTAPSGEFLTTGSFMIRGKK 707
Query: 356 NFFPPCQLAMGISFLFKLEESSIS 379
N+ P QL MG F+FK+++S I+
Sbjct: 708 NYLPHSQLVMGFGFMFKIDDSCIA 731
>gi|167516076|ref|XP_001742379.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779003|gb|EDQ92617.1| predicted protein [Monosiga brevicollis MX1]
Length = 1051
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 149/401 (37%), Positives = 205/401 (51%), Gaps = 75/401 (18%)
Query: 55 PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
P+++ P +M+Q + P SF A+D +FS E+ K+ ++A+QQE ALKKLE
Sbjct: 268 PVLVLDDVIPFMMKQFEDRPHIHLPSFDRAIDRYFSELETQKLQMRAMQQEAAALKKLEA 327
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
VK HE + Q ++ KA+++ N E VD AI IR +AN+L W +I VK+A
Sbjct: 328 VKASHEKHVEGYRLAQEANERKAQVLEANLEQVDRAIEIIRSMVANKLDWVEIAELVKEA 387
Query: 175 QRHNDPVASIIKQLKLNINHITLLL---RVHKERSNIKDSWVLHHDN------------A 219
Q+ DP A II LKL+ NH+T+ L H E S S + A
Sbjct: 388 QQQGDPDARIIDGLKLDKNHMTIRLPNPEAHAESSESDSSSASDSEEEEEEEEQKAIAAA 447
Query: 220 DPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT 278
+S P L +D+DL L+AYANA + K+ +A K+QK +TE A++SAE+KT
Sbjct: 448 SKKRGTSSATDPFLTIDLDLALTAYANACNMYQHKKISAVKEQKARDATELAIQSAERKT 507
Query: 279 KQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL--------------------GNV-- 316
+Q L+ T +NK RK+YWFEKF WFISSENYL G+V
Sbjct: 508 QQQLQQNNVTTAVNKQRKIYWFEKFLWFISSENYLVIGGRDRQQNEILVRRYLKKGDVYV 567
Query: 317 ---------------------------AWDAKVVTNAWW----------VKADQVSKTAP 339
A V+ + W V DQVSKTAP
Sbjct: 568 HADLHGAASVIVKNPRGGDVPPITLQEAGHMAVIYSGSWEARMPTSAWWVHHDQVSKTAP 627
Query: 340 TGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
GE+L+TGSFMIRGKKN+ P +L MG + LFK++E S++R
Sbjct: 628 AGEYLSTGSFMIRGKKNYLPKVELVMGFAILFKVDEGSVAR 668
>gi|308480173|ref|XP_003102294.1| hypothetical protein CRE_05887 [Caenorhabditis remanei]
gi|308262220|gb|EFP06173.1| hypothetical protein CRE_05887 [Caenorhabditis remanei]
Length = 917
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 143/394 (36%), Positives = 200/394 (50%), Gaps = 92/394 (23%)
Query: 57 ILQPGSTP-SVMRQHQGFPI-------QEFESFAAAVDEFFSTAESHKIDLKAVQQERDA 108
IL P S P + + P+ +E SF +VDEF+S E+ K + KAV E+ A
Sbjct: 256 ILSPTSQPIQLYNEFNPIPMPFTSKLQKELPSFCESVDEFYSRIETQKQEQKAVNMEKQA 315
Query: 109 LKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIE 168
LKKLENV++D + R+ AL+ TQ + A II NQ+ V+ A+L IR +ANQ SW+ IE
Sbjct: 316 LKKLENVEKDQKERIEALQLTQEQREHMANRIILNQDLVEKALLLIRSALANQFSWQTIE 375
Query: 169 ARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSET 228
K A + D VA I + N + L DP+ + +E
Sbjct: 376 EMRKNAAMNGDLVAKSIDSFRFENNEFFMNL-------------------GDPYDEEAEL 416
Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM 288
K V +D+ ++A NA+R F K+SAA+K +KT+ S+EKA+K+A++K K TL+ V+ +
Sbjct: 417 LK---VPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQVKIV 473
Query: 289 TNINKARKVYWFEKFYWFISSENYLGNVAWDAK--------------------------- 321
T + K+RK WFEKF WFISSE Y+ DA+
Sbjct: 474 TEVKKSRKAMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSV 533
Query: 322 --------------------------VVTNAW---------WVKADQVSKTAPTGEFLTT 346
+NAW WV +QVS+TAPTGE+L +
Sbjct: 534 IIRNKSFEESQEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPNQVSRTAPTGEYLPS 593
Query: 347 GSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
GSFMIRGKKNF PP QL MG+ LF++++ SI R
Sbjct: 594 GSFMIRGKKNFMPPSQLVMGLGVLFRMDDESIER 627
>gi|256080624|ref|XP_002576579.1| hypothetical protein [Schistosoma mansoni]
gi|353229334|emb|CCD75505.1| hypothetical protein Smp_052790 [Schistosoma mansoni]
Length = 1009
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 139/424 (32%), Positives = 217/424 (51%), Gaps = 81/424 (19%)
Query: 18 LHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
L + + + GY+ K + +S++ + EFHP + + Q++ P
Sbjct: 268 LEPFLCDRQATPHGYIFGKSYQ--SSDEGLRNYEEFHPFMFE---------QYRDKPHLA 316
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F+SF AVD +FS ES K + + E+ A +K+EN+K+D E RL L+ Q +D KA
Sbjct: 317 FDSFNKAVDAYFSKIESQKTLEQISRNEQKASRKVENIKKDQERRLMLLKTEQELDMRKA 376
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
L+ N+ VD I+ I ++NQ+ W+++E V+ A++ +DP+A I +LKL + +
Sbjct: 377 YLLEANRRLVDNIIIMINHALSNQIDWKELELIVEDAKQRDDPLACHIVELKLQTSQAVI 436
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
L+ E S+ D ++ N D + + V VD+D++A NA++++D KR+A+
Sbjct: 437 RLKDPFESSSDVDETLVRSGNKDEYTE---------VVVDIDVNALTNARKYYDKKRAAS 487
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVA 317
KK++KTI + K LKSA + +K +T+ I + RK WFEKF+WFISSENYL
Sbjct: 488 KKEEKTINVSRKVLKSAIHNAEIKMKTAKTVAQITEVRKPMWFEKFFWFISSENYLVVAG 547
Query: 318 WDAK----------------------------------------------------VVTN 325
D++ V+++
Sbjct: 548 HDSQQNEVLVKRYLKPGDLFVHADIHGASTVIIKARHLTSEEVDSPNHQEAGNMAVVLSS 607
Query: 326 AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW WV DQVSKTAP+GE+LT+G+FMIRGKKN+ PPC G +FKL E
Sbjct: 608 AWQSHVLTRAWWVHHDQVSKTAPSGEYLTSGAFMIRGKKNYLPPCPFDYGFGIMFKLHED 667
Query: 377 SISR 380
SI++
Sbjct: 668 SIAK 671
>gi|344257308|gb|EGW13412.1| Serologically defined colon cancer antigen 1-like [Cricetulus
griseus]
Length = 554
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 171/311 (54%), Gaps = 73/311 (23%)
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 2 NLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNP 61
Query: 201 --VHKERSNIKDSWVLHHD----------NADPFADSSETKKPSLVDVDLDLSAYANAKR 248
+ +E + D+ V D + + KP LVDVDL LSAYANAK+
Sbjct: 62 YLLSEEEDDDGDASVEVSDAEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 121
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WFIS
Sbjct: 122 YYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 181
Query: 309 SENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTGE------------ 342
SENYL G +++ + +V AD T+ PTGE
Sbjct: 182 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEAGT 241
Query: 343 ---------------------------------FLTTGSFMIRGKKNFFPPCQLAMGISF 369
+LTTGSFMIRGKKNF PP L MG SF
Sbjct: 242 MALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSF 301
Query: 370 LFKLEESSISR 380
LFK++ES I R
Sbjct: 302 LFKVDESCIWR 312
>gi|356529076|ref|XP_003533123.1| PREDICTED: nuclear export mediator factor Nemf-like [Glycine max]
Length = 1131
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 147/407 (36%), Positives = 211/407 (51%), Gaps = 87/407 (21%)
Query: 44 EDYIVA---NMEFHPMILQPGS--------TPSVMRQHQGFPIQEFESFAAAVDEFFSTA 92
E YI+ NM I QPGS P ++ Q + +FE+F AA+DEF+S
Sbjct: 288 EGYILMQNKNMGKDSSISQPGSVSQMYDEFCPILLNQFKSRDYTKFETFDAALDEFYSKI 347
Query: 93 ESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152
ES + + + +E A +KL +++D E R+ AL + + AELI N E VDAAIL
Sbjct: 348 ESQRSEQQQKAKENSASQKLNRIRQDQENRVHALRKEADHCVKMAELIEYNLEDVDAAIL 407
Query: 153 AIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSW 212
A+R +A ++W+D+ VK+ ++ +PVA +I +L L+ N +TLLL
Sbjct: 408 AVRVALAKGMNWDDLARMVKEEKKAGNPVAGLIDKLHLDRNCMTLLL------------- 454
Query: 213 VLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
+N D D +T V+VDL LSA+ANA+R+++ K+ KQ KT+ + EKA K
Sbjct: 455 ---SNNLDEMDDDEKTLPVDKVEVDLALSAHANARRWYEQKKKQESKQGKTVTAHEKAFK 511
Query: 273 SAEKKTKQTLKDVQTMTNINKARKVY------WF--EKFYWFISSEN----------YL- 313
+AE+KT+ L +T+ +I+ RKV+ WF + Y IS + Y+
Sbjct: 512 AAERKTRLQLNQEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 571
Query: 314 -GNV----------------------------------------AWDAKVVTNAWWVKAD 332
G++ AWD+K+VT+AWWV
Sbjct: 572 KGDLYIHADLHGASSTVIKNHKPAQPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPH 631
Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ESS+
Sbjct: 632 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 678
>gi|302768961|ref|XP_002967900.1| hypothetical protein SELMODRAFT_60048 [Selaginella moellendorffii]
gi|300164638|gb|EFJ31247.1| hypothetical protein SELMODRAFT_60048 [Selaginella moellendorffii]
Length = 1083
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 149/421 (35%), Positives = 218/421 (51%), Gaps = 89/421 (21%)
Query: 22 VSNLNVSSQGYLV---QKKEKRPNSEDYIVANM-EFHPMILQPGSTPSVMRQHQGFPIQE 77
V+ + + +GY+ K K+ N+E EF P++L+ S R++ +
Sbjct: 281 VTTGDFTPEGYITFHPNKTAKKKNAESAEEKMFDEFSPLLLKQ----SAHREYV-----K 331
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F++F AA+DEFFS E ++D + QE A KLE ++ D +R+ +L++ A
Sbjct: 332 FDTFDAALDEFFSKIEGQRLDQQRKTQEDSAYSKLEKIRADQRSRVESLKREVDQAVHTA 391
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
ELI N VD AI A+R +AN + W+D+ +K+ ++ +PVA +I L+L NHITL
Sbjct: 392 ELIEYNLADVDLAIDAVRAALANGMDWKDLGRMIKEERKAGNPVAGLIHSLQLEKNHITL 451
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
LL +N D D +TK V+VDL LSA+ANA+++FD+K+ A
Sbjct: 452 LL----------------SNNLDDMDDDDKTKPADKVEVDLSLSAHANARKWFDMKKKQA 495
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY------WF--EKFYWFISS 309
KQ+KT+ + EKA K+AE+KT+Q L + + I+ RKV+ WF + Y IS
Sbjct: 496 LKQEKTVAAHEKAFKAAERKTQQQLSQAKAVATISHLRKVHWFEKFNWFISSENYLIISG 555
Query: 310 EN----------YL--GNV----------------------------------------A 317
+ Y+ G++ A
Sbjct: 556 RDAQQNEQIVKRYMKKGDLYVHADLHGASSTLIKNHNPSQPVSPLTINQAGCFTVCRSQA 615
Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
WD+K++T+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG F+L+ESS
Sbjct: 616 WDSKIITSAWWVYDHQVSKTAPTGEYLTVGSFMIRGKKNFLPPYPLVMGFGLFFRLDESS 675
Query: 378 I 378
I
Sbjct: 676 I 676
>gi|302761200|ref|XP_002964022.1| hypothetical protein SELMODRAFT_266749 [Selaginella moellendorffii]
gi|300167751|gb|EFJ34355.1| hypothetical protein SELMODRAFT_266749 [Selaginella moellendorffii]
Length = 1052
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 147/421 (34%), Positives = 216/421 (51%), Gaps = 89/421 (21%)
Query: 22 VSNLNVSSQGYLV---QKKEKRPNSEDYIVANM-EFHPMILQPGSTPSVMRQHQGFPIQE 77
V+ + + +GY+ K K+ N+E EF P++L+ + ++
Sbjct: 267 VTTGDFTPEGYITFHPNKTAKKKNAESAEEKMFDEFSPLLLKQSAHREYIK--------- 317
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F++F AA+DEFFS E ++D + QE A KLE ++ D +R+ +L++ A
Sbjct: 318 FDTFDAALDEFFSKIEGQRLDQQRKTQEDSAFSKLEKIRADQRSRVESLKREVDQAVHTA 377
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
ELI N VD AI A+R +AN + W+D+ +K+ ++ +PVA +I L+L NHITL
Sbjct: 378 ELIEYNLADVDLAIDAVRAALANGMDWKDLGRMIKEERKAGNPVAGLIHSLQLEKNHITL 437
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
LL +N D D +TK V+VDL LSA+ANA+++FD+K+ A
Sbjct: 438 LL----------------SNNLDDMDDDDKTKPADKVEVDLSLSAHANARKWFDMKKKQA 481
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY------WF--EKFYWFISS 309
KQ+KT+ + EKA K+AE+KT+Q L + + I+ RKV+ WF + Y IS
Sbjct: 482 LKQEKTVAAHEKAFKAAERKTQQQLSQAKAVATISHLRKVHWFEKFNWFISSENYLIISG 541
Query: 310 EN----------YL--GNV----------------------------------------A 317
+ Y+ G++ A
Sbjct: 542 RDAQQNEQIVKRYMKKGDLYVHADLHGASSTLIKNHNPSQPVSPLTINQAGCFTVCRSQA 601
Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
WD+K++T+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG F+L+ESS
Sbjct: 602 WDSKIITSAWWVYDHQVSKTAPTGEYLTVGSFMIRGKKNFLPPYPLVMGFGLFFRLDESS 661
Query: 378 I 378
I
Sbjct: 662 I 662
>gi|260803888|ref|XP_002596821.1| hypothetical protein BRAFLDRAFT_130588 [Branchiostoma floridae]
gi|229282081|gb|EEN52833.1| hypothetical protein BRAFLDRAFT_130588 [Branchiostoma floridae]
Length = 834
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 173/288 (60%), Gaps = 38/288 (13%)
Query: 30 QGYLVQKKEKRPNSE----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
QGY+VQK+EK+P + + ++ EFHP +QH+ P EF SF AV
Sbjct: 88 QGYIVQKREKKPKQDGGPAEELLTYAEFHPFQF---------KQHEKSPCVEFPSFNKAV 138
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEFFS ES ++DLKA+QQE+ A+KKLENVK+DHE RL L++ Q DK KA+LI N +
Sbjct: 139 DEFFSQLESQRLDLKALQQEKVAIKKLENVKKDHERRLETLQKVQDEDKHKAQLIELNLD 198
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AIL +R IANQ+ W +I VK+AQ DPVAS IK LKL+ NHITL+LR
Sbjct: 199 LVDKAILVVRSAIANQIDWTEIWDIVKEAQAQGDPVASTIKSLKLDSNHITLVLR--NPF 256
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
S + D A ++S + +P +D+DL LSAYANAK+++D KR AAKK+QKTI
Sbjct: 257 SGYESDSEGDDDKAGVGREAS-SDRPMKIDIDLALSAYANAKKYYDQKRHAAKKEQKTID 315
Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
++EK + V FEKF WFI+SENYL
Sbjct: 316 ASEKCHEFV----------------------VERFEKFLWFITSENYL 341
>gi|410695644|gb|AFV74962.1| serologically defined colon cancer antigen1-like protein, partial
[Apis cerana]
Length = 273
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 116/234 (49%), Positives = 157/234 (67%), Gaps = 33/234 (14%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P +EF SF AV
Sbjct: 68 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+A +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKALLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273
>gi|410695646|gb|AFV74963.1| serologically defined colon cancer antigen1-like protein, partial
[Apis florea]
Length = 273
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 115/234 (49%), Positives = 156/234 (66%), Gaps = 33/234 (14%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P +EF SF AV
Sbjct: 68 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273
>gi|387175444|gb|AFJ66839.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175452|gb|AFJ66843.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175454|gb|AFJ66844.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175462|gb|AFJ66848.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175464|gb|AFJ66849.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175466|gb|AFJ66850.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175470|gb|AFJ66852.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175474|gb|AFJ66854.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175476|gb|AFJ66855.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175478|gb|AFJ66856.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175480|gb|AFJ66857.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175482|gb|AFJ66858.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175484|gb|AFJ66859.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175486|gb|AFJ66860.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175494|gb|AFJ66864.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175498|gb|AFJ66866.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175506|gb|AFJ66870.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175508|gb|AFJ66871.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175510|gb|AFJ66872.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175514|gb|AFJ66874.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175516|gb|AFJ66875.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175518|gb|AFJ66876.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175520|gb|AFJ66877.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175522|gb|AFJ66878.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175528|gb|AFJ66881.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
Length = 273
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 115/234 (49%), Positives = 156/234 (66%), Gaps = 33/234 (14%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P +EF SF AV
Sbjct: 68 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273
>gi|387175512|gb|AFJ66873.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175524|gb|AFJ66879.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175526|gb|AFJ66880.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
Length = 273
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 115/234 (49%), Positives = 156/234 (66%), Gaps = 33/234 (14%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P +EF SF AV
Sbjct: 68 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273
>gi|387175502|gb|AFJ66868.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
Length = 273
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 115/234 (49%), Positives = 155/234 (66%), Gaps = 33/234 (14%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P +EF SF AV
Sbjct: 68 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK KAELI NQ
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKXKAELISRNQS 178
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273
>gi|390356696|ref|XP_001200483.2| PREDICTED: nuclear export mediator factor Nemf-like
[Strongylocentrotus purpuratus]
Length = 334
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 158/221 (71%), Gaps = 12/221 (5%)
Query: 93 ESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152
ES K+D+K +QQER ALKKL+NVK+DHE R+S+L+Q Q ++++K LI N V+ A+
Sbjct: 2 ESQKLDMKVIQQERGALKKLDNVKKDHEKRISSLQQNQELNEKKGALIEINLPLVEQALR 61
Query: 153 AIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSW 212
+R +ANQ+ W++I++ +K+AQ DPVA I+ L+L+ NH +LLR D +
Sbjct: 62 VVRSAVANQIDWKEIDSIIKEAQTQGDPVALAIRSLRLDTNHFQMLLR---------DPY 112
Query: 213 VLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
+D+AD + +P LVD+D+ SAYANA+++F K+++ KK+QKT++S+ KA+K
Sbjct: 113 -KQYDDAD--EGEEDGARPMLVDIDIAQSAYANARKYFVQKKTSQKKEQKTMESSSKAIK 169
Query: 273 SAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
SAEKKT Q LKDV T+ +INK+RK YWFEK+YW ISSENY+
Sbjct: 170 SAEKKTMQALKDVATVASINKSRKTYWFEKYYWCISSENYI 210
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 46/59 (77%), Positives = 52/59 (88%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
+VAWDAKV+T+AWWV+ DQVSKTAPTGEFLTTGSFM+RGKKNF PP QL MG FL K+
Sbjct: 271 SVAWDAKVITSAWWVRHDQVSKTAPTGEFLTTGSFMVRGKKNFLPPTQLVMGFGFLMKV 329
>gi|387175434|gb|AFJ66834.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175436|gb|AFJ66835.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175438|gb|AFJ66836.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175440|gb|AFJ66837.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175442|gb|AFJ66838.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175448|gb|AFJ66841.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175450|gb|AFJ66842.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175488|gb|AFJ66861.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175496|gb|AFJ66865.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
Length = 273
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/234 (48%), Positives = 156/234 (66%), Gaps = 33/234 (14%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P ++F SF AV
Sbjct: 68 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKKFASFDVAV 118
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273
>gi|387175446|gb|AFJ66840.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175458|gb|AFJ66846.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175460|gb|AFJ66847.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175468|gb|AFJ66851.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175472|gb|AFJ66853.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175490|gb|AFJ66862.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175492|gb|AFJ66863.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175500|gb|AFJ66867.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
gi|387175504|gb|AFJ66869.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
Length = 273
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/234 (48%), Positives = 155/234 (66%), Gaps = 33/234 (14%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P + F SF AV
Sbjct: 68 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKXFASFDVAV 118
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+ALKKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273
>gi|387175456|gb|AFJ66845.1| serologically-defined colon cancer antigen 1-like protein, partial
[Apis mellifera]
Length = 273
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/234 (48%), Positives = 155/234 (66%), Gaps = 33/234 (14%)
Query: 29 SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++QKKE +P ++ D+I N+EFHP + + Q++ P +EF SF AV
Sbjct: 68 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DE+FST E K+DLKA+QQER+A KKLENVK+DH+ RL LE+TQ +DK+KAELI NQ
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREAXKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AILAI+ +ANQ++W DI+ +K+A+ DPVAS IKQLKL NHI+LLL
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233
Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
DP+ DS E KP L+D+DL +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273
>gi|242085896|ref|XP_002443373.1| hypothetical protein SORBIDRAFT_08g018400 [Sorghum bicolor]
gi|241944066|gb|EES17211.1| hypothetical protein SORBIDRAFT_08g018400 [Sorghum bicolor]
Length = 1158
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 138/377 (36%), Positives = 198/377 (52%), Gaps = 76/377 (20%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ Q + EF +F AA+DEF+S ES K++ + +E A ++L +K D E R
Sbjct: 352 CPILLNQFKSREYNEFATFDAALDEFYSKIESQKVNQQQKAKEESAAQRLNKIKLDQENR 411
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L + + AELI N E VDAAILA+R +AN++SWE + +K+ ++ +PVA
Sbjct: 412 VHTLRKEVDHCVKMAELIEYNLEDVDAAILAVRVSLANEMSWEALTRMIKEERKAGNPVA 471
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I +L N ITLLL +N D + +T V+VD+ LSA
Sbjct: 472 GLIDKLNFERNCITLLL----------------SNNLDDMDEDEKTAPVEKVEVDIALSA 515
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY---- 298
+ANA+R++++K+ KQ+KTI + EKA K+AEKKT+ L +T+ I RKV+
Sbjct: 516 HANARRWYEMKKKQESKQEKTITAHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEK 575
Query: 299 --WF--EKFYWFISSEN----------YL--GNV-------------------------- 316
WF + Y IS + Y+ G++
Sbjct: 576 FNWFISSENYLIISGRDAQQNELIVKRYMSKGDLYVHAELHGASSTIIKNHKPDTPIPPL 635
Query: 317 --------------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ 362
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP
Sbjct: 636 TLNQAGCFTVCHSKAWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 695
Query: 363 LAMGISFLFKLEESSIS 379
L MG LF+L+ESS++
Sbjct: 696 LVMGFGILFRLDESSLA 712
>gi|115489110|ref|NP_001067042.1| Os12g0564600 [Oryza sativa Japonica Group]
gi|108862839|gb|ABA98970.2| zinc knuckle family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113649549|dbj|BAF30061.1| Os12g0564600 [Oryza sativa Japonica Group]
Length = 1159
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 137/376 (36%), Positives = 198/376 (52%), Gaps = 76/376 (20%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P ++ Q + EFE+F AA+DEF+S ES +++ + +E A ++L +K D E R+
Sbjct: 354 PVLLNQFKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRV 413
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L + + AELI N E VDAAI+A+R +AN +SW+ + +K+ ++ +PVA
Sbjct: 414 HTLRKEVDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAG 473
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
+I +L N ITLLL +N D + +T V+VDL LSA+
Sbjct: 474 LIDKLSFERNCITLLL----------------SNNLDDMDEEEKTAPVEKVEVDLSLSAH 517
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY----- 298
ANA+R+++LK+ KQ+KT+ + EKA K+AEKKT+ L +T+ I RKV+
Sbjct: 518 ANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKF 577
Query: 299 -WF--EKFYWFISSEN----------YL--GNV--------------------------- 316
WF + Y IS + Y+ G++
Sbjct: 578 NWFISSENYLIISGRDAQQNELIVKRYMSKGDLYVHAELHGASSTIIKNHKPDNPIPPLT 637
Query: 317 -------------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L
Sbjct: 638 LNQAGSFTVCHSKAWDSKIVTSAWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 697
Query: 364 AMGISFLFKLEESSIS 379
MG LF+L+ESS++
Sbjct: 698 VMGFGILFRLDESSLA 713
>gi|453084374|gb|EMF12418.1| hypothetical protein SEPMUDRAFT_149103 [Mycosphaerella populorum
SO2202]
Length = 1130
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 141/400 (35%), Positives = 211/400 (52%), Gaps = 84/400 (21%)
Query: 52 EFHPM-ILQPGSTPS-VMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDAL 109
+FHP Q P+ V +H+GF I AVDEFFS+ E K++ K +++ A
Sbjct: 311 DFHPFKPAQLAEDPANVFLEHEGFNI--------AVDEFFSSIEGQKLESKLAEKQESAR 362
Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
K++E+ K++ E R++ L+Q Q + KA+ I N E V+ A A+ IA + WEDI
Sbjct: 363 KRIEHAKKEQEQRINGLQQVQELHVRKAQAIEANVERVEEATAAVNGLIAQGMDWEDIGR 422
Query: 170 RVKQAQRHNDPVASIIK-QLKLNINHITLLLRV-----HKERSNIKDSWVLHHDNADPFA 223
++Q Q+ ++PVA +IK LKL+ N +TLLL E +N DS D+ D
Sbjct: 423 LIEQEQKRHNPVAELIKLPLKLHENTMTLLLSELGADDEDEEANETDSE--PSDSEDEGT 480
Query: 224 DSSETK---KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
++++ K K +D+DL SA+ NA++++D KR+AA KQ+KT+ +++KA+KS E+K
Sbjct: 481 NAAQVKHDAKRLTIDIDLAGSAWVNARQYYDQKRTAAVKQEKTVLASKKAIKSTEQKVMA 540
Query: 281 TL-KDVQTMTNI-NKARKVYWFEKFYWFISSENYL--------------------GNV-- 316
TL KD++ ++ RK +WFEKF +F+SS+ YL G+V
Sbjct: 541 TLKKDLKQEKDVLRPVRKQFWFEKFIYFVSSDGYLVLAGKDAQQNEILYRRYLKKGDVYI 600
Query: 317 -----------------------------AWDAKVVT-NAW---------WVKADQVSKT 337
D V T +AW WV ADQVSKT
Sbjct: 601 HADLDGAASVIIKNKLNPEDPIPPSTLAQGGDLAVCTSSAWDSKAVMSAWWVNADQVSKT 660
Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
APTGE+L G F++RGKKNF PP +L +G +F++ E S
Sbjct: 661 APTGEYLAAGGFIVRGKKNFLPPAKLLLGFGVMFQISEES 700
>gi|156338807|ref|XP_001620041.1| hypothetical protein NEMVEDRAFT_v1g149359 [Nematostella vectensis]
gi|156204309|gb|EDO27941.1| predicted protein [Nematostella vectensis]
Length = 287
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 133/349 (38%), Positives = 166/349 (47%), Gaps = 113/349 (32%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P +M Q++ P EF SF VD+FFS+ S K+D+KA+ QE+ ALKKLENVK+DHE R+
Sbjct: 13 PFLMTQYKDHPYLEFPSFDKTVDDFFSSIGSQKLDVKALNQEKSALKKLENVKKDHEKRI 72
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L+ Q D KA+LI N + VD AIL + IANQ+ W +I VK+AQ DPVAS
Sbjct: 73 QQLQSAQEADVRKAQLIEINLDLVDRAILVVNSAIANQIDWSEILNLVKEAQIQGDPVAS 132
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
I++LKL NHIT+LLR L N P V VD+DL A+
Sbjct: 133 AIRELKLQTNHITMLLRYVS----------LASINGRP------------VRVDIDLLAH 170
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
NA+RF EKF
Sbjct: 171 LNARRF---------------------------------------------------EKF 179
Query: 304 YWFISSENYL--------------------GNVAWDAKVVTNAWWVKADQ---------- 333
WFISSENY+ GN + + + A + + Q
Sbjct: 180 LWFISSENYVVIGGRDQQQNELVVKRHLQPGNATCNT-IFSQATLICSYQSQLSTTAINH 238
Query: 334 ---------VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
VSKTAPTGE+LTTGSFMIRGKKNF PPC L MG SFLF++
Sbjct: 239 SYQSQLSITVSKTAPTGEYLTTGSFMIRGKKNFLPPCHLIMGFSFLFRV 287
>gi|339236819|ref|XP_003379964.1| serologically defined colon cancer antigen 1-like protein
[Trichinella spiralis]
gi|316977305|gb|EFV60421.1| serologically defined colon cancer antigen 1-like protein
[Trichinella spiralis]
Length = 789
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 144/414 (34%), Positives = 199/414 (48%), Gaps = 90/414 (21%)
Query: 29 SQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF 88
S GYL E+ E + + EFHP Q ++F++F AVDE+
Sbjct: 246 SCGYLKMTVEENAAGERIEIFH-EFHPYFFS---------QFANSECKQFDTFNGAVDEY 295
Query: 89 FSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148
FS +S K K +QQER ALK+LENV++DHE RL+ L+ Q++ + A + N E+V+
Sbjct: 296 FSKLDSQKCQQKQLQQERAALKRLENVRQDHEQRLANLQADQMLKERMAVAVELNSETVE 355
Query: 149 AAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNI 208
A+ +R IA +L W I ++ A+ DPVA I L L N + L V
Sbjct: 356 QALAVLRSAIAMKLEWFQINEMIQDARDLGDPVAGKIVGLCLERNAFVMRLPV------- 408
Query: 209 KDSWVLHHDNADPFADSSETKKPSLVDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQS 266
D F + E VDV++D LS++ N++R+F + +A KQ+KTI +
Sbjct: 409 -----------DVFDNDQELGDAETVDVEIDLALSSHQNSRRWFSQMKESALKQKKTIAA 457
Query: 267 TEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK----- 321
KALKSAE +TK+ LK + TNI K RK++WFEKF+WF SS+ L DAK
Sbjct: 458 GGKALKSAELRTKEQLKSTRQKTNIGKVRKMFWFEKFHWFFSSDRLLVIAGRDAKQNEIL 517
Query: 322 ---------------------------------------------VVTNAWW-------- 328
V +A W
Sbjct: 518 VKRYLKPGDLYVHADLRGAASVVIKQSEDKGPIPPKTLNEAAALAVCLSAAWESKVVTSA 577
Query: 329 --VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
VK DQVSK+AP+GE+L TG FMIRGKKN+ QL MG LF+L+ S +R
Sbjct: 578 WWVKHDQVSKSAPSGEYLKTGGFMIRGKKNYLTASQLVMGFGLLFRLDSESAAR 631
>gi|339260826|ref|XP_003368211.1| serologically defined colon cancer antigen 1-like protein
[Trichinella spiralis]
gi|316964832|gb|EFV49764.1| serologically defined colon cancer antigen 1-like protein
[Trichinella spiralis]
Length = 749
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 144/414 (34%), Positives = 199/414 (48%), Gaps = 90/414 (21%)
Query: 29 SQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF 88
S+GYL E+ E + + EFHP Q ++F++F AVDE+
Sbjct: 206 SRGYLKMTVEENAAGERIEIFH-EFHPYFFS---------QFASSECKQFDTFNGAVDEY 255
Query: 89 FSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148
FS +S K K +QQER ALK+LENV++DHE RL+ L+ Q++ + A + N E+V+
Sbjct: 256 FSKLDSQKCQQKQLQQERAALKRLENVRQDHEQRLANLQADQMLKERMAVAVELNSETVE 315
Query: 149 AAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNI 208
A+ +R IA +L W I ++ A+ DPVA I L L N + L V
Sbjct: 316 QALAVLRSAIAMKLEWFQINEMIQDARDLGDPVAGKIVGLCLERNAFVMRLPV------- 368
Query: 209 KDSWVLHHDNADPFADSSETKKPSLVDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQS 266
D F + E VDV++D LS++ N++R+F + +A KQ+KTI +
Sbjct: 369 -----------DVFDNDQELGDAETVDVEIDLALSSHQNSRRWFSQMKESALKQKKTIAA 417
Query: 267 TEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK----- 321
KALKSAE TK+ LK + TNI K RK++WFEKF+WF SS+ L DAK
Sbjct: 418 GGKALKSAELHTKEQLKSTRQKTNIGKVRKMFWFEKFHWFFSSDRLLVIAGRDAKQNEIL 477
Query: 322 ---------------------------------------------VVTNAWW-------- 328
V +A W
Sbjct: 478 VKRYLKPGDLYVHADLRGAASVVIKQSEDKGPIPPKTLNEAAALAVCLSAAWESKVVTSA 537
Query: 329 --VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
VK DQVSK+AP+GE+L TG FMIRGKKN+ QL MG LF+L+ S +R
Sbjct: 538 WWVKHDQVSKSAPSGEYLKTGGFMIRGKKNYLTASQLVMGFGLLFRLDSESAAR 591
>gi|452981583|gb|EME81343.1| hypothetical protein MYCFIDRAFT_114319, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 1087
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 140/430 (32%), Positives = 218/430 (50%), Gaps = 84/430 (19%)
Query: 29 SQGYLVQKKEKRPNSEDYI--------VANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
++GY++ K+++ ED V FHP P+ + + Q F E++
Sbjct: 264 AKGYILGKRKQSSKQEDTDGTADEGKDVMYEHFHPF------KPAQLAEDQSFVFLEYDG 317
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEFFS+ E K++ + ++E +A K++E+ +++ E R+ L+Q Q + KA+ I
Sbjct: 318 FNVAVDEFFSSIEGQKLESRLQEREDNAKKRIEHARKEQEQRIEGLQQVQELHVRKAQAI 377
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N E V+ A A+ IA + W DI + ++ Q ++PVA +IK LKL+ N ITLLL
Sbjct: 378 EANVERVEEATAAVNGLIAQGMDWADIGSLIENEQARHNPVAELIKLPLKLHENTITLLL 437
Query: 200 ----RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKR 254
R E ++ DS ++ D + + L VD+DL SA++NA++++D KR
Sbjct: 438 SEIGRDADEEMDVTDSEPSDSEDGDAETAPARAEDKRLTVDIDLAASAWSNARQYYDQKR 497
Query: 255 SAAKKQQKTIQSTEKALKSAEKKTKQTL-KDVQTMTNI-NKARKVYWFEKFYWFISSENY 312
+AA KQ++T +++KALKS E+ L KD++ ++ RK +WFEKF +FISS+ Y
Sbjct: 498 TAASKQERTEAASKKALKSTEQNVMAKLKKDLKQEKDVLRPVRKQFWFEKFIYFISSDGY 557
Query: 313 L--------------------GNV--------------------------------AWDA 320
L G+V A D
Sbjct: 558 LVLAGRDDLQNEMLYRRHLRKGDVYVHADLNGASSVVIKNSPHTPCAPIPPSTLAQAGDL 617
Query: 321 KVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
V ++ W V A+QVSKTA TGE+L GSF+IRGKKNF PP +L +G +
Sbjct: 618 VVCRSSAWDSKAVMSAWWVNAEQVSKTADTGEYLAVGSFIIRGKKNFLPPARLLLGFGVM 677
Query: 371 FKLEESSISR 380
F++ E S +R
Sbjct: 678 FQISEESKAR 687
>gi|452822547|gb|EME29565.1| RNA-binding protein [Galdieria sulphuraria]
Length = 1067
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 142/436 (32%), Positives = 219/436 (50%), Gaps = 100/436 (22%)
Query: 30 QGYLVQKKEKRPNSEDYIVANME------FHPMILQPGSTPSVMRQHQGFPIQEFESFAA 83
+GYL +K+ ++ D ++N E F P +L S S + EF +F
Sbjct: 291 RGYLFYRKD--AHTTDVSMSNSERLLYEDFSPFLLCHLSNTSHI---------EFPTFNE 339
Query: 84 AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
AVD +F+ E + + A +QE KK++++++D E R+ LE+ + + + AE I N
Sbjct: 340 AVDIYFANLEKERAQIVASKQESVVSKKVDSLRKDLERRIDELERAKEENFKIAEAIELN 399
Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHK 203
+ VD AI +R IAN ++W++++ +++ + +PVA I L L+ N ITL+L +
Sbjct: 400 ADEVDKAIWVVRAMIANGVAWDELDKMLEEEKEKGNPVAETIHSLHLDRNEITLMLPIDP 459
Query: 204 --ERSNIKDSWVLHHDNADPFADSSETK---------------KP-SLVDVDLDLSAYAN 245
E + +++ ++ + D+ ET+ KP L DVDL LSA+AN
Sbjct: 460 ILEDEFVNENFQYQSEDITYYDDTDETEEHFQTERMVAELNASKPVVLADVDLSLSAFAN 519
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-----KDVQTMTNINKARKVYWF 300
A R+F+ ++ A +K++KT+++T++AL AEKK + + + ++ I + RK WF
Sbjct: 520 AARYFESRKRAQEKKEKTMEATKRALNVAEKKASKQMERSQQRSLKPAVAIREIRKPAWF 579
Query: 301 EKFYWFISSENYLGNVAWDAK---------------------------VVTN-------- 325
EKF WFISSEN+L DA+ VV N
Sbjct: 580 EKFDWFISSENFLVIAGKDAQQNEQVVKRYMKTFDVYVHADIHGASSVVVKNRFRDKPVP 639
Query: 326 ----------------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
AW WV A QVSKTAP+GE+LTTGSFMIRGKKN+ PP
Sbjct: 640 LQTLIEAGAFAMCHSSAWSSKIVSSAWWVHASQVSKTAPSGEYLTTGSFMIRGKKNYLPP 699
Query: 361 CQLAMGISFLFKLEES 376
QL MG LFK++ S
Sbjct: 700 SQLVMGYGILFKMDPS 715
>gi|89130574|gb|AAI14230.1| Zgc:153813 protein [Danio rerio]
Length = 556
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 121/287 (42%), Positives = 153/287 (53%), Gaps = 83/287 (28%)
Query: 162 LSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADP 221
+ W +I V +AQ DPVA IK+LKL NHITLLLR + A P
Sbjct: 99 VDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLR---------------NPEACP 143
Query: 222 FADSSETK---------KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
++E + K LVD+D++LSA+ANAKR++D KRSAAKK+QKT+++ +KA K
Sbjct: 144 EGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQKAFK 203
Query: 273 SAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK---------VV 323
SAEKKTKQTLKDVQT+T+I KARKVYWFEKF WF+SSENYL D + +
Sbjct: 204 SAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKRYLR 263
Query: 324 TNAWWVKADQVSKTA-----PTGE------------------------------------ 342
+V AD T+ P+GE
Sbjct: 264 AGDLYVHADLHGATSCVIKNPSGEAVPPRTLTEAATMAVCYSAAWDAKVITSAWWVQHDQ 323
Query: 343 ---------FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
+LTTGSFMIRGKKNF PP L MG FLFK+++ S+ R
Sbjct: 324 VSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFR 370
>gi|429858117|gb|ELA32948.1| duf814 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 1040
Score = 197 bits (500), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 133/399 (33%), Positives = 202/399 (50%), Gaps = 77/399 (19%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP + P+ + EF+ F VDEFFS+ E K++ K ++E A +K
Sbjct: 294 DFHPFL------PNKFANDPTIKVLEFDGFNKTVDEFFSSLEGQKLESKLSEREAAAKRK 347
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE + D R+ L++ Q ++ +KA I N E V A+ A+ + + W DI +
Sbjct: 348 LEAARNDQAKRIEGLQEVQSLNVQKATAIEANVERVQEAMDAVNGLLQQGMDWIDISKLI 407
Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
++ Q+ +PVA IIK L LN+ N ITLLL ++ + ++ D +D +++ K
Sbjct: 408 EREQKRGNPVAEIIK-LPLNLADNTITLLLGEEEDIEDEDSNYETDSDASDSEDEAASNK 466
Query: 230 KPSL----VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
+ + VDV++ L+ YANA+ +++ KRSAAKK++KT+Q TE ALK+AE+K + L+
Sbjct: 467 QKTAKHLEVDVNIGLTPYANAREYYEQKRSAAKKEEKTVQQTEIALKNAEQKIQAELRKG 526
Query: 286 --QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAWDAKV- 322
Q + RK WFEKF WFIS++ YL G+V A +
Sbjct: 527 LKQEKAVLAPIRKQIWFEKFIWFISTDGYLVLGGKDAQQNEMLYKRYLRKGDVYIHADIH 586
Query: 323 --------------------------------VTNAWWVKA---------DQVSKTAPTG 341
++AW KA DQVSK+APTG
Sbjct: 587 GAATVIIKNTPSDPDAPIPPSTLAQAGTLAVCSSSAWDSKAGMGAWWVKADQVSKSAPTG 646
Query: 342 EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+L TGSFM+RG+KNF PP QL +G ++K+ E S +R
Sbjct: 647 EYLPTGSFMVRGQKNFLPPAQLLLGFGIMWKISEESKAR 685
>gi|452840445|gb|EME42383.1| hypothetical protein DOTSEDRAFT_73267 [Dothistroma septosporum
NZE10]
Length = 1122
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 144/453 (31%), Positives = 220/453 (48%), Gaps = 121/453 (26%)
Query: 22 VSNLNVSSQGYLVQKKEKRPNSEDYIVANME--------FHPMILQPGSTPSVMRQHQGF 73
+++L V+ +GY++ K+ K P ++ A + FHP +P Q +G
Sbjct: 258 ITSLPVA-RGYILAKR-KVPKADANTTATEDNQNVMYEDFHP--FKPA-------QLEGD 306
Query: 74 PIQEF---ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQ 130
P F E F AVD+FFS+ E K++ + ++E +A +++E +++ E R++ L+Q Q
Sbjct: 307 PANVFIEHEGFNKAVDDFFSSIEGQKLESRLQEREENAKRRIEQARQEQEKRITGLQQVQ 366
Query: 131 LVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLK 189
++ KA+ I N E V+ A+ A+ IA + W DI ++ Q+ ++PVA +IK LK
Sbjct: 367 ELNIRKAQAIEANVERVEEAVAAVNGLIAQGMDWVDIGRLIENEQKRHNPVAEMIKLPLK 426
Query: 190 LNINHITLLL---------------------RVHKERSNIKDSWVLHHDNADPFADSSET 228
L+ N TLLL ++NIK S+V +
Sbjct: 427 LHENTATLLLSELADADDEDMDETDSEPSDSEDEDHQANIKKSFVPEDERL--------- 477
Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--Q 286
VD+DL S ++NA++++D +R+AA KQ+KT Q+ +KALKS E+K LK Q
Sbjct: 478 ----TVDIDLAASGWSNARQYYDQRRTAATKQEKTAQAAQKALKSTEQKVMADLKKGLKQ 533
Query: 287 TMTNINKARKVYWFEKFYWFISSENYL--------------------GNV---------- 316
+ RK +WFEKF +FISS+ YL G+V
Sbjct: 534 EKEVLRPVRKQFWFEKFIYFISSDGYLVLAGKDAQQNEMLYRRHLRKGDVYVHADMHGAA 593
Query: 317 ----------------------AWDAKVVTNAWW----------VKADQVSKTAPTGEFL 344
A + V T++ W V ADQVSKTAPTGE+L
Sbjct: 594 SVIIKNNPATPQAPIPPSSLSQAGNLSVCTSSAWDSKAVMSAWWVNADQVSKTAPTGEYL 653
Query: 345 TTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
TTG FM+RGKKNF PP QL +G + +F++ E S
Sbjct: 654 TTGGFMVRGKKNFLPPAQLLLGFALVFQISEDS 686
>gi|384249421|gb|EIE22903.1| hypothetical protein COCSUDRAFT_16391 [Coccomyxa subellipsoidea
C-169]
Length = 1029
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 137/407 (33%), Positives = 201/407 (49%), Gaps = 97/407 (23%)
Query: 43 SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTA---------- 92
S D +V + F P+ILQ S V+R F ++ AA+DEF++ A
Sbjct: 267 SGDCLVYD-SFDPLILQQNSGQEVLR---------FPTYNAALDEFYAKARPAPLCLTMS 316
Query: 93 -ESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAI 151
E K + +Q E+ AL KL+ ++ D R AL++ + KA+LI N E+VD AI
Sbjct: 317 VEGQKAEQARLQAEQAALSKLDRIRIDQTGRAEALDREAKEAEAKAQLIEANAEAVDQAI 376
Query: 152 LAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDS 211
A+R +A LSW ++E ++ + VA ++ L L+ N +TLL DS
Sbjct: 377 NAVRVALAQGLSWAELERLIRDEAAAGNQVAGLVHALHLDRNAVTLL-----------DS 425
Query: 212 WVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL 271
NA+ ++ +LV+VDLDL+A NA+ + +++ + KQ KT+ + ++AL
Sbjct: 426 ------NAESNDETGTDVPTALVEVDLDLNAQQNARAWHSDRKARSAKQAKTLDANKRAL 479
Query: 272 KSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK---------- 321
A+KK + L V+ + + + RK WFEKF WF++SENYL DA+
Sbjct: 480 VEADKKVQVQLSKVKAVAAVQQLRKPAWFEKFNWFVTSENYLVVSGRDAQQNELLVKRYL 539
Query: 322 -----------------VVTN-----------------------AW---------WVKAD 332
VV N AW WV A
Sbjct: 540 RKDDLYVHAELHGASTTVVRNHNPSRPGMALVSQAGTACVCRSQAWDAKIVTSAWWVHAH 599
Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
QVSK+AP+GE+L TGSFMIRG+KNF PP L MG++FLFKL+ES I+
Sbjct: 600 QVSKSAPSGEYLPTGSFMIRGRKNFLPPHPLIMGLTFLFKLDESCIA 646
>gi|156059014|ref|XP_001595430.1| hypothetical protein SS1G_03519 [Sclerotinia sclerotiorum 1980]
gi|154701306|gb|EDO01045.1| hypothetical protein SS1G_03519 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 1063
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 144/436 (33%), Positives = 205/436 (47%), Gaps = 90/436 (20%)
Query: 28 SSQGYLV-QKKEKRPNSEDYIVANM------EFHPMILQPGSTPSVMRQHQGFPIQEFES 80
+S+GY++ +KK+ + S+D + +FHP P EFE
Sbjct: 257 TSKGYIIAKKKDSQVTSDDNQAEDRKGLLYDDFHPF------KPRQFEDDPTLVFLEFEG 310
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F VDEFFS+ E K++ + ++E +A KK++ + + RL L++ Q +++ KA +
Sbjct: 311 FNKTVDEFFSSIEGQKLESRLEERELNAKKKIQAARNEQAKRLGGLQEIQALNERKASAL 370
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N E V A A+ IA + W +I ++ Q+ +PVAS+IK LKL+ N +TLLL
Sbjct: 371 QANVERVQEARDAVNGLIAQGMDWFEIGRLIELEQKRKNPVASMIKLPLKLDQNTVTLLL 430
Query: 200 RVHKERSNIKDSWVLHHDNADPFADSSETKKPS-----------LVDVDLDLSAYANAKR 248
+ S+ D +D D + KP +D++L LS +ANA+
Sbjct: 431 DEEVFNDDEDSSYETDSDVSDS-EDEEKAAKPVEKEEKATETRLAIDINLSLSPWANARN 489
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWF 306
+FD KRSAA K+ KT+QS+ KALKS E K Q LK Q T + RK WFEKF WF
Sbjct: 490 YFDQKRSAASKEDKTLQSSSKALKSTEAKIAQDLKKGLKQEKTILRPVRKQIWFEKFVWF 549
Query: 307 ISSENYLGNVAWDAK--------------------------------------------- 321
ISS+ YL DA+
Sbjct: 550 ISSDGYLVLAGKDAQQSEILYKRYLRKGDMYLHADISGAASVIVRNNPKTPDAPIPPQTL 609
Query: 322 -------VVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLA 364
V T++ W V ADQVSK APTGEFL G F I+GKKNF PP QL
Sbjct: 610 SQAGTLVVATSSAWDSKAGMSAWWVNADQVSKAAPTGEFLPAGKFTIQGKKNFLPPAQLL 669
Query: 365 MGISFLFKLEESSISR 380
+G LF++ + S +R
Sbjct: 670 LGFGILFQISDESKAR 685
>gi|407928362|gb|EKG21221.1| protein of unknown function DUF814 [Macrophomina phaseolina MS6]
Length = 1094
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 139/437 (31%), Positives = 213/437 (48%), Gaps = 95/437 (21%)
Query: 29 SQGYLVQKKEKRP--------NSEDYIVANM----EFHPMILQPGSTPSVMRQHQGFPIQ 76
++GY+V KK +P SE+ A +FHP P
Sbjct: 261 AKGYIVAKKTGQPAPTEVSEEGSEEKAPAEKLLYEDFHPF------KPKQFEADPTLTFL 314
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
EFE F VDEFFS+ E K++ + ++E +A +KLE K++H RL L++ Q ++ K
Sbjct: 315 EFEGFNKTVDEFFSSIEGQKLESRLQEREENAKRKLEQAKQEHLKRLGGLQRAQELNIRK 374
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
AE I N + V A++A+ I + W +I+ +++ Q H +PVA +IK LKL N +
Sbjct: 375 AEAIQANVDRVQEAVMAVNGLIDKGMDWIEIDRLIEREQTHGNPVAQMIKVPLKLRENTV 434
Query: 196 TLLL---RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL--------VDVDLDLSAYA 244
TLLL V ++ + + S + D + + KKP++ +D+DL LS +A
Sbjct: 435 TLLLDEPGVEEDEEDFEGSETESEPSDD-EEEQQQRKKPAVKPQDNRLTIDIDLGLSPWA 493
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEK 302
NAK +FD K++AA K+++T+++++KALKS +KK + LK Q + RK +WFEK
Sbjct: 494 NAKTYFDQKKTAAAKEERTLEASQKALKSTQKKIEADLKKGLKQEKELLRPVRKQFWFEK 553
Query: 303 FYWFISSENYL--------------------GNV-------------------------- 316
F +FISS+ YL G++
Sbjct: 554 FIYFISSDGYLVIGGKDAQQNEILYRRHLKKGDIYVHADLSAAAVVIIKNRPSTPDDPIP 613
Query: 317 ------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
A + V T+ W V ADQVSKT +GE+L G F+I GKKNF PP
Sbjct: 614 PSTLSQAGNLSVSTSTAWDSKAVMSAWWVNADQVSKTTSSGEYLAAGGFVINGKKNFLPP 673
Query: 361 CQLAMGISFLFKLEESS 377
QL +G + +F++ E S
Sbjct: 674 AQLLLGFAVMFQITEES 690
>gi|303312187|ref|XP_003066105.1| hypothetical protein CPC735_053300 [Coccidioides posadasii C735
delta SOWgp]
gi|240105767|gb|EER23960.1| hypothetical protein CPC735_053300 [Coccidioides posadasii C735
delta SOWgp]
Length = 1125
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 134/426 (31%), Positives = 210/426 (49%), Gaps = 82/426 (19%)
Query: 29 SQGYLVQKKEKRPNSEDYIVAN--------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
++GY+V + E +P+ +++HP P I F+S
Sbjct: 264 TRGYIVARNENKPSENPSFSGEAKPDKSNYIDYHPF------APRQFADGNDISILTFDS 317
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDE++S+ E+ K++ + ++E +KLE KRDHE R+ AL+Q Q + KAE I
Sbjct: 318 FNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEIHTRKAEAI 377
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N V+ + A+ IA + W +I ++ Q +PVA +IK LKL N +T+LL
Sbjct: 378 ATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLYENTVTVLL 437
Query: 200 RVHKERSNIKDSWVLHHDNADPFADS-SETKKPSL--VDVDLDLSAYANAKRFFDLKRSA 256
+ DS ++ + ++ ++ ++P + VD+DL L+ +ANA +++D K++A
Sbjct: 438 PEGQLDEEDDDSEESDEEDEENDGEAKTKPQRPEVLSVDIDLGLTPWANASQYYDQKKTA 497
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL- 313
A K++KTI+++++ALKSAEKK LK Q + AR +WFEKFY+FISS+ YL
Sbjct: 498 AVKEEKTIKASKQALKSAEKKLTTDLKRGLKQEKPVLRPARIPFWFEKFYFFISSDGYLV 557
Query: 314 -------------------GNV-------------------AWDAKV------------- 322
G+V A DA +
Sbjct: 558 LGGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKNKPGASDAPIPPGTLAQAGTFTV 617
Query: 323 -VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
+ AW WV ADQVSKT P+GE+L TG +IRG KN P QL +G + +F+
Sbjct: 618 ATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPGQLILGFAVMFQ 677
Query: 373 LEESSI 378
+ S+
Sbjct: 678 ISPESV 683
>gi|297297786|ref|XP_002805097.1| PREDICTED: serologically defined colon cancer antigen 1-like
[Macaca mulatta]
Length = 856
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 127/341 (37%), Positives = 175/341 (51%), Gaps = 38/341 (11%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E + I+ EFHP + QH P EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411
Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
V+ E++ + + + KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
NAK+F + + I ++ K T D+ +++ A
Sbjct: 472 NAKKFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTG 531
Query: 305 WFISSEN--------YLGNVAWDAKVVTNAWWVKADQVSKT 337
I + AWDA+V+T+AWWV Q+ ++
Sbjct: 532 EPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQIIRS 572
>gi|73853411|gb|AAZ86776.1| IP12823p [Drosophila melanogaster]
Length = 489
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/237 (46%), Positives = 150/237 (63%), Gaps = 27/237 (11%)
Query: 29 SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S+GY++Q KE++P + ++ N+EFHP + Q + F FESF AV
Sbjct: 276 SKGYIIQVKEEKPTENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 326
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+ST ES KID+K +QQER+ALKKL NVK DH RL L + Q VD++KAELI +NQ
Sbjct: 327 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD AI A++ IA+QLSW DI VK+AQ + D VAS IKQLKL NHI+L+L +
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK 262
D + + ++VDVDL LSA+ANA+R++D+KRSAA+K++K
Sbjct: 447 DE---------------DDDLKDPEVTVVDVDLALSAWANARRYYDMKRSAAQKKKK 488
>gi|367021400|ref|XP_003659985.1| hypothetical protein MYCTH_2297656 [Myceliophthora thermophila ATCC
42464]
gi|347007252|gb|AEO54740.1| hypothetical protein MYCTH_2297656 [Myceliophthora thermophila ATCC
42464]
Length = 1085
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 126/374 (33%), Positives = 189/374 (50%), Gaps = 71/374 (18%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F+ + VDEFFS+ E K++ + ++E A +KLE +RD E R+ L++ Q+++ KA
Sbjct: 312 FDGYNKTVDEFFSSLEGQKLESRLQEREAIAKRKLEAARRDQEQRIEGLQEAQMLNLRKA 371
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
I N E V A+ A+ + + W D+ V++ Q+ ++PVA II+ ++L+ N IT
Sbjct: 372 AAIEANIERVQEAMDAVNGLLQQGMDWVDVNKLVEREQKLHNPVAEIIQLPMRLHENVIT 431
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSS---ETKKPS---LVDVDLDLSAYANAKRFF 250
LLL +E +D +D + AD + K P+ +D++L LS + NA+ ++
Sbjct: 432 LLLGEEEEEGEAEDKLDFDYDTDEEAADDGVPDKAKGPAKRLAIDINLKLSPWNNAREYY 491
Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFIS 308
+ KR+AA+KQQKT+Q +E ALK+AE+K + LK Q + RK WFEKF WFIS
Sbjct: 492 EQKRTAAEKQQKTVQQSEIALKNAEQKIAEDLKKGLKQEKPVLQPIRKQLWFEKFIWFIS 551
Query: 309 SENYLGNVAWDAK----------------------------------------------- 321
S+ YL DA+
Sbjct: 552 SDGYLVLGGRDAQQNEILYKRYLRKGDVYVHADMHGAPTVIVKNNPKTPDAPIPPSTLAQ 611
Query: 322 ------VVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMG 366
+NAW WV ADQVSK+AP GE+L GSFM+RGK+N PP L +G
Sbjct: 612 AGSLSVCCSNAWDSKAAMGAYWVNADQVSKSAPAGEYLPVGSFMVRGKRNPLPPALLMLG 671
Query: 367 ISFLFKLEESSISR 380
+FK+ E S +R
Sbjct: 672 FGLMFKVSEESKAR 685
>gi|384489957|gb|EIE81179.1| hypothetical protein RO3G_05884 [Rhizopus delemar RA 99-880]
Length = 1044
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 108/262 (41%), Positives = 161/262 (61%), Gaps = 11/262 (4%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
EFHP + +Q +EF +F AVDEFFS+ E+ K++LK +QE ALKK
Sbjct: 250 EFHPHLY---------KQFSNRKFKEFSTFDQAVDEFFSSIEAQKLELKTRRQEEAALKK 300
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE VK + E R+ +L QL + KA+LI N + VDAAI IR +A+Q+ W+D+ V
Sbjct: 301 LEAVKLEQEKRVESLLNQQLTNTRKAQLIELNLQFVDAAITIIRNAVASQMDWQDLNDLV 360
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
K+ +R +P+A II LKL N +TLLL +E + + + + K+
Sbjct: 361 KEEKRRGNPIALIIDTLKLETNQVTLLLTDPEEHEESESDDEEEEEEEE--EKEEKPKEI 418
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
+DVD+ L+A+ANA+++++ K++ A K +KTI+++ KALKSAE+K ++ LK+ + I
Sbjct: 419 FKIDVDIGLTAFANARKYYEQKKTTASKHEKTIEASTKALKSAERKIRKDLKETKITATI 478
Query: 292 NKARKVYWFEKFYWFISSENYL 313
NK RK +WFEKF WFIS+E YL
Sbjct: 479 NKIRKPFWFEKFQWFISTEGYL 500
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 45/63 (71%), Positives = 54/63 (85%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+A+WV DQVSK+AP+GE+LTTGSFMIRGKKNF PP QL G +LFKL+ES
Sbjct: 566 AWDSKIVTSAYWVYPDQVSKSAPSGEYLTTGSFMIRGKKNFLPPVQLVYGFGYLFKLDES 625
Query: 377 SIS 379
SI
Sbjct: 626 SIG 628
>gi|440466993|gb|ELQ36234.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
Y34]
gi|440486785|gb|ELQ66618.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
P131]
Length = 1095
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 143/450 (31%), Positives = 210/450 (46%), Gaps = 94/450 (20%)
Query: 17 TLHVLVSNLNVSSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGST---------P 64
TL + S ++ GY+V K S ED + P + PGS+ P
Sbjct: 273 TLAGITSAAQIT--GYIVAKTRDGAASVQNEDRVSEGALVKPFV--PGSSKDLLYEDFQP 328
Query: 65 SVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHET 121
+ +Q P I EFE F VDEF+S+ E K++ + ++E A KKL+ + +
Sbjct: 329 FLPKQFSSDPTNVILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAK 388
Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
R+ LE++QL++ KA I N E V A+ A+ + N + W DI V++ Q+ N+PV
Sbjct: 389 RIEGLEESQLLNFRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPV 448
Query: 182 ASIIKQLKLNINHITLLLRVH---------KERSNIKDSWVLHHDNADPFADSSETKKPS 232
A+IIK L +++ + T+ LR+ + + + D+ + A S + K
Sbjct: 449 AAIIK-LPMDLANNTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRE 507
Query: 233 L-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMT 289
L VD+ L+LS ++NA ++D KRSAA+K++KTI + ALKSA +K + L+ Q
Sbjct: 508 LEVDIKLNLSPWSNAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKP 567
Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
I R WFEK+ WF+SS+ YL DA+
Sbjct: 568 VIQPIRHQVWFEKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVI 627
Query: 322 -------------------------VVTNAW---------WVKADQVSKTAPTGEFLTTG 347
+NAW WV ADQVSK APTGEFL G
Sbjct: 628 IKNNPRTPEAPIPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAG 687
Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
SFMI+GKKN PP L +G LF++ E S
Sbjct: 688 SFMIKGKKNELPPATLVIGFGLLFRISEES 717
>gi|159129335|gb|EDP54449.1| DUF814 domain protein, putative [Aspergillus fumigatus A1163]
Length = 1116
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 142/430 (33%), Positives = 213/430 (49%), Gaps = 92/430 (21%)
Query: 31 GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFP---IQE 77
GY+V K++KRP++E A +FHP +P RQ +G P I E
Sbjct: 265 GYIVAKEDKRPSAESTADAGDPSEKAGLFYEDFHP--FRP-------RQFEGNPEVKILE 315
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F +F A VDE+FS+ E+ K++ + ++E A +KL+ V+++HE RL AL++ Q + KA
Sbjct: 316 FSTFNATVDEYFSSLETQKLEARLTEREEAAKRKLDAVRQEHEKRLGALKEAQEIHVRKA 375
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
I +N V A+ A+ IA + W +I ++ Q +PVA IIK LKL N IT
Sbjct: 376 AAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKLYENTIT 435
Query: 197 LLLRVHKERSNIKDS--WVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDL 252
L+L E + D W + ++ + K + +D+DL LS +ANA ++++
Sbjct: 436 LVLGEASEEQDAADDLFWDESEEESESEEQEAARKASEMLTIDIDLGLSPWANATQYYEQ 495
Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSE 310
K+ AA K+QKT QS+ KALKS EKK + LK Q + ARK +WFEKF +FISSE
Sbjct: 496 KKIAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQVLRPARKPFWFEKFLFFISSE 555
Query: 311 NYL--------------------GNV--------------------------------AW 318
YL G++ A
Sbjct: 556 GYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLSQAG 615
Query: 319 DAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
+ V T++ W V A+QV+KT TG L TG F I+G+KNF P QL +G +
Sbjct: 616 NLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEIKGEKNFLAPSQLVLGFA 674
Query: 369 FLFKLEESSI 378
+F++ ++S+
Sbjct: 675 VMFQISKNSL 684
>gi|451850505|gb|EMD63807.1| hypothetical protein COCSADRAFT_182004 [Cochliobolus sativus
ND90Pr]
Length = 1128
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 137/432 (31%), Positives = 208/432 (48%), Gaps = 91/432 (21%)
Query: 30 QGYLVQKKE---KRPNSEDYIVANM---EFHPMILQPGSTPSVMRQHQGFPIQEFESFAA 83
+GY++ K +P+ E + + +FHP Q ++ + EF+ F
Sbjct: 265 KGYILAKPNPSASKPDDESSDKSRLLYDDFHPFRPQ-------QFENTDYTFLEFDGFNK 317
Query: 84 AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
AVDEFFS+ E K++ K ++E+ A KKLE +++HE R+ L+Q Q ++ KAE I+ N
Sbjct: 318 AVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEDRIGGLQQVQELNFRKAEAILAN 377
Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL--- 199
V A A+ I + W DIE +++ Q + VA +I+ LKL+ N ITLLL
Sbjct: 378 VHRVTEATEAVNGLIRQGMDWVDIERLIEREQNSGNAVAQLIRLPLKLHENTITLLLNET 437
Query: 200 ---------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRF 249
E S++ + D + +P L +D+DL LSA+AN+ +
Sbjct: 438 NWEKGGEEEDEGNETSSVSEDTDDEDDRPRKTSPPKPVARPQLAIDIDLGLSAWANSTEY 497
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFI 307
FD K++AA K+ +T+Q++ KALKS EKK + LK Q + RK +WFEKF +FI
Sbjct: 498 FDQKKTAADKEGRTLQASSKALKSHEKKVAEDLKKGLKQEKEVLRPVRKQHWFEKFIYFI 557
Query: 308 SSENYL--------------------GNV------------------------------- 316
SS+ YL G+V
Sbjct: 558 SSDGYLVLGGKDAQQNEIIYRRFLRKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPSTLS 617
Query: 317 -AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
A + + T+ W V++DQVSKT TGEFL G F I+GKK F PP QL +
Sbjct: 618 QAGNLSICTSDAWDSKAVMSAWWVRSDQVSKTGQTGEFLPAGMFNIKGKKEFLPPAQLVV 677
Query: 366 GISFLFKLEESS 377
G++ +F++ +SS
Sbjct: 678 GLAVMFEISDSS 689
>gi|452000540|gb|EMD93001.1| hypothetical protein COCHEDRAFT_1172752 [Cochliobolus
heterostrophus C5]
Length = 1128
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 132/404 (32%), Positives = 196/404 (48%), Gaps = 85/404 (21%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP Q ++ + EF+ F AVDEFFS+ E K++ K ++E+ A KK
Sbjct: 293 DFHPFRPQ-------QFENTDYTFLEFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKKK 345
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE +++HE R+ L+Q Q ++ KAE I+ N V A A+ I + W DIE +
Sbjct: 346 LEKARKEHEDRIGGLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQGMDWVDIERLI 405
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL------------RVHKERSNIKDSWVLHHDN 218
++ Q + VA +I+ LKL+ N ITLLL E S++ + D
Sbjct: 406 EREQNSGNAVAQLIRLPLKLHENTITLLLNETNWEEGGEEEDEGNETSSVSEDTDDEDDR 465
Query: 219 ADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
+ +P L +D+DL LSA+AN+ +FD K++AA K+ +T+Q++ KALKS EKK
Sbjct: 466 PRKTSPPKPVARPQLAIDIDLGLSAWANSTEYFDQKKTAADKEGRTLQASTKALKSHEKK 525
Query: 278 TKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GN 315
+ LK Q + RK +WFEKF +FISS+ YL G+
Sbjct: 526 VAEDLKKGLKQEKEVLRPVRKQHWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRFLRKGD 585
Query: 316 V--------------------------------AWDAKVVTNAWW----------VKADQ 333
V A + + T+ W V++DQ
Sbjct: 586 VYVHADLKGAMPMIIKNKPDTPDAPIPPSTLSQAGNLSICTSDAWDSKAVMSAWWVRSDQ 645
Query: 334 VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
VSKT TGEFL G F I+GKK F PP QL +G++ +F++ +SS
Sbjct: 646 VSKTGQTGEFLPAGMFNIKGKKEFLPPAQLVVGLAVMFEISDSS 689
>gi|389646873|ref|XP_003721068.1| nuclear export mediator factor [Magnaporthe oryzae 70-15]
gi|351638460|gb|EHA46325.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
70-15]
Length = 1074
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 142/450 (31%), Positives = 210/450 (46%), Gaps = 94/450 (20%)
Query: 17 TLHVLVSNLNVSSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGST---------P 64
TL + S ++ GY+V K S ED + P + PGS+ P
Sbjct: 252 TLAGITSAAQIT--GYIVAKTRDGAASVQNEDRVSEGALVKPFV--PGSSKDLLYEDFQP 307
Query: 65 SVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHET 121
+ +Q P I EFE F VDEF+S+ E K++ + ++E A KKL+ + +
Sbjct: 308 FLPKQFSSDPTNVILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAK 367
Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
R+ LE++QL++ KA I N E V A+ A+ + N + W DI V++ Q+ N+PV
Sbjct: 368 RIEGLEESQLLNFRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPV 427
Query: 182 ASIIKQLKLNINHITLLLRVH---------KERSNIKDSWVLHHDNADPFADSSETKKPS 232
A+II +L +++ + T+ LR+ + + + D+ + A S + K
Sbjct: 428 AAII-ELPMDLANNTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRE 486
Query: 233 L-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMT 289
L VD+ L+LS ++NA ++D KRSAA+K++KTI + ALKSA +K + L+ Q
Sbjct: 487 LEVDIKLNLSPWSNAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKP 546
Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
I R WFEK+ WF+SS+ YL DA+
Sbjct: 547 VIQPIRHQVWFEKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVI 606
Query: 322 -------------------------VVTNAW---------WVKADQVSKTAPTGEFLTTG 347
+NAW WV ADQVSK APTGEFL G
Sbjct: 607 IKNNPRTPEAPIPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAG 666
Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
SFMI+GKKN PP L +G LF++ E S
Sbjct: 667 SFMIKGKKNELPPATLVIGFGLLFRISEES 696
>gi|119193306|ref|XP_001247259.1| hypothetical protein CIMG_01030 [Coccidioides immitis RS]
gi|392863500|gb|EAS35746.2| hypothetical protein CIMG_01030 [Coccidioides immitis RS]
Length = 1125
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 138/432 (31%), Positives = 207/432 (47%), Gaps = 94/432 (21%)
Query: 29 SQGYLVQKKEKRP------------NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
++GY+V + E +P + +YI ++HP P I
Sbjct: 264 TRGYIVARNENKPPENPSFSGEAKPDKSNYI----DYHPF------APRQFVDGNDTSIL 313
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
F+SF AVDE++S+ E+ K++ + ++E +KLE KRDHE R+ AL+Q Q + +
Sbjct: 314 TFDSFNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEIHTRR 373
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
AE I N V+ + A+ IA + W +I ++ Q +PVA +IK LKL N +
Sbjct: 374 AEAIATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLYENTV 433
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-----VDVDLDLSAYANAKRFF 250
T+LL + + + +D D D KKP VD+DL L+ +ANA +++
Sbjct: 434 TVLL--PEGQPDGEDDDSEESGEEDEENDGEAKKKPQRPEVLSVDIDLGLTPWANASQYY 491
Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFIS 308
D K++AA K+ KTI+++++ALKSAEKK LK Q + AR +WFEKFY+FIS
Sbjct: 492 DQKKTAAIKEDKTIKASKQALKSAEKKLTTDLKRGLKQEKPVLRPARIPFWFEKFYFFIS 551
Query: 309 SENYL--------------------GNV-------------------AWDAKV------- 322
S+ YL G+V A DA +
Sbjct: 552 SDGYLVLGGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKNKPGASDAPIPPGTLAQ 611
Query: 323 -------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMG 366
+ AW WV ADQVSKT P+GE+L TG +IRG KN P QL +G
Sbjct: 612 AGTFTVATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPGQLILG 671
Query: 367 ISFLFKLEESSI 378
+ +F++ S+
Sbjct: 672 FAVMFQISPESV 683
>gi|86196391|gb|EAQ71029.1| hypothetical protein MGCH7_ch7g436 [Magnaporthe oryzae 70-15]
Length = 1095
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 142/450 (31%), Positives = 210/450 (46%), Gaps = 94/450 (20%)
Query: 17 TLHVLVSNLNVSSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGST---------P 64
TL + S ++ GY+V K S ED + P + PGS+ P
Sbjct: 273 TLAGITSAAQIT--GYIVAKTRDGAASVQNEDRVSEGALVKPFV--PGSSKDLLYEDFQP 328
Query: 65 SVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHET 121
+ +Q P I EFE F VDEF+S+ E K++ + ++E A KKL+ + +
Sbjct: 329 FLPKQFSSDPTNVILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAK 388
Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
R+ LE++QL++ KA I N E V A+ A+ + N + W DI V++ Q+ N+PV
Sbjct: 389 RIEGLEESQLLNFRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPV 448
Query: 182 ASIIKQLKLNINHITLLLRVH---------KERSNIKDSWVLHHDNADPFADSSETKKPS 232
A+II +L +++ + T+ LR+ + + + D+ + A S + K
Sbjct: 449 AAII-ELPMDLANNTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRE 507
Query: 233 L-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMT 289
L VD+ L+LS ++NA ++D KRSAA+K++KTI + ALKSA +K + L+ Q
Sbjct: 508 LEVDIKLNLSPWSNAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKP 567
Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
I R WFEK+ WF+SS+ YL DA+
Sbjct: 568 VIQPIRHQVWFEKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVI 627
Query: 322 -------------------------VVTNAW---------WVKADQVSKTAPTGEFLTTG 347
+NAW WV ADQVSK APTGEFL G
Sbjct: 628 IKNNPRTPEAPIPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAG 687
Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
SFMI+GKKN PP L +G LF++ E S
Sbjct: 688 SFMIKGKKNELPPATLVIGFGLLFRISEES 717
>gi|326434920|gb|EGD80490.1| hypothetical protein PTSG_13144 [Salpingoeca sp. ATCC 50818]
Length = 947
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 186/376 (49%), Gaps = 63/376 (16%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P ++Q +G +EF+SF AVD +FS ESHK++++A+QQER A +KLE +R H R+
Sbjct: 111 PFPLKQFEGRTFKEFDSFDVAVDTYFSEIESHKLEMRALQQERAARQKLEQARRSHHDRV 170
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
++ +L D+ KA LI N E V+ AI I + + N L W +IE V++++ DPVA+
Sbjct: 171 KGYKEARLEDEYKATLIELNHELVNEAIDVINKMVGNHLDWREIEELVQESRVRGDPVAN 230
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
I +LKL N I + H ++ + + D D + K +LV++DL +A+
Sbjct: 231 AISKLKLKKNAIVM----HLTEPSMGGADDDSWSDEDEDEDEDDNTKGALVEIDLAETAH 286
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
NA++ + K++ K++K + STE+AL+S EK+ LK Q I+K+R WFEKF
Sbjct: 287 GNARKLHERKKTIRSKEEKALASTEQALRSVEKRAMDRLKKTQITATISKSRAPLWFEKF 346
Query: 304 YWFISSENYLGNVAWD---------AKVVTNAWWVKADQ------VSKTAPTGEF----- 343
WFISSENYL D + + +V AD V K + TGE
Sbjct: 347 NWFISSENYLVLAGRDRLQNEALVRKHLTQHDLYVHADMNGASSVVVKNSNTGEIPPKTL 406
Query: 344 ---------------------------------------LTTGSFMIRGKKNFFPPCQLA 364
L GSF I G K+F P QLA
Sbjct: 407 SEAATFAVAHSPAWENNQQADAWWVHANQVEKTSSEGKPLGAGSFRITGAKHFIPIRQLA 466
Query: 365 MGISFLFKLEESSISR 380
+ + LFK+++ S R
Sbjct: 467 LAYAILFKVDDESAKR 482
>gi|71001140|ref|XP_755251.1| DUF814 domain protein [Aspergillus fumigatus Af293]
gi|66852889|gb|EAL93213.1| DUF814 domain protein, putative [Aspergillus fumigatus Af293]
Length = 1116
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 141/430 (32%), Positives = 212/430 (49%), Gaps = 92/430 (21%)
Query: 31 GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFP---IQE 77
GY+V K++KRP++E A +FHP +P RQ +G P I E
Sbjct: 265 GYIVAKEDKRPSAESTADAGDPSEKAGLFYEDFHP--FRP-------RQFEGNPEVKILE 315
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F +F A VDE+FS+ E+ K++ + ++E A +KL+ V+++HE RL AL++ Q + KA
Sbjct: 316 FSTFNATVDEYFSSLETQKLEARLTEREEAAKRKLDAVRQEHEKRLGALKEAQEIHVRKA 375
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
I +N V A+ A+ IA + W +I ++ Q +PVA IIK LKL N IT
Sbjct: 376 AAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKLYENTIT 435
Query: 197 LLLRVHKERSNIKDS--WVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDL 252
L+L + D W + ++ + K + +D+DL LS +ANA ++++
Sbjct: 436 LVLGEASREQDAADDLFWDESEEESESEEQEAARKASEMLTIDIDLGLSPWANATQYYEQ 495
Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSE 310
K+ AA K+QKT QS+ KALKS EKK + LK Q + ARK +WFEKF +FISSE
Sbjct: 496 KKIAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQVLRPARKPFWFEKFLFFISSE 555
Query: 311 NYL--------------------GNV--------------------------------AW 318
YL G++ A
Sbjct: 556 GYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLSQAG 615
Query: 319 DAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
+ V T++ W V A+QV+KT TG L TG F I+G+KNF P QL +G +
Sbjct: 616 NLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEIKGEKNFLAPSQLVLGFA 674
Query: 369 FLFKLEESSI 378
+F++ ++S+
Sbjct: 675 VMFQISKNSL 684
>gi|340516439|gb|EGR46688.1| predicted protein [Trichoderma reesei QM6a]
Length = 1078
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 143/470 (30%), Positives = 215/470 (45%), Gaps = 94/470 (20%)
Query: 2 VSNEDVSS---DIHKCGPTLHVLVSNLN------------VSSQGYLVQKKEKRPNSEDY 46
+N D+S+ DI P L LV++L+ + +GY+ K+ P+S
Sbjct: 224 ANNFDISAKPADIVDNEPLLDALVNHLSEARDVVENIIASSTCKGYIFAKRRTTPSSAPD 283
Query: 47 IVANMEFHPMILQPGSTPSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQ 103
+ H +L P V ++ + P + EF+ + VDEFFS+ E K++ +
Sbjct: 284 DAEQAQKHEGLLYEDFHPFVPQKFKNDPSIQVLEFDGYNRTVDEFFSSLEGQKLESRLTG 343
Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
+E A KKLE +++ R+ L+ Q ++ KA I N E V A+ A+ +A +
Sbjct: 344 REEAARKKLEAARQEQAKRIQGLQDAQAMNYRKAAAIEANVERVQEAMDAVNGLLAQGMD 403
Query: 164 WEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWV---LHHDNA 219
W DI +++ ++ +PVA II LKL N ITLLL + + D++
Sbjct: 404 WVDIGKLIEREKKRQNPVAEIISLPLKLADNTITLLLAEEAFDEDEAEEEEDNPFETDDS 463
Query: 220 DPFADSS--------ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL 271
D D +T K VD+ L++S ++NA+ +++ +RSAA KQ+KT Q KAL
Sbjct: 464 DSEEDQGGKATSKDKKTDKLLTVDIVLNMSPWSNAREYYEERRSAAMKQEKTQQQATKAL 523
Query: 272 KSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL---------------- 313
KS E+K + LK Q + RK WFEKF WFISS+ YL
Sbjct: 524 KSTEQKIAEDLKKGLKQEKALLQPIRKQMWFEKFLWFISSDGYLVLGGKDPQQSEILYRR 583
Query: 314 ----GNVAWDAKV---------------------------------VTNAW--------- 327
G+V A + + AW
Sbjct: 584 YLRKGDVYCHADIRGAANIVIKNNPNMPDAPIPPATLSQAGSLSVCTSEAWDSKAGMGAW 643
Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
WV ADQVSKT P+G+ L G+F I+GKKN+ PP QL +G+ F FK+ E S
Sbjct: 644 WVNADQVSKTTPSGDILPAGTFTIQGKKNYLPPTQLLLGLGFAFKISEQS 693
>gi|330929686|ref|XP_003302734.1| hypothetical protein PTT_14667 [Pyrenophora teres f. teres 0-1]
gi|311321722|gb|EFQ89181.1| hypothetical protein PTT_14667 [Pyrenophora teres f. teres 0-1]
Length = 1133
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 136/409 (33%), Positives = 199/409 (48%), Gaps = 90/409 (22%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP Q ++ + EF+ F AVDEFFS+ E K++ K ++E+ A KK
Sbjct: 292 DFHPFRPQ-------QFENSDYTFLEFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKKK 344
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE +++HE R+ L+Q Q ++ KAE I+ N V A A+ I + W DI +
Sbjct: 345 LEKARKEHEDRIGGLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQGMDWVDIARLI 404
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVH--KERSNIKD-----SWVLHHDNADPFA 223
++ Q + VA +IK LKLN N ITLLL +E ++D S V + +
Sbjct: 405 EREQNSGNAVAQLIKLPLKLNENTITLLLDETNWEEGQEVEDEGNETSSVSEDSDEEAAG 464
Query: 224 DSSETKK----------PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
+ KK P L +D+DL L+A+AN+ +FD K++AA K+ +T+Q++ +ALK
Sbjct: 465 EEDGAKKKSAPVKVSARPQLAIDIDLSLTAWANSTEYFDQKKTAANKEDRTLQASTRALK 524
Query: 273 SAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL----------------- 313
S EKK + LK Q + RK +WFEKF +FISS+ YL
Sbjct: 525 SHEKKVAEDLKKGLKQEKEVLRPVRKQHWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRF 584
Query: 314 ---GNV-------------------AWDAKV--------------VTNAWWVKA------ 331
G+V DA + ++AW KA
Sbjct: 585 LRKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPSTLSQAGNLCICTSDAWDSKAVMSAWW 644
Query: 332 ---DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
DQVSKT TGEFL G F ++GKK F PP QL +G++ +F++ ESS
Sbjct: 645 VRSDQVSKTGQTGEFLPAGMFNVKGKKEFLPPAQLVVGLAVMFEISESS 693
>gi|449299546|gb|EMC95559.1| hypothetical protein BAUCODRAFT_71160 [Baudoinia compniacensis UAMH
10762]
Length = 1052
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 133/407 (32%), Positives = 199/407 (48%), Gaps = 88/407 (21%)
Query: 52 EFHPM-ILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALK 110
+FHP Q S PS+ E E F DEFFS+ E K++ + ++E +A +
Sbjct: 295 DFHPFKPAQLASDPSIT-------FLEHEGFNKTCDEFFSSLEGQKLESRLQEREDNAKR 347
Query: 111 KLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR 170
K+E +++ R+ L+ Q ++ KA+ I N E V+ A+ A+ IA + W DI
Sbjct: 348 KIEQARQEQAKRIDGLQHVQELNVRKAQAIEANVERVEEAVAAVNGLIAQGMDWMDIGRL 407
Query: 171 VKQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSW------------VLHHD 217
++ Q ++ VA +IK LKL N +TLLL E + +++ + D
Sbjct: 408 IENEQSRHNAVAEMIKLPLKLYENTVTLLL---SEYAGLEEDYDDMADETESEESEDEAD 464
Query: 218 NADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
P S +K VDVDL LS ++NA++++D KR+AA+KQ++T Q+++KALKS E+K
Sbjct: 465 TQAPRHTSKPEEKRLAVDVDLALSPWSNARQYYDQKRTAAEKQERTAQASQKALKSTEQK 524
Query: 278 TKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GN 315
LK Q + RK +FEKF +FISS+ YL G+
Sbjct: 525 VMADLKKGLKQEKDVLRPVRKQMYFEKFNYFISSDGYLVLAGRDAQQNEMLYRRYLKKGD 584
Query: 316 V--------------------------------AWDAKVVTNAWW----------VKADQ 333
V A + V T+ W V ++Q
Sbjct: 585 VYIHADLHGAASVIVKNDPQTPEAPIPPSTLGQAGNLAVCTSTAWDSKAVMSAWWVGSEQ 644
Query: 334 VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
VSKTAPTGE+LTTG F+IRGKKN+ PP QL +G + LF++ E S +R
Sbjct: 645 VSKTAPTGEYLTTGGFVIRGKKNYLPPAQLLLGFAVLFRISEESKAR 691
>gi|361131825|gb|EHL03460.1| putative Nuclear export mediator factor Nemf [Glarea lozoyensis
74030]
Length = 1063
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 135/438 (30%), Positives = 207/438 (47%), Gaps = 86/438 (19%)
Query: 22 VSNLNVSSQGYLVQKKEKR------PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
V++ V++ G+++ KK++ P V +FHP P+
Sbjct: 257 VTSCEVAT-GFIIAKKKEGYEENSDPEHSSKNVLYDDFHPF------RPAQFESDPATVF 309
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
++E F VDEFFS+ E +++ K ++E +A +K++ ++D E RL L+ Q +++
Sbjct: 310 LQYEGFNKTVDEFFSSIEGQRLESKLEERELNAQRKIQAARQDQERRLDGLQAVQSLNER 369
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINH 194
KA I N E V A+ A+ +A + W +I ++ ++ ++PVAS+IK LKL N
Sbjct: 370 KASAIQANVERVQEAMDAVNGLVAQGMDWVEIGKLIEVEKKRSNPVASMIKLPLKLEENT 429
Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS--------LVDVDLDLSAYANA 246
ITLLL + ++ +D + ++ K+P VD+DL L+ + NA
Sbjct: 430 ITLLLDEEVFDEDEDSAYETDDAPSDSEDEVTKQKEPKEKGVEKRLTVDIDLGLTPWKNA 489
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFY 304
+ +FD KR AA K+QKT++S+ KALKS E K LK Q + R++ WFEKF
Sbjct: 490 REYFDEKRQAATKEQKTLESSTKALKSQEAKIAHDLKKGLQQEKAVLRPVRRLMWFEKFI 549
Query: 305 WFISSENYLGNVAWDAK---------------------------VVTN------------ 325
WFISS+ YL DA+ VV N
Sbjct: 550 WFISSDGYLVLGGKDAQQNEMLYKKYMKKGDAFLHADIQGAATVVVRNDPRTPDAPIPPS 609
Query: 326 --------------AWWVKAD---------QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ 362
AW KA Q+SK AP+G+FL GSF + GKKNF PP Q
Sbjct: 610 TLSQAGSLVVSCSVAWDSKAGMSAWWASATQISKAAPSGDFLPPGSFSVNGKKNFLPPSQ 669
Query: 363 LAMGISFLFKLEESSISR 380
L +G +F++ ESS S+
Sbjct: 670 LLLGFGVIFRISESSKSK 687
>gi|67539818|ref|XP_663683.1| hypothetical protein AN6079.2 [Aspergillus nidulans FGSC A4]
gi|40738864|gb|EAA58054.1| hypothetical protein AN6079.2 [Aspergillus nidulans FGSC A4]
Length = 1588
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 135/431 (31%), Positives = 207/431 (48%), Gaps = 89/431 (20%)
Query: 31 GYLVQKKEKRP-----------NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
G++V K++ RP + + +FHP P GF I E+
Sbjct: 265 GFIVAKEDTRPKPPGPESEKNDSPSKPALLYEDFHPF------KPRQFEGKDGFTILEYP 318
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
S A VDE+FS+ ES K++ + ++E A KKL++++ +HE R+ ALEQ Q + KA
Sbjct: 319 SMNATVDEYFSSIESQKLESRLTERESAAKKKLDSLRSEHEKRIGALEQAQELHIRKASA 378
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLL 198
I +N + V A+ A+ +A + W +I V+ Q+ +PVAS+IK LKL+ N ITLL
Sbjct: 379 IQDNMDRVQEAMDAVNGLVAQGMDWVEIARLVEMEQKRGNPVASLIKLPLKLHENTITLL 438
Query: 199 LR-VHKERSNIKDSWVLHHDNADPF----ADSSETKKPS--LVDVDLDLSAYANAKRFFD 251
LR E +++ + +S KKP +D+DL LS +ANA ++++
Sbjct: 439 LREAGDEGYEVEELFSSDESEDSDEEEGKGAASPQKKPEGLTIDIDLGLSPWANASQYYE 498
Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFEKFYWFISS 309
K+ AA K +KT QS+ KALKS E+K + LK Q + ARK +WFEKF +F+SS
Sbjct: 499 QKKVAAVKAEKTSQSSAKALKSHERKVQDDLKRNLKQEKQVLRPARKPFWFEKFLFFVSS 558
Query: 310 ENYL--------------------GNV--------------------------------A 317
E YL G+V A
Sbjct: 559 EGYLVLGGRDSMQSEMLYRRYLRKGDVFVHADLEGATPMIVKNKPGALSSSISPTTLSQA 618
Query: 318 WDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
+ V T+ W V A QVSKT+ G+ L G F+++G+KNF P QL +G
Sbjct: 619 GNLCVATSTAWDSKAIMSAYWVDAAQVSKTSAVGDLLPVGEFLVKGEKNFLAPSQLVLGF 678
Query: 368 SFLFKLEESSI 378
+ ++++ + S+
Sbjct: 679 AVMWQISKGSL 689
>gi|406864313|gb|EKD17358.1| serologically defined colon cancer antigen 1 [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 1052
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 142/431 (32%), Positives = 211/431 (48%), Gaps = 92/431 (21%)
Query: 29 SQGYLVQKKEK---RPNSEDY---IVANMEFHPMILQPGSTPSVMRQHQGFPIQ---EFE 79
++GY++ KK++ + ED V +FHP +P RQ + P EF+
Sbjct: 263 AKGYIIAKKKEGYEEASPEDQARKFVIYEDFHP--FRP-------RQFENDPATVFLEFQ 313
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
F D+FFS+ E K++ + ++E+ A +K+E ++D RL L++ Q ++ KA
Sbjct: 314 GFNKTADQFFSSIEGQKLESRLQEREQMAKRKIEAARQDQAKRLGGLQEVQELNIRKAGA 373
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLL 198
+ N E V A+ A+ +A + W +I V+ Q+ N+PVASIIK LKL N I+LL
Sbjct: 374 LQANAERVQEAMDAVNGLVAQGMDWVEIGKLVEIEQKRNNPVASIIKLPLKLQENTISLL 433
Query: 199 LRVHKERSNIKDSWVLHHDNADPFADSSETKKPS--------LVDVDLDLSAYANAKRFF 250
L ++ + + ++ D +D D + K+P +DV+L LS +ANA+ ++
Sbjct: 434 LDEEEDADDDESNYETDSDVSDS-EDEAPKKEPKQKTVEKRLTIDVNLALSPWANAREYY 492
Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFIS 308
D KR+AA+K+QKT+QS+ KALKS E K L+ Q + R+ WFEKF WFIS
Sbjct: 493 DQKRTAAEKEQKTLQSSTKALKSQEAKIAHDLRKGLKQEKAVLRPVRRQMWFEKFTWFIS 552
Query: 309 SENYL--------------------GNVAWDAKV-------------------------- 322
S+ YL G+V A+V
Sbjct: 553 SDGYLVLAGKDPQQKETLYRRYLKKGDVYVHAEVQGAASVVIRNNPKTPDAPIPPSTLSQ 612
Query: 323 -------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMG 366
++AW WV ADQVSK A TGEFL GSF I+GKKNF PP L +G
Sbjct: 613 AGTLSISCSSAWEAKAGMSAWWVNADQVSKAASTGEFLPAGSFNIKGKKNFLPPAVLLLG 672
Query: 367 ISFLFKLEESS 377
+F + E S
Sbjct: 673 FGVIFLISEES 683
>gi|320169195|gb|EFW46094.1| serologically defined colon cancer antigen 1 [Capsaspora owczarzaki
ATCC 30864]
Length = 1151
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/331 (35%), Positives = 173/331 (52%), Gaps = 51/331 (15%)
Query: 21 LVSNLNVSSQ--GYLVQKKEKRPNSEDY------------------------IVANMEFH 54
L+ ++ SS+ GY++Q+KE R + + I E+H
Sbjct: 240 LLIDVGASSEVPGYIIQRKESRATAANKNKGVGDEQAAVAAALASASGDASDIFVFDEYH 299
Query: 55 PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
P + + QH+ P+ F +F AVDEF+S E ++D+K + ER+ LKKLE
Sbjct: 300 PFLFE---------QHKARPVVHFPTFDRAVDEFYSRIEGQRLDMKHIGDERNVLKKLEK 350
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
K + E +L L TQ + + +LI++NQ A+L IR +A+ + W +I V+ A
Sbjct: 351 FKLEQERKLVGLRTTQEEEALRGQLILDNQTK---ALLVIRSALAHAVDWSEISDMVEAA 407
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL- 233
+ DPVASII +LKL+ N ITL+L + ++ + AD S
Sbjct: 408 KEQKDPVASIIHKLKLDSNIITLML-TSPDAVEEEEDDNSEDEGADQAVSSKGKGSAKGG 466
Query: 234 -----------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL 282
VD+D+ S +ANA+ +F K+ AA K+Q+TI ++ KALKSAE++TKQ L
Sbjct: 467 KKGHHQQTRMKVDIDITASVHANAESYFSRKKQAAAKEQRTIDASSKALKSAERQTKQQL 526
Query: 283 KDVQTMTNINKARKVYWFEKFYWFISSENYL 313
K V +NK RKV WFEKF WFI+SENYL
Sbjct: 527 KQVAVKATVNKVRKVLWFEKFLWFITSENYL 557
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 45/64 (70%), Positives = 56/64 (87%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWDAKV+T+AWWV A+QVSKTAPTGE+LTTGSFMIRG+KNF PP L +G FL++L+ES
Sbjct: 621 AWDAKVITSAWWVAANQVSKTAPTGEYLTTGSFMIRGRKNFLPPSPLILGFGFLYRLDES 680
Query: 377 SISR 380
I++
Sbjct: 681 CIAK 684
>gi|398396540|ref|XP_003851728.1| hypothetical protein MYCGRDRAFT_43818 [Zymoseptoria tritici IPO323]
gi|339471608|gb|EGP86704.1| hypothetical protein MYCGRDRAFT_43818 [Zymoseptoria tritici IPO323]
Length = 1060
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 130/410 (31%), Positives = 194/410 (47%), Gaps = 102/410 (24%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP P+ + + +EF F VDEFFS+ E K++ K ++E A ++
Sbjct: 293 DFHPF------KPAQLTAEETIVFREFSGFNKTVDEFFSSIEGQKLESKLQEREDHAKRR 346
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
+E + + R+ L++ Q ++ KA+ I N E V+ A A+ IA + W DI +
Sbjct: 347 IEQAREEQAKRIDGLQEVQELNIRKAQAIEANVERVEEATAAVNGLIAQGMDWVDIGKLI 406
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFAD------ 224
+ Q+ ++ VA +IK LKL+ N +TLLL S + D D A+
Sbjct: 407 ENEQKRHNAVAELIKLPLKLHENTVTLLL-----------SELDAADGGDDEANETDSEP 455
Query: 225 -------------SSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL 271
E K+ + +D+DL S +ANA++++D KRSAA KQ+KT Q+++KAL
Sbjct: 456 DDSDDEDAAPAAKGGEDKRLT-IDIDLAASGWANARQYYDQKRSAATKQEKTAQASQKAL 514
Query: 272 KSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL---------------- 313
KS E++ LK Q + RK +WFEKF +F+SS+ YL
Sbjct: 515 KSTEQRVMADLKKGLKQEKDVLRPVRKQFWFEKFIYFLSSDGYLVLAGKDAQQNEILYRR 574
Query: 314 ----GNV--------------------------------AWDAKVVTNAWW--------- 328
G+V A + V T++ W
Sbjct: 575 YLKKGDVYVNADLQGAASVIIKNNPATPEAPIPPSTLSQAGNLAVCTSSAWESKAVMSAW 634
Query: 329 -VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
V ADQVSKTAPTGE+LT G F+IRGKKN PP QL +G +F++ E S
Sbjct: 635 WVNADQVSKTAPTGEYLTNGGFVIRGKKNHLPPAQLLLGFGVMFQISEES 684
>gi|358369883|dbj|GAA86496.1| DUF814 domain protein [Aspergillus kawachii IFO 4308]
Length = 1157
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 140/435 (32%), Positives = 210/435 (48%), Gaps = 92/435 (21%)
Query: 28 SSQGYLVQKKEKRPNSE-------------DYIVANMEFHPMILQPGSTPSVMRQHQGFP 74
S GY+V K++ RP+++ Y+ + FHP P G
Sbjct: 312 SHPGYIVAKEDTRPSADSPAQGEEDAARKPGYLYED--FHPF------KPKQFEGKPGVT 363
Query: 75 IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
I E+ SF A VDE+FS+ E+ K++ + ++E A +KLE V+++H R+ AL++ Q +
Sbjct: 364 ILEYPSFNATVDEYFSSIETQKLESRLTEREETAKRKLEAVRQEHAKRIGALKEVQELHI 423
Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNIN 193
KA I +N V A+ A+ IA + W +I ++ Q +PVA+IIK LKL N
Sbjct: 424 RKAGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYEN 483
Query: 194 HITLLLRVHKERSNIKDSW--VLHHDNADPFADSSETKKPS----LVDVDLDLSAYANAK 247
ITL+L E + + ++ D ++++ +K S +D+DL LS +ANA
Sbjct: 484 TITLMLGESGEEQDEGEDLFSDDESESEDEQEEAAKAQKQSNNMLTIDIDLGLSPWANAT 543
Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
++++ K+ AA K+QKT QS+ KALKS EKK Q LK Q + ARK +WFEKF +
Sbjct: 544 QYYEQKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQVLRPARKTFWFEKFLF 603
Query: 306 FISSENYL--------------------GNV----------------------------- 316
FISSE YL G+V
Sbjct: 604 FISSEGYLVLGGRDAMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSSNAPIPPST 663
Query: 317 ---AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
A + V T++ W V A QVSKTA G L TG F+I+G+KNF P QL
Sbjct: 664 LSQAGNLCVATSSAWDSKAIMSAYWVTASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQL 723
Query: 364 AMGISFLFKLEESSI 378
+G +F++ + S+
Sbjct: 724 VLGFGVMFQVSKESL 738
>gi|350636898|gb|EHA25256.1| hypothetical protein ASPNIDRAFT_49657 [Aspergillus niger ATCC 1015]
Length = 1515
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 140/433 (32%), Positives = 204/433 (47%), Gaps = 88/433 (20%)
Query: 28 SSQGYLVQKKEKRPNSEDYIVANME-----------FHPMILQPGSTPSVMRQHQGFPIQ 76
S GY+V K + RP+++ E FHP P G I
Sbjct: 262 SHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPF------KPKQFEGKPGVTIL 315
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
E+ SF A VDE+FS+ E+ K++ + ++E A KKL+ V+++H R+ AL++ Q + K
Sbjct: 316 EYPSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRK 375
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
A I +N V A+ A+ IA + W +I ++ Q +PVA+IIK LKL N I
Sbjct: 376 AGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTI 435
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADS--SETKKPS----LVDVDLDLSAYANAKRF 249
TL+L E + + D+ ++ +K S +D+DL LS +ANA ++
Sbjct: 436 TLMLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQY 495
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFI 307
++ K+ AA K+QKT QS+ KALKS EKK Q LK Q + ARK +WFEKF +FI
Sbjct: 496 YEQKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQVLRPARKTFWFEKFLFFI 555
Query: 308 SSENYL--------------------GNV------------------------------- 316
SSE YL G+V
Sbjct: 556 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 615
Query: 317 -AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
A + V T++ W V A QVSKTA G L TG F+I+G+KNF P QL +
Sbjct: 616 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 675
Query: 366 GISFLFKLEESSI 378
G +F++ + S+
Sbjct: 676 GFGVMFQVSKESL 688
>gi|367042422|ref|XP_003651591.1| hypothetical protein THITE_2086741 [Thielavia terrestris NRRL 8126]
gi|346998853|gb|AEO65255.1| hypothetical protein THITE_2086741 [Thielavia terrestris NRRL 8126]
Length = 1094
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 137/431 (31%), Positives = 209/431 (48%), Gaps = 80/431 (18%)
Query: 28 SSQGYLVQKKEKRPNSEDYIVANMEFHPM----ILQPGSTPSVMRQHQGFP---IQEFES 80
S++GY++ K+ P + D + E +L P + +Q + P + F+
Sbjct: 257 SARGYIIAKRN--PRAADQVADGEETTKEKAQNLLYEDFQPFLPKQFEDDPTCQVLSFDG 314
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F+ VDEFFS+ E K++ + ++E A +KLE +RD R+ L++ QL++ KA I
Sbjct: 315 FSKTVDEFFSSLEGQKLESRLQEREATAKRKLEAARRDQAQRIEGLQEAQLLNLRKAAAI 374
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N E V A+ A+ + + W DI V++ QR ++PVA IIK ++L + ITLLL
Sbjct: 375 EANVERVQEAMDAVNGLLQQGMDWVDINKLVEREQRLHNPVAEIIKLPMRLEESIITLLL 434
Query: 200 RVHKERSNIKDSWVLHH---DNADPFADSSETKKPS---LVDVDLDLSAYANAKRFFDLK 253
+E + + + + + A + + K P +D++L LS + NA+ +++ K
Sbjct: 435 GEEEEEAEAEANMDFDYDTDEEAAEETAAGKAKGPDKRLAIDINLKLSPWNNAREYYEQK 494
Query: 254 RSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSEN 311
R+AA KQQKTIQ +E AL++AEKK + LK Q + RK WFEKF WFISS+
Sbjct: 495 RTAADKQQKTIQQSEIALRNAEKKISEDLKKGLKQEKPVLQLIRKQMWFEKFLWFISSDG 554
Query: 312 YLGNVAWDAK-------------------------------------------------- 321
YL DA+
Sbjct: 555 YLVLGGRDAQQNEILYKRYLRKGDVYVHADMHGAPSVIIKNNPKTPDAPIPPSTLAQAGS 614
Query: 322 ---VVTNAWWVKA---------DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
++AW KA DQVSK+APTGE+L GSFM+RGK+N PP L +G
Sbjct: 615 LSVCCSSAWDSKAVMGAWWVNADQVSKSAPTGEYLPAGSFMVRGKRNALPPALLTLGFGL 674
Query: 370 LFKLEESSISR 380
+FK+ E S S+
Sbjct: 675 MFKISEDSKSK 685
>gi|225563152|gb|EEH11431.1| DUF814 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 1158
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 137/438 (31%), Positives = 210/438 (47%), Gaps = 85/438 (19%)
Query: 20 VLVSNLNVS------SQGYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGST 63
V+ N+N S + GY+V K E + + + VA ++FHP
Sbjct: 233 VVAENVNSSLSTAEETPGYIVSKTEGKAGEDASVDSTDPSKSSNVAYIDFHPF------E 286
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P G I F++F AVDE+FS+ ES K++ + ++E A +KLE K D + R+
Sbjct: 287 PKQFESEPGTSILRFDTFNKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAKTDQDKRV 346
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L++ Q + KA+ I N V+ A+ A+ IA + W +I ++ Q +PVA
Sbjct: 347 GVLKEAQELHIRKAQAIEANLLRVEEAVNAVNGLIAQGMDWGEIARLIEMEQSRQNPVAK 406
Query: 184 IIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFAD--------------SSET 228
+IK LKL N +TLLL E D + + S +T
Sbjct: 407 VIKLPLKLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSQKT 466
Query: 229 KKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLK 283
++P L +D+DL +S +ANA+++++ K++AA K++KT+ ST+KA+KS EKK KQ LK
Sbjct: 467 RQPLLSIDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKKAIKSTEKKVAADLKQALK 526
Query: 284 DVQTMTNINK----ARKVYWFEKFYW--------FISSE--------------------- 310
+ + + R V E Y F+ ++
Sbjct: 527 QEKPVLRPTRTPFCGRDVQQTEILYRRHLKRGDVFVHADVQGAIPIIVKNKPGTPDAPIP 586
Query: 311 ----NYLGNV------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
+ GN+ AWD+K V AWWV ADQVSKT P GE+L TG F+I G+KN PP
Sbjct: 587 PGTLSQAGNLCVATSTAWDSKAVMGAWWVNADQVSKTTPLGEYLVTGGFVICGEKNQLPP 646
Query: 361 CQLAMGISFLFKLEESSI 378
QL +G + +F++ SI
Sbjct: 647 AQLLLGFAVMFQISGESI 664
>gi|259479735|tpe|CBF70228.1| TPA: DUF814 domain protein, putative (AFU_orthologue; AFUA_2G09170)
[Aspergillus nidulans FGSC A4]
Length = 1100
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 135/431 (31%), Positives = 207/431 (48%), Gaps = 89/431 (20%)
Query: 31 GYLVQKKEKRP-----------NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
G++V K++ RP + + +FHP P GF I E+
Sbjct: 265 GFIVAKEDTRPKPPGPESEKNDSPSKPALLYEDFHPF------KPRQFEGKDGFTILEYP 318
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
S A VDE+FS+ ES K++ + ++E A KKL++++ +HE R+ ALEQ Q + KA
Sbjct: 319 SMNATVDEYFSSIESQKLESRLTERESAAKKKLDSLRSEHEKRIGALEQAQELHIRKASA 378
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLL 198
I +N + V A+ A+ +A + W +I V+ Q+ +PVAS+IK LKL+ N ITLL
Sbjct: 379 IQDNMDRVQEAMDAVNGLVAQGMDWVEIARLVEMEQKRGNPVASLIKLPLKLHENTITLL 438
Query: 199 LR-VHKERSNIKDSWVLHHDNADPF----ADSSETKKPS--LVDVDLDLSAYANAKRFFD 251
LR E +++ + +S KKP +D+DL LS +ANA ++++
Sbjct: 439 LREAGDEGYEVEELFSSDESEDSDEEEGKGAASPQKKPEGLTIDIDLGLSPWANASQYYE 498
Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFEKFYWFISS 309
K+ AA K +KT QS+ KALKS E+K + LK Q + ARK +WFEKF +F+SS
Sbjct: 499 QKKVAAVKAEKTSQSSAKALKSHERKVQDDLKRNLKQEKQVLRPARKPFWFEKFLFFVSS 558
Query: 310 ENYL--------------------GNV--------------------------------A 317
E YL G+V A
Sbjct: 559 EGYLVLGGRDSMQSEMLYRRYLRKGDVFVHADLEGATPMIVKNKPGALSSSISPTTLSQA 618
Query: 318 WDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
+ V T+ W V A QVSKT+ G+ L G F+++G+KNF P QL +G
Sbjct: 619 GNLCVATSTAWDSKAIMSAYWVDAAQVSKTSAVGDLLPVGEFLVKGEKNFLAPSQLVLGF 678
Query: 368 SFLFKLEESSI 378
+ ++++ + S+
Sbjct: 679 AVMWQISKGSL 689
>gi|317033383|ref|XP_001395552.2| hypothetical protein ANI_1_620104 [Aspergillus niger CBS 513.88]
Length = 1108
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 140/433 (32%), Positives = 204/433 (47%), Gaps = 88/433 (20%)
Query: 28 SSQGYLVQKKEKRPNSEDYIVANME-----------FHPMILQPGSTPSVMRQHQGFPIQ 76
S GY+V K + RP+++ E FHP P G I
Sbjct: 262 SHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPF------KPKQFEGKPGVTIL 315
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
E+ SF A VDE+FS+ E+ K++ + ++E A KKL+ V+++H R+ AL++ Q + K
Sbjct: 316 EYPSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRK 375
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
A I +N V A+ A+ IA + W +I ++ Q +PVA+IIK LKL N I
Sbjct: 376 AGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTI 435
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADS--SETKKPS----LVDVDLDLSAYANAKRF 249
TL+L E + + D+ ++ +K S +D+DL LS +ANA ++
Sbjct: 436 TLMLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQY 495
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFI 307
++ K+ AA K+QKT QS+ KALKS EKK Q LK Q + ARK +WFEKF +FI
Sbjct: 496 YEQKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQVLRPARKTFWFEKFLFFI 555
Query: 308 SSENYL--------------------GNV------------------------------- 316
SSE YL G+V
Sbjct: 556 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 615
Query: 317 -AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
A + V T++ W V A QVSKTA G L TG F+I+G+KNF P QL +
Sbjct: 616 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 675
Query: 366 GISFLFKLEESSI 378
G +F++ + S+
Sbjct: 676 GFGVMFQVSKESL 688
>gi|119480773|ref|XP_001260415.1| hypothetical protein NFIA_084700 [Neosartorya fischeri NRRL 181]
gi|119408569|gb|EAW18518.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 1116
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 139/430 (32%), Positives = 210/430 (48%), Gaps = 92/430 (21%)
Query: 31 GYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGSTPSVMRQHQGFP---IQE 77
GY++ K++KRP +E + +FHP +P RQ +G P I E
Sbjct: 265 GYIIAKEDKRPTAESTADTGDPSQKAGLLYEDFHP--FRP-------RQFEGKPEVTILE 315
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F +F A VDE+FS+ E+ K++ + ++E A +KL+ V+++HE RL AL++ Q + KA
Sbjct: 316 FSTFNATVDEYFSSLETQKLESRLTEREEAAKRKLDAVRQEHEKRLGALKEAQEIHVRKA 375
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
I +N V + A+ IA + W +I ++ Q +PVA IIK LKL N IT
Sbjct: 376 AAIEDNVYRVQEVMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKLYENTIT 435
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSET--KKPSL--VDVDLDLSAYANAKRFFDL 252
L+L E + D + ++ E K P + +D+DL LS +ANA ++++
Sbjct: 436 LVLGEASEEQDAADDLFSDESEEESESEEQEAARKAPEMLTIDIDLGLSPWANATQYYEQ 495
Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSE 310
K+ AA K+QKT QS+ KALKS EKK + LK Q + ARK +WFEKF +FISSE
Sbjct: 496 KKMAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQVLRPARKPFWFEKFLFFISSE 555
Query: 311 NYL--------------------GNV--------------------------------AW 318
YL G++ A
Sbjct: 556 GYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLSQAG 615
Query: 319 DAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
+ V T++ W V A+QV+KT TG L TG F ++G+KNF P QL +G +
Sbjct: 616 NLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEVKGEKNFLAPSQLVLGFA 674
Query: 369 FLFKLEESSI 378
+F++ + S+
Sbjct: 675 VMFQISKESL 684
>gi|134080270|emb|CAK97173.1| unnamed protein product [Aspergillus niger]
Length = 1180
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 140/433 (32%), Positives = 204/433 (47%), Gaps = 88/433 (20%)
Query: 28 SSQGYLVQKKEKRPNSEDYIVANME-----------FHPMILQPGSTPSVMRQHQGFPIQ 76
S GY+V K + RP+++ E FHP P G I
Sbjct: 334 SHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPF------KPKQFEGKPGVTIL 387
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
E+ SF A VDE+FS+ E+ K++ + ++E A KKL+ V+++H R+ AL++ Q + K
Sbjct: 388 EYPSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRK 447
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
A I +N V A+ A+ IA + W +I ++ Q +PVA+IIK LKL N I
Sbjct: 448 AGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTI 507
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADS--SETKKPS----LVDVDLDLSAYANAKRF 249
TL+L E + + D+ ++ +K S +D+DL LS +ANA ++
Sbjct: 508 TLMLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQY 567
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFI 307
++ K+ AA K+QKT QS+ KALKS EKK Q LK Q + ARK +WFEKF +FI
Sbjct: 568 YEQKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQVLRPARKTFWFEKFLFFI 627
Query: 308 SSENYL--------------------GNV------------------------------- 316
SSE YL G+V
Sbjct: 628 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 687
Query: 317 -AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
A + V T++ W V A QVSKTA G L TG F+I+G+KNF P QL +
Sbjct: 688 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 747
Query: 366 GISFLFKLEESSI 378
G +F++ + S+
Sbjct: 748 GFGVMFQVSKESL 760
>gi|378733722|gb|EHY60181.1| translation factor [Exophiala dermatitidis NIH/UT8656]
Length = 1147
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 140/429 (32%), Positives = 207/429 (48%), Gaps = 81/429 (18%)
Query: 29 SQGYLVQKKEKRPNSEDYIVANMEFHPM-----ILQPGSTPSVMRQHQGFP---IQEFES 80
S+GY++ K K P+SE A + P +L P Q +G P I EF+
Sbjct: 264 SKGYIIAKI-KSPSSEAPQEAEAQTQPSSGRDNLLYEDFHPFRPSQFEGKPDLRILEFDG 322
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F VDEF+S+ ES K++ + ++E A KKL+ K + E RL AL+ Q + KA+ I
Sbjct: 323 FNRTVDEFYSSIESQKLESRLTEREEAARKKLQAAKEEQEKRLGALQHVQELHVRKAQAI 382
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N V+ A A+ I + W DI ++ Q+ + VA +IK LKL N +TLLL
Sbjct: 383 EANTHRVEEACAAVNGLIGQGMDWVDIGKLIENEQKRGNVVAQMIKLPLKLEENTVTLLL 442
Query: 200 ---RVHKERSNIKDSWVLHHDNADPFADSSETKKPS----LVDVDLDLSAYANAKRFFDL 252
++E + +N+D TK + +D+DL LS +ANA+++++
Sbjct: 443 DEPGFNEESEEDEPDETDEEENSDEDTRKKPTKPATDKRLAIDIDLGLSPWANARQYYEQ 502
Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSE 310
K++AA K+++T+++ ALKSAE+K + LK Q + ARK +WFEKF +FISS+
Sbjct: 503 KKNAAVKEKRTLEAATMALKSAERKIEADLKRGLKQEKAALRPARKQFWFEKFLYFISSD 562
Query: 311 NYL--------------------GNV--------------------------------AW 318
YL G+V A
Sbjct: 563 GYLVIGGKDAQQNELLYRRYLKRGDVYVHADLQGASSVIVKNNPRTPDAPIPPSTLSQAG 622
Query: 319 DAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
V T++ W V A+QVSKTAP+GE+LTTG F+IRG KN PP QL +G
Sbjct: 623 ALTVCTSSAWDSKAVMGAWWVNAEQVSKTAPSGEYLTTGGFIIRGHKNLLPPSQLLLGFG 682
Query: 369 FLFKLEESS 377
L+ + E S
Sbjct: 683 VLWLISEES 691
>gi|328774280|gb|EGF84317.1| hypothetical protein BATDEDRAFT_8510 [Batrachochytrium
dendrobatidis JAM81]
Length = 695
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 126/429 (29%), Positives = 195/429 (45%), Gaps = 111/429 (25%)
Query: 20 VLVSNLNVSSQGYLVQK-------------KEKRPNSEDYIVANMEFHPMILQPGSTPSV 66
+L S+++ QGY+V + P + EFHP + +
Sbjct: 186 ILSSSIDTPQQGYIVTSDSMVSQQLASSDTAQSSPTTT--FTTYQEFHPYRFEQFNQDRS 243
Query: 67 MRQHQGFPIQ----EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P Q EF SF AVDE+FS ES +++++A Q E A+KKLENVK+ H+ +
Sbjct: 244 TSLSAELPKQTRFMEFVSFDKAVDEYFSKMESQRLEIRAHQAELAAVKKLENVKKSHQAQ 303
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ + +++ A+ I + E +D+ + ++ +A+ + W+D+E VK+ + + +A
Sbjct: 304 IQNFQSNVESNEQYAQAIESRLEDIDSVLRTVQSFLASGMDWKDLEDLVKEETNNGNALA 363
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I KLN+ +D+D+ +A
Sbjct: 364 KMIIGFKLNMEFFK-------------------------------------IDLDIYSTA 386
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-TMTNINKARKVYWFE 301
YANA+R++ K+ A KQ KT++ + K +K AE K Q L VQ T +I K RK YWFE
Sbjct: 387 YANARRYYGAKKVAITKQSKTMEQSAKVVKMAEMKIFQHLASVQKTAVSITKIRKPYWFE 446
Query: 302 KFYWFISSENYL-----------------------------------------GNVAWDA 320
KF WF+SSEN+L V DA
Sbjct: 447 KFLWFVSSENFLVVGGKDATQSNMLVTRYLKKGDAYVHSDLPGAASVIVKCMQSCVGTDA 506
Query: 321 -------------KVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
K++T+A+W +A QVSKT TG+ L G+FMIRGKKN+ PP QL G+
Sbjct: 507 GTMSVCQSRAWDAKIITSAYWAEAHQVSKTTSTGDTLPLGTFMIRGKKNWLPPVQLIYGM 566
Query: 368 SFLFKLEES 376
+ LF+ + S
Sbjct: 567 AMLFQTDHS 575
>gi|358379255|gb|EHK16935.1| hypothetical protein TRIVIDRAFT_10609, partial [Trichoderma virens
Gv29-8]
Length = 1079
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 136/436 (31%), Positives = 205/436 (47%), Gaps = 79/436 (18%)
Query: 21 LVSNL--NVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFP---I 75
+V N+ + + +GY+ KK+ P+S + H +L P V ++ + P +
Sbjct: 257 IVENIIASATCKGYIFAKKKTAPSSGPDETDQAQKHEGLLYEDFHPFVPQKFKNDPSIQV 316
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
EFE + VDEFFS+ E K++ + +E A KKLE + + R+ L+ Q ++
Sbjct: 317 LEFEGYNRTVDEFFSSLEGQKLESRLSGREEAAKKKLEAARHEQAKRIEGLQDAQAMNLR 376
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINH 194
KA I N E V A+ A+ +A + W DI +++ ++ +PVA II LKL N
Sbjct: 377 KAAAIEANVERVQEAMDAVNGLLAQGMDWVDIGKLIEREKKRQNPVAEIISLPLKLAENT 436
Query: 195 ITLLLRVHKERSNIKDSW-VLHHDNADPFADSSETK--------KPSLVDVDLDLSAYAN 245
ITLLL + + D++D A++SE K VD+ L+ S ++N
Sbjct: 437 ITLLLAEEEFDEDEAAEDNPFETDDSDSEAEASEVTPTKDKKADKLLTVDIVLNTSPWSN 496
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
A+ +++ +RSAA K++KT KALKS E+K + LK Q + RK WFEKF
Sbjct: 497 AREYYEERRSAAMKEEKTQLQANKALKSTEQKIAEDLKKGLKQEKALLQPIRKQMWFEKF 556
Query: 304 YWFISSENYL--------------------GNVAWDAKV--------------------- 322
WFISS+ YL G+V A +
Sbjct: 557 IWFISSDGYLVLGGKDPQQSEMLYRRYLRKGDVYCHADIRGAAHIVIKNNPNTPDAPIPP 616
Query: 323 ------------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC 361
++AW WV ADQVSK+ PTG+ L G+F I+GKKN+ PP
Sbjct: 617 ATLSQAGSLSVCTSDAWDSKAGMGGWWVNADQVSKSTPTGDILPAGNFTIQGKKNYLPPT 676
Query: 362 QLAMGISFLFKLEESS 377
QL +G+ F FK+ E S
Sbjct: 677 QLLLGLGFTFKISEQS 692
>gi|255556494|ref|XP_002519281.1| conserved hypothetical protein [Ricinus communis]
gi|223541596|gb|EEF43145.1| conserved hypothetical protein [Ricinus communis]
Length = 1092
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 116/320 (36%), Positives = 177/320 (55%), Gaps = 37/320 (11%)
Query: 16 PTLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
P ++L+ N NV + P+SE EF P++L MR++
Sbjct: 300 PEGYILMQNKNVG---------KNHPSSESAFKIYDEFCPILLNQFK----MREYV---- 342
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
+F++F AA+DEF+S ES + + + +E A++KL ++ D E R+ L + +
Sbjct: 343 -KFDTFDAALDEFYSKIESQRAEQQQKTKENSAIQKLNKIRLDQENRVLTLRKEVDLCVR 401
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
KAELI N E VDAAILA+R +A +SWED+ VK+ ++ +PVAS+I +L L N +
Sbjct: 402 KAELIEYNLEDVDAAILAVRVALAKGMSWEDLTRMVKEEKKLGNPVASLIDKLHLERNCM 461
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
TLLL +N D D +T V++DL LSA+ANA+R++++K+
Sbjct: 462 TLLL----------------SNNLDDMDDDEKTLPVDKVEIDLALSAHANARRWYEMKKK 505
Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL-- 313
KQ KT+ + EKA K+AE+KT+ L +++ I+ RKV+WFEKF WFISSENYL
Sbjct: 506 QESKQGKTVTAHEKAFKAAERKTRLQLSQEKSVATISHMRKVHWFEKFNWFISSENYLVI 565
Query: 314 -GNVAWDAKVVTNAWWVKAD 332
G A +++ + K D
Sbjct: 566 SGRDAQQNEMIVKRYMSKGD 585
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 43/63 (68%), Positives = 50/63 (79%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF P L MG LF+L+ES
Sbjct: 627 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLSPHPLIMGFGLLFRLDES 686
Query: 377 SIS 379
S+
Sbjct: 687 SLG 689
>gi|212529000|ref|XP_002144657.1| DUF814 domain protein, putative [Talaromyces marneffei ATCC 18224]
gi|210074055|gb|EEA28142.1| DUF814 domain protein, putative [Talaromyces marneffei ATCC 18224]
Length = 1117
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 138/434 (31%), Positives = 199/434 (45%), Gaps = 93/434 (21%)
Query: 30 QGYLVQKKEKRPNSEDYIVANM--------EFHPMILQPGSTPSVMRQHQGFPIQEFESF 81
+GY+V K+ R ED V + +FHP P G+ I E++SF
Sbjct: 263 KGYIVAKE--RAKKEDQQVDSTSKENLLFEDFHPF------RPRQFEGKPGYHILEYDSF 314
Query: 82 AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
VDE+FS+ ES K++ + + E A +KLE + DH+ R AL+Q Q + KAE I
Sbjct: 315 NKTVDEYFSSIESQKLESRLAEHEETAKRKLETARADHQNRAGALKQAQELHIRKAEAIQ 374
Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLR 200
N V A A+ IA + W +I ++ Q+ N+PVA IK LKL N ITLLL
Sbjct: 375 ANIYRVQEATDAVNGLIAQGMDWVEIARLIEMEQQRNNPVAQTIKLPLKLYENTITLLLS 434
Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETK------------KPSLVDVDLDLSAYANAKR 248
+ + SE + +P +D+DL LS ++NA +
Sbjct: 435 EENTEVEEEQEEFSESEPEVSEDSDSENEIEKDEGPKQKIAEPLAIDIDLSLSPWSNATQ 494
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFEKFYWF 306
+++ KR+AA K+QKTIQS+EKALKS EKK + LK Q + +RK +WFEK+ +F
Sbjct: 495 YYEQKRTAAVKEQKTIQSSEKALKSQEKKVTEDLKKHLKQEKQVLRPSRKPFWFEKYLYF 554
Query: 307 ISSENYL--------------------GNV------------------------------ 316
ISSE YL G+V
Sbjct: 555 ISSEGYLVLGGRDSHQVEILYQRYLKKGDVFVHADLEGATPMIVKNKEGTPDAPIPPGTL 614
Query: 317 --AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLA 364
A V T+ W V A QVS+T GE L G+FM++G+KN+ P Q
Sbjct: 615 TQAGSISVATSKAWETKALMPSWWVHAHQVSRTNERGELLANGAFMVKGEKNYLAPGQPI 674
Query: 365 MGISFLFKLEESSI 378
+G + LF++ + S+
Sbjct: 675 LGFAVLFQISKESV 688
>gi|226292279|gb|EEH47699.1| DUF814 domain-containing protein [Paracoccidioides brasiliensis
Pb18]
Length = 1261
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 129/403 (32%), Positives = 196/403 (48%), Gaps = 82/403 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP P G I F +F AVDE+FS+ ES K++ + ++E A +K
Sbjct: 400 DFHPF------KPKQFENVPGMTILTFNTFNKAVDEYFSSVESQKLEYRLTEREEIARRK 453
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE ++D E R+ AL++ Q + KA+ I N V+ AI A+ IA + W +I +
Sbjct: 454 LEAAQKDQENRVGALKEVQELHVRKAQAIEANLLRVEEAINAVNGLIAQGMDWVEIARLI 513
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL----------RVHKERSNIKDSWVLHHDNAD 220
+ + +PVA +IK LKL N +TLLL E +S N
Sbjct: 514 EMEKSRQNPVAKVIKLPLKLYENTVTLLLGEPTEDEEPADESDEEEEDSESGDEDGGNKV 573
Query: 221 PFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
S + ++ L +D+DL +S +ANA+++++ +++AA K++KT++ST+KA+KS EKK
Sbjct: 574 KLERSKKAQQQLLSIDIDLGISPWANARQYYEQRKAAAVKEEKTLKSTKKAIKSTEKKVT 633
Query: 280 QTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVA 317
LK Q + R +WFEKF +F+SS+ YL G+V
Sbjct: 634 TDLKHALKQEKPILRPTRTPFWFEKFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVY 693
Query: 318 WDAKV---------------------------------VTNAW---------WVKADQVS 335
A V ++AW WV ADQVS
Sbjct: 694 VHADVQGATPIFVKNKPGTPDAPIPPGTLSQAGNLCVATSSAWDSKAVMGAWWVNADQVS 753
Query: 336 KTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
KTAP+GEF+ TG F+IRG+K+ PP QL +G + +F++ E SI
Sbjct: 754 KTAPSGEFVGTGGFVIRGEKHQLPPAQLLLGYAVMFQISEDSI 796
>gi|169612956|ref|XP_001799895.1| hypothetical protein SNOG_09606 [Phaeosphaeria nodorum SN15]
gi|111061751|gb|EAT82871.1| hypothetical protein SNOG_09606 [Phaeosphaeria nodorum SN15]
Length = 1132
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 132/408 (32%), Positives = 194/408 (47%), Gaps = 86/408 (21%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP Q + + EF+ F AVDEFFS+ E K++ + ++E+ A KK
Sbjct: 297 DFHPFRPQ-------QFEESDYTFLEFDGFNKAVDEFFSSIEGQKLESRLTEREQQAKKK 349
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE +R+HE RL L+Q Q V+ KAE I+ N V A A+ I + W DI + +
Sbjct: 350 LEKARREHEERLGGLQQVQEVNFRKAEAILANVHRVAEATEAVNGLIRQGMDWGDIASLI 409
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL------RVHKERSNIKDSWVLHHDNADPFAD 224
++ Q H + VA IK LKL+ N ITLLL ++ ++ + D+ D
Sbjct: 410 EREQSHGNAVAETIKLPLKLHENTITLLLDETDFDHAEEDDDEGNETSSVSEDSEDEDEG 469
Query: 225 SSETKKPSL--------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEK 276
+ P+ +D+DL LS +AN+ ++D K++AA K+ +T+Q++ KALKS EK
Sbjct: 470 PKKKAAPAKPAARPKLAIDIDLALSPWANSTEYYDQKKTAASKEDRTLQASTKALKSHEK 529
Query: 277 KTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK------------- 321
K + LK Q + RK WFEKF +FISS+ YL DA+
Sbjct: 530 KVAEDLKKGLKQEKDILRPVRKQQWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRYFRKG 589
Query: 322 --------------VVTN--------------------------AWWVKA---------D 332
++ N AW KA D
Sbjct: 590 DVYVHADLKGAVPMIIKNKPTTPDAPIPPSTLSQAGHLSVCSSDAWESKAVMSAWWVLAD 649
Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
QVSKT TGEFL G F I+GKK + PP QL +G++ +F++ E+S +R
Sbjct: 650 QVSKTGQTGEFLPPGLFNIKGKKEYLPPAQLIVGLAVMFEISEASKAR 697
>gi|240275734|gb|EER39247.1| DUF814 domain-containing protein [Ajellomyces capsulatus H143]
Length = 1183
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 135/438 (30%), Positives = 210/438 (47%), Gaps = 85/438 (19%)
Query: 20 VLVSNLNVS------SQGYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGST 63
V+ N+N S + GY+V K E + + + VA ++FHP
Sbjct: 258 VVAENVNSSLSTAEETPGYIVSKTEGKAGEDASVDSTDPSKLRNVAYIDFHPF------E 311
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P G I F++F+ AVDE+FS+ ES K++ + ++E A +KLE ++D + R+
Sbjct: 312 PKQFESEPGTSILRFDTFSKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAQKDQDKRV 371
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L++ Q + KA+ I N V+ AI A+ IA + W +I ++ Q +PVA
Sbjct: 372 GVLKEAQELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQSRQNPVAK 431
Query: 184 IIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFAD--------------SSET 228
+IK LKL N +TLLL E D + + S +T
Sbjct: 432 VIKLPLKLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSKKT 491
Query: 229 KKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLK 283
++P L +D+DL +S +ANA+++++ K++AA K++KT+ ST+ A+KS EKK KQ LK
Sbjct: 492 RQPLLSIDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKTAIKSTEKKVAADLKQALK 551
Query: 284 DVQTMTNINK----ARKVYWFEKFYW--------FISSE--------------------- 310
+ + + R V E Y F+ ++
Sbjct: 552 QEKPVLRPTRTPFCGRDVQQTEILYRRHLKRGDVFVHADVQGAIPIIVKNKPGTPDAPIP 611
Query: 311 ----NYLGNV------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
+ GN+ AWD+K V AWWV ADQVSKT P GE+L TG F+I G+KN P
Sbjct: 612 PGTLSQAGNLCVATSTAWDSKAVMGAWWVNADQVSKTTPLGEYLVTGGFVICGEKNHLSP 671
Query: 361 CQLAMGISFLFKLEESSI 378
QL +G + +F++ SI
Sbjct: 672 AQLLLGFAVMFQISGESI 689
>gi|391869409|gb|EIT78607.1| putative RNA-binding protein [Aspergillus oryzae 3.042]
Length = 1103
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 133/439 (30%), Positives = 211/439 (48%), Gaps = 90/439 (20%)
Query: 24 NLNVSSQ----GYLVQKKEKRP-------NSEDYIVANM---EFHPMILQPGSTPSVMRQ 69
N +S+Q GY+V K + R N + N+ +FHP P
Sbjct: 253 NTRLSTQESHPGYIVAKDDNRSVSQSANENEKPSETGNLLYEDFHPF------KPRQFEG 306
Query: 70 HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
G I EF S A VDE+FS+ E+ K++ + ++E A +KLE V+++HE ++ AL++
Sbjct: 307 KPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGALKEQ 366
Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QL 188
Q + KA I +N V A+ A+ IA + W +I ++ Q +PVA IIK L
Sbjct: 367 QELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARIIKLPL 426
Query: 189 KLNINHITLLLRVHKERSNIKDSWVL--HHDNADPFADSSETKKPS---LVDVDLDLSAY 243
KL+ N ITLLL + + D + ++ D+ E+++P +D+DL +S +
Sbjct: 427 KLHENTITLLLGEAGDEQDEGDELFSSDESEKSEDEQDNGESQQPPSVLTIDIDLGISPW 486
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFE 301
ANAK++++ K+ AA K+Q+T QS+ KALKS EKK + LK + + + R+ +WFE
Sbjct: 487 ANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQPFWFE 546
Query: 302 KFYWFISSENYL--------------------GNV------------------------- 316
KF +FISSE YL G++
Sbjct: 547 KFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDPTAPI 606
Query: 317 -------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
A + V T++ W V+A Q++KTA G L G F+++G+KNF
Sbjct: 607 PPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEKNFLA 666
Query: 360 PCQLAMGISFLFKLEESSI 378
P QL +G F++ + S+
Sbjct: 667 PSQLVLGFGVTFQISKDSL 685
>gi|342879256|gb|EGU80511.1| hypothetical protein FOXB_08971 [Fusarium oxysporum Fo5176]
Length = 1060
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 138/460 (30%), Positives = 209/460 (45%), Gaps = 91/460 (19%)
Query: 2 VSNEDVSSDIHKCGPTLHVLVSNLNVS--SQGYLVQKKEKRPNSEDYI---------VAN 50
++NE + +D+ K V L S GY+ K+ +R + +
Sbjct: 218 LANETLLADLVKSLQEARQSVEELTSSEACTGYIFAKRRERTEGAEATDESKTQRDNLLY 277
Query: 51 MEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALK 110
+FHP + P +++ + EF + VDEFFS+ E +++ + ++E A +
Sbjct: 278 EDFHPFV------PYKLKKDPTIEVLEFTGYNETVDEFFSSLEGQRLESRLSEREAAAKR 331
Query: 111 KLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR 170
KLE + + R+ L++ Q ++ KA I N E V A+ A+ ++ + W D+
Sbjct: 332 KLEAARNEQSKRIEGLQEAQALNFRKAAAIEANAERVQEAMDAVNGLLSQGMDWVDVGKL 391
Query: 171 VKQAQRHNDPVASIIKQLKLNI--NHIT-------LLLRVHKERSNIKDSWVLHHDNADP 221
V++ ++ ++PVA IIK L LN+ N IT D L D
Sbjct: 392 VEREKKRHNPVAEIIK-LPLNLAENLITLELAEEEFEPEEDDPYETDDDDSALGDDEGTS 450
Query: 222 FADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
A + K V+++L LS ++NA+ +FD +++AA K++KT Q +ALK+AE+K +
Sbjct: 451 AAKGKQANKALSVEINLGLSPWSNAREYFDQRKTAAVKEEKTQQQASRALKNAEQKITED 510
Query: 282 LKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAWD 319
LK Q + RK WFEKF WFISS+ YL G+V
Sbjct: 511 LKKGLKQEKALLQPIRKPMWFEKFVWFISSDGYLVIGGKDAQQNEMIYKKYLRKGDVYCH 570
Query: 320 AKV---------------------------------VTNAW---------WVKADQVSKT 337
A + +NAW WV ADQVSK+
Sbjct: 571 ADLHGASSVIIKNNPKTPDAPIPPATLSQAGSLAVCSSNAWDSKAGMSAWWVNADQVSKS 630
Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
APTGEFL GSFMIRGKKNF PP QL +G+ FK+ E S
Sbjct: 631 APTGEFLPAGSFMIRGKKNFLPPAQLLLGLGVAFKISEES 670
>gi|295673284|ref|XP_002797188.1| DUF814 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226282560|gb|EEH38126.1| DUF814 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 1258
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 136/435 (31%), Positives = 210/435 (48%), Gaps = 91/435 (20%)
Query: 29 SQGYLVQKKEKR-------PNSEDYIVANM---EFHPMILQPGSTPSVMRQHQGFPIQEF 78
+ GY++ K E + +++ NM +FHP P G I +F
Sbjct: 461 TPGYIISKAESKTGEAITEADTDSPKPKNMLYHDFHPF------EPKQFENVPGMTILKF 514
Query: 79 ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAE 138
++F AVDE+FS+ ES K++ + ++E A +KLE ++D E R+ AL++ Q + KA+
Sbjct: 515 KTFNKAVDEYFSSVESQKLEYRLTEREEIARRKLEAAQKDQENRIGALKEVQELHVRKAQ 574
Query: 139 LIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITL 197
I N V+ AI A+ IA + W +I ++ + +PVA++IK LKL N +TL
Sbjct: 575 AIEANLLRVEEAIKAVNGLIAQGMDWVEIARLIEMEKSRQNPVANVIKLPLKLYENTVTL 634
Query: 198 LL---------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAK 247
LL E +S N S + ++ L +D+DL +S +ANA+
Sbjct: 635 LLGEPTEDEEPADESEEEEDSESDDEDGGNKVKLEGSKKAQQQLLSIDIDLGISPWANAR 694
Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
++++ KR AA K++KT++ST+KA+KS EKK LK Q + R +WFEKF +
Sbjct: 695 QYYEQKRVAAVKEEKTLKSTKKAIKSTEKKVTTDLKHALKQEKPILRPTRTPFWFEKFMF 754
Query: 306 FISSENYL--------------------GNVAWDAKV----------------------- 322
F+SS+ YL G+V A V
Sbjct: 755 FVSSDGYLVLGGRDLQQTEILYRRYLKKGDVYVHADVQGATPIFVKNKPGTPDAPIPPGT 814
Query: 323 ----------VTNAWWVKA---------DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
++AW KA DQVSKTAP+GEF+ TG F+IRG+K+ PP QL
Sbjct: 815 LSQAGNLCVASSSAWDSKAVMGAWWVNADQVSKTAPSGEFVGTGGFVIRGEKHQLPPAQL 874
Query: 364 AMGISFLFKLEESSI 378
+G + +F++ E SI
Sbjct: 875 LLGFAVMFQISEDSI 889
>gi|355718192|gb|AES06188.1| serologically defined colon cancer antigen 1 [Mustela putorius
furo]
Length = 547
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 102/210 (48%), Positives = 120/210 (57%), Gaps = 59/210 (28%)
Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
+P LVDVDL LSAYANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T
Sbjct: 39 RPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVT 98
Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-- 338
+I KARKVYWFEKF WFISSENYL D + + T +V AD T+
Sbjct: 99 SIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIVKRYLTTGDIYVHADLHGATSCV 158
Query: 339 ---PTG---------------------------------------------EFLTTGSFM 350
PTG E+LTTGSFM
Sbjct: 159 IKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFM 218
Query: 351 IRGKKNFFPPCQLAMGISFLFKLEESSISR 380
IRGKKNF PP L MG SFLFK++ES + R
Sbjct: 219 IRGKKNFLPPSYLMMGFSFLFKVDESCVWR 248
>gi|121698891|ref|XP_001267840.1| DUF814 domain protein, putative [Aspergillus clavatus NRRL 1]
gi|119395982|gb|EAW06414.1| DUF814 domain protein, putative [Aspergillus clavatus NRRL 1]
Length = 1111
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 142/431 (32%), Positives = 208/431 (48%), Gaps = 94/431 (21%)
Query: 31 GYLVQKKEKRPNSEDYIVANME----------FHPMILQPGSTPSVMRQHQGFP---IQE 77
GY+V K++ RP S N E FHP +P RQ +G P I E
Sbjct: 265 GYIVAKEDTRP-SAGPTAENQEGKKSGLLYQDFHP--FKP-------RQFEGKPEIKILE 314
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F SF A VDE+FS+ E+ K++ + ++E A +KL+ V+++HE RL AL+Q Q + KA
Sbjct: 315 FGSFNATVDEYFSSLETQKLESRLTEREEAAKRKLDAVRQEHEKRLGALKQAQELHIRKA 374
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
I +N V A+ A+ IA + W +I ++ Q +PVA IIK LKL N IT
Sbjct: 375 GAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQDRGNPVARIIKLPLKLYENTIT 434
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSE---TKKPSL--VDVDLDLSAYANAKRFFD 251
L+L E + D ++ + + K P + +D+DL LS +ANA ++++
Sbjct: 435 LVLGEASEEQDEADELFSDDESEEDSESEEQEAIKKAPEMLTIDIDLGLSPWANATQYYE 494
Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISS 309
K+ AA K+QKT QS+ KALKS EKK + LK Q + ARK +WFEKF +FISS
Sbjct: 495 QKKMAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQVLRPARKPFWFEKFIFFISS 554
Query: 310 ENYL--------------------GNV--------------------------------A 317
E YL G++ A
Sbjct: 555 EGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLSQA 614
Query: 318 WDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
+ V T++ W V A+QV+KT TG L TG F +G+K+F P QL +G
Sbjct: 615 GNLSVATSSAWDSKAVMAAWWVNANQVTKTT-TGGLLPTGEFETKGEKSFLAPSQLVLGF 673
Query: 368 SFLFKLEESSI 378
+ +F++ + S+
Sbjct: 674 AVMFQISKESL 684
>gi|169783790|ref|XP_001826357.1| hypothetical protein AOR_1_1306054 [Aspergillus oryzae RIB40]
gi|83775101|dbj|BAE65224.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1103
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 133/439 (30%), Positives = 212/439 (48%), Gaps = 90/439 (20%)
Query: 24 NLNVSSQ----GYLVQKKEKRP-------NSEDYIVANM---EFHPMILQPGSTPSVMRQ 69
N +S+Q GY+V K++ R N + N+ +FHP P
Sbjct: 253 NTRLSTQESHPGYIVAKEDNRSVSQSANENEKPSETGNLLYEDFHPF------KPRQFEG 306
Query: 70 HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
G I EF S A VDE+FS+ E+ K++ + ++E A +KLE V+++HE ++ AL++
Sbjct: 307 KPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGALKEQ 366
Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QL 188
Q + KA I +N V A+ A+ IA + W +I ++ Q +PVA IIK L
Sbjct: 367 QELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARIIKLPL 426
Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA--DSSETKKPS---LVDVDLDLSAY 243
KL+ N ITLLL + + D ++ + D+ E+++P +D+DL +S +
Sbjct: 427 KLHENTITLLLGEAGDEQDEGDELFSSDESEESEDEQDNGESQQPPSVLTIDIDLGISPW 486
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFE 301
ANAK++++ K+ AA K+Q+T QS+ KALKS EKK + LK + + + R+ +WFE
Sbjct: 487 ANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQPFWFE 546
Query: 302 KFYWFISSENYL--------------------GNV------------------------- 316
KF +FISSE YL G++
Sbjct: 547 KFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDPTAPI 606
Query: 317 -------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
A + V T++ W V+A Q++KTA G L G F+++G+KNF
Sbjct: 607 PPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEKNFLA 666
Query: 360 PCQLAMGISFLFKLEESSI 378
P QL +G F++ + S+
Sbjct: 667 PSQLVLGFGVTFQISKDSL 685
>gi|302854251|ref|XP_002958635.1| hypothetical protein VOLCADRAFT_69736 [Volvox carteri f.
nagariensis]
gi|300256024|gb|EFJ40301.1| hypothetical protein VOLCADRAFT_69736 [Volvox carteri f.
nagariensis]
Length = 744
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 133/398 (33%), Positives = 188/398 (47%), Gaps = 80/398 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
EF P+ L P + G P E +F A+DEF+S E + + ER AL K
Sbjct: 276 EFSPLPLLP---------YSGQPCLELSTFDDALDEFYSKIEGQRAGIARADAERAALSK 326
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+ +K D TR AL + + KA+LI N E VDA +LA+ Q +A + W + V
Sbjct: 327 LDKIKLDQGTRAEALLRQAEECELKAQLITYNLEMVDAVLLAVNQMLATGMDWSALADLV 386
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLL--------RVHKERSNIKDSWVLHHDNADPFA 223
+ +R +PVA++I L+L N +++LL +E + + + + + P
Sbjct: 387 RNERRAGNPVAALIASLELENNRVSVLLANTLDDTGEEGEEEAMTRKAVKVASEECFPQH 446
Query: 224 DSSETKKPS----LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL----KSAE 275
T++ + LV VDL LSA ANA +F+ +R K KT+ + E AL K E
Sbjct: 447 TQRHTQRHTHTHILVFVDLSLSAAANASTYFEARRRHLAKHAKTLAANEAALAAAEKKVE 506
Query: 276 KKTKQTLKDVQTMTNINKA---RKVYWF--EKFYWFISSEN----------YL--GNV-- 316
+ KQ + + K + +WF + Y +S + Y G+V
Sbjct: 507 AQLKQVRAAPPALQPVRKPMWFERFHWFISSENYLVVSGRDAQQNELLVKRYFRKGDVYV 566
Query: 317 ------------------------------------AWDAKVVTNAWWVKADQVSKTAPT 340
AWD+K+VT+AWWV QVSKTAPT
Sbjct: 567 HAELHGTTICVRWRSGGPIPPLTLQQAGCACVCRSRAWDSKLVTSAWWVHHQQVSKTAPT 626
Query: 341 GEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
GE+LTTGSFMIRGKKNF PP L MG FLFKL++SSI
Sbjct: 627 GEYLTTGSFMIRGKKNFLPPQPLVMGFGFLFKLDDSSI 664
>gi|428183447|gb|EKX52305.1| hypothetical protein GUITHDRAFT_65529, partial [Guillardia theta
CCMP2712]
Length = 703
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 147/258 (56%), Gaps = 16/258 (6%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P V+RQ+ G + F+SF A+D +FS AE K++ + VQQ++ A+ K+E VKR HE +
Sbjct: 274 PVVLRQYAGKKLDMFDSFDKAMDAYFSIAEDKKLEQQKVQQKKAAVSKVERVKRAHEASI 333
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
AL++ + + +A LI N VD AIL I ++ + W ++ VK+ R +P+A
Sbjct: 334 QALQEEEAENYHRATLIEANLSDVDNAILVINSMLSQGMDWASLKKLVKEEGRKGNPIAQ 393
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
+I LKL+ N ITLLL D + +T VDVDL ++AY
Sbjct: 394 MIHGLKLDSNQITLLLTF----------------GLDAMEEEEQTLPVVAVDVDLGMNAY 437
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
NA+ ++ K+ A K +KT+Q+ KA+K AE+K K+ LK T +I + RK +WFEKF
Sbjct: 438 QNAQSYYSSKKKVALKAEKTMQAAGKAIKGAERKAKEDLKKADTKASIQQIRKTHWFEKF 497
Query: 304 YWFISSENYLGNVAWDAK 321
WFISSEN+L DA+
Sbjct: 498 IWFISSENFLVLCGRDAQ 515
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 40/63 (63%), Positives = 53/63 (84%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWDAK+VT+A+WV +QVSK+APTGE+L+TGSFMIRGKKN+ PP L MG LF+++ES
Sbjct: 570 AWDAKMVTSAYWVHPEQVSKSAPTGEYLSTGSFMIRGKKNYLPPNSLIMGFGLLFRIDES 629
Query: 377 SIS 379
++
Sbjct: 630 CLA 632
>gi|238493615|ref|XP_002378044.1| DUF814 domain protein, putative [Aspergillus flavus NRRL3357]
gi|220696538|gb|EED52880.1| DUF814 domain protein, putative [Aspergillus flavus NRRL3357]
Length = 1105
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 133/439 (30%), Positives = 211/439 (48%), Gaps = 90/439 (20%)
Query: 24 NLNVSSQ----GYLVQKKEKRP-------NSEDYIVANM---EFHPMILQPGSTPSVMRQ 69
N +S+Q GY+V K + R N + N+ +FHP P
Sbjct: 255 NTRLSTQESHPGYIVAKDDNRSVSQSANENEKPSETGNLLYEDFHPF------KPRQFEG 308
Query: 70 HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
G I EF S A VDE+FS+ E+ K++ + ++E A +KLE V+++HE ++ AL++
Sbjct: 309 KPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGALKEQ 368
Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QL 188
Q + KA I +N V A+ A+ IA + W +I ++ Q +PVA IIK L
Sbjct: 369 QELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARIIKLPL 428
Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA--DSSETKKPS---LVDVDLDLSAY 243
KL+ N ITLLL + + D ++ + D+ E+++P +D+DL +S +
Sbjct: 429 KLHENTITLLLGEAGDEQDEGDELFSSDESEESEDEQDNGESQQPPSVLTIDIDLGISPW 488
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFE 301
ANAK++++ K+ AA K+Q+T QS+ KALKS EKK + LK + + + R+ +WFE
Sbjct: 489 ANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQPFWFE 548
Query: 302 KFYWFISSENYL--------------------GNV------------------------- 316
KF +FISSE YL G++
Sbjct: 549 KFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDPTAPI 608
Query: 317 -------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
A + V T++ W V+A Q++KTA G L G F+++G+KNF
Sbjct: 609 PPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEKNFLA 668
Query: 360 PCQLAMGISFLFKLEESSI 378
P QL +G F++ + S+
Sbjct: 669 PSQLVLGFGVTFQISKDSL 687
>gi|225681027|gb|EEH19311.1| DUF814 domain-containing protein [Paracoccidioides brasiliensis
Pb03]
Length = 1161
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 128/403 (31%), Positives = 194/403 (48%), Gaps = 82/403 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP P G I F +F AVDE+FS+ ES K+ + ++E A +K
Sbjct: 301 DFHPF------KPKQFENVPGMTILTFNTFNKAVDEYFSSVESQKLKYRLTEREEVARRK 354
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE ++D E R+ AL++ Q + KA+ I N V+ AI A+ IA + W +I +
Sbjct: 355 LEAAQKDQENRVGALKEVQELHVRKAQAIEANLLRVEEAINAVNGLIAQGMDWVEIARLI 414
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL----------RVHKERSNIKDSWVLHHDNAD 220
+ + +PVA +IK LKL N +TLLL E +S N
Sbjct: 415 EMEKSSQNPVAKVIKLPLKLYENTVTLLLGEPTEDEEPADESDEEEEDSESGDEDGGNKV 474
Query: 221 PFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
S + ++ L +D+DL +S +ANA+++++ +++AA K++KT++ST+KA+KS EKK
Sbjct: 475 KLEGSKKAQQQLLSIDIDLGISPWANARQYYEQRKAAAVKEEKTLKSTKKAIKSTEKKVT 534
Query: 280 QTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVA 317
LK Q + R +WFEKF +F+SS+ YL G+V
Sbjct: 535 TDLKHALKQEKPILRPTRTPFWFEKFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVY 594
Query: 318 WDAKV---------------------------------VTNAW---------WVKADQVS 335
A V ++AW WV A QVS
Sbjct: 595 VHADVQGATPIFVKNKPGTPDAPIPPGTLSQAGNLCVATSSAWDSKAVMGAWWVNAGQVS 654
Query: 336 KTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
KTAP+GEF+ TG F+IRG+K+ PP QL +G + +F++ E SI
Sbjct: 655 KTAPSGEFVGTGGFVIRGEKHQLPPAQLLLGFAVMFQISEDSI 697
>gi|115443352|ref|XP_001218483.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114188352|gb|EAU30052.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 858
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 136/424 (32%), Positives = 210/424 (49%), Gaps = 76/424 (17%)
Query: 31 GYLVQKKEKRPNSED---YIVANMEFH-PMILQPGSTPSVMRQHQGFP---IQEFESFAA 83
GY++ K++ R + + + AN E P +L P RQ +G P I EF S A
Sbjct: 11 GYIIAKEDTRSKAREEGQAVEANGEGKKPGLLYEDFHPFKPRQFEGKPGVTILEFPSMNA 70
Query: 84 AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
VDE+FS+ ES +++ + ++E A KKL+ V+++HE ++ AL+ Q + KA I +N
Sbjct: 71 TVDEYFSSIESQRLESRLTEREEAAKKKLDAVRQEHEKKIGALKHAQELHIRKAGAIEDN 130
Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVH 202
V A+ A+ IA + W +I ++ Q +PVA IIK LKL N ITLLL
Sbjct: 131 VYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVAKIIKLPLKLYENTITLLLGEA 190
Query: 203 KERSNIKDSWVLHHDNADPFADSSET--KKPSL--VDVDLDLSAYANAKRFFDLKRSAAK 258
E + + ++ ++++T KP + +D+DL LS +ANA ++++ K+ AA
Sbjct: 191 GEEQEEAEELFSESEESEDEQETTQTAQNKPEVLTIDIDLGLSPWANATQYYEQKKMAAV 250
Query: 259 KQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--- 313
K+Q+T QS+ KALKS EKK + LK Q + ARK +WFEKF +F+SSE YL
Sbjct: 251 KEQRTAQSSTKALKSHEKKVTEDLKKSLKQEKQVLRPARKPFWFEKFLFFVSSEGYLVLG 310
Query: 314 -----------------GNV--------------------------------AWDAKVVT 324
G++ A + V T
Sbjct: 311 GRDAMQSELLYRRHLKKGDIFVHADLEGATPIVVKNRPGTPNAPIPPSTLSQAGNLCVAT 370
Query: 325 NAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
++ W V A QVSKTA G L G F+++G+KNF P QL +G + +F++
Sbjct: 371 SSAWDSKAVMPAWWVHAHQVSKTAEAGGLLPAGDFLVKGEKNFLAPSQLVLGFAVMFQIS 430
Query: 375 ESSI 378
+ S+
Sbjct: 431 KESL 434
>gi|358398026|gb|EHK47384.1| hypothetical protein TRIATDRAFT_238226 [Trichoderma atroviride IMI
206040]
Length = 1068
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 137/445 (30%), Positives = 209/445 (46%), Gaps = 95/445 (21%)
Query: 21 LVSNLNVSSQ--GYLVQKKEKRPNSEDYIVANM----------EFHPMILQPGSTPSVMR 68
+V N+ S+ GY+ KK K +S + N +FHP I P +
Sbjct: 257 IVENITASATCTGYIFAKK-KATSSSGLVEGNASQKREGLLYDDFHPFI------PHKFK 309
Query: 69 QHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQ 128
+ F I EFE + VDEFFS+ E K++ + +E A KKLE+ + + R+ L+
Sbjct: 310 KDSSFEILEFEGYNRTVDEFFSSLEGQKLESRLTGREEAAKKKLEDARHEQGKRIQGLQD 369
Query: 129 TQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-Q 187
Q ++ KA I N E V A+ A+ IA + W DI +++ ++ +PVA I
Sbjct: 370 AQAMNLRKAAAIEANVERVQEAMDAVNGLIAQGMDWIDIGKLIEREKKRQNPVAETINLP 429
Query: 188 LKLNINHITLLLRVHKERSNIKDSWVLH------HDNADPFADSSETK--KPS---LVDV 236
LKL+ N ITLLL + + ++ + D+ + ++++ TK KP+ VD+
Sbjct: 430 LKLSENTITLLLAEEEFDEDEDEAQEANPYETDESDSEEGLSEANATKDTKPAKLLTVDI 489
Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKA 294
L++S ++NA+ +++ +RSAA K++KT Q KALKS E K + LK Q +
Sbjct: 490 VLNVSPWSNAREYYEQRRSAAIKEEKTQQQATKALKSTEHKIAEDLKKGLKQEKALLQPI 549
Query: 295 RKVYWFEKFYWFISSENYL--------------------GNVAWDAKV------------ 322
RK WFEKF WFISS+ YL G++ A +
Sbjct: 550 RKQLWFEKFLWFISSDGYLVLGGKDPQQSEILYRRYLRKGDIYCHADIRGAANIVIKNNP 609
Query: 323 ---------------------VTNAWWVKA---------DQVSKTAPTGEFLTTGSFMIR 352
+ AW KA DQVSK+A TGE + G+F+I
Sbjct: 610 NTPDAPIPPATLSQAGSLSVCSSEAWDSKAGMGAWWVNTDQVSKSASTGEIMPAGNFIIE 669
Query: 353 GKKNFFPPCQLAMGISFLFKLEESS 377
GKKN+ PP QL +G+ F F++ E S
Sbjct: 670 GKKNYLPPTQLLLGLGFAFRISEQS 694
>gi|296083204|emb|CBI22840.3| unnamed protein product [Vitis vinifera]
Length = 993
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/273 (38%), Positives = 159/273 (58%), Gaps = 19/273 (6%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ Q + +FE+F AA+DEF+S ES + + + +E A++KL ++ D E R
Sbjct: 316 CPILLNQFKSREFVKFETFDAALDEFYSKIESQRSEQQQKAKEGSAMQKLTKIRVDQENR 375
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L++ + AELI N E VDAAILA+R +AN ++WED+ VK+ ++ +PVA
Sbjct: 376 VHTLKKEVDHCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDLARMVKEEKKSGNPVA 435
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I +L L N +TLLL +N D D +T V+VDL LSA
Sbjct: 436 GLIDKLYLERNCMTLLL----------------SNNLDEMDDDEKTLPVDKVEVDLALSA 479
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
+ANA+R+++ K+ KQ+KT+ + EKA K+AEKKT+ L +T+ I+ RKV+WFEK
Sbjct: 480 HANARRWYEQKKRQENKQEKTVIAHEKAFKAAEKKTRLQLSQEKTVATISHMRKVHWFEK 539
Query: 303 FYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
F WFISSENYL G A +++ + K D
Sbjct: 540 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 572
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 44/63 (69%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 614 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 673
Query: 377 SIS 379
S+
Sbjct: 674 SLG 676
>gi|189211034|ref|XP_001941848.1| serologically defined colon cancer antigen 1 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187977941|gb|EDU44567.1| serologically defined colon cancer antigen 1 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 1151
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 134/408 (32%), Positives = 194/408 (47%), Gaps = 89/408 (21%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP Q ++ + EF+ F AVDEFFS+ E K++ K ++E+ A +K
Sbjct: 311 DFHPFRPQ-------QFENSDYTFIEFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKRK 363
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE +++HE R+ L+Q Q ++ KAE I+ N V A A+ I + W DI +
Sbjct: 364 LEKARKEHEDRIGGLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQGMDWVDIARLI 423
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL-------RVHKERSNIKDSWVLHHDNADPFA 223
++ Q + VA +IK LKLN N ITLLL E + S V + D
Sbjct: 424 EREQNSGNAVAQLIKLPLKLNENTITLLLDETNWEEGEEVEDEGNETSSVSEDSDEDAGE 483
Query: 224 DSSETKK---------PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKS 273
+ KK P L +D+DL L+A+AN+ +FD K++AA K+ +T+Q++ +ALKS
Sbjct: 484 EDGAKKKSAPVKVSARPQLAIDIDLSLTAWANSTEYFDQKKTAANKEDRTLQASTRALKS 543
Query: 274 AEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL------------------ 313
EKK + LK Q + RK WFEKF +FISS+ YL
Sbjct: 544 HEKKVAEDLKKGLKQEKEVLRPVRKQQWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRFL 603
Query: 314 --GNV-------------------AWDAKV--------------VTNAWWVKA------- 331
G+V DA + ++AW KA
Sbjct: 604 RKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPSTLSQAGNLCICTSDAWDSKAVMSAWWV 663
Query: 332 --DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
DQVSKT TGEFL G F ++GKK F P QL +G++ +F++ ESS
Sbjct: 664 RSDQVSKTGQTGEFLPAGMFNVKGKKEFLPLAQLVVGLAVMFEISESS 711
>gi|148704666|gb|EDL36613.1| mCG3169, isoform CRA_b [Mus musculus]
Length = 658
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/210 (48%), Positives = 121/210 (57%), Gaps = 59/210 (28%)
Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
KP LVDVDL LSAYANAK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T
Sbjct: 49 KPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVT 108
Query: 290 NINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-- 338
+I KARKVYWFEKF WFISSENYL G +++ + +V AD T+
Sbjct: 109 SIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCV 168
Query: 339 ---PTG---------------------------------------------EFLTTGSFM 350
PTG E+LTTGSFM
Sbjct: 169 IKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFM 228
Query: 351 IRGKKNFFPPCQLAMGISFLFKLEESSISR 380
IRGKKNF PP L MG SFLFK++ES + R
Sbjct: 229 IRGKKNFLPPSYLMMGFSFLFKVDESCVWR 258
>gi|242764776|ref|XP_002340841.1| DUF814 domain protein, putative [Talaromyces stipitatus ATCC 10500]
gi|218724037|gb|EED23454.1| DUF814 domain protein, putative [Talaromyces stipitatus ATCC 10500]
Length = 1111
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 136/439 (30%), Positives = 204/439 (46%), Gaps = 99/439 (22%)
Query: 29 SQGYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFPIQEF 78
++GY+V K+ + ++ D+ +N +FHP P G+ I E+
Sbjct: 262 TKGYIVAKERAKKDT-DFDQSNDSASKENLLFEDFHPF------RPRQFEGKPGYHILEY 314
Query: 79 ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAE 138
++F VDE+FS+ ES K++ + + E A +KLE + DH R AL+Q Q + KAE
Sbjct: 315 DNFNKTVDEYFSSIESQKLESRLAEHEETAKRKLEAARADHLDRAGALKQAQELHIRKAE 374
Query: 139 LIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITL 197
I N V A A+ IA + W +I ++ Q N+PVA IK LKL N ITL
Sbjct: 375 AIQANIYRVQEATDAVNGLIAQGMDWVEIARLIEMEQERNNPVAKTIKLPLKLFENTITL 434
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETK--------------KPSLVDVDLDLSAY 243
LL E S + ++P S ++ +P +D+DL LS +
Sbjct: 435 LL---SEESAKGEGDKEEFSESEPEGSDSNSESEFEKDGGPKRKNAEPLAIDIDLSLSPW 491
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFE 301
+NA ++++ K++AA K+QKTIQS+EKALKS EKK + LK Q + +RK +WFE
Sbjct: 492 SNATQYYEQKKTAAVKEQKTIQSSEKALKSQEKKVTEDLKKHLKQEKQVLRPSRKPFWFE 551
Query: 302 KFYWFISSENYL--------------------GNV------------------------- 316
K+ +FISSE YL G+V
Sbjct: 552 KYLYFISSEGYLVLGGRDSHQVEILYQRYLKKGDVFVHADLEGATPMIVKNKEGTSNAPI 611
Query: 317 -------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
A V T+ W V A QVS+T GE L +G FM++G+KN+
Sbjct: 612 PPGTLTQAGSISVATSKAWETKALMPSWWVHAHQVSRTNERGELLASGGFMVKGEKNYLA 671
Query: 360 PCQLAMGISFLFKLEESSI 378
P Q +G + LF++ + S+
Sbjct: 672 PGQPVLGFAVLFQISKESV 690
>gi|402074990|gb|EJT70461.1| serologically defined colon cancer antigen 1 [Gaeumannomyces
graminis var. tritici R3-111a-1]
Length = 1086
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 129/392 (32%), Positives = 195/392 (49%), Gaps = 81/392 (20%)
Query: 63 TPSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH 119
+P + +Q G P + FE F VDEFFS+ E K++ + ++E A +KL+ KR+H
Sbjct: 303 SPFLPQQFAGDPSNVVLTFEGFNKTVDEFFSSLEGQKLESRLTEREAGAKRKLDAAKREH 362
Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
E R+ L++ QL++ KA I N E V A+ A+ + + W D+ V++ Q+ ++
Sbjct: 363 EKRIEGLQEYQLLNLRKAAAIEANVERVQEAMDAVIGLLEQGMDWVDVGKLVEREQKRHN 422
Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS-----ETKKPSLV 234
PVA II +L +++ + T+ L + ++ DS + + D +T K +
Sbjct: 423 PVAEII-ELPMDLANNTITLVIAEQDDVDDDSEDGYETESSASDDDDDAAAVQTGKAKTL 481
Query: 235 DVD--LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
+VD L L+ + NA ++D KRSAA KQ+KT+Q + ALKSA++K KD+Q
Sbjct: 482 EVDIKLSLTPWGNAGEYYDQKRSAAVKQEKTVQQSSIALKSAQEKIA---KDLQKGLKKE 538
Query: 293 K-----ARKVYWFEKFYWFISSENYL--------------------GNV----------- 316
K AR+ WFEKF+WFISS+ YL G+V
Sbjct: 539 KPVMQLARRQMWFEKFHWFISSDGYLVLGGRDAQQNEILYRRYLKRGDVYVHADLHGAPS 598
Query: 317 --------AWDAKV--------------VTNAWWVK---------ADQVSKTAPTGEFLT 345
DA V ++AW K ADQVSK+APTGEFL
Sbjct: 599 VIIKNNPRTPDAPVPPSTLSQAGQLAVCASSAWESKAGMGAYWVGADQVSKSAPTGEFLP 658
Query: 346 TGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
TGSFM+RGK+N PP L +G +F++ + S
Sbjct: 659 TGSFMVRGKRNELPPAPLIVGFGVMFRISDES 690
>gi|85091915|ref|XP_959135.1| hypothetical protein NCU09191 [Neurospora crassa OR74A]
gi|28920536|gb|EAA29899.1| conserved hypothetical protein [Neurospora crassa OR74A]
gi|29150083|emb|CAD79644.1| conserved hypothetical protein [Neurospora crassa]
Length = 1097
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 124/386 (32%), Positives = 182/386 (47%), Gaps = 83/386 (21%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+F P + P + + I F + VDEFFS+ E +++ K ++E A +K
Sbjct: 297 DFQPFL------PKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLESKLSEREAAAKRK 350
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE + D R+ L++ ++++ KA I N E V A+ A+ + + W DI +
Sbjct: 351 LEAARMDQAKRIEGLQEMEMLNYRKAATIQANIERVQEAMDAVNGLLQEGMDWVDITKLI 410
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKD--------SWVLHHDNADPF 222
++ Q+ +PVA IIK +KL N ITLLL E + D + D+AD
Sbjct: 411 EKEQKQGNPVAEIIKLPMKLKENTITLLLGEGVEEEDEADQDKEDDEFDYSDSEDDAD-G 469
Query: 223 ADSSETKKPSL---VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
AD+++ K P VD++L LS + NA+ ++D KR+AA K QKT+Q + ALK+AE+K
Sbjct: 470 ADTTKDKAPVKRLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIA 529
Query: 280 QTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVA 317
+ L+ Q + RK WFEKF WFISS+ YL G+V
Sbjct: 530 EDLRKGLKQEKPVLQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVY 589
Query: 318 WDAKV---------------------------------VTNAW---------WVKADQVS 335
A V ++AW WV ADQVS
Sbjct: 590 VHADVHGAASVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVS 649
Query: 336 KTAPTGEFLTTGSFMIRGKKNFFPPC 361
K+AP GE+L GSFM+RGK+N PP
Sbjct: 650 KSAPAGEYLPVGSFMVRGKRNLLPPA 675
>gi|396473834|ref|XP_003839430.1| similar to DUF814 domain-containing protein [Leptosphaeria maculans
JN3]
gi|312215999|emb|CBX95951.1| similar to DUF814 domain-containing protein [Leptosphaeria maculans
JN3]
Length = 1115
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 126/382 (32%), Positives = 184/382 (48%), Gaps = 79/382 (20%)
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
EF+ F AVDEFFS+ E K++ K ++E+ A KKLE +++HE R+ L+Q Q ++ K
Sbjct: 311 EFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEERIGGLQQVQEMNFRK 370
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
AE I+ N V A A+ I + W DI +++ Q + VA I+ LKL+ N I
Sbjct: 371 AEAILANVHRVTEATEAVNGLIRQGMDWVDISRLIEREQAQGNAVAQSIRLPLKLHQNTI 430
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPF-------------ADSSETKKPSL-VDVDLDLS 241
TLLL ++ + ++ A + T++P L +D+DL LS
Sbjct: 431 TLLLNETDWDHEEEEEDEGNETSSVSEDSEEEEEGSKKKAAPTKVTQQPQLAIDIDLGLS 490
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYW 299
A+AN+ ++D K++AA K+ +T ++ KALKS EKK + LK Q + RK W
Sbjct: 491 AWANSTEYYDQKKTAASKEDRTAAASSKALKSHEKKVTEDLKKGLKQEKEVLRPVRKQQW 550
Query: 300 FEKFYWFISSENYL--------------------GNV----------------------- 316
FEK+ +FISS+ YL G+V
Sbjct: 551 FEKYIYFISSDGYLVLGGKDAQQNEIIYKRFLRKGDVYVHADLKGAVPMIIKNKPDTPDA 610
Query: 317 ---------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
A V T+ W V++ QVSKT TGEFL G F I GKK F
Sbjct: 611 PIPPSTLSQAGHLSVCTSEAWESKAVMSAWWVRSTQVSKTGQTGEFLPAGMFNITGKKEF 670
Query: 358 FPPCQLAMGISFLFKLEESSIS 379
PP QL +G++ +F++ ESSIS
Sbjct: 671 LPPAQLVVGLAVMFEISESSIS 692
>gi|449017191|dbj|BAM80593.1| unknown RNA-binding protein, conserved [Cyanidioschyzon merolae
strain 10D]
Length = 1371
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 126/418 (30%), Positives = 196/418 (46%), Gaps = 100/418 (23%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
TP ++RQHQ + F SF AVDE+F+ E + + ++R A LE ++R+ ETR
Sbjct: 412 TPRLLRQHQHLEPRMFPSFDEAVDEYFARMEELRYRQEIENRQRQAQGTLERMRRELETR 471
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L+Q + KA LI N VD A+ IR +A+ + W++++ + +R +PVA
Sbjct: 472 VLTLKQQEERCLRKAALIETNLVDVDNALQVIRAALASGIDWKELDQMLVLERRRGNPVA 531
Query: 183 SIIKQLKLNINHITLLL---------------------RVHKER--SNIKDSWVLHHDNA 219
+I L+L N +TL+L R + R SN + + +
Sbjct: 532 QLIHSLQLQENQMTLMLADDSGSVDNTDAETGSSSRQRRPAETRDLSNEDSASSVESASE 591
Query: 220 DPFADSSE--TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
D DS+ + + LV VDL LSA+ANA+R+++ ++ AA+K KT++++ +AL++AEKK
Sbjct: 592 DESGDSTSVCSSRVELVQVDLSLSAFANARRYYEQRKKAAEKGTKTMEASAQALRAAEKK 651
Query: 278 TKQTL---------KDVQTMTNINKARKVYWFEKFYWFISSENYL--------------- 313
+ L K + + RK WFEKF +FI+SENYL
Sbjct: 652 ALEVLAGTASKNKRKKATPLNTLKAIRKPLWFEKFRYFITSENYLVIAGKDSQQNEQLVR 711
Query: 314 -------GNVAWDAKV------------------------------VTNAW--------- 327
G++ A V ++AW
Sbjct: 712 RYLEENTGDLYMHADVHGAASVIIKGKKNRPAPPLSIQEAAIFAAACSSAWDAKVAVNAY 771
Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-----PCQLAMGISFLFKLEESSISR 380
WV +QVS+TAP+G +L GSF+IRG +N+ P L MG FLF+L S+ R
Sbjct: 772 WVYPEQVSRTAPSGMYLQQGSFVIRGSRNYVPVTTSGSGPLVMGFGFLFRLAPESVWR 829
>gi|327348881|gb|EGE77738.1| DUF814 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 1166
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 130/437 (29%), Positives = 201/437 (45%), Gaps = 95/437 (21%)
Query: 31 GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
GY+V K E + + + A ++FHP P I +F++
Sbjct: 267 GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPF------EPKQFENQADMAILKFDT 320
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDE+FS+ E K++ + ++E A +KLE ++D E R+ L++ + + KA+ I
Sbjct: 321 FNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQAI 380
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N V+ A+ A+ IA + W +I ++ Q +PVA +IK LKL N +TLLL
Sbjct: 381 EANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLLL 440
Query: 200 RVHKERSNIKDSWVLHHDNADPFADS------SETKKPSL--------VDVDLDLSAYAN 245
E D ++ + D +KKP +D+DL +S +AN
Sbjct: 441 GEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWAN 500
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
A+++++ K++AA K++KT+ S +KA+KS EKK LK Q + R +WFEKF
Sbjct: 501 ARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEKF 560
Query: 304 YWFISSENYLGNVAWDAK------------------------------------------ 321
+FISS+ YL DA+
Sbjct: 561 IYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIPP 620
Query: 322 ----------VVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC 361
V T++ W V ADQVSKT P+GE+L TG F+IRG+KN PP
Sbjct: 621 GTLSQAGNLCVATSSAWHSKAVMGAWWVNADQVSKTTPSGEYLETGGFVIRGEKNQLPPA 680
Query: 362 QLAMGISFLFKLEESSI 378
QL +G + +F++ SI
Sbjct: 681 QLLLGFAVMFQISSESI 697
>gi|350296215|gb|EGZ77192.1| hypothetical protein NEUTE2DRAFT_99766 [Neurospora tetrasperma FGSC
2509]
Length = 1095
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 122/385 (31%), Positives = 181/385 (47%), Gaps = 81/385 (21%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+F P + P + + I F + VDEFFS+ E +++ K ++E A +K
Sbjct: 297 DFQPFL------PKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLESKLSEREAAAKRK 350
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE + D R+ L++ ++++ KA I N E V A+ A+ + + W DI +
Sbjct: 351 LEAARMDQAKRIEGLQEMEMLNYRKAATIQANTERVQEAMDAVNGLLQEGMDWVDITKLI 410
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNI----KDSWVLHHDNADPFADSS 226
++ Q+ +PVA IIK +KL N ITLLL E + K+ + +++ AD +
Sbjct: 411 EKEQKQGNPVAEIIKLPMKLKENTITLLLGEGVEEEDEGDQDKEDDEFDYSDSEDDADGA 470
Query: 227 ETKKPSL------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
ET K VD++L LS + NA+ ++D KR+AA K QKT+Q + ALK+AE+K +
Sbjct: 471 ETTKHKAPVKRLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIAE 530
Query: 281 TLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAW 318
L+ Q + RK WFEKF WFISS+ YL G+V
Sbjct: 531 DLRKGLKQEKPVLQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYV 590
Query: 319 DAKV---------------------------------VTNAW---------WVKADQVSK 336
A V ++AW WV ADQVSK
Sbjct: 591 HADVHGAASVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSK 650
Query: 337 TAPTGEFLTTGSFMIRGKKNFFPPC 361
+AP GE+L GSFM+RGK+N PP
Sbjct: 651 SAPAGEYLPVGSFMVRGKRNLLPPA 675
>gi|449485009|ref|XP_004157045.1| PREDICTED: nuclear export mediator factor NEMF homolog [Cucumis
sativus]
Length = 1090
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 171/310 (55%), Gaps = 32/310 (10%)
Query: 30 QGY-LVQKKEKRPNSEDYIVANM---EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
+GY L+QKK+ + + AN EF P++L Q +FE+F AA+
Sbjct: 297 EGYILMQKKDVKKEESEAATANEIYDEFCPILLN---------QFMSRKYTKFETFDAAL 347
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+S ES + + + +E A KL ++ D R+ L+Q + AELI N E
Sbjct: 348 DEFYSKIESQRSEQQQKAKESSATHKLNKIRMDQGNRVELLKQEVDHSVKMAELIEYNLE 407
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VDA ILA+R +A +SWED+ VK+ ++ +PVA +I +L L N +TLLL
Sbjct: 408 DVDAVILAVRVALAKGMSWEDLARMVKEEKKSGNPVAGLIDKLNLERNCMTLLLS----- 462
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
+N D D +T+ V+VD+ LSA+ANA+R+++LK+ KQ+KTI
Sbjct: 463 -----------NNLDEMDDDEKTQPVDKVEVDISLSAHANARRWYELKKKQESKQEKTIT 511
Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKV 322
+ EKA K+AE+KT+ L +T+ I+ RKV+WFEKF WFISSENYL G A ++
Sbjct: 512 AHEKAFKAAERKTRLQLSQEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 571
Query: 323 VTNAWWVKAD 332
+ + K D
Sbjct: 572 IVKRYMSKGD 581
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 44/63 (69%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 623 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 682
Query: 377 SIS 379
S+
Sbjct: 683 SLG 685
>gi|449441522|ref|XP_004138531.1| PREDICTED: nuclear export mediator factor Nemf-like [Cucumis
sativus]
Length = 1119
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 171/310 (55%), Gaps = 32/310 (10%)
Query: 30 QGY-LVQKKEKRPNSEDYIVANM---EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
+GY L+QKK+ + + AN EF P++L Q +FE+F AA+
Sbjct: 297 EGYILMQKKDVKKEESEAATANEIYDEFCPILLN---------QFMSRKYTKFETFDAAL 347
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
DEF+S ES + + + +E A KL ++ D R+ L+Q + AELI N E
Sbjct: 348 DEFYSKIESQRSEQQQKAKESSATHKLNKIRMDQGNRVELLKQEVDHSVKMAELIEYNLE 407
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VDA ILA+R +A +SWED+ VK+ ++ +PVA +I +L L N +TLLL
Sbjct: 408 DVDAVILAVRVALAKGMSWEDLARMVKEEKKSGNPVAGLIDKLNLERNCMTLLLS----- 462
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
+N D D +T+ V+VD+ LSA+ANA+R+++LK+ KQ+KTI
Sbjct: 463 -----------NNLDEMDDDEKTQPVDKVEVDISLSAHANARRWYELKKKQESKQEKTIT 511
Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKV 322
+ EKA K+AE+KT+ L +T+ I+ RKV+WFEKF WFISSENYL G A ++
Sbjct: 512 AHEKAFKAAERKTRLQLSQEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 571
Query: 323 VTNAWWVKAD 332
+ + K D
Sbjct: 572 IVKRYMSKGD 581
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 44/63 (69%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 623 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 682
Query: 377 SIS 379
S+
Sbjct: 683 SLG 685
>gi|327287378|ref|XP_003228406.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
[Anolis carolinensis]
Length = 635
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 90/180 (50%), Positives = 117/180 (65%), Gaps = 14/180 (7%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N +GY++QK+EK+P+ E + I+ EFHP + Q+ P EF+S
Sbjct: 240 NFDGKGYIIQKREKKPSLEPEKPAEEILTYEEFHPFLFS---------QYTKCPFVEFDS 290
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKLENV++DHE RL AL Q Q +DK K EL+
Sbjct: 291 FNKAVDEFYSKLEGQKIDLKALQQEKQALKKLENVRKDHEHRLEALHQAQEIDKVKGELV 350
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N E VD AI +R +ANQ+ W +I A VK+AQ DP+AS IK+LKL NHIT+LL+
Sbjct: 351 EMNLEMVDRAITVVRSALANQIDWTEIGALVKEAQAQGDPLASAIKELKLQTNHITMLLK 410
>gi|336276025|ref|XP_003352766.1| hypothetical protein SMAC_01600 [Sordaria macrospora k-hell]
gi|380094654|emb|CCC08036.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1086
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 132/421 (31%), Positives = 193/421 (45%), Gaps = 97/421 (23%)
Query: 29 SQGYLVQKKEKRPNSEDYIVA----------NM---EFHPMILQPGSTPSVMRQHQGFPI 75
+ GY++ K RP ED V N+ +F P + P ++ + I
Sbjct: 263 ANGYIIAKP--RPGFEDAEVVVDAPPAEKAKNLLYEDFQPFL------PKQFENNKDYRI 314
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
F + VDEFFS+ E +++ K ++E A +KLE + D R+ L++ ++++
Sbjct: 315 LPFVGYNKTVDEFFSSLEGQRLESKLSEREAAAKRKLEAARMDQAKRIEGLQEMEMLNYR 374
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINH 194
KA I N E V A+ A+ + + W DI +++ Q+ +PVA IIK +KL N
Sbjct: 375 KAATIQANIERVQEAMDAVNGLLQEGMDWVDITKLIEKEQKQGNPVAEIIKLPMKLKENT 434
Query: 195 ITLLL-----RVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV-----DVDLDLSAYA 244
ITLLL + + +D + D+ D D E K V D++L LS +
Sbjct: 435 ITLLLGEGVEEEEEGDEDKEDDEFDYSDDEDE-GDVGEPKDKVPVNRLEIDINLTLSVWN 493
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEK 302
NA+ ++D KR+AA K QKT+Q + ALKSAE+K + L+ Q + RK WFEK
Sbjct: 494 NAREYYDQKRTAAHKAQKTVQQSVIALKSAEQKISEDLRKGLKQEKPVLQPIRKAMWFEK 553
Query: 303 FYWFISSENYL--------------------GNVAWDAKV-------------------- 322
F WFISS+ YL G+V A V
Sbjct: 554 FTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYVHADVHGAASVIIKNNPKTPDAPIP 613
Query: 323 -------------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
++AW WV ADQVSK+APTGE+L GSFM+RGK+N PP
Sbjct: 614 PSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSKSAPTGEYLPVGSFMVRGKRNLLPP 673
Query: 361 C 361
Sbjct: 674 A 674
>gi|357161759|ref|XP_003579195.1| PREDICTED: nuclear export mediator factor Nemf-like [Brachypodium
distachyon]
Length = 1163
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 106/273 (38%), Positives = 158/273 (57%), Gaps = 19/273 (6%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P +++Q + EFE+F AA+DEF+S ES +++ + +E A+++L +K D E R
Sbjct: 354 CPILLKQFKAREYDEFETFDAALDEFYSKIESQRVNQQQKAKEDSAVQRLNKIKLDQENR 413
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L + + AELI N E VDAAI+A+R +AN +SWE + +K+ +R +PVA
Sbjct: 414 VHTLRKEADHCIKMAELIEYNLEDVDAAIVAVRVSLANGMSWEALARMIKEERRAGNPVA 473
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I +L N ITLLL +N D + +T V+VDL LSA
Sbjct: 474 GLIDKLSFENNCITLLL----------------SNNLDDMDEDEKTAPVEKVEVDLSLSA 517
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
+ANA+R++++K+ KQ+KTI + +KA K+AEKKT+ L +T+ I RKV+WFEK
Sbjct: 518 HANARRWYEMKKKQETKQEKTITAHDKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEK 577
Query: 303 FYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
F WFISSENYL G A ++V + K D
Sbjct: 578 FNWFISSENYLIVSGRDAQQNELVVKRYMSKGD 610
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 43/63 (68%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 652 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDES 711
Query: 377 SIS 379
++
Sbjct: 712 CLA 714
>gi|117938818|gb|AAH06001.1| SDCCAG1 protein [Homo sapiens]
Length = 398
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 91/180 (50%), Positives = 116/180 (64%), Gaps = 14/180 (7%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 199 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 249
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 250 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 309
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 310 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 369
>gi|34784822|gb|AAH56687.1| SDCCAG1 protein, partial [Homo sapiens]
Length = 426
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/180 (50%), Positives = 116/180 (64%), Gaps = 14/180 (7%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P E + I+ EFHP + QH P EFES
Sbjct: 218 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 268
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 269 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 328
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVAS IK+LKL NH+T+LLR
Sbjct: 329 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 388
>gi|299471369|emb|CBN79324.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1380
Score = 167 bits (424), Expect = 5e-39, Method: Composition-based stats.
Identities = 120/379 (31%), Positives = 183/379 (48%), Gaps = 86/379 (22%)
Query: 64 PSVMRQHQGFP-IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ QH+G I F SF AVD FF K+ A+ E +K+ ++ D E R
Sbjct: 454 PQLLAQHEGGAVIHSFASFDQAVDAFFGRIVEQKLKQTAMAAEAAVERKVAWIRNDQERR 513
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ ALE+ Q A+L + V+ A++ +R +AN + W+D+E VK + +P+A
Sbjct: 514 VLALEERQEKMLRHAQLAEAWADEVEKALMVVRSALANGMDWQDLEDLVKAETANGNPIA 573
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
S+I +L+L+ N + L L P A+ E + LV+VD+ LSA
Sbjct: 574 SLIHELRLDRNQVVLSL---------------------PTAEDGEDDQ--LVEVDIMLSA 610
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
+ANA+ ++ K+ A K+ KT+ ++EK LK AE++ ++TL+ ++ ARKVYWFEK
Sbjct: 611 HANARVMYENKKLARAKELKTLTASEKVLKIAEQQAERTLQRQAHKRSLQVARKVYWFEK 670
Query: 303 FYWFISSENYL---GNVAWDAKVVTNAW------WVKADQ-------VSKTAPTGE---- 342
F WFISSENYL G A +VV + +V AD V P+G+
Sbjct: 671 FNWFISSENYLVISGRNAQQNEVVVKKYLRPGDIYVHADLHGASSCVVRNKDPSGKRAVS 730
Query: 343 ------------------------------------------FLTTGSFMIRGKKNFFPP 360
+L TGSFM+RG+K+F PP
Sbjct: 731 PLALEEAGCMTVCRSGAWGAKMVTSAWWVYADQVSKTAPTGEYLVTGSFMVRGRKHFLPP 790
Query: 361 CQLAMGISFLFKLEESSIS 379
L MG + LFKL++S ++
Sbjct: 791 RALEMGFALLFKLDDSCLA 809
>gi|125579741|gb|EAZ20887.1| hypothetical protein OsJ_36526 [Oryza sativa Japonica Group]
Length = 1176
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 104/272 (38%), Positives = 156/272 (57%), Gaps = 19/272 (6%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P ++ Q + EFE+F AA+DEF+S ES +++ + +E A ++L +K D E R+
Sbjct: 354 PVLLNQFKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRV 413
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L + + AELI N E VDAAI+A+R +AN +SW+ + +K+ ++ +PVA
Sbjct: 414 HTLRKEVDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAG 473
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
+I +L N ITLLL +N D + +T V+VDL LSA+
Sbjct: 474 LIDKLSFERNCITLLL----------------SNNLDDMDEEEKTAPVEKVEVDLSLSAH 517
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
ANA+R+++LK+ KQ+KT+ + EKA K+AEKKT+ L +T+ I RKV+WFEKF
Sbjct: 518 ANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKF 577
Query: 304 YWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
WFISSENYL G A +++ + K D
Sbjct: 578 NWFISSENYLIISGRDAQQNELIVKRYMSKGD 609
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 44/63 (69%), Positives = 52/63 (82%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 668 AWDSKIVTSAWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDES 727
Query: 377 SIS 379
S++
Sbjct: 728 SLA 730
>gi|336464133|gb|EGO52373.1| hypothetical protein NEUTE1DRAFT_71883 [Neurospora tetrasperma FGSC
2508]
Length = 1095
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 122/385 (31%), Positives = 180/385 (46%), Gaps = 81/385 (21%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+F P + P + + I F + VDEFFS+ E ++ K ++E A +K
Sbjct: 297 DFQPFL------PKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLKSKLSEREAAAKRK 350
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE + D R+ L++ ++++ KA I N E V A+ A+ + + W DI +
Sbjct: 351 LEAARMDQAKRIEGLQEMEMLNYRKAATIQANIERVQEAMDAVNGLLQEGMDWVDITKLI 410
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL----RVHKERSNIKDSWVLHHDNADPFADSS 226
++ Q+ +PVA IIK +KL N ITLLL +E K+ + +++ AD +
Sbjct: 411 EKEQKQGNPVAEIIKLPMKLKENTITLLLGEGVEEEEEGDQDKEDDEFDYSDSEDDADGA 470
Query: 227 ETKKPSL------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
ET K VD++L LS + NA+ ++D KR+AA K QKT+Q + ALK+AE+K +
Sbjct: 471 ETTKDKAPVKRLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIAE 530
Query: 281 TLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAW 318
L+ Q + RK WFEKF WFISS+ YL G+V
Sbjct: 531 DLRKGLKQEKPVLQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYV 590
Query: 319 DAKV---------------------------------VTNAW---------WVKADQVSK 336
A V ++AW WV ADQVSK
Sbjct: 591 HADVHGAASVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSK 650
Query: 337 TAPTGEFLTTGSFMIRGKKNFFPPC 361
+AP GE+L GSFM+RGK+N PP
Sbjct: 651 SAPAGEYLPVGSFMVRGKRNLLPPA 675
>gi|125537046|gb|EAY83534.1| hypothetical protein OsI_38746 [Oryza sativa Indica Group]
Length = 1153
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 104/272 (38%), Positives = 156/272 (57%), Gaps = 19/272 (6%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P ++ Q + EFE+F AA+DEF+S ES +++ + +E A ++L +K D E R+
Sbjct: 331 PVLLNQFKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRV 390
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L + + AELI N E VDAAI+A+R +AN +SW+ + +K+ ++ +PVA
Sbjct: 391 HTLRKEVDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAG 450
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
+I +L N ITLLL +N D + +T V+VDL LSA+
Sbjct: 451 LIDKLSFERNCITLLL----------------SNNLDDMDEEEKTAPVEKVEVDLSLSAH 494
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
ANA+R+++LK+ KQ+KT+ + EKA K+AEKKT+ L +T+ I RKV+WFEKF
Sbjct: 495 ANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKF 554
Query: 304 YWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
WFISSENYL G A +++ + K D
Sbjct: 555 NWFISSENYLIISGRDAQQNELIVKRYMSKGD 586
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 44/63 (69%), Positives = 52/63 (82%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 645 AWDSKIVTSAWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDES 704
Query: 377 SIS 379
S++
Sbjct: 705 SLA 707
>gi|328909421|gb|AEB61378.1| serologically defined colon cancer antigen 1-like protein, partial
[Equus caballus]
Length = 302
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/180 (50%), Positives = 116/180 (64%), Gaps = 14/180 (7%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E I+ EFHP + QH P EFES
Sbjct: 103 NFSGKGYIIQKREMKPSLEVDKPTQDILTYEEFHPFLFS---------QHSQCPYIEFES 153
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 154 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELI 213
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 214 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 273
>gi|346325475|gb|EGX95072.1| serologically defined colon cancer antigen 1 [Cordyceps militaris
CM01]
Length = 1048
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 123/400 (30%), Positives = 192/400 (48%), Gaps = 81/400 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP + P + + I FE F VD+FFS+ E K+ + V++E A +K
Sbjct: 295 DFHPFV------PQKFQNNSKIEILRFEGFNRTVDDFFSSLEGQKLQSRVVEREAAAQRK 348
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+ K+D E RL L+ +Q + KA I N E V A+ +I +A + W DI V
Sbjct: 349 LDAAKQDQENRLKGLQTSQSDNFRKAAAIEANIERVQEAMDSINGLLAQGMDWVDIGKLV 408
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKE---RSNIKDSWVLHHDNADPFADSSET 228
+ Q+ N+ VA++I L L++ + +R+ +E S ++D + +AD D +
Sbjct: 409 AREQKKNNAVANLI-CLPLSLADNVISIRLSEEDDAGSEVEDPFETDDSDADSETDLNAA 467
Query: 229 K-------KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
K K +V++ L LS ++NA+ ++D +++A K++KT ++A+KS E+K K
Sbjct: 468 KSVQNYSDKTIIVELTLTLSPWSNAREYYDQRKTAVVKEEKTQLQADRAIKSTEQKIKHD 527
Query: 282 LKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNV--- 316
LK Q + R + WFEKFYWFISS+ YL G++
Sbjct: 528 LKRALKQEKALLQPIRNLMWFEKFYWFISSDGYLVVGAKDKSQAEILYRRHLGSGDIFCH 587
Query: 317 --AWDAKVV----------------------------TNAWWVKAD---------QVSKT 337
A +A +V + AW KA QVSK+
Sbjct: 588 ADANNAAIVIVKNNSNTEDAHIAPATLAQAGQLSICSSEAWDSKAGIGAWWVNSSQVSKS 647
Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
PTG+ L G+F I G+KNF PP QL +G+S +FK+ E S
Sbjct: 648 TPTGDILQPGNFNISGEKNFLPPGQLILGLSIMFKISEES 687
>gi|51593729|gb|AAH80716.1| Sdccag1 protein, partial [Mus musculus]
Length = 443
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/181 (49%), Positives = 117/181 (64%), Gaps = 16/181 (8%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410
Query: 200 R 200
R
Sbjct: 411 R 411
>gi|74152610|dbj|BAE42589.1| unnamed protein product [Mus musculus]
Length = 438
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/181 (49%), Positives = 117/181 (64%), Gaps = 16/181 (8%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410
Query: 200 R 200
R
Sbjct: 411 R 411
>gi|26334499|dbj|BAC30950.1| unnamed protein product [Mus musculus]
Length = 438
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/181 (49%), Positives = 117/181 (64%), Gaps = 16/181 (8%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410
Query: 200 R 200
R
Sbjct: 411 R 411
>gi|54887337|gb|AAH37106.2| Sdccag1 protein [Mus musculus]
Length = 415
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/181 (49%), Positives = 117/181 (64%), Gaps = 16/181 (8%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410
Query: 200 R 200
R
Sbjct: 411 R 411
>gi|414878087|tpg|DAA55218.1| TPA: hypothetical protein ZEAMMB73_985047 [Zea mays]
Length = 608
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 104/273 (38%), Positives = 156/273 (57%), Gaps = 19/273 (6%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ Q + EF +F AA+DEF+S ES K++ + +E A ++L +K D E R
Sbjct: 351 CPVLLNQFKSREYNEFATFDAALDEFYSKIESQKVNQQQKAKEESAAQRLNKIKLDQENR 410
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L + + AELI N E VDAAILA+R +AN++SWE + +K+ ++ +PVA
Sbjct: 411 VHTLRKEVDHCVKMAELIEYNLEDVDAAILAVRVSLANEMSWEALTRMIKEERKAGNPVA 470
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I +L N ITLLL +N D + +T V+VD+ LSA
Sbjct: 471 GLIDKLNFERNCITLLL----------------SNNLDDMDEDEKTAPVEKVEVDIALSA 514
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
+ANA+R++++K+ KQ+KTI + +KA K+AEKKT+ L +T+ I RKV+WFEK
Sbjct: 515 HANARRWYEMKKKQESKQEKTITAHDKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEK 574
Query: 303 FYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
F WFISSENYL G A +++ + K D
Sbjct: 575 FNWFISSENYLIISGRDAQQNELIVKRYMSKGD 607
>gi|356558107|ref|XP_003547349.1| PREDICTED: nuclear export mediator factor NEMF homolog [Glycine
max]
Length = 1119
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 168/303 (55%), Gaps = 30/303 (9%)
Query: 44 EDYIVA---NMEFHPMILQPGS--------TPSVMRQHQGFPIQEFESFAAAVDEFFSTA 92
E YI+ N+ I QPGS P ++ Q + +FE+F AA+DEF+S
Sbjct: 288 EGYILMQNKNLGKDSSISQPGSVSQMYDEFCPILLNQFKSRDYTKFETFDAALDEFYSKI 347
Query: 93 ESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152
ES + + + +E A +KL +++D E R+ L + + AELI N E VDAAIL
Sbjct: 348 ESQRAEQQQKSKENSAAQKLNKIRQDQENRVHVLRKEADHCVKMAELIEYNLEDVDAAIL 407
Query: 153 AIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSW 212
A+R +A ++W+D+ VK+ ++ +PVA +I +L L N + LLL
Sbjct: 408 AVRVALAKGMNWDDLARMVKEEKKAGNPVAGLIDKLHLERNCMNLLL------------- 454
Query: 213 VLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
+N D D +T V+VDL LSA+ANA+R+++ K+ KQ+KT+ + EKA K
Sbjct: 455 ---SNNLDEMDDDEKTLPVDKVEVDLALSAHANARRWYEQKKKQESKQEKTVTAHEKAFK 511
Query: 273 SAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWV 329
+AE+KT+ L +T+ +I+ RKV+WFEKF WFISSENYL G A +++ +
Sbjct: 512 AAERKTRLQLNQEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNELIVKRYMS 571
Query: 330 KAD 332
K D
Sbjct: 572 KGD 574
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 44/63 (69%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 616 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 675
Query: 377 SIS 379
S+
Sbjct: 676 SLG 678
>gi|349605644|gb|AEQ00813.1| Serologically defined colon cancer antigen 1-like protein, partial
[Equus caballus]
Length = 388
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 89/179 (49%), Positives = 115/179 (64%), Gaps = 14/179 (7%)
Query: 26 NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
N S +GY++QK+E +P+ E I+ EFHP + QH P EFES
Sbjct: 219 NFSGKGYIIQKREMKPSLEVDKPTQDILTYEEFHPFLFS---------QHSQCPYIEFES 269
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 270 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELI 329
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LL
Sbjct: 330 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLL 388
>gi|113911846|gb|AAI22665.1| SDCCAG1 protein [Bos taurus]
Length = 443
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 88/178 (49%), Positives = 116/178 (65%), Gaps = 14/178 (7%)
Query: 28 SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+ +GY++QK+E +P+ E + I+ EFHP + QH P EFESF
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N + VD AI +R +ANQ+ W +I VK+AQ DPVA+ IK+LKL NH+T+LLR
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411
>gi|12837616|dbj|BAB23886.1| unnamed protein product [Mus musculus]
Length = 438
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 89/181 (49%), Positives = 116/181 (64%), Gaps = 16/181 (8%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF VDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKPVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410
Query: 200 R 200
R
Sbjct: 411 R 411
>gi|116193227|ref|XP_001222426.1| hypothetical protein CHGG_06331 [Chaetomium globosum CBS 148.51]
gi|88182244|gb|EAQ89712.1| hypothetical protein CHGG_06331 [Chaetomium globosum CBS 148.51]
Length = 1115
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 125/377 (33%), Positives = 189/377 (50%), Gaps = 71/377 (18%)
Query: 75 IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
I F+ + VD FFS+ E K++ + ++E A +KLE +RD R+ L+ Q+++
Sbjct: 317 ILSFDGYNKTVDNFFSSLEGQKLESRLQEREITAKRKLEAARRDQAQRIEGLQDVQMLNL 376
Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNIN 193
KA + N E V A+ A+ I + W DI V++ Q+ ++PVA +IK +KL+ +
Sbjct: 377 RKAAAVEANIERVQEAMDAVNGLIQQGMDWVDINKLVEREQKQHNPVAEMIKLPMKLHES 436
Query: 194 HITLLLRVHKERSNIKDSWVLHHDNADPFADSS---ETKKPS---LVDVDLDLSAYANAK 247
ITLLL +E +++ +D + AD + ++K P +D++L LS NA+
Sbjct: 437 VITLLLGEEEEEGKVEEEMDFDYDTDEETADDAAEEKSKGPDKRLAIDINLKLSPRNNAR 496
Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
++D KR+AA KQ+KT+Q +E ALK+AE+K + LK Q + RK WFEKF W
Sbjct: 497 YYYDQKRTAADKQEKTVQRSEIALKNAEQKIAEDLKKGLKQEKPILQPIRKQMWFEKFTW 556
Query: 306 FISSENYL--------------------GNV----------------------------- 316
F+SS+ YL G+V
Sbjct: 557 FVSSDGYLVLGGRDAQQNEILYKRYLRKGDVYVHADMHGASSVVIKNNPKTPDAPIPPST 616
Query: 317 ---AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
A + V ++ W V ADQVSK+AP+GE+L GSFM+RGK+N PP L
Sbjct: 617 LAQAGNLSVCCSSAWDSKAAMGAWWVNADQVSKSAPSGEYLPVGSFMVRGKRNLLPPSLL 676
Query: 364 AMGISFLFKLEESSISR 380
+G LFK+ E S SR
Sbjct: 677 MLGFGLLFKISEESKSR 693
>gi|12857277|dbj|BAB30959.1| unnamed protein product [Mus musculus]
Length = 415
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/181 (49%), Positives = 116/181 (64%), Gaps = 16/181 (8%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +GY++QK+E +P+ +ED I+ EFHP + QH P EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF AVDEF+S E KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL N +T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNQVTMLL 410
Query: 200 R 200
R
Sbjct: 411 R 411
>gi|171684415|ref|XP_001907149.1| hypothetical protein [Podospora anserina S mat+]
gi|170942168|emb|CAP67820.1| unnamed protein product [Podospora anserina S mat+]
Length = 1070
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 122/400 (30%), Positives = 186/400 (46%), Gaps = 81/400 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+F P + P QG F+ + VDEFFS+ E K++ K ++E A +K
Sbjct: 296 DFQPFL------PKQFEDDQGLTTLSFDGYNKTVDEFFSSIEGQKLESKLQEREATAKRK 349
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+ ++D R+ L Q ++ KA I N E V A+ A+ + + W +I V
Sbjct: 350 LDAARQDQAKRIEGLVGFQTLNLRKAAAIEANIERVQEAMDAVNGLLEQGMDWVNINKLV 409
Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLL-RVHKERSNIKDSWVLHHDNADPFADSS-- 226
++ Q +PVA IIK L +N+ + ITLLL +E + + ++D + D++
Sbjct: 410 EREQAQGNPVAEIIK-LPVNLAESTITLLLGEEEEEEAGEDEDMEFNYDTDEEVVDAAPE 468
Query: 227 --ETKKPS---LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
+ K P +D++L LS + NA+ +++ KR+AA K++KT+ + ALKSAE+K +
Sbjct: 469 PEKAKGPDKRLAIDINLKLSVWNNAREYYEQKRTAADKEKKTVAQSVIALKSAEQKITED 528
Query: 282 LKDV--QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK------------------ 321
L+ Q + RK WFEKF WFISS+ YL DA+
Sbjct: 529 LRKGLKQEKPVLQLIRKQMWFEKFVWFISSDGYLVLGGRDAQQNEILYKRYLKKGDVYVH 588
Query: 322 -----------------------------------VVTNAWWVKA---------DQVSKT 337
++AW KA DQVSK+
Sbjct: 589 ADMHGASTVIIKNSPKTPDAPIPPSTLAQAGSLSVCCSSAWDSKAAMGAWWVNADQVSKS 648
Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
APTGE+L GSFM+RGKKN PP L +G +F++ E S
Sbjct: 649 APTGEYLPAGSFMVRGKKNPLPPALLMLGFGLMFRISEES 688
>gi|154281559|ref|XP_001541592.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150411771|gb|EDN07159.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 1177
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 135/455 (29%), Positives = 211/455 (46%), Gaps = 102/455 (22%)
Query: 20 VLVSNLNVS------SQGYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGST 63
V+ N+N S + GY+V K E + + + VA ++FHP
Sbjct: 257 VVAENVNSSLSTAEETPGYIVSKTEGKAGEDASVDSTVPSKSSNVAYIDFHPF------E 310
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P G I F++F AVDE+FS+AES K++ + ++E A +KLE ++D + R+
Sbjct: 311 PKQFESEPGTSILRFDTFNKAVDEYFSSAESQKLESRLTEREEIAKRKLEAAQKDQDKRV 370
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L++ Q + KA+ I N V+ AI A+ IA + W +I ++ Q +PVA+
Sbjct: 371 GVLKEAQELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQGRQNPVAN 430
Query: 184 IIK-QLKLNINHITLLLRVHKERSN-----IKDSWVLHHDNADPFADSSETKKPSL---- 233
+IK LKL N +TLLL E ++ V + + D KKP +
Sbjct: 431 VIKLPLKLYENAVTLLLGEPTENEEPMDESEDEAEVEEEEEQESSEDEDSGKKPGVSKKP 490
Query: 234 ------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV-- 285
+D+DL +S +ANA+++++ K+ AA K++KT+ ST++A+KS +KK LK
Sbjct: 491 RQPLLSIDIDLGISPWANARQYYEQKKVAAVKEKKTLNSTKEAIKSTKKKVAADLKQALK 550
Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNV--------- 316
Q + R +WFEKF +F+SS+ YL G+V
Sbjct: 551 QEKPVLRPTRTPFWFEKFIFFLSSDGYLVLGGRDVQQTEILYRRHLKRGDVFVHADVQGA 610
Query: 317 -----------------------AWDAKVVTNAWW----------VKADQVSKTAPTGEF 343
A + V T+ W A+QVSKT P GE+
Sbjct: 611 IPVIVKNKPGTLDAPIPPGTLSQAGNLCVATSTAWDSKAVMGAWWANANQVSKTTPLGEY 670
Query: 344 LTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
L TG F+I G+KN PP QL +G + +F++ SI
Sbjct: 671 LVTGGFVICGEKNQLPPAQLLLGFAVMFQISGESI 705
>gi|357448763|ref|XP_003594657.1| Serologically defined colon cancer antigen-like protein [Medicago
truncatula]
gi|355483705|gb|AES64908.1| Serologically defined colon cancer antigen-like protein [Medicago
truncatula]
Length = 1146
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 106/283 (37%), Positives = 158/283 (55%), Gaps = 29/283 (10%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH--- 119
P ++ Q + +FE+F A+DEF+S ES + + + +E AL+KL ++ D
Sbjct: 318 CPILLNQFKSRDHTKFETFDLALDEFYSKIESQRSEQQHTAKENSALQKLNKIRNDQVGT 377
Query: 120 -------ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E R+ L + + AELI N E VDAAILA+R +A +SW+D+ VK
Sbjct: 378 HVQTSTIENRVHTLRKEADNCIKMAELIEYNLEDVDAAILAVRVSLAKGMSWDDLARMVK 437
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ ++ +PVA +I +L L N +TLLL +N D D +T
Sbjct: 438 EEKKAGNPVAGLIDKLHLERNCMTLLLS----------------NNLDEMDDDEKTLPAD 481
Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
V+VDL LSA+ANA+R+++LK+ KQ+KTI + EKA K+AE+KT+ L +T+ +I+
Sbjct: 482 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHEKAFKAAERKTRLQLNQEKTVASIS 541
Query: 293 KARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
RKV+WFEKF WFISSENYL G A +++ + K D
Sbjct: 542 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 584
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 43/63 (68%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKN+ PP L MG LF+L+ES
Sbjct: 626 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNYLPPHPLIMGFGLLFRLDES 685
Query: 377 SIS 379
S+
Sbjct: 686 SLG 688
>gi|224101503|ref|XP_002312307.1| predicted protein [Populus trichocarpa]
gi|222852127|gb|EEE89674.1| predicted protein [Populus trichocarpa]
Length = 796
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/284 (37%), Positives = 160/284 (56%), Gaps = 28/284 (9%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
EF P++L MR+H +F++F AA+DEF+S ES K + + +E A++K
Sbjct: 4 EFCPLLLNQFR----MREHV-----KFDAFDAALDEFYSKIESQKSEHQQKTKEGSAIQK 54
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L ++ D E R+ L + + AELI N E V++AILA+R +A + WED+ V
Sbjct: 55 LNKIRLDQENRVEMLRKEVDHSVKMAELIEYNLEDVNSAILAVRVALAKGMGWEDLARMV 114
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
K ++ +PVA +I +L N +TLLL +N D D +T
Sbjct: 115 KDEKKAGNPVAGLIDKLHFEKNCMTLLLS----------------NNLDEMDDDEKTFPV 158
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
V+VDL LSA+ANA+R+++LK+ KQ+KT+ + EKA K+AEKKT+ L +++ I
Sbjct: 159 DKVEVDLALSAHANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLSQEKSVATI 218
Query: 292 NKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
+ RKV+WFEKF WFISSENYL G A +++ + K D
Sbjct: 219 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYVSKGD 262
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 44/63 (69%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 304 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 363
Query: 377 SIS 379
S+
Sbjct: 364 SLG 366
>gi|63054438|ref|NP_588145.2| nuclear export mediator factor NEMF [Schizosaccharomyces pombe
972h-]
gi|48475020|sp|Q9USN8.2|YJY1_SCHPO RecName: Full=Uncharacterized protein C132.01c
gi|157310510|emb|CAA22870.2| nuclear export mediator factor NEMF [Schizosaccharomyces pombe]
Length = 1021
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 131/398 (32%), Positives = 191/398 (47%), Gaps = 77/398 (19%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP QP R+ FP ++ VDEFFS+ E+ K+ +A + A ++
Sbjct: 315 DFHP--FQPLQLVQANRKCMEFP-----TYNECVDEFFSSIEAQKLKKRAHDRLATAERR 367
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE+ K D +L +L+ Q +A+ I N E V+A I I + + W DIE ++
Sbjct: 368 LESAKEDQARKLQSLQDAQATCALRAQAIEMNPELVEAIISYINSLLNQGMDWLDIE-KL 426
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKK 230
Q+Q+ PVA+ I+ LKL N +T+ L + N +S D+ D D ++ K+
Sbjct: 427 IQSQKRRSPVAAAIQIPLKLIKNAVTVFLPNPESVDNSDESSETSDDDLDDSDDDNKVKE 486
Query: 231 PSL------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD 284
+ V++DL L A+ANA++ ++L+R A K+ KT ++ KALKS ++K +Q LK
Sbjct: 487 GKVSSKFIAVELDLSLGAFANARKQYELRREALIKETKTAEAASKALKSTQRKIEQDLKR 546
Query: 285 VQT--MTNINKARKVYWFEKFYWFISSENYLGNVAWDAK--------------------- 321
T I RK ++FEKF+WFISSE YL DA+
Sbjct: 547 STTADTQRILLGRKTFFFEKFHWFISSEGYLVLGGRDAQQNELLFQKYCNTGDIFVCADL 606
Query: 322 ------VVTN------------------------AW--------W-VKADQVSKTAPTGE 342
+V N AW W V+ D+VSK APTGE
Sbjct: 607 PKSSIIIVKNKNPHDPIPPNTLQQAGSLALASSKAWDSKTVISAWWVRIDEVSKLAPTGE 666
Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
L TGSF IR KKN+ PP L MG L++L+E S R
Sbjct: 667 ILPTGSFAIRAKKNYLPPTVLIMGYGILWQLDEKSSER 704
>gi|297795761|ref|XP_002865765.1| EMB1441 [Arabidopsis lyrata subsp. lyrata]
gi|297311600|gb|EFH42024.1| EMB1441 [Arabidopsis lyrata subsp. lyrata]
Length = 1080
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 171/311 (54%), Gaps = 33/311 (10%)
Query: 30 QGYLVQKKEKRPN---SEDYIVANM--EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
+GY++ +K+ N SE V M EF ++L Q + ++FE+F AA
Sbjct: 285 EGYILMQKQILANDTPSESGGVKKMYDEFCSILLN---------QFKSRVYEKFETFDAA 335
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
+DEF+S ES + + + +E A +KL +++D E R+ L++ AELI N
Sbjct: 336 LDEFYSKIESQRSEQQQKAKEDSASQKLNKIRQDQENRVQILKKEVNHCVNMAELIEYNL 395
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
E VDAAILA+R +A + W+D+ VK+ ++ +PVA +I +L L N +TLLL
Sbjct: 396 EDVDAAILAVRVALAKGMGWDDLARMVKEEKKLGNPVAGLIDKLYLEKNCMTLLL----- 450
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
+N D D +T V+VDL LSA+ NA+R++++K+ KQ+KT+
Sbjct: 451 -----------CNNLDEMDDDEKTLPVEKVEVDLSLSAHGNARRWYEMKKKQETKQEKTV 499
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAK 321
+ EKA ++AEKKT+ L + + I+ RKV+WFEKF WFISSENYL G A +
Sbjct: 500 SAHEKAFRAAEKKTRHQLSQEKVVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 559
Query: 322 VVTNAWWVKAD 332
++ + K D
Sbjct: 560 MIVKRYMSKGD 570
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 43/63 (68%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QV+KTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 612 AWDSKIVTSAWWVYPHQVTKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 671
Query: 377 SIS 379
S+
Sbjct: 672 SLG 674
>gi|290975413|ref|XP_002670437.1| hypothetical protein NAEGRDRAFT_81846 [Naegleria gruberi]
gi|284083996|gb|EFC37693.1| hypothetical protein NAEGRDRAFT_81846 [Naegleria gruberi]
Length = 1146
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 117/376 (31%), Positives = 184/376 (48%), Gaps = 75/376 (19%)
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
EF+SF + DEFFS E+ KI+ + E KK+ VKR+ E +L L+ + +
Sbjct: 397 EFDSFDRSADEFFSAIEAKKIESQKSSIENTVEKKMSKVKREQELKLQELQASFDKYETI 456
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
A LI + E VD AI I +A SWE I+ +K+ R DP+A++I +LKL + IT
Sbjct: 457 ATLIETHYEIVDQAIQVICSALAQSQSWETIKQIIKE-HRDVDPIAAMIHKLKLESSQIT 515
Query: 197 LLL---------------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
+ L E ++ +D + + +D + ++P +D+D+ L+
Sbjct: 516 VTLPPPSIDDDDEDEFEYEESDEENDDEDEESDDEEKKEKKSDKKKKEEPMRIDIDISLT 575
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFE 301
A+ANA +++ L++ + + ++K +++KA+K E+KT ++ K Q + I RK +WFE
Sbjct: 576 AHANAAKYYSLRKKSGENKEKAAFASKKAIKKTEQKTLESAKKSQIKSEITIRRKRFWFE 635
Query: 302 KFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTGE----- 342
KFYWFI+SENYL G A ++V + ++ AD ++ PTGE
Sbjct: 636 KFYWFITSENYLVLGGRDAQQNELVVKRYMRKGDIYIHADVHGASSCIIKNPTGEPIPPL 695
Query: 343 ----------------------------------------FLTTGSFMIRGKKNFFPPCQ 362
+LT GSFMIRGKKNF PP
Sbjct: 696 SLQEAGMFCVCRSVAWDNKVMSSAYWVYDHQVSKTAPTGEYLTVGSFMIRGKKNFLPPSP 755
Query: 363 LAMGISFLFKLEESSI 378
L MG + +FK++ES I
Sbjct: 756 LVMGFAVMFKVDESCI 771
>gi|15240582|ref|NP_199804.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana]
gi|8777424|dbj|BAA97014.1| unnamed protein product [Arabidopsis thaliana]
gi|332008489|gb|AED95872.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana]
Length = 1080
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 170/311 (54%), Gaps = 33/311 (10%)
Query: 30 QGYLVQKKEKRPN---SEDYIVANM--EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
+GY++ +K+ N SE V M EF ++L Q + ++FE+F AA
Sbjct: 285 EGYILMQKQILANDTTSESGGVKKMYDEFCSILLN---------QFKSRVYEKFETFDAA 335
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
+DEF+S ES + + + +E A KL +++D E R+ L++ AELI N
Sbjct: 336 LDEFYSKIESQRSEQQQKAKEDSASLKLNKIRQDQENRVQILKKEVNHCVNMAELIEYNL 395
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
E VDAAILA+R +A + W+D+ VK+ ++ +PVA +I +L L N +TLLL
Sbjct: 396 EDVDAAILAVRVALAKGMGWDDLARMVKEEKKLGNPVAGVIDRLYLEKNCMTLLL----- 450
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
+N D D +T V+VDL LSA+ NA+R++++K+ KQ+KT+
Sbjct: 451 -----------CNNLDEMDDDEKTVPVEKVEVDLSLSAHGNARRWYEMKKKQETKQEKTV 499
Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAK 321
+ EKA ++AEKKT+ L + + I+ RKV+WFEKF WFISSENYL G A +
Sbjct: 500 SAHEKAFRAAEKKTRHQLSQEKVVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 559
Query: 322 VVTNAWWVKAD 332
++ + K D
Sbjct: 560 MIVKRYMSKGD 570
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 43/63 (68%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QV+KTAPTGE+LT GSFMIRGKKNF PP L MG LF+L+ES
Sbjct: 612 AWDSKIVTSAWWVYPHQVTKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 671
Query: 377 SIS 379
S+
Sbjct: 672 SLG 674
>gi|170576547|ref|XP_001893673.1| Serologically defined colon cancer antigen 1 [Brugia malayi]
gi|158600188|gb|EDP37492.1| Serologically defined colon cancer antigen 1, putative [Brugia
malayi]
Length = 307
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 98/268 (36%), Positives = 130/268 (48%), Gaps = 82/268 (30%)
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
QA P+A+ I L LN N +TLLL DP+ + KK
Sbjct: 4 QACEAGSPIAASIVGLNLNSNQMTLLL-------------------GDPYRPEIDPKK-- 42
Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
V +D+ LS+Y NA++ K++A +K+QKTI ++ KALKS + K K+TLK V T +
Sbjct: 43 -VTIDIALSSYQNARKLHTEKKAAQQKEQKTICASSKALKSTKMKMKETLKVVHTKAEVM 101
Query: 293 KARKVYWFEKFYWFISSENYL--------------------GNVAWDAKV---------- 322
K R V WFEKF+WF+SSENYL G++ A V
Sbjct: 102 KKRHVMWFEKFFWFVSSENYLVIGGRDAQQNELLVKRYLRPGDIYMHADVRGASSIIIRN 161
Query: 323 ---------------------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIR 352
++AW WV QVS+TAPTGE+LT GSFMIR
Sbjct: 162 KLGGGDVPPRTLNEAATMAISYSSAWEAKITSSAWWVHQHQVSRTAPTGEYLTPGSFMIR 221
Query: 353 GKKNFFPPCQLAMGISFLFKLEESSISR 380
GKKN+ P CQL MG +F+L+E S+ R
Sbjct: 222 GKKNYLPTCQLQMGFGVMFQLDEESLER 249
>gi|347828082|emb|CCD43779.1| similar to serologically defined colon cancer antigen 1
[Botryotinia fuckeliana]
Length = 674
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/330 (34%), Positives = 173/330 (52%), Gaps = 33/330 (10%)
Query: 28 SSQGYLVQKKE--KRPNSEDYI-----VANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
+++GY+V KK+ + P+ E+ + +FHP P + EFE
Sbjct: 336 TAKGYIVAKKKDPQTPSDENETDIRKGLLYDDFHPF------KPKQFQDDPSLVFLEFEG 389
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F VDEFFS+ E K++ K ++E+ A KK++ + + RL L++ Q +++ KA +
Sbjct: 390 FNKTVDEFFSSIEGQKLESKLEEREKQAQKKIQAARNEQAKRLGGLQEIQALNERKASAL 449
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N E V A A+ IA + W +I +++ Q+ N+PVAS+IK LKL N +T+LL
Sbjct: 450 QANVERVQEATDAVNGLIAQGMDWFEIGRLIEREQKFNNPVASMIKLPLKLEENTVTILL 509
Query: 200 RVHKERSNIK-----DSWVLHHDNADPFADS-------SETKKPSLVDVDLDLSAYANAK 247
DS V ++ D A + ++T+ P +D+DL LS +ANA+
Sbjct: 510 DEEAFDEEEDSTYETDSDVSESEDEDDTAKTNKKKEKVADTRIP--IDIDLALSPWANAR 567
Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
+FD KRSAA K+ KT+QS+ KALKS E K Q LK Q + RK WFEKF W
Sbjct: 568 NYFDQKRSAASKEDKTLQSSSKALKSTEAKIAQDLKKGLKQEKAILRPVRKQMWFEKFIW 627
Query: 306 FISSENYL---GNVAWDAKVVTNAWWVKAD 332
FISS+ YL G A ++++ + K D
Sbjct: 628 FISSDGYLVLAGKDAQQSEILYKRYLKKGD 657
>gi|308808798|ref|XP_003081709.1| zinc knuckle (CCHC-type) family protein (ISS) [Ostreococcus tauri]
gi|116060174|emb|CAL56233.1| zinc knuckle (CCHC-type) family protein (ISS), partial
[Ostreococcus tauri]
Length = 1090
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/324 (32%), Positives = 171/324 (52%), Gaps = 28/324 (8%)
Query: 63 TPSVMRQHQGFPIQEFE---------SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLE 113
P ++Q++G + +E +F VDE+F+ +S + L+ + E A+ +LE
Sbjct: 426 APFELKQNEGRRTKTYELPKGLDPALAFDHYVDEYFNELDSQSVILQRRKAEAQAIARLE 485
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
RD + R+ LE+ + +++++A LI N E+VD AI A+ +A+ +SW+++EA +K+
Sbjct: 486 KTLRDQKNRVEQLERERELEEQRAVLIEYNHEAVDVAIEAVNSALASGMSWDELEAMIKE 545
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
+R +PVA +IK + L N IT+ L H + L D D+ KK
Sbjct: 546 ERRLGNPVAGMIKSMDLANNEITITLENHLDE--------LGEDE-----DALGKKKRVA 592
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
V VDL LSA+ANA F K+ A K +KT+ + KA+ +AE K K ++ + +
Sbjct: 593 VSVDLGLSAHANASVRFAAKKKNADKFEKTLNAQNKAVAAAESKMKSAMERAANVVVATR 652
Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVKADQVSKTAP-TGEFLTTGSFMI- 351
AR+ WFEKF+WFI+ W + +AWW+ ++VSK +P G+ L G +
Sbjct: 653 ARQPLWFEKFHWFITXXXX--XXXW--RGCDSAWWIPIERVSKISPVAGDALPPGVVHVA 708
Query: 352 RGKKNFFPPCQLAMGISFLFKLEE 375
+K F P QL MG +F + E
Sbjct: 709 HAEKQFLPHTQLVMGFGLMFVVSE 732
>gi|320581674|gb|EFW95893.1| hypothetical protein HPODL_2176 [Ogataea parapolymorpha DL-1]
Length = 940
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 127/447 (28%), Positives = 201/447 (44%), Gaps = 91/447 (20%)
Query: 7 VSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEK--RPNSEDYIVANME-----FHPMILQ 59
V++ ++ C + L + GY++ K +P ED N+E FHP +
Sbjct: 238 VTAAVNSCEQEFYRLTNLPAGELSGYIIGKHNPFFKP-EEDASYDNLEYVYDEFHP--FE 294
Query: 60 PGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH 119
P V ++ + ++E + + +D+FFST ES K LK QQ+ +A K+L+ VK +H
Sbjct: 295 P-----VHKKKENTRVEEVKGYNRTLDKFFSTLESSKAVLKIQQQQANAAKRLQTVKDEH 349
Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
T+L LE+ Q ++ K ELI + E ++ +++ + Q+ W +IE V Q+ +
Sbjct: 350 MTKLQRLEEQQAINYRKGELITFHSEQIEQCKQSVQALLDQQMDWTNIEKLVAMEQKRRN 409
Query: 180 PVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVD 237
P+A++IK L LN+ N IT+LL D ++D ++ P V +D
Sbjct: 410 PIANMIK-LPLNLAKNEITVLL---------PDIEEQSDSDSDSDSEEKRKSGPVAVAID 459
Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM----TNINK 293
L LSAYANA R+FD R+A KQ KT S A+K+ E+ +Q LK +Q + + +
Sbjct: 460 LSLSAYANATRYFDAMRAALDKQNKTKNSASIAIKNTERTIQQDLKRMQKKSQEPSGLKQ 519
Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWW---------------------VKAD 332
R +WFEKF+WFI+S+N+L D V ++ VK
Sbjct: 520 IRAKFWFEKFWWFITSDNHLCIAGRDDTQVDLIYYRYFDKNNDVLVSNDLDGLKVIVKNP 579
Query: 333 QVSKTAPTGEFLTTGSFM---------------------------------------IRG 353
+K P L G F I+G
Sbjct: 580 FKNKDIPPSTLLQAGIFSLSASKAWDNKMVTSPWMVKGTQVSKKDFDGSIVPAGMLNIQG 639
Query: 354 KKNFFPPCQLAMGISFLFKLEESSISR 380
+K F PPCQL MG L+ +E + +
Sbjct: 640 EKTFLPPCQLVMGFGLLWLGDEETTRK 666
>gi|320040092|gb|EFW22026.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 1136
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/299 (33%), Positives = 165/299 (55%), Gaps = 20/299 (6%)
Query: 29 SQGYLVQKKEKRPNSEDYIVAN--------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
++GY+V + E +P+ +++HP P I F+S
Sbjct: 264 TRGYIVARNENKPSENPSFSGEAKPDKSNYIDYHPF------APRQFADGNDISILTFDS 317
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDE++S+ E+ K++ + ++E +KLE KRDHE R+ AL+Q Q + KAE I
Sbjct: 318 FNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEIHTRKAEAI 377
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N V+ + A+ IA + W +I ++ Q +PVA +IK LKL N +T+LL
Sbjct: 378 ATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLYENTVTVLL 437
Query: 200 RVHKERSNIKDSWVLHHDNADPFADS-SETKKPSL--VDVDLDLSAYANAKRFFDLKRSA 256
+ DS ++ + ++ ++ ++P + VD+DL L+ +ANA +++D K++A
Sbjct: 438 PEGQLDEEDDDSEESDEEDEENDGEAKTKPQRPEVLSVDIDLGLTPWANASQYYDQKKTA 497
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
A K++KTI+++++ALKSAEKK LK Q + AR +WFEKFY+FISS+ YL
Sbjct: 498 AVKEEKTIKASKQALKSAEKKLTTDLKRGLKQEKPVLRPARIPFWFEKFYFFISSDGYL 556
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 31/62 (50%), Positives = 43/62 (69%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW++K + AWWV ADQVSKT P+GE+L TG +IRG KN P QL +G + +F++
Sbjct: 633 AWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPGQLILGFAVMFQISPE 692
Query: 377 SI 378
S+
Sbjct: 693 SV 694
>gi|68533893|gb|AAH99277.1| LOC733300 protein [Xenopus laevis]
Length = 453
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 85/181 (46%), Positives = 112/181 (61%), Gaps = 16/181 (8%)
Query: 26 NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N + +G+++QK+EK+P+ SED I N EFHP + QH E +
Sbjct: 241 NFNGKGFIIQKREKKPSLEPDKASED-IFTNEEFHPFLFA---------QHANSTYIELD 290
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
SF VDEFFS E KID+KA+QQE+ ALKKL NV++DHE RL +L+ Q DK K EL
Sbjct: 291 SFNKTVDEFFSKLEGQKIDIKALQQEKQALKKLGNVRKDHEHRLESLQYAQDADKAKGEL 350
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
I N + VD AI +R +ANQ+ W +I VK+AQ DPVA IK+LKL NHIT++L
Sbjct: 351 IEMNLDIVDRAIQVVRSALANQIDWTEIGLIVKEAQIQGDPVALAIKELKLQTNHITMML 410
Query: 200 R 200
+
Sbjct: 411 K 411
>gi|425773025|gb|EKV11400.1| hypothetical protein PDIG_50370 [Penicillium digitatum PHI26]
gi|425782195|gb|EKV20118.1| hypothetical protein PDIP_19610 [Penicillium digitatum Pd1]
Length = 1107
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 109/305 (35%), Positives = 163/305 (53%), Gaps = 23/305 (7%)
Query: 26 NVSSQGYLVQKKEKRPNSEDYIVANM------EFHPMILQPGSTPSVMRQHQGFPIQEFE 79
S GY+V K++ RP E + +FHP P G I EFE
Sbjct: 260 GASHPGYIVAKEDTRPIPEGETSSKAAGLLYEDFHPF------KPRQFENKPGIKILEFE 313
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
F A VDE+FS+ ES +++ + ++E A KKLE+V+ +H+ R+ L+ Q + KA
Sbjct: 314 RFNATVDEYFSSLESQRLESRLTEREEAAKKKLESVRFEHKKRIDELKNVQELHIRKANA 373
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLL 198
I +N V A+ A+ +A + W +I ++ Q +PVA IIK LKL N ++LL
Sbjct: 374 IQDNVYRVQEAMDAVNGLVAQGMDWGEIARLIEMEQDRGNPVAQIIKLPLKLYENTVSLL 433
Query: 199 L-------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFF 250
L +E S+ +S + AD S+E + L +D+DL LS +ANA +++
Sbjct: 434 LGEAGDDEDEEEEFSSSDESDSDSENEADQETSSAERESKLLTIDIDLGLSPWANASQYY 493
Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFIS 308
D K+ A++K+Q+T QS+ KALKS EKK LK + + +AR +WFEKF +FIS
Sbjct: 494 DQKKQASEKEQRTTQSSTKALKSHEKKVTTELKRGLKKEKQVLRQARTPFWFEKFVFFIS 553
Query: 309 SENYL 313
SE YL
Sbjct: 554 SEGYL 558
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 1/65 (1%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGE-FLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
+ AWD+K V +AWW A QVSK A G + TG F I+G+KNF P QL +G +F++
Sbjct: 638 STAWDSKAVMSAWWAHAHQVSKIAENGSGIMPTGVFQIKGEKNFLAPSQLVLGFGIMFQV 697
Query: 374 EESSI 378
+ S+
Sbjct: 698 SQESV 702
>gi|261195108|ref|XP_002623958.1| DUF814 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239587830|gb|EEQ70473.1| DUF814 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 1150
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 190/411 (46%), Gaps = 79/411 (19%)
Query: 31 GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
GY+V K E + + + A ++FHP P I +F++
Sbjct: 267 GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPF------EPKQFENQADMAILKFDT 320
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDE+FS+ E K++ + ++E A +KLE ++D E R+ L++ + + KA+ I
Sbjct: 321 FNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQAI 380
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N V+ A+ A+ IA + W +I ++ Q +PVA +IK LKL N +TLLL
Sbjct: 381 EANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLLL 440
Query: 200 RVHKERSNIKDSWVLHHDNADPFADS------SETKKPSL--------VDVDLDLSAYAN 245
E D ++ + D +KKP +D+DL +S +AN
Sbjct: 441 GEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWAN 500
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
A+++++ K++AA K++KT+ S +KA+KS EKK LK Q + R +WFEKF
Sbjct: 501 ARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEKF 560
Query: 304 YWFISSENYLGNVAWDAK--------------VVTNA-------WWVKADQVSKTAP--- 339
+FISS+ YL DA+ V +A ++VK + AP
Sbjct: 561 IYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIPP 620
Query: 340 ----------------------TGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
GE+L TG F+IRG+KN PP QL +G +
Sbjct: 621 GTLSQAGNLCVATSSAWHSKAVMGEYLETGGFVIRGEKNQLPPAQLLLGFA 671
>gi|50555916|ref|XP_505366.1| YALI0F13277p [Yarrowia lipolytica]
gi|49651236|emb|CAG78173.1| YALI0F13277p [Yarrowia lipolytica CLIB122]
Length = 1134
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 174/380 (45%), Gaps = 75/380 (19%)
Query: 75 IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
+ FE + VD +FST ES + L+ QE+ A K+L + + + R+ L+Q Q
Sbjct: 328 VDTFEGYNLTVDRYFSTVESTRYSLRVNAQEQIAEKRLNAARNETKKRVDGLQQVQDRSI 387
Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINH 194
+ V+ AI A++Q + W+D+E + ++ +PVA ++ + L N
Sbjct: 388 LMGTALQTYAGRVEEAIAAVKQLQDQGMDWKDMEHLIDLEKKKGNPVAQMVSSMNLEKNR 447
Query: 195 ITLLL-----------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
+TL+L + ++ ++ +E+K V+V+LDL+AY
Sbjct: 448 VTLILPNPDVEDESDSDSDSDMDETDSEGESEESGSESDSNKNESKTLK-VEVNLDLTAY 506
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFE 301
ANA +FD+K+ AA+KQ+KT +++ ALKSAE+K K LK Q + R YWFE
Sbjct: 507 ANANNYFDIKKVAAQKQEKTEKNSATALKSAEQKVKLDLKRSLAQEQHALRPMRPSYWFE 566
Query: 302 KFYWFISSENYL---GNVAWDAKVVTNAWWVKAD-----------------QVSKTAP-- 339
KF+WF SS+ YL G A +++ ++ K D + TAP
Sbjct: 567 KFWWFFSSDGYLVIGGKDAQQNEMLYKRYFRKGDAYVHAEIQGASTVIVKNHLGPTAPLP 626
Query: 340 ---------------------------------------TGEFLTTGSFMIRGKKNFFPP 360
+GEFL TGSFMIRGKKNF PP
Sbjct: 627 PSTLSQAGSLSICTSKAWDSKVLISAWWVEHGQVSKSAPSGEFLPTGSFMIRGKKNFLPP 686
Query: 361 CQLAMGISFLFKLEESSISR 380
L +G++ L+ +E S ++
Sbjct: 687 TSLDVGLAILWIADEDSTAK 706
>gi|239610682|gb|EEQ87669.1| DUF814 domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 1131
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 190/411 (46%), Gaps = 79/411 (19%)
Query: 31 GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
GY+V K E + + + A ++FHP P I +F++
Sbjct: 267 GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPF------EPKQFENQADMAILKFDT 320
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AVDE+FS+ E K++ + ++E A +KLE ++D E R+ L++ + + KA+ I
Sbjct: 321 FNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQAI 380
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
N V+ A+ A+ IA + W +I ++ Q +PVA +IK LKL N +TLLL
Sbjct: 381 EANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLLL 440
Query: 200 RVHKERSNIKDSWVLHHDNADPFADS------SETKKPSL--------VDVDLDLSAYAN 245
E D ++ + D +KKP +D+DL +S +AN
Sbjct: 441 GEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWAN 500
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
A+++++ K++AA K++KT+ S +KA+KS EKK LK Q + R +WFEKF
Sbjct: 501 ARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEKF 560
Query: 304 YWFISSENYLGNVAWDAK--------------VVTNA-------WWVKADQVSKTAP--- 339
+FISS+ YL DA+ V +A ++VK + AP
Sbjct: 561 IYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIPP 620
Query: 340 ----------------------TGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
GE+L TG F+IRG+KN PP QL +G +
Sbjct: 621 GTLSQAGNLCVATSSAWHSKAVMGEYLETGGFVIRGEKNQLPPAQLLLGFA 671
>gi|255941192|ref|XP_002561365.1| Pc16g10550 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211585988|emb|CAP93725.1| Pc16g10550 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1160
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 108/311 (34%), Positives = 165/311 (53%), Gaps = 26/311 (8%)
Query: 28 SSQGYLVQKKEKRPNSEDYIVANM------EFHPMILQPGSTPSVMRQHQGFPIQEFESF 81
S GY+V K++ RP E + +FHP P G I EFE F
Sbjct: 315 SHPGYIVAKEDTRPVPEGETASKAPALLYEDFHPF------KPRQFENKPGTKILEFERF 368
Query: 82 AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
A VDE+FS+ ES +++ + ++E A KKLE+V+ +H+ R+ L+ Q + KA+ I
Sbjct: 369 NATVDEYFSSLESQRLESRLTEREEAAKKKLESVRSEHKKRIDELKNVQEIHIRKADAIQ 428
Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL- 199
+N V A+ A+ +A + W +I ++ Q +PVA IK LKL N +TL+L
Sbjct: 429 DNVYRVQEAMDAVNGLVAQGMDWGEIARLIEMEQGRGNPVAQTIKLPLKLYENTVTLVLG 488
Query: 200 ------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLK 253
+E S+ + ++ A + K +D+DL LS +ANA +++D K
Sbjct: 489 EAGDDEDEDEEFSSSDEESDSENEAEQETARAERESKLLTIDIDLGLSPWANASQYYDQK 548
Query: 254 RSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
+ A++K+Q+T QS+ KALKS EKK K+ LK + + + +AR +WFEKF +FISS
Sbjct: 549 KQASEKEQRTTQSSAKALKSHEKKVTTDLKRGLKKEKQV--LRQARTPFWFEKFIFFISS 606
Query: 310 ENYLGNVAWDA 320
E YL A DA
Sbjct: 607 EGYLVIGARDA 617
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/63 (46%), Positives = 40/63 (63%), Gaps = 1/63 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGE-FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
AWD+K V +AWW A QVSK A G + TG F I+G+KNF P QL +G +F++ +
Sbjct: 676 AWDSKAVMSAWWAHAHQVSKIAENGSGIMPTGVFQIKGEKNFLAPSQLVLGFGIMFQISQ 735
Query: 376 SSI 378
S+
Sbjct: 736 ESV 738
>gi|156101618|ref|XP_001616502.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805376|gb|EDL46775.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 2067
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 129/437 (29%), Positives = 197/437 (45%), Gaps = 94/437 (21%)
Query: 26 NVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
V +G + + KE+R E +A EF P+IL + + F+ F V
Sbjct: 421 GVGGEGEM-EAKEERSGEE---IAFTEFSPIILNNHKNKV---EENKLEVVHFDDFNKCV 473
Query: 86 DEFFSTAESHKIDLKAVQQE----RDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
D +FS E K D QQE + +L K++ +K DHE R+ LE+ ++K LI
Sbjct: 474 DTYFSRMELSKYDK---QQEVIKIKKSLTKMDKIKLDHERRIDQLEKEVSTLRKKISLIQ 530
Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRV 201
N E V+ AI +R +A +WE I +K ++ N P+A I + N + LLL
Sbjct: 531 MNDELVEQAIQLMRAAVATNANWEKIWEHIKLFKKQNHPIALRISSVNFNNCEMELLLDD 590
Query: 202 HKERS-NIKDSWVLHHDNADPFADSSE--------------TKKPSLVDVDLDLSAYANA 246
+E DS + + DP ++++E T V ++L+ S Y N
Sbjct: 591 GEENGLGSDDSSEANGRSDDPSSEANEQPSKGKKSSNKKAATNNRFAVTINLNNSVYGNV 650
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV-----QTMTNINKARKVYWFE 301
+ + L++ A +K +KT ST A+K EKK K+ +T+ I K RKVYWFE
Sbjct: 651 EDYQKLRKKAEEKIRKTKISTNFAVKKVEKKKKEKENKQKGKHNKTVGQIQKIRKVYWFE 710
Query: 302 KF--------YWFISSENYLGN-------------------------------------- 315
KF Y I+ + L N
Sbjct: 711 KFHWFISSENYLVIAGRDALQNEILFRRYFQKNDVYVHADIHGASTCIIKNPHKDIPIPE 770
Query: 316 --------------VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC 361
AW+ K++T+AWWV QVSK+APTGE+L TGSF+IRGKKN+ P
Sbjct: 771 KTLSEAGQLAICRSSAWNNKIITSAWWVHYHQVSKSAPTGEYLKTGSFVIRGKKNYLPHV 830
Query: 362 QLAMGISFLFKLEESSI 378
+L MG+ +F+++ +++
Sbjct: 831 KLEMGLCIIFQVDNAAL 847
>gi|297736760|emb|CBI25961.3| unnamed protein product [Vitis vinifera]
Length = 321
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/185 (45%), Positives = 105/185 (56%), Gaps = 47/185 (25%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
V+VDL LSA+ANA+R+++ K+ KQ+KTI + EKA K+AEKK+ L V
Sbjct: 11 VEVDLALSAHANARRWYEQKKRQENKQEKTIIAHEKAFKAAEKKSCVQLSQV-------G 63
Query: 294 ARKVYWFEKFYWFISSENYL--------------------GNV----------------- 316
++WFEKF WFISSENYL G++
Sbjct: 64 EHYIHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLHGASRCFTVCH 123
Query: 317 --AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
AWD+K+VT+AWWV QVSKTA TGE+LT GSFMIRG KNF PP L MG LF L+
Sbjct: 124 SQAWDSKIVTSAWWVYPHQVSKTASTGEYLTVGSFMIRG-KNFLPPHPLMMGFGLLFCLD 182
Query: 375 ESSIS 379
ESS+
Sbjct: 183 ESSLG 187
>gi|322693747|gb|EFY85597.1| serologically defined colon cancer antigen 1 [Metarhizium acridum
CQMa 102]
Length = 1063
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 119/403 (29%), Positives = 184/403 (45%), Gaps = 81/403 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP I P +++ EF+ + VDEFFS+ E K++ + ++E A +K
Sbjct: 295 DFHPFI------PHKLQRDPSIKALEFKGYNQTVDEFFSSLEGQKLETRLNEREAAAKRK 348
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+ K D R+ L+ Q ++ KA I N E V A+ A+ +A + W DI V
Sbjct: 349 LDAAKADQAKRIEGLQDAQTLNMRKAAAIEANVEWVQEAMDAVNGLLAQGMDWVDIGKLV 408
Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERS-------NIKDSWVLHHDNADPF 222
++ ++ +PVA II L LN+ N ITL L +E DS + A
Sbjct: 409 EREKKRKNPVADII-VLPLNLAENLITLSLAEEEEEEAEEADPFETDDSDSEDENEASTI 467
Query: 223 ADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAA----KKQQKTIQSTEKALKSAEKK 277
+ SE L V+++L LS ++NA+ +++ +R+A K QQ+ ++ + A + +
Sbjct: 468 SKKSEKPAKGLNVEINLKLSPWSNAREYYEQRRTAVVKEEKTQQQASRALKNAEQKIVED 527
Query: 278 TKQTLKDVQTMTN-INKA---RKVYWF--EKFYWFISSEN----------YL-------- 313
K+ LK + + I K K WF Y + ++ YL
Sbjct: 528 LKKGLKQEKALLQPIRKQLWFEKFLWFISSDGYLVLGGKDAQQNEILYKRYLRKGDVYCH 587
Query: 314 ------------------------------GNV------AWDAKVVTNAWWVKADQVSKT 337
GN+ AWD K AWWVKADQVSK+
Sbjct: 588 ADLRGAPSVIIKNNPSTPDAPIPPATLAQAGNLSVCASEAWDQKAGMGAWWVKADQVSKS 647
Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
P G+FL TG+FM+RG+KNF P QL +G+ +FK+ E S +R
Sbjct: 648 GPAGDFLPTGNFMVRGQKNFLAPAQLLLGLGIMFKISEESKAR 690
>gi|322712137|gb|EFZ03710.1| DUF814 domain-containing protein [Metarhizium anisopliae ARSEF 23]
Length = 959
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 120/403 (29%), Positives = 184/403 (45%), Gaps = 81/403 (20%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP I P ++ + EF+ + VDEFFS+ E K++ + ++E A +K
Sbjct: 190 DFHPFI------PHKLQGDPSIKVLEFKGYNRTVDEFFSSLEGQKLETRLNEREAAAKRK 243
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+ K D R+ L+ Q ++ KA I N E V A+ A+ +A + W DI V
Sbjct: 244 LDATKADQAKRIEGLQDAQTLNMRKAAAIEANVEWVQEAMDAVNGLLAQGMDWVDIGKLV 303
Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERS-------NIKDSWVLHHDNADPF 222
++ ++ + VA II L LN+ N ITL L +E DS + A
Sbjct: 304 EREKKRKNAVAGII-VLPLNLAENLITLSLAEEEEEEEEEADPFETDDSDSEDENEASTI 362
Query: 223 ADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAA----KKQQKTIQSTEKALKSAEKK 277
+ SE L V+++L LS ++NA+ +++ +R+A K QQ+ ++ + A + +
Sbjct: 363 SKKSEKPVRGLNVEINLKLSPWSNAREYYEQRRTAVVKEEKTQQQASRALKNAEQKIAED 422
Query: 278 TKQTLKDVQTMTN-INKA---RKVYWF--EKFYWFISSEN----------YL-------- 313
K+ LK + + I K K WF Y + ++ YL
Sbjct: 423 LKKGLKQEKALLQPIRKQLWFEKFLWFISSDGYLVLGGKDAQQNEILYKRYLRKGDVYCH 482
Query: 314 ------------------------------GNV------AWDAKVVTNAWWVKADQVSKT 337
GN+ AWD K AWWVKADQVSK+
Sbjct: 483 ADLRGAPSVIIKNNPSTPDAPIPPATLAQAGNLSVCASEAWDQKAGMGAWWVKADQVSKS 542
Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
AP G+FL TGSFM+RG+KNF P QL +G+ +FK+ E S +R
Sbjct: 543 APAGDFLPTGSFMVRGQKNFLAPAQLLLGLGIMFKISEESKAR 585
>gi|325093107|gb|EGC46417.1| DUF814 domain-containing protein [Ajellomyces capsulatus H88]
Length = 1136
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 167/328 (50%), Gaps = 40/328 (12%)
Query: 20 VLVSNLNVS------SQGYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGST 63
V+ N+N S + GY+V K E + + + VA ++FHP
Sbjct: 266 VVAENVNSSLSTAEETPGYIVSKTEGKAGEDASVDSTDPSKLRNVAYIDFHPF------E 319
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P G I F++F+ AVDE+FS+ ES K++ + ++E A +KLE ++D + R+
Sbjct: 320 PKQFESEPGTSILRFDTFSKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAQKDQDKRV 379
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L++ Q + KA+ I N V+ AI A+ IA + W +I ++ Q +PVA
Sbjct: 380 GVLKEAQELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQSRQNPVAK 439
Query: 184 IIK-QLKLNINHITLLLRVHKE--------------RSNIKDSWVLHHDNADPFADSSET 228
+IK LKL N +TLLL E + D+ S +T
Sbjct: 440 VIKLPLKLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSKKT 499
Query: 229 KKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV-- 285
++P L +D+DL +S +ANA+++++ K++AA K++KT+ ST+ A+KS EKK LK
Sbjct: 500 RQPLLSIDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKTAIKSTEKKVAADLKQALK 559
Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL 313
Q + R +WFEKF +F+SS+ YL
Sbjct: 560 QEKPVLRPTRTPFWFEKFIFFLSSDGYL 587
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/64 (53%), Positives = 44/64 (68%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+ AWD+K V AWWV ADQVSKT P GE+L TG F+I G+KN P QL +G + +F++
Sbjct: 672 STAWDSKAVMGAWWVNADQVSKTTPLGEYLVTGGFVICGEKNHLSPAQLLLGFAVMFQIS 731
Query: 375 ESSI 378
SI
Sbjct: 732 GESI 735
>gi|440634980|gb|ELR04899.1| hypothetical protein GMDG_00158 [Geomyces destructans 20631-21]
Length = 1072
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 106/329 (32%), Positives = 166/329 (50%), Gaps = 38/329 (11%)
Query: 31 GYLVQKKEKRPN--------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
GY++ KK++R + +E + +FHP P FE F
Sbjct: 264 GYIIAKKQERTDKVASRDEETERQALLYEDFHPF------KPRQFENDPACTFVPFEGFN 317
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
VDEFFS+ E +++ + ++E A KKL+ K D + RL L++ Q +++ KA I
Sbjct: 318 NTVDEFFSSIEGQRLESRLYEREVTAKKKLQAAKDDQQKRLGGLQEIQTLNERKAGAIET 377
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL-- 199
N + V A A+ IA + W +I + Q+ +PVASIIK LKL+ N +TLLL
Sbjct: 378 NVQRVQEATDAVNGLIAQGMDWIEIGKLIDIEQKRGNPVASIIKLPLKLHENTVTLLLDE 437
Query: 200 ---------RVHKERSNIKDSWVLHHDNADPFADSSE--TKKPSLVDVDLDLSAYANAKR 248
++ S++ DS D A P ++ + K +D++L S ++NA+
Sbjct: 438 EIFVEDLNDEAYETGSDVSDS----EDEA-PIKEAVKKVVDKRLAIDINLGASPWSNARE 492
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWF 306
++ +RSAA+K++KT++S+ KALKS K +Q LK Q + RK WFEKF WF
Sbjct: 493 YYGQRRSAAEKEKKTLESSTKALKSTSHKIEQDLKKGLKQEKAILRPVRKHMWFEKFMWF 552
Query: 307 ISSENYL---GNVAWDAKVVTNAWWVKAD 332
ISS+ YL G A +++ + K D
Sbjct: 553 ISSDGYLVLGGRDAQQNEILYKRYLRKGD 581
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 40/61 (65%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW++K AWW ADQVSK+APTG++ GSF +RGKKNF PP L +G +F +
Sbjct: 625 AWESKAGMPAWWANADQVSKSAPTGDYFKPGSFDVRGKKNFLPPAPLLLGFGVMFHVSNE 684
Query: 377 S 377
S
Sbjct: 685 S 685
>gi|340975808|gb|EGS22923.1| hypothetical protein CTHT_0014010 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1116
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 125/443 (28%), Positives = 196/443 (44%), Gaps = 90/443 (20%)
Query: 2 VSNEDVSSDIHKCGPTLHVLVSNLNVSS--QGYLVQKKE-----------KRPNSEDYIV 48
+ NED+ + K ++ + S +GY++ K + PN + +
Sbjct: 229 LKNEDLFESLFKALEQGRAILDEITSSPVLKGYIIAKPNPHAQEQASEGGEAPNGKASSL 288
Query: 49 ANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA 108
+F P + P + + F+ F VDEFFS+ E K+ + ++E A
Sbjct: 289 LYEDFQPFL------PKQFEEDPNLEVLTFDGFNKTVDEFFSSLEGQKLQSRLQEREATA 342
Query: 109 LKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIE 168
KKLE ++D R+ L++ Q+++ KA I N E V A+ A+ + + W DI
Sbjct: 343 KKKLEAARQDQAKRIEGLQEAQVLNLRKAAAIEANIERVQEAMDAVNGLLQQGMDWVDIN 402
Query: 169 ARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSE 227
V++ Q+ ++PVA IIK ++L+ N ITLLL +E + +D + A+ +
Sbjct: 403 KLVEREQKLHNPVAEIIKLPMRLHENIITLLLGEEEEEGPEDEEMDFEYDTDEEAANDPQ 462
Query: 228 TKKPS------LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
+K VD++L LS + NA+ +++ KRSAA K QKTIQ E ALK+AE K +
Sbjct: 463 PEKAKGPDKRLAVDINLKLSPWNNAREYYEQKRSAADKAQKTIQQAEIALKNAEMKIAKD 522
Query: 282 L-KDV-QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK------------------ 321
L KD+ Q + R+ WFEKF WFISS+ YL DA+
Sbjct: 523 LKKDLKQEKPILQPIRQQLWFEKFIWFISSDGYLVLGGRDAQQNEILYKRYFKKGDVFVH 582
Query: 322 -----------------------------------VVTNAWWVKA---------DQVSKT 337
++AW KA D+VSK
Sbjct: 583 SDVKGAATVIIKNDPKTPDAPIPPATLTQAGCLSVCCSSAWDSKAAMGAWWVTADKVSKL 642
Query: 338 APTGEFLTTGSFMIRGKKNFFPP 360
PTG+ + G+FMI G++N P
Sbjct: 643 GPTGDPMPEGTFMINGERNPLEP 665
>gi|389585510|dbj|GAB68240.1| hypothetical protein PCYB_131140 [Plasmodium cynomolgi strain B]
Length = 1898
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 122/422 (28%), Positives = 192/422 (45%), Gaps = 92/422 (21%)
Query: 37 KEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHK 96
KE+ N E ++A EF P+IL + + F+ F VD +FS E K
Sbjct: 453 KEEDKNGE--VIAFTEFSPIILNNHKNKV---EENKLEVINFDDFNKCVDTYFSRMELSK 507
Query: 97 IDLKAVQQE----RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152
D QQE + +L K++ +K DHE R+ LE+ K+K LI N E V+ AI
Sbjct: 508 YDK---QQEVIKIKKSLTKMDKIKLDHERRIEQLEKEVSSLKKKISLIQMNDELVEQAIQ 564
Query: 153 AIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL---RVHKERSNIK 209
+R +A +WE I +K ++ N P+A I + N + LLL ++ S+
Sbjct: 565 LMRAAVATNANWEKIWEHIKLFKKQNHPIALRISSVNFNNCEMELLLDDEEATEQGSDDL 624
Query: 210 DSWVLHHDNADPFADSSE-------------TKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
S + DP ++++E T+ V ++L+ S Y N + + L++ A
Sbjct: 625 SSEANEQGSDDPSSEANEQQSKGKASNREVATRSRFAVTINLNNSVYGNVEDYQKLRKKA 684
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF--------YWFIS 308
+K +KT ST A+K EKK K+ + + +T ++VYWFEKF Y I+
Sbjct: 685 EEKIRKTKISTNFAVKKVEKKKKKKINRRENITR----QRVYWFEKFHWFISSENYLVIA 740
Query: 309 SENYLGN----------------------------------------------------V 316
+ L N
Sbjct: 741 GRDALQNEILFRRYFQKNDIYVHADIHGASTCIIKNPHKDIPIPEKTLSEAGQLAICRSS 800
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW+ K++T+AWWV QVSK+AP GE+L TGSF+IRGKKN+ P +L MG+ +F+++ +
Sbjct: 801 AWNNKIITSAWWVHYHQVSKSAPAGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNA 860
Query: 377 SI 378
++
Sbjct: 861 AL 862
>gi|258574555|ref|XP_002541459.1| predicted protein [Uncinocarpus reesii 1704]
gi|237901725|gb|EEP76126.1| predicted protein [Uncinocarpus reesii 1704]
Length = 1070
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 133/432 (30%), Positives = 211/432 (48%), Gaps = 85/432 (19%)
Query: 19 HVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEF 78
+++ N N S+ Q E +P+ YI ++HP P + I
Sbjct: 244 YIITRNENKPSEPP-TQGTETKPDKSSYI----DYHPF------EPKQFADNPDTRILPL 292
Query: 79 ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAE 138
ESF AVDE++S+ E+ K++ + +E +KLE KRDHE R+ AL++ Q ++ KA+
Sbjct: 293 ESFNKAVDEYYSSVEAQKLESRLTDREETMKRKLEATKRDHEKRVGALKEVQQLNVRKAQ 352
Query: 139 LIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITL 197
I N V+ AI A IA + W +I ++ Q +P+A +IK LKL N IT+
Sbjct: 353 AIEANLSKVEEAINAANSLIAQGMDWVEIARLIEMEQSRRNPIAKMIKLPLKLYENTITI 412
Query: 198 LL--RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDLK 253
LL + + + +S ++ ++++P + +D+DL L+ +ANA +++D K
Sbjct: 413 LLPDGMPVDDESESESEDEDEEDESGDEPEKKSREPEVLSIDIDLALTPWANASQYYDQK 472
Query: 254 RSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
++AA K+ KTI++++KALKSAEKK KQ LK Q + AR +WFEKF++FISS
Sbjct: 473 KTAAMKEDKTIKASKKALKSAEKKVTADLKQGLK--QEKPVLRPARTPFWFEKFFFFISS 530
Query: 310 ENYL--------------------GNV-------------------AWDAKV-------- 322
+ YL G+V A+D +
Sbjct: 531 DGYLVLGGQDARQDEILYHRHLQKGDVYVHTDTEGAMPMIIKNKPGAFDDPIPPGTLAQA 590
Query: 323 ------VTNAWWVKA---------DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
+ AW KA +QVS+T TGE+L T S +I G+KN P QL +G
Sbjct: 591 GTFTVATSRAWDTKALLGAWWVKAEQVSRTTATGEYLPT-SVVISGEKNHLAPGQLILGF 649
Query: 368 SFLFKLEESSIS 379
+ LF++ S++
Sbjct: 650 AVLFQISPESVA 661
>gi|118350963|ref|XP_001008760.1| conserved hypothetical protein [Tetrahymena thermophila]
gi|89290527|gb|EAR88515.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
Length = 1213
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 178/378 (47%), Gaps = 83/378 (21%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
+P + H+G E SF A+VD++F + KI + E A KK EN+K D R
Sbjct: 372 SPLYLTCHEGKKFIENNSFNASVDKYFQVM-AQKIQEEQNDVESIAWKKYENIKNDQLNR 430
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L+ Q KA+LI N + VDA I I+ ++ SW+ I + + +++ DP+A
Sbjct: 431 IQKLKNEQEEYVVKAQLIEMNIDYVDAIINIIKTLKSSGESWDKITKMINEGKKNGDPMA 490
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I L N I++LL DP D E ++V +D+ SA
Sbjct: 491 YLIHSLDFENNEISVLL-------------------GDPCDDMEEY---TVVAIDIAYSA 528
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARK------ 296
+ NA+ +++ K+ K++KT+ +++ ALK AEK + +++++ N+ RK
Sbjct: 529 HQNARNYYENKKKNIVKEKKTLDASKLALKQAEKTALKEIENLKLKNNVVNTRKQYWFEK 588
Query: 297 VYWF--EKFYWFISSEN----------YL--GNV-------------------------- 316
YWF + Y IS + Y+ G++
Sbjct: 589 FYWFISSENYLVISGRDMQQNEIIVKKYMRKGDIYMHADFHGAASTIIKNPFKDIPVSQQ 648
Query: 317 --------------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ 362
AW+AK++ +AWWV QVSK A TGE+L +GSFMIRGKKNF P +
Sbjct: 649 TIEEAAIATICRSKAWEAKIIASAWWVYDHQVSKRAETGEYLPSGSFMIRGKKNFIYPAR 708
Query: 363 LAMGISFLFKLEESSISR 380
+ MG + L+KL++ + +
Sbjct: 709 MEMGCTLLYKLDDQFVEK 726
>gi|213403135|ref|XP_002172340.1| DUF814 family protein [Schizosaccharomyces japonicus yFS275]
gi|212000387|gb|EEB06047.1| DUF814 family protein [Schizosaccharomyces japonicus yFS275]
Length = 1013
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 117/397 (29%), Positives = 178/397 (44%), Gaps = 77/397 (19%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP P + Q+ EF S+ VDEFFS+ ES KI+ + + + K+
Sbjct: 310 DFHPF------KPLQLLQNNNRTCIEFPSYNECVDEFFSSLESQKIEKQNHDRLKTFAKR 363
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
+EN KRD E +L L++ Q + ++KA+ I N + V+ AI + + + W DIE +
Sbjct: 364 IENAKRDVENKLKELQKAQELSEKKAQAIELNPQLVEGAIEYVNSLVGQAMDWLDIEKLI 423
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR--------VHKERSNIKDSWVLHHDNADPF 222
QR AS+I+ L+L N ITL+L E+S + P
Sbjct: 424 TVQQRRQHAFASVIRLPLQLKKNLITLVLPDPNPLAVDEESEQSESESDSEPESTIITPV 483
Query: 223 ADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL 282
K V+VDL L A+ANA+ ++ +R AA K++KTI+S+ KA+K+ +K+ + L
Sbjct: 484 QRRLIQPKGLAVEVDLALGAFANARVHYNNRRLAALKEEKTIESSSKAIKNTQKRAEADL 543
Query: 283 KD--VQTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAWDA 320
K + + +R+ ++FEKF+WFISS+ YL G+V A
Sbjct: 544 KTAAAEAKQALTASRRTFFFEKFHWFISSDGYLVLGGRDNQQRELLYEKYCNKGDVYVSA 603
Query: 321 KV-------------------------------VTNAW---------WVKADQVSKTAPT 340
+ + AW WV VSK T
Sbjct: 604 DLPNSSSVIIKNRNENDPIPPNTLQQAGALALATSKAWDTKTVISAWWVPIHAVSKVDQT 663
Query: 341 GEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
+ L TG F I +KN+ PP L MG L+ L+E S
Sbjct: 664 KQILPTGHFWINEEKNYLPPTNLVMGYGILWFLDEVS 700
>gi|380483775|emb|CCF40411.1| hypothetical protein CH063_10996 [Colletotrichum higginsianum]
Length = 1087
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 150/272 (55%), Gaps = 17/272 (6%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP + PS + EF+ + VDEFFS+ E K++ K ++E A +K
Sbjct: 300 DFHPFL------PSKFANDPTVKVLEFDGYNKTVDEFFSSLEGQKLESKLTEREAAARRK 353
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+ + D E R+ L Q ++ +KA I N E V A+ A+ + + W DI +
Sbjct: 354 LDAARSDQEKRIEGLRGAQSINVQKATAIEANVERVQEAMDAVNGLLQQGMDWVDISKLI 413
Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADPFADSS--- 226
++ Q+ +PVA IIK L LN+ N ITLLL ++ + + ++ D +D + S
Sbjct: 414 EREQKRRNPVAEIIK-LPLNLAENKITLLLGEEEDIEDDESNYETDSDASDSENEESSNN 472
Query: 227 ---ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL- 282
+ K ++VD+ LS +AN++ + + KRSAAKK +KT+Q ++ ALK+AE+K + L
Sbjct: 473 NKQKNDKRLEIEVDITLSPWANSRGYHEQKRSAAKKAEKTVQQSQMALKNAEQKIQAELK 532
Query: 283 KDVQTMTNI-NKARKVYWFEKFYWFISSENYL 313
K ++T + RK WFEKF WF+SS+ YL
Sbjct: 533 KGLKTEKAVLQPIRKQSWFEKFIWFVSSDGYL 564
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 40/64 (62%), Positives = 50/64 (78%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K AWWV A+QVSK+APTGE+L TGSFM+RG+KNF PP QL +GI +FK+ E
Sbjct: 630 AWDSKAGMGAWWVNANQVSKSAPTGEYLPTGSFMVRGQKNFLPPAQLLLGIGIMFKISEE 689
Query: 377 SISR 380
S +R
Sbjct: 690 SKAR 693
>gi|221059774|ref|XP_002260532.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|193810606|emb|CAQ42504.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 2040
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 122/417 (29%), Positives = 186/417 (44%), Gaps = 79/417 (18%)
Query: 31 GYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFS 90
G V K+E++ E + EF P+IL + I F+ F VD +FS
Sbjct: 439 GKGVVKEEEKSGEE---ITFTEFSPIILNNHKNKV---EENKLEIIHFDDFNKCVDSYFS 492
Query: 91 TAESHKIDLKAVQQE----RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQES 146
E K D QQE + +L K++ +K DHE R+ LE+ ++K LI N E
Sbjct: 493 RMELSKYDK---QQEVIKIKKSLTKMDKIKLDHERRIEQLEKEVSSLRKKISLIQMNDEL 549
Query: 147 VDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERS 206
V+ AI +R +A +WE I +K ++ N P+A I + N + LLL +E
Sbjct: 550 VEQAIQLMRAAVATNANWEKIWEHIKLFKKQNHPIALRISSVNFNNCEMELLLDDGEENE 609
Query: 207 NIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQS 266
D D P + K + V ++L+ S Y N + + +++ A +K +KT S
Sbjct: 610 EGSDDSSREADEESPKRATGRESKLA-VTINLNNSVYGNVEDYQKMRKKAEEKIRKTKIS 668
Query: 267 TEKALKSAEKKTKQTLKDV-----QTMTNINKARKVY------WF--EKFYWFISSENYL 313
T A+K EKK K+ +T+ I K RKVY WF + Y I+ + L
Sbjct: 669 TNFAVKKVEKKKKEKENKQKGKHNKTVGQIQKLRKVYWFEKFHWFISSENYLVIAGRDAL 728
Query: 314 GN----------------------------------------------------VAWDAK 321
N AW+ K
Sbjct: 729 QNEILFRRYFQKNDVYVHADIHGASTCIIKNPYKDIPIPEKTLSEAGQLAICRSSAWNNK 788
Query: 322 VVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
++T+AWWV QVSK+APTGE+L TGSF+IRGKKN+ P +L MG+ +F+++ +++
Sbjct: 789 IITSAWWVHYHQVSKSAPTGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNAAV 845
>gi|346976277|gb|EGY19729.1| DUF814 domain-containing protein [Verticillium dahliae VdLs.17]
Length = 1086
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 160/309 (51%), Gaps = 30/309 (9%)
Query: 26 NVSSQGYLVQK-------KEKRPNS------EDYIVANMEFHPMILQP-GSTPSVMRQHQ 71
+ ++ GY++ K E++P+ ED + + FHP + Q PSV
Sbjct: 309 SATTTGYVIAKYRQKSEETEEKPDDGAETKREDLLYDD--FHPFLPQKFADDPSVK---- 362
Query: 72 GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQL 131
+ F+ F VDEFFS+ E K++ K ++E A KKLE ++DH R+ L++ Q
Sbjct: 363 ---VLTFDGFNKTVDEFFSSLEGQKLESKLTEREAAAKKKLEATRQDHAQRIEGLQEAQS 419
Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKL 190
++++KA I N E V A+ A+ + + W +I +++ Q+ +PVA IK KL
Sbjct: 420 LNEQKAAAIEANVERVQEAMDAVNGLVQQGMDWVNIGKLIEREQKRRNPVAETIKLPRKL 479
Query: 191 NINHITLLLR---VHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANA 246
N +TLLL V E + D ++ L ++++L LS +ANA
Sbjct: 480 GENLMTLLLGTEAVEDEDEAYETGSDASDSEDDEDGAKAKGADRRLQIEINLGLSPWANA 539
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFY 304
+ ++D +R+AA K+QKT+Q + AL++AEKK + LK Q + RK WFEKF
Sbjct: 540 REYYDQRRTAAVKEQKTVQHSTMALRNAEKKITEDLKKGLKQEKAVLQPIRKQMWFEKFI 599
Query: 305 WFISSENYL 313
WFISS+ YL
Sbjct: 600 WFISSDGYL 608
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 36/61 (59%), Positives = 47/61 (77%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K AWWV+ADQVSK+APTGE+L GSFM+RG+KNF PP L +G+ +F++ E
Sbjct: 674 AWDSKAGMGAWWVRADQVSKSAPTGEYLPAGSFMVRGQKNFLPPAPLVLGLGIMFRISEE 733
Query: 377 S 377
S
Sbjct: 734 S 734
>gi|310791286|gb|EFQ26815.1| hypothetical protein GLRG_02635 [Glomerella graminicola M1.001]
Length = 1073
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 150/272 (55%), Gaps = 17/272 (6%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP + P + EF+ + VDEFFS+ E K++ K ++E A +K
Sbjct: 299 DFHPFL------PKKFADDPTVKVLEFDGYNKTVDEFFSSLEGQKLESKLTEREAAARRK 352
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L+ + D E R+ L Q ++ +KA I N E V A+ A+ + + W DI +
Sbjct: 353 LDAARSDQEKRIEGLRGAQSINVQKATAIEANVERVQEAMDAMNGLLQQGMDWVDISKLI 412
Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADP------FA 223
++ Q+ ++PVA IIK L LN+ N ITLLL ++ + + ++ D +D +
Sbjct: 413 EREQKRHNPVAEIIK-LPLNLAENTITLLLGEEEDIEDDESNYETDSDASDSEDEDNGNS 471
Query: 224 DSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL- 282
+ ++ K VDV++ LS +AN++ + + KRSAAKK +KT+Q + ALK+AE+K + L
Sbjct: 472 NKQKSDKRLEVDVNIALSPWANSREYHEQKRSAAKKAEKTVQQSVIALKNAEQKIQAELK 531
Query: 283 KDVQTMTNI-NKARKVYWFEKFYWFISSENYL 313
K ++T + RK WFEKF WF+SS+ YL
Sbjct: 532 KGLKTEKAVLQPIRKQIWFEKFIWFVSSDGYL 563
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 41/64 (64%), Positives = 50/64 (78%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K AWWV ADQVSK+APTGE+L TGSFM+RG+KNF PP QL +GI +FK+ E
Sbjct: 629 AWDSKAGMGAWWVNADQVSKSAPTGEYLPTGSFMVRGQKNFLPPAQLLLGIGIMFKISEE 688
Query: 377 SISR 380
S +R
Sbjct: 689 SKAR 692
>gi|307109165|gb|EFN57403.1| hypothetical protein CHLNCDRAFT_57209 [Chlorella variabilis]
Length = 1158
Score = 135 bits (339), Expect = 4e-29, Method: Composition-based stats.
Identities = 95/233 (40%), Positives = 137/233 (58%), Gaps = 19/233 (8%)
Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
+E+ A+ KLE ++RDHE RL +L Q + KA LI N E+VDAA+ A+R+ +A+ +
Sbjct: 335 KEKAAVGKLEAIRRDHEKRLGSLGQEAEAAELKAALIEYNLEAVDAALNAVREALASGMD 394
Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHK---------ERSNIKDSWVL 214
W D+ VK+ +R +PVA +I L+L + +TLLLR + +W+
Sbjct: 395 WRDLARMVKEERRAGNPVAGLIDSLQLERSRVTLLLRRARVCAWGGGGVAGGVRGGNWLD 454
Query: 215 HHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS-----AAKKQQKTIQSTEK 269
D + A TK V+VDL LSA+ANA+ ++D +R A KQQKT+ + +K
Sbjct: 455 EEDGDEEAATRPATK----VEVDLGLSAHANARTYYDSRRKHQARGAGVKQQKTLDANQK 510
Query: 270 ALKSAEKKTKQTLKDVQTMTNINK-ARKVYWFEKFYWFISSENYLGNVAWDAK 321
ALK+AEKK +Q LK V++ RK +WFEKF+WFISSENYL DA+
Sbjct: 511 ALKAAEKKAQQQLKQVRSAAAAPAITRKPFWFEKFFWFISSENYLVLSGRDAQ 563
Score = 57.4 bits (137), Expect = 1e-05, Method: Composition-based stats.
Identities = 29/62 (46%), Positives = 36/62 (58%), Gaps = 17/62 (27%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWDAK+VT+AWWV +QVSKTAP+GE+L L MG ++F L E
Sbjct: 619 AWDAKIVTSAWWVHPEQVSKTAPSGEYLP-----------------LVMGYGYMFGLAEE 661
Query: 377 SI 378
SI
Sbjct: 662 SI 663
>gi|400593352|gb|EJP61303.1| DUF814 domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 1062
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 90/274 (32%), Positives = 148/274 (54%), Gaps = 19/274 (6%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP + P+ ++ I FE + VDEFFS+ E +++ + +++E A +K
Sbjct: 295 DFHPFV------PTKFEKNDDIEILRFEGYNRTVDEFFSSLEGQRLESRLMEREAAAQRK 348
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
++ ++D E R+ L+ QL + KA I N E V A+ +I + + W DI V
Sbjct: 349 IDAARQDQENRIRGLQTAQLDNFRKAAAIEANIERVQEAMDSINGLLNQGMDWVDIGKLV 408
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERS-NIKDSWVLHHDNAD-PFADSSETK 229
+ Q+ N+PVA++I L LN+ + +R+ +E +D D++D F D +T
Sbjct: 409 AREQKKNNPVATLI-CLPLNLVDNVISVRLSEEDDVASEDEEPYETDDSDVRFEDDLDTT 467
Query: 230 KPSL--------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
+ L V++ L+LS ++NA+ ++D +++A K++KT +KA+KS E K KQ
Sbjct: 468 ESGLKNSDKTIVVELTLNLSPWSNARGYYDQRKNAVVKEEKTQLQADKAIKSTEHKVKQD 527
Query: 282 LKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
LK V Q + R WFEKFYWFISS+ YL
Sbjct: 528 LKKVLKQEKALLQPIRNPMWFEKFYWFISSDGYL 561
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 33/61 (54%), Positives = 43/61 (70%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K AWWV ++QVSK+ TG+ L G+F I G+KNF PP QL +G+S LFK+ E
Sbjct: 627 AWDSKAGIGAWWVNSNQVSKSTSTGDILQPGNFNISGEKNFLPPGQLVLGLSVLFKISEE 686
Query: 377 S 377
S
Sbjct: 687 S 687
>gi|403374308|gb|EJY87098.1| DUF3441 multi-domain protein [Oxytricha trifallax]
Length = 1126
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 171/378 (45%), Gaps = 73/378 (19%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P + Q P E+ SF VDE+FS + + K +E + KK+ +K D R+
Sbjct: 354 PIPLAQFASDPCLEYASFDQCVDEYFSQLDKQREQSKYSNKEDEIWKKMSRIKDDQAKRI 413
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L++ Q + + KA+L+ V A I ++ + +SW DI+ VK+ ++ +P+A
Sbjct: 414 QGLQKEQDLSEFKAQLLQKYIYEVQALIDILQVMQTSGISWNDIQRMVKEEKKAGNPLAD 473
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHH--DNADPFADSSETKKPSLVDVDLDLS 241
+I ++ N +TL+L E + ++ + + +N DP VDVDL +S
Sbjct: 474 LIYKMNFEKNSVTLMLDACNE-EDAENEFAVDEKFENFDPVV---------RVDVDLHIS 523
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT-KQTLKDVQTMTNINKARKVYWF 300
A N +++F++K+ + +K+ KT + + A K AE K+ +K QT I++ RKVYWF
Sbjct: 524 AQMNIRKYFEIKKKSYEKEVKTKTAADIAFKDAETNALKEIVKHRQTQ-KIDRMRKVYWF 582
Query: 301 EKFYWFISSENYL---GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLT------------ 345
EKF WFISSENYL G A +V+ + K D T G +T
Sbjct: 583 EKFDWFISSENYLCISGKNAQLNEVLVKRYMDKGDLFMHTDMPGAAVTIIKNPSGLIVPP 642
Query: 346 --------------------------------------TGSF------MIRGKKNFFPPC 361
TG + MIRGK+N P
Sbjct: 643 ITLNEAAIFELCHSKAWEGKIVTSVYWVHADQVSKTPPTGLYIPTGSFMIRGKRNIMTPS 702
Query: 362 QLAMGISFLFKLEESSIS 379
+L +G + +F L E SI+
Sbjct: 703 KLELGFTIMFTLNEESIA 720
>gi|302509578|ref|XP_003016749.1| DUF814 domain protein, putative [Arthroderma benhamiae CBS 112371]
gi|291180319|gb|EFE36104.1| DUF814 domain protein, putative [Arthroderma benhamiae CBS 112371]
Length = 1073
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/290 (33%), Positives = 153/290 (52%), Gaps = 36/290 (12%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP PS ++ I FE+F +AVD +FS+ E+ K++ + ++E A KK
Sbjct: 254 DFHPF------EPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARKK 307
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE+ KR+HE R++AL++ Q KA I N V+ A+ A+ +A + W +I +
Sbjct: 308 LESTKREHEKRVNALKEKQEFHVRKARAIETNLPRVEEAMNAVNGLVAQGMDWVEIARLI 367
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR------------------------VHKERS 206
+ Q +PVA IK LKL N IT+LL + +
Sbjct: 368 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEEESEEEEEEDDDGYGDDE 427
Query: 207 NIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
+ S H +A P + E K L +D+DL +S +ANA++++D K+ AA K++KT++
Sbjct: 428 YERPSQKKH--SAKPLKEKKEKKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTLK 485
Query: 266 STEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
++ KA+KS E+K K LK Q + + R WFEKF++FISS+ YL
Sbjct: 486 ASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 535
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 38/62 (61%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD K WWV A QVSK TG+ L G FMI+G+KN PP Q+ +G + LF++
Sbjct: 601 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 660
Query: 377 SI 378
S+
Sbjct: 661 SV 662
>gi|326471330|gb|EGD95339.1| hypothetical protein TESG_02825 [Trichophyton tonsurans CBS 112818]
Length = 1099
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/291 (32%), Positives = 150/291 (51%), Gaps = 35/291 (12%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP PS ++ I FE+F +AVD +FS+ E+ K++ + ++E A KK
Sbjct: 277 DFHPF------EPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARKK 330
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE+ KR+HE R++AL++ Q KA I N V+ A+ A+ +A + W +I +
Sbjct: 331 LESTKREHEKRVNALKEKQEFHVRKARAIEINLPRVEEAMNAVNGLVAQGMDWVEIARLI 390
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDN------------ 218
+ Q +PVA IK LKL N IT+LL + ++ +
Sbjct: 391 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEEESEEEEEEEEEDDDGYG 450
Query: 219 -------------ADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
P + E K L +D+DL +S +ANA++++D K+ AA K++KT+
Sbjct: 451 DDEYERPSQKKQLTKPLKEKKEMKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTL 510
Query: 265 QSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
+++ KA+KS E+K K LK Q + + R WFEKF++FISS+ YL
Sbjct: 511 KASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 561
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 38/62 (61%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD K WWV A QVSK TG+ L G FMI+G+KN PP Q+ +G + LF++
Sbjct: 627 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 686
Query: 377 SI 378
S+
Sbjct: 687 SV 688
>gi|320589532|gb|EFX01993.1| duf814 domain containing protein [Grosmannia clavigera kw1407]
Length = 1969
Score = 132 bits (332), Expect = 3e-28, Method: Composition-based stats.
Identities = 103/299 (34%), Positives = 154/299 (51%), Gaps = 22/299 (7%)
Query: 28 SSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGST--------PSVMRQHQGFP---IQ 76
+ Q Y+V KE +++ E I P ST P RQ + P +
Sbjct: 1183 TCQSYIVATKES-ADTDASAATTEEGGSAIPVPASTKLDYVDFHPFKPRQFEADPKCVLL 1241
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
F+SF A DEF+S + K D + QQE A KKLE +RD R+ +L++TQ ++ K
Sbjct: 1242 PFDSFNKAADEFYSHLQGLKADRQLHQQESVAFKKLEATRRDQAMRIESLQETQQLNTRK 1301
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
A I NQE V AAI A+ + WED+ ++ + N PVA++IK L + +
Sbjct: 1302 AAAIEANQEWVQAAIDAVNDQLHVGTDWEDLAHLIENSADSN-PVAALIK-LPMRLADGI 1359
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
+ L++ E + D + D + +E ++ V+V L LSA+ NA+ ++D KR A
Sbjct: 1360 ITLQLSDEPAADFDE------DFDEDEEEAEEEELLDVNVKLALSAWGNAREYYDQKRVA 1413
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQT--MTNINKARKVYWFEKFYWFISSENYL 313
A K+QKT + T AL++AEKK + LK VQ R+ WFEKF WF+SS+ +L
Sbjct: 1414 ASKEQKTKEVTSMALRNAEKKVAEELKRVQKGGKPAPQLIRRQLWFEKFLWFVSSDGHL 1472
Score = 60.5 bits (145), Expect = 1e-06, Method: Composition-based stats.
Identities = 28/60 (46%), Positives = 37/60 (61%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD K A+WV A+QV KT +G+ L GSF IRG+KN PP Q +G LF++ +
Sbjct: 1540 AWDNKAGFGAYWVHANQVFKTTASGDVLPLGSFDIRGEKNHLPPPQRVLGFGLLFQISNA 1599
>gi|326479424|gb|EGE03434.1| DUF814 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 979
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 94/291 (32%), Positives = 150/291 (51%), Gaps = 35/291 (12%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP PS ++ I FE+F +AVD +FS+ E+ K++ + ++E A KK
Sbjct: 277 DFHPF------EPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARKK 330
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE+ KR+HE R++AL++ Q KA I N V+ A+ A+ +A + W +I +
Sbjct: 331 LESTKREHEKRVNALKEKQEFHVRKARAIEINLPRVEEAMNAVNGLVAQGMDWVEIARLI 390
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDN------------ 218
+ Q +PVA IK LKL N IT+LL + ++ +
Sbjct: 391 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEEESEEEEEEEEEDDDGYG 450
Query: 219 -------------ADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
P + E K L +D+DL +S +ANA++++D K+ AA K++KT+
Sbjct: 451 DDEYERPSQKKQLTKPLKEKKEMKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTL 510
Query: 265 QSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
+++ KA+KS E+K K LK Q + + R WFEKF++FISS+ YL
Sbjct: 511 KASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 561
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 38/62 (61%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD K WWV A QVSK TG+ L G FMI+G+KN PP Q+ +G + LF++
Sbjct: 627 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 686
Query: 377 SI 378
S+
Sbjct: 687 SV 688
>gi|374109900|gb|AEY98805.1| FAGL017Wp [Ashbya gossypii FDAG1]
Length = 1006
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 121/442 (27%), Positives = 196/442 (44%), Gaps = 106/442 (23%)
Query: 31 GYLVQKK----EKRPNSEDYIVANMEFHPM--ILQPGSTPSVMRQHQGFPIQEFES-FAA 83
GY+V ++ +++ ++E + +FHP L GS + F I E + +
Sbjct: 273 GYIVARENPLYDEKKDAEGIRLTYEQFHPFRPYLPDGS-------QKKFEIVEVDGDYNR 325
Query: 84 AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
VD+FFST +S K L+ QE++A KKLE K +++ ++ L + Q ++++ IINN
Sbjct: 326 TVDKFFSTIDSTKYALRIQTQEQNARKKLEKAKAENQKKIQELVEVQHTNEQRGNAIINN 385
Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVH 202
E V+ A AI+ + Q+ W IE +K Q ++ +A +IK L L N I++ L +
Sbjct: 386 IELVEEAKSAIQGLLDQQMDWTSIEKLIKTEQAKSNRIARVIKLPLNLKANKISVELPLS 445
Query: 203 KERSNIKD-SW----------------------------VLHHDNADPFADSSETKKPSL 233
E D SW V + + S + K
Sbjct: 446 NEDDESSDGSWGDSESDSGFSSSDDELSDSGLSDFDAEVVRGSGSKNKKGKSKVSNKSIT 505
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--I 291
V +DL +SAYANA +F++K++ AKKQ Q+ +KA+K+ E+K ++ LK + +
Sbjct: 506 VSIDLSMSAYANASSYFEMKKTGAKKQLGVEQNVQKAMKNIEQKIEKDLKKKLKEQHDVL 565
Query: 292 NKARKVYWFEKFYWFISSENYL-----------------------------GNVAW---- 318
R Y+FEK++WFIS+E +L G+ W
Sbjct: 566 QVIRSPYFFEKYFWFISTEGFLVLMGKSGIETDQIYSKYIEDDDVYVSNGFGSQVWIKNF 625
Query: 319 ------------------------DAKVVTNAWWVKADQVSKTAPT-GEFLTTGSFMIRG 353
KV T+ WW A +SK G L +G+F ++
Sbjct: 626 ERTEIPPNTLMQAGIFANSASEAWSKKVATSPWWCAAKNLSKFDDVGGGLLPSGAFRLKS 685
Query: 354 --KKNFFPPCQLAMGISFLFKL 373
KNF PP QL MG +F++K+
Sbjct: 686 DEAKNFLPPAQLVMGFAFMWKI 707
>gi|302917991|ref|XP_003052561.1| hypothetical protein NECHADRAFT_77690 [Nectria haematococca mpVI
77-13-4]
gi|256733501|gb|EEU46848.1| hypothetical protein NECHADRAFT_77690 [Nectria haematococca mpVI
77-13-4]
Length = 1072
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 107/330 (32%), Positives = 171/330 (51%), Gaps = 25/330 (7%)
Query: 2 VSNEDVSSDIHKCGPTLHVLVSNLNVSSQ--GYL-VQKKEKRPNSEDYIVANME------ 52
++NE + D+ K V L S + GY+ +++EK +ED E
Sbjct: 234 LANETLLDDLVKSLTEARKTVDELTSSGECTGYIFAKRREKLEGAEDDDKPKRENLLYED 293
Query: 53 FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
FHP + P + + + EF+ + VDEFFS+ E K++ + ++E A +KL
Sbjct: 294 FHPFV------PQKLSKDPTIEVLEFKGYNETVDEFFSSLEGQKLESRLTEREAAAKRKL 347
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ K++ R+ L++ Q ++ KA I N E V A+ A+ ++ + W D+ V+
Sbjct: 348 DAAKQEQAKRIEGLQEAQNLNFRKAAAIEANVERVQEAMDAVNGLLSQGMDWVDVGKLVE 407
Query: 173 QAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK- 229
+ ++ ++PVA IIK L LN+ N ITL L + D + D + DS+ TK
Sbjct: 408 REKKRHNPVAEIIK-LPLNLAENLITLELAEEEFEPEEDDPYETDDDESADEEDSTPTKG 466
Query: 230 ----KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
K V+++L LS ++NA+ +FD ++SAA K++KT Q +ALK+AE+K Q LK
Sbjct: 467 KHASKALSVEINLGLSPWSNAREYFDQRKSAAVKKEKTEQQASRALKNAEQKITQDLKKG 526
Query: 286 --QTMTNINKARKVYWFEKFYWFISSENYL 313
Q + RK WFEKF WFISS+ YL
Sbjct: 527 LKQEKALLQPIRKQLWFEKFIWFISSDGYL 556
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 40/61 (65%), Positives = 49/61 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K +AWWV ADQVSK+APTGEFL TGSFM+RGKKNF PP QL +G+ +F++ E
Sbjct: 622 AWDSKAGMSAWWVNADQVSKSAPTGEFLPTGSFMVRGKKNFLPPAQLLLGLGLVFRISEE 681
Query: 377 S 377
S
Sbjct: 682 S 682
>gi|315050252|ref|XP_003174500.1| hypothetical protein MGYG_02028 [Arthroderma gypseum CBS 118893]
gi|311339815|gb|EFQ99017.1| hypothetical protein MGYG_02028 [Arthroderma gypseum CBS 118893]
Length = 1093
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 98/295 (33%), Positives = 150/295 (50%), Gaps = 45/295 (15%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP PS + FESF +AVD++FS+ E+ K++ + ++E A KK
Sbjct: 277 DFHPF------EPSQTKDLPNTTTLRFESFNSAVDKYFSSIEARKLESRLTEKEDAARKK 330
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE+ KR+HE R++AL++ Q KA I N V+ A+ A+ +A + W +I +
Sbjct: 331 LESTKREHEKRVNALKEKQEFHVRKARAIETNLLQVEEAMTAVNGLVAQGMDWVEIARLI 390
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR-----------------------------V 201
+ Q +PVA IK LKL N IT+LL
Sbjct: 391 EMEQGKRNPVALSIKLPLKLYENTITVLLNEEVAEEEEEEESDESDEEEDEDDDDGYGDD 450
Query: 202 HKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQ 260
ER K V +P + E K L +D+DL +S +ANA++++D K+ AA K+
Sbjct: 451 EYERPKQKKRLV------NPQREKKEKKDTRLSIDIDLGISPWANARQYYDEKKIAAVKE 504
Query: 261 QKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
+KT++++ KA+KS E+K K LK Q + + R WFEKF++FISS+ YL
Sbjct: 505 EKTLKASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 559
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 38/62 (61%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD K WWV A QVSK TG+ L G FMI+G+KN PP Q+ +G + LF++
Sbjct: 625 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISSQ 684
Query: 377 SI 378
SI
Sbjct: 685 SI 686
>gi|190345457|gb|EDK37344.2| hypothetical protein PGUG_01442 [Meyerozyma guilliermondii ATCC
6260]
Length = 873
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 124/455 (27%), Positives = 194/455 (42%), Gaps = 122/455 (26%)
Query: 21 LVSNLNVSSQGYLVQKK----EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
LVS+L G +V KK E +++D EFHP +P + F
Sbjct: 127 LVSSLGTEVTGIIVSKKNPAFEPSDDNKDLEYLYDEFHP--FKP-----YKENLESFKFT 179
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
E + VD FFST +S K +L+ QQ+ +A K+L N + + + ++ L Q ++ +K
Sbjct: 180 EIRGYNKTVDTFFSTLDSKKHELRMEQQKHNAKKRLLNAREERDKQIDNLRIQQEMNSKK 239
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
+ II + + V I +++ + Q+ W +IE+ +K Q + VA IK L LN+
Sbjct: 240 GDAIIYHADLVSECIASVQTLLDQQMDWANIESLIKLEQSRGNSVAKTIK-LPLNLTENK 298
Query: 197 LLLRVHKERSNIKDSWVLHHDNAD--------------------------------PFAD 224
+ L++ S +D AD P
Sbjct: 299 IGLKLPDTDS--------MYDPADIDSESDSETSSESETESESESESGSESEDETPPKRM 350
Query: 225 SSETKK---PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
S + K P+L V +DL LS +ANA+ +F+ K+ A KQ+K ++T+ AL++A+KK +Q
Sbjct: 351 SKKAKSKEIPALSVWIDLSLSPFANARTYFESKKQAESKQEKVEKNTDMALRNAQKKIEQ 410
Query: 281 TLKDVQTMTNIN----KARKVYWFEKFYWFISSENYL----------------------- 313
L + + N N + R YWFEKF+WF+SSE YL
Sbjct: 411 DL--AKNLKNENETLRQVRPKYWFEKFFWFVSSEGYLCIAGRDDAQVDMIYYRHFSDNDF 468
Query: 314 ---GNVAWDAKVV-------------------------TNAW--------WVKA-DQVSK 336
++ KVV + AW W A + V+K
Sbjct: 469 FVSSDIEGSLKVVVKNPYRGEALPPYTLMQAGMFAMSASAAWNGKITTSPWFLAGNDVTK 528
Query: 337 TAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
G + +G+F +GKK F PP QL MG+ F F
Sbjct: 529 LDFDGSLVPSGTFNYKGKKEFLPPTQLVMGLGFYF 563
>gi|219109751|ref|XP_002176629.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411164|gb|EEC51092.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1238
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 169/324 (52%), Gaps = 36/324 (11%)
Query: 5 EDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKE-------KRPNSEDY-----IVANME 52
E + + + + GP + + + + + GY++ + +P+SE+ +VA
Sbjct: 309 ERLQTSLSEQGPAIMYNLHSAAIDTPGYILYQPRVEEDIVNGKPHSENLSSAVAVVAKEL 368
Query: 53 FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
H + P ++ QHQ P E++ F AAV +FF+ + K LK E +KL
Sbjct: 369 AHADKVLLEFQPHLLAQHQNCPRLEYKHFGAAVADFFAHMVAQKRLLKVQASEMAVQEKL 428
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
V++D R+ ALE+ Q + A+++ NN E+VD A+L I + + + W+ + V
Sbjct: 429 RKVQQDQADRVMALERDQQTLQAYAQVVKNNAENVDKALLVINSALDSGMDWDQLIELVS 488
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
Q + +P+A++I +L+L N I ++LR+ + DPF + S+
Sbjct: 489 VEQANRNPIANLIVRLELE-NEI-MILRLPR----------------DPFDELSDVLN-- 528
Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
V+V L SA+ANA F R++ +K QKT++S+ KAL++AE+ ++ L + Q T
Sbjct: 529 -VNVSLKDSAHANASALFAKYRASKEKTQKTLESSSKALQAAEESAQRQLIEAQRRTKQT 587
Query: 293 KA---RKVYWFEKFYWFISSENYL 313
A RK W+EKF+WF++S+NYL
Sbjct: 588 VAAVKRKPAWYEKFHWFVTSDNYL 611
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 41/65 (63%), Positives = 55/65 (84%), Gaps = 1/65 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
AW +++VT+AWWV++ QVSKTAP+GEFLT GSFM+RGKKNF PP L MG++ LF+L ++
Sbjct: 683 AWASRMVTSAWWVESHQVSKTAPSGEFLTVGSFMVRGKKNFLPPSPLEMGLAVLFRLGDD 742
Query: 376 SSISR 380
SI+R
Sbjct: 743 DSIAR 747
>gi|385304258|gb|EIF48283.1| tae2-like protein [Dekkera bruxellensis AWRI1499]
Length = 979
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 114/422 (27%), Positives = 202/422 (47%), Gaps = 53/422 (12%)
Query: 5 EDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQK------KEKRPNSEDYIVANMEFHPMIL 58
E V + + + T L+ QG++++K K +SE+ EFHP
Sbjct: 259 EKVVTSLQESENTFKSLLQTPPGKVQGWILRKINPLFDNTKEESSENLKYTYEEFHP--F 316
Query: 59 QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD 118
+P V ++++ + + + VD FF+ E K L QQ+ A K+L+ VK +
Sbjct: 317 EP-----VHKENEDSKVDVVDGYNKTVDTFFTMIELSKASLSRQQQKAAAAKRLQLVKEE 371
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
+E +L+ L+ Q ++++K LI + ++ +I+ + Q+ W++I+ ++ +R
Sbjct: 372 NEKKLAKLDAVQELNRKKGYLITLHSSEIEDCRSSIQALLDQQMDWQNIDKLIEVERRRG 431
Query: 179 DPVASIIKQLKLNINHITLLLRVHKE-----RSNIKDSWVLHHDNADPFADSSETKKPSL 233
+P A +IK L L + T+LL +E + DS + D D +E KK ++
Sbjct: 432 NPTAKMIKSLNLLKHEFTVLLPDEQEVVDDENEDESDSDSDSDSDDDDDDDETEDKKSNI 491
Query: 234 VDVDLDL--SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK-------- 283
+ V +D+ SA+AN+ R+FD K++A +KQ+KT ++ A+K++E K + +K
Sbjct: 492 ISVSIDIRESAFANSTRYFDAKKNAQEKQEKTKENAAIAIKNSEMKIHRDMKRLENESKN 551
Query: 284 ----------------DVQTMTNINKARKVYWFEKF-------YWFISSENYLGNV--AW 318
D ++++K+ KV + F+ + Y AW
Sbjct: 552 TVDIHSIYYRYLDNNTDYLVSSDVDKSLKVVIKNPYKNKEIPPSTFVQAGIYCLTTSKAW 611
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
D+K+ + W+VK D VSK G L G I+G KNF PP QL MGI L+ +E +
Sbjct: 612 DSKMSPSPWFVKGDAVSKKDFDGSLLPPGLLNIKGDKNFLPPSQLVMGIGLLWLPDEKTK 671
Query: 379 SR 380
+R
Sbjct: 672 AR 673
>gi|407039370|gb|EKE39608.1| zinc knuckle domain containing protein [Entamoeba nuttalli P19]
Length = 959
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 164/360 (45%), Gaps = 87/360 (24%)
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
EFESF A+DEF S E + + + ++E KK++ V H+ R L K +
Sbjct: 290 EFESFEKAMDEFHSHIEKQEYEAEVEKKEMIMKKKIQAVIDGHQKRYQGLLDKAETLKNE 349
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP--VASIIKQLKLNINH 194
AE + N + VD I I + ++ WE IE + ++ + NDP +A IK+ N
Sbjct: 350 AEAVEENIQVVDQLIQEINVFLKEKMKWEQIEGII-ESLKENDPTSIAKYIKRFDFK-NE 407
Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKR 254
+ +L L H N D + V+V L+ + + N + F+++++
Sbjct: 408 VVVL--------------ELKHTNEDKIIE---------VEVALNKNGFENIRNFYEMRK 444
Query: 255 SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-TMTNINKARKVYWFEKFYWFISSENYL 313
+ K +KT++S + A+K AE K ++ K+ + T+ ++ K RK +WFEKF+WF+SSEN++
Sbjct: 445 NILAKAEKTMESKDLAIKQAENKQERVAKEKKITLVDVKKMRKRFWFEKFHWFLSSENFI 504
Query: 314 ---GNVAWDAKVVTNAW------WVKAD-----------------------QVSKTA--- 338
G A V+ + +V AD Q K A
Sbjct: 505 IISGKDALQNDVIYRRYMKSTDVYVHADIHGAASCIIKGIPGKTIGAPTLEQAGKIAVCR 564
Query: 339 ------------------------PTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
P+GE+LTTGSFMIRGKKN+ PP L GI +F +E
Sbjct: 565 SSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKNYLPPVPLVFGIGIMFAVE 624
>gi|296813237|ref|XP_002846956.1| serologically defined colon cancer antigen 1 [Arthroderma otae CBS
113480]
gi|238842212|gb|EEQ31874.1| serologically defined colon cancer antigen 1 [Arthroderma otae CBS
113480]
Length = 1103
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 150/289 (51%), Gaps = 33/289 (11%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP PS + + F++F +AVD++FS+ E+ K++ + ++E A KK
Sbjct: 277 DFHPF------EPSQSKDLPNTTMLRFDTFNSAVDKYFSSIEARKLESRLTEREDAARKK 330
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE KRDHE R++AL++ Q KA I N V+ AI A+ +A + W +I +
Sbjct: 331 LEATKRDHEKRVNALKEKQEFHVRKAHAIEANLPQVEDAINAVNGLVAQGMDWVEIARLI 390
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHD------------- 217
+ Q +PVA IK LKL N IT+LL + +D +
Sbjct: 391 EMEQAKGNPVALCIKLPLKLYENTITILLTEETAETEDEDEESDESEGDDEDEDNDYGDD 450
Query: 218 ----------NADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQS 266
A + E K L +D+DL +S +ANA++++D K+ AA K++KT+++
Sbjct: 451 EYERPKHKKMTAKTQKEKKERKDNRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTLKA 510
Query: 267 TEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
+ KA+KS EKK K LK Q + +AR WFEKF++FISS+ YL
Sbjct: 511 STKAIKSTEKKVKADLKLALKQEKPVLRRARNPAWFEKFFFFISSDGYL 559
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 38/62 (61%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD K WWV A QVSK TG+ L G FMI+G+KN PP Q+ +G + LF+L
Sbjct: 625 AWDTKAAMGGWWVHASQVSKVTSTGDILKAGHFMIKGEKNHLPPGQIVLGFAVLFQLSPQ 684
Query: 377 SI 378
S+
Sbjct: 685 SV 686
>gi|327303108|ref|XP_003236246.1| hypothetical protein TERG_03295 [Trichophyton rubrum CBS 118892]
gi|326461588|gb|EGD87041.1| hypothetical protein TERG_03295 [Trichophyton rubrum CBS 118892]
Length = 1098
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 152/292 (52%), Gaps = 38/292 (13%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP PS ++ I F +F +AVD +FS+ E+ K++ + ++E A KK
Sbjct: 277 DFHPF------EPSQSKEAPNTTILRFGNFNSAVDRYFSSIEAQKLESRLTEKEDAARKK 330
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE+ KR+HE R++AL++ Q KA I N V+ A+ A+ +A + W +I +
Sbjct: 331 LESTKREHEKRVNALKEKQEFHVRKARAIETNLPRVEEAMNAVNGLVAQSMDWVEIARLI 390
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR--------------------------VHKE 204
+ Q +PVA IK LKL N IT+LL + +
Sbjct: 391 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEGGTEDDEEEEEEEEPEEEEEEDDDDGYGD 450
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
+ S H +A P + E K L +D+DL +S +ANA++++D K+ AA K++KT
Sbjct: 451 DEYERPSQKKH--SAKPLKEKKEKKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKT 508
Query: 264 IQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
++++ KA+KS E+K K LK Q + + R WFEKF++FISS+ YL
Sbjct: 509 LKASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 560
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 38/62 (61%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD K WWV A QVSK TG+ L G FMI+G+KN PP Q+ +G + LF++
Sbjct: 626 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 685
Query: 377 SI 378
S+
Sbjct: 686 SL 687
>gi|67468480|ref|XP_650274.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56466879|gb|EAL44894.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449704977|gb|EMD45123.1| zinc knuckle domain containing protein [Entamoeba histolytica KU27]
Length = 959
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 164/360 (45%), Gaps = 87/360 (24%)
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
EFESF A+DEF S E + + + ++E KK++ V H+ R L K +
Sbjct: 290 EFESFEKAMDEFHSHIEKQEYEAEVEKKEMIMKKKIQAVIDGHQKRYQGLLDKAETLKNE 349
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP--VASIIKQLKLNINH 194
AE + N + VD I I + ++ WE IE + ++ + NDP +A IK+ N
Sbjct: 350 AEAVEENIQVVDQLIQEINVFLKEKMKWEQIEGII-ESLKENDPTSIAKYIKRFDFK-NE 407
Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKR 254
+ +L L H N D + V++ L+ + + N + F+++++
Sbjct: 408 VVVL--------------ELKHTNEDKIIE---------VEIALNKNGFENIRNFYEMRK 444
Query: 255 SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-TMTNINKARKVYWFEKFYWFISSENYL 313
+ K +KT++S + A+K AE K ++ K+ + T+ ++ K RK +WFEKF+WF+SSEN++
Sbjct: 445 NILAKAEKTMESKDLAIKQAENKQERVAKEKKITLVDVKKMRKRFWFEKFHWFLSSENFI 504
Query: 314 ---GNVAWDAKVVTNAW------WVKAD-----------------------QVSKTA--- 338
G A V+ + +V AD Q K A
Sbjct: 505 IISGKDALQNDVIYRRYMKSTDVYVHADIHGAASCIIKGIPGKTIGAPTLEQAGKIAVCR 564
Query: 339 ------------------------PTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
P+GE+LTTGSFMIRGKKN+ PP L GI +F +E
Sbjct: 565 SSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKNYLPPVPLVFGIGIMFAVE 624
>gi|340059520|emb|CCC53907.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 1048
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/249 (30%), Positives = 137/249 (55%), Gaps = 6/249 (2%)
Query: 63 TPSVMRQHQGFPIQE--FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP ++ Q+ G + + SF + D FF E KI+ + ++ + K E +RDH+
Sbjct: 305 TPVLLAQYDGENVTKSYLPSFGSVCDTFFLHTEEGKIEQQKEKKTVAVMSKKERCERDHQ 364
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
R+ ALE+ +L + K EL+I N E +DAAI I +A+ + W+ + +KQ P
Sbjct: 365 RRIEALERMELENARKGELLIQNAEKIDAAIGLINGALASGIQWDALRRLLKQRHAEGHP 424
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VA ++ +L L+ N++++L+ + + I + + +++ D K P +++VDL
Sbjct: 425 VAYMVHELFLDRNNMSVLVETNDDDDCIDEGGSVSYESK---VDDC-NKPPWVIEVDLSK 480
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+A+ANA +F K++ K +T+ +T +A++ AEKK ++ QT+ +I R W+
Sbjct: 481 TAHANAAAYFSQKKANRAKLDRTVAATAQAMRGAEKKGERMAARHQTVKDIATERHRCWW 540
Query: 301 EKFYWFISS 309
EKF WF +S
Sbjct: 541 EKFNWFRTS 549
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 13/74 (17%)
Query: 301 EKFYWFISSENYLGNVAWDAKVVTNAWWVKADQ-VSKTAPTGEFLTTGSFMIRGKKNFFP 359
E W +S + AW+ K AWWV A Q + TA TG ++ G+K+
Sbjct: 632 EAAAWCVSRSS-----AWEGKFNVGAWWVYASQIIGGTA-------TGCYLFSGEKHHVL 679
Query: 360 PCQLAMGISFLFKL 373
P LA+G LF++
Sbjct: 680 PQPLALGCGLLFRV 693
>gi|255710571|ref|XP_002551569.1| KLTH0A02530p [Lachancea thermotolerans]
gi|238932946|emb|CAR21127.1| KLTH0A02530p [Lachancea thermotolerans CBS 6340]
Length = 1058
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 127/457 (27%), Positives = 197/457 (43%), Gaps = 131/457 (28%)
Query: 31 GYLVQKKEKRPNSEDYIVANMEF-----HPMILQPGSTPSVMRQHQGFP--IQEFESFAA 83
GY++ KK +SE AN+EF HP P + +G+ I+ +
Sbjct: 266 GYIIAKKNPHYDSEKGD-ANLEFVYEQFHPF------PPHLSEDEKGYTKIIEVPGQYNK 318
Query: 84 AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
VD+FFST ES K L+ QE A +LE+ K D+E R+ AL Q ++ + II
Sbjct: 319 TVDDFFSTIESSKYALRIQNQEFQAKNRLESAKLDNEKRIQALIDVQTQNEVRGHAIIAA 378
Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVH 202
+ V+ A AI+ + Q+ W+ IE + Q+ N+ +A +IK L L N TL L +
Sbjct: 379 ADLVEEAQNAIKALVEQQMDWKTIEVLISNEQKKNNRIARLIKLPLDLKNNKFTLSLPRN 438
Query: 203 KERSNIKDSWVLHHDNADPFADS------------------------------------- 225
E + DN+D D+
Sbjct: 439 DE---------IESDNSDEEEDNLTSSEDETSSSDSSDSSLSDFEADDNDEDELTSVSNI 489
Query: 226 -------SETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
+ +KPS+ +DL LSAYANA +F++K+S +KQ+K ++ +KALK+ E++
Sbjct: 490 KKDRNDNKKKEKPSIDATIDLTLSAYANASNYFNIKKSNVEKQKKVEKNAQKALKNIEQR 549
Query: 278 TKQTLKDVQTMTN--INKARKVYWFEKFYWFISSENYL--------------GNVAWDAK 321
++ LK ++ +NK RK Y+FEKF+WF+SSE +L G D
Sbjct: 550 IEKDLKKKLKESHDVLNKTRKPYFFEKFHWFVSSEGFLVLMGKSGMESDQIYGKYIHDND 609
Query: 322 V-VTNAW----WVKADQVSKTAPT------------------------------------ 340
V V+N++ W+K ++ P
Sbjct: 610 VFVSNSFDTHVWIKNPDETEVPPNTLMQAGIMCMSASPAWSKKIQSSAWWCFAKELSKFD 669
Query: 341 ---GEFLTTGSFMIRG--KKNFFPPCQLAMGISFLFK 372
GE L G+F ++ KK+F PP QL MG + L+K
Sbjct: 670 NYGGEVLPAGTFRLKDEKKKSFLPPSQLVMGFALLWK 706
>gi|46128721|ref|XP_388914.1| hypothetical protein FG08738.1 [Gibberella zeae PH-1]
Length = 1077
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 156/308 (50%), Gaps = 30/308 (9%)
Query: 28 SSQGYLVQKKEKRPNS----------EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
+ GY+ K+ +RP D ++ + +FHP I P ++ + E
Sbjct: 262 TCTGYIFAKRRERPEGTEVDEETKTKRDNLLYD-DFHPFI------PYKLKNDPAIEVLE 314
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F+ + VDEFFS+ E +++ K ++E A +KLE K + R+ L++ Q ++ KA
Sbjct: 315 FQGYNETVDEFFSSLEGQRLESKLTEREATAKRKLEAAKNEQNKRIEGLQEAQSLNFRKA 374
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
I N E V A+ A+ + + W D+ V++ ++ ++PVA IIK L LN+ N I
Sbjct: 375 AAIEANVERVQEAMDAVNGLLNQGMDWVDVGKLVEREKKRHNPVAEIIK-LPLNLAENLI 433
Query: 196 T--------LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAK 247
T ++ D L D A A ++ K V+++L LS ++NA+
Sbjct: 434 TLELAEEEFEPEEDDPYETDDDDDSALGDDEATSAAKGKQSNKALNVEINLGLSPWSNAR 493
Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
+FD +++AA K++KT Q KALK+AE+K + LK Q + RK WFEKF W
Sbjct: 494 EYFDQRKTAAVKEEKTQQQASKALKNAEQKITEDLKKGLKQEKALLQPIRKQMWFEKFTW 553
Query: 306 FISSENYL 313
FISS+ YL
Sbjct: 554 FISSDGYL 561
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 40/61 (65%), Positives = 46/61 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K AWWV ADQVSK+APTGEFL GSFMIRGKKNF PP QL +G+ F++ E
Sbjct: 627 AWDSKAGMPAWWVNADQVSKSAPTGEFLQAGSFMIRGKKNFLPPAQLLLGLGLAFRISEE 686
Query: 377 S 377
S
Sbjct: 687 S 687
>gi|301106825|ref|XP_002902495.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262098369|gb|EEY56421.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1051
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/261 (34%), Positives = 146/261 (55%), Gaps = 13/261 (4%)
Query: 63 TPSVMRQH-QGFP-IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP + QH Q + ++ F++F AVDE+FS E+ ++ + A KL +K++ +
Sbjct: 345 TPYLYAQHLQAYKKVKSFDTFDEAVDEYFSRFEAETAEVAKQSAQLAAENKLAKLKKNQQ 404
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
+L+ L + Q + A+LI NQ+ V+ +L IR +A+ + W +E V+ Q++ +P
Sbjct: 405 QQLAQLREVQEQSFQDAQLIEANQQDVENVLLVIRSALASGMDWRGLEELVRYEQKNGNP 464
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VAS+I QL L N + +LL DS +++ E KK ++ +DL L
Sbjct: 465 VASLIHQLDLEHNRVAILL---------CDSDEDDYEDGG-DGTGEEDKKAHVIWIDLSL 514
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-INKARKVYW 299
SA ANA+ + K+ A K +K ++T+KA+ AEK TK+TL+ QT N I + RK W
Sbjct: 515 SALANAREIYTKKKKAGAKVKKATEATDKAIALAEKNTKKTLEKQQTKRNVIYQRRKTLW 574
Query: 300 FEKFYWFISSENYLGNVAWDA 320
FEKF+WF+++E YL DA
Sbjct: 575 FEKFHWFLTNEKYLVVAGKDA 595
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 38/63 (60%), Positives = 51/63 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW ++V+ A+WV ADQVSKTAP GE+LTTGSFMIRGKKN+ P +L MG++ LF++++S
Sbjct: 660 AWTSQVIAGAYWVHADQVSKTAPAGEYLTTGSFMIRGKKNYIQPSRLEMGLAILFRIDDS 719
Query: 377 SIS 379
I
Sbjct: 720 CIG 722
>gi|302665563|ref|XP_003024391.1| DUF814 domain protein, putative [Trichophyton verrucosum HKI 0517]
gi|291188443|gb|EFE43780.1| DUF814 domain protein, putative [Trichophyton verrucosum HKI 0517]
Length = 1074
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 93/290 (32%), Positives = 149/290 (51%), Gaps = 35/290 (12%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP PS ++ I FE+F +AVD +FS+ E+ K++ + ++E A KK
Sbjct: 254 DFHPF------EPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARKK 307
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
LE+ KR+HE R++AL++ Q KA I N V+ A+ A+ +A + W +I +
Sbjct: 308 LESTKREHEKRVNALKEKQEFHVRKARAIETNLPQVEEAMNAVNGLVAQGMDWVEIARLI 367
Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR-------------------------VHKER 205
+ Q +PVA IK LKL N IT+LL + +
Sbjct: 368 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEDESEEEEEDDDDDGYGDD 427
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
+ S H + + S+ D+DL +S +ANA++++D K+ AA K++KT++
Sbjct: 428 EYERPSQKKHSAKPLKEKKGKKDTRLSI-DIDLGISPWANARQYYDEKKIAAVKEEKTLK 486
Query: 266 STEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
++ KA+KS E+K K LK Q + + R WFEKF++FISS+ YL
Sbjct: 487 ASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 536
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 38/62 (61%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD K WWV A QVSK TG+ L G FMI+G+KN PP Q+ +G + LF++
Sbjct: 602 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 661
Query: 377 SI 378
S+
Sbjct: 662 SV 663
>gi|410077749|ref|XP_003956456.1| hypothetical protein KAFR_0C03290 [Kazachstania africana CBS 2517]
gi|372463040|emb|CCF57321.1| hypothetical protein KAFR_0C03290 [Kazachstania africana CBS 2517]
Length = 1038
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 124/449 (27%), Positives = 200/449 (44%), Gaps = 113/449 (25%)
Query: 30 QGYLVQKK-------EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE-SF 81
+G+++ KK + +P E Y+ N FHP +P + + + + I E E S+
Sbjct: 294 KGFILAKKNVNYDSLKDKPELE-YLYEN--FHP--FKPYISGA---EEKSVRILEIEGSY 345
Query: 82 AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
+D FFST ES K L+ QE A KKLE+ + D++ R+ +L Q++++ KA I+
Sbjct: 346 NRTLDVFFSTIESLKYSLRIQNQELQAKKKLEDARSDNQKRIQSLSDVQILNETKANAIL 405
Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
NN + VD+A A++ + Q W IE + ++ + +A II +L LN+ N I + +
Sbjct: 406 NNTDLVDSAKQAVQDLLEQQTDWNMIEKLIMNEKKRRNKIAEII-ELPLNLKNNKINIKI 464
Query: 200 RVHK----ERSNIKDS---------------------------WVLHHDNADPFADSSET 228
+ E D+ V H+N + +
Sbjct: 465 PLQSPSQFEEETFSDNESVKSSLSDSDFSDESDSELSDFSMEEVVGRHENTRKIR-AKDD 523
Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQ 286
K+ V +DL LS+YANA ++F+ K+ +A+KQ+K + KA+ + E + Q LK +
Sbjct: 524 KQHVTVTIDLSLSSYANASQYFNSKKDSAEKQKKMEKHMAKAMTNIENRIDQQLKKKLRE 583
Query: 287 TMTNINKARKVYWFEKFYWFISSENYL-----------------------------GNVA 317
+ T + K RK Y+FEK+ WFISSE YL G+ A
Sbjct: 584 SHTVLKKIRKPYFFEKYNWFISSEGYLVMTGKSALENDQIYMKYIEDDDIFMSTSFGSKA 643
Query: 318 W----------------------------DAKVVTNAWWVKADQVSKTAPTGEFLT-TGS 348
W KVV + W A ++K G + TG
Sbjct: 644 WIKNPDRGEIPPNTLMQAGIFCASSSKAWSNKVVCSPKWCYARNITKFTQDGSIVAETGE 703
Query: 349 FMI--RGKKNFFPPCQLAMGISFLFKLEE 375
F++ K++ PP QL MGI FL+KL++
Sbjct: 704 FVLIDEQKQSTLPPAQLIMGIGFLWKLKQ 732
>gi|83033024|ref|XP_729296.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23486663|gb|EAA20861.1| strong similarity to unknown protein-related [Plasmodium yoelii
yoelii]
Length = 1768
Score = 128 bits (322), Expect = 4e-27, Method: Composition-based stats.
Identities = 113/393 (28%), Positives = 187/393 (47%), Gaps = 73/393 (18%)
Query: 51 MEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDL-KAVQQERDAL 109
+EF P++L+ + I +F++F VD +FS E K D + + + ++AL
Sbjct: 440 VEFSPILLKNHINKI---NEKKIEIIKFDNFNMCVDTYFSKIELTKYDKHQEMNKNKNAL 496
Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
K++ +K DHE R+ LE+ + K+K LI N + V AI +R I+ +WE I
Sbjct: 497 TKMDKIKLDHEKRIEGLEKEVSMLKKKILLIELNYQFVGEAIKLMRSAISTSANWEKIWD 556
Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITL-LLRVHKERSNIKDSWVLHHDNADPFADS--S 226
+K ++ N P+A +K + +N N+ + LL + ++++S ++ D + +
Sbjct: 557 HIKLFKKRNHPIA--LKIMSVNFNNCEMELLLDDNDDDDVEESGDDNNLKNDKWKEKVIE 614
Query: 227 ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV- 285
E K V ++L+ S + N + + L++ A +K +K ST A+K EKK K
Sbjct: 615 EKNKTCAVTINLNNSVFGNIEDYEKLRKKAEEKIRKIKMSTNIAVKKVEKKKKDKDIKQK 674
Query: 286 ---QTMTNINKARKVYWFEKF--------YWFISSENYLGN------------------- 315
+++ I K RK++WFEKF Y IS + L N
Sbjct: 675 GKNKSVFQIKKIRKIFWFEKFNWFISSENYLVISGRDSLQNEILFRRYFQNNDIYVHADI 734
Query: 316 ---------------------------------VAWDAKVVTNAWWVKADQVSKTAPTGE 342
AW+ K++T+AWWV QVSKTAPTGE
Sbjct: 735 HGAASCIIKNPYKDIPIPEKTLSEAGQLAMCRSSAWNNKMITSAWWVYYHQVSKTAPTGE 794
Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
++ TGSF+IRGKKN+ P +L MG+ +F++ +
Sbjct: 795 YIKTGSFVIRGKKNYLPYAKLEMGLCIIFQINK 827
>gi|302309325|ref|NP_986649.2| AGL017Wp [Ashbya gossypii ATCC 10895]
gi|299788305|gb|AAS54473.2| AGL017Wp [Ashbya gossypii ATCC 10895]
Length = 1006
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 120/442 (27%), Positives = 197/442 (44%), Gaps = 106/442 (23%)
Query: 31 GYLVQKK----EKRPNSEDYIVANMEFHPM--ILQPGSTPSVMRQHQGFPIQEFES-FAA 83
GY++ ++ +++ ++E + +FHP L GS + F I E + +
Sbjct: 273 GYILARENPLYDEKKDAEGIRLTYEQFHPFRPYLPDGS-------QKKFEIVEVDGDYNR 325
Query: 84 AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
VD+FFST +S K L+ QE++A KKLE K +++ ++ AL + Q ++++ IINN
Sbjct: 326 TVDKFFSTIDSTKYALRIQTQEQNARKKLEKAKAENQKKIQALVEVQHTNEQRGNAIINN 385
Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVH 202
E V+ A AI+ + Q+ W IE +K Q ++ +A +IK L L N I++ L +
Sbjct: 386 IELVEEAKSAIQGLLDQQMDWTSIEKLIKTEQAKSNRIARVIKLPLNLKANKISVELPLS 445
Query: 203 KERSNIKD-SW----------------------------VLHHDNADPFADSSETKKPSL 233
E D SW V + + S + K
Sbjct: 446 NEDDESSDGSWGDSESDSGFSSSDDELSDSGLSDFDAEVVRGSGSKNKKGKSKVSNKSIT 505
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--I 291
V +DL +SAYANA +F++K++ AKKQ Q+ +KA+K+ E+K ++ LK + +
Sbjct: 506 VSIDLSMSAYANASSYFEMKKTGAKKQLGVEQNVQKAMKNIEQKIEKDLKKKLKEQHDVL 565
Query: 292 NKARKVYWFEKFYWFISSENYL-----------------------------GNVAW---- 318
R Y+FEK++WFIS+E +L G+ W
Sbjct: 566 QVIRSPYFFEKYFWFISTEGFLVLMGKSGIETDQIYSKYIEDDDVYVSNGFGSQVWIKNF 625
Query: 319 -----------DAKVVTNA-------------WWVKADQVSKTAPT-GEFLTTGSFMIRG 353
A + N+ WW A +SK G L +G+F ++
Sbjct: 626 ERTEIPPNTLMQAGIFANSASEAWSKKVATSPWWCAAKNLSKFDDVGGGLLPSGAFRLKS 685
Query: 354 --KKNFFPPCQLAMGISFLFKL 373
KNF PP QL MG +F++K+
Sbjct: 686 DEAKNFLPPAQLVMGFAFMWKI 707
>gi|408392777|gb|EKJ72097.1| hypothetical protein FPSE_07722 [Fusarium pseudograminearum CS3096]
Length = 1078
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 155/310 (50%), Gaps = 32/310 (10%)
Query: 28 SSQGYLVQKKEKRPNS----------EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
+ GY+ K+ +RP D ++ + +FHP I P ++ + E
Sbjct: 262 TCTGYIFAKRRERPEGTEVDEETKTKRDNLLYD-DFHPFI------PYKLKNDPAIEVLE 314
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
FE + VDEFFS+ E +++ K ++E A +KLE K + R+ L++ Q ++ KA
Sbjct: 315 FEGYNETVDEFFSSLEGQRLESKLTEREATAKRKLEAAKNEQNKRIEGLQEAQSLNFRKA 374
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
I N E V A+ A+ + + W D+ V++ ++ ++PVA IIK L LN+ N I
Sbjct: 375 AAIEANVERVQEAMDAVNGLLNQGMDWVDVGKLVEREKKRHNPVADIIK-LPLNLAENLI 433
Query: 196 T----------LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
T ++ + D L D A +T K V+++L S ++N
Sbjct: 434 TLELAEEEFEPEEDDPYETDDDDDDDSALGDDEGTSAAKGKQTNKALNVEINLGFSPWSN 493
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
A+ +FD +++AA K++KT Q +ALK+AE+K + LK Q + RK WFEKF
Sbjct: 494 AREYFDQRKTAAVKEEKTQQQASRALKNAEQKITEDLKKGLKQEKALLQPIRKQMWFEKF 553
Query: 304 YWFISSENYL 313
WFISS+ YL
Sbjct: 554 TWFISSDGYL 563
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 40/61 (65%), Positives = 46/61 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K AWWV ADQVSK+APTGEFL GSFMIRGKKNF PP QL +G+ F++ E
Sbjct: 629 AWDSKAGMPAWWVNADQVSKSAPTGEFLQAGSFMIRGKKNFLPPAQLLLGLGLAFRISEE 688
Query: 377 S 377
S
Sbjct: 689 S 689
>gi|146419620|ref|XP_001485771.1| hypothetical protein PGUG_01442 [Meyerozyma guilliermondii ATCC
6260]
Length = 873
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 124/455 (27%), Positives = 193/455 (42%), Gaps = 122/455 (26%)
Query: 21 LVSNLNVSSQGYLVQKK----EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
LVS+L G +V KK E +++D EFHP +P + F
Sbjct: 127 LVSSLGTEVTGIIVSKKNPAFEPSDDNKDLEYLYDEFHP--FKP-----YKENLELFKFT 179
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
E + VD FFST +S K +L+ QQ+ +A K+L N + + + ++ L Q ++ +K
Sbjct: 180 EIRGYNKTVDTFFSTLDSKKHELRMEQQKHNAKKRLLNAREERDKQIDNLRIQQEMNSKK 239
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
+ II + + V I +++ + Q+ W +IE+ +K Q + VA IK L LN+
Sbjct: 240 GDAIIYHADLVSECIASVQTLLDQQMDWANIESLIKLEQSRGNSVAKTIK-LPLNLTENK 298
Query: 197 LLLRVHKERSNIKDSWVLHHDNAD--------------------------------PFAD 224
+ L++ S +D AD P
Sbjct: 299 IGLKLPDTDS--------MYDPADIDSELDSETSSESETESESESESGSESEDETPPKRM 350
Query: 225 SSETKK---PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
S + K P+L V +DL LS +ANA+ +F+ K+ A KQ+K ++T+ AL++A+KK +Q
Sbjct: 351 SKKAKSKEIPALSVWIDLLLSPFANARTYFESKKQAESKQEKVEKNTDMALRNAQKKIEQ 410
Query: 281 TLKDVQTMTNIN----KARKVYWFEKFYWFISSENYL----------------------- 313
L + + N N + R YWFEKF+WF+SSE YL
Sbjct: 411 DL--AKNLKNENETLRQVRPKYWFEKFFWFVSSEGYLCIAGRDDAQVDMIYYRHFSDNDF 468
Query: 314 ---GNVAWDAKVVTN-------------------------AW--------WVKA-DQVSK 336
++ KVV AW W A + V+K
Sbjct: 469 FVSSDIEGSLKVVVKNPYRGEALPPYTLMQAGMFAMSASAAWNGKITTSPWFLAGNDVTK 528
Query: 337 TAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
G + +G+F +GKK F PP QL MG+ F F
Sbjct: 529 LDFDGSLVPSGTFNYKGKKEFLPPTQLVMGLGFYF 563
>gi|145351275|ref|XP_001420008.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580241|gb|ABO98301.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1069
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/267 (36%), Positives = 148/267 (55%), Gaps = 22/267 (8%)
Query: 63 TPSVMRQHQGFPIQEFE---------SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLE 113
+P ++Q++G P ++FE +F AVDE+F E+ L + E AL KLE
Sbjct: 307 SPFSLKQNEGRPTRKFELPKGLDPVCAFDHAVDEYFIALEAQSQILARRKAEAQALAKLE 366
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
+D ++R+ LE+ + ++++A LI N E+VD AI A+ +A+ +SW ++EA + +
Sbjct: 367 KSLKDQKSRVEQLEREREKEEQRAVLIEYNHEAVDTAIDAVNSALASGMSWPELEAMINE 426
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
+R +PVA +IK L L N IT+ L +N H D D +S +K
Sbjct: 427 ERRLGNPVAGMIKSLDLANNQITITL------AN-------HLDEVDEVDAASGKRKRVA 473
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
V VDL LSA+ANA F K+ A+K KT+ + KA+ +AE K K ++ ++I +
Sbjct: 474 VGVDLGLSAHANASMRFAAKKKHAEKFSKTVDAQSKAVAAAEAKAKAAMEKAANGSSIAR 533
Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDA 320
AR+ WFEKF WFI+SEN L A DA
Sbjct: 534 ARQPLWFEKFNWFITSENCLVLQAKDA 560
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 35/61 (57%), Gaps = 2/61 (3%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPT-GEFLTTG-SFMIRGKKNFFPPCQLAMGISFLFKLE 374
AW+++ V +AWW +++VSK +P G+ L G + + K F P QL MG +F +
Sbjct: 619 AWNSRAVKSAWWTSSERVSKISPVAGDALPPGVTHVAHADKQFLPHAQLVMGFGLMFVVS 678
Query: 375 E 375
E
Sbjct: 679 E 679
>gi|345565416|gb|EGX48366.1| hypothetical protein AOL_s00080g336 [Arthrobotrys oligospora ATCC
24927]
Length = 1207
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 172/382 (45%), Gaps = 73/382 (19%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+FHP + P+ + ++ VD FFS+ ES K++ K +++ A K+
Sbjct: 385 DFHPFL------PTQFKDKPNVHTIPITTYNKTVDSFFSSIESQKLEQKTAEKKSLAAKR 438
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
L N + +H+ ++ +L+ Q V KA+ I N E V+ I A+ IA + W +I + V
Sbjct: 439 LANARNEHKNKIESLKSAQEVHVRKAQAIEANVERVEEVIDAVNGLIAQGMDWTEIRSLV 498
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
++ + + VA +I+ +K N T+++R+++E ++ + +
Sbjct: 499 EREKSAGNGVAEMIRDVKFMEN--TVVVRLYEEEEEDDSDDDDDESGSEDGNGEEKEGRS 556
Query: 232 SL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM-- 288
L +++DL L+ YANA+ +++ KRSAA K+ KT+QS+ KALKS EKK ++ LK
Sbjct: 557 HLDIEIDLALTGYANARIYYEQKRSAAVKETKTLQSSAKALKSTEKKIQKDLKQAYKAEK 616
Query: 289 TNINKARKVYWFEKFYWFISSENYL--------------------GNVAWDAKVVTNAWW 328
+ R+ W+EKFYWF SSE YL G+V A+V +
Sbjct: 617 MELRTLRRQGWWEKFYWFRSSEGYLVLGAKDPTQADMLYKKYFKKGDVWVHAEVPGSCHV 676
Query: 329 V---KADQVSKTAPTGEFLTTGS------------------------------------- 348
V K + V+ P G GS
Sbjct: 677 VVKNKVEDVNSPIPPGTLSQAGSLAVASSDAWEKKMVISAWWAGYEQVGKIGAGGIVLGT 736
Query: 349 --FMIRGKKNFFPPCQLAMGIS 368
F+++G+K + PP L MG +
Sbjct: 737 GEFVVKGEKKWLPPAMLVMGFA 758
>gi|50312521|ref|XP_456296.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49645432|emb|CAG99004.1| KLLA0F27335p [Kluyveromyces lactis]
Length = 1027
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 124/451 (27%), Positives = 195/451 (43%), Gaps = 117/451 (25%)
Query: 31 GYLVQKKEK-------RPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAA 83
GY+V K K +P E Y +N FHP G + V G +
Sbjct: 273 GYIVAHKNKLYDPLRDKPELE-YTFSN--FHPFKPFVGDSTDVKIIEIG------GMYNN 323
Query: 84 AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
VD FFST ES+K + Q+ A KKL+ K ++ET + +L Q ++EK ++I N
Sbjct: 324 TVDTFFSTIESNKYASRIQNQDFQAQKKLDEAKNNNETIIKSLLHAQQTNEEKGNILIAN 383
Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHK 203
V+ A A++ + Q+ W+ +E + QR + +A IIK L +++ + + + + K
Sbjct: 384 ANLVEEAKNAVKSLLDQQMDWQSMETLIANEQRKGNKIARIIK-LPMDLPNNKITIELPK 442
Query: 204 ERSNIKDSWVLHHDNADPFAD--------------------------------------S 225
+ + DS H AD ++ +
Sbjct: 443 DGYSEDDSTEHHQSEADYSSNESDVNQSDSSVSSDYSDSDFEELTSSKSKQQSRRKSKIT 502
Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KD 284
SE ++ L+ VDL LSA+ANA +F+ K++ ++KQ+K ++ EKALKS ++K ++ L K
Sbjct: 503 SEKRETVLLTVDLSLSAFANASSYFNAKKATSEKQKKVEKNAEKALKSIQQKIEKDLQKK 562
Query: 285 VQTMTNINKA-RKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNA-- 326
+ +I KA R Y+FEK+YWFISSE++L V D +VTNA
Sbjct: 563 SKESHDILKAIRTPYFFEKYYWFISSESFLVLMGKSPVETDQLYAKYVNDDDIMVTNAFD 622
Query: 327 ----------------------------------------WWVKADQVSKTAPT-GEFLT 345
WW A V+K G L
Sbjct: 623 VKAWILNPQKTEVPPNTLMQAGTFANSASDAWSKKIASSPWWCFAKNVTKFDDIDGSVLP 682
Query: 346 TGSFMIR--GKKNFFPPCQLAMGISFLFKLE 374
GSF ++ KN PP QL MG+ ++K++
Sbjct: 683 VGSFRMKQPKAKNMLPPAQLVMGLGLVWKVK 713
>gi|348681953|gb|EGZ21769.1| hypothetical protein PHYSODRAFT_557667 [Phytophthora sojae]
Length = 1063
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 145/261 (55%), Gaps = 12/261 (4%)
Query: 63 TPSVMRQHQGF--PIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP + QHQ ++ F++F AVDE+FS E+ ++ + A KL +K++ +
Sbjct: 346 TPFLYAQHQQAHKKVKSFDTFDEAVDEYFSRFEADTAEVAKQSAQLAAENKLAKLKKNQQ 405
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
+L+ L + Q + A+LI NQ+ V+ +L IR +A+ + W +E V+ Q++ +P
Sbjct: 406 QQLAQLREVQEQSFQHAQLIEANQQDVENVLLVIRSALASGMDWRGLEELVRYEQKNGNP 465
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VAS+I +L L N + +LL ++ +D E K+ ++ +DL L
Sbjct: 466 VASLIHKLDLEHNRVAILLCDEEDDDEGEDGGD---------GTGEEDKQAHVIWIDLSL 516
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-INKARKVYW 299
SA ANA+ + K+ A +K +K ++T+KA+ AEK TK+TL+ QT N I + RK W
Sbjct: 517 SALANAREIYTKKKKAGEKVKKATEATDKAIALAEKNTKKTLEKQQTKRNVIYQRRKTLW 576
Query: 300 FEKFYWFISSENYLGNVAWDA 320
FEKF+WF+++E YL DA
Sbjct: 577 FEKFHWFLTNEKYLVVAGKDA 597
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 41/63 (65%), Positives = 53/63 (84%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW ++V+ A+WV ADQVSKTAP GE+LTTGSFMIRGKKN+ P +L MG++ LF+++ES
Sbjct: 662 AWTSQVIAGAYWVHADQVSKTAPAGEYLTTGSFMIRGKKNYIQPSRLEMGLAVLFRIDES 721
Query: 377 SIS 379
SIS
Sbjct: 722 SIS 724
>gi|401842736|gb|EJT44818.1| TAE2-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 1032
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 159/311 (51%), Gaps = 34/311 (10%)
Query: 30 QGYLVQKKEKRPNS----EDYIVANMEFHPMILQPGSTPSVMRQHQG-FPIQEFES-FAA 83
+GY++ KK + NS D FHP P + ++ G F I + E +
Sbjct: 281 KGYILAKKNENFNSIKDPADLEFIYDTFHPF------RPYINEKNSGSFRIADVEGPYNK 334
Query: 84 AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
+D+FFST ES K L+ QE A KK+++ + +++ ++ AL Q +++ K LII N
Sbjct: 335 TLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLNVQELNERKGHLIIEN 394
Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLLRV 201
++ LA++ + Q+ W IE +K Q+ + +A ++ L LN+ N I++ L +
Sbjct: 395 ASLIEEVKLAVQGLVDQQMDWSTIEKLIKSEQKKGNKIAQLL-NLPLNLKQNKISVKLDI 453
Query: 202 HKERSNI------------KDSWVLHHDNADPFADSSETKKPSL-----VDVDLDLSAYA 244
+E +I + F + + KK V +DL LSAYA
Sbjct: 454 SREEESITSSDEDDESEDSSSEGSSDSGDMSTFKEENSKKKGQSNNALNVTIDLGLSAYA 513
Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFEK 302
NA +F++K+++A+KQ+K ++ KA+K+ E K Q LK ++ + + K R Y+FEK
Sbjct: 514 NASEYFNIKKTSAEKQKKVEKNVGKAMKNIEVKIDQQLKRKLKESHSVLKKVRTPYFFEK 573
Query: 303 FYWFISSENYL 313
+ WFISSE +L
Sbjct: 574 YNWFISSEGFL 584
>gi|367011407|ref|XP_003680204.1| hypothetical protein TDEL_0C01040 [Torulaspora delbrueckii]
gi|359747863|emb|CCE90993.1| hypothetical protein TDEL_0C01040 [Torulaspora delbrueckii]
Length = 1016
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 103/329 (31%), Positives = 166/329 (50%), Gaps = 50/329 (15%)
Query: 26 NVSSQGYLVQKKEKR----PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFP-IQEFES 80
N S+G+++ KK +SED FHP P + Q QG I E
Sbjct: 276 NKDSRGFILAKKNVNYTLGKDSEDLEFIFENFHPF------KPFIEEQDQGRSRITEVPG 329
Query: 81 -FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
+ +D FFST ES K L+ QQE+ A KK+E+ + +++ R+ AL Q +++K
Sbjct: 330 EYNKTLDTFFSTIESSKYALRIQQQEQLAKKKIEDARLENQKRIQALLDVQSSNEQKGHA 389
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITL 197
II N + V+ A +A++ I Q+ W+ IE ++ Q + +A +I L LN+ N + +
Sbjct: 390 IIANADLVEEAKIAVQGLIDQQMDWQTIEKLIRNEQLKKNKIAMVI-DLPLNLKENAVNI 448
Query: 198 LLRV-----HKERSNIKDSWVLHH----------DNADPFADSSETKKPS---------- 232
L+ V H S +S+V D+ D ET++
Sbjct: 449 LVPVSHDDEHNNESESDESFVESSSDESDSDEGTDSDDSEVSDFETEESRNESRTSKRKV 508
Query: 233 ----LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE----KKTKQTLKD 284
+ +DL LSAYANA ++F +K+++A KQ+K ++ EKA+K+ E K+ KQ LK+
Sbjct: 509 ENKLKIRIDLGLSAYANASKYFTVKKTSADKQKKVEKNVEKAMKNIEQRIDKQLKQKLKE 568
Query: 285 VQTMTNINKARKVYWFEKFYWFISSENYL 313
++ + +AR Y+FEK +WF SSE +L
Sbjct: 569 SHSV--LKRARSPYFFEKHFWFYSSEGFL 595
>gi|325185450|emb|CCA19934.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1061
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 91/314 (28%), Positives = 164/314 (52%), Gaps = 42/314 (13%)
Query: 33 LVQKKEKRPNSEDYIVANMEFHPMILQPGS--------TPSVMRQHQGFPIQEFESFAAA 84
++ + EK+P I+ N Q G+ TP + QH+ F++F
Sbjct: 299 ILSESEKKPKGGYIILGNA-------QTGTKTDTYESFTPVLYAQHREKAYVSFDTFDQT 351
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
VDE+FS E+ K + ++ A KLE ++++ +L L ++ ++ + A+LI +Q
Sbjct: 352 VDEYFSYHEARKTQTGSQAAQQAASSKLEKMRKNQIQQLDELHHSEEINLKHAQLIELHQ 411
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
++ + IR +A+ + W+ ++ VK Q + +PVAS+I + L+ N +++LL
Sbjct: 412 LDIEKVLSVIRSALASGMDWKALKDLVKYEQTNANPVASMIHEFDLSKNRVSVLL----- 466
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDLKRSAAKKQQK 262
+ DP+ + +E P++ + +DL LSA NA + K+++A+K +K
Sbjct: 467 -------------SDDPYFEDAE---PAVHAIWLDLSLSALGNAAELYAKKKTSAEKAKK 510
Query: 263 TIQSTEKALKSAEKKTKQTLK-DVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAW 318
+TEKA+K A KT++ +K + T I++ RK +WFEKF+WF+SSEN L G A
Sbjct: 511 AEVATEKAIKLAASKTEKFMKTQLIKPTPIHQRRKTFWFEKFHWFLSSENILVISGKDAQ 570
Query: 319 DAKVVTNAWWVKAD 332
+++ N + K D
Sbjct: 571 QNELLVNRYVRKND 584
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 41/60 (68%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW +V+T A+WVKA+ VSK+ +GE L G+F+I GKKNF +L MG++ L+ EE
Sbjct: 631 AWKNQVITGAYWVKAECVSKSTSSGELLPPGTFLILGKKNFLQALRLEMGLAILYHTEEG 690
>gi|358339725|dbj|GAA47729.1| nuclear export mediator factor NEMF [Clonorchis sinensis]
Length = 449
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/262 (35%), Positives = 135/262 (51%), Gaps = 30/262 (11%)
Query: 21 LVSNLNVS---SQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
LV NV S+GY+ KK +P E+ + +F P + R H FP
Sbjct: 205 LVGTDNVEMGMSRGYIFGKK-LQPEDEE-LSRQEDFQPFLFDQFRN----RPHVAFP--- 255
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
+F+ AVD +FS E K VQ E A KK EN+K+DHE RL+AL+ Q D KA
Sbjct: 256 --TFSKAVDTYFSKIERDKTTELLVQNENKANKKFENIKKDHELRLAALKADQEQDVHKA 313
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
+L+ N++ VD IL I ++NQL W ++ +++A+ D +AS I QL L N IT+
Sbjct: 314 QLLEKNRQLVDNIILMINHALSNQLDWGTLDTMIQEARARGDLLASHIVQLNLQQNQITV 373
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKK---------PSLVDVDLDLSAYANAKR 248
L+ S L DPF SE + P+ V + LDL+A NA++
Sbjct: 374 SLKYGF-------SLYLLIMPRDPFESESEGENCERDQTISAPTEVVISLDLNALNNARK 426
Query: 249 FFDLKRSAAKKQQKTIQSTEKA 270
++D KR+A KK++KT+ ++ K
Sbjct: 427 YYDRKRAALKKEEKTLIASRKV 448
>gi|297736754|emb|CBI25955.3| unnamed protein product [Vitis vinifera]
Length = 712
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 88/260 (33%), Positives = 134/260 (51%), Gaps = 36/260 (13%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ Q + +FE+F AA DEF+S E + + QQ+ K E R
Sbjct: 314 CPILLNQFKSREFVKFETFDAASDEFYSKIEGQRSE----QQQ----------KAKEENR 359
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L++ + AELI N E VDAAILA+R +AN ++WED+ VK+ ++ +PVA
Sbjct: 360 VHTLKKEDDRCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDLARMVKEKKKSGNPVA 419
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I +L L N +TLLL +N D D +T V+VDL LSA
Sbjct: 420 GLIDKLYLERNCMTLLLS----------------NNLDEMDDDEKTLHVDKVEVDLALSA 463
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
+ANA+++++ K+ K++KTI + EK LK +++ + + N WFEK
Sbjct: 464 HANARQWYEQKKRQENKREKTIIAHEKLLKLLKRRLASSFHSYWPLVLFN------WFEK 517
Query: 303 FYWFISSENYLGNVAWDAKV 322
F WFISS+NY DA++
Sbjct: 518 FNWFISSKNYFVISGRDAQL 537
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 40/60 (66%), Positives = 46/60 (76%)
Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
WD+K+VT+AWWV QVSKTAPTGE+LT GSFMI GKKNF PP L MG LF L+E +
Sbjct: 643 WDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIHGKKNFLPPHPLMMGFGLLFCLDERA 702
>gi|444317477|ref|XP_004179396.1| hypothetical protein TBLA_0C00610 [Tetrapisispora blattae CBS 6284]
gi|387512437|emb|CCH59877.1| hypothetical protein TBLA_0C00610 [Tetrapisispora blattae CBS 6284]
Length = 1053
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 125/464 (26%), Positives = 196/464 (42%), Gaps = 133/464 (28%)
Query: 30 QGYLVQKKEKRPNSEDYIVA----NME--------FHPMILQPGSTPSVMRQHQGFPIQE 77
+GY+V KK PN Y++ ++E F P I + T S + G
Sbjct: 273 RGYIVAKK--NPN---YVIGRDADDLEYVYETFNPFEPFIDETHRTNSKIIIVDG----- 322
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
+ +D+FF+T ES K LK QE A KK+E+ +++ R+ AL Q +++K
Sbjct: 323 --PYNLTLDKFFTTIESSKYALKIQTQEEQAKKKIEDAHLENKKRIDALINVQTSNEQKG 380
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ-RHNDPVASIIKQLKLNINHIT 196
II N E ++ A++ + Q+ W IE +K Q R N+ +II L L N I
Sbjct: 381 YAIIANTELIETTKYAVQGLVDQQMDWNTIEKLIKNEQVRGNEVAENIILPLNLKENTIN 440
Query: 197 LLLRVH-------KERSNIKDSWVLHHDNADPFADSSE---------------------- 227
++L + S +D + DN ++S+
Sbjct: 441 MILPLKSETSSIENSSSEEQDEYCSESDNEPANENTSDEESDISVEQDVSDFVEVTTIGN 500
Query: 228 ----TKKPS-----------LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
+KK +V +DL LSAYANA R+FD K+ A+KQ++ ++ EKA+K
Sbjct: 501 SPLISKKSKHKRLQNNENSIIVSIDLSLSAYANASRYFDTKKKTAEKQKRVEENAEKAMK 560
Query: 273 SAEKKTKQTL--KDVQTMTNINKARKVYWFEKFYWFISSEN------------------- 311
+ E+ + +L K ++ + K RK Y+FEK++WFISSE
Sbjct: 561 NIEQGIETSLQRKLKESHEVLKKIRKPYFFEKYHWFISSEKILVLMGKSSTETDQIYSKY 620
Query: 312 ------YLGNVAWDAKV------------------------VTNAW---------WVKAD 332
Y+ N ++D +V + AW W KA
Sbjct: 621 IEDDDIYMSN-SFDTQVWIKNPEKIEISPNTLMQAGVFCMSSSEAWSKKIASSPWWCKAK 679
Query: 333 QVSKTAPTGE-FLTTGSFMIR--GKKNFFPPCQLAMGISFLFKL 373
VSK G L G F+++ +K+ PP QL MGI L+K+
Sbjct: 680 NVSKFDKEGNTCLEPGKFILKNENEKHSLPPAQLVMGIGLLWKV 723
>gi|340505619|gb|EGR31934.1| hypothetical protein IMG5_099620 [Ichthyophthirius multifiliis]
Length = 1423
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 156/316 (49%), Gaps = 60/316 (18%)
Query: 30 QGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFF 89
QGYLV ++++ N EF P+IL +QG I++ ESF ++++F
Sbjct: 714 QGYLVISEKEKANY-------FEFSPLILNS---------YQGKQIEQMESFNDCINKYF 757
Query: 90 S-----TAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
E K D++++ A KK N+K D E R+ L+ Q KA+LI N
Sbjct: 758 QKMSKKIEEEQKEDVESI-----AWKKYLNIKTDQENRIKKLKDEQEEFITKAQLIEENY 812
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
+ V+A ++ ++ L+W+ I + + ++ DP+A++I Q+ N +++ L
Sbjct: 813 QDVEAITNILKTMKSSGLAWDKIIKMINEGKKQGDPLANLIHQIDFENNEVSIYL----- 867
Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
F D ++ V VD+ SA+ NA+ +++ KR K++KT+
Sbjct: 868 ----------------GFIDDQMSELIP-VSVDIYQSAHQNARNYYENKRKNVLKEKKTL 910
Query: 265 QSTEKALKSAEKKTKQTLKDVQ-----TMTNINKARKVYWFEKFYWFISSENYL---GNV 316
+T+ ALK AE K LK+++ TM +N RK YWFEKFYWFI+SENYL G
Sbjct: 911 DATKTALKQAE---KTALKEIETQKHKTMQLVN-VRKQYWFEKFYWFITSENYLVISGRD 966
Query: 317 AWDAKVVTNAWWVKAD 332
+ +++ + K D
Sbjct: 967 SQQNEILVKKYMKKGD 982
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 48/64 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW+AK++ +AWWV + QVSK A TGE+L +GSFMIRGKKNF P ++ M + LFKL +
Sbjct: 1025 AWEAKIIASAWWVDSHQVSKRAETGEYLPSGSFMIRGKKNFVYPSRMEMACTILFKLNDD 1084
Query: 377 SISR 380
S+ R
Sbjct: 1085 SLER 1088
>gi|254566655|ref|XP_002490438.1| hypothetical protein [Komagataella pastoris GS115]
gi|238030234|emb|CAY68157.1| hypothetical protein PAS_chr1-4_0316 [Komagataella pastoris GS115]
gi|328350832|emb|CCA37232.1| Uncharacterized protein YPL009C [Komagataella pastoris CBS 7435]
Length = 1007
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 122/446 (27%), Positives = 187/446 (41%), Gaps = 124/446 (27%)
Query: 31 GYLVQKKEK--RPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
GY+V K+ P++E +YI EFHP +P S Q I+ + + +
Sbjct: 271 GYIVSKRNPIHDPSTEGSLEYIYD--EFHPY--EPTHKRSSDTQ-----IKTIKGYNKTI 321
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
D+FF+T E K LK QQ+ +A ++L++VK ++ +++ L + QL++ +K E+I+ +
Sbjct: 322 DDFFTTIEVSKHSLKEQQQKVNAERRLQSVKSENLEKIAKLTEAQLLNIQKGEVIMVYSD 381
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLLRV-- 201
V+ A++ + Q+ W IE + ++ + +A +I L LN+ N I+L L +
Sbjct: 382 VVEQCKAAVQSLLDQQMDWNHIEKLIGVEKKRGNEIAKLI-NLPLNLLENKISLALPLVN 440
Query: 202 --------------------------HKERSNIKDSWVLHHDNADPFADSSETKKPSLVD 235
+E N K S H D K V+
Sbjct: 441 FDESSEEEDESDSEDESDSEDSSSSDEQETKNKKQSSTKHSRKKD---------KTINVN 491
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK-----DVQTMTN 290
+DL LSAYANA +FD K+ A K KT +++E A+KSAE K + LK + + N
Sbjct: 492 IDLSLSAYANASTYFDAKKIAQDKLVKTEKNSELAIKSAESKINRDLKKQKKTESSQVNN 551
Query: 291 INKA----RKVYWFEKFYWFISSENYLGNVAWD--------------------------- 319
N A R +WFEK++WFISS+ +L D
Sbjct: 552 SNAALRQIRDKFWFEKYFWFISSDGFLCVAGRDDQQFDHIYFEYFDNDNDFLVSNELEGA 611
Query: 320 AKVV-------------------------TNAW--------W-VKADQVSKTAPTGEFLT 345
KV+ T AW W V VSK G L
Sbjct: 612 LKVIVKNPFLNKDVAPNTFIQAGAFSLSTTKAWENKMVSSPWIVTGSSVSKRDVDGSALA 671
Query: 346 TGSFMIRGKKNFFPPCQLAMGISFLF 371
G I +K F PPCQ+ MG L+
Sbjct: 672 PGLVNITTEKQFLPPCQMVMGFGMLW 697
>gi|68475252|ref|XP_718344.1| hypothetical protein CaO19.10114 [Candida albicans SC5314]
gi|68475451|ref|XP_718248.1| hypothetical protein CaO19.2582 [Candida albicans SC5314]
gi|46440007|gb|EAK99318.1| hypothetical protein CaO19.2582 [Candida albicans SC5314]
gi|46440107|gb|EAK99417.1| hypothetical protein CaO19.10114 [Candida albicans SC5314]
Length = 1018
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 120/459 (26%), Positives = 191/459 (41%), Gaps = 107/459 (23%)
Query: 19 HVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
++ + N ++++GY+V KK + + + +YI EFHP +P P+ +G
Sbjct: 257 YIDLINGEIATEGYIVAKKNNKVSEKSDLEYIYD--EFHP--FEP-YKPN----QEGIKF 307
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
+ +D+FFS ES K +K QQ+ +A K+LE + + + ++ +L Q ++ +
Sbjct: 308 ISVSGYNKTLDKFFSNIESTKFSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAK 367
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--N 193
K ELI + E V+ ++ I Q+ W +IE + Q+ + +A I QL LN+ N
Sbjct: 368 KGELIQYHSELVEECRSYVQSFIDQQMDWTNIETVISLEQKKKNELAQHI-QLPLNLKEN 426
Query: 194 HITLLLRVHKERSNIKDS-------------------WVLHHDNADPFADSSETKKPSLV 234
I +LL + I +S DN + T++ +
Sbjct: 427 KIKVLLEDFDDYEEITESASATETGSETETESESESSSESESDNDEDKIPVKRTQRKTNT 486
Query: 235 D-----------VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
+DL SA+ANA+ +FD K++A KQ K ST ALK+AE+K Q L
Sbjct: 487 KEKPKRKTIPTWIDLSQSAFANARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLT 546
Query: 284 DVQTMTN--INKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWW---------VKAD 332
N + + R YWFEKF+WF+SSE YL DA ++ V AD
Sbjct: 547 RSLKQENDTLKEIRPKYWFEKFFWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSAD 606
Query: 333 QVS------KTAPTGEFL-----------------------TTGSFMIRG---------- 353
K GE L TT ++++ G
Sbjct: 607 MEGSLKVFIKNPLKGEALPPSTLMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDG 666
Query: 354 ------------KKNFFPPCQLAMGISFLFKLEESSISR 380
+K + PP QL MG F L++ S R
Sbjct: 667 SIVPEGEFNYLVQKEYLPPAQLIMGFGFYCLLDDESTKR 705
>gi|349581807|dbj|GAA26964.1| K7_Ypl009cp [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1027
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/441 (25%), Positives = 197/441 (44%), Gaps = 104/441 (23%)
Query: 30 QGYLVQKKEKRPNSEDYIVANMEF--------HPMILQPGSTPSVMRQHQGFPIQEFESF 81
+GY++ K+ + NSE A++EF P I S S + + +G +
Sbjct: 282 KGYILAKRNENYNSEKD-TADLEFIYDTFHPFKPYINGGDSDSSCIIEVEG-------PY 333
Query: 82 AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
+D+FFST ES K L+ QE A KK+++ + +++ ++ AL Q +++ K LII
Sbjct: 334 NRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLII 393
Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
N ++ LA++ I Q+ W IE +K Q+ + +A ++ L LN+ N I++ L
Sbjct: 394 ENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKISVKL 452
Query: 200 RVHKERSNIKD----------SWVLHHDNADPFADSSETKKPSL---------VDVDLDL 240
+ + N + +++ S E S+ V +DL L
Sbjct: 453 DLSSKELNTSSDEDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGL 512
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTMTNINKARK 296
SAYANA +F++K+++A+KQ+K ++ KA+K+ E K K+ LKD ++ + K R
Sbjct: 513 SAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV--LKKIRT 570
Query: 297 VYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WVKADQVSKT 337
Y+FEK+ WFISSE +L + D ++N++ W+K + ++
Sbjct: 571 PYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWIKNPERTEV 630
Query: 338 APT---------------------------------------GEFLTTGSFMIRGK--KN 356
P L G+F ++ + +N
Sbjct: 631 PPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQN 690
Query: 357 FFPPCQLAMGISFLFKLEESS 377
PP QL MG FL+K++ S
Sbjct: 691 HLPPAQLVMGFGFLWKVKTSG 711
>gi|406604691|emb|CCH43887.1| putative RNA-binding protein [Wickerhamomyces ciferrii]
Length = 983
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/314 (32%), Positives = 158/314 (50%), Gaps = 35/314 (11%)
Query: 25 LNVS-SQGYLVQKK----EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGF-PIQEF 78
LNV + G++V KK + + ED +FHP P +++ + PIQ
Sbjct: 253 LNVKPTNGHVVLKKNPLYDSEKDEEDLKYLYEQFHPF------EPINLKEDEELVPIQ-- 304
Query: 79 ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAE 138
+ +D+FFST ES K L+ QE A K+L+ + D + ++ L Q V+ K E
Sbjct: 305 -GYNKTLDKFFSTIESSKYALRIQNQENQAKKRLQQARDDKQQQVQRLLDVQAVNTLKGE 363
Query: 139 LIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHIT 196
II N E V+ A A++ + Q+ W+ +E + + + VA +I L LN+ N I+
Sbjct: 364 TIIFNAEIVEEAKAAVQALLDQQMDWKTMEKLINVEKAKGNRVAKVI-NLPLNLKENKIS 422
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADS----------SETKKPSL-----VDVDLDLS 241
L L +N +D ++ S+ KK ++ V +DL LS
Sbjct: 423 LSLSTEDPYANDEDEDESSSESEPESESDSDEDEPKPVKSQAKKDNVKNTINVTIDLTLS 482
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--INKARKVYW 299
+YANA +F++K+S +KQ+K QS KALK+ E+K ++ LK N + K R Y+
Sbjct: 483 SYANASEYFNVKKSTVEKQKKVEQSATKALKNIEQKIEKDLKKNLKQENDILRKLRNPYF 542
Query: 300 FEKFYWFISSENYL 313
FEKF WFIS+ENYL
Sbjct: 543 FEKFNWFISNENYL 556
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 37/57 (64%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AW+ K+VT++WW+ A V+K G L GSF +KNF PP QL MG +FL+K+
Sbjct: 619 AWENKMVTSSWWLYASDVTKKDIDGTILNAGSFRYLKEKNFLPPSQLVMGFAFLWKV 675
>gi|3859683|emb|CAA22020.1| conserved hypothetical protein [Candida albicans]
Length = 1018
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 119/459 (25%), Positives = 191/459 (41%), Gaps = 107/459 (23%)
Query: 19 HVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
++ + N ++++GY+V KK + + + +YI EFHP +P P+ +G
Sbjct: 257 YIDLINGAIATEGYIVAKKNNKVSEKSDLEYIYD--EFHP--FEP-YKPN----QEGIKF 307
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
+ +D+FFS ES K+ +K QQ+ +A K+LE + + + ++ +L Q ++ +
Sbjct: 308 ISVSGYNKTLDKFFSNIESTKLSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAK 367
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--N 193
K ELI + E V+ ++ I Q+ W +IE + Q+ + +A I QL LN+ N
Sbjct: 368 KGELIQYHSELVEECRSYVQSFIDQQMDWTNIETVISLEQKKKNELAQHI-QLPLNLKEN 426
Query: 194 HITLLLRVHKERSNIKDS-------------------WVLHHDNADPFADSSETKKPSLV 234
I +LL + +S DN + T++ +
Sbjct: 427 KIKVLLEDFDDYEESTESASATETGSETETESESESSSESESDNDEDKIPVKRTQRKTNT 486
Query: 235 D-----------VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
+DL SA+ANA+ +FD K++A KQ K ST ALK+AE+K Q L
Sbjct: 487 KEKPKRKTIPTWIDLSQSAFANARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLT 546
Query: 284 DVQTMTN--INKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWW---------VKAD 332
N + + R YWFEKF+WF+SSE YL DA ++ V AD
Sbjct: 547 RSLKQENDTLKEIRPKYWFEKFFWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSAD 606
Query: 333 QVS------KTAPTGEFL-----------------------TTGSFMIRG---------- 353
K GE L TT ++++ G
Sbjct: 607 MEGSLKVFIKNPLKGEALPPSTLMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDG 666
Query: 354 ------------KKNFFPPCQLAMGISFLFKLEESSISR 380
+K + PP QL MG F L++ S R
Sbjct: 667 SIVPEGEFNYLVQKEYLPPAQLVMGFGFYCLLDDESTKR 705
>gi|392296002|gb|EIW07105.1| Tae2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1036
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 116/445 (26%), Positives = 199/445 (44%), Gaps = 112/445 (25%)
Query: 30 QGYLVQKKEKRPNSEDYI----VANMEF-----HPM--ILQPGSTPSVMRQHQGFPIQEF 78
+GY++ K+ +E+YI A++EF HP + G T S I E
Sbjct: 282 KGYILAKR-----NENYISEKDTADLEFIYDTFHPFKPYINGGDTDSSC-------IIEV 329
Query: 79 ES-FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
E + +D+FFST ES K L+ QE A KK+++ + +++ ++ AL Q +++ K
Sbjct: 330 EGPYNRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKG 389
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
LII N ++ LA++ I Q+ W IE +K Q+ + +A ++ L LN+ N I
Sbjct: 390 HLIIENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKI 448
Query: 196 TLLLRVHKERSNIKD----------SWVLHHDNADPFADSSETKKPSL---------VDV 236
++ L + + N + +++ S E S+ V +
Sbjct: 449 SVKLDLSSKELNTSSDEDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTI 508
Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTMTNIN 292
DL LSAYANA +F++K+++A+KQ+K ++ KA+K+ E K K+ LKD ++ +
Sbjct: 509 DLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV--LK 566
Query: 293 KARKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WVKADQ 333
K R Y+FEK+ WFISSE +L + D ++N++ W+K +
Sbjct: 567 KIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWIKNPE 626
Query: 334 VSKTAPT---------------------------------------GEFLTTGSFMIRGK 354
++ P L G+F ++ +
Sbjct: 627 KTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNE 686
Query: 355 --KNFFPPCQLAMGISFLFKLEESS 377
+N PP QL MG FL+K++ S
Sbjct: 687 NDQNHLPPAQLVMGFGFLWKVKTSG 711
>gi|6325248|ref|NP_015316.1| Tae2p [Saccharomyces cerevisiae S288c]
gi|74676621|sp|Q12532.1|TAE2_YEAST RecName: Full=Translation-associated element 2
gi|683781|emb|CAA88377.1| unknown [Saccharomyces cerevisiae]
gi|965084|gb|AAB68096.1| Ypl009cp [Saccharomyces cerevisiae]
gi|1314067|emb|CAA95032.1| unknown [Saccharomyces cerevisiae]
gi|285815527|tpg|DAA11419.1| TPA: Tae2p [Saccharomyces cerevisiae S288c]
Length = 1038
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 117/449 (26%), Positives = 197/449 (43%), Gaps = 120/449 (26%)
Query: 30 QGYLVQKKEKRPNSEDYI----VANMEF-----HPM--ILQPGSTPSVMRQHQGFPIQEF 78
+GY++ K+ +E+YI A++EF HP + G T S I E
Sbjct: 282 KGYILAKR-----NENYISEKDTADLEFIYDTFHPFKPYINGGDTDSSC-------IIEV 329
Query: 79 ES-FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
E + +D+FFST ES K L+ QE A KK+++ + +++ ++ AL Q +++ K
Sbjct: 330 EGPYNRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKG 389
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
LII N ++ LA++ I Q+ W IE +K Q+ + +A ++ L LN+ N I
Sbjct: 390 HLIIENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKI 448
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSS--------------ETKKPSL-------- 233
++ L + + N DN + E S+
Sbjct: 449 SVKLDLSSKELNTSSD----EDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKI 504
Query: 234 -VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTM 288
V +DL LSAYANA +F++K+++A+KQ+K ++ KA+K+ E K K+ LKD ++
Sbjct: 505 NVTIDLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV 564
Query: 289 TNINKARKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WV 329
+ K R Y+FEK+ WFISSE +L + D ++N++ W+
Sbjct: 565 --LKKIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWI 622
Query: 330 KADQVSKTAPT---------------------------------------GEFLTTGSFM 350
K + ++ P L G+F
Sbjct: 623 KNPEKTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFR 682
Query: 351 IRGK--KNFFPPCQLAMGISFLFKLEESS 377
++ + +N PP QL MG FL+K++ S
Sbjct: 683 LKNENDQNHLPPAQLVMGFGFLWKVKTSG 711
>gi|190407936|gb|EDV11201.1| hypothetical protein SCRG_02481 [Saccharomyces cerevisiae RM11-1a]
Length = 1030
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 117/449 (26%), Positives = 197/449 (43%), Gaps = 120/449 (26%)
Query: 30 QGYLVQKKEKRPNSEDYI----VANMEF-----HPM--ILQPGSTPSVMRQHQGFPIQEF 78
+GY++ K+ +E+YI A++EF HP + G T S I E
Sbjct: 282 KGYILAKR-----NENYISEKDTADLEFIYDTFHPFKPYINGGDTDSSC-------IIEV 329
Query: 79 ES-FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
E + +D+FFST ES K L+ QE A KK+++ + +++ ++ AL Q +++ K
Sbjct: 330 EGPYNRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKG 389
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
LII N ++ LA++ I Q+ W IE +K Q+ + +A ++ L LN+ N I
Sbjct: 390 HLIIENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKI 448
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSS--------------ETKKPSL-------- 233
++ L + + N DN + E S+
Sbjct: 449 SVKLDLSSKELNTSSD----EDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKI 504
Query: 234 -VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTM 288
V +DL LSAYANA +F++K+++A+KQ+K ++ KA+K+ E K K+ LKD ++
Sbjct: 505 NVTIDLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV 564
Query: 289 TNINKARKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WV 329
+ K R Y+FEK+ WFISSE +L + D ++N++ W+
Sbjct: 565 --LKKIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWI 622
Query: 330 KADQVSKTAPT---------------------------------------GEFLTTGSFM 350
K + ++ P L G+F
Sbjct: 623 KNPEKTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFR 682
Query: 351 IRGK--KNFFPPCQLAMGISFLFKLEESS 377
++ + +N PP QL MG FL+K++ S
Sbjct: 683 LKNENDQNHLPPAQLVMGFGFLWKVKTSG 711
>gi|151942783|gb|EDN61129.1| conserved protein [Saccharomyces cerevisiae YJM789]
Length = 1040
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 117/449 (26%), Positives = 197/449 (43%), Gaps = 120/449 (26%)
Query: 30 QGYLVQKKEKRPNSEDYI----VANMEF-----HPM--ILQPGSTPSVMRQHQGFPIQEF 78
+GY++ K+ +E+YI A++EF HP + G T S I E
Sbjct: 282 KGYILAKR-----NENYISEKDTADLEFIYDTFHPFKPYINGGDTDSSC-------IIEV 329
Query: 79 ES-FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
E + +D+FFST ES K L+ QE A KK+++ + +++ ++ AL Q +++ K
Sbjct: 330 EGPYNRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKG 389
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
LII N ++ LA++ I Q+ W IE +K Q+ + +A ++ L LN+ N I
Sbjct: 390 HLIIENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKI 448
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSS--------------ETKKPSL-------- 233
++ L + + N DN + E S+
Sbjct: 449 SVKLDLSSKELNTSSD----EDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKI 504
Query: 234 -VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTM 288
V +DL LSAYANA +F++K+++A+KQ+K ++ KA+K+ E K K+ LKD ++
Sbjct: 505 NVTIDLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV 564
Query: 289 TNINKARKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WV 329
+ K R Y+FEK+ WFISSE +L + D ++N++ W+
Sbjct: 565 --LKKIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWI 622
Query: 330 KADQVSKTAPT---------------------------------------GEFLTTGSFM 350
K + ++ P L G+F
Sbjct: 623 KNPEKTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFR 682
Query: 351 IRGK--KNFFPPCQLAMGISFLFKLEESS 377
++ + +N PP QL MG FL+K++ S
Sbjct: 683 LKNENDQNHLPPAQLVMGFGFLWKVKTSG 711
>gi|412992730|emb|CCO18710.1| predicted protein [Bathycoccus prasinos]
Length = 1191
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 130/253 (51%), Gaps = 21/253 (8%)
Query: 68 RQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALE 127
R G +E F +DE+++ E + ++ + E ALKK E +K+D E RL LE
Sbjct: 378 RDSSGEEKEEQNVFNDVLDEYYAKLELQQGVVQRNRAEAQALKKFEKLKKDQENRLKVLE 437
Query: 128 QTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQ 187
+ + ++AELI N + V + A+ +A+ ++W+D+E+ + + +R +PVA I
Sbjct: 438 RDAEKEMQRAELISRNVDDVANVLNAVNDALASGMAWDDLESMIHEEKRRGNPVAKFISS 497
Query: 188 LKLNINHITLLLR----VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
L L N +T+ L V + + ++ V H KP + +DL LSA+
Sbjct: 498 LDLKNNQVTIKLAPKEDVEIDETTGEEVVVKKH-------------KPIDIPLDLSLSAF 544
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK---QTLKDVQTMTNINKARKVYWF 300
AN FDLK+ K KT + ++ + AEKK +TL++ + + AR +WF
Sbjct: 545 ANCNARFDLKKKHDTKLDKTAKHHDRVVSDAEKKALMKIETLRE-KALPGAKAARTPFWF 603
Query: 301 EKFYWFISSENYL 313
EKF W+++SEN L
Sbjct: 604 EKFIWWVTSENCL 616
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 30/61 (49%), Gaps = 6/61 (9%)
Query: 317 AWDAKVVTNAWWVKADQVSKT----APTGEFLTTGSF-MIRG-KKNFFPPCQLAMGISFL 370
W K +AWW +A +SK L +GSF +++G +K F PP + MG L
Sbjct: 697 CWAKKRSASAWWARARDISKRRWDLPEDASLLPSGSFALVKGARKKFLPPAKPEMGFGLL 756
Query: 371 F 371
F
Sbjct: 757 F 757
>gi|238879662|gb|EEQ43300.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1018
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 119/459 (25%), Positives = 190/459 (41%), Gaps = 107/459 (23%)
Query: 19 HVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
++ + N ++++GY+V KK + + + +YI EFHP +P P+ +G
Sbjct: 257 YIDLINGAIATEGYIVAKKNNKVSEKSDLEYIYD--EFHP--FEP-YKPN----QEGIKF 307
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
+ +D+FFS ES K +K QQ+ +A K+LE + + + ++ +L Q ++ +
Sbjct: 308 ISVSGYNKTLDKFFSNIESTKFSIKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAK 367
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--N 193
K ELI + E V+ ++ I Q+ W +IE + Q+ + +A I QL LN+ N
Sbjct: 368 KGELIQYHSELVEECRSYVQSFIDQQMDWTNIETVISLEQKKKNELAQHI-QLPLNLKEN 426
Query: 194 HITLLLRVHKERSNIKDS-------------------WVLHHDNADPFADSSETKKPSLV 234
I +LL + +S DN + T++ +
Sbjct: 427 KIKVLLEDFDDYEESTESASATETGSETETESESESSSESESDNDEDKIPVKRTQRKTNT 486
Query: 235 D-----------VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
+DL SA+ANA+ +FD K++A KQ K ST ALK+AE+K Q L
Sbjct: 487 KEKPKRKTIPTWIDLSQSAFANARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLT 546
Query: 284 DVQTMTN--INKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWW---------VKAD 332
N + + R YWFEKF+WF+SSE YL DA ++ V AD
Sbjct: 547 RSLKQENDTLKEIRPKYWFEKFFWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSAD 606
Query: 333 QVS------KTAPTGEFL-----------------------TTGSFMIRG---------- 353
K GE L TT ++++ G
Sbjct: 607 MEGSLKVFIKNPLKGEALPPSTLMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDG 666
Query: 354 ------------KKNFFPPCQLAMGISFLFKLEESSISR 380
+K + PP QL MG F L++ S R
Sbjct: 667 SIVPEGEFNYLVQKEYLPPAQLVMGFGFYCLLDDESTKR 705
>gi|297736763|emb|CBI25964.3| unnamed protein product [Vitis vinifera]
Length = 403
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 114/206 (55%), Gaps = 16/206 (7%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ Q + +FE+F AA DEF+S E + + + +E A++KL + D E R
Sbjct: 198 CPILLNQFKSREFVKFETFDAASDEFYSKIEGQRSEQQQKAKEVTAMQKLSKICMDQENR 257
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ L++ + AELI N E VDAAILA+R +AN ++WED+ VK+ ++ +PVA
Sbjct: 258 VHTLKKEDDRCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDLARMVKEKKKSGNPVA 317
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I +L L N +TLLL +N D D +T V+VDL LSA
Sbjct: 318 GLIDKLYLERNCMTLLL----------------SNNLDEMDDDEKTLHVDKVEVDLALSA 361
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTE 268
+ANA+R+++ K+ K++KTI + E
Sbjct: 362 HANARRWYEQKKRQENKREKTIIAHE 387
>gi|367000852|ref|XP_003685161.1| hypothetical protein TPHA_0D00840 [Tetrapisispora phaffii CBS 4417]
gi|357523459|emb|CCE62727.1| hypothetical protein TPHA_0D00840 [Tetrapisispora phaffii CBS 4417]
Length = 1016
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 121/453 (26%), Positives = 193/453 (42%), Gaps = 121/453 (26%)
Query: 26 NVSSQGYLVQKK------EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
N S+GY++ K+ EK +YI FHP P P+ + + ++
Sbjct: 267 NRYSKGYILAKRNPKFIEEKDSKDTEYIYET--FHP--FAPYVDPNEIDISKVIEVEG-- 320
Query: 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
+ +D FFST ES K L+ QE A KKL++ ++ T+++AL Q +++EK L
Sbjct: 321 PYNNTLDLFFSTIESSKYALRIQNQEFLAKKKLDDAVNENLTKINALRDIQSINEEKGVL 380
Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITL 197
II + ++ A++ I Q+ W IE ++ Q+ + +A +I L LN+ N I +
Sbjct: 381 IIEKADLIEEVKGAVQSLIDQQMDWNAIENIIRNEQKKRNNIARLI-MLPLNLKENKINI 439
Query: 198 LLRVHK--------------------------------ERSNIKDSWVLHHDNADPFADS 225
+L E++ IK + N
Sbjct: 440 ILPAEDNNSDDSDNSSSSSDSDSEYSDNSDSDSSDDDIEKNRIKRKNRKNSKNV------ 493
Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK----QT 281
K + + +DL LSA+ANA +F+ K+++A+KQ+K ++ EKALK+ E++ K +
Sbjct: 494 --KIKGTQITIDLALSAFANASEYFNKKKTSAEKQKKVEKNAEKALKNIEERIKVQLNKK 551
Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISSENYL---------------------------- 313
LKD + + K R Y+FE+F WF SSE +L
Sbjct: 552 LKDSHDI--LKKIRAPYFFERFNWFFSSEGFLILMGKSPLDTDQIYSKYIEDDDIYMSNS 609
Query: 314 -GNVAW---------------DAKVV----TNAW---------WVKADQVSKTAPTGE-F 343
G W A V+ + AW W A VSK + G+
Sbjct: 610 FGTQVWIKNPEKTEIPPNTLMQAGVLCMSASEAWSKKIASSPWWCFAKNVSKFSSDGKSV 669
Query: 344 LTTGSFMIRGKK--NFFPPCQLAMGISFLFKLE 374
L G F ++ K NF PP QL MG FL+K++
Sbjct: 670 LEPGLFRMKNDKQQNFLPPAQLVMGFGFLWKVK 702
>gi|440301763|gb|ELP94149.1| zinc knuckle domain containing protein, partial [Entamoeba invadens
IP1]
Length = 703
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 162/371 (43%), Gaps = 104/371 (28%)
Query: 71 QGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQ 130
+G F+SF A+DEF S E + + + +++ KK++ V HE R L
Sbjct: 1 KGRLFDTFDSFCDAMDEFHSHIEKQEYEEELEKKDATMKKKIQAVIDGHEKRYKGL---- 56
Query: 131 LVDKEKAELIINNQESVDAAILAIRQDI-------ANQLSWEDIEARVKQAQRHNDP--V 181
EKAE ++ + V++ I+ + Q I + ++ WE +E + Q+ + NDP V
Sbjct: 57 ---LEKAEEMVVKAKVVESHIIIVDQLIKEINVFLSEKMQWERVE-EIIQSAKENDPTSV 112
Query: 182 ASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
A IK+ D VL +NA+ + VDV L +
Sbjct: 113 AQYIKKFDF-----------------ANDVVVLSLENAN--------NQKIDVDVLLTKN 147
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQT-MTNINKAR----- 295
+ N + F++++R K KT++S E A++ A +K ++ K Q + ++ K R
Sbjct: 148 GFENVRNFYEMRRVVLAKADKTLESRETAIQQATQKQERVAKTKQIDLADLKKMRRRFWF 207
Query: 296 -KVYWF---EKFYWFISSENYLGN------------------------------------ 315
K +WF E F IS ++ L N
Sbjct: 208 EKFHWFISSENFV-IISGKDALQNDVMYRRYMKNTDIYVHADIHGAASCLIKGVKGKVIG 266
Query: 316 ---------------VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
AW K+VT+A+WV +DQVSKTAP+GE+L TGSFMIRGKKN+ PP
Sbjct: 267 AATLEQAGKVAVCRSSAWTNKIVTSAYWVYSDQVSKTAPSGEYLVTGSFMIRGKKNYLPP 326
Query: 361 CQLAMGISFLF 371
L G+ +F
Sbjct: 327 APLVFGLGIVF 337
>gi|145509741|ref|XP_001440809.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408037|emb|CAK73412.1| unnamed protein product [Paramecium tetraurelia]
Length = 1071
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 132/245 (53%), Gaps = 25/245 (10%)
Query: 70 HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
+QG + E +F AV ++F + + + +++ E A KK EN+K+D +R+ L++
Sbjct: 274 YQGKYVVETPTFNQAVHQYFLVVDRQEENKQSI--EDIAWKKFENIKQDQMSRIQKLQEE 331
Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLK 189
Q KA LI N V A I I++ I N + W+ I+ + +++ +P++++I +
Sbjct: 332 QDEYIMKAGLIQENINDVQAIIDIIQKMIENGIPWDKIQRMINDSKKEGNPLSNMIGGMN 391
Query: 190 LNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRF 249
L N +T+LL + D ++D + +++D+ SAY NA+++
Sbjct: 392 LKQNKVTILL----------------GNKDDEYSDLIQ------IEIDITQSAYQNARKY 429
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKT-KQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
++ K+ K+ KT ++ E+ALK AEK K+ ++ + + RK YWFEKF+WFIS
Sbjct: 430 YESKKKNRDKEIKTKEAVEQALKQAEKTALKEIEREKNKIQKVQNQRKKYWFEKFFWFIS 489
Query: 309 SENYL 313
S+ YL
Sbjct: 490 SDGYL 494
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 37/64 (57%), Positives = 48/64 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
+WDAK+V +AWWV A QVSK+APTG + GSFMI GKKNF P +L MG + L++L++
Sbjct: 557 SWDAKIVVSAWWVHASQVSKSAPTGMNIPAGSFMIYGKKNFIYPPRLEMGCTILYQLDQD 616
Query: 377 SISR 380
SI R
Sbjct: 617 SIKR 620
>gi|255083452|ref|XP_002504712.1| predicted protein [Micromonas sp. RCC299]
gi|226519980|gb|ACO65970.1| predicted protein [Micromonas sp. RCC299]
Length = 1219
Score = 111 bits (277), Expect = 7e-22, Method: Composition-based stats.
Identities = 84/254 (33%), Positives = 134/254 (52%), Gaps = 14/254 (5%)
Query: 81 FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
F AA+D FF++ E+ + + + A KKLE V+RD E R+ ALE+ + + A LI
Sbjct: 344 FDAALDLFFASFEARRDRSRREKSANAAAKKLEKVRRDQEARVRALEKERESQELAATLI 403
Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
N VDA + A+ +A ++W+D+ +K+ R +PVA ++K L L N +T+ L+
Sbjct: 404 EYNLTQVDAVLAAVNGALAGGMAWDDLTLMIKEEARAGNPVARLVKTLDLPKNKVTVTLK 463
Query: 201 VH-----------KERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRF 249
H + + D+ + +A P + + V++DL L A+ANA+
Sbjct: 464 NHLDVDDDEGDDDGDDGDGGDADDVGEGDAKPRSRRLKRDGGVSVELDLALGAHANAREH 523
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT---LKDVQTMTNINKARKVYWFEKFYWF 306
FD K+ K KT+ ++A+ +AEKK K+ + T I +AR WFEKF+WF
Sbjct: 524 FDRKKKHDAKHGKTLAQNKRAVAAAEKKAKEAGARMASKGTGMGIARARVPEWFEKFHWF 583
Query: 307 ISSENYLGNVAWDA 320
I++EN L A DA
Sbjct: 584 ITTENCLVLSARDA 597
Score = 47.8 bits (112), Expect = 0.009, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 32/61 (52%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+ AWD++ V +A+W+ + V K P G+ L G G K + PP L MG +F L
Sbjct: 682 SAAWDSRHVVSAFWIPPENVRKVTPDGDPLAPGVVWHVGAKTYLPPAPLVMGFGCVFLLR 741
Query: 375 E 375
+
Sbjct: 742 D 742
>gi|193787557|dbj|BAG52763.1| unnamed protein product [Homo sapiens]
Length = 481
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 47/66 (71%), Positives = 55/66 (83%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+ AWDA+V+T+AWWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++
Sbjct: 6 SAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVD 65
Query: 375 ESSISR 380
ES + R
Sbjct: 66 ESCVWR 71
>gi|34364937|emb|CAE45889.1| hypothetical protein [Homo sapiens]
Length = 505
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 47/66 (71%), Positives = 55/66 (83%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+ AWDA+V+T+AWWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++
Sbjct: 30 SAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVD 89
Query: 375 ESSISR 380
ES + R
Sbjct: 90 ESCVWR 95
>gi|349602918|gb|AEP98908.1| Serologically defined colon cancer antigen 1-like protein, partial
[Equus caballus]
Length = 517
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 47/66 (71%), Positives = 55/66 (83%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+ AWDA+V+T+AWWV QVSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++
Sbjct: 41 SAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVD 100
Query: 375 ESSISR 380
ES + R
Sbjct: 101 ESCVWR 106
>gi|167395586|ref|XP_001741648.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165893772|gb|EDR21907.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 960
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 159/360 (44%), Gaps = 87/360 (24%)
Query: 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
EFESF A+DEF S E + + + ++E KK++ V H+ R L K +
Sbjct: 290 EFESFEKAMDEFHSHIEKQEYEAEVEKKEMIMKKKVQAVIDGHQKRYQGLLDKAETLKNE 349
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP--VASIIKQLKLNINH 194
AE + N + VD I I + ++ WE IE +++ + NDP +A IK+
Sbjct: 350 AEAVEENIQVVDQLIQEINVFLKEKMKWEQIEGIIEEL-KENDPTSIAKYIKRFDFKNEV 408
Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKR 254
+ L LR H N D D V++ L+ + + N + F+++++
Sbjct: 409 VVLELR---------------HTNEDKIID---------VEIALNKNGFENVRNFYEMRK 444
Query: 255 SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-TMTNINKARKVYWFEKFYWFISSENYL 313
+ K +KTI+S + A+K AE K ++ K+ + T+ ++ K RK +WFEKF+WF+SSEN++
Sbjct: 445 NILAKAEKTIESKDLAIKQAENKQERVAKEKKITLVDVKKMRKRFWFEKFHWFLSSENFI 504
Query: 314 ---GNVAWDAKVVTNAW------WVKAD-----------------------QVSKTAP-- 339
G A ++ + +V AD Q K A
Sbjct: 505 IISGKDALQNDIIYRRYMKNTDIYVHADIHGAASCIIKGIPGKTIGAPTLEQAGKIAVCR 564
Query: 340 ----TGEFLT---------------TGSFM------IRGKKNFFPPCQLAMGISFLFKLE 374
T + +T +G ++ IRGKKN+ PP L GI +F +E
Sbjct: 565 SSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKNYLPPVPLVFGIGIMFVVE 624
>gi|313215449|emb|CBY16187.1| unnamed protein product [Oikopleura dioica]
Length = 404
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 43/66 (65%), Positives = 58/66 (87%)
Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
+ AW+AKV+T+AWWV A+QVSKTAP+GE+L+TGSFMIRGKKN+ PP QL +G FLFKL+
Sbjct: 36 SAAWNAKVLTSAWWVHANQVSKTAPSGEYLSTGSFMIRGKKNYLPPSQLVLGFGFLFKLD 95
Query: 375 ESSISR 380
++ ++R
Sbjct: 96 DACVAR 101
>gi|145494650|ref|XP_001433319.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400436|emb|CAK65922.1| unnamed protein product [Paramecium tetraurelia]
Length = 1070
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 131/245 (53%), Gaps = 25/245 (10%)
Query: 70 HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
+QG + E +F AV ++F + + + +++ E A KK EN+K+D +R+ L+
Sbjct: 274 YQGKYVIETPTFNEAVHQYFLVVDRQEDNKQSI--EDIAWKKFENIKQDQMSRIQKLQSE 331
Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLK 189
Q KA LI N V A I I++ I N + W+ I+ + +++ +P++++I +
Sbjct: 332 QDEYIMKAGLIQENINDVQAIIDIIQKMIENGIPWDKIQRMINDSKKEGNPLSNMIGGMN 391
Query: 190 LNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRF 249
L N +T+LL + D ++D + +++D+ SA+ NA+++
Sbjct: 392 LKQNKVTILL----------------GNKEDEYSDLIQ------IEIDITQSAHQNARKY 429
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKT-KQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
++ K+ K+ KT ++ E+ALK AEK K+ ++ + + RK YWFEKF+WFIS
Sbjct: 430 YESKKKNRDKEIKTKEAVEQALKQAEKTALKEIEREKNKIQKVQNQRKKYWFEKFFWFIS 489
Query: 309 SENYL 313
S+ YL
Sbjct: 490 SDGYL 494
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 37/64 (57%), Positives = 48/64 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
+WDAK+V +AWWV A QVSK+APTG + GSFMI GKKNF P +L MG + L++L++
Sbjct: 557 SWDAKIVVSAWWVHASQVSKSAPTGMNIPAGSFMIYGKKNFIYPPRLEMGCTILYQLDQD 616
Query: 377 SISR 380
SI R
Sbjct: 617 SIKR 620
>gi|303290793|ref|XP_003064683.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453709|gb|EEH51017.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 807
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 153/383 (39%), Gaps = 111/383 (28%)
Query: 75 IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
+Q +ESF A+D +F++ E+ + + E+ + +LE V++D R +ALE+ + D+
Sbjct: 313 VQAYESFDDALDAYFASFETQAATRQRERAEKAVVDRLEKVRKDQSQRAAALEREREADE 372
Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINH 194
+A LI N E VD A+ A+ +A + W D+E +++ R +PVA IK L L N
Sbjct: 373 LRATLIEYNLERVDVALAAVNNALAGGMGWGDLEIMIREETRAGNPVAGTIKSLDLANNK 432
Query: 195 ITLLLRVH------------KERSNIKDSWVLHHDNADPFADSSETKKPSL--------- 233
IT+ L H ++ + SSE K+
Sbjct: 433 ITVTLANHLDDDEDDEDEEEEDGEDEDKDGDEDDAGEGDDEKSSERKRKQQQKKLRRKRR 492
Query: 234 --VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
V V+LD LSAYANA+ F+ K+ A K KT+ TE+A
Sbjct: 493 KAVAVELDLSLSAYANARTHFEKKKKHATKHDKTLAQTERA------------------- 533
Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
KF+WF+++EN L A DA
Sbjct: 534 ------------KFWWFVTTENCLVVSARDAAQTDAMLKKYAPPGSSVVVGGGGGGGGAG 581
Query: 322 ------------------VVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGK 354
+NAW +VK +Q+ K P GE L G GK
Sbjct: 582 WCNGVPPASLAQAGAACLCRSNAWDSRQVISAWYVKPEQIRKETPEGEPLLNGVVWTVGK 641
Query: 355 KNFFPPCQLAMGISFLFKLEESS 377
K F PP L MG +++F L + +
Sbjct: 642 KTFLPPAPLVMGFAYMFVLGDDA 664
>gi|430813962|emb|CCJ28739.1| unnamed protein product [Pneumocystis jirovecii]
Length = 631
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 142/291 (48%), Gaps = 30/291 (10%)
Query: 28 SSQGYLVQKKEK--RPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
S GY+V+K E R N D + + P+ + P + +Q+ P +F
Sbjct: 277 SVTGYIVEKIESKTRINLNDITLESTP-DPVKIYVDFNPFIPKQYSNNPNYSVITFDDGY 335
Query: 86 DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
++ S K D+K Q+ A ++L+ K +H+ ++ L++ Q + +KA+ I NQE
Sbjct: 336 NK------SQKFDMKLKNQKDIAYRRLQITKEEHQKKIDDLQKFQNICIKKAKAIEENQE 389
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
VD I A+ + + WEDI VK + + +I +L H+
Sbjct: 390 IVDETIKAVNTCVLRSMDWEDIAKLVKTEKEYESNTITI----QLPCPHL---------- 435
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
D + +D+ + K +L +D+ L L+A+ NA+ +++ K++A+ K++KTI
Sbjct: 436 ----DDNIYENDSTTGLFNGQNDKTETLNIDIKLSLNAWTNARDYYEKKKAASVKEEKTI 491
Query: 265 QSTEKALKSAEKKTKQTLK--DVQTMTNINKARKVYWFEKFYWFISSENYL 313
++ KALK+AE+K LK Q + R + WFEKF WFISS+ YL
Sbjct: 492 AASSKALKNAERKINSDLKRNTAQEKKKLVPMRNLQWFEKFLWFISSDGYL 542
>gi|358254228|dbj|GAA54239.1| nuclear export mediator factor Nemf, partial [Clonorchis sinensis]
Length = 527
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 41/62 (66%), Positives = 52/62 (83%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW+A+VVT+AWWV+ DQVSKTAP+GE+LTTG+FMIRG+KN+ PPC G LFKL+E
Sbjct: 123 AWNARVVTSAWWVRQDQVSKTAPSGEYLTTGAFMIRGRKNYLPPCHFMYGFGVLFKLDEE 182
Query: 377 SI 378
S+
Sbjct: 183 SV 184
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/44 (65%), Positives = 34/44 (77%)
Query: 270 ALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
ALKSA K +QT KD++T+ I K RK WFEKF+WFISSENYL
Sbjct: 1 ALKSAAFKAQQTRKDLRTVAQITKIRKPMWFEKFFWFISSENYL 44
>gi|403216659|emb|CCK71155.1| hypothetical protein KNAG_0G00970 [Kazachstania naganishii CBS
8797]
Length = 1006
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 150/321 (46%), Gaps = 49/321 (15%)
Query: 31 GYLVQKKEKRPNSE--------DYIVANME-FHPMILQPGSTPSVMRQHQGFPIQEFESF 81
GY+V KK PN E +Y N F P I + + + + +G ++
Sbjct: 270 GYIVAKK--NPNFEPSRDQEDLEYFFENFHPFKPFIKECERSILHVLKIEG-------TY 320
Query: 82 AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
VD+FFST ES K +K QE A KKLE + ++ R+ AL Q +++K LII
Sbjct: 321 NRTVDKFFSTLESSKYAMKIQNQETLAGKKLEEARSENGKRIQALIDVQSQNEQKGHLII 380
Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLR 200
+ E V+ A A++ + QL W IE + Q+ + +A IK LKL N I L L
Sbjct: 381 THAELVEDAKGAVQGLLDQQLDWNIIEKLIITEQKKGNKIAKAIKLPLKLKKNTIVLELP 440
Query: 201 VHKERSNIKDSWV------------------------LHHDNADPFADSSETKKPSLVDV 236
+ D+ + H + ++ KP+ V V
Sbjct: 441 LEDNNDTEDDTELSEEVDSSDISSSELSSDEESDQGSTQHQHRK--SNRIRALKPTTVSV 498
Query: 237 D--LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL--KDVQTMTNIN 292
D LDLS YANA +F +K+ +KQ+K Q+ +KA+K+ E K + L K ++ +
Sbjct: 499 DIKLDLSTYANASEYFMVKKHTVEKQKKVEQNLDKAMKNIETKVNKQLNSKLKESHKVLK 558
Query: 293 KARKVYWFEKFYWFISSENYL 313
+ R Y+FEK+ WFISSE +L
Sbjct: 559 RLRTPYFFEKYNWFISSEGHL 579
>gi|378754807|gb|EHY64836.1| hypothetical protein NERG_02239 [Nematocida sp. 1 ERTm2]
Length = 697
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 152/369 (41%), Gaps = 113/369 (30%)
Query: 72 GFPIQEFESFAAAVDEFFSTAE-----SHKIDLKAVQ-QERDALKKLENVKRDHETRLSA 125
G IQEFE F AA+D F+ E S K K + QERD KK+E
Sbjct: 263 GQKIQEFEGFGAAMDAVFNVQEITETASQKKQRKIREAQERDLHKKIE------------ 310
Query: 126 LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASII 185
E T L K+KAEL+ NQ V I I A LS ++ E R ++ ++ +P A II
Sbjct: 311 -EMTIL--KDKAELLSENQAEVKNVISVIEAANAASLSEKEFE-RFRETEKDTNPTAQII 366
Query: 186 KQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
K K+N + T+ L + K+ V +D S +
Sbjct: 367 K--KVNFGNKTVDLTLDKK----------------------------AVSIDYTKSIFEQ 396
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
+ AKK ++ ++ T KAL ++ K + V+ + I R +WFEKF W
Sbjct: 397 INMLY----QKAKKIEEKLKKTRKALDESKHKEVEIASKVEKIEKIE--RNPFWFEKFRW 450
Query: 306 FISSENYLGNVAWDAKV------------------------------------------- 322
FI+ ++ L D+K
Sbjct: 451 FITKDSDLIIAGRDSKQNEILVKKYLLDTDYYFHADIRGGSSVIVGEHATDHTKEIAASM 510
Query: 323 ---VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
++ AW V+ DQVSKTAP GE+LT GSFMI GKK F+ P +L G S +
Sbjct: 511 AMHLSKAWENNLITEVYCVRGDQVSKTAPAGEYLTHGSFMITGKKEFYHPTRLEYGFSLI 570
Query: 371 FKLEESSIS 379
+K+E+ I+
Sbjct: 571 YKIEDEEIT 579
>gi|407846065|gb|EKG02413.1| hypothetical protein TCSYLVIO_006562 [Trypanosoma cruzi]
Length = 1080
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 75/249 (30%), Positives = 126/249 (50%), Gaps = 17/249 (6%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP ++ Q+ G +SF + D FF E+ KI+ ++ + K +RDH+
Sbjct: 323 TPVLLAQYSSDGVVTSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNKFERDHQ 382
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
RL+ALE + ++ K E II N +D AI I +A + W+ + + +K+ P
Sbjct: 383 RRLNALEMEEQENQRKGECIIQNAVKIDEAIGLINGALAAGIQWDALRSLLKRRHAEGHP 442
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VA ++ L L N I++L+ +++ D + D + P +++V+L
Sbjct: 443 VAYMVHDLFLERNSISVLVESNEQ-----------EDEGEEDCDVT----PMVIEVELSK 487
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+AYANA +F +S K +KT+ +T KAL AEKK ++ +T I K R+ +W+
Sbjct: 488 TAYANATTYFSKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWW 547
Query: 301 EKFYWFISS 309
EKF WF +S
Sbjct: 548 EKFSWFRTS 556
>gi|343472755|emb|CCD15168.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 559
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 137/281 (48%), Gaps = 18/281 (6%)
Query: 31 GYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFS 90
GYL+ K KR E +F P++L +V QH F +F++ DEFFS
Sbjct: 254 GYLL-KSSKRGGQEANDSRYDDFSPVLLDQYKKDAVAYQH-------FPNFSSVCDEFFS 305
Query: 91 TAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAA 150
+E +I+ + + + K E +R+H R+ L++++ K LI N E++D
Sbjct: 306 YSEKKRIEHHNDKVKTVVVSKREECERNHNRRIDKLKRSEEESIRKGHLIFQNTETIDKI 365
Query: 151 ILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKD 210
I I + + ++ W+D + +KQ + P+AS+IK++ + +L+ N
Sbjct: 366 IGLINEALDMKIRWDDFRSVLKQRRDEGHPLASMIKEVLFERRKVVVLM-------NEDA 418
Query: 211 SWVLHHDNADPFADSSETKKPSLVDVDLDL--SAYANAKRFFDLKRSAAKKQQKTIQSTE 268
D + E + + ++++DL +A+ NA+ +F +S A K ++TI +TE
Sbjct: 419 DDDDDEQTEDEEGEKREDRDRATYEIEIDLTKTAHTNAEEYFARAKSTAAKLKRTIAATE 478
Query: 269 KALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
KA+ AE+K + Q I + R +W+EKF WF +S
Sbjct: 479 KAMAGAERKGRTVTGKTQEKKIITE-RCRFWWEKFNWFRTS 518
>gi|71413048|ref|XP_808681.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70872935|gb|EAN86830.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1082
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/258 (29%), Positives = 129/258 (50%), Gaps = 22/258 (8%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+F P++L S+ V+ +SF + D FF E+ KI+ ++ + K
Sbjct: 321 DFSPVLLAQYSSDGVVTSF-------LKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISK 373
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
+RDH+ RL+ALE + ++ K E II N +D AI I +A + W+ + + +
Sbjct: 374 RNKFERDHQRRLNALEMEEQENQRKGECIIQNAVKIDEAIGLINGALAAGIQWDALRSLL 433
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
K+ PVA ++ L L N I++L+ +++ D + D + P
Sbjct: 434 KRRHAEGHPVAYMVHDLFLERNSISVLVESNEQ-----------EDEGEEDCDVT----P 478
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
+++V+L +AYANA +F +S K +KT+ +T KAL AEKK ++ +T I
Sbjct: 479 MVIEVELSKTAYANATTYFAKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAI 538
Query: 292 NKARKVYWFEKFYWFISS 309
K R+ +W+EKF WF +S
Sbjct: 539 VKERRRFWWEKFSWFRTS 556
>gi|74025594|ref|XP_829363.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834749|gb|EAN80251.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 1100
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/263 (31%), Positives = 131/263 (49%), Gaps = 16/263 (6%)
Query: 63 TPSVMRQHQGFPIQE--FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
+P ++ Q++G + SF + D FF E KI+ + L K E +RDH
Sbjct: 317 SPVLLAQYRGDAVSASYLPSFGSVCDAFFLYTEKEKIEQHNDRATTCVLSKKEKFERDHN 376
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
R++ALE+++ + K ELII N E +D AI I +A + WE + +KQ P
Sbjct: 377 RRIAALERSEEENTRKGELIIQNAEKIDEAIGLINGALAAGIQWEALRRLLKQRHAEGHP 436
Query: 181 VASIIKQLKLNINHITLLLRVHKE------------RSNIKDSWVLHHDNADPFADSSE- 227
VA ++ +L L+ N I++L+ + E + K + N+ D E
Sbjct: 437 VAYMVHELFLDRNSISVLVEENDEDVECYEDEESKVKVGGKGENHRYGGNSGEKKDRVEG 496
Query: 228 -TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ 286
++ PS+++VDL +AYANA +F K++ K +KTI +T KA AEKK ++ Q
Sbjct: 497 CSRTPSVIEVDLSKTAYANAASYFTQKKANRAKLEKTIAATAKAAAGAEKKGERLAAKKQ 556
Query: 287 TMTNINKARKVYWFEKFYWFISS 309
T I R W+EKF WF +S
Sbjct: 557 TKKAIATERHRCWWEKFNWFRTS 579
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 8/63 (12%)
Query: 317 AWDAKVVTNAWWVKADQ-VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
AW++K AWWV A Q V TA G +++ G+KN+ P L +G LF++
Sbjct: 672 AWESKFSVGAWWVHASQIVGGTA-------AGCYLLSGEKNYLRPRPLMLGCGLLFRISS 724
Query: 376 SSI 378
+I
Sbjct: 725 RAI 727
>gi|261335340|emb|CBH18334.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 1100
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/263 (31%), Positives = 131/263 (49%), Gaps = 16/263 (6%)
Query: 63 TPSVMRQHQGFPIQE--FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
+P ++ Q++G + SF + D FF E KI+ + L K E +RDH
Sbjct: 317 SPVLLAQYRGDAVSASYLPSFGSVCDAFFLYTEKEKIEQHNDRATTCVLSKKEKFERDHN 376
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
R++ALE+++ + K ELII N E +D AI I +A + WE + +KQ P
Sbjct: 377 RRIAALERSEEENTRKGELIIQNAEKIDEAIGLINGALAAGIQWEALRRLLKQRHAEGHP 436
Query: 181 VASIIKQLKLNINHITLLLRVHKE------------RSNIKDSWVLHHDNADPFADSSE- 227
VA ++ +L L+ N I++L+ + E + K + N+ D E
Sbjct: 437 VAYMVHELFLDRNSISVLVEENDEDVECYEDEESKVKVGGKGENHRYGGNSGEKKDRVEG 496
Query: 228 -TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ 286
++ PS+++VDL +AYANA +F K++ K +KTI +T KA AEKK ++ Q
Sbjct: 497 CSRTPSVIEVDLSKTAYANAASYFTQKKANRAKLEKTIAATAKAAAGAEKKGERLAAKKQ 556
Query: 287 TMTNINKARKVYWFEKFYWFISS 309
T I R W+EKF WF +S
Sbjct: 557 TKKAIATERHRCWWEKFNWFRTS 579
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 8/63 (12%)
Query: 317 AWDAKVVTNAWWVKADQ-VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
AW++K AWWV A Q V TA G +++ G+KN+ P L +G LF++
Sbjct: 672 AWESKFSVGAWWVHASQIVGGTA-------AGCYLLSGEKNYLRPRPLMLGCGLLFRISS 724
Query: 376 SSI 378
+I
Sbjct: 725 RAI 727
>gi|224014996|ref|XP_002297159.1| signal peptidase [Thalassiosira pseudonana CCMP1335]
gi|220968134|gb|EED86484.1| signal peptidase [Thalassiosira pseudonana CCMP1335]
Length = 968
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 147/315 (46%), Gaps = 40/315 (12%)
Query: 14 CGPTLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGF 73
CG V+ + + S GY++ K ++ + DY +EF P +L QH+
Sbjct: 234 CGEGAKVIENLSSGESGGYILYKPKQTDDKNDYNKTLLEFQPHLLH---------QHKNQ 284
Query: 74 PIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVD 133
+ +FA A DEFFS S +I +A E A ++L ++ D + R+ L Q
Sbjct: 285 HALSYTTFATATDEFFSHLSSQRIAQRADAAEAAARERLSKIQLDQQRRVDGLVAEQEKS 344
Query: 134 KEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNIN 193
++ A L+ + E VD + I + + ++W+ +E V Q + +P+A +I +L+L +
Sbjct: 345 RDCARLVEMHAEDVDRVLGVINSALESGMNWDALEQLVLVEQGNENPIALLIFKLELCKD 404
Query: 194 HITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLK 253
+ L L + + K V V + SA+ NA+ F
Sbjct: 405 QVVLALPDIDDWDDSD---------------PDRPPKLHYVTVSIKESAHGNARNMF--- 446
Query: 254 RSAAKKQQKTIQSTEKALKSAEKKTKQTLKD-------VQTMTNINKARKVYWFEKFYWF 306
A KQ KT++++ ALK+AE K KQ L + +Q M N RK YWFEKF WF
Sbjct: 447 --ATIKQSKTLEASTTALKAAEAKAKQQLAEAQKKKQRIQVMPN----RKTYWFEKFAWF 500
Query: 307 ISSENYLGNVAWDAK 321
I+S+NYL DA+
Sbjct: 501 ITSDNYLVVAGQDAQ 515
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 39/65 (60%), Positives = 54/65 (83%), Gaps = 1/65 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
AW +K+V +A+WV++ QVSKTAPTGE+LT GSFMIRG+KNF PP L MG+ LF+L ++
Sbjct: 579 AWSSKMVCSAYWVESHQVSKTAPTGEYLTVGSFMIRGRKNFLPPSSLEMGMGVLFRLGDD 638
Query: 376 SSISR 380
+S++R
Sbjct: 639 ASVAR 643
>gi|71411706|ref|XP_808091.1| hypothetical protein Tc00.1047053507483.60 [Trypanosoma cruzi
strain CL Brener]
gi|70872222|gb|EAN86240.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1081
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 76/258 (29%), Positives = 129/258 (50%), Gaps = 22/258 (8%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+F P++L S+ V+ + +SF + D FF E+ KI+ ++ + K
Sbjct: 321 DFSPVLLAQYSSDGVV-------MSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISK 373
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
+RDH RL+ALE + ++ K E II N +D AI I +A + W+ + + +
Sbjct: 374 RNKFERDHLRRLNALEMEEQENQRKGECIIQNVVKIDEAIGLINGALAAGIQWDALRSLL 433
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
K+ PVA ++ L L N I++L+ +++ D + D + P
Sbjct: 434 KRRHAEGHPVAYMVHDLFLERNSISVLVESNEQ-----------EDEGEEDCDVT----P 478
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
+++V+L +AYANA +F +S K +KT+ +T KAL AEKK ++ +T I
Sbjct: 479 MVIEVELSKTAYANATTYFAKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAI 538
Query: 292 NKARKVYWFEKFYWFISS 309
K R+ +W+EKF WF +S
Sbjct: 539 VKERRRFWWEKFSWFRTS 556
>gi|170582502|ref|XP_001896158.1| hypothetical protein [Brugia malayi]
gi|158596691|gb|EDP34993.1| conserved hypothetical protein [Brugia malayi]
Length = 643
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 40/64 (62%), Positives = 51/64 (79%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW+AK+ ++AWWV QVS+TAPTGE+LT GSFMIRGKKN+ P CQL MG +F+L+E
Sbjct: 40 AWEAKITSSAWWVHQHQVSRTAPTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEE 99
Query: 377 SISR 380
S+ R
Sbjct: 100 SLER 103
>gi|407406699|gb|EKF30889.1| hypothetical protein MOQ_005283 [Trypanosoma cruzi marinkellei]
Length = 1098
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 74/258 (28%), Positives = 129/258 (50%), Gaps = 22/258 (8%)
Query: 52 EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+F P++L S+ V+ +SF + D FF E+ KI+ ++ + K
Sbjct: 333 DFSPVLLAQYSSEGVVTSF-------LKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISK 385
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
+RDH+ RL+ LE + ++ K E II + +D AI I +A + W+ + + +
Sbjct: 386 RNKFERDHQRRLNTLEMEEQENQRKGECIIQHAVKIDEAIGLINGALAAGIQWDALRSLL 445
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
K+ PVA ++ +L L N I++L+ +++ D + D + P
Sbjct: 446 KRRHAEGHPVAYMVHELFLERNSISVLVESNEQ-----------EDEGEEDCDVT----P 490
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
+++V+L +AYANA +F +S K +KT+ +T KAL AEKK ++ +T I
Sbjct: 491 MVIEVELSKTAYANATTYFSKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAI 550
Query: 292 NKARKVYWFEKFYWFISS 309
K R+ +W+EKF WF +S
Sbjct: 551 VKERRRFWWEKFSWFRTS 568
>gi|422293271|gb|EKU20571.1| hypothetical protein NGA_2069500, partial [Nannochloropsis gaditana
CCMP526]
Length = 107
Score = 100 bits (249), Expect = 1e-18, Method: Composition-based stats.
Identities = 44/64 (68%), Positives = 50/64 (78%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW AK+VT+AWWV A QVSKTAP GEFL TGSFM+RGKKNF P L MG+ LFKL+E
Sbjct: 28 AWKAKMVTSAWWVGAGQVSKTAPAGEFLPTGSFMVRGKKNFLAPQPLEMGLGLLFKLDEG 87
Query: 377 SISR 380
S+ R
Sbjct: 88 SVGR 91
>gi|387595331|gb|EIJ92956.1| hypothetical protein NEPG_02355 [Nematocida parisii ERTm1]
Length = 700
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 151/368 (41%), Gaps = 113/368 (30%)
Query: 72 GFPIQEFESFAAAVDEFFSTAE-----SHKIDLKAVQ-QERDALKKLENVKRDHETRLSA 125
G I+EF+ F +A+D F+ E S K K + QERD KK++
Sbjct: 263 GQKIKEFDGFGSAMDAAFAVQEITETVSQKKHRKIREAQERDLHKKID------------ 310
Query: 126 LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASII 185
E T L K KAEL+ NQ V I I A LS ++ E R K++++ +P A II
Sbjct: 311 -EMTIL--KTKAELLSENQAEVKNVISVIEAAHAASLSEKEFE-RFKESEKDKNPTAKII 366
Query: 186 KQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
K K N T+ L + K+ LV +D S +
Sbjct: 367 K--KANFGKKTVDLIIDKQ----------------------------LVTIDYTASIFEQ 396
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF-- 303
+ AKK ++ ++ T AL+ + K + K ++ + I+ R V+WFEKF
Sbjct: 397 INALY----QKAKKIEEKLKKTRVALEESRTKEIEVTKRIEKIEKID--RNVFWFEKFRW 450
Query: 304 ------------------------------YWF----------ISSENYLGNV------- 316
Y+F I EN +
Sbjct: 451 LITKDSDLILAGRDSKQNEILVKKHLLDTDYYFHADVRGGSSVIVGENATVHTKEVAAAM 510
Query: 317 ------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW+ +T + V+ +QVSKTAP GE+LT GSFMI GKK F+ P +L G S +
Sbjct: 511 ALHLSKAWENSTITEVYCVRGEQVSKTAPAGEYLTHGSFMITGKKEFYHPTKLEYGFSIM 570
Query: 371 FKLEESSI 378
+KL++ I
Sbjct: 571 YKLKDKEI 578
>gi|387592702|gb|EIJ87726.1| hypothetical protein NEQG_02273 [Nematocida parisii ERTm3]
Length = 700
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 151/368 (41%), Gaps = 113/368 (30%)
Query: 72 GFPIQEFESFAAAVDEFFSTAE-----SHKIDLKAVQ-QERDALKKLENVKRDHETRLSA 125
G I+EF+ F +A+D F+ E S K K + QERD KK++
Sbjct: 263 GQKIKEFDGFGSAMDAAFAVQEITETASQKKHRKIREAQERDLHKKID------------ 310
Query: 126 LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASII 185
E T L K KAEL+ NQ V I I A LS ++ E R K++++ +P A II
Sbjct: 311 -EMTIL--KTKAELLSENQAEVKNVISVIEAAHAASLSEKEFE-RFKESEKDKNPTAKII 366
Query: 186 KQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
K K N T+ L + K+ LV +D S +
Sbjct: 367 K--KANFGKKTVDLIIDKQ----------------------------LVTIDYTASIFEQ 396
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF-- 303
+ AKK ++ ++ T AL+ + K + K ++ + I+ R V+WFEKF
Sbjct: 397 INALY----QKAKKIEEKLKKTRVALEESRTKEIEVTKRIEKIEKID--RNVFWFEKFRW 450
Query: 304 ------------------------------YWF----------ISSENYLGNV------- 316
Y+F I EN +
Sbjct: 451 LITKDSDLILAGRDSKQNEILVKKHLLDTDYYFHADVRGGSSVIVGENATVHTKEVAAAM 510
Query: 317 ------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW+ +T + V+ +QVSKTAP GE+LT GSFMI GKK F+ P +L G S +
Sbjct: 511 ALHLSKAWENSTITEVYCVRGEQVSKTAPAGEYLTHGSFMITGKKEFYHPTKLEYGFSIM 570
Query: 371 FKLEESSI 378
+KL++ I
Sbjct: 571 YKLKDKEI 578
>gi|221502557|gb|EEE28284.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 1859
Score = 99.0 bits (245), Expect = 3e-18, Method: Composition-based stats.
Identities = 73/249 (29%), Positives = 121/249 (48%), Gaps = 20/249 (8%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F VDE+FS+ + K + Q ++AL ++E +K D E R+ LE+ ++A
Sbjct: 454 FRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAANLLQQA 513
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
+ + N V+ I +R +A + W+++ ++K + P+A + +LKL L
Sbjct: 514 QAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLEKQRAML 573
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
LL + + E + LV VD+ LSA+ NA+ +
Sbjct: 574 LLEAPRREEA---------------EEPGEASETILVPVDVALSAHGNAQLLHSQVKQLK 618
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKD-----VQTMTNINKARKVYWFEKFYWFISSENY 312
K QKT +T AL +A++K ++TLK +Q + K RK +WFEKF+WFISS++Y
Sbjct: 619 AKTQKTSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHY 678
Query: 313 LGNVAWDAK 321
L DA+
Sbjct: 679 LVLAGRDAQ 687
Score = 71.6 bits (174), Expect = 5e-10, Method: Composition-based stats.
Identities = 34/65 (52%), Positives = 47/65 (72%), Gaps = 1/65 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
AW K + AWWV QVSK+AP+G +L+TGSFMIRG++NF +L MG LF+L +E
Sbjct: 750 AWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLLFRLADE 809
Query: 376 SSISR 380
+S++R
Sbjct: 810 ASVAR 814
>gi|221482059|gb|EEE20420.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 1859
Score = 99.0 bits (245), Expect = 3e-18, Method: Composition-based stats.
Identities = 73/249 (29%), Positives = 121/249 (48%), Gaps = 20/249 (8%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F VDE+FS+ + K + Q ++AL ++E +K D E R+ LE+ ++A
Sbjct: 454 FRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAANLLQQA 513
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
+ + N V+ I +R +A + W+++ ++K + P+A + +LKL L
Sbjct: 514 QAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLEKQRAML 573
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
LL + + E + LV VD+ LSA+ NA+ +
Sbjct: 574 LLEAPRREEA---------------EEPGEASETILVPVDVALSAHGNAQLLHSQVKQLK 618
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKD-----VQTMTNINKARKVYWFEKFYWFISSENY 312
K QKT +T AL +A++K ++TLK +Q + K RK +WFEKF+WFISS++Y
Sbjct: 619 AKTQKTSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHY 678
Query: 313 LGNVAWDAK 321
L DA+
Sbjct: 679 LVLAGRDAQ 687
Score = 71.6 bits (174), Expect = 5e-10, Method: Composition-based stats.
Identities = 34/65 (52%), Positives = 47/65 (72%), Gaps = 1/65 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
AW K + AWWV QVSK+AP+G +L+TGSFMIRG++NF +L MG LF+L +E
Sbjct: 750 AWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLLFRLADE 809
Query: 376 SSISR 380
+S++R
Sbjct: 810 ASVAR 814
>gi|159477991|ref|XP_001697091.1| hypothetical protein CHLREDRAFT_181058 [Chlamydomonas reinhardtii]
gi|158269999|gb|EDO96040.1| predicted protein [Chlamydomonas reinhardtii]
Length = 246
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 42/59 (71%), Positives = 48/59 (81%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
AWD+K+VT+AWWV QVSKTAP+GE+L TGSFMIRGKKNF PP L MG FLFK +E
Sbjct: 128 AWDSKIVTSAWWVHHHQVSKTAPSGEYLVTGSFMIRGKKNFLPPQPLVMGFGFLFKWDE 186
>gi|237842889|ref|XP_002370742.1| hypothetical protein TGME49_014090 [Toxoplasma gondii ME49]
gi|211968406|gb|EEB03602.1| hypothetical protein TGME49_014090 [Toxoplasma gondii ME49]
Length = 1859
Score = 99.0 bits (245), Expect = 4e-18, Method: Composition-based stats.
Identities = 73/249 (29%), Positives = 121/249 (48%), Gaps = 20/249 (8%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F VDE+FS+ + K + Q ++AL ++E +K D E R+ LE+ ++A
Sbjct: 454 FRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAANLLQQA 513
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
+ + N V+ I +R +A + W+++ ++K + P+A + +LKL L
Sbjct: 514 QAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLEKQRAML 573
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
LL + + E + LV VD+ LSA+ NA+ +
Sbjct: 574 LLEAPRREEA---------------EEPGEASETILVPVDVALSAHGNAQLLHSQVKQLK 618
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKD-----VQTMTNINKARKVYWFEKFYWFISSENY 312
K QKT +T AL +A++K ++TLK +Q + K RK +WFEKF+WFISS++Y
Sbjct: 619 AKTQKTSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHY 678
Query: 313 LGNVAWDAK 321
L DA+
Sbjct: 679 LVLAGRDAQ 687
Score = 71.6 bits (174), Expect = 6e-10, Method: Composition-based stats.
Identities = 34/65 (52%), Positives = 47/65 (72%), Gaps = 1/65 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
AW K + AWWV QVSK+AP+G +L+TGSFMIRG++NF +L MG LF+L +E
Sbjct: 750 AWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLLFRLADE 809
Query: 376 SSISR 380
+S++R
Sbjct: 810 ASVAR 814
>gi|156083749|ref|XP_001609358.1| hypothetical protein [Babesia bovis T2Bo]
gi|154796609|gb|EDO05790.1| conserved hypothetical protein [Babesia bovis]
Length = 1006
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 160/366 (43%), Gaps = 66/366 (18%)
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
+ F+ F A+D +F+ +E KI+ K +Q + +K L+ +K D R E+
Sbjct: 315 ERFDDFNMALDAYFTKSELRKIERK--EQPKKPIK-LQKIKDDQNRRELEREREVHRLGV 371
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
L+ ++++ D + +R +A+ SW++I ++ + + +A I+ + + +
Sbjct: 372 SIALVEGHRDTFDTVLDLMRSLVASGASWQEITDQLSRQRDSGHLLARHIRSVNIPDRRV 431
Query: 196 TLLLRVHKER--SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLK 253
+ L +N+ + + SS+ + V +D L+ + N + + K
Sbjct: 432 DVCLPNDDPGYYTNVTSMGDKRNKRGSKKSQSSDQFDDTSVTLDYGLTCFQNLEIMYSQK 491
Query: 254 RSAAKKQQKTIQSTEKALKSAEK-KTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
+ A+K ++T + ALK ++ K KQ ++ K RK WFEKF+WFI+S+ +
Sbjct: 492 KRMAEKLERTRAGHQFALKRVDREKEKQVKSRGDRNVSLVKVRKRMWFEKFHWFITSDGF 551
Query: 313 L--------------------GNVAWDAKV------------------------------ 322
L G++ + A V
Sbjct: 552 LVLGGRDSTQNELLVKRYLTKGDLYFHADVHGAASCILKNPSGNAESFPNTIDEAACFSL 611
Query: 323 -VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
+++AW WV QVS++AP+GE+L GSFMIRGKKN+ P +L M I +F
Sbjct: 612 CLSSAWSQKMVVPAWWVHHHQVSRSAPSGEYLPHGSFMIRGKKNYVQPQRLEMAIGVVFH 671
Query: 373 LEESSI 378
+E I
Sbjct: 672 IEVPDI 677
>gi|297736765|emb|CBI25966.3| unnamed protein product [Vitis vinifera]
Length = 369
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/63 (66%), Positives = 48/63 (76%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTAPT E+LT GSFMIRGKKNF PP L MG L L+ES
Sbjct: 145 AWDSKIVTSAWWVYPHQVSKTAPTVEYLTVGSFMIRGKKNFLPPHPLMMGFGLLLCLDES 204
Query: 377 SIS 379
S+
Sbjct: 205 SLG 207
>gi|257215816|emb|CAX83060.1| Serologically defined colon cancer antigen 1 [Schistosoma
japonicum]
Length = 521
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 38/62 (61%), Positives = 47/62 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW + V+T AWWV DQVSKTAP+GE+LT+GSF+IRGKKN+ PPC G +FKL E
Sbjct: 115 AWQSHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFGIMFKLHED 174
Query: 377 SI 378
S+
Sbjct: 175 SV 176
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 27/40 (67%)
Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK 321
+K +T+ I + RK WFEKF+WFISSENYL D++
Sbjct: 1 MKTAKTIAQITEVRKPMWFEKFFWFISSENYLVVAGHDSQ 40
>gi|56753953|gb|AAW25169.1| SJCHGC08981 protein [Schistosoma japonicum]
Length = 414
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 38/62 (61%), Positives = 47/62 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW + V+T AWWV DQVSKTAP+GE+LT+GSF+IRGKKN+ PPC G +FKL E
Sbjct: 8 AWQSHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFGIMFKLHED 67
Query: 377 SI 378
S+
Sbjct: 68 SV 69
>gi|254583608|ref|XP_002497372.1| ZYRO0F04004p [Zygosaccharomyces rouxii]
gi|238940265|emb|CAR28439.1| ZYRO0F04004p [Zygosaccharomyces rouxii]
Length = 1024
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 92/202 (45%), Gaps = 62/202 (30%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNI 291
V +DL LSAYANA +F++K++ A+KQ+K ++ EKA K+ E+K + LK +T +
Sbjct: 519 VSIDLGLSAYANASYYFNIKKNNAEKQKKVEKNVEKAFKNIEEKVGRQLKQKLKETHNVL 578
Query: 292 NKARKVYWFEKFYWFISSEN-------------------------YLGNV---------- 316
K R Y+FEK +WFISSE YL N
Sbjct: 579 RKVRTPYFFEKHHWFISSEGFLVLMGKSDSETDLIYSKYIEDDDVYLFNTFGTQVWIKNP 638
Query: 317 ----------------------AWDAKVVTNAWWVKADQVSKTAPT-GEFLTTGSFMIRG 353
AW K+ ++ WW A +SK P+ L G F+++
Sbjct: 639 DSTEVPPNTLMQAGILCMSASEAWSKKISSSPWWCFAKNISKFEPSDNSVLPPGRFLLKN 698
Query: 354 K--KNFFPPCQLAMGISFLFKL 373
+ KNF PP QL MG FL+K+
Sbjct: 699 ENNKNFMPPAQLVMGFGFLWKV 720
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 87/175 (49%), Gaps = 9/175 (5%)
Query: 31 GYLVQKKEKRPNSE----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVD 86
GY++ K+ NSE D FHP +P P + + I+ + +D
Sbjct: 279 GYILAKRNVNFNSERDEKDLEFVYETFHP--FEPFVAPQNVGDTRTIKIEG--GYNKVLD 334
Query: 87 EFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQES 146
FFST ES K L+ QQE+ A K+LE + D++ ++ AL Q ++EK II N +
Sbjct: 335 SFFSTIESSKYALRIQQQEQQATKRLEAARLDNQKKIQALVDAQSFNEEKGHSIIANADL 394
Query: 147 VDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRV 201
V+ A++ + Q+ W IE ++ Q+ + +A +I QL LN+ + +R+
Sbjct: 395 VEQTKSAVQGYVDQQMDWSTIEKLIQVEQKRGNKIAQLI-QLPLNLQENKIAIRL 448
>gi|294875379|ref|XP_002767293.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239868856|gb|EER00011.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 1087
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/62 (66%), Positives = 50/62 (80%), Gaps = 1/62 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC-QLAMGISFLFKLEE 375
AWD K+VT+AWWV A QVSKTAPTGEFL+TGSFMIRGKKNF PP +L MG+ ++ + +
Sbjct: 659 AWDNKIVTSAWWVWAKQVSKTAPTGEFLSTGSFMIRGKKNFLPPTGRLEMGLGVMWTVTD 718
Query: 376 SS 377
S
Sbjct: 719 DS 720
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 100/205 (48%), Gaps = 17/205 (8%)
Query: 75 IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
+ E+ SF VD++++ +++ + VQ++ + K+EN+K D R+ LE+ Q
Sbjct: 365 VVEYPSFTECVDDYYTRLMRAQLEGQLVQKQSQMISKVENIKSDQRRRMGELEKEQQSLW 424
Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINH 194
E+A + N DAAI + +A +L W+++ VKQ QR P+A I+QL L+ N
Sbjct: 425 EQAVALEANTTLADAAIQMVNALLAAKLRWDELTIAVKQQQRAGHPLAMHIRQLALDKNR 484
Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKR 254
I+++L + A D + V +DL +A AN + ++
Sbjct: 485 ISIVL-----------------EKAASTDDDDDGATTVEVWLDLGRTAQANVALLHEKRK 527
Query: 255 SAAKKQQKTIQSTEKALKSAEKKTK 279
+K KT + +A+K AEK+ K
Sbjct: 528 GMQEKMGKTEEQMARAVKMAEKRLK 552
>gi|297736751|emb|CBI25952.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/63 (66%), Positives = 48/63 (76%), Gaps = 1/63 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K+VT+AWWV QVSKTA TGE+LT GSFMIRG KNF PP L MG LF L+ES
Sbjct: 149 AWDSKIVTSAWWVYPHQVSKTASTGEYLTVGSFMIRG-KNFLPPHPLMMGFGLLFCLDES 207
Query: 377 SIS 379
S+
Sbjct: 208 SLG 210
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 76/141 (53%), Gaps = 12/141 (8%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
V+VDL LSA+ANA+R+++ K+ KQ+KTI + EKA K+AEKK+ L V
Sbjct: 11 VEVDLALSAHANARRWYEQKKRQENKQEKTIIAHEKAFKAAEKKSCVQLSQV-------G 63
Query: 294 ARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFM 350
++WFEKF WFISSENYL G A K++ + K D T + S
Sbjct: 64 EHYIHWFEKFNWFISSENYLVISGRDAQQNKMIVKRYMSKGDLFIHFKSTNN--NSSSTF 121
Query: 351 IRGKKNFFPPCQLAMGISFLF 371
+ +++ C++ + S LF
Sbjct: 122 LFFQRHLNTCCRIPLNYSSLF 142
>gi|397618049|gb|EJK64734.1| hypothetical protein THAOC_14501 [Thalassiosira oceanica]
Length = 1217
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 39/65 (60%), Positives = 53/65 (81%), Gaps = 1/65 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
AW +K+VT+A+WV++ QVSKTAPTGE+LT GSFMIRG+KNF P L MG+ LF+L ++
Sbjct: 707 AWSSKMVTSAYWVESHQVSKTAPTGEYLTVGSFMIRGRKNFLPASTLEMGVGVLFRLGDD 766
Query: 376 SSISR 380
S++R
Sbjct: 767 VSVAR 771
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 73/267 (27%), Positives = 131/267 (49%), Gaps = 26/267 (9%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P ++ QH+ P + +FA A DEFFS S K +A E A ++L + D R+
Sbjct: 394 PHLLIQHKNQPHLTYSTFATATDEFFSNLSSQKAAARADAAESAARERLAKIHADQARRV 453
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
L + Q ++ A L+ + + VD A+ I + + + W+ +E V Q + +P+A
Sbjct: 454 DGLVREQDKFRDAARLVELHADDVDRALGVINGALQSGMDWDQLEQLVTVEQGNENPIAL 513
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
+I +L L+ + I L L + D + D SE +V V++ SA+
Sbjct: 514 LIHKLVLDKDEIMLAL-----------------PDIDNWEDESEAPPIVIVTVNIKESAH 556
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKS---------AEKKTKQTLKDVQTMTNINKA 294
NA+ + + R++ +K++KTI+++E ALK+ AE + ++ K + + + +
Sbjct: 557 GNARAKYAVYRASKEKERKTIEASETALKAAEAKAKQQLAEAQKRKARKQLSVNSQVYQG 616
Query: 295 RKVYWFEKFYWFISSENYLGNVAWDAK 321
+ KF WFI+S+NYL DA+
Sbjct: 617 NLQFCLNKFAWFITSDNYLVVAGKDAQ 643
>gi|387220185|gb|AFJ69801.1| hypothetical protein NGATSA_2069500, partial [Nannochloropsis
gaditana CCMP526]
Length = 75
Score = 91.3 bits (225), Expect = 8e-16, Method: Composition-based stats.
Identities = 40/59 (67%), Positives = 46/59 (77%)
Query: 322 VVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
+VT+AWWV A QVSKTAP GEFL TGSFM+RGKKNF P L MG+ LFKL+E S+ R
Sbjct: 1 MVTSAWWVGAGQVSKTAPAGEFLPTGSFMVRGKKNFLAPQPLEMGLGLLFKLDEGSVGR 59
>gi|154332902|ref|XP_001562713.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134059716|emb|CAM41838.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 1198
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 122/253 (48%), Gaps = 17/253 (6%)
Query: 63 TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP ++ Q+ + F SF DEFF E+ +ID +++ + K + DH
Sbjct: 359 TPILLAQYTEDGVVSFYRASFGRVCDEFFLITETARIDASNEKRKNTSKNKEDKFAADHA 418
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
R++ALE ++ K + +I N + VD AI I +A +SWE + +K+ P
Sbjct: 419 RRINALETDIAANQLKGQQLILNADRVDEAIQLINGALATGISWEALRILLKRRHAEGHP 478
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VA +I L L N I++LL VL + + D P +V+V L
Sbjct: 479 VAYMIHDLFLERNSISVLLET-----------VLDEEAGEEDCDVP----PMVVEVALSK 523
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+A+ANA +F ++ K ++TI +T++A A +K ++ + + I K R+ W+
Sbjct: 524 TAHANAADYFGRQKQHRSKLERTIAATDRAAAGAARKGERKAAEQKERKVIVKERQRSWW 583
Query: 301 EKFYWFISSENYL 313
EKF+WF +S L
Sbjct: 584 EKFFWFRTSAGDL 596
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 6/55 (10%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
AW+ K T +WWV A QV+ TG +L T G+++ PP +++G + LF
Sbjct: 679 AWERKQTTGSWWVYASQVTGGTATGTYLYT------GERHHLPPQSMSLGCALLF 727
>gi|154304164|ref|XP_001552487.1| hypothetical protein BC1G_08352 [Botryotinia fuckeliana B05.10]
Length = 484
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 38/64 (59%), Positives = 46/64 (71%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AWD+K +AWWV ADQVSK+APTGEFL GSF GKKNF PP QL +G LF++ +
Sbjct: 48 AWDSKAGMSAWWVTADQVSKSAPTGEFLPAGSFNTHGKKNFLPPAQLLLGFGVLFQISDE 107
Query: 377 SISR 380
S +R
Sbjct: 108 SKAR 111
>gi|67624075|ref|XP_668320.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54659500|gb|EAL38073.1| hypothetical protein Chro.50204 [Cryptosporidium hominis]
Length = 1375
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/251 (29%), Positives = 119/251 (47%), Gaps = 24/251 (9%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
++F VDEF+S+ + K A Q+ + K++ VK D E RL L + +A
Sbjct: 367 LDNFCKCVDEFYSSIDIVKESKFATQEHKTIYSKVDKVKIDQERRLEGLSSEKEACIVRA 426
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
+ + ++QE ++ + IR IA W+DI ++Q +++N P+A IK L L + + +
Sbjct: 427 KFMESHQEILEKILQLIRHLIATGAQWQDIWNEIQQQKKNNHPLARHIKSLNLKDDKVKI 486
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
L S I P D + K D+ + S +N + + ++ A
Sbjct: 487 LFSQRDLGSEIT-----------PVVD--QIGKSIEFDLIISKSIQSNIRFQYMESKALA 533
Query: 258 KKQQKTIQSTEKAL--------KSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
+K +KT + + AL K AEK +K + +V I K R YWFEKFYWFISS
Sbjct: 534 EKFEKTQHAYKIALKKVTNIAKKDAEKASKGLVSNV---PRIKKLRAQYWFEKFYWFISS 590
Query: 310 ENYLGNVAWDA 320
+ YL DA
Sbjct: 591 DGYLIIGGHDA 601
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 35/55 (63%), Positives = 42/55 (76%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
AW K V +AWWV DQVSK AP+GE+L+TGSF+IRGKKNF PP +L MG + F
Sbjct: 658 AWVNKTVISAWWVYPDQVSKNAPSGEYLSTGSFVIRGKKNFLPPLKLEMGCALYF 712
>gi|70949333|ref|XP_744087.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56523889|emb|CAH79538.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 1345
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/268 (30%), Positives = 124/268 (46%), Gaps = 33/268 (12%)
Query: 51 MEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDL-KAVQQERDAL 109
+EF P++L+ + + +F F VD +FS E K D + + + ++AL
Sbjct: 406 VEFSPILLKNHINKI---DEKKIELIKFNDFNMCVDTYFSKMELTKYDKHQEMNKRKNAL 462
Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
K++ +K DHE R+ ALE+ + K+K LI N E V AI +R I+ +WE I
Sbjct: 463 TKIDKIKLDHERRIEALEKEVNILKKKILLIQANDEFVGEAIKLMRAAISTSANWEKIWD 522
Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
VK ++ N PVA I + N I LLL N D+ +++ E
Sbjct: 523 HVKLFKKRNHPVALKIMSVNFNNCEIELLL-------NEGDTEESSSEDSSKEKGMEEKN 575
Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV---- 285
K L++ A +K +K ST A+K EKK K
Sbjct: 576 KACT------------------LRKKAEEKIRKIKMSTNVAIKKVEKKKKDKDTKQKGKH 617
Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL 313
+++ I K RK++WFEKF WF+SSENYL
Sbjct: 618 KSVFQIQKLRKIFWFEKFNWFLSSENYL 645
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 35/60 (58%), Positives = 49/60 (81%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW+ KV+T+AWWV QVSKTAPTGE++ TGSF+IRGKKN+ P +L MG+S +F++ ++
Sbjct: 709 AWNNKVITSAWWVYYHQVSKTAPTGEYIKTGSFVIRGKKNYLPYAKLEMGLSIIFQVNKN 768
>gi|300706574|ref|XP_002995542.1| hypothetical protein NCER_101531 [Nosema ceranae BRL01]
gi|239604689|gb|EEQ81871.1| hypothetical protein NCER_101531 [Nosema ceranae BRL01]
Length = 644
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 92/384 (23%), Positives = 156/384 (40%), Gaps = 108/384 (28%)
Query: 52 EFHPMILQPGSTPSV-MRQHQGFPI----------QEFESFAAAVDEFFSTAESHKIDLK 100
EF+ I G +V M++ + F + F+SF AV+ FF KI
Sbjct: 202 EFNEFISNIGGFGAVQMKKGKPFSFFPFDIEIDNKKTFQSFNEAVEFFFMDRRKKKI--- 258
Query: 101 AVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN 160
++ D L+K+ N + +H L + + + KA+LI+ N + V+ + I N
Sbjct: 259 ---EKVDKLQKIRNKQYEHIKELENMVKDMTM---KADLILKNADIVENVLDIHNYVIKN 312
Query: 161 QLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNAD 220
+L+W D + + + +A II +S+ K N
Sbjct: 313 KLNWNDFLKFKEDEKSKGNEIADII------------------VKSDFK--------NKS 346
Query: 221 PFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
D + + +++ D S ++NA+ +F+ KR KK ++ I TEKA+ + + KT
Sbjct: 347 CIIDLKDNEDSHFIEISFDKSLHSNAQNYFE-KR---KKFEEKILKTEKAIDTIKIKTYT 402
Query: 281 TLKDVQTMTNINKARKVYWFEKFYWFISSENYL--------------------------- 313
+ ++ R V+WFEKF + +++ L
Sbjct: 403 KEEKIKI------QRSVFWFEKFNFCFTTDKKLVIGGKNAQQNEIIVKKHLTPNHLYFHT 456
Query: 314 -------------------------GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGS 348
+ W+ VV+ ++VK+DQVSKT PTG+FL GS
Sbjct: 457 ESSGGSSVISEADVNIDEVALVALCNSACWEVNVVSPVFYVKSDQVSKTPPTGQFLPKGS 516
Query: 349 FMIRGKKNFFPPCQLAMGISFLFK 372
F+IRG K + +L G+ LFK
Sbjct: 517 FLIRGTKTYVNVYKLEYGVGLLFK 540
>gi|428671810|gb|EKX72725.1| hypothetical protein BEWA_012840 [Babesia equi]
Length = 842
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 125/297 (42%), Gaps = 88/297 (29%)
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
E++ N + VD + IR IA SW+DI+ +++ + P+A I ++ + +T
Sbjct: 229 EILQENIDLVDKCLNLIRTLIATGASWKDIQDQIEIQKTLKHPIAQRIAKIDIPNQIVTF 288
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
D ++ P +++ LS + N ++ + K+
Sbjct: 289 A------------------------GDPNDKSGP---EIEYGLSCHQNLEKMYSKKKKLE 321
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-INKARKVYWFEKFYWFISSENYL--- 313
K ++T E ALK K+ ++ V+ T+ I RK +WFEKFYWFI+S+ YL
Sbjct: 322 SKLERTKIGREFALKKVNKEKEKEQTQVKGPTHKITTLRKRFWFEKFYWFITSDGYLVLG 381
Query: 314 -----------------GNVAWDAKV-------------------------------VTN 325
G++ + A + +N
Sbjct: 382 GRDALQNEILVKKYLTKGDLYFHADIHGASSCILKNPGQVAEIPQTSIDQGGCFAVCHSN 441
Query: 326 AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AW WV QVSKT PTGEF+ GSF+IRGKKN+ P +L MG++ +F +
Sbjct: 442 AWNQKFMVPSWWVYHHQVSKTPPTGEFVPQGSFVIRGKKNYIQPQKLEMGLTVVFHI 498
>gi|398011164|ref|XP_003858778.1| hypothetical protein, conserved [Leishmania donovani]
gi|322496988|emb|CBZ32058.1| hypothetical protein, conserved [Leishmania donovani]
Length = 1228
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 120/253 (47%), Gaps = 17/253 (6%)
Query: 63 TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP ++ Q+ ++ SF DEFF E+ +ID ++ + A K + DH
Sbjct: 359 TPILLAQYTNDGVEALYRTSFGRVCDEFFLITETERIDASNAKRTKTAKSKEDKFAADHA 418
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
R++ALE ++ K E +I N + VD AI I +A +SW+ + +K+ P
Sbjct: 419 RRINALETDIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHP 478
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VA +I L L N I++LL VL + + D P +V+V L
Sbjct: 479 VAYMIHDLFLERNSISVLLET-----------VLDEEKGEEDCDV----PPLVVEVTLSK 523
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+A+ANA +F ++ K ++T+ +TEKA A +K + + I K R+ W+
Sbjct: 524 TAHANAADYFSKQKQHRSKLERTVAATEKAAAGAARKGARKAAAQKEKKVIVKERQRNWW 583
Query: 301 EKFYWFISSENYL 313
EKF+WF ++ L
Sbjct: 584 EKFFWFRTTAGDL 596
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
AW+ K T +WWV A QV+ TG+++ G+++ PP +++G + LF
Sbjct: 679 AWERKQTTGSWWVYASQVTGGT------ATGAYLYAGERHHLPPQSMSLGCALLF 727
>gi|302761992|ref|XP_002964418.1| hypothetical protein SELMODRAFT_405643 [Selaginella moellendorffii]
gi|300168147|gb|EFJ34751.1| hypothetical protein SELMODRAFT_405643 [Selaginella moellendorffii]
Length = 382
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 58/96 (60%), Gaps = 7/96 (7%)
Query: 283 KDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVKADQVSKTAPTGE 342
KDV T+++ ++ + +++ + G ++T+AWWV QVSK APTGE
Sbjct: 142 KDVATISHTTGSKSLTVHKQYVCYA-----FGPAG--PTIITSAWWVYDHQVSKNAPTGE 194
Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
+LT GS MIRGKKNF PP L MG F+L++SSI
Sbjct: 195 YLTVGSLMIRGKKNFLPPYPLVMGFGLFFRLDKSSI 230
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
++F++F A +DEFFS E ++D + QE A KLE ++ D +R+ +L++
Sbjct: 63 EKFDTFDAVLDEFFSKIEGQRLDQQRKTQEDSAYSKLEKIRVDQRSRVESLKREVDQAVH 122
Query: 136 KAELIINNQESVDAAILAIRQDIAN 160
AELI N VD AI + +D+A
Sbjct: 123 TAELIEYNLADVDLAI-DVAKDVAT 146
>gi|146078492|ref|XP_001463556.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134067642|emb|CAM65921.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 1228
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 120/253 (47%), Gaps = 17/253 (6%)
Query: 63 TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP ++ Q+ ++ SF DEFF E+ +ID ++ + A K + DH
Sbjct: 359 TPILLAQYTNDGVEALYRTSFGRVCDEFFLITETERIDASNAKRTKTAKSKEDKFAADHA 418
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
R++ALE ++ K E +I N + VD AI I +A +SW+ + +K+ P
Sbjct: 419 RRINALETDIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHP 478
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VA +I L L N I++LL VL + + D P +V+V L
Sbjct: 479 VAYMIHDLFLERNSISVLLET-----------VLDEEKGEEDCDV----PPLVVEVTLSK 523
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+A+ANA +F ++ K ++T+ +TEKA A +K + + I K R+ W+
Sbjct: 524 TAHANAADYFSKQKQHRSKLERTVAATEKAAAGAARKGARKAAAQKEKKVIVKERQRNWW 583
Query: 301 EKFYWFISSENYL 313
EKF+WF ++ L
Sbjct: 584 EKFFWFRTTAGDL 596
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
AW+ K T +WWV A QV+ TG+++ G+++ PP +++G + LF
Sbjct: 679 AWERKQTTGSWWVYASQVTGGT------ATGAYLYAGERHHLPPQSMSLGCALLF 727
>gi|157865120|ref|XP_001681268.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68124563|emb|CAJ02783.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 1224
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 120/253 (47%), Gaps = 17/253 (6%)
Query: 63 TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP ++ Q+ ++ SF DEFF E+ +ID +++ A K + DH
Sbjct: 359 TPILLAQYANDGVEALYRTSFGRVCDEFFLLTETERIDASNAKRKNTAKSKEDKFAADHA 418
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
R++ALE ++ K E +I N + VD AI I +A +SW+ + +K+ P
Sbjct: 419 RRINALETDIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHP 478
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VA +I L L N I++LL L +N + D P +V+V L
Sbjct: 479 VAYMIHDLFLERNSISVLLET-----------ALDEENGEEDCDVP----PLVVEVALSK 523
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+A+ANA +F ++ K ++T+ +TEKA A +K + + I K R+ W+
Sbjct: 524 TAHANAADYFSKQKQYRSKLERTVAATEKAAAGAARKGARKAAGQKEKKVIVKERQRNWW 583
Query: 301 EKFYWFISSENYL 313
EKF+WF ++ L
Sbjct: 584 EKFFWFRTTAGDL 596
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
AW+ K T +WWV A QV+ TG+++ G+++ PP +++G + LF
Sbjct: 679 AWERKQTTGSWWVYASQVTGGT------ATGAYLYAGERHHLPPQSMSLGCALLF 727
>gi|124805420|ref|XP_001350435.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
gi|23496557|gb|AAN36115.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
Length = 2158
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 34/60 (56%), Positives = 49/60 (81%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW+ K++T+AWWV +QVSK+AP+GE+L TGSF+IRGKKN+ P +L MG LF++E++
Sbjct: 753 AWNNKIITSAWWVYYNQVSKSAPSGEYLKTGSFVIRGKKNYLPHVKLEMGFCVLFQIEKN 812
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 130/277 (46%), Gaps = 26/277 (9%)
Query: 52 EFHPMILQPGSTPSVMRQHQG-FPIQEFESFAAAVDEFFSTAESHKID-LKAVQQERDAL 109
EF P+IL+ M+ ++G F+ + VD +FS E K D + + + ++A+
Sbjct: 438 EFSPIILKNHE----MKLNEGKIKYISFDDYNLCVDTYFSKLELSKYDKQQEITKSKNAI 493
Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
K++ +K DHE R+ LE+ L+ K+K LI N ++ I +R ++ +WE I
Sbjct: 494 TKVDKIKLDHERRIEQLEKEVLLLKKKITLIQLNDVLIEEGIKLMRSALSTSANWEKIWE 553
Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
+K ++ P+A IK + + LL +R K D ++
Sbjct: 554 HIKIFKKQEHPIAVRIKSVNFKNCEMDYLLSDCDDRKGNK---------MGDDGDDNDDD 604
Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV---- 285
D + + + K F +A +K +KT +T+ A+K EKK K +
Sbjct: 605 DDGDDDNNNNNKSCVKPKTF-----AAEEKIRKTKMATDFAVKKVEKKKKNKDNNKQKGK 659
Query: 286 --QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDA 320
++ I K RKVYWFEKF+WFISSENYL DA
Sbjct: 660 AKSSVGQIQKLRKVYWFEKFHWFISSENYLVIAGRDA 696
>gi|396081612|gb|AFN83228.1| putative RNA-binding protein [Encephalitozoon romaleae SJ-2008]
Length = 648
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 148/352 (42%), Gaps = 102/352 (28%)
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
++F +F A + FF + K + +R++ K++ V++ E + +EQ + ++
Sbjct: 243 RDFSTFNDAAEFFFQNRK------KFGRNDRES--KVDKVRKRQENYMKEMEQERQSYRK 294
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
KAEL+ N + V+ + + N++ W D E +Q + + ++ I
Sbjct: 295 KAELLEENADFVNKILDIFKIVKKNKVRWTDFEKFREQENKKGNEISKAI---------- 344
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
+K ++ H A E + +D + S ++N RF+ +
Sbjct: 345 ------------VKTDFISH---TCTIALEGEE-----IQIDFETSLFSNISRFYQKNKK 384
Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF-YWF-------I 307
+K +KT S E+ LK K + + +T R +YWFEKF ++F I
Sbjct: 385 LEEKIKKTRDSLEEVLKKVAPKV-----ETKKVT-----RALYWFEKFHFFFSSDGILVI 434
Query: 308 SSEN------------------YLGNV----------------------------AWDAK 321
+N + G++ W+A
Sbjct: 435 GGKNAQQNEILVKKHLEPTDLYFHGDMHGSSSIIVKKPTQKTIEEAASMALCMSKCWEAN 494
Query: 322 VVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
VV+ W+V +QVSKTAP+GE+LT GSFMI+GKKN+ ++ G+ LFK+
Sbjct: 495 VVSPVWYVYGEQVSKTAPSGEYLTKGSFMIKGKKNYVECHKIEYGLGLLFKV 546
>gi|209875685|ref|XP_002139285.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209554891|gb|EEA04936.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 1427
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 159/369 (43%), Gaps = 76/369 (20%)
Query: 72 GFPIQEFES-FAAAVDEFFSTAESHKIDLKAVQQERDALK-KLENVKRDHETRLSALEQT 129
G + F S F+ VDEF+S+ + +K + K +QQE A+ K++ ++ D E RL L +
Sbjct: 379 GLIVNRFTSKFSDCVDEFYSSIDINK-ETKEIQQEEKAINSKIDKLRIDQERRLKELVEE 437
Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLK 189
+ ++A + + ++ +L R IA W+DI V+Q ++ P+A IK L
Sbjct: 438 KEACIKRANFMECCELLLEKILLLTRHLIATGAQWKDICNEVRQQRKIGHPIAKYIKSLD 497
Query: 190 LNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL--SAYANAK 247
L + + + + + S + ++ +S+ K +++ L++ S AN +
Sbjct: 498 LEHDRVVVYFGADEFPEDFDYSRYGYGES------NSKLKSQEGIEIYLNISKSMQANIR 551
Query: 248 RFFDLKRSAAKKQQKTIQSTEKAL----KSAEKKTKQTLKDVQTMTN-INKARKVYWFEK 302
++ + + K ++T + ++AL K+ + T++ + T N I+K R+ YWFEK
Sbjct: 552 SEYEESKHISAKLERTKSAYKRALNKVTKTVNRNTEKLTGPLNTGVNRIHKIRQSYWFEK 611
Query: 303 FYWFISSENYL---GN------VAWDAKVVTNAWWVKADQVSKTA------------PTG 341
F+WFISS+ +L GN + + + N ++ AD T P
Sbjct: 612 FHWFISSDGFLVIGGNDSSQNELLYRRYLEKNDRYIHADTHGATTCIVKNPKNLADIPMN 671
Query: 342 EFLTTGSFMI---------------------------------------RGKKNFFPPCQ 362
G I RGKKNF PP +
Sbjct: 672 TLCEAGQMSICYSRSWANKTVISAWWVYPDQVSKTAPSGEYLTTGSFVIRGKKNFLPPLK 731
Query: 363 LAMGISFLF 371
L MGI+ +F
Sbjct: 732 LEMGIALVF 740
>gi|76156132|gb|AAX27365.2| SJCHGC07862 protein [Schistosoma japonicum]
Length = 241
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 94/161 (58%), Gaps = 11/161 (6%)
Query: 28 SSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDE 87
S +GY+ K + ++E + + EFHP + + Q++ P F+SF AVD
Sbjct: 82 SLKGYIFGKPHQ--STEKGLQSYEEFHPFMFE---------QYREKPHLAFDSFNRAVDA 130
Query: 88 FFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESV 147
FFS ES K + + E+ A +K+EN+K+D E R+ L+ Q +D +KA LI N++ V
Sbjct: 131 FFSKIESQKTLGQISRNEQKANRKVENIKKDQERRIMLLKTEQELDMQKAYLIEANRQLV 190
Query: 148 DAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
D I+ I ++NQ+ W+++E +++A++ NDP++ I +L
Sbjct: 191 DNIIILINHALSNQIDWKELELIIEEAKQRNDPLSCHIVEL 231
>gi|300176454|emb|CBK23765.2| unnamed protein product [Blastocystis hominis]
Length = 767
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 153/359 (42%), Gaps = 80/359 (22%)
Query: 102 VQQERDAL-KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN 160
+++ R+A+ K++E + R+ ALE + + + + V+ AI+ + Q ++
Sbjct: 1 MRRSREAINKRVETISAKQSARIRALESEIAACRRRIRALQSYPTVVNTAIVVLNQYLSQ 60
Query: 161 QLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI---------------TLLLRVHKER 205
+ W ++ +VK ++ V IKQL L N + + E
Sbjct: 61 GVQWSVLQEQVKLLKQRPYNVFHHIKQLDLEHNRVKLEFDDDFDDDDDDDDDDDNLEDED 120
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS--AYANAKRFFDLKRSAAKKQQKT 263
N++D A A SS ++ P ++VD++LS N F K+ K KT
Sbjct: 121 DNLEDDDDNLEGEAKMAAASSRSESPHKINVDVELSLNCNQNISLLFSQKKDLQDKLDKT 180
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTM--TNINKARKVYWFEKFYWFISSENYL-------- 313
+Q+ + A+ A K+ + L+ + I + R+ WFEKF W ++++ ++
Sbjct: 181 VQAAQAAVAEASKQRQTELRVAEAAHPAEIARQREKRWFEKFDWCVTTDGFIVLAGKSGE 240
Query: 314 ------------------GNVAWDAKVV-------------------------TNAW--- 327
+V A VV ++AW
Sbjct: 241 QNEILVRRYLRPGDLFLHADVHGAATVVLRNYRAPELPGEAALLQAAAFALCHSSAWDAQ 300
Query: 328 ------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
WV A QVSKTAP+GE+L TGSFMIRGKKNF P ++ MG++ LF++ + R
Sbjct: 301 LLCKVYWVPARQVSKTAPSGEYLPTGSFMIRGKKNFLAPYRMEMGLTVLFEVRPEDVQR 359
>gi|223478404|ref|YP_002582764.1| fibronectin-binding protein A domain-containing protein
[Thermococcus sp. AM4]
gi|214033630|gb|EEB74457.1| Fibronectin-binding protein A domain protein [Thermococcus sp. AM4]
Length = 650
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/365 (21%), Positives = 147/365 (40%), Gaps = 89/365 (24%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ ++G F++F+ A+DE+F K ++ ++ + ++L R E
Sbjct: 242 VPIELKIYEGLEKHYFKTFSEALDEYFGKLTIEKAKIERTRKLENKKRQLLATLRKQEEM 301
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L E+ ++E +LI N ++ + R+ +L WE+ + R++ ++ + VA
Sbjct: 302 LKGFEKAMNENQEIGDLIYANYALIERLLEEFRK-ATEKLGWEEFKKRIEAGKKEGNRVA 360
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
++K + +T+ L K + + S +NA+ +
Sbjct: 361 LMVKSIDPKEKAVTIELEGKKVKLYLNKSI---GENAELY-------------------- 397
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
Y AK+F R + K + T++ L EK ++ ++ + I + RK WFEK
Sbjct: 398 YEKAKKF----RHKYEGALKAYEDTKRKLDEVEKLIEEEMRKELNVKRIER-RKKKWFEK 452
Query: 303 FYWFISSENYLGNVAWDAKV---------------------------------------- 322
F WF+SSE +L DA
Sbjct: 453 FRWFVSSEGFLVLAGKDASTNEVLIKKHMTENDLYCHADVYGAPHVVIKDGQKAGERTIF 512
Query: 323 --------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQ 362
++ AW W +QV+K AP+GE+L G+FM+ GK+N+ P +
Sbjct: 513 EACQFAVSMSRAWSQGLYGADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLK 572
Query: 363 LAMGI 367
LA+G+
Sbjct: 573 LAVGV 577
>gi|66357888|ref|XP_626122.1| MJ1625/yease Yp1009cp-like HhH domain [Cryptosporidium parvum Iowa
II]
gi|46227289|gb|EAK88239.1| MJ1625/yease Yp1009cp-like HhH domain [Cryptosporidium parvum Iowa
II]
Length = 1378
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 35/55 (63%), Positives = 42/55 (76%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
AW K V +AWWV DQVSK AP+GE+L+TGSF+IRGKKNF PP +L MG + F
Sbjct: 661 AWVNKTVISAWWVYPDQVSKNAPSGEYLSTGSFVIRGKKNFLPPLKLEMGCALYF 715
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 118/251 (47%), Gaps = 24/251 (9%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
++F VDEF+S+ + K A Q+ + K++ VK D E RL L + +A
Sbjct: 370 LDNFCKCVDEFYSSIDIVKESKFATQEHKTIYSKVDKVKIDQERRLEGLSSEKEACIVRA 429
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
+ + ++QE ++ + IR IA W+DI ++Q +++N P+A IK L L + + +
Sbjct: 430 KFMESHQEILEKILQLIRHLIATGAQWQDIWNEIQQQKKNNHPLARHIKSLNLKDDKVKI 489
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
L S P D + K D+ + S +N + + ++ A
Sbjct: 490 LFSQRDLGS-----------ETTPVVD--QIGKSIEFDLIISKSIQSNIRFQYMESKALA 536
Query: 258 KKQQKTIQSTEKAL--------KSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
+K +KT + + AL K AEK +K + +V I K R YWFEKFYWFISS
Sbjct: 537 EKFEKTQLAYKIALKKVTNIAKKDAEKASKGLVSNV---PRIKKLRAQYWFEKFYWFISS 593
Query: 310 ENYLGNVAWDA 320
+ YL DA
Sbjct: 594 DGYLIIGGHDA 604
>gi|341581973|ref|YP_004762465.1| Fibronectin-binding protein A (FbpA) [Thermococcus sp. 4557]
gi|340809631|gb|AEK72788.1| Fibronectin-binding protein A (FbpA) [Thermococcus sp. 4557]
Length = 650
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 86/394 (21%), Positives = 155/394 (39%), Gaps = 106/394 (26%)
Query: 38 EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKI 97
E RPN ++ + H ++ P +R ++GF + F +F+ A+DE+F K
Sbjct: 226 EARPN---IVLKDGNMHDVV------PIELRIYEGFEKRYFTTFSEALDEYFGKITMEKA 276
Query: 98 DLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQD 157
++ ++ ++L R E L E+ ++E +LI N ++ + R+
Sbjct: 277 RVEQTKRLEAKKRQLLMTLRKQEEMLKGFEEGAKANQEIGDLIYANYALIERLLEEFRK- 335
Query: 158 IANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER----SNIKDSWV 213
L W++ + R+++ +R + VA ++K +T+ L K R +I ++
Sbjct: 336 ATETLGWDEFKKRIEEGKRAGNRVALMVKGTDPKEKAVTIELEGKKVRLYLNRSIGENAE 395
Query: 214 LHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKS 273
L+++ A F E L AY + KR D +++ K E ++K
Sbjct: 396 LYYEKAKKFRHKHEGA----------LKAYEDTKRKLDEIERLIEEELK----KELSVKR 441
Query: 274 AEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKV----------- 322
E++ K+ WFEKF WF+SSE +L DA
Sbjct: 442 IERRKKK------------------WFEKFRWFVSSEGFLVLAGKDAGTNEILIKRHMDD 483
Query: 323 -------------------------------------VTNAW----------WVKADQVS 335
++ AW W +QV+
Sbjct: 484 NDLYCHADVYGAPHVVIKDGQKAGEKTIFEACQFAVSMSKAWSRGVYSEDAYWAHPNQVT 543
Query: 336 KTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
K P+GE+L G+FM+ GK+N+ P +LA+G+
Sbjct: 544 KQTPSGEYLGKGAFMVYGKRNWLHGLPLKLAVGV 577
>gi|342186351|emb|CCC95837.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1015
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 128/279 (45%), Gaps = 15/279 (5%)
Query: 31 GYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFS 90
GYL+ K KR E I P + S+ + + I F+ A FF
Sbjct: 254 GYLL-KSSKRGGQEAMI-------PGTMISALFCSISTRRMLWLINIFQISVAFAMNFFH 305
Query: 91 TAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAA 150
+ +I+ + + + K E +R+H R+ L++++ K LI N E++D
Sbjct: 306 IRKKKRIEHHNDKVKTVVVSKREECERNHNRRIDKLKRSEEESIRKGHLIFQNTETIDKI 365
Query: 151 ILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKD 210
I I + + ++ W+D + +KQ + P+AS+IK++ + +L+ + + +
Sbjct: 366 IGLINEALDMKIRWDDFRSVLKQRRDEGHPLASMIKEVLFERRKVVVLMNEDADDDDEQT 425
Query: 211 SWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKA 270
D + E +++DL +A+ NA+ +F +S A K ++TI +TEKA
Sbjct: 426 EDEEGEKREDRDRATYE------IEIDLTKTAHTNAEEYFARAKSTAAKLKRTIAATEKA 479
Query: 271 LKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
+ AE+K + Q I + R +W+EKF WF +S
Sbjct: 480 MAGAERKGRTVTGKTQEKKIITE-RCRFWWEKFNWFRTS 517
>gi|302419577|ref|XP_003007619.1| DUF814 domain-containing protein [Verticillium albo-atrum VaMs.102]
gi|261353270|gb|EEY15698.1| DUF814 domain-containing protein [Verticillium albo-atrum VaMs.102]
Length = 1107
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 136/308 (44%), Gaps = 70/308 (22%)
Query: 26 NVSSQGYLVQK-------KEKRPNS------EDYIVANMEFHPMILQP-GSTPSVMRQHQ 71
+ ++ GY++ K E++P+ ED + + FHP + Q PSV
Sbjct: 677 SATTTGYVIAKYRQKSEETEEKPDDGAETKREDLLYDD--FHPFLPQKFADDPSVK---- 730
Query: 72 GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQL 131
+ F+ F VDEFF A R ETR + Q
Sbjct: 731 ---VLTFDGFNKTVDEFFFLA------------------------RGPETR-----EAQS 758
Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLN 191
++++KA I N E V A+ A+ + + W +I +++ Q+ ++P
Sbjct: 759 LNEQKAAAIEANVERVQEAMDAVNGLVQQGMDWVNIGKLIEREQKRHNP----------- 807
Query: 192 INHITLLLR---VHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAK 247
N +TLLL V E + D ++ L ++++L LS +ANA+
Sbjct: 808 -NLMTLLLGTEAVEDEDEAYETGSDASDSEDDEDGAKAKGADRRLQIEINLGLSPWANAR 866
Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
++D +R+AA K+ KT+Q + ALK+AEKK + LK Q + RK WFEKF W
Sbjct: 867 EYYDQRRTAAVKELKTVQHSTMALKNAEKKITEDLKKGLKQEKAVLQPIRKQMWFEKFIW 926
Query: 306 FISSENYL 313
F+SS+ YL
Sbjct: 927 FLSSDGYL 934
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 33/45 (73%), Gaps = 1/45 (2%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGK-KNFFPP 360
AWD+K AWWV+ADQVSK+APTGE+L +FM G +NF PP
Sbjct: 1000 AWDSKAGMGAWWVRADQVSKSAPTGEYLPAAAFMGAGPGRNFLPP 1044
>gi|401416565|ref|XP_003872777.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322489002|emb|CBZ24251.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 1189
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/253 (28%), Positives = 117/253 (46%), Gaps = 17/253 (6%)
Query: 63 TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
TP ++ Q+ ++ SF DEFF E+ +ID +++ A K + DH
Sbjct: 333 TPILLAQYTNDGVEALYRSSFGRVCDEFFLITETERIDASNAKRKNTAKSKEDKFATDHA 392
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
R++ALE ++ K E +I N + VD AI I +A +SW+ + +K+ P
Sbjct: 393 RRINALEADIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHP 452
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
VA +I L L N I++LL VL + + D P +V+V L
Sbjct: 453 VAYMIHDLFLERNSISVLLEA-----------VLDEEKGEEDCDV----PPLVVEVALSK 497
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
+A+ANA +F ++ K ++T+ +T KA A K + + I K R+ W+
Sbjct: 498 TAHANAADYFSKQKHHRSKLERTVAATAKAAAGAALKGARKAAAQKERKVIVKERQRQWW 557
Query: 301 EKFYWFISSENYL 313
EKF WF ++ L
Sbjct: 558 EKFLWFRTTAGDL 570
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
AW+ K T +WWV A QV+ TG+++ G+++ PP +++G + LF
Sbjct: 653 AWERKQTTGSWWVYASQVTGGT------ATGAYLYAGERHHLPPQSMSLGCALLF 701
>gi|390960715|ref|YP_006424549.1| hypothetical protein containing fibronectin-binding protein
[Thermococcus sp. CL1]
gi|390519023|gb|AFL94755.1| hypothetical protein containing fibronectin-binding protein
[Thermococcus sp. CL1]
Length = 649
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 90/390 (23%), Positives = 157/390 (40%), Gaps = 108/390 (27%)
Query: 50 NMEFHP-MILQPGS----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQ 104
N E P ++ + G+ P ++ ++G + F +F+ A+DE+F +I ++ +
Sbjct: 223 NDELRPNIVFKEGNMHDVVPIELKIYEGLEKKYFNTFSEALDEYFG-----RITIEKAKI 277
Query: 105 ERDALKKLENVKRD-------HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQD 157
ER +KLEN KR E L E ++E +LI N ++ + R+
Sbjct: 278 ERT--RKLENKKRQLLMTLRKQEEMLKGFEGAMRENQEIGDLIYANYALIERLLDEFRK- 334
Query: 158 IANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHD 217
+L WE+ R+++ ++ + VA ++K + +T+ L K + + S +
Sbjct: 335 ATEKLGWEEFRKRIEEGKKAGNRVAMMVKGINPKEKAVTIELDGKKVKLYLNRSI---GE 391
Query: 218 NADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
NA+ + Y AK+F R + K + T++ L EK
Sbjct: 392 NAELY--------------------YEKAKKF----RHKHEGALKAYEDTKRKLNEVEKL 427
Query: 278 TKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKV--------------- 322
++ +K + I + RK WFEKF WFISSE +L DA
Sbjct: 428 IEEEMKKELNVKRIER-RKKKWFEKFRWFISSEGFLVLAGKDASTNEILIKRHMGENDLY 486
Query: 323 ---------------------------------VTNAW----------WVKADQVSKTAP 339
++ AW W +QV+K P
Sbjct: 487 CHADVYGAPHVVIKDGQKAGERTIFEACQFAVSMSKAWSRGVYSEDAYWAYPNQVTKQTP 546
Query: 340 TGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+GE+L G+FM+ GK+N+ P +LA+G+
Sbjct: 547 SGEYLGKGAFMVYGKRNWLHGLPLKLAVGV 576
>gi|389852774|ref|YP_006355008.1| hypothetical protein Py04_1359 [Pyrococcus sp. ST04]
gi|388250080|gb|AFK22933.1| hypothetical protein Py04_1359 [Pyrococcus sp. ST04]
Length = 642
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 102/394 (25%), Positives = 174/394 (44%), Gaps = 115/394 (29%)
Query: 48 VANMEFHP-MILQPGST----PSVMRQHQGFPIQEFESFAAAVDEFFS--TAESHKID-L 99
V N+E +P ++ + GS P + + + ++SF+ A+DE+F T E K +
Sbjct: 217 VLNLEKNPNIVYKNGSMVDVLPVDLVWYSNYEKVFYDSFSKALDEYFGKLTIEKAKRERT 276
Query: 100 KAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159
KA++++R AL+ ++KR E ++ E+ ++E+ +L+ N V + IR+ I
Sbjct: 277 KALEEKRKALEI--SLKRIEE-QIRGFEKEAQENQERGDLLYANYTLVKEILETIRRGIK 333
Query: 160 NQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNA 219
L E++ R+++A++ P A+II ++ KDS V+
Sbjct: 334 T-LGVEEVVKRIEEAKKKGYPWANIISKVS-------------------KDSLVIE---- 369
Query: 220 DPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
E KK + +D++ + NA+ F++ AKK ++ ++ KA + +KK +
Sbjct: 370 ------LEGKK---IKLDINKTLEENAEIFYE----KAKKARQKLEGARKAYEETKKKIE 416
Query: 280 QTLKDVQ------TMTNINKARKVYWFEKFYWFISSENYL---GNVA------------- 317
+++ + + K RK WFEKF WFISSE +L G A
Sbjct: 417 NIEQEIMEEEKKIAVKKLEKRRK-KWFEKFRWFISSEGFLVIGGKDATTNEIVVKRHMSE 475
Query: 318 ---------WDAKVV-----------------------TNAW----------WVKADQVS 335
W A V + AW WV +QVS
Sbjct: 476 NDLYCHADIWGAPHVVIKEGRKASEKTIFEACQFAVSMSRAWSEGLASADAYWVYPEQVS 535
Query: 336 KTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
K AP GE+L G+FM+ GK+N+ P +LA+GI
Sbjct: 536 KQAPAGEYLPKGAFMVYGKRNWLHGIPLKLAVGI 569
>gi|401410580|ref|XP_003884738.1| hypothetical protein NCLIV_051350 [Neospora caninum Liverpool]
gi|325119156|emb|CBZ54708.1| hypothetical protein NCLIV_051350 [Neospora caninum Liverpool]
Length = 1853
Score = 85.9 bits (211), Expect = 3e-14, Method: Composition-based stats.
Identities = 67/249 (26%), Positives = 118/249 (47%), Gaps = 26/249 (10%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
F VDE+FS+ + K + +AL ++E ++ D R+ LE+ E+A
Sbjct: 441 FRDINVCVDEYFSSVDVQKGERAEALARHEALSRVEKIRSDQAQRMQQLEEEAASLLEEA 500
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
+ + N V+ I +R +A + W+++ ++KQ + P+A +++L+L L
Sbjct: 501 QAVEANVGLVEQIIQLLRAALATGVDWDELGRQMKQQAKEGHPLAVHVQELRLEKQRALL 560
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
LL E + + ++V +D+ LSA+ NA+ +
Sbjct: 561 LLEAPGEEA---------------------SGATTVVSIDITLSAHGNAQLLHSQVKQLK 599
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQ-----TMTNINKARKVYWFEKFYWFISSENY 312
K KT +T AL +A++K ++TLK + + K RK +WFEKF+WFISS++Y
Sbjct: 600 AKTLKTSSATAAALAAADRKAQRTLKQKEQQVLQAQQQLQKVRKAFWFEKFHWFISSDHY 659
Query: 313 LGNVAWDAK 321
L DA+
Sbjct: 660 LVLAGRDAQ 668
Score = 73.2 bits (178), Expect = 2e-10, Method: Composition-based stats.
Identities = 34/65 (52%), Positives = 47/65 (72%), Gaps = 1/65 (1%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
AW+ K AWWV QVSK+AP+G +L+TGSFMIRG++NF +L MG LF+L +E
Sbjct: 731 AWNTKTPAAAWWVYGHQVSKSAPSGLYLSTGSFMIRGRRNFIQIHRLEMGFGLLFRLADE 790
Query: 376 SSISR 380
+S++R
Sbjct: 791 ASVAR 795
>gi|448089209|ref|XP_004196743.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
gi|448093427|ref|XP_004197774.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
gi|359378165|emb|CCE84424.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
gi|359379196|emb|CCE83393.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
Length = 1056
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 100/212 (47%), Gaps = 68/212 (32%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-----KDVQTM 288
V +DL LS++AN++ +FD K++A KQ K ++TE AL++AEKK + L K+ +T+
Sbjct: 513 VSIDLSLSSFANSRIYFDNKKNAETKQAKVEKNTEIALRNAEKKINRDLSSNLKKESETL 572
Query: 289 TNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVV------TNAWWVKAD------- 332
I R +WFEKFYWF+S+E YL GN ++ N ++V +D
Sbjct: 573 KQI---RPKFWFEKFYWFVSNEGYLCLAGNDDTQTDMIYYRHFNDNDYFVTSDIEGSLKV 629
Query: 333 ------QVSKTAPT-----GEF-----------LTT----------------------GS 348
Q + +P+ G F +TT G+
Sbjct: 630 FVKNPYQGKEVSPSTLTQAGIFSMSASKAWDNKITTSAWYLKGSEVSKKDFDGSLVSFGN 689
Query: 349 FMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
F +G+K F PP QL MG++F F +E + R
Sbjct: 690 FNYKGEKQFLPPSQLVMGLAFYFLGDEETTQR 721
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 82/161 (50%), Gaps = 10/161 (6%)
Query: 41 PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLK 100
P+S+D EFHP +P + E E + +D +FST ES + L+
Sbjct: 286 PDSKDLKYIMDEFHPY--KPHK-----ENEDSYQFMEVEGYNKTLDTYFSTIESSRYALR 338
Query: 101 AVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN 160
QQ+ A K+LE + + + ++ +L + ++ +K E II + + ++ ++ Q I
Sbjct: 339 IEQQKEQARKRLEKARNERDKQIQSLLDQKNLNIKKGEAIIYHADVIEECKESVLQLIRQ 398
Query: 161 QLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
Q+ WE+IE ++ Q + +A +IK L LN+ N I +LL
Sbjct: 399 QMDWENIEKLIQLEQTRGNKLAQMIK-LPLNLVQNKINVLL 438
>gi|448508289|ref|XP_003865916.1| hypothetical protein CORT_0A00840 [Candida orthopsilosis Co 90-125]
gi|380350254|emb|CCG20475.1| hypothetical protein CORT_0A00840 [Candida orthopsilosis Co 90-125]
Length = 654
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 86/379 (22%), Positives = 149/379 (39%), Gaps = 104/379 (27%)
Query: 102 VQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQ 161
+ Q+ A ++L K + + +L+ Q + +K ELI + VD +++ + Q
Sbjct: 26 IHQKNLAAERLNKAKTRRDQEIQSLKDEQASNAKKGELIQLHSGLVDECRQYVQRFVDQQ 85
Query: 162 LSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVL------- 214
+ W ++E +K ++ +A I +L L++ ++R+ + I+ + +
Sbjct: 86 MDWTNMELAIKLEKKRPGSIARYI-ELPLHLKENRFVIRLKDPENGIESNGKITSDSEPD 144
Query: 215 -------------------------HHDNADPFADSSETKKP--------SLVDVDLDLS 241
++ +D S+ P + V +D LS
Sbjct: 145 SESDSESDSESDSDSDSDNSSDGDEENERSDSKGTSTSESHPKDKYTNEGTPVSIDYTLS 204
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KDV--QTMTNINKARKVY 298
+YANA +F+ K++A KQ K + E A K+AEKK Q L K++ + T+ R+ +
Sbjct: 205 SYANASVYFENKKAAEAKQTKVEKGAEIAYKNAEKKINQDLVKNLRRENGTSSKSEREKF 264
Query: 299 WFEKFY-------------------------------WFISSE----------NYL---- 313
WFE FY +F+SSE N L
Sbjct: 265 WFESFYWFVSSEGYLCLAGRTKSQIDLLYFKYFSDDDFFVSSEIEGSLKVFVKNPLKGES 324
Query: 314 ---------------GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
+ AW+ K+ T AW + +VSK +G L G F +K+F
Sbjct: 325 VPPSTILQAGIFAMSASQAWNGKINTAAWVLHGSEVSKYNQSGALLPPGEFEYLARKHFL 384
Query: 359 PPCQLAMGISFLFKLEESS 377
PP QL MG F ++E S
Sbjct: 385 PPAQLVMGFGLYFLVDEGS 403
>gi|50293495|ref|XP_449159.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528472|emb|CAG62129.1| unnamed protein product [Candida glabrata]
Length = 1031
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 93/203 (45%), Gaps = 62/203 (30%)
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL--KDVQTMT 289
++V +DL LSAYANA +F++K+ A+KQ+K ++ EKA+K+ E K + L K ++
Sbjct: 514 TVVAIDLGLSAYANASTYFNMKKDHAEKQKKVEKNIEKAMKNIEDKIGKQLQKKLKESHD 573
Query: 290 NINKARKVYWFEKFYWFISSENYL-----GNV-------------------AWDAKV--- 322
+ K RK Y+FEK++WF S+E +L NV ++D KV
Sbjct: 574 VLKKIRKPYFFEKYFWFYSTEGFLVMLGKSNVETDQIYSRYIEDDDIFMSNSFDTKVWIK 633
Query: 323 ------------------------------VTNAWWVKADQVSKTAPT-GEFLTTGSFMI 351
++ WW A V+K G L G F +
Sbjct: 634 NPERVEVPPNTLMQAGILCMSASPAWQKKIASSPWWCFAKNVTKFDDVDGSVLAPGVFRL 693
Query: 352 RGKK--NFFPPCQLAMGISFLFK 372
R +K N PP QL MG+ F++K
Sbjct: 694 RNEKQINMLPPAQLVMGVGFMWK 716
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 86/176 (48%), Gaps = 13/176 (7%)
Query: 31 GYLVQKKE-----KRPNSE-DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
GY++ KK P + +YI N FHP I +T + + I+ +
Sbjct: 273 GYIIAKKNPLYKIDTPGYDLEYIYEN--FHPFIPHIPATD----EDKSKVIKIEGDYNKT 326
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
+D+FFST ES K LK QE+ A +K+E +++++ R+ AL + Q ++ K L+I N
Sbjct: 327 LDDFFSTIESSKYALKIQNQEQQAKQKIEAARQENKKRIDALREQQASNETKGNLLIANV 386
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
+ V+ A+ + Q+ W IE ++ Q + +A + L L N I +LL
Sbjct: 387 DLVEEVKSAVLGLVNQQMDWNTIEKLIQSEQNKGNKIAKHVSLPLDLKNNKIKILL 442
>gi|401826788|ref|XP_003887487.1| hypothetical protein EHEL_061370 [Encephalitozoon hellem ATCC
50504]
gi|395460005|gb|AFM98506.1| hypothetical protein EHEL_061370 [Encephalitozoon hellem ATCC
50504]
Length = 648
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 78/352 (22%), Positives = 144/352 (40%), Gaps = 102/352 (28%)
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
++F +F A + FF + + K + +R++ K++ V++ E + +EQ ++
Sbjct: 243 RDFPTFNDAAEFFFQSRK------KFGKNDRES--KVDKVRKRQENYMKEMEQEGESYRK 294
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
KAEL+ N + V+ + + N++ W D E +Q R + ++ I + +
Sbjct: 295 KAELLEANADFVNKILDIFKVVKKNKVKWTDFEKFREQENRKGNEISKAIVKTDFISHTC 354
Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
T++L + + +D +++ + N RF+ +
Sbjct: 355 TIVLEGEE------------------------------IQIDFEVTLFNNVSRFYQKSKK 384
Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL-- 313
+K KT S E+ LK K + + +T R +YWFEKF++F SS+ L
Sbjct: 385 LEEKIMKTRDSLEEVLKKIAPKV-----ETKKIT-----RALYWFEKFHFFFSSDGVLVI 434
Query: 314 ------------------------GNVAWDAKVVTN------------------------ 325
G++ + ++
Sbjct: 435 GGRNAQQNEILVKKHLEPNDLYFHGDMHGSSSIIVKKPTPKTIEEAASMALCMSKCWEAN 494
Query: 326 ----AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
W+V +QVSKTAP+GE+LT GSFMI+GKKN+ ++ G+ LFK+
Sbjct: 495 VVSPVWYVYGEQVSKTAPSGEYLTKGSFMIKGKKNYVECHKIEYGLGLLFKV 546
>gi|68071251|ref|XP_677539.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497695|emb|CAH96713.1| conserved hypothetical protein [Plasmodium berghei]
Length = 1012
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 32/57 (56%), Positives = 46/57 (80%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
+W+ K++T+AWWV QVSKTAPTGE++ TGSF+IRGKKN+ P +L MG+ +F++
Sbjct: 282 SWNNKIITSAWWVYYHQVSKTAPTGEYIKTGSFVIRGKKNYLPYAKLEMGLCIIFQV 338
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/232 (31%), Positives = 118/232 (50%), Gaps = 26/232 (11%)
Query: 88 FFSTAESHKIDL-KAVQQERDALKKLENVKRDHETRLSALEQTQL-VDKEKAELIINNQE 145
+ + ES K D + + + ++AL K++ +K DHE R+ + Q+ + K+K LI N E
Sbjct: 7 YLTKMESTKYDKHQEMNKRKNALTKIDKIKLDHERRIEGSTKKQVSILKKKISLIQLNDE 66
Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
SV AI +R I+ +WE I +K ++ + P+A +K + +N N+ + L
Sbjct: 67 SVGEAIKLMRSAISTSANWEQIWDHIKLFKKRDHPIA--LKIMSVNFNNCEMEL------ 118
Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
+L+ D+ + D + K S + D ++ + L++ A +K +K
Sbjct: 119 -------LLNDDDIEENGDDNNLKNNSWKEKIADKNS-----KTCTLRKKAEEKIRKIKM 166
Query: 266 STEKALKSAEKKTKQTLKDV----QTMTNINKARKVYWFEKFYWFISSENYL 313
ST A+K EKK K +++ I K RKV+WFEKF WFISSENYL
Sbjct: 167 STNMAVKKVEKKKKDKDTKQKGKNKSVFQIKKLRKVFWFEKFNWFISSENYL 218
>gi|399216143|emb|CCF72831.1| unnamed protein product [Babesia microti strain RI]
Length = 933
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 33/55 (60%), Positives = 45/55 (81%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
AW AK +T+AWWV++ QVSKT TGE+L +GSF+IRGKKNF PP +L MG++ ++
Sbjct: 582 AWKAKFMTSAWWVESSQVSKTPETGEYLPSGSFVIRGKKNFLPPSKLEMGLAVIY 636
>gi|18977764|ref|NP_579121.1| hypothetical protein PF1392 [Pyrococcus furiosus DSM 3638]
gi|397651884|ref|YP_006492465.1| hypothetical protein PFC_06185 [Pyrococcus furiosus COM1]
gi|18893505|gb|AAL81516.1| hypothetical protein PF1392 [Pyrococcus furiosus DSM 3638]
gi|393189475|gb|AFN04173.1| hypothetical protein PFC_06185 [Pyrococcus furiosus COM1]
Length = 649
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 97/393 (24%), Positives = 164/393 (41%), Gaps = 106/393 (26%)
Query: 38 EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKI 97
EK+PN IV N E + P ++ ++ + + +ESF+ A+DE+F K
Sbjct: 227 EKKPN----IVYNKEG----VMVDVVPIDLQWYREYTKRYYESFSEALDEYFGKLTIEKA 278
Query: 98 DLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQD 157
L+ +Q + K LE R E ++ E+ + ++EK + + + V+ IL +
Sbjct: 279 RLEKTKQLEERRKALEISLRRIEEQIKGFEKEAMTNQEKGDALYAHYSIVN-EILRVISS 337
Query: 158 IANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHD 217
Q E+++ R+++ ++ P A +I + + N +TL L IK
Sbjct: 338 ALKQYGVEEVKKRIEEGKKAGYPWAKMI--IDVTDNKVTLNL------DGIK-------- 381
Query: 218 NADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSA 274
V +D++ S NA+ +++ + A KK + + T++ L
Sbjct: 382 ----------------VSLDVEKSLEENAELYYERAKKAKKKLEGAKIAYEETKRKLIEL 425
Query: 275 EKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVA-------------- 317
EK+ ++ K++ K +K WFEKF WFISSE +L G A
Sbjct: 426 EKEIERESKEINIKKITRKKKK--WFEKFRWFISSEGFLVIGGKDATTNEIVVKKHMDEN 483
Query: 318 --------WDAK-----------------------VVTNAW----------WVKADQVSK 336
W A ++ AW WV +QVSK
Sbjct: 484 DIYCHADIWGAPHVIIKNGRNASEKTIREACQFAVAMSRAWSEGLASADAYWVYPEQVSK 543
Query: 337 TAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
AP GE+L G+FM+ GK+N+ P +LA+GI
Sbjct: 544 QAPAGEYLPKGAFMVYGKRNWIHGIPLKLAVGI 576
>gi|154418675|ref|XP_001582355.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121916590|gb|EAY21369.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 875
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 115/251 (45%), Gaps = 27/251 (10%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
+P + Q+ Q F++F A DEF+S E + + + E KK+++VK++ + +
Sbjct: 283 SPFPLAQYDPSQSQVFDTFDKACDEFWSVRELERAQKEHKENEAAPDKKVQSVKKNFDKK 342
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ + LI N ++ I IAN++ W++I ++ Q + +A
Sbjct: 343 RKQFQDELDLLNRTGHLIQANATQIEQCRNVINSFIANRVRWDEIRMSIRAYQECGNELA 402
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
S+I ++ E+S + L +D E K + ++L +A
Sbjct: 403 SMIDKVDF-------------EKSGF---YCLVND---------EEGKTERIFIELKKTA 437
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
YANA +FD + KK + E+ LK EK K V + I + RK +WFE+
Sbjct: 438 YANASAYFDKRAVLVKKLEGANAKEEEVLKKVEKDAIAAKKKV--TSTIQERRKTWWFER 495
Query: 303 FYWFISSENYL 313
F+WFI++ENYL
Sbjct: 496 FHWFITTENYL 506
>gi|212223298|ref|YP_002306534.1| fibronectin-binding protein [Thermococcus onnurineus NA1]
gi|212008255|gb|ACJ15637.1| predicted fibronectin-binding protein [Thermococcus onnurineus NA1]
Length = 649
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/365 (21%), Positives = 144/365 (39%), Gaps = 89/365 (24%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ ++ F + F +F+ A+DE+F K ++ ++ ++L + E
Sbjct: 241 VPVELKVYENFEKRYFSTFSEALDEYFGKVTLEKAKIEQTKKLEAKKRQLLMTLKKQEEL 300
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L E+ ++E +LI N V+ + R+ +L WE+ + R+ + ++ + A
Sbjct: 301 LKGFEEQAKANQEIGDLIYANFTMVERLLDEFRK-ATERLGWEEFKKRIDEGKKAGNKAA 359
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
++K + +T+ L K R + S +NA+ + + ++ K L L A
Sbjct: 360 LMVKSIDPKEKAVTIELEGKKVRLYLNKSI---GENAELYYEKAKKAKHKLEGA---LKA 413
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
Y + KR D EK ++ +K + I + RK WFEK
Sbjct: 414 YEDTKRKLD---------------------EIEKLIEEEMKKELAVKRIER-RKKKWFEK 451
Query: 303 FYWFISSENYLGNVAWDAKV---------------------------------------- 322
F WF+SSE +L DA
Sbjct: 452 FRWFVSSEGFLVLAGKDASTNENLIKKHMDENDLYCHADVYGAPHVVIKDGQKAGEKTIF 511
Query: 323 --------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQ 362
++ AW W +QV+K AP+GE+L G+FM+ GK+N+ P +
Sbjct: 512 EACQFAVSMSKAWSQGLYSADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLK 571
Query: 363 LAMGI 367
LA+G+
Sbjct: 572 LAVGV 576
>gi|78395025|gb|AAI07765.1| SDCCAG1 protein, partial [Homo sapiens]
Length = 458
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/47 (76%), Positives = 40/47 (85%)
Query: 334 VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
VSKTAPTGE+LTTGSFMIRGKKNF PP L MG SFLFK++ES + R
Sbjct: 2 VSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 48
>gi|76156824|gb|AAX27946.2| SJCHGC07203 protein [Schistosoma japonicum]
Length = 184
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 33/52 (63%), Positives = 39/52 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW V+T AWWV DQVSKTAP+GE+LT+GSF+IRGKKN+ PPC G
Sbjct: 133 AWQNHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFG 184
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 31/43 (72%)
Query: 271 LKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
LKSA K + T+K +T+ I + RK WFEKF+WFISSENYL
Sbjct: 8 LKSAIHKAEATMKTAKTIAQITEVRKPMWFEKFFWFISSENYL 50
>gi|332157694|ref|YP_004422973.1| hypothetical protein PNA2_0051 [Pyrococcus sp. NA2]
gi|331033157|gb|AEC50969.1| hypothetical protein PNA2_0051 [Pyrococcus sp. NA2]
Length = 650
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 89/370 (24%), Positives = 158/370 (42%), Gaps = 100/370 (27%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P +R + G+ F++F+ A+DE+F K + ++ + K+L + E
Sbjct: 243 VPIDLRWYDGYEKVYFDTFSKALDEYFGKLTIEKAREEKTKKLEEKKKQLIATLKRQENM 302
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ ++ ++E A+LI N + VD + + + + +L WE++ RV++ ++ + +A
Sbjct: 303 IKGFKEEMRRNQEIADLIYANYQLVDNLLKELSKAV-ERLGWEELIRRVEEGKKKGNRIA 361
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+IK + N +T+ + E KK L +D D++
Sbjct: 362 MMIKSINPQENSVTIEI---------------------------EDKKVRLY-IDRDIN- 392
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV----- 297
NA+ +++ AKK + ++ +KA + +KK +Q K+++ + +K+
Sbjct: 393 -ENAEIYYE----KAKKAKHKLEGAKKAYEELKKKLEQVEKEIEEEEKKVQVKKIERRKK 447
Query: 298 YWFEKFYWFISSENYL---GNVA----------------------WDAKVV--------- 323
WFEKF WFISSE +L G A W A V
Sbjct: 448 KWFEKFRWFISSEGFLVIGGKDATTNEIVVRKYMGENDIYCHADIWGAPHVIIKDGRRAS 507
Query: 324 --------------TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
+ AW WV +QV K AP+GEFL G+FM+ GK+N+
Sbjct: 508 EKTIFEACQFAVSMSRAWSEGLYSADAYWVYPEQVKKQAPSGEFLPKGAFMVYGKRNWMH 567
Query: 360 --PCQLAMGI 367
P +LA+GI
Sbjct: 568 GIPLKLAVGI 577
>gi|150865765|ref|XP_001385110.2| highly conserved hypothetical protein Predicted RNA-binding
[Scheffersomyces stipitis CBS 6054]
gi|149387021|gb|ABN67081.2| conserved hypothetical protein Predicted RNA-binding protein
[Scheffersomyces stipitis CBS 6054]
Length = 1038
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 94/183 (51%), Gaps = 18/183 (9%)
Query: 26 NVSSQGYLVQKK--------EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
N + G++V KK E N +Y+ EFHP +P + +G+ E
Sbjct: 270 NEQTNGFIVSKKNPLYNPSEEHSDNDLEYVYD--EFHP--FKP-----FKKNLEGYKFTE 320
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
E + +D FFS ES K LK QQ+++A K+LEN + + ++ +L Q Q + +K
Sbjct: 321 IEGYNKTLDTFFSALESTKFALKIEQQKQNANKRLENARSERNKQIQSLIQQQETNSKKG 380
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
+ II + + V + I AI++ + Q+ W +IEA VK Q + + S IK L LN N I
Sbjct: 381 DTIIYHADLVASCISAIQKMLDKQMDWGNIEAIVKHEQSSGNEIMSTIKLPLNLNENKIN 440
Query: 197 LLL 199
L+L
Sbjct: 441 LVL 443
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 86/207 (41%), Gaps = 67/207 (32%)
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN----I 291
+DL LS YANA+ +F+ K+SA K++K ++TE ALK+AE+K KQ L + N +
Sbjct: 523 IDLSLSPYANARLYFESKKSAESKKEKVEKNTEMALKNAERKIKQDL--AHNLKNEHDTL 580
Query: 292 NKARKVYWFEKF------------------------YWFISSENYL-------------- 313
+ R YWFEKF Y F + ++
Sbjct: 581 KQLRPKYWFEKFYWFVSSEGYLCLAGRDPSQTDMIYYRFFNDNDFFVSAEMEGSLKVFVK 640
Query: 314 ----------------GN------VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMI 351
GN AW KV T+AW + VSK G L G F
Sbjct: 641 NPFKGESVPPYTLMQAGNFAKSTSTAWSGKVSTSAWVLHGSDVSKKDFDGSLLAGGEFNY 700
Query: 352 RGKKNFFPPCQLAMGIS-FLFKLEESS 377
+ KK F PP QL MG +L EE++
Sbjct: 701 KSKKEFLPPTQLTMGFGLYLLGDEETA 727
>gi|409095360|ref|ZP_11215384.1| Fibronectin-binding protein A (FbpA) [Thermococcus zilligii AN1]
Length = 650
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 149/365 (40%), Gaps = 89/365 (24%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ + G + F +F+ A+DE+F K ++ Q+ K+L E
Sbjct: 242 VPIELKVYGGLEKKYFSTFSEALDEYFGRITVEKARIEQTQKLEAKKKQLLTTLMMQEEM 301
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L E+ ++E +LI N V+ + ++ +L WE+ + R+++ ++ + VA
Sbjct: 302 LRGFEKAMKENQELGDLIYANYPVVERLLEEFKR-ATEKLGWEEFKKRIEEGKKAGNRVA 360
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
++K++ +T+ L E +K LH D + +++E
Sbjct: 361 LMVKEIDPKEKAVTVEL----EGKEVK----LHVDRS--LGENAEL-------------Y 397
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
Y NAK+F R + K + T + ++ EK ++ +K + I + RK WFEK
Sbjct: 398 YENAKKF----RHKYEGALKAYEDTRRKIEEIEKLIEEEMKKELNVRRI-EGRKKRWFEK 452
Query: 303 FYWFISSENYLGNVAWDAKV---------------------------------------- 322
F WF+SSE +L DA
Sbjct: 453 FRWFVSSEGFLVLAGKDANTNETLVKKHMDKNDLYCHADVYGAPHVVIKDGQKAGEKTIF 512
Query: 323 --------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQ 362
++ AW W +QV+K AP+GE+L G+FM+ GK+N+ P +
Sbjct: 513 EACQFAVSMSRAWSQGLYSADAYWAYPEQVTKQAPSGEYLGKGAFMVYGKRNWLHGLPLK 572
Query: 363 LAMGI 367
LA+G+
Sbjct: 573 LAVGV 577
>gi|255722283|ref|XP_002546076.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240136565|gb|EER36118.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 857
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 97/194 (50%), Gaps = 16/194 (8%)
Query: 11 IHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVM 67
+H C L+S +QGY+V +K K + + DYI EFHP +P +
Sbjct: 96 LHVCEDKYMDLISG-KTETQGYIVSRKNKNASEDSEFDYICD--EFHP--FKPYKSNVT- 149
Query: 68 RQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALE 127
E + VD+FFST ES K LK QQ+ +A K+LE K + + ++ +L
Sbjct: 150 ----DLKFTEVSGYNKTVDQFFSTLESSKFSLKIEQQKENASKRLEKAKSERDKQIESLV 205
Query: 128 QTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQ 187
Q ++ +K ELI + E V+ ++Q + Q+ W +IE + Q+ N+P + I Q
Sbjct: 206 AQQQLNSKKGELIQYHSELVEECRRYVQQYLDQQMDWTNIETVIALEQKKNNPTSKSI-Q 264
Query: 188 LKLNI--NHITLLL 199
L LN+ N I +LL
Sbjct: 265 LPLNLKDNKIKVLL 278
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 35/67 (52%)
Query: 314 GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
+ AW+ KV T+AW + ++SK G + G F KK + PP QL MG+ F +
Sbjct: 482 ASTAWNGKVTTSAWVLHGTEISKRDFDGSIVPDGEFKYLAKKEYLPPAQLVMGLGFYCLV 541
Query: 374 EESSISR 380
+E S +
Sbjct: 542 DEESTKK 548
>gi|85000891|ref|XP_955164.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65303310|emb|CAI75688.1| hypothetical protein, conserved [Theileria annulata]
Length = 1185
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 32/59 (54%), Positives = 44/59 (74%)
Query: 316 VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
AW+ K +WWV QVSKT PTGE++ GSF+IRGKKN+ PP +L MGI++LF+++
Sbjct: 613 TAWNEKFSVQSWWVYWHQVSKTPPTGEYVPQGSFVIRGKKNYLPPQKLEMGITYLFQVQ 671
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 123/258 (47%), Gaps = 25/258 (9%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR-LSALEQTQLVDKEK 136
FE F AVD FF+ E K + K+V + + K +K D R L+ +E Q +D K
Sbjct: 274 FEDFNDAVDTFFTKHELAKQEKKSVDKRPTKINK---IKIDQNKRELNLMEDIQKID-SK 329
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
+L+ + + + + + IA+ SW DI +++ ++ N P+ IK+ ++I T
Sbjct: 330 IKLLEEHVDVAENCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVHYIKE--IHIPTQT 387
Query: 197 LLLRV------HKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF 250
L+ H E++ + + T+ +V++D L+++ N K+ +
Sbjct: 388 LIFYSNQNQDQHNEQNKQNQFQQNIQQKNENKQNKKNTRDEVVVELDYRLNSHQNLKKLY 447
Query: 251 DLKRSAAKKQQKTIQSTEKALKSA--------EKKTKQTLKDVQTMTNINKARKVYWFEK 302
+ ++ K ++T E ALK KKT + +DV+ I+ R+ +WFEK
Sbjct: 448 NERKRLENKLERTRIGKEYALKKVTKSLKKEENKKTDKKGRDVK----ISSVRRRFWFEK 503
Query: 303 FYWFISSENYLGNVAWDA 320
FYWFI+S+ YL DA
Sbjct: 504 FYWFITSQGYLVLAGRDA 521
>gi|159906014|ref|YP_001549676.1| hypothetical protein MmarC6_1632 [Methanococcus maripaludis C6]
gi|159887507|gb|ABX02444.1| protein of unknown function DUF814 [Methanococcus maripaludis C6]
Length = 680
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 152/373 (40%), Gaps = 113/373 (30%)
Query: 75 IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
I+ +ESF A+DE+FS I K ++Q L+KL VK+ S LE + +K
Sbjct: 276 IKHYESFLTALDEYFSRF----IMKKEIKQAESKLQKL--VKKQERILKSQLETKEKYEK 329
Query: 135 E------KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
+ + +LI N VD + I+ ++ WE I+ +K+ + H PV S K +
Sbjct: 330 QSRSNHKRGDLIYANYSFVDEIVSTIKL-AREKMGWEGIKNVIKENKTH--PVLS--KII 384
Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKR 248
+N + L+L++ + N L DN V VDL +A+ NA
Sbjct: 385 NVNEKNAELMLKLSADYGN-----GLIEDN---------------VPVDLRKNAFENA-- 422
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-----------INKARKV 297
D+ +KK + +Q +ALK +EKK + LKD + + + + + +
Sbjct: 423 --DIVYQKSKKFKNKVQGVIEALKISEKKLAE-LKDKEKLDSEVLKEKEENIKKKERKVL 479
Query: 298 YWFEKFYWF--------------------------------------------------I 307
W+EK W I
Sbjct: 480 KWYEKLKWTVIGGYLIVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEI 539
Query: 308 SSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
EN L VA A + AW WV+ DQ+SKTA +GE+L G+F+IRGK+NF
Sbjct: 540 PDENILFEVAKFASSHSRAWKLGVGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNF 599
Query: 358 FPPCQLAMGISFL 370
L +GI L
Sbjct: 600 IRSAALELGIGML 612
>gi|347828081|emb|CCD43778.1| hypothetical protein [Botryotinia fuckeliana]
Length = 430
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/56 (60%), Positives = 41/56 (73%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
+AWWV ADQVSK+APTGEFL GSF GKKNF PP QL +G LF++ + S +R
Sbjct: 2 SAWWVTADQVSKSAPTGEFLPAGSFNTHGKKNFLPPAQLLLGFGVLFQISDESKAR 57
>gi|71027701|ref|XP_763494.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350447|gb|EAN31211.1| hypothetical protein, conserved [Theileria parva]
Length = 1249
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 43/58 (74%)
Query: 316 VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AW+ K +WWV QVSKT PTGE++ GSF+IRGKKN+ PP +L MGI++LF++
Sbjct: 598 TAWNEKFSVQSWWVYWHQVSKTPPTGEYVPQGSFVIRGKKNYLPPQKLEMGITYLFQV 655
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/256 (30%), Positives = 124/256 (48%), Gaps = 26/256 (10%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
FE F AVD FF+ E K + K Q ++ KL +K D + R EQ + D K
Sbjct: 275 FEDFNDAVDAFFTKHELAKQE-KKTQDKKPT--KLNKIKIDQDKR----EQKLVEDIRKL 327
Query: 138 ELIINN-QESVDAA---ILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNIN 193
+L I +E+VD A + + IA+ SW DI +++ ++ N P+ IK+ IN
Sbjct: 328 DLEIKLLEENVDIAENCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVHYIKE----IN 383
Query: 194 HITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP----SLVDVDLDLSAYANAKRF 249
T L H S D + P +++K + V +D L+++ N K+
Sbjct: 384 IPTQTLIFHNPISG-SDQLSQGGQSGKPGKSGTQSKLSKDLTASVSLDYRLNSHQNLKKL 442
Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEK-----KTKQTLKDVQTMTNINKARKVYWFEKFY 304
++ ++ K ++T E ALK K +TK+T K+ + + I+ RK +WFEKFY
Sbjct: 443 YNERKRLENKLERTKIGKEYALKKVTKSLKKQETKKTDKNKRDV-RISSVRKRFWFEKFY 501
Query: 305 WFISSENYLGNVAWDA 320
WFI+S+ YL DA
Sbjct: 502 WFITSQGYLVLAGRDA 517
>gi|134045609|ref|YP_001097095.1| hypothetical protein MmarC5_0566 [Methanococcus maripaludis C5]
gi|132663234|gb|ABO34880.1| protein of unknown function DUF814 [Methanococcus maripaludis C5]
Length = 680
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 155/388 (39%), Gaps = 116/388 (29%)
Query: 63 TPSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH 119
+P ++ H+ I+ +ESF A+DE+FS I K ++Q L+KL VK+
Sbjct: 261 SPIDLKMHENLKENEIKHYESFLTALDEYFSRF----IMKKEIKQAESKLQKL--VKKQE 314
Query: 120 ETRLSALEQTQLVDKE------KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
S LE + +K+ + +LI N VD + I+ D ++ W I+ +K+
Sbjct: 315 RILKSQLETKEKYEKQSLSNHKRGDLIYANYSLVDEIVGTIK-DAREKMDWNGIKKIIKE 373
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
+ H P+ S I +N+N N + + L D + + +
Sbjct: 374 NKTH--PILSKI----INVNE-----------KNAELTLKLSADYGNGLIEDT------- 409
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
V VDL +A+ NA D+ +KK + +Q +ALK +EKK + LKD + + +
Sbjct: 410 VPVDLRKNAFENA----DIVYQKSKKFKHKVQGVIEALKISEKKLAE-LKDKEKLDSEIL 464
Query: 294 ARK-----------VYWFEKFYWF------------------------------------ 306
K + W+EK W
Sbjct: 465 KEKEEKIKKKERKVLKWYEKLKWTVIGGYLIVAGKDATTNEMLIKRYVEKNDIVFHTLME 524
Query: 307 --------------ISSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGE 342
I EN L VA A + AW WV+ DQ+SKTA +GE
Sbjct: 525 GAPFTIIRTEGSEEIPDENVLFEVAKFASSHSRAWKLGIGSADVYWVRPDQISKTAESGE 584
Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+L G+F+IRGK+NF L +GI L
Sbjct: 585 YLKKGAFVIRGKRNFIRSAALELGIGML 612
>gi|147920849|ref|YP_685344.1| hypothetical protein RCIX612 [Methanocella arvoryzae MRE50]
gi|110620740|emb|CAJ36018.1| conserved hypothetical protein [Methanocella arvoryzae MRE50]
Length = 670
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 148/365 (40%), Gaps = 95/365 (26%)
Query: 69 QHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQ 128
+ +G+ FE+F AVD +F + V+++ + L E R + ++ E+
Sbjct: 259 EGEGYEKVYFETFNKAVDAYFGARIKTEAKAAIVEKKAEKLGVFERRLRQQQDAIAKFER 318
Query: 129 TQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
+ + K E+I + V+ I I+ SW+DI +K A++ + A+ I+ +
Sbjct: 319 EEQENARKGEVIYAEYQKVEEIIKVIKGARDRGYSWDDIRKILKDAKKAGNQAAAAIQAI 378
Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKR 248
IT++L + V++D+ L+ NA+
Sbjct: 379 DSATGLITVVL------------------------------PEATVNIDVKLTVPQNAQA 408
Query: 249 FFD-LKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
++D +K+ AKK+ K I+ T KA+ A+ K + K VQ + K RK W+++F W
Sbjct: 409 YYDKVKKVQAKKEGALKAIEETRKAMAKAQPKVAEPGKPVQKKVS-AKPRKPKWYDRFRW 467
Query: 306 FIS-------------------------------------------------SENYLGNV 316
F + +E L V
Sbjct: 468 FFTSDGFLVVAGRDADTNEEIVKKYMEKNDVFFHAQAHGAPITVLKTAGKPVTEQALAEV 527
Query: 317 A---------WDA-KVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL--A 364
A W A + + +WVK +QVSKT GE++ G+F++RG++N+ Q+ A
Sbjct: 528 AQFAVSYSSVWKAGQFSGDCYWVKPEQVSKTPEPGEYVAKGAFIVRGERNYVKDVQVRAA 587
Query: 365 MGISF 369
+GI F
Sbjct: 588 IGIRF 592
>gi|403222989|dbj|BAM41120.1| uncharacterized protein TOT_030000383 [Theileria orientalis strain
Shintoku]
Length = 1119
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/279 (27%), Positives = 130/279 (46%), Gaps = 27/279 (9%)
Query: 65 SVMRQHQGFPIQE---FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHET 121
VM F +Q E + A+D FF+ E K + K E KL +K D +
Sbjct: 255 GVMDDFGPFELQNAEYHEDYNYALDAFFTKNELVKQEKKT---ESKKPTKLTKIKADQDK 311
Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
R S L + + ++ ++ N + VD + + IA+ SW DI +++ ++ N P+
Sbjct: 312 RESKLMEEIMGYDKQIRVLEENIDIVDNCLNLTKALIASGASWNDIYEQLQIQRKQNHPL 371
Query: 182 ASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
IK++ NI + TL+ V N +D +P + E + +V +D L+
Sbjct: 372 VCYIKEI--NIPNQTLVF-VSNPEGNERDD--------EP--ERKELVEEQVVVLDYRLT 418
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALK----SAEKKTKQTLKDVQTM-TNINKARK 296
Y N K+F+ ++ A K ++T E ALK S K+ + D +T I+ RK
Sbjct: 419 GYQNLKKFYINRKKAENKLERTKIGKEYALKKVAKSLSKQPEVKKGDRRTREVKISSLRK 478
Query: 297 VYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
+WFEKFYWFI+S+ YL G + +++ + K D
Sbjct: 479 RFWFEKFYWFITSQGYLVLAGRDSLQNELLVKKYLTKGD 517
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/57 (54%), Positives = 42/57 (73%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AW+ K +WWV QVSKT PTGE++ GSF+IRGKKN+ P +L MGI++LF++
Sbjct: 584 AWNDKFSVQSWWVYWHQVSKTPPTGEYVPQGSFVIRGKKNYLQPQKLEMGITYLFQV 640
>gi|57641373|ref|YP_183851.1| fibronectin-binding protein [Thermococcus kodakarensis KOD1]
gi|57159697|dbj|BAD85627.1| predicted fibronectin-binding protein [Thermococcus kodakarensis
KOD1]
Length = 650
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/367 (22%), Positives = 146/367 (39%), Gaps = 93/367 (25%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ ++GF + F +F+ A+DE+F K ++ ++ + + L R E
Sbjct: 242 VPIELKIYEGFEKRYFPTFSEALDEYFGKITLEKAKIEQTKKLEEKKRGLMATLRKQEEM 301
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L E+ ++E +LI N ++ + R+ L W++ R+ + ++ + VA
Sbjct: 302 LKGFEKAMRENQEIGDLIYANYTLIERLLEEFRK-ATETLGWDEFRRRIDEGKKTGNKVA 360
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
++K + +T+ L K + ++ S +NA+ +
Sbjct: 361 LMVKGIDPKEKAVTIELDGKKVKLYLEKSL---GENAEIY-------------------- 397
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA--RKVYWF 300
Y AK+F A K + T + E+ K E++ K+ L N+ K RK WF
Sbjct: 398 YEKAKKFRHKYEGALKAYEDTKRKLEEIEKLIEEEQKKEL-------NVKKLERRKRKWF 450
Query: 301 EKFYWFISSENYLGNVAWDAKV-------------------------------------- 322
EKF WF+SSE +L DA
Sbjct: 451 EKFRWFVSSEGFLVLAGKDASTNEVLVKKHMEDNDLYCHADVYGAPHVVIKDGQKAGEKT 510
Query: 323 ----------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--P 360
++ AW W +QV+K AP+GE+L G+FM+ GK+N+ P
Sbjct: 511 IFEACQFAVSMSRAWSQGLYSADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWMHGLP 570
Query: 361 CQLAMGI 367
+LA+G+
Sbjct: 571 LKLAVGV 577
>gi|260942807|ref|XP_002615702.1| hypothetical protein CLUG_04584 [Clavispora lusitaniae ATCC 42720]
gi|238850992|gb|EEQ40456.1| hypothetical protein CLUG_04584 [Clavispora lusitaniae ATCC 42720]
Length = 605
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 92/213 (43%), Gaps = 63/213 (29%)
Query: 231 PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KDVQTM 288
P+L VD+DL LSA+ANA +F+ K+ A KQ + ++T+ ALK+AE+K + L K+++
Sbjct: 71 PTLTVDIDLALSAFANASVYFESKKVAVTKQTRVEKNTKIALKNAERKIQSDLNKNLKNE 130
Query: 289 TNINKA-RKVYWFEKFYWFISSENYLGNVAWD--------------------------AK 321
T +A R +WFEK++WF +S+ YL D AK
Sbjct: 131 TESLRAFRHKFWFEKYFWFTTSDGYLCLAGRDDLQTDMIYYRHFSDGDYFVSSDLDGAAK 190
Query: 322 V-------------------------VTNAW---------WVKADQVSKTAPTGEFLTTG 347
V + AW W+ V+K G L G
Sbjct: 191 VFILNPYKAQNVSPSALFQAGIFALSTSTAWSAKISSSAWWMSGADVTKREFDGSLLGPG 250
Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
+ KKN+ PP Q+ MG F + +E + +
Sbjct: 251 ILKYKAKKNYMPPAQMVMGFGFYWLCDEETTQK 283
>gi|19074389|ref|NP_585895.1| hypothetical protein ECU06_1390 [Encephalitozoon cuniculi GB-M1]
gi|19069031|emb|CAD25499.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
gi|449329389|gb|AGE95661.1| hypothetical protein ECU06_1390 [Encephalitozoon cuniculi]
Length = 648
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/377 (22%), Positives = 152/377 (40%), Gaps = 103/377 (27%)
Query: 53 FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
+ +I++ G PS + + ++FE+F A + +F + + K + +R++ K+
Sbjct: 221 YGGVIMEKGK-PSSLVPFKTAGARDFETFNEAAEFYFQSRK------KFGKNDRES--KV 271
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ V++ E + +EQ + + KAEL+ N + V+ + + N++ W D E
Sbjct: 272 DKVRKRQEEYVKEMEQQGELLRRKAELLERNSKLVNRILDIFKVVKKNRIKWTDFEKFWG 331
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
Q + + V+ I + HK W++
Sbjct: 332 QENKKGNEVSKAI---------VKTDFMAHK-------CWIVLEGEE------------- 362
Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
+++D D S ++N + + +K ++T S E+ LK K + + +T
Sbjct: 363 -IEIDFDSSLFSNISGLYQKSKKLEEKIRRTRDSLEEVLKRIAPKIES-----KKIT--- 413
Query: 293 KARKVYWFEKFYWFISSENYL--------------------GNVAWD------------- 319
R YWFEKF++F SS+ L G++ +
Sbjct: 414 --RAPYWFEKFHFFFSSDGVLVIGGKNAQQNEILVKKHLEPGDLYFHSDMHGSSSIIVKK 471
Query: 320 ------------AKVVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
A ++ W +V DQVSKTAP+GE+L GSFMI GKKN+
Sbjct: 472 ATQKTIEEAASMALCMSKCWEANVVSPVWYVYGDQVSKTAPSGEYLKKGSFMITGKKNYV 531
Query: 359 PPCQLAMGISFLFKLEE 375
++ G+ LF++ E
Sbjct: 532 ECHRIEYGLGLLFRVSE 548
>gi|303389736|ref|XP_003073100.1| putative RNA-binding protein [Encephalitozoon intestinalis ATCC
50506]
gi|303302244|gb|ADM11740.1| putative RNA-binding protein [Encephalitozoon intestinalis ATCC
50506]
Length = 648
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 83/377 (22%), Positives = 148/377 (39%), Gaps = 103/377 (27%)
Query: 53 FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
+ +I++ G PS + + ++F +F A + FF + K + +R+ K+
Sbjct: 221 YGGVIMEKGK-PSNLIPFKADGAKDFNTFNDAAEYFFQGRK------KFGKNDRET--KV 271
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ V++ E + +EQ ++KAEL+ N + V+ + + N++ W D E +
Sbjct: 272 DKVRKRQENYMKEMEQQGECYRKKAELLEKNADLVNRILEIFKVVRKNKVKWTDFEKFRE 331
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
Q + V+ I + ++H W+ P
Sbjct: 332 QENKKGSEVSKAIVKTDF-VSHTC---------------WITLEGEEIP----------- 364
Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
+D ++S + N F+ + +K +KT S + LK K + + +T
Sbjct: 365 ---IDFNISLFNNVSEFYQKSKKLEEKIRKTRDSLGEVLKKIAPKV-----ETKKIT--- 413
Query: 293 KARKVYWFEKFYWFISSENYL---GNVAWD------------------------------ 319
R +YWFEKF++F SS+ L G A
Sbjct: 414 --RTLYWFEKFHFFFSSDGVLVIGGKTAQQNEILVKKHLEPTDLYFHSDVHGASSIIVKK 471
Query: 320 ------------AKVVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
A ++ W +V +QVSKTAP+GE+L GSFMI+GKKN+
Sbjct: 472 PTEKTIVETASMALCMSRCWETNVVSPVWYVYGEQVSKTAPSGEYLGKGSFMIKGKKNYV 531
Query: 359 PPCQLAMGISFLFKLEE 375
++ G+ LF++ E
Sbjct: 532 DCHKIEYGLGLLFRVFE 548
>gi|294658357|ref|XP_002770767.1| DEHA2F07678p [Debaryomyces hansenii CBS767]
gi|202953070|emb|CAR66294.1| DEHA2F07678p [Debaryomyces hansenii CBS767]
Length = 1064
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 88/207 (42%), Gaps = 62/207 (29%)
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--INK 293
+D+ LS +ANA+ +F+ K+SA KQ K +STE ALK+A+KK +Q L + N + +
Sbjct: 516 IDISLSPFANARVYFESKKSAESKQIKVEKSTEFALKNAKKKIEQDLNNKLKNENDSLKQ 575
Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAKVV---------TNAWWVKAD-----QV----- 334
R YWFEKF WF+SSE YL D + N +++ +D +V
Sbjct: 576 IRPKYWFEKFLWFVSSEGYLCLAGRDNSQIDMIYYRHFNDNDYFISSDIEGSLKVFIKNP 635
Query: 335 --SKTAPTGEFLTTGSFMI---------------------------------------RG 353
++ P + G F I +
Sbjct: 636 FKGESIPPSTLMQAGIFAISASSAWNGKVTTSAWLLHGADISKKDFDGTLISSGNFNYKA 695
Query: 354 KKNFFPPCQLAMGISFLFKLEESSISR 380
KK + PPCQL MG F + +E + +
Sbjct: 696 KKTYLPPCQLIMGFGFYWLGDEETTKK 722
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 88/179 (49%), Gaps = 20/179 (11%)
Query: 31 GYLVQKK--EKRPNSED------YIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
GY+V KK P+++D YI+ EF+P +P + + E + +
Sbjct: 273 GYIVSKKNLSYNPDNDDSTNDLEYIMD--EFYPY--KPYKS-----DMDNYRFTEIQGYN 323
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
+D FFST ES K L+ QQ++ A K+L+ + + + ++ +L Q + +K + I+
Sbjct: 324 RTMDSFFSTIESTKYALRIDQQKQQATKRLDYAREERDKQIQSLLAQQESNIKKGDAIMY 383
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
+ VD ++ + I Q+ W +IE+ ++ Q + +A I L LN+ N I L L
Sbjct: 384 YADLVDQCKDSVVKLIDQQMDWTNIESLIELEQSRGNKIARFI-NLPLNLKENKINLHL 441
>gi|354544800|emb|CCE41525.1| hypothetical protein CPAR2_800770 [Candida parapsilosis]
Length = 661
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 86/207 (41%), Gaps = 63/207 (30%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KDV--QTMTN 290
V +D LS+YANA +F+ K++A KQ K + E A K+AEKK Q L K++ + T+
Sbjct: 200 VSIDYTLSSYANASIYFESKKAAESKQAKIEKGAEIAYKNAEKKINQDLVKNLRRENGTS 259
Query: 291 INKARKVYWFEK------------------------FYWFISSENYL------------- 313
N R+ +WFE ++ + S +++L
Sbjct: 260 SNAEREKFWFESFYWFVSSEGYLCLAGRSKSQTDLLYFKYFSDDDFLVSSEIEGSLKVFV 319
Query: 314 -----------------------GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFM 350
+ AW+ K+ T AW + ++SK +G L G F
Sbjct: 320 KNPLKGESVPPTTILQAGIFAMAASQAWNGKINTAAWVLHGSEISKYNSSGALLPAGEFE 379
Query: 351 IRGKKNFFPPCQLAMGISFLFKLEESS 377
KK+F PP QL MG F ++E S
Sbjct: 380 YLAKKHFLPPAQLVMGFGLYFLVDEGS 406
>gi|344304197|gb|EGW34446.1| hypothetical protein SPAPADRAFT_70556 [Spathaspora passalidarum
NRRL Y-27907]
Length = 865
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 89/207 (42%), Gaps = 62/207 (29%)
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT--------KQTLKDVQT 287
+DL L+++ANA+ +FD K++AA KQ+K +ST+ ALK+AE+K KQ ++
Sbjct: 355 IDLSLTSFANARAYFDTKKTAATKQEKVEKSTKIALKNAERKITHDLTKNLKQENDSLKQ 414
Query: 288 MTNINKARKVYWF-----------------EKFYW--------FISSE----------NY 312
M K +WF + Y+ F+SS+ N
Sbjct: 415 MRPKYWFEKFFWFVTNEGYLCLAGRDNAQVDMIYYRHFSDNDCFVSSDLEGSLKVFVKNP 474
Query: 313 L-------------------GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRG 353
L + W+ KV T+AW + D++SK G + +G F G
Sbjct: 475 LKGETVPPSSIMQAGIFAMSASSGWNGKVTTSAWVLHGDEISKRDFDGSLIASGEFRYTG 534
Query: 354 KKNFFPPCQLAMGISFLFKLEESSISR 380
KK + PP QL MG ++E S R
Sbjct: 535 KKVYLPPAQLVMGFGLYSLVDEESTKR 561
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 95/195 (48%), Gaps = 10/195 (5%)
Query: 7 VSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNS-EDYIVANMEFHPMILQPGSTPS 65
V++ + C LV + N ++ G++V K+ K ++ + EFHP P
Sbjct: 110 VANALGACEDAYLSLVDSKNENT-GFIVAKRNKASDTNSSFEFIYDEFHPF------KPY 162
Query: 66 VMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSA 125
Q + + E + +D FFST ES K +LK Q ++ A K+L+ K + + ++ +
Sbjct: 163 KANQ-EDYQYTEVSGYNKTLDRFFSTLESSKFELKVEQLKQTAAKRLDKAKSERDKQIQS 221
Query: 126 LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK-QAQRHNDPVASI 184
L + Q ++ +K ELI + + VD I+ + + W +IE ++ + +++N +I
Sbjct: 222 LLEQQDLNAKKGELIQYHADLVDDCRAYIQSFLDQSMDWTNIETVLELEKKKNNKLAKAI 281
Query: 185 IKQLKLNINHITLLL 199
L L N I +LL
Sbjct: 282 GLPLNLKDNKIKVLL 296
>gi|14591254|ref|NP_143331.1| hypothetical protein PH1465 [Pyrococcus horikoshii OT3]
gi|3257889|dbj|BAA30572.1| 650aa long hypothetical protein [Pyrococcus horikoshii OT3]
Length = 650
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 158/373 (42%), Gaps = 106/373 (28%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDALKKLENVKRDH 119
P ++ ++G+ FE+F+ A+DE+F + ++ K ++++R L L +KR
Sbjct: 243 VPIDLKWYEGYEKVYFETFSQALDEYFGKLTIEKAKAEKTKKLEEKRKQL--LATLKRQE 300
Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
E + E+ ++E LI N +D + + + N L W++ + R+++ ++ +
Sbjct: 301 EM-IKGFEKELKKNQEIGNLIYANYTLIDGLLREFSKAVKN-LGWDEFKKRIEEGKKKGN 358
Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLD 239
+A ++K ++ N IT+ + E K+ L +D D
Sbjct: 359 KIALMVKGIEPESNSITVEI---------------------------EGKRVKLY-LDKD 390
Query: 240 LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA----- 294
L+ NA+ +++ AKK + ++ KA + ++K + ++++ +
Sbjct: 391 LN--ENAEIYYE----KAKKAKHKLEGARKAYEDLKRKLESIEREIEEEEKKIQVKKIEK 444
Query: 295 RKVYWFEKFYWFISSENYL---GNVA----------------------WDAKVV------ 323
RK WFEKF WFISSE +L G A W A V
Sbjct: 445 RKKKWFEKFRWFISSEGFLVIGGKDATTNEIVVRKYLEENDLYCHADIWGAPHVVIKDGQ 504
Query: 324 -----------------TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKN 356
+ AW WV +QV K AP+GEFL G+FM+ GK+N
Sbjct: 505 KAGEKTIFEACQFAVSMSRAWSEGLYSADAYWVYPNQVKKQAPSGEFLPKGAFMVYGKRN 564
Query: 357 --FFPPCQLAMGI 367
+ P +LA+GI
Sbjct: 565 WMYGIPLKLAVGI 577
>gi|383318475|ref|YP_005379316.1| RNA-binding protein, eukaryotic snRNP-like protein [Methanocella
conradii HZ254]
gi|379319845|gb|AFC98797.1| putative RNA-binding protein, eukaryotic snRNP-like protein
[Methanocella conradii HZ254]
Length = 662
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 109/237 (45%), Gaps = 35/237 (14%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
FESF A+DE+FS + + + V+++ + L E R E ++ E+ + + K
Sbjct: 266 FESFNQALDEYFSRHVAAEAKAEVVERKAEKLGVYERRLRQQEEAIAKFEREEAENVRKG 325
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
E I ++ I IR A SW+DI ++ A++ + AS+I+ +
Sbjct: 326 EAIYAEYNTISEVIGVIRGARAKGYSWDDIRKILRDARKAGNKAASLIQSV--------- 376
Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSA 256
DP A++ K S+ V+V++DL+ NA+ ++D + A
Sbjct: 377 ----------------------DPAANTVNVKLSSVSVNVNIDLTVPQNAQAYYDKAKKA 414
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
K++ +++ E+ K+ K+T ++ + K R W+EK+ WF +S+ +L
Sbjct: 415 RLKKEGALKAIEETKKAMAKETPAPPREPSAKAHPRKPR---WYEKYRWFYTSDGFL 468
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL--AMGISF 369
+ +WV+ +QVSKT GE++ G+F+IRG +N+ ++ A+GI F
Sbjct: 539 DCFWVRPEQVSKTPEHGEYVAKGAFIIRGDRNYVKNVEVRAAVGIRF 585
>gi|149246271|ref|XP_001527605.1| hypothetical protein LELG_00125 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146447559|gb|EDK41947.1| hypothetical protein LELG_00125 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 701
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 85/209 (40%), Gaps = 65/209 (31%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ----TLKDVQTMT 289
V +DL S++ANA+ +FD K++A + Q K + E A ++AEKK Q +K T
Sbjct: 210 VKIDLTQSSFANARIYFDSKKAAEQLQLKVEKGAEIAYRNAEKKISQDFVRNVKKELGST 269
Query: 290 NINKARKVYWFEKFYW------------------------FISSENYL------GNV--- 316
+ + R WFEKFYW ++ ++YL G++
Sbjct: 270 DSSALRSKLWFEKFYWFVSSEGYLCLAGRDKTQVDMIYFKYVGDDDYLVSSEIEGSLKVF 329
Query: 317 ---------------------------AWDAKVVTNAWWVKADQVSK-TAPTGEFLTTGS 348
AW KV T AW ++A VSK + G L G
Sbjct: 330 IKNPIKDEAIPPSTILQAGIFAMSASHAWSGKVNTAAWVMQASDVSKYDSAAGNLLPPGE 389
Query: 349 FMIRGKKNFFPPCQLAMGISFLFKLEESS 377
F KK+ PP QL MG F ++E S
Sbjct: 390 FEYFAKKDLLPPAQLVMGFGFYCDVDEES 418
>gi|429961918|gb|ELA41462.1| hypothetical protein VICG_01446 [Vittaforma corneae ATCC 50505]
Length = 351
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 32/60 (53%), Positives = 43/60 (71%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
WD V+ ++V+ DQVSK+AP+GE++T GSFMI+GK+N P +L GI LFKLE S
Sbjct: 200 CWDEGVIKPVFYVEPDQVSKSAPSGEYITKGSFMIKGKRNIMNPYRLEYGIGLLFKLEGS 259
>gi|269863550|ref|XP_002651263.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220064852|gb|EED42793.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 335
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 71/170 (41%), Gaps = 60/170 (35%)
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKA------RKVYWFEKFYWF----------- 306
I T + ++ +KTK ++D+Q K R YWFEKF++F
Sbjct: 135 IYKTRRRMREKAEKTKIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGG 194
Query: 307 ----------------------------------------ISSENYLGNV---AWDAKVV 323
I Y V AWD +V+
Sbjct: 195 KNAQQNDQIVNKYMEDRDLYFHCDVKGASSVICKGSADRNIEDATYFALVYSKAWDEQVI 254
Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
+ ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 255 KDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 304
>gi|282165250|ref|YP_003357635.1| hypothetical protein MCP_2580 [Methanocella paludicola SANAE]
gi|282157564|dbj|BAI62652.1| conserved hypothetical protein [Methanocella paludicola SANAE]
Length = 666
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 61/261 (23%), Positives = 121/261 (46%), Gaps = 37/261 (14%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P + ++ G+ FESF A+DE++S + + V+++ + L LE + E +
Sbjct: 252 PIELSRYAGYQKVYFESFNKALDEYYSKHIVAEAKAEVVEKKAEKLGVLERRLKQQEDAI 311
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
+ E+ + K ELI +VD I I+ + +SW+DI +K A++ +P AS
Sbjct: 312 AKFEKEEKEYVRKGELIYAEYGAVDDIIKVIKGARSRGISWDDIRKILKDAKKAGNPAAS 371
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP-SLVDVDLDLSA 242
+I+ + DP A++ K P + +++++DL+
Sbjct: 372 MIQSV-------------------------------DPAANTVAVKFPEATININVDLTV 400
Query: 243 YANAKRFFDLKRSAAKKQQ---KTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYW 299
N++ ++D + K+ K I+ T++A+ A++ ++ + + K RK W
Sbjct: 401 PQNSQTYYDKAKKVQSKKDGALKAIEDTKRAM--AKEMPREKPAEPKKPAVKMKPRKPKW 458
Query: 300 FEKFYWFISSENYLGNVAWDA 320
+EK+ WF +S+ +L DA
Sbjct: 459 YEKYRWFFTSDGFLVIAGRDA 479
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISF 369
+ +WV+ +QVSKT +GE++ G+F+IRG +N+ + A+GI F
Sbjct: 543 DCFWVRPEQVSKTPESGEYVAKGAFIIRGDRNYVKNVEARAAVGIRF 589
>gi|150402208|ref|YP_001329502.1| hypothetical protein MmarC7_0281 [Methanococcus maripaludis C7]
gi|150033238|gb|ABR65351.1| protein of unknown function DUF814 [Methanococcus maripaludis C7]
Length = 680
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 96/386 (24%), Positives = 152/386 (39%), Gaps = 114/386 (29%)
Query: 64 PSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
P ++ H+ I+ +ESF A+DE+FS I K ++Q L+KL VK+
Sbjct: 262 PIDLKMHENLKENEIKHYESFLTALDEYFSRF----IMKKEIKQAESKLQKL--VKKQER 315
Query: 121 TRLSALEQTQ------LVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
S LE + +++ ++ +LI N VD + I+ ++ W I+ +K+
Sbjct: 316 ILKSQLETKEKYEKQSILNHKRGDLIYANYSLVDEIVSTIKL-AREKMDWNGIKNVIKEN 374
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
+ H PV S I +N+N N + + L D + + + V
Sbjct: 375 KTH--PVLSKI----INVNE-----------KNAELTLNLSADYGNGLIEDT-------V 410
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----------KQTLKD 284
VDL +A+ NA D+ +KK + + +ALK +EKK + LK+
Sbjct: 411 PVDLRKNAFENA----DIVYQKSKKFKNKVHGVIEALKISEKKLAELKEKEKLDSEVLKE 466
Query: 285 VQTMTNINKARKVYWFEKFYWF-------------------------------------- 306
+ + + + W+EK W
Sbjct: 467 KEENIKKKERKVLKWYEKLKWTVIGGYLIVAGKDATTNEMLIKRYVEKNDIVFHTLMEGA 526
Query: 307 ------------ISSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFL 344
I EN L VA A + AW WV+ DQ+SKTA +GEFL
Sbjct: 527 PFTIIRTEGSEEIPDENILFEVAKFAASHSRAWKLGIGSADVYWVRPDQISKTAESGEFL 586
Query: 345 TTGSFMIRGKKNFFPPCQLAMGISFL 370
G+F+IRGK+NF L +GI L
Sbjct: 587 KKGAFVIRGKRNFIRSAALELGIGML 612
>gi|269862592|ref|XP_002650899.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220065446|gb|EED43157.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 480
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 344 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 400
>gi|269862824|ref|XP_002650989.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220065304|gb|EED43067.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 506
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 423 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 479
>gi|14520906|ref|NP_126381.1| hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
gi|5458123|emb|CAB49612.1| Hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
gi|380741455|tpe|CCE70089.1| TPA: hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
Length = 649
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 83/377 (22%), Positives = 153/377 (40%), Gaps = 114/377 (30%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD---- 118
P ++ ++G+ FE+F+ A+DE+F K+ ++ ++ER +KLE K+
Sbjct: 242 VPIELKWYEGYERVYFETFSQALDEYFG-----KLTIEKAKEERT--RKLEEKKKQLMAT 294
Query: 119 ---HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
E + E+ ++E +LI N +D + + + + W + + R+++ +
Sbjct: 295 LERQERMIKGFEEEARKNQEIGDLIYANYTIIDGILREFSKAV-EKFGWNEFKKRLEEGK 353
Query: 176 RHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD 235
+ + +A ++K + + IT+ L K +
Sbjct: 354 KQGNKIALLVKNVNPEEDSITIELEGRK------------------------------IK 383
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK-----TKQTLKDVQTMTN 290
+ L+ S NA+ +++ AKK + ++ +KA + ++K + ++ +
Sbjct: 384 LYLNRSINDNAELYYE----KAKKAKHKLEGAKKAYEELKRKLEQIEKEIEEEEKKIQVK 439
Query: 291 INKARKVYWFEKFYWFISSENYL---GNVA----------------------WDAKVV-- 323
+ RK WFEKF WFISSE +L G A W A V
Sbjct: 440 KIEKRKKKWFEKFRWFISSEGFLVIGGKDATTNEIVVRKYMQENDIYCHADIWGAPHVII 499
Query: 324 ---------------------TNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
+ AW WV +QV K AP+GEFL G+FM+
Sbjct: 500 KDGQKASERTIFEACQFAVSMSRAWSEGLYSGDAYWVYPEQVKKQAPSGEFLPKGAFMVY 559
Query: 353 GKKNFFP--PCQLAMGI 367
GK+N+ P +LA+G+
Sbjct: 560 GKRNWMHGIPLKLAVGV 576
>gi|269867209|ref|XP_002652521.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220062310|gb|EED41535.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 265
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 127 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 183
>gi|45358591|ref|NP_988148.1| hypothetical protein MMP1028 [Methanococcus maripaludis S2]
gi|44921349|emb|CAF30584.1| unnamed protein product [Methanococcus maripaludis S2]
Length = 680
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 160/403 (39%), Gaps = 126/403 (31%)
Query: 58 LQPGST----------PSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQ 104
L+P ST P ++ H+ I+ +ESF A+DE+FS I K ++Q
Sbjct: 246 LKPKSTLKNGEFVNIDPIDLKIHENLKENEIKHYESFLTALDEYFSRF----IMKKEIKQ 301
Query: 105 ERDALKKLENVKRDHETRLSAL------EQTQLVDKEKAELIINNQESVDAAILAIRQDI 158
L+KL VK+ S L E+ + + ++ +LI N VD + I+ D
Sbjct: 302 AESKLQKL--VKKQERILKSQLDTKDKYEKQSVSNHKRGDLIYANYSLVDEIVSTIK-DA 358
Query: 159 ANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDN 218
++ W I+ +K+ + H P+ S I +N+N N + + L D
Sbjct: 359 REKMDWNGIKNVIKENKTH--PILSKI----INVNE-----------KNAELTLKLSADY 401
Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT 278
+ + S V VDL +A+ NA D+ +KK + +Q +ALK +EKK
Sbjct: 402 GNGLIEDS-------VPVDLRKNAFENA----DIVYQKSKKFKNKVQGVIEALKISEKKL 450
Query: 279 KQTLKDVQTMTN-----------INKARKVYWFEKFYWFI-------------------- 307
+ LK+ + + + + + + W+EK W +
Sbjct: 451 AE-LKEKEKLDSEVFKEKEEKIKKKERKVLKWYEKLKWTVIGGYLIVAGKDATTNEMLIK 509
Query: 308 ------------------------------SSENYLGNVAWDAKVVTNAW---------- 327
EN L VA A + AW
Sbjct: 510 RYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENILFEVAKFAASHSRAWKLGIGSADVY 569
Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
WV+ DQ+SKTA +GE+L G+F+IRGK+NF L +GI +
Sbjct: 570 WVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGII 612
>gi|240103770|ref|YP_002960079.1| Fibronectin-binding protein A (FbpA) [Thermococcus gammatolerans
EJ3]
gi|239911324|gb|ACS34215.1| Fibronectin-binding protein A (FbpA) [Thermococcus gammatolerans
EJ3]
Length = 650
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 79/342 (23%), Positives = 150/342 (43%), Gaps = 43/342 (12%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ ++G + F++F+ A+DE+F K ++ ++ K+L R E
Sbjct: 242 VPIELKIYEGLEKRYFKTFSEALDEYFGKLTIEKAKIEKTRKLESKKKQLLATLRKQEEM 301
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L E+ ++E +LI N V+ + R+ +L WE+ + R++ ++ + VA
Sbjct: 302 LKGFEKAMNENQEIGDLIYANYAMVERLLDEFRK-ATEKLGWEEFKRRIEAGKKEGNKVA 360
Query: 183 SIIKQLKLNINHITLLLRVHKER----SNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
++K + +T+ L K + +I ++ L+++ A F E + D
Sbjct: 361 LMVKAIDPKEKTVTIELEGRKVKLYLNKSIGENAELYYEKAKKFRHKYEGALKAYEDTRR 420
Query: 239 DLSAYAN-----AKRFFDLKRSAAKKQQ-----KTIQSTEKALKSAEK--KTKQTL---- 282
L K+ ++KR +K++ + S+E L A K T +TL
Sbjct: 421 KLDEVEKLIEEEMKKELNVKRIERRKKKWFEKFRWFISSEGFLVLAGKDASTNETLIKKH 480
Query: 283 ---------KDVQTMTNI-----NKARKVYWFEKFYWFISSENYLGNVAWDAKVV-TNAW 327
DV ++ KA + FE + +S AW + +A+
Sbjct: 481 MSDNDLYCHADVYGAPHVVIKDGQKAGEKTIFEACQFAVSMSR-----AWSQGLYGADAY 535
Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
W +QV+K AP+GE+L G+FM+ GK+N+ P +LA+G+
Sbjct: 536 WAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLKLAVGV 577
>gi|224108804|ref|XP_002314973.1| predicted protein [Populus trichocarpa]
gi|222864013|gb|EEF01144.1| predicted protein [Populus trichocarpa]
Length = 235
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 94/177 (53%), Gaps = 23/177 (12%)
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
AE I +N + VD+AILA+ +A + WED+ VK ++ +P+A +I +L N +
Sbjct: 2 AEFIEHNLQGVDSAILAVPVALAKGIGWEDLARMVKDEKKAGNPIAGLIDKLHFEKNCMA 61
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
LL+ + I W++ + + S++ANA+R+++LK+
Sbjct: 62 LLIAI------ISMKWMMMKRHFQCIS-----------------SSHANAQRWYELKKKQ 98
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
KQ+KT + +KA K+AEKK L +++ I+ KV+W EKF WFI + NYL
Sbjct: 99 ECKQEKTFTAHKKAFKAAEKKIHLQLSQEKSVATISHMHKVHWLEKFNWFIGTWNYL 155
>gi|269865041|ref|XP_002651784.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220063882|gb|EED42272.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 243
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 107 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 163
>gi|344230527|gb|EGV62412.1| hypothetical protein CANTEDRAFT_126343 [Candida tenuis ATCC 10573]
Length = 969
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 84/207 (40%), Gaps = 62/207 (29%)
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KDVQTMTN-INK 293
+DL LS YANA FFD K+SA KQ K ++T AL++AE+K L K +Q + +NK
Sbjct: 459 IDLSLSPYANASTFFDSKKSAEVKQLKVEKNTGIALQNAERKITHDLTKALQNESEALNK 518
Query: 294 ARKVYWFEKF--------YWFISSENYLGN------------------------------ 315
R+ +WFEKF Y +S ++ L N
Sbjct: 519 VREKFWFEKFYWFVTSDGYLCLSGKDDLQNDMIYYRYFNDDDFFVYSDIEGALKVFIKNP 578
Query: 316 ---------VAWDAKVVT-------------NAWWVKADQVSKTAPTGEFLTTGSFMIRG 353
W A + + +AW++ VSK G L G F +G
Sbjct: 579 YKGETVPPSTIWQAGMFSLSNSESWSNKSSSSAWYLPGPGVSKKDIDGSLLRPGKFNFKG 638
Query: 354 KKNFFPPCQLAMGISFLFKLEESSISR 380
KK PP QL MG F ++ + R
Sbjct: 639 KKEHMPPVQLVMGFGIYFVGDDETTKR 665
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 85/177 (48%), Gaps = 18/177 (10%)
Query: 30 QGYLVQKKEKRPNSEDYIVANM-----EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
+G +V KK N+ + N+ EFHP +P S+ + + +
Sbjct: 227 EGIIVSKK--NTNATEETAENLQYLFDEFHP--FRPFKDGSL------YKFTSIQGYNKT 276
Query: 85 VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
+D+FFST ES K ++K Q++ A+K+L+ K + ++ +L + + +K +LII N
Sbjct: 277 LDQFFSTLESLKNEIKIENQKQLAMKRLDKAKNERVKQIESLINEKNANIKKGDLIILNA 336
Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
V I I + Q+ W DIE ++ + D + + I QL LN+ N I L L
Sbjct: 337 NLVSGCIDFINGMLEKQMDWHDIEKYIELQKSSGDDITNAI-QLPLNLLENKIKLNL 392
>gi|269864556|ref|XP_002651614.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220064197|gb|EED42442.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 320
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 170 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 226
>gi|340624350|ref|YP_004742803.1| fibronectin-binding A domain-containing protein [Methanococcus
maripaludis X1]
gi|339904618|gb|AEK20060.1| Fibronectin-binding A domain protein [Methanococcus maripaludis X1]
Length = 680
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 150/373 (40%), Gaps = 113/373 (30%)
Query: 75 IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSAL------EQ 128
I+ +ESF A+DE+FS I K ++Q L+KL VK+ S L E+
Sbjct: 276 IKHYESFLTALDEYFSRF----IMKKEIKQAESKLQKL--VKKQERILKSQLDTKDKYEK 329
Query: 129 TQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
+ + ++ +LI N VD + I+ D ++ W I+ +K+ + H P+ S I
Sbjct: 330 QSISNHKRGDLIYANYSLVDEIVSTIK-DAREKMDWNGIKNVIKENKTH--PILSKI--- 383
Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKR 248
+N+N N + + L D + + S V VDL +A+ NA
Sbjct: 384 -INVNE-----------KNAELTLKLSADYGNGLIEDS-------VPVDLRKNAFENA-- 422
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-----------INKARKV 297
D+ +KK + +Q +ALK +EKK + LK+ + + + + + +
Sbjct: 423 --DIVYQKSKKFKNKVQGVIEALKISEKKLAE-LKEKEKLDSEVFKEKEEKIKKKERKVL 479
Query: 298 YWFEKFYWFI-------------------------------------------------- 307
W+EK W +
Sbjct: 480 KWYEKLKWTVIGGYLIVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEI 539
Query: 308 SSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
EN + VA A + AW WV+ DQ+SKTA +GE+L G+F+IRGK+NF
Sbjct: 540 PDENIMFEVAKFAASHSRAWKLGIGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNF 599
Query: 358 FPPCQLAMGISFL 370
L +GI +
Sbjct: 600 IRSAALELGIGII 612
>gi|159111661|ref|XP_001706061.1| Serologically defined colon cancer antigen 1 [Giardia lamblia ATCC
50803]
gi|157434154|gb|EDO78387.1| Serologically defined colon cancer antigen 1 [Giardia lamblia ATCC
50803]
Length = 1063
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 45/61 (73%)
Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
W KV T ++WV ADQVSKTAP G ++ TGSF+IRGK+NF P L +G++ L++ + ++
Sbjct: 691 WAQKVGTQSYWVYADQVSKTAPAGMYIGTGSFVIRGKRNFIPQQPLELGVALLWRYDTAN 750
Query: 378 I 378
+
Sbjct: 751 V 751
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 11/107 (10%)
Query: 226 SETKKPSLVDVDLDLSAYAN------AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
S+ +P ++ VD+ A AK F+ ++A +K ++T+ + EKK
Sbjct: 516 SDPTEPIVISVDVPFKGAAGTTAHIIAKTLFEAAKAAEEKCKRTLGHSSAYFDKVEKKAT 575
Query: 280 QTLKDVQTMTN-----INKARKVYWFEKFYWFISSENYLGNVAWDAK 321
+ V T+ + R WFEKF+WF S++ YL DA+
Sbjct: 576 ADIDSVMKETDAELIALQHQRSPLWFEKFHWFFSTDGYLVLSGRDAQ 622
>gi|308160802|gb|EFO63274.1| Serologically defined colon cancer antigen 1 [Giardia lamblia P15]
Length = 1063
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 44/61 (72%)
Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
W KV T ++WV ADQVSKTAP G ++ TGSF+IRGK+NF P L +G++ L++ + +
Sbjct: 691 WTQKVGTQSYWVYADQVSKTAPAGMYIGTGSFVIRGKRNFIPQQPLELGVALLWRYDTPN 750
Query: 378 I 378
+
Sbjct: 751 V 751
Score = 44.7 bits (104), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 46/103 (44%), Gaps = 11/103 (10%)
Query: 230 KPSLVDVDLDLSAYAN------AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
+P ++ VD+ A AK F+ ++A +K ++T+ + EKK +
Sbjct: 520 EPIIISVDVPFKGTAGTNAHTIAKTLFEAAKAAEEKCKRTLGHSSAYFDKVEKKATADID 579
Query: 284 DVQTMTN-----INKARKVYWFEKFYWFISSENYLGNVAWDAK 321
V T+ + R WFEKF+WF S+ YL DA+
Sbjct: 580 SVMKETDAELIALQHQRSPLWFEKFHWFFSTNGYLVLSGRDAQ 622
>gi|428179079|gb|EKX47951.1| hypothetical protein GUITHDRAFT_106038 [Guillardia theta CCMP2712]
Length = 841
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 102/396 (25%), Positives = 168/396 (42%), Gaps = 100/396 (25%)
Query: 30 QGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFF 89
+GY++ + +++ +++ EF P++L +Q + + E+ S A+DEFF
Sbjct: 380 KGYVILDRISDSSNQTLVIS--EFSPILL---------KQQEDKMVLEYPSIDVAMDEFF 428
Query: 90 STAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDA 149
ST + ++ A + KK+ K+D ++ L+Q +L++ +KA L+ N +D
Sbjct: 429 STIDFNRDQKDANEAVETVSKKVNKAKKDIKSHTEGLKQEELLNHKKATLLELNSFHIDE 488
Query: 150 AILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIK 209
A+ IR + K++ N +L +H+ S
Sbjct: 489 ALDKIRGLV------------------------------KIHRNAANVLHEIHEMNSTA- 517
Query: 210 DSWVLHHDNADPFADSSETKKPSLVDV----DLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
S+ L + +S K D+ D +S+ ANA+ FF K+ A KQQ+ +
Sbjct: 518 -SFRLPQEG---IVESEAVKSRGATDITLVLDYSISSLANARNFFQKKKKVAAKQQRAEE 573
Query: 266 STEKALK------SAEKKTKQTLKDVQTMTNINK---ARKVYWFEKFYWFISSENYL--- 313
+ +LK S K TK + D Q+ ++ R+ +WFEKF+WFISS+ L
Sbjct: 574 MADISLKNTQIKASQRKNTKASKNDFQSKSSSIGISSVRRKFWFEKFFWFISSDQILVIA 633
Query: 314 ------------------------------------GNVAWDAKVV--TNAWWVKADQVS 335
+ AW +K T A+WV DQVS
Sbjct: 634 GKDAQQNELLVKRNELKERKVLPENTILQAAEFAVCRSSAWKSKTASGTAAYWVYPDQVS 693
Query: 336 KTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
K +GE+L+ G F+IRGKKNF L MG F
Sbjct: 694 KAPQSGEYLSKGGFVIRGKKNFVSISTLCMGFGIFF 729
>gi|269863594|ref|XP_002651278.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220064823|gb|EED42778.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 262
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 107 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 163
>gi|269864527|ref|XP_002651604.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220064216|gb|EED42452.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 257
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 107 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 163
>gi|269862884|ref|XP_002651013.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220065270|gb|EED43045.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 191
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 41 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 97
>gi|269864419|ref|XP_002651566.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220064286|gb|EED42490.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 290
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 107 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 163
>gi|269862032|ref|XP_002650678.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220065783|gb|EED43376.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 166
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 43/57 (75%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
AWD +V+ + ++V +DQVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 16 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 72
>gi|366991987|ref|XP_003675759.1| hypothetical protein NCAS_0C04050 [Naumovozyma castellii CBS 4309]
gi|342301624|emb|CCC69395.1| hypothetical protein NCAS_0C04050 [Naumovozyma castellii CBS 4309]
Length = 1020
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 89/179 (49%), Gaps = 15/179 (8%)
Query: 30 QGYLVQKKEKRPNSE------DYIVANME-FHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
+G+++ KK + NSE +Y+ F P + S + + + G +
Sbjct: 276 EGFILAKKNENYNSERDNSDLEYVYETFHPFRPYVDDKNSANTHIIEIDG-------QYN 328
Query: 83 AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
+D FFST ES K L+ QE A KK+E+ + ++E R+ AL Q ++EK LII
Sbjct: 329 KTLDNFFSTIESSKYALRIQAQELQAKKKIEDARLENERRIQALVDVQASNEEKGHLIIA 388
Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRV 201
N + V+ A LAI+ + Q+ W IE + QR + +A +I QL LN+ +R+
Sbjct: 389 NADLVEEAKLAIQGLLDQQMDWNTIEKLILSEQRKGNMIAQLI-QLPLNLKENKFTVRL 446
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 95/214 (44%), Gaps = 69/214 (32%)
Query: 229 KKPSLVD-------VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
KKP+ VD +DL L+AYANA +F++K+++ +KQ++ ++ EKA+K+ E++ +
Sbjct: 507 KKPATVDKNTVKVTIDLGLTAYANASEYFNIKKTSVEKQKRMEKNIEKAMKNIEERVDKQ 566
Query: 282 LKDVQTMTN--INKARKVYWFEKFYWFISSENYLGNVA---------------------- 317
LK ++ + + R Y+FEK+ WFISSE +L +
Sbjct: 567 LKKKLKDSHDVLKRIRTPYFFEKYNWFISSEGFLVMMGKNGVETDQIYSKFIEDNDIYMS 626
Query: 318 -------W----DAKVV---------------TNAW---------WVKADQVSKTAPTGE 342
W D K V + AW W A VSK +
Sbjct: 627 NHFNTQVWIKNPDGKEVPPNTLMQAGILCMSSSEAWSKKIAASAWWCFAKNVSKFDDVDQ 686
Query: 343 -FLTTGSFMIRGKK--NFFPPCQLAMGISFLFKL 373
L+ G F ++ +K N P QL MG F++K+
Sbjct: 687 SVLSPGVFRLKNEKDRNELPASQLVMGFGFMWKV 720
>gi|242399100|ref|YP_002994524.1| fibronectin-binding protein [Thermococcus sibiricus MM 739]
gi|242265493|gb|ACS90175.1| Predicted fibronectin-binding protein [Thermococcus sibiricus MM
739]
Length = 650
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 89/377 (23%), Positives = 156/377 (41%), Gaps = 113/377 (29%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ ++ + + F +F+ A+DE+F KI L++ + ER KKL+N KR E
Sbjct: 242 VPIELKWYESYKKKFFTTFSEALDEYFG-----KILLESAKIERT--KKLQNKKRQLEAT 294
Query: 123 LSALEQTQLV---------DKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
L +Q +++ ++E +LI N ++ + + + + +L W++ + RV+
Sbjct: 295 LR--KQEEMINGFKNQIQENQEIGDLIYTNFAFIENLLKELSKAV-EKLGWKEFKERVEN 351
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
++ + +A IIK + +T+ L K
Sbjct: 352 GKKSGNKIAQIIKNIDAKEKAVTIELDGKK------------------------------ 381
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKK---QQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
V + L+ S NA+ +++ + A K QK + T K +K EK ++ K ++
Sbjct: 382 VKLYLNKSVGENAEIYYEKAKKAKHKLEGAQKAHKETLKKIKEIEKLIEEEEKKELSVRK 441
Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAKV---------------------------- 322
+ K RK WFEKF WF+SSE +L DA
Sbjct: 442 LEK-RKKKWFEKFRWFLSSEGFLIIAGKDATTNEIVVKRYMSENDLYCHADIYGAPHVVI 500
Query: 323 --------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
++ AW W +QV+K AP+GE+L G+FM+
Sbjct: 501 KDGKKAGEKTLFEACQFAVSMSRAWKEGLYSGDAYWTDPNQVTKKAPSGEYLGKGAFMVY 560
Query: 353 GKKNFFP--PCQLAMGI 367
GK+N+ P +LA+GI
Sbjct: 561 GKRNWMHGLPVKLAVGI 577
>gi|385773877|ref|YP_005646444.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|323477992|gb|ADX83230.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 609
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 93/206 (45%), Gaps = 64/206 (31%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
+++D LSA NA RFFD + +K ++ ++S ++ + EK KQ ++ + +
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV-- 392
Query: 294 ARKVYWFEKFYWFIS------------SEN------YL--------GNVA---------- 317
RK W+EK+ W IS S+N YL ++A
Sbjct: 393 LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQ 452
Query: 318 ----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRG 353
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI G
Sbjct: 453 ENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYG 512
Query: 354 KKNFFPPCQLAMGISFLFKLEESSIS 379
KKNF +L + I + L E+S+S
Sbjct: 513 KKNFVKNVKLQLAIGLI--LSENSVS 536
>gi|238620391|ref|YP_002915217.1| hypothetical protein M164_1946 [Sulfolobus islandicus M.16.4]
gi|238381461|gb|ACR42549.1| protein of unknown function DUF814 [Sulfolobus islandicus M.16.4]
Length = 609
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 93/206 (45%), Gaps = 64/206 (31%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
+++D LSA NA RFFD + +K ++ ++S ++ + EK KQ ++ + +
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV-- 392
Query: 294 ARKVYWFEKFYWFIS------------SEN------YL--------GNVA---------- 317
RK W+EK+ W IS S+N YL ++A
Sbjct: 393 LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQ 452
Query: 318 ----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRG 353
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI G
Sbjct: 453 ENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYG 512
Query: 354 KKNFFPPCQLAMGISFLFKLEESSIS 379
KKNF +L + I + L E+S+S
Sbjct: 513 KKNFVKNVKLQLAIGLI--LSENSVS 536
>gi|227828200|ref|YP_002829980.1| hypothetical protein M1425_1938 [Sulfolobus islandicus M.14.25]
gi|229585429|ref|YP_002843931.1| hypothetical protein M1627_2016 [Sulfolobus islandicus M.16.27]
gi|227459996|gb|ACP38682.1| protein of unknown function DUF814 [Sulfolobus islandicus M.14.25]
gi|228020479|gb|ACP55886.1| protein of unknown function DUF814 [Sulfolobus islandicus M.16.27]
Length = 609
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 93/206 (45%), Gaps = 64/206 (31%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
+++D LSA NA RFFD + +K ++ ++S ++ + EK KQ ++ + +
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV-- 392
Query: 294 ARKVYWFEKFYWFIS------------SEN------YL--------GNVA---------- 317
RK W+EK+ W IS S+N YL ++A
Sbjct: 393 LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQ 452
Query: 318 ----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRG 353
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI G
Sbjct: 453 ENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYG 512
Query: 354 KKNFFPPCQLAMGISFLFKLEESSIS 379
KKNF +L + I + L E+S+S
Sbjct: 513 KKNFVKNVKLQLAIGLI--LSENSVS 536
>gi|284998447|ref|YP_003420215.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284446343|gb|ADB87845.1| protein of unknown function DUF814 [Sulfolobus islandicus L.D.8.5]
Length = 609
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)
Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
V+++LD LSA NA RFFD + +K ++ ++S ++ + EK KQ ++ + +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV 392
Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
RK W+EK+ W IS S+N YL ++A
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450
Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510
Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
GKKNF +L + I + L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536
>gi|333910763|ref|YP_004484496.1| fibronectin-binding A domain-containing protein [Methanotorris
igneus Kol 5]
gi|333751352|gb|AEF96431.1| Fibronectin-binding A domain protein [Methanotorris igneus Kol 5]
Length = 675
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 83/376 (22%), Positives = 149/376 (39%), Gaps = 98/376 (26%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P +++++ F +E+ F A+D++F+ + K + + +KK E + +
Sbjct: 261 VPINLKKYENFEKKEYGEFLEALDDYFAQFMAKVETKKEESKLQKLIKKQERILKTQLET 320
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L E+ ++EK +LI N VD IL ++ ++ W I+ V + + H P+
Sbjct: 321 LEKYEKQMQENQEKGDLIYANYTLVD-EILNTLRNAREKMEWYKIKKIVNEHKDH--PIL 377
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I+ + I + L AD + K V +D+ +A
Sbjct: 378 GLIQNINEKNGEIVIKLS----------------------ADYGDKKIEKNVSLDIRKNA 415
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD----------VQTMTNIN 292
+ NA+ ++ + +KK + I+ ++A+K +EKK + LK+ +
Sbjct: 416 FENAETYY----TKSKKLKSKIEGIKEAIKLSEKKLAE-LKEKGEIELKELKEKEKIKKK 470
Query: 293 KARKVYWFEKFYWF---------------------------------------------- 306
+ ++ W+EKF W
Sbjct: 471 ERKERKWYEKFKWTVINGFLVIAGKDAVTNELLIKKYTEDDDIVFHAQIEGAPFTVIKTN 530
Query: 307 --ISSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGK 354
I E L VA + + AW WVK +QVSKTA +GE+L G+F+IRGK
Sbjct: 531 KRIVDEETLNEVAKFSVAHSRAWKLGWGALDTYWVKPEQVSKTAESGEYLKKGAFVIRGK 590
Query: 355 KNFFPPCQLAMGISFL 370
+NF L +GI +
Sbjct: 591 RNFIRNVPLELGIGII 606
>gi|385776519|ref|YP_005649087.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|323475267|gb|ADX85873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
Length = 609
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)
Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
V+++LD LSA NA RFFD + +K ++ ++S ++ + EK KQ ++ + +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV 392
Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
RK W+EK+ W IS S+N YL ++A
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450
Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510
Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
GKKNF +L + I + L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536
>gi|229579837|ref|YP_002838236.1| hypothetical protein YG5714_2060 [Sulfolobus islandicus Y.G.57.14]
gi|228010552|gb|ACP46314.1| protein of unknown function DUF814 [Sulfolobus islandicus
Y.G.57.14]
Length = 609
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)
Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
V+++LD LSA NA RFFD + +K ++ ++S ++ + EK KQ ++ + +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV 392
Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
RK W+EK+ W IS S+N YL ++A
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450
Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510
Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
GKKNF +L + I + L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536
>gi|227830959|ref|YP_002832739.1| hypothetical protein LS215_2101 [Sulfolobus islandicus L.S.2.15]
gi|227457407|gb|ACP36094.1| protein of unknown function DUF814 [Sulfolobus islandicus L.S.2.15]
Length = 609
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)
Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
V+++LD LSA NA RFFD + +K ++ ++S ++ + EK KQ ++ + +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV 392
Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
RK W+EK+ W IS S+N YL ++A
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450
Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510
Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
GKKNF +L + I + L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536
>gi|253745574|gb|EET01418.1| Serologically defined colon cancer antigen 1 [Giardia intestinalis
ATCC 50581]
Length = 1065
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 44/61 (72%)
Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
W K+ ++WV ADQVSKTAP G ++ TGSF+IRGK+NF P L +G++ L++ + ++
Sbjct: 693 WAQKIGAQSYWVYADQVSKTAPPGMYIGTGSFVIRGKRNFIPQQPLELGVALLWRYDAAN 752
Query: 378 I 378
+
Sbjct: 753 V 753
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 49/112 (43%), Gaps = 10/112 (8%)
Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANA----KRFFDLKRSAAKKQQKTIQSTEKALKSA 274
A P ++ +E S VDV SA NA F+ + A +K ++T+ +
Sbjct: 514 AKPISEPTEAIIIS-VDVPFKGSAGTNAHTIANTLFEAAKEAEQKCERTLGHSSAYFNKV 572
Query: 275 EKKTKQTLKDVQTMTN-----INKARKVYWFEKFYWFISSENYLGNVAWDAK 321
EKK + T+ + R WFEKF+WF S++ YL DA+
Sbjct: 573 EKKATAEIDSAIKETDAKLIALQHQRPPLWFEKFHWFFSTDGYLVLSGRDAQ 624
>gi|402470262|gb|EJW04606.1| hypothetical protein EDEG_01190 [Edhazardia aedis USNM 41457]
Length = 393
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 30/56 (53%), Positives = 39/56 (69%)
Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
W KV N ++VK+DQVSK A +GE+L GSFMIRGKKN+ +L GI +FK+
Sbjct: 147 WKEKVTGNVYYVKSDQVSKKAQSGEYLKAGSFMIRGKKNYVDVYRLEYGIGIVFKI 202
>gi|15897146|ref|NP_341751.1| hypothetical protein SSO0195 [Sulfolobus solfataricus P2]
gi|13813331|gb|AAK40541.1| Membrane conserved hypothetical protein [Sulfolobus solfataricus
P2]
Length = 609
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 87/207 (42%), Gaps = 66/207 (31%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN- 292
+++D LSA NA RFFD + +K ++ ++S E+ + K KQ +++ I
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALKSLEELKEKLAKIEKQ---EIEKQNEIKL 391
Query: 293 KARKVYWFEKFYWFISSENY---LGNVA-------------------------------- 317
RK W+EK+ W IS Y LG A
Sbjct: 392 TLRKKEWYEKYRWSISRSGYLIILGRDASQNESIVKKYLRDKDIFLHADIIGAPATIIIT 451
Query: 318 -----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 452 QDNKTISEEDIYDAAVMAASYSKAWKVGLASVDIFWVLGNQVSKSPPSGEYLNKGSFMIY 511
Query: 353 GKKNFFPPCQLAMGISFLFKLEESSIS 379
GKKNF +L + I + L E+ +S
Sbjct: 512 GKKNFIKNVKLQLAIGLI--LSENGVS 536
>gi|15679889|ref|NP_277007.1| hypothetical protein MTH1907 [Methanothermobacter
thermautotrophicus str. Delta H]
gi|2623041|gb|AAB86367.1| conserved protein [Methanothermobacter thermautotrophicus str.
Delta H]
Length = 655
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 75/288 (26%), Positives = 130/288 (45%), Gaps = 46/288 (15%)
Query: 51 MEFHPMILQPGS--TPSVMRQHQGFPIQEFESFAAAVDEFFSTA---ESHKIDLKAVQQE 105
++ P I+ G P +R + G + FESF A DEFFS+ E + + ++E
Sbjct: 236 LDLKPHIIGDGEDVLPIELRVYSGRERRYFESFNDAADEFFSSIFREEIRRAHEEEWERE 295
Query: 106 RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWE 165
D +K ++R+ L ++T V + +L+ N V+ + IR+ + SW+
Sbjct: 296 VDRFRKRLRIQRET---LEKFKKTIEVSTRRGDLLYANYSLVEEVLATIRR-AREKYSWD 351
Query: 166 DIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
+I+ + A++ P AS I ++ + +IT+ L + +P
Sbjct: 352 EIKNIIADARKRGLPEASNITEID-RMGNITIFL------------------DGEP---- 388
Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT---IQSTEKALKSAEKKTKQTL 282
V +D L NA+ +++ + A +K + I+ TEK ++ EKK L
Sbjct: 389 --------VRIDSKLGVPENAEVYYEKAKKAKRKIKGVMTAIEKTEKEIERIEKKRDDAL 440
Query: 283 KDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
+++ K RK+ WFEKF WF+SS+ +L DA TN VK
Sbjct: 441 RNIMVPRRRVK-RKLRWFEKFRWFVSSDGFLVIGGRDAG--TNEMVVK 485
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 32/45 (71%), Gaps = 2/45 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ +WV +QVSKT +GEF+ G+F+IRG +N+ P ++A+G+
Sbjct: 541 DVYWVHPEQVSKTPRSGEFVARGAFIIRGSRNYLRGVPLKIAIGV 585
>gi|284174391|ref|ZP_06388360.1| hypothetical protein Ssol98_07002 [Sulfolobus solfataricus 98/2]
gi|384433658|ref|YP_005643016.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|261601812|gb|ACX91415.1| protein of unknown function DUF814 [Sulfolobus solfataricus 98/2]
Length = 609
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 87/207 (42%), Gaps = 66/207 (31%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN- 292
+++D LSA NA RFFD + +K ++ ++S E+ + K KQ +++ I
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALKSLEELKEKLAKIEKQ---EIEKQNEIKL 391
Query: 293 KARKVYWFEKFYWFISSENY---LGNVA-------------------------------- 317
RK W+EK+ W IS Y LG A
Sbjct: 392 TLRKKEWYEKYRWSISRSGYLIILGRDASQNESIVKKYLRDKDIFLHADIIGAPATIIIT 451
Query: 318 -----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 452 QDNKTISEEDIYDAAVMAASYSKAWKVGLASVDIFWVLGNQVSKSPPSGEYLNKGSFMIY 511
Query: 353 GKKNFFPPCQLAMGISFLFKLEESSIS 379
GKKNF +L + I + L E+ +S
Sbjct: 512 GKKNFIKNVKLQLAIGLI--LSENGVS 536
>gi|440491782|gb|ELQ74392.1| putative RNA-binding protein [Trachipleistophora hominis]
Length = 886
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 44/60 (73%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
W ++V+++++V DQVSK+AP+GEFL+ G FM++GKKN+ +L I+ LF LE++
Sbjct: 730 CWKDRIVSDSYYVNEDQVSKSAPSGEFLSKGGFMVKGKKNYVHNVRLEYAIALLFALEKN 789
>gi|147771936|emb|CAN75697.1| hypothetical protein VITISV_035984 [Vitis vinifera]
Length = 431
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 61/105 (58%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ Q + +FE+F AA DEF+S E + + + +E A++KL + D E R
Sbjct: 245 CPILLNQFKSREFVKFETFDAASDEFYSKIEGQRSEQQQKAKEVXAMQKLSKICMDQENR 304
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI 167
+ L++ + AELI N E VDAAILA+R +AN ++WED+
Sbjct: 305 VHTLKKEDDRCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDL 349
>gi|449066809|ref|YP_007433891.1| hypothetical protein SacN8_03855 [Sulfolobus acidocaldarius N8]
gi|449069082|ref|YP_007436163.1| hypothetical protein SacRon12I_03840 [Sulfolobus acidocaldarius
Ron12/I]
gi|449035317|gb|AGE70743.1| hypothetical protein SacN8_03855 [Sulfolobus acidocaldarius N8]
gi|449037590|gb|AGE73015.1| hypothetical protein SacRon12I_03840 [Sulfolobus acidocaldarius
Ron12/I]
Length = 588
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/199 (24%), Positives = 81/199 (40%), Gaps = 67/199 (33%)
Query: 234 VDVDLDLSAYANAKRFFDLKRS---AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
+D+D L+ Y NA +++DL + AKK + ++ K L + K + ++++
Sbjct: 320 IDIDPKLTVYKNASKYYDLAKEYSEKAKKAGEVLEELRKKLSELQFKIDERKEEIRI--- 376
Query: 291 INKARKVYWFEKFYWFIS------------------------------------------ 308
RK W+EK++W I+
Sbjct: 377 --SLRKKEWYEKYHWGITRNGHIVIAGRDSDQNESIVRKLLDEKDIFLHADIQGAAATVL 434
Query: 309 -------SENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
SE+ + + A+ A + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 435 KANSGQVSEDDILDAAYIAACYSKAWKTGLGSVDVFWVYGNQVSKSPPSGEYLAKGSFMI 494
Query: 352 RGKKNFFPPCQLAMGISFL 370
G+KNF +L + I +
Sbjct: 495 YGRKNFIKNVKLELAIGIM 513
>gi|70606588|ref|YP_255458.1| hypothetical protein Saci_0795 [Sulfolobus acidocaldarius DSM 639]
gi|68567236|gb|AAY80165.1| conserved Prokaryal protein [Sulfolobus acidocaldarius DSM 639]
Length = 594
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/199 (24%), Positives = 81/199 (40%), Gaps = 67/199 (33%)
Query: 234 VDVDLDLSAYANAKRFFDLKRS---AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
+D+D L+ Y NA +++DL + AKK + ++ K L + K + ++++
Sbjct: 326 IDIDPKLTVYKNASKYYDLAKEYSEKAKKAGEVLEELRKKLSELQFKIDERKEEIRI--- 382
Query: 291 INKARKVYWFEKFYWFIS------------------------------------------ 308
RK W+EK++W I+
Sbjct: 383 --SLRKKEWYEKYHWGITRNGHIVIAGRDSDQNESIVRKLLDEKDIFLHADIQGAAATVL 440
Query: 309 -------SENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
SE+ + + A+ A + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 441 KANSGQVSEDDILDAAYIAACYSKAWKTGLGSVDVFWVYGNQVSKSPPSGEYLAKGSFMI 500
Query: 352 RGKKNFFPPCQLAMGISFL 370
G+KNF +L + I +
Sbjct: 501 YGRKNFIKNVKLELAIGIM 519
>gi|429964304|gb|ELA46302.1| hypothetical protein VCUG_02190 [Vavraia culicis 'floridensis']
Length = 943
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 41/60 (68%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
W ++V ++++V +QVSK+AP+GEFLT G FM++GKKN+ +L + LF LE+
Sbjct: 742 CWKDRIVCDSYYVNEEQVSKSAPSGEFLTKGGFMVKGKKNYVHNVRLEYAVGLLFALEKG 801
>gi|333987711|ref|YP_004520318.1| fibronectin-binding A domain-containing protein [Methanobacterium
sp. SWAN-1]
gi|333825855|gb|AEG18517.1| Fibronectin-binding A domain protein [Methanobacterium sp. SWAN-1]
Length = 663
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 94/385 (24%), Positives = 159/385 (41%), Gaps = 104/385 (27%)
Query: 52 EFHPMIL---QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA 108
+FHP I+ + P + +++ + + FE+F A DEF+S+ D+K V ++ A
Sbjct: 234 QFHPQIISSEKDDVLPLNLTKYEKYEKKTFETFNQAADEFYSSIVGD--DIKQVHEDVWA 291
Query: 109 --LKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
+ K E + L + T + K K E I +N +++ IL I + SW D
Sbjct: 292 AEVGKFEKRLKIQMETLEKFKDTIVKTKIKGEAIYSNYQNIQN-ILDIIHNARETYSWLD 350
Query: 167 IEARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
I +K+ ++ II+ L K+ + +TL N D
Sbjct: 351 IIDIIKKGKKEKVSGLDIIESLDKMGV--LTL--------------------NLDG---- 384
Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEK-KTKQTLKD 284
++V+VD ++S NA+ +++ + A +K + EK +K E+ K K+ +
Sbjct: 385 ------TIVNVDSNMSIPENAEIYYNKGKKAKRKISGVNIAIEKTMKEVERAKNKREIAM 438
Query: 285 VQTMTNINKARK-VYWFEKFYWFISSENYL--------------------------GNVA 317
+ + + RK + WFEK WF+SS+ L ++
Sbjct: 439 EKVLVPQKRVRKELKWFEKLRWFLSSDGLLVIGGRDATTNEMIVKKHMENRDIYFHSDIH 498
Query: 318 WDAKVV-----------------------TNAW----------WVKADQVSKTAPTGEFL 344
A VV ++AW WV DQVSKT +GEF+
Sbjct: 499 GAASVVVKAGEGEVPESTLNETASFAGSFSSAWSAGFGSTDVYWVHPDQVSKTPQSGEFV 558
Query: 345 TTGSFMIRGKKNFF--PPCQLAMGI 367
G+F+IRG +NF P +A+GI
Sbjct: 559 GKGAFIIRGSRNFIRNAPLLVAVGI 583
>gi|386003039|ref|YP_005921338.1| hypothetical protein Mhar_2365 [Methanosaeta harundinacea 6Ac]
gi|357211095|gb|AET65715.1| hypothetical protein Mhar_2365 [Methanosaeta harundinacea 6Ac]
Length = 668
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/270 (21%), Positives = 121/270 (44%), Gaps = 30/270 (11%)
Query: 52 EFHPMILQPGS-----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLK---AVQ 103
E +P ++ G P+ + + G +EF SF+ A+DEFFS+ E+ + K A++
Sbjct: 233 EAYPHLVAEGERIVDVVPAPLAVYDGLERREFGSFSEALDEFFSSKEAEAEEAKPKTALE 292
Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
+ R+ ++ R+ E L+ L E + V+A + AI + +
Sbjct: 293 RRREMQERSIQEFRERERELARL----------GEKVYERYGEVEAVLAAISKGFERGFT 342
Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA 223
+ +I A++K + P+A I L + L L + + D
Sbjct: 343 YSEILAKIKTS---GLPIAEKILALDYQ-GELRLRLDDPGDGDGGEGKGGTVGDTG---- 394
Query: 224 DSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
+ ++++++ +L+ NA+R++DL + AKK++ ++ E+ ++ +K
Sbjct: 395 -GKGEARGAVLELNSNLTVPQNAQRYYDLAKEQAKKREGAEKALEETIRLIARKAGPEKA 453
Query: 284 DVQTMTNINKARKVYWFEKFYWFISSENYL 313
+ + + RK W+E+F WF SS+ +L
Sbjct: 454 KTRA---VYRRRKPKWYERFRWFTSSDGFL 480
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 3/54 (5%)
Query: 318 WDAKVV-TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGIS 368
W A + + + V ADQV+KT GEFL G+F++RG++ ++ P LA+GI+
Sbjct: 543 WKAGLFEGDCYLVAADQVTKTPEPGEFLKKGAFVVRGERRYYRDVPLGLALGIA 596
>gi|70918391|ref|XP_733179.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56504739|emb|CAH85243.1| hypothetical protein PC301461.00.0 [Plasmodium chabaudi chabaudi]
Length = 169
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 76/150 (50%), Gaps = 4/150 (2%)
Query: 51 MEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDL-KAVQQERDAL 109
+EF P++L+ + + +F F VD +FS E K D + + + ++AL
Sbjct: 19 VEFIPILLKNHINKI---DEKKIELIKFNDFNMCVDTYFSKMELTKYDKHQEMNKRKNAL 75
Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
K++ +K DHE R+ ALE+ + K+K LI N E V AI +R I+ +WE I
Sbjct: 76 TKIDKIKLDHERRIEALEKEVNILKKKILLIQANDEFVGEAIKLMRAAISTSANWEKIWD 135
Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLL 199
VK ++ N PVA I + N I LLL
Sbjct: 136 HVKLFKKRNHPVALKIMSVNFNNCEIELLL 165
>gi|15920412|ref|NP_376081.1| hypothetical protein ST0231 [Sulfolobus tokodaii str. 7]
gi|15621194|dbj|BAB65190.1| hypothetical protein STK_02310 [Sulfolobus tokodaii str. 7]
Length = 595
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 90/204 (44%), Gaps = 69/204 (33%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAEKKTKQTLKDVQTMTN 290
+++D LS Y NA ++FD+ + A+K++K T+ + ++ LK +K+ ++ ++++
Sbjct: 326 IELDPKLSVYKNASKYFDIAKEYAEKRKKAEETLNNLKQKLKELDKQIEERTEEIRI--- 382
Query: 291 INKARKVYWFEKF--------YWFIS---------------------------------- 308
RK W+EK+ Y I+
Sbjct: 383 --SLRKREWYEKYRWSFTRNGYLVIAGRDIDQNESLVRKLLEPKDIFLHADIQGAPATII 440
Query: 309 -------SENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
+E+ + + A A + AW WV DQVSK+ P+GE+L GSFMI
Sbjct: 441 KTQGNNVTEDDIRDAAVIAACYSKAWKVGMGAIDVFWVNGDQVSKSPPSGEYLKKGSFMI 500
Query: 352 RGKKNFF--PPCQLAMGISFLFKL 373
GKKNF QL +G++ FK+
Sbjct: 501 YGKKNFINNVKMQLFLGLTEDFKI 524
>gi|337284225|ref|YP_004623699.1| hypothetical protein PYCH_07400 [Pyrococcus yayanosii CH1]
gi|334900159|gb|AEH24427.1| hypothetical protein PYCH_07400 [Pyrococcus yayanosii CH1]
Length = 648
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 141/317 (44%), Gaps = 48/317 (15%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFS--TAESHKID-LKAVQQERDALK-KLENVKRD 118
P ++ + G+ + FE+F+ A+DE+F T E K + + ++++R AL+ LE ++
Sbjct: 241 VPIELKWYDGYERKYFETFSEALDEYFGKLTVEKAKAEKTRKLEEKRKALEISLERIR-- 298
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
++ A E+ ++E +LI N V+ + +R + +L WE++E RV++ ++
Sbjct: 299 --EQMMAFEEEAKKNQELGDLIYANYSLVERLLEELRAAV-KKLGWEELERRVEEGKKTG 355
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
+ A +IK + + N +T+ + D K + L
Sbjct: 356 NKAAEVIKGIHPSENAVTVEI------------------------DGKAIK------LYL 385
Query: 239 DLSAYANAKRFFD-LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
+ S NA+ +++ KR+ AK + E +K E + + + + RK
Sbjct: 386 NRSLGENAELYYERAKRAKAKLEGARKAYEETKIKIEELERLIEEEGKKVGVKKLERRKK 445
Query: 298 YWFEKFYWFISSENYLGNVAWDAKVVTNAWWVKAD-QVSKTAPTGEFLTTGSFMIR---- 352
WFEKF WFISSE +L DA TN VK + + + +I+
Sbjct: 446 KWFEKFRWFISSEGFLVIGGKDA--TTNEMVVKRHMEENDIYCHADVYGAPHVVIKDGRK 503
Query: 353 -GKKNFFPPCQLAMGIS 368
G++ F CQ A+ +S
Sbjct: 504 AGERTIFEACQFAVSMS 520
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 38/54 (70%), Gaps = 3/54 (5%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
AW + + +A+WV +QVSK +P GE+L G+FM+ GK+N+F P +LA+G+
Sbjct: 522 AWGQGLYSADAYWVYPEQVSKKSPAGEYLPKGAFMVYGKRNWFHGIPLKLAVGV 575
>gi|222445070|ref|ZP_03607585.1| hypothetical protein METSMIALI_00687 [Methanobrevibacter smithii
DSM 2375]
gi|222434635|gb|EEE41800.1| fibronectin-binding protein A domain protein [Methanobrevibacter
smithii DSM 2375]
Length = 668
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 83/384 (21%), Positives = 155/384 (40%), Gaps = 98/384 (25%)
Query: 53 FHPMILQPGSTPSV----MRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--R 106
F P I++ S V + ++ GF F++F A DEF+S + D+K +++
Sbjct: 239 FKPQIVKKDSKEDVVAFDLVKYDGFEKTYFDNFNEACDEFYSKKVN--TDIKNIKEAAWN 296
Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
+ K E + E L +T + K E+I +N +++ + + I+ S+++
Sbjct: 297 KKVNKFEKRLKLQEETLDNFHKTIETSQHKGEVIYSNYTTIENLVKVVNNAISKDYSYKE 356
Query: 167 IEARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
I +K+A+++ A I + + K+ + L L++++ NI + +NA+ + +
Sbjct: 357 IGKTLKEAKKNGLKEAEIFESIDKMGV----LTLKLNETSININPKLTI-PENAEIYYEK 411
Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
++ K + A N K+ L++ AKK+ + + +K+ K+ LK
Sbjct: 412 AKKAKKKTKGATI---AIENTKK--QLEKIKAKKEVAM-----EHISVPKKRVKKNLK-- 459
Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL-------------------------------- 313
W+EK WF++S+N L
Sbjct: 460 -------------WYEKLRWFVTSDNVLVIGGRDAGTNEAVVKKYMDNNDIYLHADIHGA 506
Query: 314 --------GNVAWD---------AKVVTNAW----------WVKADQVSKTAPTGEFLTT 346
GN D A + AW WV +QV+KT GEFL
Sbjct: 507 TSTVIKLEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVNPEQVTKTPEAGEFLPK 566
Query: 347 GSFMIRGKKNFFPPCQLAMGISFL 370
GSF+IRG +N+ ++ + I +
Sbjct: 567 GSFVIRGNRNYIRSAKVRIAIGIV 590
>gi|375084281|ref|ZP_09731287.1| fibronectin-binding protein [Thermococcus litoralis DSM 5473]
gi|374741041|gb|EHR77473.1| fibronectin-binding protein [Thermococcus litoralis DSM 5473]
Length = 650
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 138/316 (43%), Gaps = 47/316 (14%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P ++ ++G+ + FE+F+ A+DE+F ++ ++ +D + LE R E +
Sbjct: 243 PIELKWYEGYEKKFFETFSEALDEYFGKILIESAKIERTKKLQDKKRGLEVTLRKQEEMI 302
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
E+ ++E +LI N V+ + + + + +L WE+ + R+++ ++ + VA
Sbjct: 303 KGFERQMQENQEIGDLIYANFTFVENLLKELSKAV-EKLGWEEFKKRIEEGRKSGNKVAQ 361
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
IIK + +T+ L E KK V + L+ S
Sbjct: 362 IIKGIDPKEKAVTVEL---------------------------EGKK---VKLYLNKSIG 391
Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI--NKARKVYWFE 301
NA+ +++ + A K + ++ E LK ++ K ++ + ++ + RK WFE
Sbjct: 392 ENAEIYYEKAKKAKHKLEGARKAYEDTLKKIQEIEKLIEEEEKKELSVKKLEKRKKKWFE 451
Query: 302 KFYWFISSENYL---GNVAWDAKVVT------NAWWVKADQVSKTAPTGEFLTTGSFMIR 352
KF WF+SSE +L G A ++V N + AD AP +
Sbjct: 452 KFRWFVSSEGFLVIGGKDATTNEIVVKRHMSENDLYCHADIYG--AP---HVVIKDGKKA 506
Query: 353 GKKNFFPPCQLAMGIS 368
G+K F CQ A+ +S
Sbjct: 507 GEKTIFEACQFAVSMS 522
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 35/54 (64%), Gaps = 3/54 (5%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
AW D +A+W QV+K AP+GE+L G+FM+ GK+N+ P +LA+GI
Sbjct: 524 AWKDGIYSGDAYWADPSQVTKKAPSGEYLGKGAFMVYGKRNWMHGLPVKLAIGI 577
>gi|261350362|ref|ZP_05975779.1| fibronectin-binding protein A [Methanobrevibacter smithii DSM 2374]
gi|288861145|gb|EFC93443.1| fibronectin-binding protein A [Methanobrevibacter smithii DSM 2374]
Length = 668
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 83/384 (21%), Positives = 155/384 (40%), Gaps = 98/384 (25%)
Query: 53 FHPMILQPGSTPSV----MRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--R 106
F P I++ S V + ++ GF F++F A DEF+S + D+K +++
Sbjct: 239 FKPQIVKKDSKEDVVAFDLVKYDGFEKTYFDNFNEACDEFYSKKVN--TDIKNIKEAAWN 296
Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
+ K E + E L +T + K E+I +N +++ + + I+ S+++
Sbjct: 297 KKVNKFEKRLKLQEETLDNFHKTIETSQHKGEVIYSNYTTIENLVKVVNNAISKDYSYKE 356
Query: 167 IEARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
I +K+A+++ A I + + K+ + L L++++ NI + +NA+ + +
Sbjct: 357 IGKTLKEAKKNGLKEAEIFESIDKMGV----LTLKLNETSININPKLTI-PENAEIYYEK 411
Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
++ K + A N K+ L++ AKK+ + + +K+ K+ LK
Sbjct: 412 AKKAKKKTKGATI---AIENTKK--QLEKIKAKKEVAM-----EHISVPKKRVKKNLK-- 459
Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL-------------------------------- 313
W+EK WF++S+N L
Sbjct: 460 -------------WYEKLRWFVTSDNVLVIGGRDAGTNETVVKKYMDNNDIYLHADIHGA 506
Query: 314 --------GNVAWD---------AKVVTNAW----------WVKADQVSKTAPTGEFLTT 346
GN D A + AW WV +QV+KT GEFL
Sbjct: 507 TSTVIKLEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVNPEQVTKTPEAGEFLPK 566
Query: 347 GSFMIRGKKNFFPPCQLAMGISFL 370
GSF+IRG +N+ ++ + I +
Sbjct: 567 GSFVIRGNRNYIRSAKVRIAIGIV 590
>gi|156844590|ref|XP_001645357.1| hypothetical protein Kpol_1058p36 [Vanderwaltozyma polyspora DSM
70294]
gi|156116018|gb|EDO17499.1| hypothetical protein Kpol_1058p36 [Vanderwaltozyma polyspora DSM
70294]
Length = 1019
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 84/174 (48%), Gaps = 9/174 (5%)
Query: 31 GYLVQKKEKRPNSE----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVD 86
GY++ KK + N E D FHP +P + + + I + +D
Sbjct: 277 GYILAKKNENYNPEKHSPDTEFIYETFHP--FEPFVESKDLEKTKIIEIPG--DYNKTLD 332
Query: 87 EFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQES 146
+FFST ES K L+ QE A KKL++ K ++E R+ AL Q +++K LII +
Sbjct: 333 QFFSTIESSKYSLRIQNQELQAKKKLDDAKLENERRIQALVDVQTSNEQKGHLIIAHSNL 392
Query: 147 VDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
++ A++ I Q+ W IE + Q+ + +A +K LKL N I ++L
Sbjct: 393 IEEVKFAVQGLIDQQMDWNTIENLIGSEQKKGNKIAQKVKLPLKLKNNKIDVIL 446
Score = 41.2 bits (95), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
Query: 317 AWDAKVVTNAWWVKADQVSK-TAPTGEFLTTGSFMIRGKKNF--FPPCQLAMGISFLFKL 373
AW K+ + W A +SK + T L G F ++ +K+ PP QL MG F +K+
Sbjct: 661 AWSKKIAASPRWCYARNISKFDSTTNTLLPRGRFALKDEKSMIHLPPAQLVMGFGFAWKV 720
Query: 374 EESS 377
+ S
Sbjct: 721 KTES 724
>gi|410730361|ref|XP_003671360.2| hypothetical protein NDAI_0G03400 [Naumovozyma dairenensis CBS 421]
gi|401780178|emb|CCD26117.2| hypothetical protein NDAI_0G03400 [Naumovozyma dairenensis CBS 421]
Length = 1037
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 8/140 (5%)
Query: 53 FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
F P + + S + + G+ + +D+FFST ES K L+ QE A KKL
Sbjct: 310 FEPYVEEKDKASSHISEIPGY-------YNKTLDKFFSTIESSKYALRIQNQELQAKKKL 362
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ K D++ +L AL Q ++EK LI+ N + V+ A AI+ + Q+ W IE +K
Sbjct: 363 DEAKLDNQKKLQALIDVQSSNEEKGHLIVANADLVEEAKSAIQGLVDQQMDWNTIEKLIK 422
Query: 173 QAQRHNDPVASIIKQLKLNI 192
Q+ + +A +I L LN+
Sbjct: 423 SEQKKHVKIAELI-VLPLNL 441
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 93/218 (42%), Gaps = 66/218 (30%)
Query: 225 SSETKKPS----LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE----- 275
SS T +P+ V +DL SAYANA +F+ K+++A+KQ++ ++ EKA+K+ E
Sbjct: 516 SSATSQPNEKMITVTIDLGFSAYANASEYFNAKKTSAEKQKRVEKNIEKAMKNIEEKVNT 575
Query: 276 ------KKTKQTLKDVQT------------------MTNINKARKVYWFEKFYW---FIS 308
K++ + LK ++T M N A + K+
Sbjct: 576 QLKKKLKESHEVLKKIRTPYFFEKYHWFISSEGYLVMMGKNDAETDQIYSKYIEDDDVFM 635
Query: 309 SENYLGNV---------------------------AWDAKVVTNAWWVKADQVSKTAPTG 341
S N+ V AW K+ ++AWW A V+K
Sbjct: 636 SNNFGTKVWIKNPMKHEVPPNTLMQAGILCMSSSEAWSKKIASSAWWCNAKNVTKFDKFD 695
Query: 342 E-FLTTGSFMIRGKK--NFFPPCQLAMGISFLFKLEES 376
+ L G F+++ +K N P QL MG+ FL+K++ S
Sbjct: 696 KSVLPPGVFVLKDEKDQNTLPASQLVMGLGFLWKVKTS 733
>gi|229581503|ref|YP_002839902.1| hypothetical protein YN1551_0858 [Sulfolobus islandicus Y.N.15.51]
gi|228012219|gb|ACP47980.1| protein of unknown function DUF814 [Sulfolobus islandicus
Y.N.15.51]
Length = 609
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)
Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
V+++LD LSA NA RFFD + +K ++ ++S ++ + +K KQ ++ + +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLKKIEKQEIEKQNEIKLV 392
Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
RK W+EK+ W IS S+N YL ++A
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450
Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
+DA V+ + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510
Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
GKKNF +L + I + L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536
>gi|448677723|ref|ZP_21688913.1| hypothetical protein C443_04694 [Haloarcula argentinensis DSM
12282]
gi|445773398|gb|EMA24431.1| hypothetical protein C443_04694 [Haloarcula argentinensis DSM
12282]
Length = 717
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 117/257 (45%), Gaps = 19/257 (7%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKR---DH 119
TP + +++G + F F +A+D++F + + Q D ++E KR
Sbjct: 275 TPIPLSEYEGLYSESFTEFNSALDDYFFNFQREEEVEGGETQRPDFEVEIEKQKRIIQQQ 334
Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
E + E V++EKAEL+ N + VD + +R + +SW+DIEA+ +
Sbjct: 335 EQAIEDFEADAEVEREKAELLYANYDLVDDVLSTVRAAREDDVSWDDIEAKFDEGADRG- 393
Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDV 236
+A+ + L+ + T+ L + R + D++ NAD A E KK
Sbjct: 394 -IAAAEAVVSLDGSEGTVTLDIGGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA--- 448
Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARK 296
L+A N + DL+ A K+++ ++ + + AE + + D +M +I
Sbjct: 449 ---LAAIENTRE--DLE--AVKERRDEWEADDGEDEVAEDEGEDEPTDWLSMQSIPTRST 501
Query: 297 VYWFEKFYWFISSENYL 313
W+E+F WF +S+ +L
Sbjct: 502 ERWYEQFRWFHTSDGFL 518
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V DQVSKT +GE+L G F IRG + +F P +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPVGVAVGIT 641
>gi|322368861|ref|ZP_08043428.1| Fibronectin-binding A domain protein [Haladaptatus paucihalophilus
DX253]
gi|320551592|gb|EFW93239.1| Fibronectin-binding A domain protein [Haladaptatus paucihalophilus
DX253]
Length = 711
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 63/268 (23%), Positives = 111/268 (41%), Gaps = 46/268 (17%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFS---------TAESHKIDLKAVQQERDALKKLE 113
TP + +++G + F+SF AAVD++F+ E K D +A ++K +
Sbjct: 275 TPFPLEEYEGLDSEAFDSFNAAVDDYFTNLDVSENEDAGEPQKPDFQA------QIEKQQ 328
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
+ E + E+ ++EKAEL+ N VD + +R A W DIEAR ++
Sbjct: 329 RIIEQQEGAIEGFERKADAEREKAELLYANYGFVDEILATVRNARAEDTPWADIEARFEE 388
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
A ++ + + +T+ I D+ + F D K +
Sbjct: 389 GAERGIEAAEAVQGIDPSEGTVTV---------EIDDTKITL------FPDDGVEKNANR 433
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN- 292
+ Y AKR + K A + T + E+ K AE+ ++ ++ NI+
Sbjct: 434 L--------YQEAKRIEEKKEGALAAIEDTREELEEVKKRAEQWEEEPEEERTEPENIDW 485
Query: 293 ------KARK-VYWFEKFYWFISSENYL 313
RK W+E+F WF +S+ +L
Sbjct: 486 LSRASIPVRKQEQWYERFRWFRTSDGFL 513
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D + +A+ V DQVSKT +GE+L G F IRG + +F P +A+GI+
Sbjct: 585 DGRFAGDAYMVTPDQVSKTPESGEYLEKGGFAIRGDRTYFEDTPVGVAVGIT 636
>gi|148642838|ref|YP_001273351.1| RNA-binding protein snRNP-like protein [Methanobrevibacter smithii
ATCC 35061]
gi|148551855|gb|ABQ86983.1| predicted RNA-binding protein, eukaryotic snRNP-like protein
[Methanobrevibacter smithii ATCC 35061]
Length = 668
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/384 (21%), Positives = 155/384 (40%), Gaps = 98/384 (25%)
Query: 53 FHPMILQPGSTPSV----MRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--R 106
F P I++ S V + ++ GF F++F A DEF+S + D+K +++
Sbjct: 239 FKPQIVKKDSKEDVVAFDLVKYDGFEKTYFDNFNEACDEFYSKKVN--TDIKNIKEAAWN 296
Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
+ K E + E L +T + K E+I +N +++ + + I+ S+++
Sbjct: 297 KKVNKFEKRLKLQEETLDNFHKTIETSQHKGEVIYSNYTTIENLVKVVNNAISKDYSYKE 356
Query: 167 IEARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
I +K+A+++ A I + + K+ + L L++++ NI + +NA+ + +
Sbjct: 357 IGKTLKEAKKNCLKEAEIFESIDKMGV----LTLKLNETSININPKLTI-PENAEIYYEK 411
Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
++ K + A N K+ L++ AKK+ + + +K+ K+ LK
Sbjct: 412 AKKAKKKTKGATI---AIENTKK--QLEKIKAKKEVAM-----EHISVPKKRVKKNLK-- 459
Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL-------------------------------- 313
W+EK WF++S+N L
Sbjct: 460 -------------WYEKLRWFVTSDNVLVIGGRDAGTNEAVVKKYMDNNDIYLHADIHGA 506
Query: 314 --------GNVAWD---------AKVVTNAW----------WVKADQVSKTAPTGEFLTT 346
GN D A + AW WV +QV+KT GEFL
Sbjct: 507 TSTVIKLEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVNPEQVTKTPEAGEFLPK 566
Query: 347 GSFMIRGKKNFFPPCQLAMGISFL 370
GSF+IRG +N+ ++ + I +
Sbjct: 567 GSFVIRGNRNYIRSAKVRIAIGIV 590
>gi|10640760|emb|CAC12538.1| conserved hypothetical protein [Thermoplasma acidophilum]
Length = 588
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 52/88 (59%), Gaps = 8/88 (9%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
V++D +SA NA R+F S AK ++ I+ KA++ AEK Q L ++Q K
Sbjct: 331 VEIDYTVSAGENANRYF----SQAKDYRRKIEGAMKAIEEAEK---QRLTEMQKAEK-KK 382
Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAK 321
RKV+WFE ++WFISSE YL DAK
Sbjct: 383 RRKVFWFETYHWFISSEGYLVIAGRDAK 410
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
AW A + + A+WV QVSKT +GE++ TGS++IRGK+N+ +L + I
Sbjct: 465 AWPAGIASGTAYWVYPSQVSKTPESGEYVATGSWIIRGKRNYITDLKLELCIGM 518
>gi|16082623|ref|NP_394872.1| RNA-binding protein snRNP [Thermoplasma acidophilum DSM 1728]
Length = 601
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 52/88 (59%), Gaps = 8/88 (9%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
V++D +SA NA R+F S AK ++ I+ KA++ AEK Q L ++Q K
Sbjct: 344 VEIDYTVSAGENANRYF----SQAKDYRRKIEGAMKAIEEAEK---QRLTEMQKAEK-KK 395
Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAK 321
RKV+WFE ++WFISSE YL DAK
Sbjct: 396 RRKVFWFETYHWFISSEGYLVIAGRDAK 423
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
AW A + + A+WV QVSKT +GE++ TGS++IRGK+N+ +L + I
Sbjct: 478 AWPAGIASGTAYWVYPSQVSKTPESGEYVATGSWIIRGKRNYITDLKLELCIGM 531
>gi|410722235|ref|ZP_11361543.1| putative RNA-binding protein, snRNP like protein [Methanobacterium
sp. Maddingley MBC34]
gi|410597380|gb|EKQ52002.1| putative RNA-binding protein, snRNP like protein [Methanobacterium
sp. Maddingley MBC34]
Length = 742
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/339 (23%), Positives = 150/339 (44%), Gaps = 49/339 (14%)
Query: 70 HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--RDALKKLENVKRDHETRLSALE 127
+Q F + F++F A DEF+S D+K VQ++ + K E R E L +
Sbjct: 341 YQNFHKERFDTFNQAADEFYSGKVG--ADIKKVQEDIWAKEVGKYEKRLRIQEETLEKFQ 398
Query: 128 QTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQ 187
+T + K+K L+ ++ + +L I + SW +I ++ K+A++ A II+
Sbjct: 399 KTIVETKKKGNLLYSHYSEIQD-LLDIIHQAREKFSWMEIASKFKKARKEGMKEAQIIES 457
Query: 188 L-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL-------D 239
+ K+ + L L + ER + D+ + +NA+ + + + K + V++ D
Sbjct: 458 MDKMGV----LTLNLEGERVTV-DANLEIPENAEKYYNKGKKAKRKIRGVNIAIERTKKD 512
Query: 240 LSAYANAKRF-FDLKRSAAKKQQKTIQSTEK-----------ALKSAEKKTKQTL----- 282
+ N + + R K+ +K ++ EK + + T + +
Sbjct: 513 VERKRNKREMALERVRVPQKRVRKELKWFEKLRWFLSSDGYLVIGGRDAGTNEMVVKRHL 572
Query: 283 --KDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAW----------WVK 330
+D+ ++I+ A V + E+ + A ++AW WV
Sbjct: 573 DNQDIYLHSDIHGAPSVVIKKGEVEGEIPESTVQEAGTLAASFSSAWSKGYGSQDVYWVH 632
Query: 331 ADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
DQVSKT +GEF+ G+F+IRG +N+ P ++A+GI
Sbjct: 633 PDQVSKTPQSGEFVARGAFIIRGSRNYLRGIPLKIAVGI 671
>gi|429217609|ref|YP_007175599.1| RNA-binding protein [Caldisphaera lagunensis DSM 15908]
gi|429134138|gb|AFZ71150.1| putative RNA-binding protein, snRNP like protein [Caldisphaera
lagunensis DSM 15908]
Length = 669
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 38/55 (69%), Gaps = 3/55 (5%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC--QLAMGIS 368
AW A + + A+WV +QVSK+ P+GE+L GSFMI GKKNF P +L +GIS
Sbjct: 524 AWKAGMGSVGAYWVLGNQVSKSPPSGEYLAKGSFMIYGKKNFLKPINMELYLGIS 578
>gi|241958102|ref|XP_002421770.1| conserved hypothetical protein [Candida dubliniensis CD36]
gi|223645115|emb|CAX39711.1| conserved hypothetical protein [Candida dubliniensis CD36]
Length = 1012
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 95/186 (51%), Gaps = 15/186 (8%)
Query: 19 HVLVSNLNVSSQGYLVQKKEKRP--NSE-DYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
++ + N + ++GY+V KK + NSE +YI EF P +P P+ +G
Sbjct: 257 YIDLINGAIDNEGYIVAKKNNKASENSELEYIYD--EFDP--FEP-YKPN----QEGLKF 307
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
+ +D+FFS ES K +K QQ+ +A K+LE + + + ++ +L Q ++ +
Sbjct: 308 IPVSGYNKTLDKFFSNIESTKFSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQKLNAK 367
Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--N 193
K ELI + E V+ ++ I Q+ W +IE + Q+ + +A I QL LN+ N
Sbjct: 368 KGELIQYHSELVEECRNYVQSFIDQQMDWTNIETVISLEQKKKNDLAKHI-QLPLNLKEN 426
Query: 194 HITLLL 199
I +LL
Sbjct: 427 KIKVLL 432
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%)
Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
AW+ KV T+AW + ++SK G + G F +K + PP QL MG F L+E
Sbjct: 642 AWNGKVTTSAWVLHGTEISKRDYDGSIVPEGEFNYLVQKEYLPPAQLVMGFGFYCLLDEE 701
Query: 377 S 377
S
Sbjct: 702 S 702
>gi|304314240|ref|YP_003849387.1| hypothetical protein MTBMA_c04780 [Methanothermobacter marburgensis
str. Marburg]
gi|302587699|gb|ADL58074.1| conserved hypothetical protein [Methanothermobacter marburgensis
str. Marburg]
Length = 653
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 132/290 (45%), Gaps = 47/290 (16%)
Query: 50 NMEFHPMILQPGST---PSVMRQHQGFPIQEFESFAAAVDEFFSTA---ESHKIDLKAVQ 103
+++F+P I++ G P + ++ + FE+F A DEFFS+ E K+ +
Sbjct: 232 DLKFNPHIIKNGEGDVLPIELMVYRDREREYFETFNEAADEFFSSIFREELRKVHEAEWE 291
Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
+E + +K ++R+ L + T K +L+ + +V+ + IR D + S
Sbjct: 292 KEVEKFRKRLRIQRET---LQKFQDTIDTSTRKGDLLYAHYAAVEDVLRTIR-DAREKYS 347
Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA 223
W++I + A+ A +I+++ + ++TLL + ER
Sbjct: 348 WKEIRKIIADARSKGMVEAQMIQEID-GMGNMTLL--IDGER------------------ 386
Query: 224 DSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQS---TEKALKSAEKKTKQ 280
+ +D L NA+ +++ + A +K + +Q+ TE+ ++ EK+
Sbjct: 387 ----------IRIDPTLGVPENAEVYYEKAKKAKRKIKGVLQAIEKTEREIEKVEKRRDD 436
Query: 281 TLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
L+++ K RK+ WFEKF WFISS+++L DA TN VK
Sbjct: 437 ALRNIMVPQKRVK-RKLRWFEKFRWFISSDDFLVIGGRDAG--TNEMVVK 483
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 32/45 (71%), Gaps = 2/45 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ +WV +QVSKT +GEF+ G+F+IRG +N+ P ++A+G+
Sbjct: 539 DVYWVHPEQVSKTPRSGEFVARGAFIIRGTRNYIRGVPLKVAVGV 583
>gi|13542268|ref|NP_111956.1| RNA-binding protein snRNP [Thermoplasma volcanium GSS1]
gi|14325702|dbj|BAB60605.1| hypothetical protein [Thermoplasma volcanium GSS1]
Length = 604
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 10/89 (11%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEK-KTKQTLKDVQTMTNIN 292
+D+D SA NA R+FDL +K +K I+ +KA++ AE+ + K K V+++
Sbjct: 345 IDIDYTKSAGENANRYFDL----SKDYRKKIEGAKKAIEEAEQERIKLQEKKVKSVN--- 397
Query: 293 KARKVYWFEKFYWFISSENYLGNVAWDAK 321
R+++WFE ++WFISSE YL DAK
Sbjct: 398 --RRIFWFETYHWFISSEGYLVIAGRDAK 424
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 36/54 (66%), Gaps = 1/54 (1%)
Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
AW A + + A+WV QVSKT +GE+++TGS++IRGK+N+ +L + I
Sbjct: 479 AWPAGIASGTAYWVYPSQVSKTPESGEYVSTGSWIIRGKRNYVTNLKLELCIGL 532
>gi|424813826|ref|ZP_18239004.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
Nanosalina sp. J07AB43]
gi|339757442|gb|EGQ42699.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
Nanosalina sp. J07AB43]
Length = 632
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 157/346 (45%), Gaps = 68/346 (19%)
Query: 6 DVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPN-----SEDYIVANMEFHPMILQP 60
+++S + G ++ N+ + ++ Q ++R + +ED + + P++
Sbjct: 167 EIASKLSFGGTYAEEVLKRANIDKETHVEQLDDERKDIISEKTEDMLNSAENPDPILYVD 226
Query: 61 GSTPSVMRQHQGFPIQ--------EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
G P + FP++ +F++F+ A+DE++ ++ K + + ++ + +
Sbjct: 227 GEEPV---RSSPFPLERYADDESIDFDTFSEAIDEYYYRKKALKEKKEKEEAYQEKKQGI 283
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQL---SWEDIEA 169
E K+ E ++ LE++ ++EKAE I N + + I++ I N L WE
Sbjct: 284 ERQKQQQERKIQGLEKSAEQNREKAERIYENYQ----LLQRIKRQIENSLDEDGWEQTRQ 339
Query: 170 RVKQAQRHN-DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSET 228
++++++ + D VAS+ KQ + I++ D+ E
Sbjct: 340 KLEKSESEDADKVASLNKQE----DFISV--------------------------DTGEE 369
Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK----TKQTLKD 284
+ DL+ A A +++D AK ++ I+S ++ALK +K+ K+ +
Sbjct: 370 NLKVYLFQDLE----ATASQYYD----KAKNSEEKIESAKEALKETKKELEDLKKEEINT 421
Query: 285 VQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
+ + + + RK WFEK+ WF SSE+YL DA+ TN VK
Sbjct: 422 DEVLEDKTQKRKKKWFEKYRWFYSSEDYLVLCGRDAQ--TNDMLVK 465
>gi|146302942|ref|YP_001190258.1| hypothetical protein Msed_0157 [Metallosphaera sedula DSM 5348]
gi|145701192|gb|ABP94334.1| protein of unknown function DUF814 [Metallosphaera sedula DSM 5348]
Length = 601
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 82/196 (41%), Gaps = 66/196 (33%)
Query: 234 VDVDLDLSAYANAKRFFDLKR---SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
+++D +SA NA ++F+ + + ++ ++TI+ EK + + K+K+T++ + +
Sbjct: 323 IEIDPKISASKNASQYFEKAKELDAKIRRTRETIEELEKKKQEIKAKSKETIEGSKILV- 381
Query: 291 INKARKVYWFEKFYWFIS------------------------------------------ 308
RK W+E+++W I+
Sbjct: 382 ----RKKEWYERYHWTITSNGFIVIAGRDIDQNESIVRKMLEDKDIFLHADIQGAPATVI 437
Query: 309 ------SENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
E L + A A + AW WV +QVSK+ P+GE+L GSFMI
Sbjct: 438 KNPVGIGEQDLMDAAVLAGCYSKAWKLGLASIDVFWVYGEQVSKSPPSGEYLPKGSFMIY 497
Query: 353 GKKNFFPPCQLAMGIS 368
GKKN+ +L + I
Sbjct: 498 GKKNYIKNVKLELTIG 513
>gi|150400994|ref|YP_001324760.1| hypothetical protein Maeo_0563 [Methanococcus aeolicus Nankai-3]
gi|150013697|gb|ABR56148.1| protein of unknown function DUF814 [Methanococcus aeolicus
Nankai-3]
Length = 686
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 34/46 (73%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ +WVK DQ+SKTA +GE+L+ G+FM+RGK+NF L +GI +
Sbjct: 575 DVYWVKPDQISKTAESGEYLSKGAFMVRGKRNFIRSAILDLGIGII 620
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 120/276 (43%), Gaps = 54/276 (19%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFST--------AESHKIDLKAVQQERDALKKLEN 114
+P + ++ + + +++F A+D++FS + KI +QER ++E+
Sbjct: 261 SPIELLKYANYDKKYYDNFLTAMDDYFSIFILKTEIKKQETKIQKMVNRQERILNSQIES 320
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDA---AILAIRQDIANQLSWEDIEARV 171
+K+ E+ + +K K +LI N VD I++ R+ +L W++I+ V
Sbjct: 321 LKK--------YEKQDIENKLKGDLIYANYAMVDEILNTIISARE----KLEWKEIKKIV 368
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
KQ + ++P+ I + I L L V + D + K
Sbjct: 369 KQNK--DNPILGKIVSINEKNGEIILNLTVD-------------------YGDGAPITKN 407
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQ---QKTIQSTEKALKSA---EKKTKQTLKDV 285
++D+ + A+ NA ++ + K I+ +EK LK E+ +TLK+
Sbjct: 408 VILDIRKN--AFENADNYYGKSKKFKHKIKGVHTAIEISEKKLKKLKIQEESEMETLKEK 465
Query: 286 QTMTNINKARKVY-WFEKFYWFISSENYLGNVAWDA 320
+ T + K RK W+EKF W + + +YL DA
Sbjct: 466 EETTMVKKERKKRKWYEKFKWTVIN-DYLVIAGKDA 500
>gi|448502987|ref|ZP_21612851.1| Fibronectin-binding A domain protein [Halorubrum coriense DSM
10284]
gi|445693389|gb|ELZ45541.1| Fibronectin-binding A domain protein [Halorubrum coriense DSM
10284]
Length = 730
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/286 (22%), Positives = 116/286 (40%), Gaps = 57/286 (19%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
TP + +H+G P F+SF AAVDE+F + D + A + K E
Sbjct: 269 TPFPLAEHEGLPSVGFDSFNAAVDEYFYRLGNEATDDGEAPADATASRPDFEAEIAKQER 328
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + E+ ++E+AEL+ N + VD + +R+ N++ W++I A +
Sbjct: 329 IVEQQRGAIEGFEEQAQAERERAELLYANYDLVDEVLSTVREARENEVPWDEIAATLDAG 388
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A+ + + +T+ L + +D + D D+SE V
Sbjct: 389 AERGIPAAAAVVDVDGGEGTVTVAL-------DDEDGGSVRID-----LDASEG-----V 431
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK----------- 283
+V+ D Y AKR + K++ AK I+ST + L++ ++ + +
Sbjct: 432 EVNAD-RLYQEAKR-VEEKKAGAK---AAIESTREELEAVNERKAEWEEQEAAADESAGA 486
Query: 284 ----------------DVQTMTNINKARKVYWFEKFYWFISSENYL 313
D + ++I WFE+F WF +S YL
Sbjct: 487 DGDGEDGEDGDEAYETDWLSRSSIPIRSPDDWFERFRWFRTSTGYL 532
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 604 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 654
>gi|385805336|ref|YP_005841734.1| putative RNA-binding protein, eukaryotic snRNP-like protein
[Fervidicoccus fontis Kam940]
gi|383795199|gb|AFH42282.1| putative RNA-binding protein, eukaryotic snRNP-like protein
[Fervidicoccus fontis Kam940]
Length = 629
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
AW A + V + +WV +DQVSKT P GE+L GSFMI G+KN+ L +GI
Sbjct: 500 AWKASLKVVDVFWVHSDQVSKTPPAGEYLEKGSFMIYGEKNYVRNVPLKLGIGL 553
>gi|307354208|ref|YP_003895259.1| Fibronectin-binding A domain-containing protein [Methanoplanus
petrolearius DSM 11571]
gi|307157441|gb|ADN36821.1| Fibronectin-binding A domain protein [Methanoplanus petrolearius
DSM 11571]
Length = 636
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 149/379 (39%), Gaps = 106/379 (27%)
Query: 56 MILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENV 115
+I + G P + + P+ F SF A+ +F +S D K ++ L K E +
Sbjct: 232 VITKSGCWPYPLPGEE--PVSSFSSFNDALSAYFPLPQSAAKDAK-----KEKLPKSEII 284
Query: 116 KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
+R + + E+ +EK + I N + + I +R D +++LSW++IE +K +
Sbjct: 285 RRRQQEAIVNFEKKIAELQEKVDAIYENYQDISGIIDTLR-DASSKLSWQEIEETLKNSS 343
Query: 176 RHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD 235
P A I +R++ S + D A + K
Sbjct: 344 L---PAAKSI-------------VRIYPSESAV-----------DVMAGGKKVK------ 370
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL-KSAEKKTKQTLKDVQTMTNINKA 294
+ ++ + ANA R++ + KK+ + + EK + K + K +Q K
Sbjct: 371 IFINENPEANANRYYGEIKKYKKKKAGALVAMEKFMPKEKQAKKRQDY----------KP 420
Query: 295 RKVYWFEKFYWFISSENYL-------------------------------GNVA------ 317
+K W+ K+ WF++S+ L G+V
Sbjct: 421 QKKKWYHKYRWFVTSDGVLVIGGQDAGSNEDIGKKYLEGRDYFVHADVHGGSVVVVKGET 480
Query: 318 --WD-----AKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
W+ A +NAW K +QVSKTA +GEF+ G+F+IRG++ +F
Sbjct: 481 ENWEEVAEFAASYSNAWKAGHFNCDVYAAKPEQVSKTAESGEFVKRGAFIIRGERRYFRN 540
Query: 361 CQLAMGISFLFKLEESSIS 379
L + I + E + I
Sbjct: 541 IGLKVAIGLQLEPELAVIG 559
>gi|284161856|ref|YP_003400479.1| fibronectin-binding A domain-containing protein [Archaeoglobus
profundus DSM 5631]
gi|284011853|gb|ADB57806.1| Fibronectin-binding A domain protein [Archaeoglobus profundus DSM
5631]
Length = 626
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/374 (22%), Positives = 150/374 (40%), Gaps = 121/374 (32%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P + ++ + + FESF A+DEF+S K + ++E + LKKL ++ E +L
Sbjct: 231 PIELEYYRDYEKKYFESFNKALDEFYS-----KTIAETEEEESEELKKL---RKRLEIQL 282
Query: 124 SA---LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
+ LE+ K + I N +++ A+ A RQ ++S+E+ +A+ K + D
Sbjct: 283 ESKRKLEEEAEKFKSLGDFIYENYATIEKALNAFRQ-AKEKMSFEEFKAKAKSLKFVKD- 340
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
+ +++ ++L N K+ + +DLD
Sbjct: 341 ---------VGKDYVVIVL-------NGKE-----------------------IRLDLDK 361
Query: 241 SAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
+ A+ +++ + A +K + I+ T+K ++ AE+K K ++ I RK
Sbjct: 362 DIHGIAESYYEKAKKAREKLEGLLIAIEKTKKEIEEAERKEK-----LKYTAPIRIVRKR 416
Query: 298 YWFEKFYWFISSENYLG------------------------------------------- 314
WFE+F WFI+S+ +L
Sbjct: 417 EWFERFRWFITSDGFLAIGGRNAQMNEEIVSKYLEPKDLFFHTQTPGAPAVVLKKGLEAP 476
Query: 315 -----NVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
A A + ++ W +V ADQV K+A GE+L GSF I GK+N+
Sbjct: 477 EISIVETAQFAAIYSSLWKQGLHSGEVYYVTADQVKKSAKAGEYLPKGSFYIVGKRNYIT 536
Query: 360 ---PCQLAMGISFL 370
C + + +S L
Sbjct: 537 VELKCAVGVDLSNL 550
>gi|374630447|ref|ZP_09702832.1| Fibronectin-binding A domain protein [Methanoplanus limicola DSM
2279]
gi|373908560|gb|EHQ36664.1| Fibronectin-binding A domain protein [Methanoplanus limicola DSM
2279]
Length = 629
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 147/368 (39%), Gaps = 107/368 (29%)
Query: 55 PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
P I + G P + P + FE+++ A+D +F E + ++K L K E
Sbjct: 234 PAITKSGCWPLIFEGE--IPEETFETYSQALDSYFGLPEVSEAEVK------KKLSKAEI 285
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+++ + + E+ + EK E+I N +++ A I+ D + ++SW++IE +K A
Sbjct: 286 IRKRQQEAIVKFEEKITLASEKVEIIYANYQTI-ADIVKTLSDASLKMSWQEIEDILKNA 344
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
++P+A +IK++ + + +LL D K +
Sbjct: 345 ---DNPMAKMIKRVYPSEAAVDILL------------------------DGKTIKLYASE 377
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
V+ NA R++ + KK+ + + E+ + + K+T D++ +
Sbjct: 378 GVE------GNAGRYYSEIKKFKKKKAGALVAMERFKVTERPERKRT--DIKFI------ 423
Query: 295 RKVYWFEKFYWFISSENYLGNVAWDAKV-------------------------------- 322
K W+ KF WF +S++ L DA
Sbjct: 424 -KPKWYHKFRWFYTSDDVLVIGGRDAGTNEDIVRKYLEGKDTFLHADIHGGSAVAVKGET 482
Query: 323 ------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
+NAW V DQVSKTA +GE L G+F+IRG++ ++
Sbjct: 483 ECMDEAAVFAVSYSNAWKSGFYSADVYAVPRDQVSKTAESGESLKRGAFVIRGERKYYR- 541
Query: 361 CQLAMGIS 368
+A GIS
Sbjct: 542 -NVAPGIS 548
>gi|312136934|ref|YP_004004271.1| fibronectin-binding a domain-containing protein [Methanothermus
fervidus DSM 2088]
gi|311224653|gb|ADP77509.1| Fibronectin-binding A domain protein [Methanothermus fervidus DSM
2088]
Length = 645
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 90/198 (45%), Gaps = 66/198 (33%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAEKKTKQTLKDVQTMTN 290
V++D++ S NA+ +++ + KK + ++ TEK L + +KK +K++ +
Sbjct: 382 VNIDINKSLEKNAEIYYEKAKKIRKKIKGAIKAMEETEKKLNNLKKKRDIEIKNI--LIP 439
Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------VV 323
I K RK+ WFEKF WFISS+ +L DA+ V+
Sbjct: 440 IKKRRKLKWFEKFRWFISSDGFLVIGGRDAQTNEIIVKKYMEENDIYLHADIHGAPSVVI 499
Query: 324 TN----------------------AW----------WVKADQVSKTAPTGEFLTTGSFMI 351
N AW WV QV+K+ P+GE+++ G+F+I
Sbjct: 500 KNKNKKIPENTINEAAIFAASFSKAWTYGLGSADVYWVYPQQVTKSPPSGEYISKGAFVI 559
Query: 352 RGKKNFF--PPCQLAMGI 367
RGK+N+ P +LA+GI
Sbjct: 560 RGKRNYIRNVPIELAVGI 577
>gi|448474105|ref|ZP_21602073.1| Fibronectin-binding A domain protein [Halorubrum aidingense JCM
13560]
gi|445818385|gb|EMA68244.1| Fibronectin-binding A domain protein [Halorubrum aidingense JCM
13560]
Length = 731
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/282 (21%), Positives = 104/282 (36%), Gaps = 56/282 (19%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
TP + +H+G P F+SF AAVDE+F E D + A + K E
Sbjct: 277 TPFPLSEHEGLPSVGFDSFNAAVDEYFYRLEHEDTDAGEAPADASASRPDFEEEIAKQER 336
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + ++ ++E+AEL+ + VD + +R AN + W++I +
Sbjct: 337 IIEQQRGAIEGFDEQAAQERERAELLYAEYDLVDEVLSTVRDARANDVPWDEIADTLAAG 396
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A + + + +T+ L R I D+ NAD
Sbjct: 397 AERGIPAAEAVVDVDGSDGTVTVELGDDGTRVEI-DTGAGVEVNADRL------------ 443
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-------- 286
Y AKR D K A ++ I+ST L++ +++ +
Sbjct: 444 --------YQEAKRIEDKKAGA----EQAIESTRAELEAVKERKAEWAAQQAAADDDQSD 491
Query: 287 ---------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++I R W+E+F WF ++ YL
Sbjct: 492 SEEDDDEEEHEIDWLSRSSIPIRRPEDWYERFRWFHTASGYL 533
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 605 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRVAVGV 655
>gi|389860344|ref|YP_006362583.1| hypothetical protein TCELL_0020 [Thermogladius cellulolyticus 1633]
gi|388525247|gb|AFK50445.1| hypothetical protein TCELL_0020 [Thermogladius cellulolyticus 1633]
Length = 644
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQLAMGI 367
AW A + + +WV+ QVSK+ P+GE+L+ GSFM+ GK+N+ P +LA+G+
Sbjct: 510 AWKAGLASIEVFWVRGGQVSKSPPSGEYLSKGSFMVYGKRNYLSIPLELALGV 562
>gi|336476370|ref|YP_004615511.1| fibronectin-binding A domain-containing protein [Methanosalsum
zhilinae DSM 4017]
gi|335929751|gb|AEH60292.1| Fibronectin-binding A domain protein [Methanosalsum zhilinae DSM
4017]
Length = 660
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 132/320 (41%), Gaps = 36/320 (11%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
+ESF A+DE+F A K+D K ++D E + +T +S E+ + A
Sbjct: 273 YESFNEALDEYFGKASLEKVDEKVETVKKDKAGVFERRLQQQKTAISKFEKQAEKYVQAA 332
Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN---------DPVASII--- 185
E I + + ++ A+ + SW +I++ +K ++ DP II
Sbjct: 333 EKIYSYYQDIEHITDALNNARSKGYSWSEIKSIIKSSKDSTQAAKSIINIDPGKGIIVLD 392
Query: 186 ---KQLKLNIN-----HITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVD 237
+++NIN + + K+ + +D + + E K+PS +
Sbjct: 393 LDGTNVEININKSIPQNAEMYYEKAKKVTRKRDGALKALEETKASMQKKEKKEPSKRKII 452
Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK-----TKQTLKDVQTMTNIN 292
S Y + F + + E K EK+ T+ V +
Sbjct: 453 RKPSWYERFRWFISSDGFLVVGGRDADTNEEIVKKYMEKRDLFFHTQAPGAPVTIIKTEG 512
Query: 293 KARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSF 349
K E+ F+ S + L G+ A D + VK +QVSKT +GE+L GSF
Sbjct: 513 KEVPSTTIEEASRFVVSYSSLWKLGHFAGDC------YMVKPEQVSKTPESGEYLKKGSF 566
Query: 350 MIRGKKNFFP--PCQLAMGI 367
+IRG++N+F P ++A+GI
Sbjct: 567 VIRGERNYFKNVPMRVAVGI 586
>gi|310752298|gb|ADP09459.1| FbpA and DUF814 domain protein [uncultured marine crenarchaeote
E48-1C]
Length = 608
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 79/377 (20%), Positives = 146/377 (38%), Gaps = 101/377 (26%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK-LENVKR--DHE 120
P ++ + F + +ESF +DEF+ + + L +E +LK+ +E +KR + +
Sbjct: 180 PFRLKCYADFEHKCYESFNETLDEFYVRVGAIEKALTVATEEVGSLKQEMERLKRIIEMQ 239
Query: 121 TRLSALEQTQLVDKEKAELII----NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
A +T + + ++ II E++ LA R++ +W++I + V ++
Sbjct: 240 EEACATAKTNMQENKRMGDIIHVHAGELEALLHRFLAGREE---GKAWDEIVSEVLAEKK 296
Query: 177 HNDPVASII-----KQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
+ + K L +++ L + RS DNA F
Sbjct: 297 TGVKSSGFLVSFDDKHLVVDVCLDGLQFGLSLRRS--------LFDNAARF--------- 339
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
Y AK+ D + A ++ + ++ E L+ AE + +V
Sbjct: 340 --------YRRYKRAKQKLDGAKIAMEESHRKLEEVEARLEKAEAAGSVSPVEVIEEVAE 391
Query: 292 NKARKVYWFEKFYWFISSENYL--------------------GNVAWDAKVV-------- 323
K + WFEKF WF+SS+ L G++ + A VV
Sbjct: 392 RKIERKKWFEKFRWFVSSDGVLVVAGKDAVSNEVLVNKYATDGDIVFHADVVGAPFVVVK 451
Query: 324 -------------------------------TNAWWVKADQVSKTAPTGEFLTTGSFMIR 352
+ +WVK DQ+ K+A +G+++ G F++R
Sbjct: 452 MNGEKPSEECLRQAGVFAASFSRGWREGFASVDVYWVKPDQLDKSAKSGQYVPKGGFVVR 511
Query: 353 GKKNFF--PPCQLAMGI 367
GK+N+ P +LA+GI
Sbjct: 512 GKRNWMRGSPLRLAVGI 528
>gi|448435995|ref|ZP_21587011.1| Fibronectin-binding A domain protein [Halorubrum tebenquichense DSM
14210]
gi|445683155|gb|ELZ35558.1| Fibronectin-binding A domain protein [Halorubrum tebenquichense DSM
14210]
Length = 743
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 113/289 (39%), Gaps = 53/289 (18%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKKL 112
TP + +H+ P F+SF AVDE+F S D +A DA + K
Sbjct: 272 TPFPLAEHENLPSVGFDSFNDAVDEYFYRLGSE--DTEAGDAPADASASRPDFEGEIAKQ 329
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E + E + E+ ++E+AEL+ N + VD I +R+ +++ W++IE +
Sbjct: 330 ERIIEQQEGAIEGFEEQAQAERERAELLYANYDLVDEVISTVREARESEVPWDEIEETLD 389
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
P A + + +T+ L + D V D A S T +
Sbjct: 390 AGAERGIPAAEAVVDVDGGEGTVTVELAEEPD-----DDAVDGEDGA-----SGGTTR-- 437
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQK--TIQSTEKALKS-----AEKKTKQTLKD 284
+++D NA R + + KR KK+ I+ST L++ AE + +Q D
Sbjct: 438 -IELDASEGVEVNADRLYQEAKRVEEKKEGAVAAIESTRAELEAVKERKAEWEEQQAADD 496
Query: 285 VQ--------------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ +I W+++F WF +S YL
Sbjct: 497 GSAQGGDGDDEDDDEEYETDWLSRASIPIRSPDDWYDRFRWFHTSTGYL 545
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 617 DGRGAGDAYMVDPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 667
>gi|448491980|ref|ZP_21608648.1| Fibronectin-binding A domain protein [Halorubrum californiensis DSM
19288]
gi|445692198|gb|ELZ44379.1| Fibronectin-binding A domain protein [Halorubrum californiensis DSM
19288]
Length = 729
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 63/288 (21%), Positives = 112/288 (38%), Gaps = 63/288 (21%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
TP + +H+G P F++F AAVDE+F + + D + A + K E
Sbjct: 270 TPFPLAEHEGLPSVGFDTFNAAVDEYFYRLGNEETDEGEAPADAGASRPDFEEEIAKQER 329
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + E+ ++E+AEL+ + + VD + +R+ N++ W++I A +
Sbjct: 330 IIEQQLGAIEGFEEQAQAERERAELLYAHYDLVDEVLSTVREARENEVPWDEIAATLDAG 389
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A+ + + +T+ L E V
Sbjct: 390 AERGIPAAAAVVDVDGGEGTVTVEL-------------------------DEEGDGEGTV 424
Query: 235 DVDLDLS--AYANAKRFF-DLKRSAAKK--QQKTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V+LD S NA R + + KR KK ++ I+ST + L++ K+ K ++ Q
Sbjct: 425 TVELDASEGVEVNADRLYREAKRVEEKKAGAKEAIESTREELEAV-KERKAEWEEQQAAD 483
Query: 287 ---------------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++I WFE+F WF +S YL
Sbjct: 484 DGSGGDDGGEDDEEEYETDWLSRSSIPIRSPDDWFERFRWFRTSTGYL 531
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 603 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 653
>gi|448546430|ref|ZP_21626594.1| hypothetical protein C460_17818 [Haloferax sp. ATCC BAA-646]
gi|448548417|ref|ZP_21627684.1| hypothetical protein C459_05213 [Haloferax sp. ATCC BAA-645]
gi|448557611|ref|ZP_21632800.1| hypothetical protein C458_13126 [Haloferax sp. ATCC BAA-644]
gi|445702883|gb|ELZ54823.1| hypothetical protein C460_17818 [Haloferax sp. ATCC BAA-646]
gi|445714168|gb|ELZ65935.1| hypothetical protein C458_13126 [Haloferax sp. ATCC BAA-644]
gi|445714512|gb|ELZ66274.1| hypothetical protein C459_05213 [Haloferax sp. ATCC BAA-645]
Length = 702
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/281 (22%), Positives = 113/281 (40%), Gaps = 65/281 (23%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +R+HQ G + +++F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E + E + EQ ++E+AEL+ N + VD + +R + W+DI A ++
Sbjct: 313 ERIIDQQEGAIEGFEQQAQDERERAELLYANYDLVDDVLSTVRDAREEGVPWDDIGATLE 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L D+A
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +D+ + NA R + + KR KK+ I+ T + L++ +K+ + D
Sbjct: 403 TVTLDVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELEAVKKRRDEWEADDDEDD 462
Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++ +WFE+F WF +S YL
Sbjct: 463 EEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSTGYL 503
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF++RG + +F P ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKVAVGI 625
>gi|395645660|ref|ZP_10433520.1| protein of unknown function DUF814 [Methanofollis liminatans DSM
4140]
gi|395442400|gb|EJG07157.1| protein of unknown function DUF814 [Methanofollis liminatans DSM
4140]
Length = 635
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 83/399 (20%), Positives = 156/399 (39%), Gaps = 126/399 (31%)
Query: 47 IVANMEFHP--MILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQ 104
++ ++E HP ++ + G P + G Q F+++ AA++ F+ + + ++
Sbjct: 225 LIRDVETHPEPVVTESGCWPV---RGLGTVQQAFDTYNAALESFYPEVPA---SVTKEEE 278
Query: 105 ERDALKKLENVKRDHETRLSALEQTQLVDKEKA-ELIINNQESVDAAILAIRQDIANQLS 163
+R L + E ++ ET + E +++ EKA E I N V I + Q + +S
Sbjct: 279 KRPKLTREEVIRLQQETAIKKFE-SKIARAEKAVEAIYTNYPLVQEVITTL-QRASRSMS 336
Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA 223
W++IE +K + + P A + + VH A
Sbjct: 337 WQEIEKILKSS---DLPAAKAV-------------VSVHP-------------------A 361
Query: 224 DSSETKKPSLVDVDLDL--------SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE 275
D++ VDVD+ + S AN +R++D + KK++ + + E+ + +
Sbjct: 362 DAA-------VDVDVGMQVTIHVHESVEANVERYYDQIKKFRKKKEGALAAMERGVPKQK 414
Query: 276 KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSEN--YLGN------------------ 315
+K K+TL + K WF +F WF +++ LG
Sbjct: 415 EKPKETLHLL----------KKKWFHRFRWFYTTDGTLVLGGRDASQNEELVKRYMEGKD 464
Query: 316 -------------------------VAWDAKVVTNAW----------WVKADQVSKTAPT 340
VA A +NAW + DQVSKT +
Sbjct: 465 TFVHADVHGGSVVIVKGPTEHLEDEVACFAASYSNAWKAGHFAADVYIARPDQVSKTPES 524
Query: 341 GEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
GE+++ G+F++RG++ + L + I K + + I
Sbjct: 525 GEYVSRGAFIVRGERQYVRDVPLGVAIGVQLKPDVTVIG 563
>gi|320100405|ref|YP_004175997.1| hypothetical protein [Desulfurococcus mucosus DSM 2162]
gi|319752757|gb|ADV64515.1| protein of unknown function DUF814 [Desulfurococcus mucosus DSM
2162]
Length = 665
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 69/176 (39%), Gaps = 63/176 (35%)
Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT----NINKARKVYWFEKFYWFIS----- 308
KK+ + + E A+K E+ +Q + TM+ + + W+E+F+W +
Sbjct: 398 KKRGELVGKIESAVKVLEEMRRQLNEASSTMSIEDKTVRRLSPTLWYERFHWLFTRNGFL 457
Query: 309 -------SEN------YLGN------------------------------VAWDAKVVTN 325
S+N YLG+ ++ A +
Sbjct: 458 AIGGRDQSQNEMVVRKYLGDNDVFIHADIHGGSAVVLKSRGLHSVEDVVDASYLAACYSR 517
Query: 326 AW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQLAMGISFL 370
AW WV QVSKT P+GE+L G+FMI G KNF P +LA+G F
Sbjct: 518 AWRAGFSFIEVFWVPGSQVSKTPPSGEYLPRGAFMIYGSKNFLSIPLRLAVGARFF 573
>gi|14601515|ref|NP_148055.1| hypothetical protein APE_1611 [Aeropyrum pernix K1]
gi|5105298|dbj|BAA80611.1| conserved hypothetical protein [Aeropyrum pernix K1]
Length = 650
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 81/204 (39%), Gaps = 70/204 (34%)
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--INKARKVYWFEKF 303
K L R A + + K + EKA A + ++ ++ + + I + RK +WFEK+
Sbjct: 363 GKLILRLYREAGELEAKA-ERAEKAFAEARSRLEEAVRRARLRSLRRIIEGRKRFWFEKY 421
Query: 304 YWFISS------------------ENYLGNV---------------------------AW 318
+W I+ + YLG+ +
Sbjct: 422 HWTITRNGFLAIGGRDAGQNESVVKRYLGDDDIFLHADIHGAPATVLLTRRLQPGDDDIY 481
Query: 319 DAKVVTNAW--------------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQ 362
DA V+ A+ WV QVSK+ P GE+L G+FM+ GK+N+ P +
Sbjct: 482 DAAVLAAAYSRAWKAGAGGVSVYWVYGSQVSKSPPAGEYLARGAFMVYGKRNYIHHVPLK 541
Query: 363 LAMGI------SFLFKLEESSISR 380
LA+GI ++ E ++SR
Sbjct: 542 LALGIVMHKDVPYIVAGSEEAVSR 565
>gi|432328279|ref|YP_007246423.1| putative RNA-binding protein, snRNP like protein [Aciduliprofundum
sp. MAR08-339]
gi|432134988|gb|AGB04257.1| putative RNA-binding protein, snRNP like protein [Aciduliprofundum
sp. MAR08-339]
Length = 596
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/195 (24%), Positives = 81/195 (41%), Gaps = 66/195 (33%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL-KSAEKKTKQTLKDVQTMTNIN 292
V++ +D S NA +++ AKK ++ I+ ++AL K+ E+ + + I
Sbjct: 330 VELRVDKSVGENASLYYE----KAKKMREKIKGAQQALEKAKEELKSVKKMEEKKKREIR 385
Query: 293 KARKVYWFEKFYWFISSENYLGNVAWDAKV------------------------------ 322
K+R+ +WFEK+ WFISSE+ L DAK
Sbjct: 386 KSRRRFWFEKYRWFISSEDILVIAGRDAKTNEEVVKKHLGDKDLYMHADIHGAPSVVIKS 445
Query: 323 -------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRG 353
++ AW WV QVSK +GE++ G++++ G
Sbjct: 446 EGKEIGEKTLYEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVVHG 505
Query: 354 KKNFF--PPCQLAMG 366
++N+ P +LA+G
Sbjct: 506 RRNYIHKVPLRLAVG 520
>gi|448424081|ref|ZP_21582207.1| Fibronectin-binding A domain protein [Halorubrum terrestre JCM
10247]
gi|448478971|ref|ZP_21603977.1| Fibronectin-binding A domain protein [Halorubrum arcis JCM 13916]
gi|445682746|gb|ELZ35159.1| Fibronectin-binding A domain protein [Halorubrum terrestre JCM
10247]
gi|445822801|gb|EMA72563.1| Fibronectin-binding A domain protein [Halorubrum arcis JCM 13916]
Length = 735
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 113/289 (39%), Gaps = 64/289 (22%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF-------STAESHKIDLKAVQQE-RDALKKLEN 114
TP + +H+ P F+SF AAVDE+F + D A + + + + K E
Sbjct: 275 TPFPLAEHEDLPSVGFDSFNAAVDEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQER 334
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + + E+ ++E+AEL+ + + VD I +R+ N++ W++IE +
Sbjct: 335 IIEQQKGAIEGFEEQAQAERERAELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAG 394
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A + + +T+ L E V
Sbjct: 395 AERGIPAAEAVAGVDGGEGTVTVEL-------------------------EEEGDDGGTV 429
Query: 235 DVDLDLS--AYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQTLKDVQTM- 288
V+LD S NA R + + KR KK+ ++ I+ST + L++ K+ K+ ++ Q
Sbjct: 430 TVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREELEAV-KERKREWEEQQAAD 488
Query: 289 ------------------------TNINKARKVYWFEKFYWFISSENYL 313
++I WFE+F WF +S YL
Sbjct: 489 DGSGGDGGDNEEEDEEYETDWLARSSIPIRSPDDWFERFRWFHTSTGYL 537
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F C++A+G+
Sbjct: 609 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVACRIAVGV 659
>gi|219852170|ref|YP_002466602.1| hypothetical protein Mpal_1566 [Methanosphaerula palustris E1-9c]
gi|219546429|gb|ACL16879.1| protein of unknown function DUF814 [Methanosphaerula palustris
E1-9c]
Length = 629
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 81/370 (21%), Positives = 147/370 (39%), Gaps = 108/370 (29%)
Query: 55 PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
P++ G+ P V+ PI +F+ A++ + H+ Q++ + + E
Sbjct: 234 PVVTSGGAWPIVLTGMT--PISHHPTFSEALEAIYPLVTRHE-----GPQKKAPIPREER 286
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQ-LSWEDIEARVKQ 173
++ E L + ++ +++K +LI N V I + D A++ LSW++I + +K+
Sbjct: 287 IRLQQEAALKSFDKKIVLNKAIVDLIYENYTLVTDVIKTL--DAASKTLSWQEIGSMLKE 344
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
+ ND VA I + + LLL K ++ +S
Sbjct: 345 SD--ND-VARQIAGVHPAEAAVDLLLDGKKVLIHVHES---------------------- 379
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
++V+L+ YA K+F KK+ + + E+ + A+K T + +T + K
Sbjct: 380 IEVNLE-RYYAQVKKF-------KKKRDGAVSAMERPV--AKKATSKV-----HLTPLKK 424
Query: 294 ARKVYWFEKFYWFISSEN--YLGN------------------------------------ 315
W+ +F WF +S+N LG
Sbjct: 425 R----WYHRFRWFFTSDNCLVLGGRDAGQNEELVKRYMEGGDTFVHADVHGASVVIVKGK 480
Query: 316 ------VAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
VA A + AW V+ DQVSKT GEF++ GSF++RG++ +F
Sbjct: 481 TEQMDEVAQFAASYSGAWRSGHFSADVYAVRPDQVSKTPEAGEFVSRGSFIVRGERTYFK 540
Query: 360 PCQLAMGISF 369
L + I +
Sbjct: 541 SVPLGVAIGY 550
>gi|448583074|ref|ZP_21646543.1| hypothetical protein C454_08194 [Haloferax gibbonsii ATCC 33959]
gi|445730031|gb|ELZ81623.1| hypothetical protein C454_08194 [Haloferax gibbonsii ATCC 33959]
Length = 702
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 61/281 (21%), Positives = 113/281 (40%), Gaps = 65/281 (23%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +R+HQ G + +++F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSNRPDFEEEIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ + E + EQ ++E+AEL+ N + VD + +R + W+DI AR++
Sbjct: 313 QRIIDQQEGAIEGFEQQAEDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAARLE 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L D+A
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTIEL-----------------DDA------------- 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +++ + NA R + + KR KK+ I+ T + L + +K+ + D
Sbjct: 403 TVTLEVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDDEDD 462
Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++ +WFE+F WF +S YL
Sbjct: 463 EEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYL 503
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF+IRG + +F ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYFEDVAAKVAVGI 625
>gi|374635672|ref|ZP_09707266.1| Fibronectin-binding A domain protein [Methanotorris formicicus
Mc-S-70]
gi|373561525|gb|EHP87758.1| Fibronectin-binding A domain protein [Methanotorris formicicus
Mc-S-70]
Length = 673
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 36/55 (65%), Gaps = 3/55 (5%)
Query: 316 VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ W A + +WVK +Q+SKTA +GE+L G+F+IRGK+NF L +GI +
Sbjct: 554 LGWGA---LDTYWVKPEQISKTAESGEYLKKGAFVIRGKRNFIRNVPLELGIGVI 605
>gi|297619525|ref|YP_003707630.1| Fibronectin-binding A domain-containing protein [Methanococcus
voltae A3]
gi|297378502|gb|ADI36657.1| Fibronectin-binding A domain protein [Methanococcus voltae A3]
Length = 722
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/364 (23%), Positives = 139/364 (38%), Gaps = 100/364 (27%)
Query: 78 FESFAAAVDEFFSTAESHKIDLKAVQQERDAL-KKLENVKRDHETRLSALEQTQLVDKEK 136
+E++ A+DE+FS K D+K + + D L +K E + + E+ + +K
Sbjct: 311 YENYLNALDEYFSQFILQK-DIKKEETKLDKLIRKQERIVNSQIETKAKYEKQSAKNHQK 369
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASII-----KQLKLN 191
+LI N +D I IR ++ W+ I+ V ++ ++P+ S I K +LN
Sbjct: 370 GDLIYANFTEIDEIINTIRS-AREKMEWKQIKKIV--SENKDNPILSKIESINEKNAELN 426
Query: 192 INHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFD 251
+ L+ E IK + V +D+ SA+ NA ++
Sbjct: 427 LK---LIAEYGGELGTIKGN----------------------VAIDIRESAFENANSYYT 461
Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN------KARKVYWFEKF-- 303
+ K I + E + K EK +QT D + + + R + W+EK
Sbjct: 462 KAKKFKNKVSGVIVALEISQKKLEKIRQQTELDAELLKQKQQNIKKKERRVLKWYEKLKW 521
Query: 304 -----YWFIS------------------------------------------SENYLGNV 316
Y I+ SE L V
Sbjct: 522 TIIDNYLIIAGKDATTNEIIVKKYLEKNDVVFHTLMEGAPFTVIKNTSEETPSEETLLEV 581
Query: 317 AWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMG 366
A A + AW WV +Q+SKTA +GEFL G+F+IRGK+NF L +G
Sbjct: 582 AKFAVSHSKAWKLGLGSADVYWVLPEQISKTAESGEFLKKGAFVIRGKRNFIRSAPLDLG 641
Query: 367 ISFL 370
+ +
Sbjct: 642 VGIV 645
>gi|298675852|ref|YP_003727602.1| fibronectin-binding A domain-containing protein [Methanohalobium
evestigatum Z-7303]
gi|298288840|gb|ADI74806.1| Fibronectin-binding A domain protein [Methanohalobium evestigatum
Z-7303]
Length = 670
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 119/275 (43%), Gaps = 72/275 (26%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFS------------TAESHKIDL--KAVQQERDAL 109
P + Q++ + Q F+SF A+DEFF +A+ K+D+ + +QQ++ A+
Sbjct: 262 PFELEQYKEYEKQYFDSFNKALDEFFGKEVVEAERKIQESAKKEKVDIYQRRLQQQQGAI 321
Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
+K E + + AE I ++ V+ I + + SW+DI++
Sbjct: 322 EKFEKEANKYNS--------------IAEAIYSHYPFVEEVITVLTNARKSGYSWDDIKS 367
Query: 170 RVKQAQRHND-PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSET 228
++K+A ND P A +I+ + I + D T
Sbjct: 368 KLKEA---NDIPSAKLIQSIDPKSGTIVM--------------------------DLDGT 398
Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQ---KTIQSTEKALKSAEKKTKQTLKDV 285
K +D+ S NA+ +++ + KK++ + I+ T++ +++ +K +QT K
Sbjct: 399 K----ATLDIRYSVPQNAQTYYEKAKRVMKKREGALRAIEETKRIIENRDKPQQQTRKR- 453
Query: 286 QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDA 320
RK +W+ +F WFISS+ +L DA
Sbjct: 454 ------KVIRKKHWYSRFRWFISSDGFLVVGGRDA 482
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISF 369
+ +WV +QVSKT +GEFL GSF+IRG +N+F P +A+G+
Sbjct: 546 DCYWVYPNQVSKTPESGEFLKKGSFIIRGDRNYFKNVPVSVAIGLEL 592
>gi|315231919|ref|YP_004072355.1| RNA-binding protein [Thermococcus barophilus MP]
gi|315184947|gb|ADT85132.1| RNA-binding protein [Thermococcus barophilus MP]
Length = 650
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/312 (21%), Positives = 131/312 (41%), Gaps = 37/312 (11%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ ++ + + FE+F+ A+DE+F K ++ ++ + +++ R E +
Sbjct: 242 VPIELKWYENYEKKYFETFSEALDEYFGKITVEKAKIERTKRLEEKKRQILATLRRQEEQ 301
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ E ++E +LI N +D + + + +L WE+ + R+++ ++ + +A
Sbjct: 302 MKGFEAEMKKNQELGDLIYANFTFIDNLLREFSKAV-EKLGWEEFKKRIEEGKKAGNKIA 360
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
++K + +T+ + K + + S +NA+ + + ++ K L
Sbjct: 361 LMVKSIDPKEKAVTIEIEGRKIKLYLNKSI---GENAEIYYEKAKKAKHKL--------- 408
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
AKR A + +K +Q EK ++ KK + K + WFEK
Sbjct: 409 -EGAKR-------AYEDTKKKLQEIEKLIEEEMKKELKVKKLEKRKKK--------WFEK 452
Query: 303 FYWFISSENYLGNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMI------RGKKN 356
F WFISSE +L DA TN VK + ++ G+K
Sbjct: 453 FRWFISSEGFLVIGGKDA--TTNEMVVKRHMGDNDLYCHADVHGAPHVVIKDGQKAGEKT 510
Query: 357 FFPPCQLAMGIS 368
F CQ A+ +S
Sbjct: 511 IFEACQFAVSMS 522
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 38/54 (70%), Gaps = 3/54 (5%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
AW V + +A+W +QV+K AP+GE+L G+FM+ GK+N++ P +LA+GI
Sbjct: 524 AWSEGVYSADAYWAYPNQVTKKAPSGEYLGKGAFMVYGKRNWYHGIPLKLAVGI 577
>gi|150399105|ref|YP_001322872.1| hypothetical protein Mevan_0351 [Methanococcus vannielii SB]
gi|150011808|gb|ABR54260.1| protein of unknown function DUF814 [Methanococcus vannielii SB]
Length = 680
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW V + + +WV DQ+SKTA +GE+L G+F+IRGK+NF L +G+
Sbjct: 558 AWKLGVGSADVYWVMPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALDLGVGIF 612
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 59/257 (22%), Positives = 110/257 (42%), Gaps = 46/257 (17%)
Query: 64 PSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
P ++ H+ P I+ +ESF A+DE+FS I K ++Q + KL VK+
Sbjct: 262 PISLKIHEKIPENEIKNYESFLVALDEYFSRF----IIKKEIKQAETKINKL--VKKQER 315
Query: 121 TRLSALEQTQ------LVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
S LE + ++++EK +LI N VD + IR ++ W I+ + +
Sbjct: 316 ILNSQLETKEKYEKQSVLNQEKGDLIYANYMDVDEILSTIRS-AREKMDWNAIKEVINKN 374
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
+ H I+ ++ +++N N + S L D + + + V
Sbjct: 375 KDH-----QILGKI-ISVNE-----------KNAEISLKLSLDYGNGIIEKN-------V 410
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQS---TEKA---LKSAEKKTKQTLKDVQTM 288
+DL +A+ +A F+ + K I++ +EK LK EK + L++ +
Sbjct: 411 VLDLRKNAFESADDFYQKSKKFKNKVSGVIEALKISEKKLNELKEKEKTDSEVLREKEEN 470
Query: 289 TNINKARKVYWFEKFYW 305
+ + + W+EK W
Sbjct: 471 IKKKEKKLLKWYEKLKW 487
>gi|336122066|ref|YP_004576841.1| Fibronectin-binding A domain-containing protein
[Methanothermococcus okinawensis IH1]
gi|334856587|gb|AEH07063.1| Fibronectin-binding A domain protein [Methanothermococcus
okinawensis IH1]
Length = 684
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 33/46 (71%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ +WVK +Q+SKTA +GE+L G+F+IRGK+NF L +GI +
Sbjct: 572 DVYWVKPEQISKTAESGEYLKKGAFVIRGKRNFIRSVPLELGIGIV 617
>gi|448512226|ref|ZP_21616340.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
9100]
gi|448520849|ref|ZP_21618182.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
10118]
gi|445694546|gb|ELZ46671.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
9100]
gi|445702985|gb|ELZ54924.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
10118]
Length = 735
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 113/289 (39%), Gaps = 64/289 (22%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF-------STAESHKIDLKAVQQE-RDALKKLEN 114
TP + +H+ P F+SF AAVDE+F + D A + + + + K E
Sbjct: 275 TPFPLAEHEDLPSVGFDSFNAAVDEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQER 334
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + + E+ ++E+AEL+ + + VD I +R+ N++ W++IE +
Sbjct: 335 IIEQQKGAIEGFEEQAQAERERAELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAG 394
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A + + +T+ L E V
Sbjct: 395 AERGIPAAEAVVGVDGGEGTVTVEL-------------------------EEEGDDGGTV 429
Query: 235 DVDLDLS--AYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQTLKDVQTM- 288
V+LD S NA R + + KR KK+ ++ I+ST + L++ K+ K+ ++ Q
Sbjct: 430 TVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREELEAV-KERKREWEEQQAAD 488
Query: 289 ------------------------TNINKARKVYWFEKFYWFISSENYL 313
++I WFE+F WF +S YL
Sbjct: 489 DGSGGDGGDNEEEDEEYETDWLARSSIPIRSPDDWFERFRWFHTSTGYL 537
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F C++A+G+
Sbjct: 609 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVACRIAVGV 659
>gi|408381973|ref|ZP_11179520.1| fibronectin-binding A domain-containing protein [Methanobacterium
formicicum DSM 3637]
gi|407815421|gb|EKF86006.1| fibronectin-binding A domain-containing protein [Methanobacterium
formicicum DSM 3637]
Length = 711
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 118/266 (44%), Gaps = 42/266 (15%)
Query: 70 HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--RDALKKLENVKRDHETRLSALE 127
+Q F + FE+F A DEF+S D+K VQ++ + K E R E L +
Sbjct: 310 YQNFHKERFETFNQAADEFYSGKVG--ADIKKVQEDIWAKEVGKYEKRLRIQEETLEKFQ 367
Query: 128 QTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQ 187
+T + K K LI ++ + +L I + SW +I +++K+A++ A II+
Sbjct: 368 KTIVETKRKGNLIYSHYSEIQ-NLLDIIHQAREKFSWMEIASKLKKARKEGMVQAQIIQS 426
Query: 188 L-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
+ K+ + +TL L ET V VD +L NA
Sbjct: 427 MDKMGV--LTLNL-------------------------EGET-----VTVDANLEIPENA 454
Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAE-KKTKQTLKDVQTMTNINKARK-VYWFEKFY 304
+++++ + A +K + + E+ K E K+ K+ L + + RK + WFEK
Sbjct: 455 EKYYNKGKKAKRKIKGVNMAIERTKKDVERKRNKRELALERVRVPQKRVRKELKWFEKLR 514
Query: 305 WFISSENYLGNVAWDAKVVTNAWWVK 330
WF+SS+ +L DA TN VK
Sbjct: 515 WFLSSDGFLVIGGRDAG--TNEMVVK 538
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 32/45 (71%), Gaps = 2/45 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ +WV DQVSKT +GEF+ G+F+IRG +N+ P ++A+GI
Sbjct: 595 DVYWVHPDQVSKTPQSGEFVARGAFIIRGSRNYLRGIPLKIAVGI 639
>gi|269865201|ref|XP_002651841.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
gi|220063780|gb|EED42216.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
Length = 142
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 31/41 (75%)
Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
QVSKTAP+GEFL GSFMI+GKKN P +L G+ +F++
Sbjct: 8 QVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 48
>gi|84489327|ref|YP_447559.1| RNA-binding protein [Methanosphaera stadtmanae DSM 3091]
gi|84372646|gb|ABC56916.1| predicted RNA-binding protein [Methanosphaera stadtmanae DSM 3091]
Length = 666
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+A+WV DQVSKT +GEFL G+F+IRGKKNF P +A+G+
Sbjct: 541 DAYWVTLDQVSKTPQSGEFLKKGAFVIRGKKNFIRNVPVLIAIGV 585
Score = 45.1 bits (105), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 53/263 (20%), Positives = 108/263 (41%), Gaps = 38/263 (14%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P + ++ F + FE+F A DEF+S I K + + K E + E
Sbjct: 247 VPITLNKYAQFKSKSFETFNMAADEFYSKKIVSDIKNKEEKLWAKRIGKFEKRLKMQEET 306
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND-PV 181
L +T + K + I + + I I Q N SW++I + +K++++ P
Sbjct: 307 LEGFYKTIEDKQHKGDTIYAHYNEIQQIINVIHQAREN-YSWKEIGSIIKKSKKEGKIPE 365
Query: 182 ASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
+I+ + K+ + ++ L + V +D ++
Sbjct: 366 LEMIESIDKMGVINLKL--------------------------------DDTHVQIDSNI 393
Query: 241 SAYANAKRFFDLKRSAAKK---QQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
+ +++++ + A +K K I++T+ +K E K + ++ ++ + R++
Sbjct: 394 GIPESTEKYYNKGKKAKRKIDGVNKAIENTKSEIKKLEDKKEVAIELLRQKQEKREKREL 453
Query: 298 YWFEKFYWFISSENYLGNVAWDA 320
W+EK WFIS + YL DA
Sbjct: 454 KWYEKLRWFISRDGYLVIGGRDA 476
>gi|296109018|ref|YP_003615967.1| Fibronectin-binding A domain protein [methanocaldococcus infernus
ME]
gi|295433832|gb|ADG13003.1| Fibronectin-binding A domain protein [Methanocaldococcus infernus
ME]
Length = 666
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 32/46 (69%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ +WVK DQ+SK A +GE+L G+F+IRGK+N+ L +GI L
Sbjct: 553 DTYWVKPDQISKRAESGEYLKRGAFVIRGKRNYIRNVPLELGIGVL 598
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 59/274 (21%), Positives = 124/274 (45%), Gaps = 39/274 (14%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P +R+++ + + F SF A+DE+F+ + K + ++K E++ R
Sbjct: 257 VPIELRKYKDYEKRYFNSFYEALDEYFAKFLTSVEIKKEKSKLEKEIEKQESILRRQLET 316
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L A E+ ++ K +LI +N + V+ + AIR ++ WE+++ +++ + H P+
Sbjct: 317 LKAYEEEVRKNQIKGDLIYSNYQLVEEILNAIRV-AKDKKGWEEVKRVIRENKEH--PII 373
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I E N K ++ ++D + K V +D+ S
Sbjct: 374 KLI------------------EGVNEKKGEIIVRLSSD-----LDGKIEERVVLDIRKST 410
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSA--EKKTKQTLKDV----QTMTNINKARK 296
+ NA+ +++ AKK + I+ +KA++ + + + + +DV + ++
Sbjct: 411 FENAESYYN----KAKKFKSKIEGIKKAIEMSKKKLEELKKKRDVEIEEKKALKKKVKKE 466
Query: 297 VYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
W+EKF W + + N+L DA +TN +K
Sbjct: 467 RKWYEKFKWTVIN-NFLVIAGKDA--ITNEIIIK 497
>gi|374632982|ref|ZP_09705349.1| putative RNA-binding protein, snRNP like protein [Metallosphaera
yellowstonensis MK1]
gi|373524466|gb|EHP69343.1| putative RNA-binding protein, snRNP like protein [Metallosphaera
yellowstonensis MK1]
Length = 602
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 84/198 (42%), Gaps = 68/198 (34%)
Query: 234 VDVDLDLSAYANAKRFFDLKR---SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
V++D +LS A +F+ + S A++ ++TI +K ++ + K ++T ++ +++
Sbjct: 324 VEIDPNLSLTRVASSYFERAKELESKARRAEETIAELKKKVEELKLKLRET-EESKSLV- 381
Query: 291 INKARKVYWFEKFYWFISSENYL------------------------------------- 313
RK W+EK+ W + NYL
Sbjct: 382 ---IRKKEWYEKYRWSFTRNNYLVIAGRDVDQNESLVKKMLGEEEIFLHADIQGAPATII 438
Query: 314 --------GNVAWDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
G++ +DA VV + AW WV QVSK+ P+GE+L GSFMI
Sbjct: 439 KDSKGVQEGDI-YDAAVVAACYSKAWKLGLGSVDVFWVYGSQVSKSPPSGEYLPKGSFMI 497
Query: 352 RGKKNFFPPCQLAMGISF 369
GKKNF +L + I
Sbjct: 498 YGKKNFIKNVRLELAIGL 515
>gi|222479900|ref|YP_002566137.1| Fibronectin-binding A domain protein [Halorubrum lacusprofundi ATCC
49239]
gi|222452802|gb|ACM57067.1| Fibronectin-binding A domain protein [Halorubrum lacusprofundi ATCC
49239]
Length = 733
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 607 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 657
Score = 38.1 bits (87), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 54/289 (18%), Positives = 109/289 (37%), Gaps = 58/289 (20%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF-----STAESHKIDLKAVQQERD---ALKKLEN 114
TP + +H+G P F+SF AAVDE+F +E + A D + K E
Sbjct: 267 TPFPLSEHEGLPSVGFDSFNAAVDEYFYRLDRDGSEEGEAPADASPSRPDFEEEIGKQER 326
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + + E+ ++E+AEL+ + VD + +++ ++ W++I +
Sbjct: 327 IVEQQQGAIEGFEEQAEAERERAELLYAEYDLVDEVLSTVQEAREAEVPWDEIAETLDAG 386
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A + + +T+ LR + ++ + +
Sbjct: 387 AEQGIPAAETVVDVDGGEGTVTVELRGGDGEDDDGET--------------------TRI 426
Query: 235 DVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQTLK-------- 283
++D NA R + + KR KK+ + I+ST L++ +++ +
Sbjct: 427 ELDASAGVEVNADRLYQEAKRIEGKKEGAMEAIKSTRAELEAVKERKAEWEAKEAAADET 486
Query: 284 -------------------DVQTMTNINKARKVYWFEKFYWFISSENYL 313
D + ++I W+++F WF +S YL
Sbjct: 487 AGDGADDGEEEEDGEEYQTDWLSRSSIPIRSPDDWYDRFRWFYTSTGYL 535
>gi|292656996|ref|YP_003536893.1| hypothetical protein HVO_2883 [Haloferax volcanii DS2]
gi|448293595|ref|ZP_21483700.1| hypothetical protein C498_17603 [Haloferax volcanii DS2]
gi|291371020|gb|ADE03247.1| conserved protein [Haloferax volcanii DS2]
gi|445570456|gb|ELY25019.1| hypothetical protein C498_17603 [Haloferax volcanii DS2]
Length = 702
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/281 (21%), Positives = 112/281 (39%), Gaps = 65/281 (23%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +R+HQ G + +++F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ + E + EQ ++E+AEL+ N + VD + +R + W+DI A ++
Sbjct: 313 QRIIDQQEGAIEGFEQQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAATLE 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L D+A
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +D+ + NA R + + KR KK+ I+ T + L + +K+ + D
Sbjct: 403 TVTLDVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDGDED 462
Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++ +WFE+F WF +S YL
Sbjct: 463 EDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSTGYL 503
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF++RG + +F P ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKVAVGI 625
>gi|433431126|ref|ZP_20407596.1| hypothetical protein D320_16320 [Haloferax sp. BAB2207]
gi|448568141|ref|ZP_21637718.1| hypothetical protein C456_00247 [Haloferax lucentense DSM 14919]
gi|448601017|ref|ZP_21656300.1| hypothetical protein C452_18184 [Haloferax alexandrinus JCM 10717]
gi|432194170|gb|ELK50822.1| hypothetical protein D320_16320 [Haloferax sp. BAB2207]
gi|445727091|gb|ELZ78705.1| hypothetical protein C456_00247 [Haloferax lucentense DSM 14919]
gi|445734620|gb|ELZ86178.1| hypothetical protein C452_18184 [Haloferax alexandrinus JCM 10717]
Length = 702
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/281 (21%), Positives = 112/281 (39%), Gaps = 65/281 (23%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +R+HQ G + +++F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ + E + EQ ++E+AEL+ N + VD + +R + W+DI A ++
Sbjct: 313 QRIIDQQEGAIEGFEQQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAATLE 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L D+A
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +D+ + NA R + + KR KK+ I+ T + L + +K+ + D
Sbjct: 403 TVTLDVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDGDDD 462
Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++ +WFE+F WF +S YL
Sbjct: 463 EDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSTGYL 503
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF++RG + +F P ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKVAVGI 625
>gi|448454957|ref|ZP_21594359.1| Fibronectin-binding A domain protein [Halorubrum lipolyticum DSM
21995]
gi|445814337|gb|EMA64302.1| Fibronectin-binding A domain protein [Halorubrum lipolyticum DSM
21995]
Length = 736
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 610 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 660
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 63/308 (20%), Positives = 121/308 (39%), Gaps = 60/308 (19%)
Query: 42 NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKA 101
SED A+ + P ++ TP + +H+G P F+SF AAVDE+F E D
Sbjct: 255 GSED---ADGDGDPRVVD--VTPFPLSEHEGLPSVGFDSFNAAVDEYFYRLEREDGDAGE 309
Query: 102 VQQE--------RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILA 153
+ + + K E + + E+ ++E+AEL+ + VD +
Sbjct: 310 APADASPSRPEFEEEIAKQERIIEQQRGAIEGFEEQAEAERERAELLYARYDLVDEVLST 369
Query: 154 IRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWV 213
+R+ N++ W++I ++ P A + + +T+ L +R +D
Sbjct: 370 VREARENEVPWDEIAETLEAGAERGIPAAEAVADVDGGEGTVTVEL----DREGGED--- 422
Query: 214 LHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKA 270
E+ V++D NA R + + KR KK+ + I+ST +
Sbjct: 423 ------------GESGDSVRVELDASTGVEVNADRLYQEAKRIEGKKEGAMEAIESTRRE 470
Query: 271 LKSAEKKTK--QTLKDVQ-----------------------TMTNINKARKVYWFEKFYW 305
L++ E++ + ++ + ++I W+++F W
Sbjct: 471 LEAVEERKAEWEAMEAADDGDGDGGDSEDEDDEEEYETDWLSRSSIPIRSPDDWYDRFRW 530
Query: 306 FISSENYL 313
F +S YL
Sbjct: 531 FHTSTGYL 538
>gi|448528898|ref|ZP_21620278.1| Fibronectin-binding A domain protein [Halorubrum hochstenium ATCC
700873]
gi|445710346|gb|ELZ62165.1| Fibronectin-binding A domain protein [Halorubrum hochstenium ATCC
700873]
Length = 740
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 614 DGRGAGDAYMVDPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 664
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 62/292 (21%), Positives = 112/292 (38%), Gaps = 59/292 (20%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKKL 112
TP + +H+ P F+SF AAVDE+F S D +A DA + K
Sbjct: 269 TPFPLAEHEDLPSVGFDSFNAAVDEYFYRLGSE--DAEAGDAPADASASRPDFEGEIAKQ 326
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ + E + E+ ++E+AEL+ N + VD + +R+ +++ W++IE +
Sbjct: 327 QRIIEQQEGAIEGFEEQAQAERERAELLYANYDLVDEVLSTVREARESEVPWDEIEETLD 386
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
P A + + +T+ L D + AD
Sbjct: 387 AGAERGIPAAEAVVDVDGGEGTVTVEL----------------ADESGDDADDEGGANGG 430
Query: 233 LVDVDLDLS--AYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKK---------- 277
++LD S NA R + + KR KK+ I+ST + L++ +++
Sbjct: 431 TTRIELDASEGVEVNADRLYQEAKRVEEKKEGAMAAIESTREELEAVKERKAEWEEQQAA 490
Query: 278 ----------------TKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
++ D + +I W+++F WF +S YL
Sbjct: 491 NDGSGQGDDGDDGADDEEEYETDWLSRASIPIRSPDNWYDRFRWFHTSTGYL 542
>gi|448448413|ref|ZP_21591226.1| Fibronectin-binding A domain protein [Halorubrum litoreum JCM
13561]
gi|445814829|gb|EMA64787.1| Fibronectin-binding A domain protein [Halorubrum litoreum JCM
13561]
Length = 736
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/288 (21%), Positives = 117/288 (40%), Gaps = 61/288 (21%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF-------STAESHKIDLKAVQQE-RDALKKLEN 114
TP + +H+ P F+SF AAVDE+F + D A + + + + K E
Sbjct: 275 TPFPLAEHEDLPSVGFDSFNAAVDEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQER 334
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + + E+ ++E+AEL+ + + VD I +R+ N++ W++IE +
Sbjct: 335 IIEQQKGAIEGFEEQAQAERERAELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAG 394
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A + + ++ T+ + + +E D+ A V
Sbjct: 395 AERGIPAAEAV--VGVDGGEGTVTVELEEE-----------GDDGGTMA----------V 431
Query: 235 DVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQTLKDVQTM--- 288
++D NA R + + KR KK+ ++ I+ST + L++ K+ K+ ++ Q
Sbjct: 432 ELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREELEAV-KERKREWEEQQAADDG 490
Query: 289 -----------------------TNINKARKVYWFEKFYWFISSENYL 313
++I WFE+F WF +S YL
Sbjct: 491 SGGDGGDNEGEEDEEYETDWLARSSIPIRSPDDWFERFRWFHTSTGYL 538
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F C++A+G+
Sbjct: 610 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVACRIAVGV 660
>gi|302347972|ref|YP_003815610.1| fibronectin-binding protein [Acidilobus saccharovorans 345-15]
gi|302328384|gb|ADL18579.1| Predicted fibronectin-binding protein [Acidilobus saccharovorans
345-15]
Length = 647
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 3/56 (5%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP--CQLAMGISF 369
AW + + +W QVSK+ P+GE+LT GSFM+ GKKN+ P +L +GI+
Sbjct: 501 AWKEGMASVSVYWAYGSQVSKSPPSGEYLTKGSFMVYGKKNYLRPLRLELYLGIAL 556
>gi|448470211|ref|ZP_21600408.1| Fibronectin-binding A domain protein [Halorubrum kocurii JCM 14978]
gi|445808289|gb|EMA58361.1| Fibronectin-binding A domain protein [Halorubrum kocurii JCM 14978]
Length = 735
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 609 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRVAVGV 659
Score = 45.4 bits (106), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 64/292 (21%), Positives = 113/292 (38%), Gaps = 68/292 (23%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
TP + +H+G P F+SF AAVDE+F E+ D + A + K E
Sbjct: 273 TPFPLSEHEGLPSVGFDSFNAAVDEYFYRLEAEDTDAGEAPADASASRPDFEEEIAKQER 332
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + E+ ++E+AEL+ + VD + +++ ++ W++I +
Sbjct: 333 IIEQQRGAIEGFEEQAEAERERAELLYAEYDLVDEVLSTVQEAREAEVPWDEIAETLDAG 392
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A + + +T+ L D+ D DS V
Sbjct: 393 ADRGIPAAEAVVDVDGGEGTVTVEL-----------------DDED--GDS--------V 425
Query: 235 DVDLDLSAY--ANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKK------------ 277
V+LD SA NA R + + KR KK+ + I+ST + L++ +++
Sbjct: 426 RVELDASAGVEVNADRLYQEAKRIEGKKEGAMEAIESTRRELEAVKERKAEWEAKEAAAD 485
Query: 278 ----------------TKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
++ D + ++I WFE+F WF +S YL
Sbjct: 486 ETPGGGGDGDGDDDADDEEYETDWLSRSSIPIRSPDDWFERFRWFRTSTGYL 537
>gi|332796292|ref|YP_004457792.1| hypothetical protein Ahos_0606 [Acidianus hospitalis W1]
gi|332694027|gb|AEE93494.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 566
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 317 AWDAKVV-TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
AW + + +WV QVSK+ P GE+L GSFMI GKKN+ +L + I F
Sbjct: 433 AWKLGLAAVDVFWVYGSQVSKSPPAGEYLPKGSFMIYGKKNYIKSVKLNLAIGF 486
>gi|448439536|ref|ZP_21588100.1| Fibronectin-binding A domain protein [Halorubrum saccharovorum DSM
1137]
gi|445691070|gb|ELZ43265.1| Fibronectin-binding A domain protein [Halorubrum saccharovorum DSM
1137]
Length = 733
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
D + +A+ V+ DQVSKT +GE++ GSF+IRG + +F PC++A+G+
Sbjct: 607 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRVAVGV 657
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 62/289 (21%), Positives = 108/289 (37%), Gaps = 66/289 (22%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
TP + +H+G P F+SF AAVDE+F E + D + A + K E
Sbjct: 275 TPFPLSEHEGLPSVGFDSFNAAVDEYFYRLEHEESDAGEAPTDASASRPDFEEEIAKQER 334
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + + E+ ++E+AEL+ + VD + +++ N + W++I +
Sbjct: 335 IIEQQKGAIEGFEEQAEAERERAELLYAEYDLVDEVLSTVQEARENDVPWDEIAETLDAG 394
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A + + +T+ L R V
Sbjct: 395 AERGIPAAEAVVDVDGGEGTVTVELGEDDTR----------------------------V 426
Query: 235 DVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKS-----AEKKTKQTLKDVQ 286
++D NA R + + KR KK+ + I+ST + L++ AE K K+ D +
Sbjct: 427 ELDASAGVEVNADRLYQEAKRIEGKKEGAMEAIESTRQDLEAVKERKAEWKAKEAADDEE 486
Query: 287 ----------------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++I WFE+F WF +S YL
Sbjct: 487 GGSDAGGGEGDEGEEEYETDWLSRSSIPIRSPDDWFERFRWFRTSTGYL 535
>gi|435850617|ref|YP_007312203.1| putative RNA-binding protein, snRNP like protein
[Methanomethylovorans hollandica DSM 15978]
gi|433661247|gb|AGB48673.1| putative RNA-binding protein, snRNP like protein
[Methanomethylovorans hollandica DSM 15978]
Length = 664
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 120/282 (42%), Gaps = 53/282 (18%)
Query: 52 EFHPMILQPGS---------TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAV 102
E P I+ G P + ++ + + SF A+DE+F E+ I +A
Sbjct: 239 ELDPCIIMKGEGDTAQSIDVVPFELEVYRELTKERYPSFNKALDEYFGKREAASITEQAF 298
Query: 103 QQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAA--ILAIRQDIAN 160
+++ + LE R E + + AE I N ++V+ +LAI +D
Sbjct: 299 SVKKEKVDLLERRLRQQEEAVEKYGKESEKHTSIAETIYANYQAVEDVLKVLAIARD--K 356
Query: 161 QLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT--LLLRVHKERSNIKDSWVLHHDN 218
SW+ I++ +K A + + P A I L+I+ T ++L + ++NI
Sbjct: 357 GYSWDQIKSTIKAA-KDSVPAAKSI----LSIDSATGIVVLDLMGMKTNI---------- 401
Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT 278
D+ + NA+ +++ + AKKQ+ I+S E+ + +KK
Sbjct: 402 ------------------DVTKTVPQNAQVYYERSKKLAKKQEGAIRSIEQTKLAMQKKE 443
Query: 279 KQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDA 320
K + T+ I K W+++F WF+SS+ +L DA
Sbjct: 444 KTATRKRGTV-RIKKQ----WYDRFRWFVSSDGFLVIGGRDA 480
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 31/45 (68%), Gaps = 2/45 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ +WV QVSKT +GE++ GSF+IRG++N+ P +A+GI
Sbjct: 544 DCYWVNPTQVSKTPESGEYVKKGSFIIRGERNYLKDVPVGVAVGI 588
>gi|399576519|ref|ZP_10770274.1| RNA-binding protein, snrnp like protein [Halogranum salarium B-1]
gi|399237963|gb|EJN58892.1| RNA-binding protein, snrnp like protein [Halogranum salarium B-1]
Length = 706
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 58/294 (19%), Positives = 114/294 (38%), Gaps = 62/294 (21%)
Query: 52 EFHPMILQPGS-----TPSVMRQHQGFPIQEFESFAAAVDEFFS----TAESHKIDLKAV 102
EF P + + TP + +++G F+SF AA+D++F + E+ K A
Sbjct: 241 EFDPRVYEKDGNVVDVTPFPLEEYEGLDSAAFDSFNAALDDYFFRLDLSDEAEKGGGGAE 300
Query: 103 QQERDALKKLENVKR---DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159
D +++E KR E + E+ ++EKAEL+ N E D + +R
Sbjct: 301 ANRPDFQEEIEKQKRIIQQQEGAIEGFEEQAQEEREKAELLYANYELADEVLSTVRGARE 360
Query: 160 NQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNA 219
+ W DI + + P A ++ + + +T+ + +
Sbjct: 361 ENIPWADIADTLAEGAEQGIPAAEAVEDVDGSTGTVTITIDGQR---------------- 404
Query: 220 DPFADSSETKKPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEK 276
+D+D+ + NA R + + KR KK + I++T + L++ EK
Sbjct: 405 --------------IDLDVSMGVEKNADRIYTEAKRVEEKKAGALEAIENTREKLEAVEK 450
Query: 277 KTKQTLKDVQ-----------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ + + +I + W+++F WF +S+ +L
Sbjct: 451 RRDEWEASDDEPDEDEDDEEKPDIDWLSRNSIPIRNQDKWYDRFRWFETSDGFL 504
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V ADQVSKT +GE++ GSF++RG + ++ ++A+G+
Sbjct: 576 EGRFADDAYMVSADQVSKTPESGEYVEKGSFVVRGDRTYYEDVAAEVAVGL 626
>gi|448659123|ref|ZP_21683091.1| hypothetical protein C435_18454 [Haloarcula californiae ATCC 33799]
gi|445760625|gb|EMA11882.1| hypothetical protein C435_18454 [Haloarcula californiae ATCC 33799]
Length = 717
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 60/263 (22%), Positives = 116/263 (44%), Gaps = 31/263 (11%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF---------STAESHKIDLKAVQQERDALKKLE 113
TP+ + +++ + F F A+D++F E+ + D +A ++K E
Sbjct: 275 TPTPLAEYEELYSESFTEFNPALDDYFFNFQREEEVEGGETQRPDFEA------EIEKQE 328
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
+ + E + E V++EKAEL+ N + VD + ++ + +SW+DIEA+ +
Sbjct: 329 RIIQQQEQAIEDFEADAEVEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDE 388
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKK 230
+A+ + L+ + T+ L + R + D++ NAD A E KK
Sbjct: 389 GADRG--IAAAEAVVSLDGSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKK 445
Query: 231 PSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
L+A N + DL+ ++++ E + E + + T D +M +
Sbjct: 446 EGA------LAAIENTRE--DLEAVKERREEWEADDGEDEADNDEAEDEPT--DWLSMQS 495
Query: 291 INKARKVYWFEKFYWFISSENYL 313
I W+E+F WF +S+ +L
Sbjct: 496 IPTRSTERWYEQFRWFHTSDGFL 518
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V DQVSKT +GE+L G F +RG + +F P +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVGIT 641
>gi|330835774|ref|YP_004410502.1| hypothetical protein Mcup_1916 [Metallosphaera cuprina Ar-4]
gi|329567913|gb|AEB96018.1| conserved hypothetical protein [Metallosphaera cuprina Ar-4]
Length = 508
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 60/255 (23%), Positives = 103/255 (40%), Gaps = 77/255 (30%)
Query: 185 IKQLKLNINHITLLLRVHKERSN---IKDSWVLHHDNADPFADSSETKKPS-------LV 234
IK+L++ I+ + +L K+ ++ +K ++ + P+ ++ + PS +
Sbjct: 181 IKKLEVTISKLYEILNQTKQEADLNRVKGRLIMENY---PYVETMMSNGPSEIEINGVKI 237
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
++D S NA +FD AK+ ++ I+ TE+ + E+K + L + +K
Sbjct: 238 EIDPSKSIAKNAALYFD----KAKELEEKIKKTEETIVELERKKQDLLSKTKEEIESSKV 293
Query: 295 --RKVYWFEKF--------YWFISSEN----------YLGN------------------- 315
RK WFEK+ Y I+ + +LG+
Sbjct: 294 LIRKREWFEKYHWTITKNGYIVIAGRDIDQNESLVKKFLGDDDIFLHADIQGAPATVIKS 353
Query: 316 -----------VAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGK 354
A A + AW WV QVSK+ P+GE+L GSFMI GK
Sbjct: 354 PNSISDEDLLDAATLAASYSKAWKLGLGSIDVFWVYGKQVSKSPPSGEYLPKGSFMIYGK 413
Query: 355 KNFFPPCQLAMGISF 369
KNF +L + +
Sbjct: 414 KNFIKNVKLELTVGI 428
>gi|389848295|ref|YP_006350534.1| hypothetical protein HFX_2877 [Haloferax mediterranei ATCC 33500]
gi|448618500|ref|ZP_21666737.1| hypothetical protein C439_16130 [Haloferax mediterranei ATCC 33500]
gi|388245601|gb|AFK20547.1| hypothetical protein HFX_2877 [Haloferax mediterranei ATCC 33500]
gi|445746871|gb|ELZ98329.1| hypothetical protein C439_16130 [Haloferax mediterranei ATCC 33500]
Length = 701
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 59/280 (21%), Positives = 115/280 (41%), Gaps = 64/280 (22%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +++HQ G + +++F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLQEHQNAGLDEEAYDTFNEALDEYF-----FRLDLTADEQEATSNRPDFEEQIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E + + + ++ ++E+AEL+ N + VD + +R+ + W+DI +
Sbjct: 313 ERIIEQQKQAIEGFDEQANEERERAELLYANYDLVDDVLSTVREAREQGVPWDDIAVTLD 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L D+A
Sbjct: 373 EGAEQGIPAAEAVTNVDGANGTVTIKL-----------------DDA------------- 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +D+ + NA R + + KR KKQ I+ T + L++A+++ + D Q
Sbjct: 403 TVTLDVSMGVEKNADRLYTEAKRIQEKKQGALAAIEDTREELEAAKRRRDEWEADDQEDE 462
Query: 287 -------------TMTNINKARKVYWFEKFYWFISSENYL 313
++ ++ +WFE+F WF +S YL
Sbjct: 463 SDEDEEPEETDWLSLDSVPVKSTEHWFERFRWFHTSSGYL 502
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF+IRG + +F P ++A+GI
Sbjct: 574 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDRRYFEDVPAKVAVGI 624
>gi|126466189|ref|YP_001041298.1| hypothetical protein Smar_1299 [Staphylothermus marinus F1]
gi|126015012|gb|ABN70390.1| protein of unknown function DUF814 [Staphylothermus marinus F1]
Length = 663
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 2/53 (3%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF-PPCQLAMGI 367
AW+ + +WV A QVSKT P+GE+L+ G+FMI GKKN+ +LA+GI
Sbjct: 521 AWNEGFGAIDVFWVHASQVSKTPPSGEYLSKGAFMIYGKKNYVKTKLELAIGI 573
Score = 38.1 bits (87), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 8/90 (8%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM---TN 290
+ +D+ L + N R+ L KK ++ E+AL+ EK ++ + Q + T
Sbjct: 382 IPIDIRLDTWNNILRYKKLSGELLKKAKRA----EEALRELEKSLEEAVNKKQLIEKKTE 437
Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDA 320
I ++ W+E+F+W I+SE +L DA
Sbjct: 438 IGIKPRL-WYERFHWMITSEGFLVIAGRDA 466
>gi|408405775|ref|YP_006863758.1| hypothetical protein Ngar_c31850 [Candidatus Nitrososphaera
gargensis Ga9.2]
gi|408366371|gb|AFU60101.1| hypothetical protein with domain of unknown function DUF814 and
fibronectin-binding A protein [Candidatus Nitrososphaera
gargensis Ga9.2]
Length = 661
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW D +A+WV +QV K APTG+FL GSF+I GK+N+ ++ + I +
Sbjct: 516 AWKDGLSSADAYWVMPEQVKKGAPTGQFLPKGSFVIEGKRNYLKGVEIRLAIGIV 570
>gi|448688255|ref|ZP_21694088.1| hypothetical protein C444_10199 [Haloarcula japonica DSM 6131]
gi|445779316|gb|EMA30246.1| hypothetical protein C444_10199 [Haloarcula japonica DSM 6131]
Length = 717
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 56/263 (21%), Positives = 115/263 (43%), Gaps = 31/263 (11%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF---------STAESHKIDLKAVQQERDALKKLE 113
TP + +++ + F F +A+D++F E+ + D +A ++K +
Sbjct: 275 TPIPLSEYEELYSESFTEFNSALDDYFFNFQREEEVEGGETQRPDFEA------EIEKQK 328
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
+ + E + E V++EKAEL+ N + VD + ++ + +SW+DIEA+ +
Sbjct: 329 RIIQQQEQAIEDFEADAEVEREKAELLYANYDLVDDVLSTVQAAREDDVSWDDIEAKFDE 388
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKK 230
+A+ + L+ + T+ L + R + D++ NAD A E KK
Sbjct: 389 GADRG--IAAAEAVVSLDGSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKK 445
Query: 231 PSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
L+A N + DL+ ++ + + + + + T D +M +
Sbjct: 446 EGA------LAAIENTRE--DLEEVKERRDEWEADDGDDETDEDQSEDEPT--DWLSMQS 495
Query: 291 INKARKVYWFEKFYWFISSENYL 313
+ +W+E+F WF +S+ +L
Sbjct: 496 VPTRSTEHWYEQFRWFHTSDGFL 518
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V DQVSKT +GE+L G F IRG + +F P +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPVGVAVGIT 641
>gi|161527567|ref|YP_001581393.1| hypothetical protein Nmar_0055 [Nitrosopumilus maritimus SCM1]
gi|160338868|gb|ABX11955.1| protein of unknown function DUF814 [Nitrosopumilus maritimus SCM1]
Length = 652
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW + +A+WV DQV K+AP+GEFL GSF I G++NF L + + +
Sbjct: 507 AWREGMYGVSAYWVNPDQVKKSAPSGEFLPKGSFTIEGQRNFIKSGNLKLAVGII 561
>gi|73669087|ref|YP_305102.1| hypothetical protein Mbar_A1575 [Methanosarcina barkeri str.
Fusaro]
gi|72396249|gb|AAZ70522.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 797
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 35/55 (63%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
+ +WVKA+QVSKT +GE++ G+F+IRG++N+F L + + K E I
Sbjct: 599 DCYWVKAEQVSKTPESGEYVKKGAFIIRGERNYFKDIPLGVAVGLELKGETRVIG 653
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 62/287 (21%), Positives = 119/287 (41%), Gaps = 37/287 (12%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERD-ALKKLENVKRDHETR 122
P + ++ GF + F+SF A+DEFF +I+ ++++ L E E
Sbjct: 311 PFDLTRYSGFEKEYFDSFNTALDEFFGKKALEQIEEVKAAKKKEKTLGVYERRLLQQEGS 370
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L E+ + AE + N + ++ + + + SW++I + +KQA++ P A
Sbjct: 371 LKKFEKEIEKNNTLAETVYANYQGIEELLSVLNGARSTGYSWDEIRSILKQAKK-TVPAA 429
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
I NI+ T + V+ F S + +D+ +
Sbjct: 430 QKI----TNIDPRTGTVTVN-------------------FDGKS-------ISLDIRKTV 459
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
NA+ +++ + KK+ +++ E K+ EKK + + + +RK +W+++
Sbjct: 460 PQNAQEYYEKVKKFNKKKDGALKAIEDTRKAMEKKAVAKVA--KAGRKLRASRKKHWYDR 517
Query: 303 FYWFISSENYLGNVAWDAKV---VTNAWWVKADQVSKTAPTGEFLTT 346
F WF+SS+ + DA + + K D V T G LT
Sbjct: 518 FRWFVSSDGFFIVGGRDADTNEEIFKKYLEKRDLVFHTQTPGAPLTV 564
>gi|327401161|ref|YP_004342000.1| fibronectin-binding A domain-containing protein [Archaeoglobus
veneficus SNP6]
gi|327316669|gb|AEA47285.1| Fibronectin-binding A domain protein [Archaeoglobus veneficus SNP6]
Length = 637
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 150/361 (41%), Gaps = 117/361 (32%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P ++ + G + F +F A+DE+++ S + Q+E + LKKL+ RL
Sbjct: 241 PIELQIYDGLERKYFPTFNEALDEYYARRIS-----EVKQEESEELKKLK-------ARL 288
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA-RVKQAQRHNDPVA 182
+ QL K++ E N E AA A+ ++ QL + +EA R + Q+ D +
Sbjct: 289 ----EKQLETKKEFE---NEMERYRAAGDAVYENY--QLLEQILEAFRQARQQKSWDEIK 339
Query: 183 SIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
I++ KL+ L++ +H E K+S V+ N P ++L+
Sbjct: 340 KIVRAHPKLS----KLVVEIHPE----KNSVVV---NIGP---------------KIELA 373
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM------TNINKAR 295
N + D+ AKK ++ ++ KA++ KTK+ ++ V+ + + AR
Sbjct: 374 LDKNLPQIADVYYERAKKVRQKLEGLLKAIE----KTKEEMQRVEELEAKKYVKGLRVAR 429
Query: 296 KVYWFEKFYWFISSENYL----GNVAWDAKVVT--------------------------- 324
K WFE+F WFI+S+ +L N A + ++V+
Sbjct: 430 KREWFERFRWFITSDGFLVIGGRNAAMNEEIVSKYMEPKDLFFHTQTPGAPATVLKLGQE 489
Query: 325 ----------------NAWW-----------VKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
+A W VK +QV + A GE+L GSF I GK+N+
Sbjct: 490 APETSIIEAAQFAATYSALWKEGKYSGEVYYVKPEQVKRAAKHGEYLARGSFYIEGKRNY 549
Query: 358 F 358
Sbjct: 550 L 550
>gi|20089538|ref|NP_615613.1| hypothetical protein MA0651 [Methanosarcina acetivorans C2A]
gi|19914450|gb|AAM04093.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length = 788
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 35/55 (63%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
+ +W+KA+QV+KT +GE+L G+F+IRG++N+F L + + K E I
Sbjct: 575 DCYWIKAEQVTKTPESGEYLKKGAFVIRGERNYFKDIPLGVAVGLELKGETRVIG 629
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 63/288 (21%), Positives = 121/288 (42%), Gaps = 37/288 (12%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKI-DLKAVQQERDALKKLENVKRDHET 121
P + ++ F + F+SF A+DEFF ++ ++KA +++ L E E
Sbjct: 286 VPFDLIRYSEFEKEYFDSFNTALDEFFGKKALEQVAEVKAAEKKEKTLGVYERRLLQQEE 345
Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
L+ + + AE++ N + ++ + A SW++I + +KQA++ P
Sbjct: 346 SLAKFGKEIEKNNTLAEIVYANYQLIEELFSVLNGARAKGYSWDEIRSILKQAKK-TVPA 404
Query: 182 ASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
A I NI+ T + V + N V++D+ +
Sbjct: 405 AQKI----TNIDQKTGTVTVDLDGRN--------------------------VNLDIRKT 434
Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFE 301
NA+ +++ + +KK+ +++ E+ K+ EKK + + RK +W++
Sbjct: 435 VPQNAQEYYEKVKKFSKKRDGALKAIEETKKAMEKKAASKAA--KAGRKLQAFRKKHWYD 492
Query: 302 KFYWFISSENYLGNVAWDAKV---VTNAWWVKADQVSKTAPTGEFLTT 346
+F WF+SS+ +L DA + + K D V T G LT
Sbjct: 493 RFRWFVSSDGFLVVGGRDADTNEEIFKKYLEKRDIVFHTQTPGAPLTV 540
>gi|297527127|ref|YP_003669151.1| hypothetical protein Shell_1151 [Staphylothermus hellenicus DSM
12710]
gi|297256043|gb|ADI32252.1| protein of unknown function DUF814 [Staphylothermus hellenicus DSM
12710]
Length = 663
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 33/44 (75%), Gaps = 1/44 (2%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF-PPCQLAMGI 367
+ +WV A+QVSKT P+GE+L G+FMI GKKN+ +LA+GI
Sbjct: 530 DVFWVYANQVSKTPPSGEYLPKGAFMIYGKKNYVKTKLELAIGI 573
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 45/83 (54%), Gaps = 8/83 (9%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD---VQTMTN 290
+ +D+ L+A+ N R+ L KK +K E+AL+ E+ ++ + +Q T
Sbjct: 382 IPIDIRLNAWNNILRYKKLSGELLKKAKKA----EEALRELERSLEEAVNKKQLIQRRTE 437
Query: 291 INKARKVYWFEKFYWFISSENYL 313
I ++ W+E+F+W I+SE +L
Sbjct: 438 IGIKPRL-WYERFHWMITSEGFL 459
>gi|261403479|ref|YP_003247703.1| fibronectin-binding A domain-containing protein [Methanocaldococcus
vulcanius M7]
gi|261370472|gb|ACX73221.1| Fibronectin-binding A domain protein [Methanocaldococcus vulcanius
M7]
Length = 670
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 34/46 (73%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ +WVK +Q+SKTA +GE+L G+F+IRG+++++ L +GI L
Sbjct: 558 DTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGVL 603
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 132/283 (46%), Gaps = 37/283 (13%)
Query: 56 MILQPGS----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
+IL+ G P +++++ + +ESF AVD++F+ ++ K + +++
Sbjct: 249 IILKDGEHYDVVPVNLKKYEDLEKKYYESFLDAVDDYFAKFLTNVEVKKKKSKIEKEIER 308
Query: 112 LENV-KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR 170
EN+ KR ET L ++ ++ K +LI N + V+ + AI+Q ++ W I+
Sbjct: 309 QENILKRQLET-LERYKKDAEKNQIKGDLIYANYQIVENLLSAIKQ-AREKMDWARIKKI 366
Query: 171 VKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKK 230
VK+ + H P+ +++ ++ NI I + L+ AD +
Sbjct: 367 VKENKDH--PILDLVEDIRENIGEIIVRLK----------------------ADVGDKTI 402
Query: 231 PSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAEKKTKQTLKDVQT 287
+ +D+ +A NA+RF++ + K + I+ T+K ++ +KK ++TL +
Sbjct: 403 EERIPLDIRKNASENAERFYEKAKKLKHKVEGIKTAIELTKKKIEELKKKEEKTLGEEIP 462
Query: 288 MTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
K ++ W+EKF W + + +L DA +TN +K
Sbjct: 463 EMKKKKRKERKWYEKFKWTVIN-GFLVIAGKDA--ITNEILIK 502
>gi|407461558|ref|YP_006772875.1| hypothetical protein NKOR_00035 [Candidatus Nitrosopumilus
koreensis AR1]
gi|407045180|gb|AFS79933.1| hypothetical protein NKOR_00035 [Candidatus Nitrosopumilus
koreensis AR1]
Length = 651
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/176 (25%), Positives = 66/176 (37%), Gaps = 63/176 (35%)
Query: 257 AKKQQKTIQSTE----KALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
AKKQ I S E K LK EK +T + ++ +++ RK W+E++ WF +S+ +
Sbjct: 387 AKKQSGAISSIEEIKSKTLKKLEKLQNKTDSEKDSVL-VSEIRKKNWYERYRWFYTSDGF 445
Query: 313 L------------------------------------------------GNVAWDAKVVT 324
L VA +
Sbjct: 446 LVIGGRDAASNSAVVRKHLAKNDKIFHGDIFGSPFFIIKDAQNAPDTSMNEVAHATVCFS 505
Query: 325 NAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW WV +QV K+AP+GEFL GSF I G++NF L + + +
Sbjct: 506 RAWREGMYGVSAYWVNPEQVKKSAPSGEFLPKGSFTIEGQRNFIKSGNLKLAVGII 561
>gi|268323401|emb|CBH36989.1| conserved hypothetical protein containing fibronectin-binding
protein A N-terminal domain, DUF814 family [uncultured
archaeon]
gi|268324037|emb|CBH37625.1| conserved hypothetical protein containing fibronectin-binding
protein A N-terminal (FbpA) domain and DUF814 domain
[uncultured archaeon]
Length = 631
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/197 (21%), Positives = 80/197 (40%), Gaps = 63/197 (31%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKT---IQSTEKALKSAEKKTKQTLKDVQTMTN 290
+++D +S + NA ++D + KK++ I+ T++ +++ ++K + +++
Sbjct: 360 LEIDTSVSLHKNAGAYYDKAKVFRKKREGVEPAIEMTKEKIRTEKEKEVRIEEELIPTKK 419
Query: 291 INKARKVYWFEKFYWFISSENY-------------------------------------- 312
+ K W+EKF WF +S+ +
Sbjct: 420 EVRTEKEEWYEKFRWFETSDGFLVVGGKDATTNEILAKKHMEPNDLFFHTQAEGAPVVIA 479
Query: 313 -----------LGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
L +A A +N W V +QVSKT P+GE++ GSFM+
Sbjct: 480 KAGGKEISESGLKEIAQFAASYSNLWKYGFYEGECYCVVGEQVSKTPPSGEYIKKGSFMV 539
Query: 352 RGKKNFF-PPCQLAMGI 367
RGK+ +F L +GI
Sbjct: 540 RGKRKYFKAELGLCIGI 556
>gi|448666601|ref|ZP_21685246.1| fibronectin-binding A domain-containing protein [Haloarcula
amylolytica JCM 13557]
gi|445771732|gb|EMA22788.1| fibronectin-binding A domain-containing protein [Haloarcula
amylolytica JCM 13557]
Length = 717
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/248 (23%), Positives = 113/248 (45%), Gaps = 31/248 (12%)
Query: 78 FESFAAAVDEFF---------STAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQ 128
F F A+D++F E+ + D +A ++K + + + E + E
Sbjct: 290 FTEFNPALDDYFFNFQREEEVEGGETQRPDFEA------EIEKQKRIIQQQEQAIEDFEA 343
Query: 129 TQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
V++EKAEL+ N + VD + ++ A+ +SW+DIEA+ + +A+ +
Sbjct: 344 DAEVEREKAELLYANYDLVDDVLSTVQAARADDVSWDDIEAKFNEGADRG--IAAAEAVV 401
Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDLSAYAN 245
L+ + T+ L + R + D++ NAD A E KK L+A N
Sbjct: 402 SLDGSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA------LAAIEN 454
Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
+ DL+ A K+++ ++ + ++ E + + D +M +I +W+E+F W
Sbjct: 455 TRE--DLE--AVKERRDEWEADDGDDEADEDEGEDEPTDWLSMQSIPTRSTEHWYEQFRW 510
Query: 306 FISSENYL 313
F +S+ +L
Sbjct: 511 FHTSDGFL 518
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V DQVSKT +GE+L G F IRG + +F P +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPAGIAVGIT 641
>gi|118577090|ref|YP_876833.1| RNA-binding protein [Cenarchaeum symbiosum A]
gi|118195611|gb|ABK78529.1| RNA-binding protein [Cenarchaeum symbiosum A]
Length = 631
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 37/54 (68%), Gaps = 3/54 (5%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNF--FPPCQLAMGI 367
AW +A +A+WV +QV K AP+G+FL GSF+I G++NF P +LA+G+
Sbjct: 496 AWREAMYGLSAYWVLPEQVKKAAPSGQFLPKGSFVIEGRRNFVKIPTLRLAVGL 549
>gi|354610742|ref|ZP_09028698.1| Fibronectin-binding A domain protein [Halobacterium sp. DL1]
gi|353195562|gb|EHB61064.1| Fibronectin-binding A domain protein [Halobacterium sp. DL1]
Length = 745
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 117/271 (43%), Gaps = 48/271 (17%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER-DALKKLENVKRDHET 121
TP +R+ + P F+ F A+D++F+ ++ + + R D ++E KR E
Sbjct: 305 TPVPLREREELPADAFDRFNDALDDYFTNLDTTEEEESGEAVSRPDFEAEIEKQKRIIEQ 364
Query: 122 RLSAL---EQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
+ A+ EQ ++EKAEL+ N + VD I + W+DI R ++A +
Sbjct: 365 QQQAIDDFEQQAEAEREKAELLYGNYDLVDELIGVVADARGAGHGWQDIAERFEEAA-GD 423
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
P A + + +N + T+ +R+ H DP +S K +
Sbjct: 424 VPGADVF--VGVNESEGTVRVRIDD-----------HTIELDP--ESGVEKNADRI---- 464
Query: 239 DLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ--------------TLKD 284
Y AKR + + A + I++T L+SA+++ ++ L D
Sbjct: 465 ----YTEAKRIEEKQEGA----RAAIENTRGDLESAKQRREEWEAEPDEQESEADDELAD 516
Query: 285 VQ--TMTNINKARKVYWFEKFYWFISSENYL 313
V + ++I + W+E+F WF +SE +L
Sbjct: 517 VDWLSRSSIPIRNQEQWYERFRWFRTSEGFL 547
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGIS 368
D + +A+ V DQVSKT +GE+L G F IRG + +F P + A+GI+
Sbjct: 619 DGRGAGDAYMVSPDQVSKTPESGEYLEKGGFAIRGDRTYFRDLPAEWAVGIA 670
>gi|218883339|ref|YP_002427721.1| hypothetical protein DKAM_0025 [Desulfurococcus kamchatkensis
1221n]
gi|218764955|gb|ACL10354.1| protein of unknown function DUF814 [Desulfurococcus kamchatkensis
1221n]
Length = 659
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 69/184 (37%), Gaps = 68/184 (36%)
Query: 251 DLKRSAA---KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-----INKARKVYWFEK 302
DL R +K + ++ TE A K E+ Q L + N I K +W+E+
Sbjct: 392 DLSRQVIELERKHGELVRKTESAKKVLEE-MHQQLNTISISMNTEEKTIRKPSPTFWYER 450
Query: 303 FYWFIS------------SEN------YLG------------------------------ 314
F+W + S+N YLG
Sbjct: 451 FHWLFTRNGFLAIGGRDQSQNELVVRKYLGENDVFIHADIHGGSAVVLKSGGAHSLEDVV 510
Query: 315 NVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQL 363
+ ++ A + AW WV QVSKT P GE+L G+FM+ G KN+ P +L
Sbjct: 511 DASYLAACYSKAWKAGFSYIEVYWVPGRQVSKTPPPGEYLPRGAFMVYGSKNYLQVPLRL 570
Query: 364 AMGI 367
+GI
Sbjct: 571 GIGI 574
>gi|256811227|ref|YP_003128596.1| fibronectin-binding A domain-containing protein [Methanocaldococcus
fervens AG86]
gi|256794427|gb|ACV25096.1| Fibronectin-binding A domain protein [Methanocaldococcus fervens
AG86]
Length = 671
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 34/46 (73%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ +WVK +Q+SKTA +GE+L G+F+IRG+++++ L +GI +
Sbjct: 558 DTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGVI 603
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 44/209 (21%), Positives = 95/209 (45%), Gaps = 29/209 (13%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++++ G + + SF AVD++F+ ++ + K + ++K EN+ +
Sbjct: 259 VPIDLKKYDGLEKKYYNSFLEAVDDYFAKFLTNIVVKKEKSKIEREIEKQENILKRQMDT 318
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L ++ ++ K +LI N + V+ + AIRQ ++ W I+ V++ + H P+
Sbjct: 319 LKKYKEDAEKNQIKGDLIYANYQIVEELLSAIRQ-AREKMDWARIKKIVRENKEH--PIL 375
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
+I+ + N+ I + L +S + D + V +D+ +A
Sbjct: 376 GLIENINENVGEIVIRL-----KSEVDDKVIEER-----------------VSLDIRKNA 413
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKAL 271
+ NA+ +++ AKK + I+ E A+
Sbjct: 414 FENAENYYE----KAKKLKNKIEGIENAI 438
>gi|170291097|ref|YP_001737913.1| RNA-binding protein, snRNP-like protein [Candidatus Korarchaeum
cryptofilum OPF8]
gi|170175177|gb|ACB08230.1| RNA-binding protein, snRNP-like protein [Candidatus Korarchaeum
cryptofilum OPF8]
Length = 624
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 114/284 (40%), Gaps = 80/284 (28%)
Query: 148 DAAILAIRQDIANQLSW-EDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERS 206
D I ++R++I Q+S E+ E Q +R D + S NI+ + L + RS
Sbjct: 277 DERISSLRREIERQISLKEEYERTYAQLRRIGDTILS-------NIHEVEEAL--GRARS 327
Query: 207 NIKDSWVLHHD-NADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
+ + V D + S E ++ + +D+ SA NA ++D AKK ++
Sbjct: 328 GEEHALVKRVDWKSGKVIISLEGEE---IQLDIRRSASENASEYYD----KAKKAREKAL 380
Query: 266 STEKALKSAEKKTKQ---TLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKV 322
+KAL + ++ KQ +L++ + + ++ W+EKF WF +S L DA+
Sbjct: 381 RIDKALSNIMERLKQIESSLEERKLELSPKPRKRERWYEKFRWFYTSSGNLVICGRDAQT 440
Query: 323 ------------------------------------------------VTNAW------- 327
+ AW
Sbjct: 441 NSEIVSKYMDDKDLFFHVDMPGGAVVVLKVEREVDQRSIEQAAVAAASFSRAWKEGLSYA 500
Query: 328 ---WVKADQVSKTAPTGEFLTTGSFMIRGKKNFF-PPCQLAMGI 367
+VK +QVSK AP G +L GSF I GK+N+ +L++GI
Sbjct: 501 DVYYVKGEQVSKHAPPGMYLPKGSFYITGKRNYLRAKLELSVGI 544
>gi|126178886|ref|YP_001046851.1| hypothetical protein Memar_0936 [Methanoculleus marisnigri JR1]
gi|125861680|gb|ABN56869.1| protein of unknown function DUF814 [Methanoculleus marisnigri JR1]
Length = 632
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/381 (20%), Positives = 145/381 (38%), Gaps = 129/381 (33%)
Query: 55 PMILQPGSTPSVMRQHQGFPIQE-FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLE 113
P+I G P V+ G ++E F +F+ A+D F+ K + A + L + E
Sbjct: 234 PVITASGCWPVVL---AGDEVRERFATFSEALDAFYPKTVGGKEEAAAGKPR---LSQAE 287
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
++R + E+ ++ E+I N +V A I+A + + SW++IE +K+
Sbjct: 288 VIRRRQAEAIKGFEKKIERNQRIVEVIYENYTAV-AGIIATLDEASKNRSWQEIEKILKE 346
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
++P A +++ + P+
Sbjct: 347 --NGDNPAAKMVRAIH-----------------------------------------PAD 363
Query: 234 VDVDLDLSA-----YA------NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL 282
VD+DLS Y N R++D + KK+ + + E+ + + +TK+ L
Sbjct: 364 AAVDVDLSGERVKIYVHETIEQNLGRYYDQIKKFKKKKTGALAAMERTVPE-KPRTKRNL 422
Query: 283 KDVQTMTNINKARKVYWFEKFYWFISSENYL----------------------------- 313
+ K R W+ +F WF +S+ L
Sbjct: 423 P-------LQKKR---WYHRFRWFTTSDGTLVIGGRDASQNEELVKKYMEGGDLFVHADV 472
Query: 314 --GNV----------------------AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGS 348
G+V AW A T + + + DQVSKTA +GE++ G+
Sbjct: 473 HGGSVVIVKGTTEHMDEAVRFAASYSNAWKAGHFTADVYAARPDQVSKTAESGEYVARGA 532
Query: 349 FMIRGKKNFF--PPCQLAMGI 367
F++RG++ +F P +A+G+
Sbjct: 533 FIVRGERQYFRNAPLGVAIGL 553
>gi|407465827|ref|YP_006776709.1| hypothetical protein NSED_09905 [Candidatus Nitrosopumilus sp. AR2]
gi|407049015|gb|AFS83767.1| hypothetical protein NSED_09905 [Candidatus Nitrosopumilus sp. AR2]
Length = 648
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 35/54 (64%), Gaps = 3/54 (5%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
AW + +A+WV DQ+ K+AP+GEFL GSF I G++NF +LA+GI
Sbjct: 504 AWREGMYGVSAYWVYPDQIKKSAPSGEFLPKGSFTIEGQRNFIKSDTLRLAVGI 557
>gi|452210388|ref|YP_007490502.1| hypothetical protein MmTuc01_1891 [Methanosarcina mazei Tuc01]
gi|452100290|gb|AGF97230.1| hypothetical protein MmTuc01_1891 [Methanosarcina mazei Tuc01]
Length = 775
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 35/55 (63%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
+ +W+K++QV+KT +GE+L G+F+IRG++N+F L + + K E I
Sbjct: 573 DCYWIKSEQVTKTPESGEYLKKGAFVIRGERNYFKDVPLGIAVGLELKGETRIIG 627
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 62/273 (22%), Positives = 113/273 (41%), Gaps = 37/273 (13%)
Query: 78 FESFAAAVDEFFST-AESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
F+SF A+DEFF A ++K +++ L E E L+ E+ +
Sbjct: 299 FDSFNTALDEFFGKKALEQVAEVKEAEKKEKTLGVFERRLMQQEESLAKFEKEIEKNNAL 358
Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
AE + N + ++ + A SW++I + +KQA++ P A I NI+ T
Sbjct: 359 AETVYANYQIIEELFSVLNGARAKGYSWDEIRSILKQAKK-TVPAAQTI----TNIDQKT 413
Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
+ V+ + +I ++D+ + NA+ +++ +
Sbjct: 414 GTVTVNLDGKSI--------------------------NLDIRKTVPQNAQEYYEKVKKF 447
Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNV 316
KK+ I++ E K+ EKK + + +RK +W+++F WF+SS+ +L
Sbjct: 448 TKKKDGAIRAIEDTKKAMEKKAATKSA--KAGRKLQASRKKHWYDRFRWFVSSDGFLVVG 505
Query: 317 AWDAKV---VTNAWWVKADQVSKTAPTGEFLTT 346
DA + + K D V T G LT
Sbjct: 506 GRDADTNEEIFKKYMEKRDIVFHTQTPGAPLTV 538
>gi|387219995|gb|AFJ69706.1| hypothetical protein NGATSA_2054800, partial [Nannochloropsis
gaditana CCMP526]
Length = 94
Score = 53.1 bits (126), Expect = 2e-04, Method: Composition-based stats.
Identities = 24/54 (44%), Positives = 36/54 (66%)
Query: 268 EKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK 321
EKA+K+AE++ +L Q ++ RK YWFEKF+WFI+S+N+L DA+
Sbjct: 2 EKAVKAAERQAAASLSKQQRKRTLSVVRKPYWFEKFHWFITSDNHLVVSGRDAQ 55
>gi|289192132|ref|YP_003458073.1| Fibronectin-binding A domain protein [Methanocaldococcus sp.
FS406-22]
gi|288938582|gb|ADC69337.1| Fibronectin-binding A domain protein [Methanocaldococcus sp.
FS406-22]
Length = 671
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 34/46 (73%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ +WVK +Q+SKTA +GE+L G+F+IRG+++++ L +GI +
Sbjct: 558 DTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGVI 603
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 54/255 (21%), Positives = 110/255 (43%), Gaps = 45/255 (17%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++++ G + + SF AVD++F+ K + +++ EN+ +
Sbjct: 259 VPIDLKKYDGLEKKYYNSFLEAVDDYFAKFLVKVEVKKEKSKFEREIERQENILKRQLGT 318
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND--P 180
L ++ ++ K +LI N + V+ + AIRQ ++ W AR+K+ R N P
Sbjct: 319 LKKYKEDAEKNQIKGDLIYANYQIVEELLNAIRQ-AREKMDW----ARIKKIIRENKEHP 373
Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
+ +I+ + N+ I + L+ S + D+ + V +D+
Sbjct: 374 ILGLIENINENVGEIVVRLK-----SEVDDNVIEER-----------------VSLDIRK 411
Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALK----------SAEKKTKQTLKDVQTMTN 290
+A+ NA+ +++ AKK + I+ E A++ ++ + + +Q
Sbjct: 412 NAFENAESYYE----KAKKLRNKIEGIENAIELTKKKIDELKKKGEEELKEKESIQMKKK 467
Query: 291 INKARKVYWFEKFYW 305
+ K RK W+EKF W
Sbjct: 468 VRKERK--WYEKFKW 480
>gi|390937875|ref|YP_006401613.1| putative RNA-binding protein [Desulfurococcus fermentans DSM 16532]
gi|390190982|gb|AFL66038.1| putative RNA-binding protein, snRNP like protein [Desulfurococcus
fermentans DSM 16532]
Length = 659
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 70/184 (38%), Gaps = 68/184 (36%)
Query: 251 DLKRSAA---KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-----INKARKVYWFEK 302
DL R +K+ + ++ TE A K E+ Q L + N I K +W+E+
Sbjct: 392 DLSRQVIELERKRGELVRKTESAKKVLEE-MHQQLNTISISMNTEEKTIRKPSPTFWYER 450
Query: 303 FYWFIS------------SEN------YLG------------------------------ 314
F+W + S+N YLG
Sbjct: 451 FHWLFTRNGFLAIGGRDQSQNELVVRKYLGENDVFIHADIHGGSAVVLKSGGAHSLEDVV 510
Query: 315 NVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQL 363
+ ++ A + AW WV QVSKT P GE+L G+FM+ G KN+ P +L
Sbjct: 511 DASYLAACYSKAWKAGFSYIEVYWVSGRQVSKTPPPGEYLPRGAFMVYGSKNYLQVPLRL 570
Query: 364 AMGI 367
+G+
Sbjct: 571 GIGV 574
>gi|91773364|ref|YP_566056.1| hypothetical protein Mbur_1391 [Methanococcoides burtonii DSM 6242]
gi|91712379|gb|ABE52306.1| FbpA, DUF814 containing protein [Methanococcoides burtonii DSM
6242]
Length = 663
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 34/47 (72%), Gaps = 2/47 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISF 369
+ +W+K +QVSKT +GE+L GSF+IRG++N++ P +A+G+
Sbjct: 543 DCYWIKPEQVSKTPESGEYLKKGSFIIRGERNYYKDVPVGVAIGLDL 589
Score = 39.7 bits (91), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 55/278 (19%), Positives = 108/278 (38%), Gaps = 52/278 (18%)
Query: 48 VANMEFHPMILQP---------GSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKID 98
+ N E +P I++ P + Q+ + + SF A+DEFF S ++
Sbjct: 235 IVNHELNPCIIKKTVKDELQAVDVLPLELTQYSDAEKEFYPSFNKALDEFFGKKASEEVI 294
Query: 99 LKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDI 158
+ V ++++ E R + + E AE I N ++V+ + +
Sbjct: 295 EQVVAKKKEKEDVFERRLRKQQEAILKFETDSTRYTLIAESIYGNYQTVEEVLSVLEAAR 354
Query: 159 ANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDN 218
SW+DI +K+A + P A I + + + L V N++ +
Sbjct: 355 DKGYSWKDIWDTLKKA-KDTLPAAKAIVSIDPAEGSVVVDLDVVNANINVRKTI------ 407
Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAE 275
P NA+ +++ + +KK+ I+ T++A++ E
Sbjct: 408 ------------PQ------------NAQMYYNKAKKISKKRDGALIAIEDTKRAMQKRE 443
Query: 276 KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
+K + K V K +W+++F WF SS+ +L
Sbjct: 444 QKVSKRRKAV---------FKKHWYDRFRWFFSSDGFL 472
>gi|167044451|gb|ABZ09127.1| putative domain of unknown function (DUF814) [uncultured marine
crenarchaeote HF4000_APKG6D9]
Length = 648
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 72/178 (40%), Gaps = 71/178 (39%)
Query: 257 AKKQQKTIQS-------TEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
AKKQ + I S TEK L+ + KT ++ +D+ +T I RK W+E++ WF ++
Sbjct: 384 AKKQSRAISSIEEIKSKTEKKLEKFQNKT-ESEQDIMLVTEI---RKKSWYERYRWFYTT 439
Query: 310 ENYLGNVAWDAK------------------------------------------------ 321
+ YL DA
Sbjct: 440 DGYLAVGGRDAASNSAVVRKHLVKNDKIFHADIFGSPFFIIKDAEHAPATSMDEVAHATV 499
Query: 322 VVTNAW-----WVKA-----DQVSKTAPTGEFLTTGSFMIRGKKNFFPP--CQLAMGI 367
+ AW VKA +QV K+AP+GEFL GSF I G++NF +LA+GI
Sbjct: 500 CFSRAWREGLYGVKAYWIHPEQVKKSAPSGEFLPKGSFTIEGQRNFINSKNLKLAVGI 557
>gi|15669822|ref|NP_248636.1| hypothetical protein MJ_1625 [Methanocaldococcus jannaschii DSM
2661]
gi|42559938|sp|Q59020.1|Y1625_METJA RecName: Full=Uncharacterized protein MJ1625
gi|1592339|gb|AAB99643.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
2661]
Length = 671
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 35/45 (77%), Gaps = 2/45 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
+ +WVK +Q+SKTA +GE+L G+F+IRG+++++ P +L +G+
Sbjct: 558 DTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGVGV 602
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 54/253 (21%), Positives = 106/253 (41%), Gaps = 41/253 (16%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P +++++G + + SF AVD++F+ + + K + +++ EN+ R
Sbjct: 259 VPIDLKKYKGLEKKYYNSFLEAVDDYFAKFLTKVVVKKEKSKIEKEIERQENILRRQLET 318
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
L ++ ++ K +LI N + V+ + AIRQ ++ W AR+K+ R N
Sbjct: 319 LKKYKEDAEKNQIKGDLIYANYQIVEELLNAIRQ-AREKMDW----ARIKKIIRENK--- 370
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
H L L + + + L + D + V +D+ +A
Sbjct: 371 ----------EHPILGLIENINENIGEIIIRLKSEVDDKVIEER-------VSLDIRKNA 413
Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALK----------SAEKKTKQTLKDVQTMTNIN 292
+ NA+ +++ AKK + I+ E A++ ++ + + +Q I
Sbjct: 414 FENAESYYE----KAKKLRNKIEGIENAIELTKKKIEELKKKGEEELKEKESMQMKKKIR 469
Query: 293 KARKVYWFEKFYW 305
K RK W+EKF W
Sbjct: 470 KERK--WYEKFKW 480
>gi|448612034|ref|ZP_21662464.1| hypothetical protein C440_11728 [Haloferax mucosum ATCC BAA-1512]
gi|445742795|gb|ELZ94289.1| hypothetical protein C440_11728 [Haloferax mucosum ATCC BAA-1512]
Length = 701
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/280 (20%), Positives = 110/280 (39%), Gaps = 64/280 (22%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +++HQ G + +E+F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLQEHQNAGLDEEAYETFNEALDEYF-----FRLDLTADEQEATSNRPDFEEQIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E + E + +Q ++E+AEL+ N + D + +R + W++I +
Sbjct: 313 ERIIEQQEQAIEGFDQQADEERERAELLYANYDLADDVLSTVRDAREQGVPWDEIAVTLD 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L +
Sbjct: 373 EGADQGIPAAEAVTNVDSANGTVTVEL------------------------------DGT 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +D+ + NA R + + KR KK+ I+ T + L++A+++ + D
Sbjct: 403 SVTLDVSMGVEKNADRLYTEAKRIQEKKEGALAAIEDTREELEAAKRRRDEWEADDGGGD 462
Query: 287 -------------TMTNINKARKVYWFEKFYWFISSENYL 313
++ ++ +WFE+F WF +S YL
Sbjct: 463 ADEDDEPEETDWLSLESVPVKSTEHWFERFRWFYTSSGYL 502
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF+IRG + +F P ++A+GI
Sbjct: 574 EGRFADDAYMVEPSQVSKTPESGEYIDKGSFVIRGDRRYFEDVPAKVAVGI 624
>gi|124027973|ref|YP_001013293.1| hypothetical protein Hbut_1105 [Hyperthermus butylicus DSM 5456]
gi|123978667|gb|ABM80948.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
Length = 672
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
AW A + + + +WV QVSK AP GE+L G+FM+ GK+N+ ++ + I + E
Sbjct: 521 AWKAGIGSVDVYWVWGSQVSKAAPAGEYLAKGAFMVYGKRNYIRNVEVKIAIGIGLEDEY 580
Query: 376 SSI 378
S +
Sbjct: 581 SPV 583
Score = 45.4 bits (106), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 100/262 (38%), Gaps = 53/262 (20%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK-----LENVKRD 118
P + GF + F + A DE+F T V E +A +K LE +R+
Sbjct: 253 PRGLAARYGFEYRAFNDPSTAYDEYFLTVAREAAGASTVAAEIEAERKKLLASLEAARRN 312
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
E L + + E AE++ N V A+ R+ + WE I
Sbjct: 313 LEHLRKKLRELE----ELAEIVSTNIADVYDAVECARK-MRETAGWEQIPGNCPGVV-DV 366
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
+P IIK + + N + + +R+ +P + VDL
Sbjct: 367 EPNKGIIK-ISIVGNIVPIDIRM----------------------------EPGRLVVDL 397
Query: 239 DLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY 298
KR ++ R+ ++ +K ++ E L E+K +Q L + M R+
Sbjct: 398 -------YKRIGEV-RAKIERGEKAVKDIEARLAELEEKVRQRLLRARAMV-----RRKE 444
Query: 299 WFEKFYWFISSENYLGNVAWDA 320
W+EK++W I+S YL DA
Sbjct: 445 WYEKYHWVITSHGYLAIGGRDA 466
>gi|288932692|ref|YP_003436752.1| Fibronectin-binding A domain protein [Ferroglobus placidus DSM
10642]
gi|288894940|gb|ADC66477.1| Fibronectin-binding A domain protein [Ferroglobus placidus DSM
10642]
Length = 646
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 89/389 (22%), Positives = 161/389 (41%), Gaps = 107/389 (27%)
Query: 48 VANMEFHP-MILQPGST----PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAV 102
V N F+P +IL+ G P +++++G+ + FESF AVDEF++ + +I++K
Sbjct: 220 VENCNFNPHIILKDGDYVDYQPIDLKKYEGYEKKYFESFNKAVDEFYTRSALKEIEVK-- 277
Query: 103 QQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQL 162
+++ + ++KLEN RL + QL KE+ E +ES + I I +
Sbjct: 278 EKKSEVIEKLEN-------RL----KIQLETKERYE-----RES--EKLRRIGDLIYEKY 319
Query: 163 SW-EDIEARVKQA--QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNA 219
E I + +K+A + D V I+ + K +L + KE+ + VL D+
Sbjct: 320 PIVERIHSALKKAVELKGFDEVKKILAEQKKAGKLKEILDIIPKEK-----AVVLSIDD- 373
Query: 220 DPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
K +D +L + NA+ ++D + +K +++ EK + + +
Sbjct: 374 --------VKFKLFLDKNL----HENAEYYYDQAKKLKEKVNGIVKAIEKTREEIRRAEE 421
Query: 280 QTLKDVQTMTNINKARKVYWFEKFYWFISSENYL-------------------------- 313
K + ++ R+ W+EK+ W+I+SE +L
Sbjct: 422 IEAKKI--LSEFRVVRRREWYEKYRWYITSEGFLVIGGRNAEMNEEIVSKHFESKDLFFH 479
Query: 314 -----GNV----------------AWDAKVVTNAWW-----------VKADQVSKTAPTG 341
G V A + + +A W V +QV + A G
Sbjct: 480 TQTPGGAVTILKRGLEAGEKSIKEAAEFAAIYSALWKHGMHSGEVYYVTYEQVKRAAKPG 539
Query: 342 EFLTTGSFMIRGKKNFFP-PCQLAMGISF 369
E+L GSF I GK+N+ + A+G+
Sbjct: 540 EYLPKGSFYIVGKRNYLSVELKAAIGVDL 568
>gi|257053989|ref|YP_003131822.1| Fibronectin-binding A domain protein [Halorhabdus utahensis DSM
12940]
gi|256692752|gb|ACV13089.1| Fibronectin-binding A domain protein [Halorhabdus utahensis DSM
12940]
Length = 707
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/278 (21%), Positives = 115/278 (41%), Gaps = 57/278 (20%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKR---DH 119
TP + +++ P + F+SF A++ +F E + + D ++E KR
Sbjct: 260 TPVPLVEYEDTPGESFDSFNDALEAYFLGLEQEPDEEETGSNRPDFEAEIEKQKRIIQQQ 319
Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
E + E+ ++EKAEL+ N + VD + ++ A + W++IEA + +
Sbjct: 320 EGAIEDFEEDAEAEREKAELLYANYDLVDEVLSTVQDARAAETPWDEIEATLSAGKDQGI 379
Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLD 239
P A ++ + + +T+ I D HH +++D D
Sbjct: 380 PAAEAVRDVDGSEGTVTV---------QIDD----HH-----------------IELDAD 409
Query: 240 LSAYANAKRFF-DLKRSAAKK--QQKTIQSTEKALKSAEKKTK---------------QT 281
NA R + + KR KK ++ I +T + L++ +++ + +T
Sbjct: 410 TGVEKNADRLYQEAKRIEGKKAGAEEAIANTREQLEAVKQRREAWEASDGDDGGDGSGET 469
Query: 282 LKDVQ------TMTNINKARKVYWFEKFYWFISSENYL 313
+D Q T +I W+E+F WF +S+ +L
Sbjct: 470 HEDDQEDVDWLTRESIPIRTSEEWYERFRWFTTSDGFL 507
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
+ K + + V DQV+KT +GE+L GSF IRG + ++ P +A+GI+
Sbjct: 580 EGKYAGDVYCVGPDQVTKTPESGEYLEKGSFAIRGDRTYYDDTPVGVAVGIT 631
>gi|21227916|ref|NP_633838.1| hypothetical protein MM_1814 [Methanosarcina mazei Go1]
gi|20906336|gb|AAM31510.1| hypothetical protein MM_1814 [Methanosarcina mazei Go1]
Length = 343
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 35/55 (63%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
+ +W+K++QV+KT +GE+L G+F+IRG++N+F L + + K E I
Sbjct: 141 DCYWIKSEQVTKTPESGEYLKKGAFVIRGERNYFKDVPLGIAVGLELKGETRIIG 195
>gi|386874769|ref|ZP_10116995.1| hypothetical protein BD31_I0230 [Candidatus Nitrosopumilus salaria
BD31]
gi|386807392|gb|EIJ66785.1| hypothetical protein BD31_I0230 [Candidatus Nitrosopumilus salaria
BD31]
Length = 539
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 35/54 (64%), Gaps = 3/54 (5%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP--CQLAMGI 367
AW + +A+WV +QV K+AP+GEFL GSF I G++NF +LA+GI
Sbjct: 395 AWREGMYGVSAYWVNPEQVKKSAPSGEFLPKGSFTIEGQRNFIKSDTLRLAVGI 448
>gi|410670434|ref|YP_006922805.1| hypothetical protein Mpsy_1229 [Methanolobus psychrophilus R15]
gi|409169562|gb|AFV23437.1| hypothetical protein Mpsy_1229 [Methanolobus psychrophilus R15]
Length = 664
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 34/55 (61%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
+ +WV +QVSKT +GE++ G+F+IRG++N+F Q+ M ++ E I
Sbjct: 544 DCYWVLPEQVSKTPESGEYVKKGAFIIRGERNYFRDVQVGMAVALELGAETRVIG 598
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/289 (20%), Positives = 112/289 (38%), Gaps = 45/289 (15%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
P ++ H G + F SF A+D FF + ++ +++ + E R E
Sbjct: 259 APFELKTHAGLEKEVFPSFNKALDGFFGKRSAEEVTEVVEAVKKEKVDVFERRLRKQEEA 318
Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
+ + + AE I + + ++ I + + N SW++I++ +K A + P A
Sbjct: 319 IENFGREAERHVDVAEKIYAHYQVIEDVIGVLEKARQNGYSWDEIKSILKGA-KETVPAA 377
Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
I + I L L K +D+ L+
Sbjct: 378 KSISSIDSATGRIVLDLEGTK------------------------------ATIDIKLTI 407
Query: 243 YANAKRFFDLKRSAAKKQQ---KTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYW 299
NA+ +++ + +K++ + I+ T A++ EKK + V K +W
Sbjct: 408 PQNAQSYYEKAKKLTRKKEGAIRAIEDTRVAMQKKEKKVSGNKRKVHM--------KKHW 459
Query: 300 FEKFYWFISSENYLGNVAWDAKV---VTNAWWVKADQVSKTAPTGEFLT 345
+++F WF SSE +L DA+ + + K+D V T G +T
Sbjct: 460 YDRFRWFYSSEGFLVVGGRDAETNEELVKKYMDKSDVVFHTQDPGAPMT 508
>gi|429192346|ref|YP_007178024.1| RNA-binding protein [Natronobacterium gregoryi SP2]
gi|448325749|ref|ZP_21515133.1| Fibronectin-binding A domain-containing protein [Natronobacterium
gregoryi SP2]
gi|429136564|gb|AFZ73575.1| putative RNA-binding protein, snRNP like protein [Natronobacterium
gregoryi SP2]
gi|445614570|gb|ELY68242.1| Fibronectin-binding A domain-containing protein [Natronobacterium
gregoryi SP2]
Length = 710
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 54/257 (21%), Positives = 109/257 (42%), Gaps = 16/257 (6%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
TP + +H+ G + ++SF A +D++F E + D ++R + + K E +
Sbjct: 264 TPFPLEEHEAEGLAAEAYDSFLAVLDDYFFRLELEEEDDSDPTEQRPDFEEEIAKYERII 323
Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
E + EQ +EKAEL+ VD + +R+ W++IE R ++ +
Sbjct: 324 EQQEGAIEGFEQQAEQLREKAELLYAEYGLVDEVLSTVREAREQDRPWDEIEERFEEGKE 383
Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDV 236
A + + + +T+ L + D + D A E KK
Sbjct: 384 RGIEAAKAVVDVDGSEGTVTVTLDGEHVELAVHDGVEQNADRLYKEAKDIEGKKEGA--- 440
Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARK 296
L+A + + DL+ AK+++ + ++ ++ + KD +M ++
Sbjct: 441 ---LAAIEDTRE--DLEE--AKRRRDQWEVDDEDDGDDDEIDEADSKDWLSMPSVPIREN 493
Query: 297 VYWFEKFYWFISSENYL 313
W+++F WF +S++YL
Sbjct: 494 EPWYDRFRWFYTSDDYL 510
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P +A+GI
Sbjct: 583 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTPVGVAVGI 633
>gi|167043365|gb|ABZ08068.1| putative domain of unknown function (DUF814) [uncultured marine
crenarchaeote HF4000_ANIW141O9]
Length = 632
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
++A+WV DQV K AP+G+ + GSFMI G++NF L M ++ + K EES
Sbjct: 509 SSAYWVNPDQVKKGAPSGQSMAKGSFMIEGQRNFVKISTLKMCVA-IIKHEES 560
Score = 38.5 bits (88), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 6/89 (6%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN- 292
+ +DL+ S A F+ +KKQ+ I S EK L E + ++ ++ ++ +++
Sbjct: 363 IKIDLNSSLPTTASTLFN----ESKKQKAAIGSIEKLLIKTENELEKVIEKGESAKSVSF 418
Query: 293 -KARKVYWFEKFYWFISSENYLGNVAWDA 320
+ RK WFE++ WF +++ L D+
Sbjct: 419 TQVRKKNWFERYRWFYTTDGVLAVGGRDS 447
>gi|448565126|ref|ZP_21636097.1| hypothetical protein C457_11862 [Haloferax prahovense DSM 18310]
gi|445715785|gb|ELZ67538.1| hypothetical protein C457_11862 [Haloferax prahovense DSM 18310]
Length = 702
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 59/281 (20%), Positives = 110/281 (39%), Gaps = 65/281 (23%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +R+HQ G + +++F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ + E + EQ ++E+AEL+ N + VD + +R + W+DI +
Sbjct: 313 QRIIDQQEGAIEGFEQQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLA 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L D+A
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +++ + NA R + + KR KK+ I+ T + L + +K+ + D
Sbjct: 403 TVTLEVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDDEDD 462
Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++ +WFE+F WF +S YL
Sbjct: 463 EDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYL 503
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF+IRG + +F ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYFEDVAAKVAVGI 625
>gi|288560094|ref|YP_003423580.1| RNA-binding protein [Methanobrevibacter ruminantium M1]
gi|288542804|gb|ADC46688.1| RNA-binding protein [Methanobrevibacter ruminantium M1]
Length = 669
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 35/55 (63%), Gaps = 1/55 (1%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW T + +WV+ +QVSKT +GEF+ G+F+IRGK+N+ +L + I +
Sbjct: 548 AWSRNYGTQDVYWVEPEQVSKTPVSGEFVPKGAFIIRGKRNYIRGAKLEIAIGIV 602
>gi|374724028|gb|EHR76108.1| putative RNA-binding protein [uncultured marine group II
euryarchaeote]
Length = 723
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 80/345 (23%), Positives = 142/345 (41%), Gaps = 64/345 (18%)
Query: 1 NVSNEDVS-SDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSE--DYIVANMEFHPM- 56
N++ +D + D+H L L+ +L QG L+ K E + A+ME + +
Sbjct: 206 NMATQDANVEDVHTA---LQTLLGHLGEQRQGILLMKPSDEMGVEALEQKAASMEANGLR 262
Query: 57 -----ILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF--------FSTAESHKIDLKAVQ 103
+TP+++ H G +F + AVD + + E+ K+D+ A
Sbjct: 263 DRFFEAHASEATPTILPSHAGMAQAKFATLCEAVDAWKGAHDAGALARREAEKLDIAAPG 322
Query: 104 QERDA-LKKLENVKRDHETRLSA----LEQTQLVDKEKAELIINNQESVDAAILAIRQDI 158
+ +++LE K E LS +E+ Q++ I NN V++ ++ + + I
Sbjct: 323 RGHSTDVERLERRKVQQEKALSGFSKKIEKQQMI----GHTIQNNWTHVESLLIQVTEAI 378
Query: 159 ANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDN 218
+ W+++++ K SI + LN ERS + VL +N
Sbjct: 379 EAK-GWKEVKSMAK----------SIPWIVSLN----------PAERSFLS---VLPDEN 414
Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT 278
+P K P + +D S + NA+RFF R K + + + E L ++
Sbjct: 415 GEP-------KGPQ-ATLSIDESVHQNAQRFFTAARKQKDKTKGAVDALEDTLLQLQRAQ 466
Query: 279 KQTLKDVQTMTNINKAR--KVYWFEKFYWFISSENYLGNVAWDAK 321
K+ K Q +NK + K WFE W + + +L DAK
Sbjct: 467 KKEAKQ-QATGKLNKIKRSKRLWFEHHRWSMITGGHLLVGGKDAK 510
Score = 45.4 bits (106), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 30/44 (68%)
Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
+ VK QVSKTA TGEF+ GSF++RG++ +F + +GI +
Sbjct: 602 YSVKPAQVSKTAQTGEFVGKGSFIVRGQRQWFKDLDVQIGIGIV 645
>gi|307595006|ref|YP_003901323.1| hypothetical protein Vdis_0882 [Vulcanisaeta distributa DSM 14429]
gi|307550207|gb|ADN50272.1| protein of unknown function DUF814 [Vulcanisaeta distributa DSM
14429]
Length = 668
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 38/59 (64%), Gaps = 1/59 (1%)
Query: 323 VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF-LFKLEESSISR 380
V +A++V+ +QV+K AP+GE+L GSFMI G + + +L +GI + E+SI R
Sbjct: 528 VLDAFYVRGEQVTKKAPSGEYLGKGSFMIYGTRGWVRNAELGLGIGVRVDSFGEASIIR 586
>gi|48478297|ref|YP_024003.1| hypothetical protein PTO1225 [Picrophilus torridus DSM 9790]
gi|48430945|gb|AAT43810.1| hypothetical protein PTO1225 [Picrophilus torridus DSM 9790]
Length = 611
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 36/55 (65%), Gaps = 1/55 (1%)
Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW A + + A+WV QVSKT +GEF++ GS++IRGK+N+ L + I+ +
Sbjct: 489 AWPAGIGSGTAYWVYPSQVSKTPESGEFISKGSWVIRGKRNYIFDLPLKLKITVI 543
>gi|325958497|ref|YP_004289963.1| fibronectin-binding A domain-containing protein [Methanobacterium
sp. AL-21]
gi|325329929|gb|ADZ08991.1| Fibronectin-binding A domain protein [Methanobacterium sp. AL-21]
Length = 661
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
T+ +WV +QVSKT +GEF+ G+F+IRG +N+ P L++GI
Sbjct: 538 TDVYWVHPEQVSKTPQSGEFVAKGAFIIRGSRNYMRGLPLTLSLGI 583
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 120/284 (42%), Gaps = 41/284 (14%)
Query: 53 FHPMILQPGS---TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--RD 107
F+P I++ P + ++ F + FESF A DEF+S+ D+ V +E
Sbjct: 235 FNPQIVKGKKEDVLPLDLLMYKDFEKESFESFNDAADEFYSSIVGE--DIVNVNEEVWSG 292
Query: 108 ALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI 167
+ K E L E+T K K E I ++ ++++ IL I SW +I
Sbjct: 293 EVGKFEKRLNIQLETLEKFEKTVKDSKIKGEAIYSDYQAIE-NILNIIHSARETNSWLEI 351
Query: 168 EARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS 226
A VK+A++ P II+ + K+ + L L + R NI DS + +NA+ + +
Sbjct: 352 IATVKKAKKDKVPGLEIIESIDKMGV----LTLNLDGVRVNI-DSSMGIPENAEIYYNKG 406
Query: 227 ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ 286
+ K + V + + ++ K + K EK L +K+ K+ LK
Sbjct: 407 KKAKRKIKGVHIAIEK---------TRKEIDKAKNKREIEMEKVL-VPQKRVKKDLK--- 453
Query: 287 TMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
W+EK WF++S+ L DA TN VK
Sbjct: 454 ------------WYEKLRWFVTSDGLLAIGGRDA--TTNEMVVK 483
>gi|347524253|ref|YP_004781823.1| hypothetical protein Pyrfu_1716 [Pyrolobus fumarii 1A]
gi|343461135|gb|AEM39571.1| protein of unknown function DUF814 [Pyrolobus fumarii 1A]
Length = 668
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
AW + + +WV QVSK AP GE+LT G+FM+ G++N+ +L + I
Sbjct: 516 AWKEGLGAIDVYWVWGRQVSKAAPPGEYLTKGAFMVYGERNYIRGVELKLAIGL 569
>gi|11499620|ref|NP_070862.1| hypothetical protein AF2038 [Archaeoglobus fulgidus DSM 4304]
gi|2648497|gb|AAB89216.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
Length = 627
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 64/270 (23%), Positives = 119/270 (44%), Gaps = 46/270 (17%)
Query: 51 MEFHP---MILQPGS----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQ 103
M F P +IL+ G P + + + + FESF A+D++FS + +L++++
Sbjct: 205 MNFTPKPQIILKDGDYLDVVPMDLLYYSNYEKKYFESFNDALDDYFSKKLAEMDELESMK 264
Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
E L+KL+ + L E ++ + I N + V+ I A R + S
Sbjct: 265 SEE--LEKLKKRLEIQKESLRKFEDEAESFRKIGDAIYENYQMVEKIIEAFRA-ARERKS 321
Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA 223
W++I K+ ++ + ++K +K N I + + IK S H+NAD +
Sbjct: 322 WDEI----KEIVARDEKLKKLVKAIKPEKNAIVVKVGDFDVELEIKKSI---HENADLY- 373
Query: 224 DSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
Y AK+ R A+ ++ I++T + ++ E+K ++ L
Sbjct: 374 -------------------YEKAKK----AREKAEGVKRAIEATLREMERVEEKLEKKL- 409
Query: 284 DVQTMTNINKARKVYWFEKFYWFISSENYL 313
+T+I RK W+E + WF +SE +L
Sbjct: 410 ----VTSIKVRRKKEWYENYRWFFTSEGFL 435
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQLAMGI 367
+ K ++V +QVSK+A GE+L GSF I GK+N+ A+G+
Sbjct: 499 EGKHAGEVYYVLPEQVSKSAKAGEYLPKGSFYITGKRNYLTVELNCAVGV 548
>gi|430813961|emb|CCJ28738.1| unnamed protein product [Pneumocystis jirovecii]
Length = 441
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 28/39 (71%)
Query: 342 EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
E+ T G+FMI+GKKNF PP QL +G L+ ++E S +R
Sbjct: 3 EYSTVGTFMIQGKKNFLPPSQLILGYGILWTIDEVSKAR 41
>gi|330506586|ref|YP_004383014.1| hypothetical protein MCON_0325 [Methanosaeta concilii GP6]
gi|328927394|gb|AEB67196.1| protein of unknown function (DUF814) [Methanosaeta concilii GP6]
Length = 641
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + VK DQVSKT +GEFL G+F+IRG++ +F P +A+GI
Sbjct: 523 ADCYLVKGDQVSKTPESGEFLKKGAFVIRGERRYFRDVPLGIALGI 568
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 61/264 (23%), Positives = 106/264 (40%), Gaps = 74/264 (28%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAE---------SHKIDLKAVQQERDALKKLEN 114
P ++ + G + F +F+ A+D FF E H+I+L +R A+++ +
Sbjct: 250 PRPLKLYSGLEKKRFVTFSEALDAFFVEREKETTRQDPLEHRIEL-----QRKAIEEFRS 304
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
++ +LV K ELI SV+ + + A S+ I R+ +
Sbjct: 305 ------------QEAELV--RKGELIYQLYGSVEQILTLMNDARARGFSYNQIWERISGS 350
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
P A I L +RV F D E +
Sbjct: 351 ---GLPQAKTILSLDGRGE-----MRV--------------------FLDGEE------L 376
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQ---QKTIQSTE--KALKSAEKKTKQTLKDVQTMT 289
+++ +L+ NA+R++D + +K Q + TE KA K A KKT+ ++
Sbjct: 377 ELNAELAVPQNAQRYYDKAKDMVRKARGAQSALAITEELKAGKVAPKKTR-------AVS 429
Query: 290 NINKARKVYWFEKFYWFISSENYL 313
+ + RK W+E+F WF SS+ +L
Sbjct: 430 SYYRRRKPKWYERFRWFYSSDGFL 453
>gi|345005767|ref|YP_004808620.1| fibronectin-binding A domain-containing protein [halophilic
archaeon DL31]
gi|344321393|gb|AEN06247.1| Fibronectin-binding A domain protein [halophilic archaeon DL31]
Length = 717
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP--CQLAMGIS 368
D + +A+ V DQVSKT +GE+L G F IRG + +F C++A+GI+
Sbjct: 589 DGRGAGDAYMVDPDQVSKTPESGEYLEKGGFAIRGDREYFNGINCEVAVGIT 640
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 55/261 (21%), Positives = 103/261 (39%), Gaps = 55/261 (21%)
Query: 78 FESFAAAVDEFFS--TAESHKIDLKAVQQERDALKKLENVKR---DHETRLSALEQTQLV 132
F+SF AA+DE+F T + + + ++ D +E KR E + E+
Sbjct: 287 FDSFNAALDEYFYRLTHQERREEEGGGRKRPDFEADIEKEKRIIQQQEGAIEGFEEEAEQ 346
Query: 133 DKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI 192
+ +AEL E VD + I+Q + W++I+ + Q P A + +
Sbjct: 347 RRREAELCYERYELVDEVLSTIQQARQQEHGWDEIQETLAQGAEQGIPAAEAVVDVNSAK 406
Query: 193 NHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF-D 251
+T+ L H+ + +D + NA R + +
Sbjct: 407 GMVTIELDDHR------------------------------ITLDASMGVEKNADRLYRE 436
Query: 252 LKRSAAKKQ--QKTIQSTEKALKSAEKKTKQ-----------------TLKDVQTMTNIN 292
KR KK+ ++ I+ T K L++A+++ ++ D T +I
Sbjct: 437 AKRVEGKKEGAREAIEDTRKRLEAAKQRREEWEAEDDPEPEPDPDEEQEEVDWLTREDIP 496
Query: 293 KARKVYWFEKFYWFISSENYL 313
+ +W+E+F WF +S+ YL
Sbjct: 497 IRQPEHWYEEFRWFRTSDGYL 517
>gi|305663918|ref|YP_003860206.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
gi|304378487|gb|ADM28326.1| protein of unknown function DUF814 [Ignisphaera aggregans DSM
17230]
Length = 667
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 36/56 (64%), Gaps = 3/56 (5%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISF 369
AW + ++ + +WV Q+S + PTGE+L GS+M+ GKKN+ +LA+GI+
Sbjct: 524 AWKSGLLAIDVFWVYGSQISLSPPTGEYLPKGSYMVYGKKNYIKNVSLKLALGIAI 579
>gi|119719655|ref|YP_920150.1| hypothetical protein Tpen_0745 [Thermofilum pendens Hrk 5]
gi|119524775|gb|ABL78147.1| protein of unknown function DUF814 [Thermofilum pendens Hrk 5]
Length = 610
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
AW + + +WV A+QVSK P+GE+L GSFM+ GKKN+
Sbjct: 482 AWKGGLYAVDVFWVPAEQVSKKPPSGEYLAKGSFMVYGKKNY 523
>gi|448622787|ref|ZP_21669436.1| hypothetical protein C438_10403 [Haloferax denitrificans ATCC
35960]
gi|445753295|gb|EMA04712.1| hypothetical protein C438_10403 [Haloferax denitrificans ATCC
35960]
Length = 701
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/280 (20%), Positives = 110/280 (39%), Gaps = 64/280 (22%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +R+HQ G + +++F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ + E + E+ ++E+AEL+ N + VD + +R + W+DI +
Sbjct: 313 QRIIDQQEGAIEGFEKQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLA 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L D+A
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +++ + NA R + + KR KK+ I+ T + L + +K+ + D
Sbjct: 403 TVTLEVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDDDDE 462
Query: 287 -------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++ +WFE+F WF +S YL
Sbjct: 463 EDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYL 502
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF++RG + +F P ++A+GI
Sbjct: 574 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKVAVGI 624
>gi|257077022|ref|ZP_05571383.1| hypothetical protein Faci_08161 [Ferroplasma acidarmanus fer1]
Length = 615
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 1/55 (1%)
Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW A + + A+WV QVSKT +GEF++ GS+++RGK+N+ L + +S +
Sbjct: 489 AWPAGIGSGTAYWVYPSQVSKTPESGEFVSKGSWIVRGKRNYVLNIPLELKVSIV 543
>gi|448633897|ref|ZP_21674396.1| hypothetical protein C437_16451 [Haloarcula vallismortis ATCC
29715]
gi|445750588|gb|EMA02026.1| hypothetical protein C437_16451 [Haloarcula vallismortis ATCC
29715]
Length = 717
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/264 (21%), Positives = 118/264 (44%), Gaps = 33/264 (12%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF---------STAESHKIDLKA-VQQERDALKKL 112
TP + +++ + F F A+D++F E+ + D +A +++++ +++
Sbjct: 275 TPIPLAEYEELYSESFTEFNTALDDYFFNFQREEEVEGGETQRPDFEAEIEKQKRIIQQQ 334
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E D E A ++EKAEL+ N + VD + ++ + +SW+DIEA+
Sbjct: 335 EQAIEDFEADAEA-------EREKAELLYANYDLVDDVLSTVQAAREDDVSWDDIEAKFD 387
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
+ A + + L+ + T+ L + R + D++ NAD A E K
Sbjct: 388 EGADRGIEAAEAV--VSLDGSEGTVTLDIEGTRVTV-DAFTGVEKNADELYKEAKRIEEK 444
Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
K L+A N + DL+ A K+++ ++ + + E ++ D ++
Sbjct: 445 KEGA------LAAIENTRE--DLE--AVKERRDEWEADDGEDEVDEDGSEDEPTDWLSIQ 494
Query: 290 NINKARKVYWFEKFYWFISSENYL 313
+I W+E+F WF +S+ +L
Sbjct: 495 SIPTRSTERWYEQFRWFHTSDGFL 518
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V DQVSKT +GE+L G F +RG + +F P +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPVGVAVGIT 641
>gi|340345857|ref|ZP_08668989.1| RNA-binding protein [Candidatus Nitrosoarchaeum koreensis MY1]
gi|339520998|gb|EGP94721.1| RNA-binding protein [Candidatus Nitrosoarchaeum koreensis MY1]
Length = 638
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 35/54 (64%), Gaps = 3/54 (5%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNF--FPPCQLAMGI 367
AW + +A+WV +QV K+AP+G+FL GSF I G++NF +LA+GI
Sbjct: 504 AWREGMYGVSAFWVNPEQVKKSAPSGQFLPKGSFTIEGQRNFVKISTLKLAVGI 557
>gi|448725341|ref|ZP_21707802.1| hypothetical protein C448_01989 [Halococcus morrhuae DSM 1307]
gi|445798677|gb|EMA49073.1| hypothetical protein C448_01989 [Halococcus morrhuae DSM 1307]
Length = 695
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/291 (19%), Positives = 116/291 (39%), Gaps = 55/291 (18%)
Query: 48 VANMEFHPMILQPGS-----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAV 102
+++ +F P I + TP + + F +EF+SF AA+D +F ++ + +
Sbjct: 237 LSSGDFDPRIYRDDGEPVDVTPFPLDERAEFDSEEFDSFTAALDAYFVELDTTEDEESGE 296
Query: 103 QQERDALKKLENVKR---DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159
++ D +++E +R E + E ++E AE + N E VD + +R
Sbjct: 297 RERPDFEEQIERQQRIIDQQEGAIEDFEAQADRERETAESLYANYELVDEILTTVRNARE 356
Query: 160 NQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNA 219
+ WE IE R + + A + ++ + +T+ + D+
Sbjct: 357 EGIGWEAIEERFAEGEERGIAAAGAVTGIEPSEGTVTIEI-----------------DDR 399
Query: 220 DPFADSSETKKPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEK 276
D V++D NA R + + KR KK+ ++ + T + L++ E+
Sbjct: 400 D-------------VELDPQEGVEQNADRLYREAKRVVEKKEGAEEAVVETREELEAIER 446
Query: 277 KTKQTLKDVQ--------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ + + +I + W+E+F WF +S+ YL
Sbjct: 447 QRDEWEAGDVDDDPDEESEDVDWLSRRSIPTRKNEQWYERFRWFHTSDGYL 497
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
+ + +A+ + DQVSKT +GE+L G F IRG + +F +A+GI+
Sbjct: 569 NGRFAGDAYMAEPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVAVGIT 620
>gi|448602394|ref|ZP_21656450.1| hypothetical protein C441_00535 [Haloferax sulfurifontis ATCC
BAA-897]
gi|445747909|gb|ELZ99363.1| hypothetical protein C441_00535 [Haloferax sulfurifontis ATCC
BAA-897]
Length = 702
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/281 (20%), Positives = 110/281 (39%), Gaps = 65/281 (23%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
TP +R+HQ G + +++F A+DE+F ++DL A +QE + + K
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPNFEEQIAKQ 312
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
+ + E + E+ ++E+AEL+ N + VD + +R + W+DI +
Sbjct: 313 QRIIDQQEGAIEGFEKQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLA 372
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
+ P A + + +T+ L D+A
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402
Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
V +++ + NA R + + KR KK+ I+ T + L + +K+ + D
Sbjct: 403 TVTLEVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDDDED 462
Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++ +WFE+F WF +S YL
Sbjct: 463 EDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSTGYL 503
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V+ QVSKT +GE++ GSF+IRG + +F P ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYFEDVPAKVAVGI 625
>gi|257388236|ref|YP_003178009.1| fibronectin-binding A domain-containing protein [Halomicrobium
mukohataei DSM 12286]
gi|257170543|gb|ACV48302.1| Fibronectin-binding A domain protein [Halomicrobium mukohataei DSM
12286]
Length = 708
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ--LAMGIS 368
D K +A+ V DQVSKT +GE+L G F IRG + +F + +A+GIS
Sbjct: 582 DGKYAGDAYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFRDLEVDVAVGIS 633
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/276 (21%), Positives = 114/276 (41%), Gaps = 55/276 (19%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF------STAE---SHKIDLKAVQQERDALKKLE 113
TP + ++ + FE+F A+DE+F TAE +D + E ++K E
Sbjct: 265 TPIPLEEYDDVESRAFETFTEALDEYFYEVEREDTAEEIADAGVDRPDFESE---IEKYE 321
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
+ + ++ + E ++EKAEL+ + VD + I+ W++IEA ++
Sbjct: 322 RIIQQQQSAIEDFESDAEAEREKAELLYARYDLVDEILSTIQGARTQDTPWDEIEATFEE 381
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
+ A ++ L+ + T+ L + R I D+ + NAD
Sbjct: 382 GKEQGIAAAEAVE--GLDGSEGTVTLSIDDVRVTI-DATMGVEKNADQL----------- 427
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK-----------TKQTL 282
Y AKR + K A Q I+ T + L++ E++ T++
Sbjct: 428 ---------YQAAKRIEEKKEGA----QAAIEDTREDLEAVERRRENWEAEDTTETQEQT 474
Query: 283 KDVQTMTNINKA-----RKVYWFEKFYWFISSENYL 313
+ + +++A R+ W+++F WF +S +L
Sbjct: 475 AEADDVDWLSRASIPVRRQEPWYDRFRWFRTSNGFL 510
>gi|393796641|ref|ZP_10380005.1| hypothetical protein CNitlB_10052 [Candidatus Nitrosoarchaeum
limnia BG20]
Length = 638
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW + +A+WV +QV K+AP+G+FL GSF I G++NF L + + +
Sbjct: 504 AWREGMYGVSAFWVNPEQVKKSAPSGQFLPKGSFTIEGQRNFVKISTLKLAVGLM 558
>gi|329766254|ref|ZP_08257812.1| hypothetical protein Nlim_1602 [Candidatus Nitrosoarchaeum limnia
SFB1]
gi|329137313|gb|EGG41591.1| hypothetical protein Nlim_1602 [Candidatus Nitrosoarchaeum limnia
SFB1]
Length = 590
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
AW + +A+WV +QV K+AP+G+FL GSF I G++NF L + + +
Sbjct: 456 AWREGMYGVSAFWVNPEQVKKSAPSGQFLPKGSFTIEGQRNFVKISTLKLAVGLM 510
>gi|448639710|ref|ZP_21676858.1| hypothetical protein C436_08831 [Haloarcula sinaiiensis ATCC 33800]
gi|445762237|gb|EMA13458.1| hypothetical protein C436_08831 [Haloarcula sinaiiensis ATCC 33800]
Length = 717
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V DQVSKT +GE+L G F +RG + +F P +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVGIT 641
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 85/185 (45%), Gaps = 16/185 (8%)
Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLN 191
V++EKAEL+ N + VD + ++ + +SW+DIEA+ + +A+ + L+
Sbjct: 347 VEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRG--IAAAEAVVSLD 404
Query: 192 INHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDLSAYANAKR 248
+ T+ L + R + D++ NAD A E KK L+A N +
Sbjct: 405 GSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA------LAAIENTRE 457
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
DL+ ++++ E + E + + T D +M +I W+E+F WF +
Sbjct: 458 --DLEAVKERREEWEADDGEDEADNDEAEDEPT--DWLSMQSIPTRSTERWYEQFRWFHT 513
Query: 309 SENYL 313
S+ +L
Sbjct: 514 SDGFL 518
>gi|344211873|ref|YP_004796193.1| fibronectin-binding A domain-containing protein [Haloarcula
hispanica ATCC 33960]
gi|343783228|gb|AEM57205.1| fibronectin-binding A domain protein [Haloarcula hispanica ATCC
33960]
Length = 717
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V DQVSKT +GE+L G F +RG + +F P +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVGIT 641
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 85/185 (45%), Gaps = 16/185 (8%)
Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLN 191
V++EKAEL+ N + VD + ++ + +SW+DIEA+ + +A+ + L+
Sbjct: 347 VEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRG--IAAAEAVVSLD 404
Query: 192 INHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDLSAYANAKR 248
+ T+ L + R + D++ NAD A E KK L+A N +
Sbjct: 405 GSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA------LAAIENTRE 457
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
DL+ ++++ E + E + + T D +M +I W+E+F WF +
Sbjct: 458 --DLEAVKERREEWEADDGEDEADNDEAEDEPT--DWLSMQSIPTRSTERWYEQFRWFHT 513
Query: 309 SENYL 313
S+ +L
Sbjct: 514 SDGFL 518
>gi|55377795|ref|YP_135645.1| hypothetical protein rrnAC0969 [Haloarcula marismortui ATCC 43049]
gi|55230520|gb|AAV45939.1| unknown [Haloarcula marismortui ATCC 43049]
Length = 717
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V DQVSKT +GE+L G F +RG + +F P +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVGIT 641
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 85/185 (45%), Gaps = 16/185 (8%)
Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLN 191
V++EKAEL+ N + VD + ++ + +SW+DIEA+ + +A+ + L+
Sbjct: 347 VEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRG--IAAAEAVVSLD 404
Query: 192 INHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDLSAYANAKR 248
+ T+ L + R + D++ NAD A E KK L+A N +
Sbjct: 405 GSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA------LAAIENTRE 457
Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
DL+ ++++ E + E + + T D +M +I W+E+F WF +
Sbjct: 458 --DLEAVKERREEWEADDGEDEADNDEAEDEPT--DWLSMQSIPTRSTERWYEQFRWFHT 513
Query: 309 SENYL 313
S+ +L
Sbjct: 514 SDGFL 518
>gi|335438854|ref|ZP_08561586.1| Fibronectin-binding A domain protein [Halorhabdus tiamatea SARL4B]
gi|334890357|gb|EGM28628.1| Fibronectin-binding A domain protein [Halorhabdus tiamatea SARL4B]
Length = 707
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
D K + + V+ DQV+KT +GE+L GSF IRG++ ++ P +A+GI+
Sbjct: 580 DGKYAGDVYCVEHDQVTKTPESGEYLEKGSFAIRGERTYYDDTPVGVAVGIT 631
>gi|254167318|ref|ZP_04874170.1| conserved domain protein [Aciduliprofundum boonei T469]
gi|197623581|gb|EDY36144.1| conserved domain protein [Aciduliprofundum boonei T469]
Length = 589
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 51/137 (37%), Gaps = 61/137 (44%)
Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAKV---------------------------- 322
I K R+ +WFEK+ WFISSE L DAK
Sbjct: 383 IRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGAPSVVI 442
Query: 323 ---------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
++ AW WV QVSK +GE++ G++++
Sbjct: 443 KSEGKEIGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVV 502
Query: 352 RGKKNFF--PPCQLAMG 366
GK+N+ P QLA+G
Sbjct: 503 HGKRNYIHKVPLQLAVG 519
>gi|289596339|ref|YP_003483035.1| protein of unknown function DUF814 [Aciduliprofundum boonei T469]
gi|289534126|gb|ADD08473.1| protein of unknown function DUF814 [Aciduliprofundum boonei T469]
Length = 589
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 51/137 (37%), Gaps = 61/137 (44%)
Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAKV---------------------------- 322
I K R+ +WFEK+ WFISSE L DAK
Sbjct: 383 IRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGAPSVVI 442
Query: 323 ---------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
++ AW WV QVSK +GE++ G++++
Sbjct: 443 KSEGKEIGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVV 502
Query: 352 RGKKNFF--PPCQLAMG 366
GK+N+ P QLA+G
Sbjct: 503 HGKRNYIHKVPLQLAVG 519
>gi|254166596|ref|ZP_04873450.1| conserved domain protein [Aciduliprofundum boonei T469]
gi|197624206|gb|EDY36767.1| conserved domain protein [Aciduliprofundum boonei T469]
Length = 593
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 51/137 (37%), Gaps = 61/137 (44%)
Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAKV---------------------------- 322
I K R+ +WFEK+ WFISSE L DAK
Sbjct: 387 IRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGAPSVVI 446
Query: 323 ---------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
++ AW WV QVSK +GE++ G++++
Sbjct: 447 KSEGKEIGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVV 506
Query: 352 RGKKNFF--PPCQLAMG 366
GK+N+ P QLA+G
Sbjct: 507 HGKRNYIHKVPLQLAVG 523
>gi|325969240|ref|YP_004245432.1| hypothetical protein VMUT_1728 [Vulcanisaeta moutnovskia 768-28]
gi|323708443|gb|ADY01930.1| hypothetical protein VMUT_1728 [Vulcanisaeta moutnovskia 768-28]
Length = 668
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 31/47 (65%)
Query: 323 VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
V + ++V+ +QV+K AP+GE+L GSFMI G + + +L +GI
Sbjct: 528 VLDVFYVRGEQVTKKAPSGEYLGKGSFMIYGTRGWIRNVELRLGIGL 574
>gi|414878086|tpg|DAA55217.1| TPA: hypothetical protein ZEAMMB73_507954 [Zea mays]
Length = 522
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 350 MIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
MIRGKKNF PP L MG LF+L+ESS++
Sbjct: 1 MIRGKKNFLPPHPLVMGFGILFRLDESSLA 30
>gi|448303302|ref|ZP_21493251.1| fibronectin-binding A domain-containing protein [Natronorubrum
sulfidifaciens JCM 14089]
gi|445593087|gb|ELY47265.1| fibronectin-binding A domain-containing protein [Natronorubrum
sulfidifaciens JCM 14089]
Length = 716
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 57/268 (21%), Positives = 107/268 (39%), Gaps = 39/268 (14%)
Query: 62 STPSVMRQHQGFPIQEFESFAAAVDEFFSTAE-SHKIDLKAVQQERD---ALKKLENVKR 117
+TP + +H + ++SF AA+D++F E + + Q+ D + K E +
Sbjct: 272 ATPFPLEEHAELAAEPYDSFLAALDDYFFRLELDDEEEPDPTTQKPDFGEEIAKYERIID 331
Query: 118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRH 177
+ + EQ +E+AEL+ VD + I+ A W++IEAR ++
Sbjct: 332 QQQGAIEGFEQQADDLREQAELLYAEYGLVDDILSTIQDARAQDRPWDEIEARFEEGAER 391
Query: 178 NDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVD 237
A + + ++ + + + + +R ++ + HD + AD
Sbjct: 392 GIEAAEAV--VGIDSSEGIVTVDIDGDRIDL-----VAHDGVEQNAD------------- 431
Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK-----SAEKKTKQTLKDVQT----- 287
Y AKR + K A + T + E A + A+ + + D T
Sbjct: 432 ---RLYTEAKRVAEKKEGALAAIEDTREDLEDAKRRRDEWDADDEGDEQADDEDTEETNW 488
Query: 288 --MTNINKARKVYWFEKFYWFISSENYL 313
M +I WF++F WF +S+ YL
Sbjct: 489 LEMPSIPIRENEPWFDRFRWFHTSDGYL 516
Score = 45.4 bits (106), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG++ + P +A+GI
Sbjct: 589 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGERTYHRDTPVGVAVGI 639
>gi|313126151|ref|YP_004036421.1| RNA-binding protein, snrnp like protein [Halogeometricum
borinquense DSM 11551]
gi|448285991|ref|ZP_21477228.1| RNA-binding protein, snrnp like protein [Halogeometricum
borinquense DSM 11551]
gi|312292516|gb|ADQ66976.1| predicted RNA-binding protein, snRNP like protein [Halogeometricum
borinquense DSM 11551]
gi|445575584|gb|ELY30057.1| RNA-binding protein, snrnp like protein [Halogeometricum
borinquense DSM 11551]
Length = 702
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/292 (18%), Positives = 116/292 (39%), Gaps = 84/292 (28%)
Query: 62 STPSVMRQHQ--GFPIQEFESFAAAVDEFF----STAE--------SHKIDLKAVQQERD 107
+TP +++H+ G + +++F A+DE+F +AE S++ D +A
Sbjct: 256 ATPFPLKEHEAEGLNSESYDTFNEALDEYFFAFDRSAEDEPEEEPGSNRPDFEA------ 309
Query: 108 ALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI 167
++K + + E + EQ ++E+AEL+ N E VD + +R + W++I
Sbjct: 310 EIEKKKRIIEQQEGAIEGFEQQAERERERAELLYANYELVDEVLSTVRSARDESVPWDEI 369
Query: 168 EARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSE 227
++ P A + + ++ T+ + + R
Sbjct: 370 RQTLEDGAERGIPAAEAV--VDVDGAEGTVTIEIDGTR---------------------- 405
Query: 228 TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ- 286
++V++D+ NA R + K+ K ++ ++ +A + T++ L +V+
Sbjct: 406 ------IEVEVDMGVEKNADRLY--------KEAKRVEGKKEGAMAAIEDTREELAEVKA 451
Query: 287 -------------------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++I + W+E+F WF +S+ YL
Sbjct: 452 RRDAWEEDDEDDDEEPEEPEDIDWLSRSSIPLKTEEQWYEQFRWFHTSDGYL 503
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 27/40 (67%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
+ + +A+ V DQVSKT +GE++ GSF+IRG + +F
Sbjct: 575 EGRYADDAYMVTPDQVSKTPESGEYIEKGSFVIRGDRTYF 614
>gi|156938202|ref|YP_001435998.1| hypothetical protein Igni_1415 [Ignicoccus hospitalis KIN4/I]
gi|156567186|gb|ABU82591.1| protein of unknown function DUF814 [Ignicoccus hospitalis KIN4/I]
Length = 644
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 37/57 (64%), Gaps = 3/57 (5%)
Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISFL 370
AW + + + ++VK QVSK+ P G+++ G+F+I+GK+ + P +LA+G+ +
Sbjct: 492 AWKEGRGSVDVFYVKGSQVSKSPPPGQYVAKGAFIIKGKREYVRDVPLRLALGVEMV 548
>gi|294496348|ref|YP_003542841.1| Fibronectin-binding A domain protein [Methanohalophilus mahii DSM
5219]
gi|292667347|gb|ADE37196.1| Fibronectin-binding A domain protein [Methanohalophilus mahii DSM
5219]
Length = 662
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 33/47 (70%), Gaps = 2/47 (4%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGISF 369
+ +W+ +QV+KT +GE+L GSF+IRG++N++ P + A+G+
Sbjct: 542 DCYWIYPEQVTKTPESGEYLKKGSFIIRGERNYYRDVPVRAAVGLEL 588
>gi|327311796|ref|YP_004338693.1| hypothetical protein TUZN_1922 [Thermoproteus uzoniensis 768-20]
gi|326948275|gb|AEA13381.1| hypothetical protein TUZN_1922 [Thermoproteus uzoniensis 768-20]
Length = 623
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW A + + ++V QVSK P+GE+L GSFMI G KN+ +L + +
Sbjct: 485 AWRAGIHAVDVFYVPGRQVSKQPPSGEYLARGSFMIYGSKNYIRHVRLELAVG 537
>gi|387209294|gb|AFJ69115.1| hypothetical protein NGATSA_3044600, partial [Nannochloropsis
gaditana CCMP526]
Length = 106
Score = 48.1 bits (113), Expect = 0.007, Method: Composition-based stats.
Identities = 29/101 (28%), Positives = 57/101 (56%), Gaps = 1/101 (0%)
Query: 100 KAVQQERDALKKLE-NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDI 158
+AV+ + +A++ V+R++E RL LE T+ + A L+ + ++VD +L +R I
Sbjct: 3 QAVRAQEEAVRSRPLRVQRENEARLKELEATEARLLDAARLVECHSDAVDKVLLVLRSAI 62
Query: 159 ANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
A W+ ++ +++ Q +P+A +I K + N +TL L
Sbjct: 63 ATGADWQTLDEYIRKEQAGGNPLARMITGSKWSDNKVTLSL 103
>gi|448385151|ref|ZP_21563730.1| Fibronectin-binding A domain protein [Haloterrigena thermotolerans
DSM 11522]
gi|445657436|gb|ELZ10264.1| Fibronectin-binding A domain protein [Haloterrigena thermotolerans
DSM 11522]
Length = 719
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V ADQVSKT +GE+L G F IRG + ++ P A+GI
Sbjct: 592 DGRYAGDVYAVDADQVSKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 642
Score = 44.7 bits (104), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 56/270 (20%), Positives = 107/270 (39%), Gaps = 43/270 (15%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKLENVKRD 118
TP + +H P + ++SF +A+D++F AE + D + + ++ + K E +
Sbjct: 274 TPFPLEEHADLPAEGYDSFLSALDDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQ 333
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
+ + EQ +E+AEL+ VD + ++Q +W++I R ++
Sbjct: 334 QQGAIEGFEQEAEQLRERAELLYAEYGLVDEILSTVQQAREQDRAWDEIRERFEEGADRG 393
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSN-IKDSWVLHHDNADPFADSSETKKPSLVDVD 237
+A+ + ++ + T+ + + ER + D V NAD
Sbjct: 394 --IAAAEAVIDVDGSEGTVTVDLDGERIELVADRGV--EQNADRL--------------- 434
Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL--------------K 283
Y AKR D K A + T + E A + ++ Q +
Sbjct: 435 -----YTEAKRVEDKKEGALAAIENTREDLEDAKRRRDEWEAQDAASDDEDEADDEGPKR 489
Query: 284 DVQTMTNINKARKVYWFEKFYWFISSENYL 313
D +I WF++F WF +S++YL
Sbjct: 490 DWLADPSIPIRENEPWFDRFRWFHTSDDYL 519
>gi|433590765|ref|YP_007280261.1| putative RNA-binding protein, snRNP like protein [Natrinema
pellirubrum DSM 15624]
gi|448331831|ref|ZP_21521081.1| Fibronectin-binding A domain protein [Natrinema pellirubrum DSM
15624]
gi|433305545|gb|AGB31357.1| putative RNA-binding protein, snRNP like protein [Natrinema
pellirubrum DSM 15624]
gi|445628400|gb|ELY81707.1| Fibronectin-binding A domain protein [Natrinema pellirubrum DSM
15624]
Length = 721
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V ADQVSKT +GE+L G F IRG + ++ P A+GI
Sbjct: 594 DGRYAGDVYAVDADQVSKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 644
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 58/260 (22%), Positives = 110/260 (42%), Gaps = 23/260 (8%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKLENVKRD 118
TP + +H P + ++SF +A+D++F AE + D + + ++ + K E +
Sbjct: 276 TPFPLEEHADLPAEGYDSFLSALDDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQ 335
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
+ + EQ +E+AEL+ VD + ++ +W++I R ++
Sbjct: 336 QQGAIEGFEQEAEQLRERAELLYAEYGLVDEILSTVQGAREQDRAWDEIRERFEEGADRG 395
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSN-IKDSWVLHHDNADPF---ADSSETKKPSLV 234
+A+ + ++ + T+ + + ER + D V NAD A E KK
Sbjct: 396 --IAAAEAVIDVDGSEGTVTVDLDGERIELVADRGV--EQNADRLYTEAKRVEDKKEGA- 450
Query: 235 DVDLDLSAYANAKR-FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
L+A N + D KR + + K S ++ E + L D N+
Sbjct: 451 -----LAAIENTREDLEDAKRRRDEWEAKDAASDDEDEADDEGPNRDWLADPSIPIRENE 505
Query: 294 ARKVYWFEKFYWFISSENYL 313
WF++F WF +S++YL
Sbjct: 506 P----WFDRFRWFHTSDDYL 521
>gi|448409564|ref|ZP_21574778.1| hypothetical protein C475_10624 [Halosimplex carlsbadense 2-9-1]
gi|445672910|gb|ELZ25479.1| hypothetical protein C475_10624 [Halosimplex carlsbadense 2-9-1]
Length = 729
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 32/61 (52%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
D K + + V +DQVSKT +GEFL G F IRG + ++ ++ + + + E I
Sbjct: 602 DGKFAGDVYEVGSDQVSKTPESGEFLEKGGFAIRGDRTYYESTEVGVAVGITCEPETRVI 661
Query: 379 S 379
Sbjct: 662 G 662
Score = 38.9 bits (89), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 53/267 (19%), Positives = 104/267 (38%), Gaps = 27/267 (10%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKLEN 114
TP + ++ + F+SF A+D++F+ A + + K E
Sbjct: 274 TPIPLAEYDDRYGEPFDSFNEALDDYFTFASDEDDEGGGDAAGGDRGRPDFESEIAKHER 333
Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
+ + + E ++ AE + N + VD + +++ A SW+DIE R +
Sbjct: 334 IIEQQQGAIEDFEAQAERERANAEALYANYDLVDDILSTVQEARAEDRSWDDIEERFAEG 393
Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKP 231
R P A + + L+ + T+ + + ER + S + NAD A E KK
Sbjct: 394 ARQGIPAAEAV--VSLDGSEGTVTIDIDGERVTLAASEGVEK-NADRLYREAKRIEGKKE 450
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM--- 288
+ A A+ +L+ +K + +A ++ D + +
Sbjct: 451 G--------AEEAIAQTRSELEAVEERKAEWEAADAGEAGSGGDESEGSDEDDDEPVDWL 502
Query: 289 --TNINKARKVYWFEKFYWFISSENYL 313
+I + +WFE + WF +S+ +L
Sbjct: 503 AEPSIPVRQSDHWFEDYRWFHTSDGFL 529
>gi|452206612|ref|YP_007486734.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
gi|452082712|emb|CCQ35980.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
Length = 703
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 35/56 (62%), Gaps = 2/56 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
D K + + V ADQVSKT +GE++ GSF+IRG + ++ +++G+S K E
Sbjct: 576 DGKFEGDVYEVDADQVSKTPESGEYVEKGSFVIRGDREYY--HDVSVGVSVGIKCE 629
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 52/278 (18%), Positives = 103/278 (37%), Gaps = 56/278 (20%)
Query: 62 STPSVMRQHQ--GFPIQEFESFAAAVDEFFS--TAESHKIDLKAVQQER----DALKKLE 113
+TP +R+H+ G+ FESF A+DE+F ES + +R ++K E
Sbjct: 257 ATPLPLREHETEGYDATAFESFNGAIDEYFYRLETESETEEQAGAGTDRPDFESEIEKYE 316
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
+ E + + ++ ++ KAE + N + +D +R + + W +IE +
Sbjct: 317 RIIEQQEGAIESYDEQADEEQRKAESLYGNYDLIDEICSTVRAAREDGVPWAEIEETFEA 376
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
A + + +T+ L
Sbjct: 377 GAERGIEAAEAVVSVDGAEGTVTVDL------------------------------GDGP 406
Query: 234 VDVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTK----------- 279
+++D + NA R + + KR KK+ Q I+ T + L + E++ +
Sbjct: 407 IELDPTVGVERNADRLYTEAKRVRGKKEGAQAAIEDTREDLAAVERRREAWEAEDADEGE 466
Query: 280 ----QTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
D ++++ W E+F WF +S+ +L
Sbjct: 467 DEDDDAETDYLAVSSVPVRYDEKWHERFRWFRTSDGFL 504
>gi|448399812|ref|ZP_21571045.1| Fibronectin-binding A domain protein [Haloterrigena limicola JCM
13563]
gi|445668265|gb|ELZ20895.1| Fibronectin-binding A domain protein [Haloterrigena limicola JCM
13563]
Length = 722
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V ADQV+KT +GE+L G F IRG + ++ P A+GI
Sbjct: 595 DGRYAGDVYAVDADQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 645
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 54/259 (20%), Positives = 105/259 (40%), Gaps = 19/259 (7%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAE-SHKIDLKAVQQERD---ALKKLENVKRD 118
TP + +H + +E+F A+D++F E + + +Q D + K E +
Sbjct: 275 TPFPLEEHTDLDSEPYETFLEALDDYFFQLELGEDEEPEPTEQRPDFDSEIAKYERIIEQ 334
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
+ + EQ +E+AEL+ VD + I+ W++I R +
Sbjct: 335 QQGAIEGFEQEADALREQAELLYAEYGLVDEILSTIQDARVQDRPWDEIRERFEAGAEQG 394
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS----ETKKPSLV 234
A + + ++ + T+ + + ER ++ + NAD + E K+ +L
Sbjct: 395 IEAAEAV--VDVDGSEGTVTVDLDGERIDLVVEQGVEQ-NADRLYTEAKRVEEKKEGALA 451
Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
++ +AKR D A +++ T + E AE++ D +I
Sbjct: 452 AIEDTREDLEDAKRRRD--EWEATEREDTSEDGEDEADEAEQR------DWLAEPSIPIR 503
Query: 295 RKVYWFEKFYWFISSENYL 313
WF++F WF +S+ YL
Sbjct: 504 ENEPWFDRFRWFHTSDGYL 522
>gi|76801680|ref|YP_326688.1| hypothetical protein NP2070A [Natronomonas pharaonis DSM 2160]
gi|76557545|emb|CAI49126.1| conserved hypothetical protein [Natronomonas pharaonis DSM 2160]
Length = 699
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 56/282 (19%), Positives = 112/282 (39%), Gaps = 66/282 (23%)
Query: 62 STPSVMRQH--QGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------L 109
+TP + +H +G+ FE F A+D +F H++ +A + E D +
Sbjct: 256 ATPLPLEEHTAEGYDATAFEHFNGAIDAYF-----HRLQAEA-ETETDTGDDKPDFESEI 309
Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
K E + + + E+ V+++KAEL+ N + VD ++ + W++IE
Sbjct: 310 AKFERIIEQQQGAIEEYEKQAEVEQQKAELLYGNYDLVDEICSTVQSAREEGVPWDEIET 369
Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
++ A+ + + +T+ L D E
Sbjct: 370 TFEEGAERGIDAAAAVVGVDAAEGTVTIDL------------------------DDKE-- 403
Query: 230 KPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQ------ 280
+D+D + NA R + + KR KK+ Q I+ T + L+ +++ +Q
Sbjct: 404 ----IDLDPTMGVEKNADRLYQEAKRVRGKKEGAQAAIEDTREDLEDVKERRRQWEADDD 459
Query: 281 ---------TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
+D +M ++ W+E+F WF +S+++L
Sbjct: 460 EDDDADEESPDRDYLSMASVPVRYDEKWYEQFRWFRTSDDFL 501
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
D K + + V DQVSKT +GE++ G F+IRG +N++ Q+ + +
Sbjct: 573 DGKFEGDVYEVDPDQVSKTPESGEYIEKGGFVIRGDRNYYRDMQVGVAVG 622
>gi|448391228|ref|ZP_21566471.1| fibronectin-binding A domain-containing protein [Haloterrigena
salina JCM 13891]
gi|445666097|gb|ELZ18766.1| fibronectin-binding A domain-containing protein [Haloterrigena
salina JCM 13891]
Length = 723
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P +A+GI
Sbjct: 596 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVDVAVGI 646
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 69/327 (21%), Positives = 136/327 (41%), Gaps = 37/327 (11%)
Query: 3 SNEDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSE-DYIVANMEFHPMILQPG 61
++EDV I+ L + + N N + YL + + SE D + + P +
Sbjct: 218 ADEDVYDRIYGAIERLALDLRNGNFDPRLYLADEGDDDNESESDENGGDGDSSPDRVV-D 276
Query: 62 STPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKK 111
+TP + +H + ++SF AA+D++F ++++L ++E D + K
Sbjct: 277 ATPFPLEEHVELASEPYDSFLAALDDYF-----YRLELAEDEEETDPTTQRPDFEEEIAK 331
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
E + + + EQ +E+AEL+ VD + +++ A WE+IE R
Sbjct: 332 YERIIEQQQGAIEGFEQEADALREQAELLYAEYGLVDDILSTVQEARAQDRPWEEIEERF 391
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSET 228
+ +A+ + ++ + T+ + + ER ++ + NAD A E
Sbjct: 392 AEGADRG--IAAAEAVVDVDGSEGTVTVELDGERIDLVAKQGVEQ-NADRLYTEAKRVEE 448
Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ 286
KK L+A + + DL + A++ + + + E E + L +
Sbjct: 449 KKEGA------LAAIEDTRE--DLAEAKARRDRWEEEDAAAEGDDDEDEDDDRDWLSEPS 500
Query: 287 TMTNINKARKVYWFEKFYWFISSENYL 313
N+ WF++F WF +S+ YL
Sbjct: 501 VPIRENEP----WFDRFRWFHTSDGYL 523
>gi|448329966|ref|ZP_21519260.1| Fibronectin-binding A domain protein [Natrinema versiforme JCM
10478]
gi|445613154|gb|ELY66864.1| Fibronectin-binding A domain protein [Natrinema versiforme JCM
10478]
Length = 720
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQVSKT +GE+L G F IRG + ++ P A+GI
Sbjct: 593 DGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 643
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 67/325 (20%), Positives = 139/325 (42%), Gaps = 36/325 (11%)
Query: 3 SNEDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSE------DYIVANMEFHPM 56
++EDV +++ L + + N N + YL + E+ + E D V ++
Sbjct: 218 ADEDVYERLYETIERLALDIRNGNFDPRLYLERDDEEADDGEGESEDADANVVDV----- 272
Query: 57 ILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKL 112
TP + +H + ++SF +A+D++F AE + D + + ++ + K
Sbjct: 273 ------TPFPLEEHDDLDGEAYDSFLSALDDYFFRLELAEEEESDPTDQRPDFESEIAKQ 326
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E + + + EQ +E+AEL+ VD + I+ SW++I R +
Sbjct: 327 ERIIEQQQGAIEGFEQEAEELREQAELLYAEYGLVDDILSTIQGAREQDRSWDEIRERFE 386
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
+ A + + ++ + T+ + + ER + + NAD A E K
Sbjct: 387 EGAEQGIDAAEAV--VDVDGSDGTVTVDIDGERIGLVAGRGVEQ-NADRLYTEAKRVEEK 443
Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQ-KTIQSTEKALKSAEKKTKQTLKDVQTM 288
K L+A N + DL+ + ++ + + +S A +++ ++T +D +
Sbjct: 444 KEGA------LAAIENTRE--DLEEAKRRRDEWEADESGPAAETESDEDEEETQRDWLSE 495
Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
+I WF++F WF +S++YL
Sbjct: 496 PSIPIRENEPWFDRFRWFQTSDDYL 520
>gi|352682802|ref|YP_004893326.1| putative RNA-binding protein [Thermoproteus tenax Kra 1]
gi|350275601|emb|CCC82248.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
[Thermoproteus tenax Kra 1]
Length = 624
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW A + + ++V QVSK P+GE+L GSFMI G +N+ +L + I
Sbjct: 486 AWKAGIHSIDVFYVFGSQVSKQPPSGEYLARGSFMIYGTRNYIRHVRLELAIG 538
>gi|448313587|ref|ZP_21503301.1| fibronectin-binding A domain-containing protein [Natronolimnobius
innermongolicus JCM 12255]
gi|445597955|gb|ELY52026.1| fibronectin-binding A domain-containing protein [Natronolimnobius
innermongolicus JCM 12255]
Length = 723
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P +A+GI
Sbjct: 596 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTPVGVAVGI 646
>gi|284166116|ref|YP_003404395.1| fibronectin-binding A domain-containing protein [Haloterrigena
turkmenica DSM 5511]
gi|284015771|gb|ADB61722.1| Fibronectin-binding A domain protein [Haloterrigena turkmenica DSM
5511]
Length = 723
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P +A+GI
Sbjct: 596 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVDVAVGI 646
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 53/280 (18%), Positives = 107/280 (38%), Gaps = 60/280 (21%)
Query: 62 STPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKK 111
+TP + +H + ++SF AA+D++F ++++L ++E D + K
Sbjct: 276 ATPFPLEEHVELASEPYDSFLAALDDYF-----YRLELADDEEETDPTTQRPDFEEEIAK 330
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
E + + EQ +E+AEL+ VD + +++ A W++IE R
Sbjct: 331 YERIIEQQRGAIEGFEQEADALREQAELLYAEYGLVDDILSTVQEARAQDRPWDEIEERF 390
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
+ +A+ + ++ + T+ + + ER ++ + NAD
Sbjct: 391 AEGADRG--IAAAEAVVNVDGSEGTVTVELDGERIDLVAKQGVEQ-NADRL--------- 438
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ----------- 280
Y AKR + K A I+ T + L A+ + +
Sbjct: 439 -----------YTEAKRVGEKKEGAL----AAIEDTREDLGEAKARRDRWEEADAADEGE 483
Query: 281 -------TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
+D + ++ WF++F WF +S+ YL
Sbjct: 484 DDEDDEGEERDWLSEPSVPIRENEPWFDRFRWFHTSDGYL 523
>gi|336253827|ref|YP_004596934.1| Fibronectin-binding A domain-containing protein [Halopiger
xanaduensis SH-6]
gi|335337816|gb|AEH37055.1| Fibronectin-binding A domain protein [Halopiger xanaduensis SH-6]
Length = 718
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P +A+GI
Sbjct: 591 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTPVGVAVGI 641
Score = 44.7 bits (104), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 55/274 (20%), Positives = 109/274 (39%), Gaps = 50/274 (18%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
TP + +H+ G + ++SF A+D++F E + ++R + + K E +
Sbjct: 272 TPFPLEEHEAEGLVGEAYDSFLTALDDYFFRLELEDEEEPDPTEQRPDFEEEIAKHERII 331
Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
+ + EQ +EKAEL+ VD + IR W++IE R ++ +
Sbjct: 332 EQQQGAIEGFEQEAEQLREKAELLYARYGLVDDILSTIRNAREQDRPWDEIEERFEEGKE 391
Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDV 236
A + + ++ + + + + ER +++ + NAD
Sbjct: 392 RGIEAAEAV--VGIDGSEGIVTVDIDGERIDLEARQGVEQ-NADRL-------------- 434
Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ---------------- 280
Y AKR + K A I+ T + L+ A+++ +Q
Sbjct: 435 ------YTEAKRVEEKKEGAL----AAIEDTREDLEEAKRRREQWEAEDAGEDDADDEDE 484
Query: 281 -TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
KD +M ++ W+++F WF +S++YL
Sbjct: 485 GEDKDWLSMPSVPIRENEPWYDRFRWFHTSDDYL 518
>gi|124485365|ref|YP_001029981.1| hypothetical protein Mlab_0540 [Methanocorpusculum labreanum Z]
gi|124362906|gb|ABN06714.1| protein of unknown function DUF814 [Methanocorpusculum labreanum Z]
Length = 642
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 77/364 (21%), Positives = 137/364 (37%), Gaps = 116/364 (31%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
PS++R+ P + F +F+ A++ F+ + K V +++ L K E +++ E +
Sbjct: 248 PSLVRE----PKEHFATFSQALEAFYPKPVAEK-----VIEQKIKLSKEERIRKQQEAAV 298
Query: 124 SALEQTQLVDKEKAELIINN----QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
++ E +E+I ++ QE++D A + +LSW+DI A +K++ +
Sbjct: 299 VNFDKKIAEATEISEIIYSHYGEVQETIDVLAAA-----SQKLSWQDIAAVIKKS---DL 350
Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLD 239
P A I + NA D E K V + +
Sbjct: 351 PAAKRIISV--------------------------DPKNASVVIDLQEKHK---VTIFVH 381
Query: 240 LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYW 299
S AN R+F + + K+ +++ E + AEKK + K W
Sbjct: 382 ESLEANVGRYFAVVKKFRAKKAGALRAMEAGIVHAEKKKAAGPGRL----------KPKW 431
Query: 300 FEKFYWFISSENYL------------------------------GNVAWDAKVVT----- 324
+ +F W +S+ L G A K VT
Sbjct: 432 YHRFRWMETSDGVLVIGGRNADQNEELVKKYMEGKDTFLHADVFGASAVIVKGVTERMDQ 491
Query: 325 ---------NAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQL 363
AW +QVSKT +GE++ GSF+IRG++ + P ++
Sbjct: 492 AVQFAASYSRAWAGGGASVDVIAASPNQVSKTPESGEYVAHGSFVIRGERKIYKDVPLEI 551
Query: 364 AMGI 367
A+G+
Sbjct: 552 AIGV 555
>gi|269986196|gb|EEZ92508.1| protein of unknown function DUF814 [Candidatus Parvarchaeum
acidiphilum ARMAN-4]
Length = 587
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW + +++K +QV+KT P+GE L G+F I GK+++ QL + IS
Sbjct: 463 AWRVGATNLDVYFIKPEQVTKTPPSGESLKKGAFYIEGKRDYIKNAQLGIYIS 515
>gi|383621605|ref|ZP_09948011.1| Fibronectin-binding A domain-containing protein [Halobiforma
lacisalsi AJ5]
gi|448702236|ref|ZP_21699890.1| Fibronectin-binding A domain-containing protein [Halobiforma
lacisalsi AJ5]
gi|445777606|gb|EMA28567.1| Fibronectin-binding A domain-containing protein [Halobiforma
lacisalsi AJ5]
Length = 718
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P +A+GI
Sbjct: 591 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGVAVGI 641
Score = 45.4 bits (106), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 55/278 (19%), Positives = 110/278 (39%), Gaps = 57/278 (20%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
TP + +H+ G + ++SF A+D++F E + + ++R + + K + +
Sbjct: 271 TPFPLEEHEAEGLSAEAYDSFLTALDDYFFRLELDEEEEPDPTEQRPDFEEEIAKHQRII 330
Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
E + EQ +E+AE + VD + IRQ W++IE R ++ +
Sbjct: 331 EQQEGAIEGFEQQADELREQAESLYAEYGLVDEVLSTIRQARKQDRPWDEIEQRFEEGKE 390
Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDV 236
A + + L+ + T+ + V ER +D+
Sbjct: 391 RGIEAAETV--VDLDGSEGTVTVEVDGER----------------------------IDL 420
Query: 237 DLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQ------------- 280
+D NA R + + KR KK+ I+ T + L+ A+++ Q
Sbjct: 421 VVDDGVEQNADRLYTEAKRVEEKKEGALAAIEDTREDLEDAKRRRDQWEAEDAAEDDEDD 480
Query: 281 -----TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
++ +M ++ W+++F WF +S+ YL
Sbjct: 481 DDEDEEERNWLSMPSVPIRENEPWYDRFRWFHTSDGYL 518
>gi|448300325|ref|ZP_21490327.1| fibronectin-binding A domain-containing protein [Natronorubrum
tibetense GA33]
gi|445586054|gb|ELY40340.1| fibronectin-binding A domain-containing protein [Natronorubrum
tibetense GA33]
Length = 726
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P +A+GI
Sbjct: 599 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTPVGVAVGI 649
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 64/324 (19%), Positives = 136/324 (41%), Gaps = 28/324 (8%)
Query: 3 SNEDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSEDY----IVANMEFHPMIL 58
+ EDV +++ L + + N N + Y + E +SED N + P +
Sbjct: 218 AGEDVYDRLYEAIERLALDLRNSNFDPRLYFADEGEDGDDSEDGGENETGDNGDSSPERV 277
Query: 59 QPGSTPSVMRQHQGFPIQEFESFAAAVDEFF------STAESHKIDLKAVQQERDALKKL 112
+TP + +H P + ++++ +A+D++F E D + +E + K
Sbjct: 278 V-DATPFPLEEHVELPSEPYDTYLSALDDYFFRLELEEEGEPDPTDQRPDFEEE--IAKQ 334
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E + + + EQ + +E+AE + VD + I++ A W++IE R +
Sbjct: 335 ERIIEQQQGAIEGFEQEADMLREQAESLYAEYGLVDDILSTIQEARAQDRPWDEIEERFE 394
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
A + + ++ + + + V E +++ + + NAD A + E K
Sbjct: 395 AGAEQGIEAAEAV--IDVDGSEGVVTVDVDGEYIDLETTQGVEQ-NADRLYTEAKAVEDK 451
Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
K LSA N ++ DL+ A +++ + + + ++ +D +M
Sbjct: 452 KEGA------LSAIENTRK--DLQ-EAKRRRDQWEADDGEDEGDDADEEEREDRDWLSMP 502
Query: 290 NINKARKVYWFEKFYWFISSENYL 313
++ W+++F WF +S+ YL
Sbjct: 503 SVPVRENEPWYDRFRWFYTSDGYL 526
>gi|119872023|ref|YP_930030.1| hypothetical protein Pisl_0509 [Pyrobaculum islandicum DSM 4184]
gi|119673431|gb|ABL87687.1| protein of unknown function DUF814 [Pyrobaculum islandicum DSM
4184]
Length = 613
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 79/203 (38%), Gaps = 75/203 (36%)
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKA----------LKSAEKKTKQT 281
++V+++ D+ R F+ + +K K Q EK + AE+K K +
Sbjct: 335 TVVELEKDIPVGRQISRMFEHAKELEEKANKAAQVVEKLRRDLAKLEEEIHRAEEKVKSS 394
Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISS-----------------------ENYL---GN 315
+K V + +A WFEKF W I++ E+YL +
Sbjct: 395 VKIV-----VKRA----WFEKFRWSITTGKRPIIGGRDASQNETIVRKYLREHYLFFHAD 445
Query: 316 VAWDAKVV--------------------TNAW----------WVKADQVSKTAPTGEFLT 345
+ + VV + AW +V+ +QVSK AP G++L
Sbjct: 446 IPGASVVVMPPSEDPLELLQTAQFAAAYSKAWKIGIHSIDVYYVRGEQVSKHAPAGQYLA 505
Query: 346 TGSFMIRGKKNFFPPCQLAMGIS 368
GSFMI GK+ + +L + +
Sbjct: 506 RGSFMIYGKREYIRHVRLELAVG 528
>gi|88601740|ref|YP_501918.1| hypothetical protein Mhun_0437 [Methanospirillum hungatei JF-1]
gi|88187202|gb|ABD40199.1| protein of unknown function DUF814 [Methanospirillum hungatei JF-1]
Length = 627
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 37/54 (68%), Gaps = 3/54 (5%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
AW A + + + + DQVSKTA +GE+L+ GSF++RG++ +F P ++ +G+
Sbjct: 495 AWRAGFASADVYAARPDQVSKTAESGEYLSRGSFVVRGERQWFHDVPLEVVIGL 548
>gi|448578556|ref|ZP_21643976.1| hypothetical protein C455_13495 [Haloferax larsenii JCM 13917]
gi|445725734|gb|ELZ77354.1| hypothetical protein C455_13495 [Haloferax larsenii JCM 13917]
Length = 702
Score = 45.8 bits (107), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 55/289 (19%), Positives = 109/289 (37%), Gaps = 81/289 (28%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE---------RDALKK 111
TP +R+HQ G + ++SF A+DE+F ++DL A +Q+ + K
Sbjct: 258 TPFPLREHQNDGLDEEAYDSFNEALDEYF-----FRLDLTADEQQDVGSNRPDFEAQIAK 312
Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
E + E + ++ ++E+AEL+ N + VD + +R + W+DI ++
Sbjct: 313 QERIIEQQEGAIEGFDERAAAERERAELLYANYDLVDDVLSTVRDAREEGVPWDDIAEKL 372
Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
P A + + +T+ L
Sbjct: 373 DAGAEQGIPAAEAVTNVDGAEGTVTIEL------------------------------DD 402
Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ----- 286
S + +D+ + NA R + + K IQ ++ +A + T++ L++V+
Sbjct: 403 STITLDVSMGVEKNADRLY--------TEAKRIQEKKEGALAAIEDTREELEEVKRRRDE 454
Query: 287 ----------------------TMTNINKARKVYWFEKFYWFISSENYL 313
++ ++ YW+E+F WF +S+ YL
Sbjct: 455 WEADDDEDDAEDEEEQEETDWLSLQSVPVKSTDYWYEQFRWFHTSDGYL 503
Score = 45.1 bits (105), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + +A+ V+ QVSKT +GE++ GSF+IRG + ++ ++A+GI
Sbjct: 575 DGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDRTYYNDVAAEVAVGI 625
>gi|159040762|ref|YP_001540014.1| hypothetical protein Cmaq_0175 [Caldivirga maquilingensis IC-167]
gi|157919597|gb|ABW01024.1| protein of unknown function DUF814 [Caldivirga maquilingensis
IC-167]
Length = 650
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 30/45 (66%)
Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
+ ++VK +QVSK AP+GE+L+ G FMI G++ + +L + +
Sbjct: 513 DVYYVKGEQVSKKAPSGEYLSKGGFMIYGQRGWVRGVELIISVGL 557
>gi|448414286|ref|ZP_21577425.1| RNA-binding protein, snrnp like protein [Halosarcina pallida JCM
14848]
gi|445682579|gb|ELZ34996.1| RNA-binding protein, snrnp like protein [Halosarcina pallida JCM
14848]
Length = 701
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ--LAMGI 367
+ + A+ V DQVSKT +GE+L GSF+IRG + +F + +A+G+
Sbjct: 575 EGRYAGEAYMVTPDQVSKTPESGEYLEKGSFVIRGDRTYFRDVEAKVAVGV 625
Score = 38.5 bits (88), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 55/284 (19%), Positives = 118/284 (41%), Gaps = 68/284 (23%)
Query: 62 STPSVMRQHQ--GFPIQEFESFAAAVDEFF----STAE--------SHKIDLKAVQQERD 107
+TP + +H+ G + ++F A+DE+F TAE S++ D +A
Sbjct: 256 ATPFPLEEHEAEGLNSESHDTFNDALDEYFFRLDRTAEDEPDEEPGSNRPDFEA------ 309
Query: 108 ALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI 167
++K + + + E + E+ ++E+AEL+ + + VD + +R + W+DI
Sbjct: 310 EIEKKKRIIQQQEGAIEGFEEQAQEERERAELLYAHYDLVDEVLTTVRDAREENVPWDDI 369
Query: 168 EARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSE 227
R+++ P A + + +T+ L
Sbjct: 370 RQRLEEGAERGIPAAESVVDVDGAEGTVTVEL---------------------------- 401
Query: 228 TKKPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKD 284
+ + ++V +D NA R + + KR KK+ ++ T + L A+++ ++ ++
Sbjct: 402 --EDTRIEVVVDTGVEKNADRLYTEAKRVEGKKEGALAAVEDTREELAEAKRRREEWEEE 459
Query: 285 VQ---------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ + ++I + +WFE+F WF +S+ YL
Sbjct: 460 DEDDEEEDEEPEDIDWLSRSSIPLRTEEHWFERFRWFHTSDGYL 503
>gi|448377770|ref|ZP_21560466.1| Fibronectin-binding A domain protein [Halovivax asiaticus JCM
14624]
gi|445655714|gb|ELZ08559.1| Fibronectin-binding A domain protein [Halovivax asiaticus JCM
14624]
Length = 736
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V ADQV+KT +GE+L G F IRG + + P +A+GI
Sbjct: 609 DGRYAGDVYAVDADQVTKTPESGEYLEKGGFAIRGDRTYHRDTPVDVAVGI 659
>gi|289580546|ref|YP_003479012.1| fibronectin-binding A domain-containing protein [Natrialba magadii
ATCC 43099]
gi|448284209|ref|ZP_21475471.1| fibronectin-binding A domain-containing protein [Natrialba magadii
ATCC 43099]
gi|289530099|gb|ADD04450.1| Fibronectin-binding A domain protein [Natrialba magadii ATCC 43099]
gi|445571291|gb|ELY25845.1| fibronectin-binding A domain-containing protein [Natrialba magadii
ATCC 43099]
Length = 727
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P A+GI
Sbjct: 600 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 650
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 53/258 (20%), Positives = 111/258 (43%), Gaps = 16/258 (6%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
TP + +H GF + +++F A+D++F E + +R + + K E +
Sbjct: 279 TPFPLEEHDEAGFDGEPYDTFLDALDDYFFHLELEDEEEPDPTSQRPDFGEEIAKHERII 338
Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
+ + EQ +E AEL+ N VD + I++ A W++IEAR +
Sbjct: 339 EQQQGAIEGFEQEAENLRENAELLYANYGLVDDILSTIQEARAQDRPWDEIEARFEDGAE 398
Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD- 235
A + ++++ ++ V + I+ ++ HD + AD T+ + +
Sbjct: 399 QGIEAAEAV----IDVDGSEGIVTVDVDGEYIE---LVAHDGVEQNADRLYTEAKRVAEK 451
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKAR 295
+ L+A + + DLK + ++ + Q + ++ + +D ++I
Sbjct: 452 KEGALAAIEDTRE--DLKDAKRRRDEWEEQDGKPGAGDEDEDDEDDDRDWLAESSIPIRE 509
Query: 296 KVYWFEKFYWFISSENYL 313
WF++F WF +S+ YL
Sbjct: 510 NEPWFDRFRWFHTSDGYL 527
>gi|448317278|ref|ZP_21506835.1| fibronectin-binding A domain-containing protein [Natronococcus
jeotgali DSM 18795]
gi|445604315|gb|ELY58265.1| fibronectin-binding A domain-containing protein [Natronococcus
jeotgali DSM 18795]
Length = 717
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P A+GI
Sbjct: 590 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 640
>gi|435848081|ref|YP_007310331.1| putative RNA-binding protein, snRNP like protein [Natronococcus
occultus SP4]
gi|433674349|gb|AGB38541.1| putative RNA-binding protein, snRNP like protein [Natronococcus
occultus SP4]
Length = 712
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F IRG + ++ P A+GI
Sbjct: 585 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 635
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 55/273 (20%), Positives = 104/273 (38%), Gaps = 48/273 (17%)
Query: 62 STPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENV 115
+TP + +H+ G + +SF A+D++F E + ++R + + K E +
Sbjct: 267 ATPFPLEEHEDEGLTAEAADSFLGALDDYFFELELEDEEEPDPTEQRPDFEEEIAKHERI 326
Query: 116 KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
+ + EQ +E AEL+ E VD + I++ W++IE R ++ +
Sbjct: 327 IEQQQGAIEGFEQQAQAQRENAELLYARYELVDDILSTIQEARTQDRPWDEIEERFEEGK 386
Query: 176 RHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD 235
A + + +T+ L +E + D V NAD
Sbjct: 387 ERGIEAAEAVVGVDGTEGIVTVELD-GEEIDLVADDGV--EQNADRL------------- 430
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ--------------- 280
Y AKR + K A I+ T + L+ A+++ +
Sbjct: 431 -------YTEAKRIEEKKEGAL----AAIEDTREDLEDAKRRRDEWEATDDHEDDDDEED 479
Query: 281 TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
K+ M ++ W+++F WF +S+ YL
Sbjct: 480 EEKNWLEMASVPIRENEPWYDRFRWFHTSDGYL 512
>gi|154150873|ref|YP_001404491.1| hypothetical protein Mboo_1330 [Methanoregula boonei 6A8]
gi|153999425|gb|ABS55848.1| protein of unknown function DUF814 [Methanoregula boonei 6A8]
Length = 631
Score = 45.4 bits (106), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 69/373 (18%), Positives = 133/373 (35%), Gaps = 104/373 (27%)
Query: 54 HPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLE 113
HP+I + P V+ ++ +F F+ A++ F+ ++ K+ + A R L + E
Sbjct: 232 HPVISKSSCEPVVLAENAPQDENQFAGFSDALEVFYPMTKAEKVKVAA----RPKLSEGE 287
Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
+++ E + ++ +E I N + I ++ + +LSW++IE +K
Sbjct: 288 RIRKYQEAAIKKFDEKVAKAEEVVAAIYENYPFISQVITSLAA-ASKRLSWQEIEHHLKD 346
Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
+ IT P + E
Sbjct: 347 TS-------------STDAKRITAFF---------------------PGEAAVEVDIGKK 372
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
V + + + NA ++D + KK++ + A+K+ + + K D+ M +
Sbjct: 373 VKIFVHETVEQNAGHYYDQIKKFKKKKEGALL----AMKTVKPRKKVIRHDIVPMKKL-- 426
Query: 294 ARKVYWFEKFYWFISSENY--LGN------------------------------------ 315
W+ +F WFI+S+ LG
Sbjct: 427 -----WYHRFRWFITSDGVVVLGGRDAGQNEELVKKYMTGGDLFVHADVHGASVVIVKGK 481
Query: 316 ------VAWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
VA A + AW + QVSKT GEF+ GSF++RG++ ++
Sbjct: 482 TEKMDEVAQFAASYSGAWRSGHFTADVFSAQPTQVSKTPQAGEFVARGSFIVRGERTYYR 541
Query: 360 PCQLAMGISFLFK 372
L++GI + +
Sbjct: 542 DVPLSVGIGLVLE 554
>gi|116754828|ref|YP_843946.1| hypothetical protein Mthe_1534 [Methanosaeta thermophila PT]
gi|116666279|gb|ABK15306.1| protein of unknown function DUF814 [Methanosaeta thermophila PT]
Length = 641
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 51/260 (19%), Positives = 103/260 (39%), Gaps = 59/260 (22%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
P + ++G + FE F+ A+DEFF AE L A+++ + + + R ET+L
Sbjct: 249 PFPLEVYKGLEARSFERFSDALDEFF-VAEPEMPKLSALERRLELQRAAIDELRAKETQL 307
Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
+++ + I +D+ + AI LS+ DI R++ + +
Sbjct: 308 ASM----------GDFIYQRYSEIDSILKAIAGARERGLSYTDIWERIQSSGKSAVKSLD 357
Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
++ + I+ +TL +++ L+
Sbjct: 358 YSGEMIVEIDGVTL-------------------------------------ELNAGLTVP 380
Query: 244 ANAKRFFD---LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
NA R+++ A ++ ++ TE L+ E++ + + + + K WF
Sbjct: 381 QNAGRYYERAKEAAKKAAGAEEALRRTEDLLQRGEERRRSPV--------LKRRHKPRWF 432
Query: 301 EKFYWFISSENYLGNVAWDA 320
E+F WF SS+++L DA
Sbjct: 433 ERFRWFYSSDDFLVIGGRDA 452
>gi|448361523|ref|ZP_21550140.1| fibronectin-binding A domain-containing protein [Natrialba asiatica
DSM 12278]
gi|445650542|gb|ELZ03465.1| fibronectin-binding A domain-containing protein [Natrialba asiatica
DSM 12278]
Length = 720
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F +RG + ++ P A+GI
Sbjct: 593 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 643
>gi|448348947|ref|ZP_21537792.1| fibronectin-binding A domain-containing protein [Natrialba
taiwanensis DSM 12281]
gi|445641664|gb|ELY94739.1| fibronectin-binding A domain-containing protein [Natrialba
taiwanensis DSM 12281]
Length = 720
Score = 45.4 bits (106), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F +RG + ++ P A+GI
Sbjct: 593 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 643
>gi|171186042|ref|YP_001794961.1| hypothetical protein Tneu_1592 [Pyrobaculum neutrophilum V24Sta]
gi|170935254|gb|ACB40515.1| protein of unknown function DUF814 [Pyrobaculum neutrophilum
V24Sta]
Length = 613
Score = 45.4 bits (106), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW + + ++V+ +QV+K AP G++L GSFMI GK+ + +L + +
Sbjct: 476 AWKIGIHAVDVYYVRGEQVTKQAPAGQYLARGSFMIYGKREYVRNVRLELAVG 528
>gi|15790499|ref|NP_280323.1| hypothetical protein VNG1508C [Halobacterium sp. NRC-1]
gi|169236235|ref|YP_001689435.1| hypothetical protein OE3153R [Halobacterium salinarum R1]
gi|10580999|gb|AAG19803.1| conserved hypothetical protein [Halobacterium sp. NRC-1]
gi|167727301|emb|CAP14087.1| conserved hypothetical protein [Halobacterium salinarum R1]
Length = 703
Score = 45.1 bits (105), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGIS 368
D + +A+ V DQVSKT +GE+L G F +RG + +F + A+GI+
Sbjct: 576 DGRGAGDAYMVSPDQVSKTPESGEYLEKGGFAVRGDRTYFRDLSTEWAVGIT 627
>gi|448353444|ref|ZP_21542220.1| fibronectin-binding A domain-containing protein [Natrialba
hulunbeirensis JCM 10989]
gi|445640304|gb|ELY93393.1| fibronectin-binding A domain-containing protein [Natrialba
hulunbeirensis JCM 10989]
Length = 736
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F +RG + ++ P A+GI
Sbjct: 609 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 659
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 55/259 (21%), Positives = 111/259 (42%), Gaps = 15/259 (5%)
Query: 63 TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
TP + +H GF + +++F A+D++F E + +R + + K E +
Sbjct: 285 TPFPLEEHDEAGFDGEPYDTFLDALDDYFFHLELEDEEEPDPTSQRPDFEEEIAKHERII 344
Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
+ + EQ +E AEL+ N VD + I++ A WE IEAR ++
Sbjct: 345 EQQQGAIEGFEQEAENLRENAELLYANYGLVDDILSTIQEARAQDRPWEAIEARFEEGAE 404
Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD- 235
A + ++++ ++ V + I+ ++ HD + AD T+ + +
Sbjct: 405 QGIEAAEAV----IDVDGSEGIVTVDVDGEYIE---LVAHDGVEQNADRLYTEAKRVAEK 457
Query: 236 VDLDLSAYANAKR-FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
+ L+A + + D KR + ++ +S + E + + +D ++I
Sbjct: 458 KEGALAAIEDTREDLEDAKRRRDEWEESDGESGAGSGGGDEDEGEDEDRDWLAESSIPIR 517
Query: 295 RKVYWFEKFYWFISSENYL 313
WF++F WF +S+ YL
Sbjct: 518 ENEPWFDRFRWFHTSDGYL 536
>gi|448368844|ref|ZP_21555611.1| fibronectin-binding A domain-containing protein [Natrialba aegyptia
DSM 13077]
gi|445651387|gb|ELZ04295.1| fibronectin-binding A domain-containing protein [Natrialba aegyptia
DSM 13077]
Length = 722
Score = 45.1 bits (105), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F +RG + ++ P A+GI
Sbjct: 595 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 645
>gi|448359396|ref|ZP_21548054.1| fibronectin-binding A domain-containing protein [Natrialba
chahannaoensis JCM 10990]
gi|445643534|gb|ELY96581.1| fibronectin-binding A domain-containing protein [Natrialba
chahannaoensis JCM 10990]
Length = 727
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V +DQV+KT +GE+L G F +RG + ++ P A+GI
Sbjct: 600 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 650
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 55/266 (20%), Positives = 110/266 (41%), Gaps = 30/266 (11%)
Query: 62 STPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENV 115
+TP + +H G + +++F A+D++F E + +R + + K E +
Sbjct: 278 ATPFPLEEHDESGLDGEPYDTFLNALDDYFFHLELEDEEEPDPTSQRPDFGEEIAKHERI 337
Query: 116 KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
+ + EQ +E AEL+ N VD + I++ A W+DIEAR ++
Sbjct: 338 IEQQQGAIEGFEQEAENLRENAELLYANYGLVDDILSTIQEARAQDRPWDDIEARFEEGA 397
Query: 176 RHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS--------SE 227
A + ++++ ++ V I+ ++ HD + AD +E
Sbjct: 398 EQGIEAAEAV----IDVDGSEGIVTVDVNGEYIE---LVAHDGVEQNADRLYTEAKRVAE 450
Query: 228 TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQT 287
K+ +LV ++ +AKR ++ + Q E ++ + +D
Sbjct: 451 KKEGALVAIEDTREDLEDAKR---------RRDEWEEQDGEPGAGEEDEDDEDDDRDWLA 501
Query: 288 MTNINKARKVYWFEKFYWFISSENYL 313
++I WF++F WF +S+ YL
Sbjct: 502 ESSIPIRENEPWFDRFRWFHTSDGYL 527
>gi|448306550|ref|ZP_21496454.1| fibronectin-binding A domain-containing protein [Natronorubrum
bangense JCM 10635]
gi|445597848|gb|ELY51920.1| fibronectin-binding A domain-containing protein [Natronorubrum
bangense JCM 10635]
Length = 710
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 56/272 (20%), Positives = 108/272 (39%), Gaps = 47/272 (17%)
Query: 62 STPSVMRQHQGFPIQEFESFAAAVDEFFSTAE-SHKIDLKAVQQERD---ALKKLENVKR 117
+TP + +H + ++SF AA+D++F E + + Q+ D + K E +
Sbjct: 266 ATPFPLEEHAELAAEPYDSFLAALDDYFFRLELDDEEEPDPTTQKPDFGEEIAKYERIID 325
Query: 118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRH 177
+ + EQ +E+AEL+ VD + I+ A W++IEAR ++
Sbjct: 326 QQQGAIEGFEQQADELREQAELLYAEYGLVDDILSTIQDARAQDRPWDEIEARFEEGAER 385
Query: 178 NDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVD 237
A + + ++ + + + + +R ++ + HD + AD
Sbjct: 386 GIEAAEAV--VGIDSSEGIVTVDIDGDRIDL-----VAHDGVEQNAD------------- 425
Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK----------------TKQT 281
Y AKR + K A I+ T + L+ A+++ +
Sbjct: 426 ---RLYTEAKRVAEKKAGAL----AAIEDTREDLEDAKRRRDEWDADDEGDEEADDEEAE 478
Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
K+ M +I WF++F WF +S+ YL
Sbjct: 479 EKNWLEMPSIPIRENEPWFDRFRWFHTSDGYL 510
>gi|433638964|ref|YP_007284724.1| putative RNA-binding protein, snRNP like protein [Halovivax ruber
XH-70]
gi|433290768|gb|AGB16591.1| putative RNA-binding protein, snRNP like protein [Halovivax ruber
XH-70]
Length = 847
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
D + + + V ADQV+KT +GE+L G F IRG + + P +A+GI
Sbjct: 720 DGRYAGDVYSVDADQVTKTPESGEYLEKGGFAIRGDRTYHRDTPVGVAVGI 770
>gi|18313944|ref|NP_560611.1| hypothetical protein PAE3259 [Pyrobaculum aerophilum str. IM2]
gi|18161516|gb|AAL64793.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
Length = 614
Score = 45.1 bits (105), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 42/178 (23%), Positives = 68/178 (38%), Gaps = 57/178 (32%)
Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEK-KTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
+ FD + +K QK Q EK K K +Q + + +++ A K WFEKF+W
Sbjct: 352 QLFDEAKELEEKAQKAAQVLEKLKKDLSKLDEEQRRAEEKLKSSVKIATKKSWFEKFHWT 411
Query: 307 ISS-----------------------ENYLG-----------------------NVAWDA 320
+++ E+YL +A A
Sbjct: 412 VTTGRKPVIGGRDASQNEVVVKKYLKEHYLFFHADIPGASAVVAPPSEDPLELLQIAQFA 471
Query: 321 KVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
+ AW +VK QV+K P+G++L GSFMI GK+ + +L + +
Sbjct: 472 AAYSKAWKIGIHAVDVYYVKGVQVTKQPPSGQYLARGSFMIYGKREYVRNVRLELAVG 529
>gi|290559894|gb|EFD93216.1| protein of unknown function DUF814 [Candidatus Parvarchaeum
acidophilus ARMAN-5]
Length = 587
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
AW A + ++V +QV+KT P+GE L G+F I GK+ + L + +SF
Sbjct: 463 AWRAGATNLDVYFVNPEQVTKTPPSGESLKKGAFYIEGKRKYIKNSSLGIYLSF 516
>gi|300711181|ref|YP_003736995.1| hypothetical protein HacjB3_09100 [Halalkalicoccus jeotgali B3]
gi|448296718|ref|ZP_21486771.1| hypothetical protein C497_13578 [Halalkalicoccus jeotgali B3]
gi|299124864|gb|ADJ15203.1| hypothetical protein HacjB3_09100 [Halalkalicoccus jeotgali B3]
gi|445580850|gb|ELY35220.1| hypothetical protein C497_13578 [Halalkalicoccus jeotgali B3]
Length = 694
Score = 44.7 bits (104), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 112/295 (37%), Gaps = 65/295 (22%)
Query: 48 VANMEFHPMILQPGS-----TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDL 99
V F P I + TP + +H ++ F A+D++F T+E + D
Sbjct: 237 VTEKRFDPRIYRDDGQIVDVTPIALDEHAALEGDSYDRFNEALDDYFFELDTSEDEETDT 296
Query: 100 KAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159
E +++ + + E + EQ ++E+AEL+ N ++VD + +R +
Sbjct: 297 SPEFDEE--IERKKRIIDQQEGAIEGFEQEATEERERAELVYANYDTVDEVLTTVRGALE 354
Query: 160 NQLSWEDIEARVKQAQRHNDPVASII-------KQLKLNINHITLLLRVHKERSNIKDSW 212
WE+IEA +Q A + + +++ T+ L V RS ++
Sbjct: 355 EGRGWEEIEATFEQGAEQGIDAAERVTGFDPENGMVSVDLGEATVSLEV---RSGVE--- 408
Query: 213 VLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
NAD Y AKR + K A ++ I T + L
Sbjct: 409 ----KNAD--------------------RIYTEAKRIEEKKAGA----EEAIADTREELD 440
Query: 273 SAEKKTKQ--TLKDVQ------------TMTNINKARKVYWFEKFYWFISSENYL 313
+ ++ +Q T + Q + +I + W+E F WF +S+ YL
Sbjct: 441 ALRERKRQWETRDETQDDGGEPEEIDWLSRASIPVRKSEEWYEDFRWFHTSDGYL 495
>gi|448346455|ref|ZP_21535340.1| Fibronectin-binding A domain protein [Natrinema altunense JCM
12890]
gi|445632658|gb|ELY85869.1| Fibronectin-binding A domain protein [Natrinema altunense JCM
12890]
Length = 715
Score = 44.3 bits (103), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 55/260 (21%), Positives = 114/260 (43%), Gaps = 21/260 (8%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKLENVKRD 118
TP + +H + ++SF +A+D +F AE + D + + ++ + K E +
Sbjct: 268 TPFPLEEHDDLEGEPYDSFLSALDAYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQ 327
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ-AQRH 177
+ + EQ +E+AEL+ VD + I+ + SW+DI R ++ A++
Sbjct: 328 QQGAIEGFEQEAASLREQAELLYAEYGLVDDILSTIQGARERERSWDDIRERFEEGAEQG 387
Query: 178 NDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS----ETKKPSL 233
D A+I+ ++ + T+ + + ER ++ D+ NAD + E K +L
Sbjct: 388 IDAAAAIV---DIDGSDGTVTVEIDDERIDL-DAQQGVEQNADRLYTEAKRVEEKKDGAL 443
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
++ A+AKR + + + +S ++ +D + ++I
Sbjct: 444 AAIEDTRQDLADAKR--------RRDEWEADESGGGDDDETDEDGDDLPRDWLSESSIPI 495
Query: 294 ARKVYWFEKFYWFISSENYL 313
WF++F WF +S+ +L
Sbjct: 496 RENEPWFDRFRWFNTSDGFL 515
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 25/40 (62%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
D + + + V +DQVSKT +GE+L G F IRG + ++
Sbjct: 588 DGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYY 627
>gi|374326819|ref|YP_005085019.1| hypothetical protein P186_1339 [Pyrobaculum sp. 1860]
gi|356642088|gb|AET32767.1| hypothetical protein P186_1339 [Pyrobaculum sp. 1860]
Length = 621
Score = 44.3 bits (103), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW + + ++V+ +QVSK P+G++L GSFMI GK+ + ++ + +
Sbjct: 484 AWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMIYGKREYVRNVRMELAVG 536
>gi|145591891|ref|YP_001153893.1| hypothetical protein Pars_1690 [Pyrobaculum arsenaticum DSM 13514]
gi|145283659|gb|ABP51241.1| protein of unknown function DUF814 [Pyrobaculum arsenaticum DSM
13514]
Length = 614
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW + + ++V+ +QVSK P+G++L GSFM+ GK+ + +L + +
Sbjct: 477 AWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMVYGKREYVRHIRLELAVG 529
>gi|379003409|ref|YP_005259081.1| putative RNA-binding protein [Pyrobaculum oguniense TE7]
gi|375158862|gb|AFA38474.1| putative RNA-binding protein [Pyrobaculum oguniense TE7]
Length = 614
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW + + ++V+ +QVSK P+G++L GSFM+ GK+ + +L + +
Sbjct: 477 AWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMVYGKREYVRHIRLELAVG 529
>gi|41615287|ref|NP_963785.1| hypothetical protein NEQ506 [Nanoarchaeum equitans Kin4-M]
gi|40069011|gb|AAR39346.1| NEQ506 [Nanoarchaeum equitans Kin4-M]
Length = 255
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 29/42 (69%), Gaps = 1/42 (2%)
Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
AW + + + ++V Q+SK AP+GE+L GSFM+ GKK++
Sbjct: 142 AWQEGLASIDVFYVLGKQISKKAPSGEYLKHGSFMVYGKKHY 183
>gi|448730186|ref|ZP_21712496.1| hypothetical protein C449_10386 [Halococcus saccharolyticus DSM
5350]
gi|445793917|gb|EMA44482.1| hypothetical protein C449_10386 [Halococcus saccharolyticus DSM
5350]
Length = 699
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 31/61 (50%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
+ + +A+ DQVSKT +GE+L G F IRG + +F + + + + + E I
Sbjct: 572 NGRFAGDAYMASPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVAVGIVCEPETRVI 631
Query: 379 S 379
Sbjct: 632 G 632
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 50/268 (18%), Positives = 101/268 (37%), Gaps = 41/268 (15%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVKRD 118
TP + + + F+SF AA+D +F ++ + D + +ER D +++ + +
Sbjct: 257 TPFPLEERADLDSEGFDSFTAALDAYFVALDTTE-DEEGGGRERPDFEDDIERQQRIIEQ 315
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
E + E ++ KAE + + + VD + +R W+DIE R +
Sbjct: 316 QEGAIEDFEDQADAERAKAESLYAHYDLVDEILSTVRNAREQGTGWDDIEERFAEGADRG 375
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
A + + + +T+ + + +D NAD
Sbjct: 376 IAAAEAVDGVTPSEGTVTVDIDGRSVELDPRDGV---EQNADRL---------------- 416
Query: 239 DLSAYANAKRFFDLKRSA------AKKQQKTIQSTEKALKSAEKKTKQTLKDVQ------ 286
Y AKR K A + + + +Q +SA++ ++ +
Sbjct: 417 ----YKEAKRVVGKKEGAEEAVAETRAELEALQRRRDEWESADENETESTDTDEDEDIDW 472
Query: 287 -TMTNINKARKVYWFEKFYWFISSENYL 313
T +I + W+E+F WF +S+ +L
Sbjct: 473 LTRRSIPVRQNEQWYERFRWFRTSDGFL 500
>gi|448737510|ref|ZP_21719550.1| hypothetical protein C451_08253 [Halococcus thailandensis JCM
13552]
gi|445803654|gb|EMA53937.1| hypothetical protein C451_08253 [Halococcus thailandensis JCM
13552]
Length = 695
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
+ + +A+ + DQVSKT +GE+L G F IRG + +F +A+GI+
Sbjct: 569 NGRFAGDAYMAEPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVAVGIT 620
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 58/296 (19%), Positives = 115/296 (38%), Gaps = 65/296 (21%)
Query: 48 VANMEFHPMILQPGSTPSVMRQHQGFPIQE--------FESFAAAVDEFFSTAESHKIDL 99
+++ +F+P I + P + FP+ E F+SF AA+D +F ++ + +
Sbjct: 237 LSSGDFNPRIYRDDGDPVDVTP---FPLDERAELDSEGFDSFTAALDAYFVELDTTEDEE 293
Query: 100 KAVQQERDALKKLENVKR---DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQ 156
++ D +++E +R E + E ++E AE + N E VD + +R
Sbjct: 294 SGGRERPDFEEQIERQQRIIDQQEGAIEDFEAQADRERETAESLYANYELVDEILTTVRN 353
Query: 157 DIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHH 216
+ WE IE R + + A + ++ + +T+ +I D
Sbjct: 354 AREEGIGWEAIEERFAEGEERGIAAAEAVSGIEPSEGTVTV---------DIDDR----- 399
Query: 217 DNADPFADSSETKKPSLVDVDLDL--SAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKAL 271
DV+LD NA R + + KR KK+ ++ + T + L
Sbjct: 400 ------------------DVELDPQEGVEQNADRLYREAKRVVEKKEGAEEAVAETREEL 441
Query: 272 KSAEKKTKQTLKDVQ--------------TMTNINKARKVYWFEKFYWFISSENYL 313
++ E++ + + +I W+E+F WF +S+ +L
Sbjct: 442 EAIERQRDEWEAGDVDDDPDEESEDVDWLSQRSIPVRTDEQWYERFRWFHTSDGFL 497
>gi|448339346|ref|ZP_21528374.1| Fibronectin-binding A domain protein [Natrinema pallidum DSM 3751]
gi|445620575|gb|ELY74071.1| Fibronectin-binding A domain protein [Natrinema pallidum DSM 3751]
Length = 721
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 25/40 (62%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
D + + + V +DQVSKT +GE+L G F IRG + ++
Sbjct: 594 DGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYY 633
Score = 41.2 bits (95), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 56/259 (21%), Positives = 113/259 (43%), Gaps = 19/259 (7%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKLENVKRD 118
TP + +H+ + ++SF +A+D++F AE + D + + ++ + K E +
Sbjct: 274 TPFPLEEHEDLDGEPYDSFLSALDDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQ 333
Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
+ + EQ +E+AEL+ VD + I SW+DI R ++
Sbjct: 334 QQGAIEGFEQEAESLREQAELLYAEYGLVDDILSTILGARKRDRSWDDIRDRFEEGAEQG 393
Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVD 235
A + + ++ + T+ + + ER ++ D+ NAD A E KK
Sbjct: 394 IDAAEAV--VDVDGSDGTVTVDIDDERISL-DAQQGVEQNADRLYTEAKRVEEKKEGA-- 448
Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQ-KTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
L+A N + DL+ + ++ + + +S A++ + + +D + +I
Sbjct: 449 ----LAAIENTRD--DLEDAKRRRDEWEDDESGGADEAEADEDEEDSQRDWLSEPSIPIR 502
Query: 295 RKVYWFEKFYWFISSENYL 313
WF++F WF +S+ YL
Sbjct: 503 ENEPWFDRFRWFHTSDGYL 521
>gi|448340269|ref|ZP_21529242.1| Fibronectin-binding A domain protein [Natrinema gari JCM 14663]
gi|445630575|gb|ELY83836.1| Fibronectin-binding A domain protein [Natrinema gari JCM 14663]
Length = 722
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 25/40 (62%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
D + + + V +DQVSKT +GE+L G F IRG + ++
Sbjct: 595 DGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYY 634
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 57/265 (21%), Positives = 113/265 (42%), Gaps = 31/265 (11%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKKL 112
TP + +H + ++SF +A+D++F +++L A ++E D + K
Sbjct: 275 TPFPLEEHDDLEGEPYDSFLSALDDYF-----FRLEL-AAEEEPDPTDQRPDFESEIAKH 328
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E + + + EQ +E+AEL+ VD + I+ + SW++I R +
Sbjct: 329 ERIIEQQQGAIEGFEQEAASLREQAELLYAEYGLVDEILSTIQGARERERSWDEIRERFE 388
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
+ A I + ++ + T+ + + ER ++ D+ NAD A E K
Sbjct: 389 EGAEQGIDAAEAI--VDIDGSDGTVTVEIDDERIDL-DAQQGVEQNADRLYTEAKRVEEK 445
Query: 230 KPSLVDVDLDLSAYANAKR-FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM 288
K D L+A N ++ D KR + + + +S + + +D +
Sbjct: 446 K------DGALAAIENTRQDLADAKRR--RDEWEADESGGEDDDETDADGDDLPRDWLSE 497
Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
++I WF++F WF +S+ +L
Sbjct: 498 SSIPIRENEPWFDRFRWFHTSDGFL 522
>gi|110667755|ref|YP_657566.1| hypothetical protein HQ1801A [Haloquadratum walsbyi DSM 16790]
gi|109625502|emb|CAJ51929.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
Length = 719
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V DQVSKT +GE++ GSF+IRG + + ++A+GI
Sbjct: 592 EGRYADDAYVVTPDQVSKTPESGEYIEKGSFVIRGDRTYIRDVAAEIAVGI 642
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 53/265 (20%), Positives = 112/265 (42%), Gaps = 19/265 (7%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERD---ALKKLENVKRDHE 120
P R+ Q + ++SF A+D +F + + ++ Q D + K + + E
Sbjct: 260 PLEERKQQNLDVTTYDSFNGALDVYFREVDRNPAAEESGQTRPDFAAEIAKKQRIIEQQE 319
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI-EARVKQAQRHND 179
+ EQ ++ +AEL+ N E V+ I I+ A SW++I E A+R D
Sbjct: 320 GAIDDFEQRAEAERSRAELLYANYELVNEIIETIQTARAEDTSWDEIRETFAMGAERGID 379
Query: 180 PVASIIK------QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
A+++ + + I+ + + + V ++ + + A + E ++
Sbjct: 380 AAAAVVSVDGAEAMVTIEIDDMRVPVNVD---VGVEKNADQRYTEAKRIEEKKEGALTAI 436
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKA---LKSAEKKTKQTLKDVQ--TM 288
+ +L+A + +D + A + +TE + + + ++ D + +M
Sbjct: 437 ENTREELNAVKQRRDAWD-REDAKPDTEDNADNTETVTDKVNTGTEPSRMGPTDDEWLSM 495
Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
T+I + W+E+F WF +S YL
Sbjct: 496 TSIPLQKNDDWYEQFRWFHTSTGYL 520
>gi|385803199|ref|YP_005839599.1| hypothetical protein Hqrw_1937 [Haloquadratum walsbyi C23]
gi|339728691|emb|CCC39852.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
Length = 719
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
+ + +A+ V DQVSKT +GE++ GSF+IRG + + ++A+GI
Sbjct: 592 EGRYADDAYVVTPDQVSKTPESGEYIEKGSFVIRGDRTYIRDVAAEIAVGI 642
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 54/265 (20%), Positives = 109/265 (41%), Gaps = 19/265 (7%)
Query: 64 PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERD---ALKKLENVKRDHE 120
P R+ Q + ++SF A+D +F + + ++ Q D + K + + E
Sbjct: 260 PLEERKQQNLDVTAYDSFNGALDVYFREVDRNPAAEESGQTRPDFAAEIAKKQRIIEQQE 319
Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI-EARVKQAQRHND 179
+ EQ ++ +AEL+ N E V+ I I+ A SW++I E A+R D
Sbjct: 320 GAIDDFEQRAEAERSRAELLYANYELVNEIIETIQTARAEDTSWDEIRETFAMGAERGID 379
Query: 180 PVASIIK------QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
A+++ + + I+ + + + V ++ + + A + E ++
Sbjct: 380 AAAAVVSVDGAEAMVTIEIDDVRVPVNVD---VGVEKNADQRYTEAKRIEEKKEGALTAI 436
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK-----QTLKDVQTM 288
+ +L+A + +D + A + +TE T+ T + +M
Sbjct: 437 ENTREELNAVKQRRDAWD-REDAKPDTEDNADNTETVTDKVNTGTEPSRMGPTNDEWLSM 495
Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
T+I + W+E+F WF +S YL
Sbjct: 496 TSIPLQKNDDWYEQFRWFHTSTGYL 520
>gi|374850433|dbj|BAL53422.1| hypothetical conserved protein [uncultured crenarchaeote]
Length = 530
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
+ + VKA+QVS T P+G++L GSFM+ G K F+
Sbjct: 411 VSVYAVKAEQVSFTPPSGQYLPKGSFMVYGSKKFY 445
>gi|397781041|ref|YP_006545514.1| hypothetical protein BN140_1875 [Methanoculleus bourgensis MS2]
gi|396939543|emb|CCJ36798.1| putative protein MJ1625 [Methanoculleus bourgensis MS2]
Length = 659
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 62/280 (22%), Positives = 111/280 (39%), Gaps = 78/280 (27%)
Query: 155 RQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVL 214
R+++ Q E I+ K+ +R+ V + + IT L ++RS + +L
Sbjct: 312 REEVIRQRQAEAIKGFEKKIRRYERVVEVLYENYTAVTGVITTLDAASRDRSWQEIEQIL 371
Query: 215 HHDNADPFADSSETKKPSLVDVDLDLSA-----YA------NAKRFFDLKRSAAKKQQKT 263
++ + A P+ V+LDL+ Y N R++D + KK+
Sbjct: 372 KSNSDNAAAKMIRAVHPAEAAVELDLAGERVKVYVHETIEQNIGRYYDQIKKFKKKKAGA 431
Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---------- 313
+ + E+A+ + + + KQ L K R W+ +F WF +S+ L
Sbjct: 432 LAAMERAI-TVKPRRKQHLV-------FQKKR---WYHRFRWFSTSDGVLVIGGRDASQN 480
Query: 314 ---------------------GNV-------------AWDAKVVTNAW----------WV 329
G+V A A +NAW
Sbjct: 481 EELVKKYMEGGDLFIHADVHGGSVVIVKGATEHLDEAAQFAASYSNAWKAGHFSADVYAA 540
Query: 330 KADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
+ DQVSKTA +GE++ G+F++RG++ +F P +A+G+
Sbjct: 541 RPDQVSKTAESGEYVARGAFIVRGERQYFRNVPVGVAIGL 580
>gi|397772651|ref|YP_006540197.1| Fibronectin-binding A domain protein [Natrinema sp. J7-2]
gi|397681744|gb|AFO56121.1| Fibronectin-binding A domain protein [Natrinema sp. J7-2]
Length = 722
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 57/265 (21%), Positives = 113/265 (42%), Gaps = 31/265 (11%)
Query: 63 TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKKL 112
TP + +H + ++SF +A+D++F +++L A ++E D + K
Sbjct: 275 TPFPLEEHDDLEGEPYDSFLSALDDYF-----FRLEL-AAEEEPDPTDQRPDFESEIAKH 328
Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
E + + + EQ +E+AEL+ VD + I+ + SW++I R +
Sbjct: 329 ERIIEQQQGAIEGFEQEAASLREQAELLYAEYGLVDEILSTIQGARERERSWDEIRERFE 388
Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
+ A I + ++ + T+ + + ER ++ D+ NAD A E K
Sbjct: 389 EGAEQGIDAAEAI--VDIDGSDGTVTVEIDDERIDL-DAQQGVEQNADRLYTEAKRVEEK 445
Query: 230 KPSLVDVDLDLSAYANAKR-FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM 288
K D L+A N ++ D KR + + + +S + + +D +
Sbjct: 446 K------DGALAAIENTRQDLADAKRR--RDEWEADESGGEDDDETDADGDDLPRDWLSE 497
Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
++I WF++F WF +S+ +L
Sbjct: 498 SSIPIRENEPWFDRFRWFHTSDGFL 522
>gi|355571923|ref|ZP_09043131.1| protein of unknown function DUF814 [Methanolinea tarda NOBI-1]
gi|354825019|gb|EHF09254.1| protein of unknown function DUF814 [Methanolinea tarda NOBI-1]
Length = 633
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 81/204 (39%), Gaps = 79/204 (38%)
Query: 231 PSLVDVDLDL----------SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSA-EKKTK 279
P VD+DL S ANA +++ KK ++ I + A++ A +KK +
Sbjct: 362 PGRGAVDIDLGERVTLTVGESIEANAAAYYE----EIKKYRRKIAGAQAAMERAVQKKER 417
Query: 280 QTLKDVQTMTNINKARKVYWFEKFYWFI------------SSEN------YLGN------ 315
+T++ A K W+ +F WFI +S+N Y+
Sbjct: 418 RTVRAA--------AGKKRWYHRFRWFITSDGVLVVGGRDASQNEELVKKYMEGSDLFVH 469
Query: 316 --------------------VAWDAKVVTNAW----------WVKADQVSKTAPTGEFLT 345
VA A + AW V QVSKT +GE+++
Sbjct: 470 ADVHGASVVIVKGKTGKMDEVATFAASYSGAWKSGHLAADVYCVAPSQVSKTPESGEYVS 529
Query: 346 TGSFMIRGKKNFFP--PCQLAMGI 367
GSF++RG++ +F P +A+G+
Sbjct: 530 RGSFIVRGERRYFRNVPLGIAIGL 553
>gi|296241940|ref|YP_003649427.1| hypothetical protein Tagg_0195 [Thermosphaera aggregans DSM 11486]
gi|296094524|gb|ADG90475.1| protein of unknown function DUF814 [Thermosphaera aggregans DSM
11486]
Length = 666
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF-PPCQLAMGI 367
AW + +W VS + P+GE+L G+ M+ GKKN+ P +L +G+
Sbjct: 521 AWKTGMGFIEVFWTMGSSVSLSPPSGEYLPKGAIMVYGKKNYLKTPLRLGLGL 573
>gi|315427275|dbj|BAJ48887.1| conserved hypothetical protein [Candidatus Caldiarchaeum
subterraneum]
gi|343485854|dbj|BAJ51508.1| conserved hypothetical protein [Candidatus Caldiarchaeum
subterraneum]
Length = 628
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
+ + VKA+QVS T P+G++L GSFM+ G K F+
Sbjct: 509 VSVYAVKAEQVSFTPPSGQYLPKGSFMVYGSKKFY 543
>gi|424812620|ref|ZP_18237860.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
Nanosalinarum sp. J07AB56]
gi|339756842|gb|EGQ40425.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
Nanosalinarum sp. J07AB56]
Length = 628
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 79/386 (20%), Positives = 136/386 (35%), Gaps = 136/386 (35%)
Query: 57 ILQPGSTPSVMRQHQG-------FPIQ--------EFESFAAAVDEFFSTAESHKIDLKA 101
+L+ G P + +G FP+Q FE+F+ A+DE F K++ K
Sbjct: 210 VLELGDDPVLYTDPEGEPVRAAPFPLQTYSEHEEERFETFSRALDELFHRRRQQKLESKR 269
Query: 102 VQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN-------QESVDAAILAI 154
+ + R+ + +E E + LEQ ++ AE I N ++ VD+ I
Sbjct: 270 MDKYRERREGIERQLHQQEQKAEGLEQAARQRRQAAETIYENYQVFHDLKQKVDSVIHEE 329
Query: 155 RQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVL 214
+ A QL D+E+ +NH RV
Sbjct: 330 GWESAEQLEVSDLES----------------------VNHQERFYRVA------------ 355
Query: 215 HHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSA 274
D +E K + D S A A R +D AK++++ ++T +AL++
Sbjct: 356 --------IDGAEVK------LSPDESLEAAASRMYD----EAKEREQKAENTREALQNT 397
Query: 275 EKKTKQTLKDVQTMTNI----NKARKVYWFEKFYWFISSENYL----------------- 313
K ++ +D + +++R WFEK+ WF + E L
Sbjct: 398 RGKLEELEEDEFEVEEDSMERDESRSKRWFEKYRWFHTPEGRLVICGRGPQTNESLVKNH 457
Query: 314 ----------------------GNVAWDAKV---------VTNAW--WVKADQVSKTAP- 339
G A + ++ + AW + AD V P
Sbjct: 458 LEGDDLYLHADFDGAPSVALKDGQDASEEEIRQAAKAAVTFSKAWKSGIGADDVYYVEPS 517
Query: 340 -------TGEFLTTGSFMIRGKKNFF 358
+GE+L G+F+IRG + +
Sbjct: 518 QVTKNPESGEYLEKGAFVIRGDRTYL 543
>gi|359415829|ref|ZP_09208221.1| hypothetical protein HRED_04719, partial [Candidatus Haloredivivus
sp. G17]
gi|358033813|gb|EHK02326.1| hypothetical protein HRED_04719 [Candidatus Haloredivivus sp. G17]
Length = 194
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 50/93 (53%), Gaps = 8/93 (8%)
Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK----TKQTLKDVQTMT 289
+ +DL A A +++D AK+ + +++ EKAL+ E + ++ ++ + M
Sbjct: 52 IKIDLHQDLEATASQYYD----KAKESESKMENAEKALEKTEDEIESLGEEDIELEEVME 107
Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAKV 322
+ ++ R WFEK+ WF SS+ YL + DA+
Sbjct: 108 DKSEKRSKKWFEKYRWFYSSDGYLVCLGRDAQT 140
>gi|126460385|ref|YP_001056663.1| hypothetical protein Pcal_1780 [Pyrobaculum calidifontis JCM 11548]
gi|126250106|gb|ABO09197.1| protein of unknown function DUF814 [Pyrobaculum calidifontis JCM
11548]
Length = 616
Score = 42.0 bits (97), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 16/53 (30%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
AW + + ++ + +QVSK P G++L GSFM+ GK+ + +L + +
Sbjct: 479 AWKIGIHAIDVYYARGEQVSKQPPAGQYLARGSFMVYGKREYVRNVRLELAVG 531
>gi|432330923|ref|YP_007249066.1| putative RNA-binding protein, snRNP like protein [Methanoregula
formicicum SMSP]
gi|432137632|gb|AGB02559.1| putative RNA-binding protein, snRNP like protein [Methanoregula
formicicum SMSP]
Length = 630
Score = 41.6 bits (96), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
QVSKT GEF++ GSF++RG++ ++ L+ GI +
Sbjct: 512 QVSKTPEAGEFVSRGSFIVRGERTYYRNIPLSTGIGLML 550
>gi|152997342|ref|YP_001342177.1| hypothetical protein Mmwyl1_3337 [Marinomonas sp. MWYL1]
gi|150838266|gb|ABR72242.1| conserved hypothetical protein [Marinomonas sp. MWYL1]
Length = 232
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 66/129 (51%), Gaps = 11/129 (8%)
Query: 17 TLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI- 75
T+ + +L V+ G K+E RPN +D+++ + I+ PG T SV+ ++ G P+
Sbjct: 50 TVEIFALDLLVNKNG----KEELRPNDDDFLIIPLT---TIIPPGKTQSVIIRYIGEPVL 102
Query: 76 QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
E +S+ A+++ T + +ID + + K L NV ++ ++ + VDKE
Sbjct: 103 SESKSYRIAINQV--TVDLGEIDGSGIGMAL-SFKTLLNVVPNNAKSNLVIKNKEQVDKE 159
Query: 136 KAELIINNQ 144
+++ N+
Sbjct: 160 TWSVLLENK 168
>gi|290474141|ref|YP_003467018.1| toxin RtxA protein [Xenorhabdus bovienii SS-2004]
gi|289173451|emb|CBJ80230.1| putative toxin RtxA protein [Xenorhabdus bovienii SS-2004]
Length = 4716
Score = 38.9 bits (89), Expect = 4.0, Method: Composition-based stats.
Identities = 48/181 (26%), Positives = 73/181 (40%), Gaps = 19/181 (10%)
Query: 118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRH 177
DH + A Q L DKEKAE E LA QL D+EA + Q
Sbjct: 1615 DH-VQQDAAAQNALQDKEKAEADRQRLEQERDTQLAAVSGSQAQLEATDLEALNRNGQDQ 1673
Query: 178 NDPV----ASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
D + +I K+L ++ L K + W A+ +T+ S
Sbjct: 1674 RDAINHEAQAITKELTEQAKGLSALDSHSKHHGKSGEQWRAQF--AEGLLGQVQTQLNSA 1731
Query: 234 VDV---DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV-QTMT 289
+ LD S +A ++RF DL+ + AK +E + E+ +Q+ K++ QT T
Sbjct: 1732 DETAQQQLDRSQHAASERFGDLRDAVAK--------SEAGVSKGEQNQQQSQKEIEQTRT 1783
Query: 290 N 290
N
Sbjct: 1784 N 1784
>gi|348664654|gb|EGZ04498.1| hypothetical protein PHYSODRAFT_536255 [Phytophthora sojae]
Length = 580
Score = 38.1 bits (87), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 9/84 (10%)
Query: 98 DLKAVQQERDALKK---LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAI 154
D + QQE +A ++ +RD RL+ L TQL+ + ++ +N Q+ DA A+
Sbjct: 202 DRRHFQQELNASRRDVQTAKCQRDKVERLAQLRHTQLLKESARKVRVNEQKRRDAQ--AV 259
Query: 155 RQDIANQLSWED----IEARVKQA 174
RQ IA+ L + D +E++VKQA
Sbjct: 260 RQLIADTLEYRDELSIMESKVKQA 283
>gi|301111582|ref|XP_002904870.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095200|gb|EEY53252.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 583
Score = 37.7 bits (86), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 6/61 (9%)
Query: 118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED----IEARVKQ 173
D +L+ L TQLV + +L IN Q+ DA A+RQ IA+ L + D +E+R+KQ
Sbjct: 224 DKAEKLANLRHTQLVKESARKLRINEQKRRDAQ--AVRQTIADTLEYRDELNAMESRLKQ 281
Query: 174 A 174
A
Sbjct: 282 A 282
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.128 0.364
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,382,614,316
Number of Sequences: 23463169
Number of extensions: 207395567
Number of successful extensions: 799899
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 648
Number of HSP's successfully gapped in prelim test: 1298
Number of HSP's that attempted gapping in prelim test: 795570
Number of HSP's gapped (non-prelim): 4509
length of query: 380
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 236
effective length of database: 8,980,499,031
effective search space: 2119397771316
effective search space used: 2119397771316
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 78 (34.7 bits)