BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy4691
         (380 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|340713692|ref|XP_003395373.1| PREDICTED: nuclear export mediator factor NEMF homolog [Bombus
           terrestris]
          Length = 971

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 211/416 (50%), Positives = 263/416 (63%), Gaps = 92/416 (22%)

Query: 29  SQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P +   E++I  N+EFHP + +         Q+  +P +EF+SF  AV
Sbjct: 242 SKGYIIQKKESKPTTDGKENFIYTNIEFHPFLFE---------QYADYPYKEFDSFDVAV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQERDALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 293 DEYFSTMEGQKLDLKALQQERDALKKLENVKKDHDQRLINLEKTQELDKQKAELISRNQA 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKILLKEAESRGDPVASAIKQLKLETNHISLLLH----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                         DP+ DS E    KP L+D+DL  +A+ NA ++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNATKYYNQKRSAAKKQQKT 453

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---------- 313
           I+S +KALKSAEKKTKQTLK+VQT+ +INK RK+YWFEKFYWFISSENYL          
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQN 513

Query: 314 ------------------------------GNVAWDAKVVTN----------AW------ 327
                                         GN +   K +            AW      
Sbjct: 514 ELIVKRYLKSGDIYVHADLTGASSVVIKNPGNDSVPPKTLAEAGTMAVAYSIAWDAKVVA 573

Query: 328 ---WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
              WV  DQVSKTAPTGE+LTTGSFMIRGKKN+ PPCQL MG+ FLF+LEESSI R
Sbjct: 574 GAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIER 629


>gi|350409527|ref|XP_003488770.1| PREDICTED: nuclear export mediator factor NEMF homolog [Bombus
           impatiens]
          Length = 971

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 213/416 (51%), Positives = 265/416 (63%), Gaps = 92/416 (22%)

Query: 29  SQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P +   ED+I  N+EFHP + +         Q+  +P +EF+SF  AV
Sbjct: 242 SKGYIIQKKESKPTADGKEDFIYTNIEFHPFLFE---------QYTNYPYKEFDSFDVAV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQERDALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 293 DEYFSTMEGQKLDLKALQQERDALKKLENVKKDHDQRLINLEKTQELDKQKAELISRNQT 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKVLLKEAESRGDPVASAIKQLKLETNHISLLLH----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                         DP+ DS E    KP L+D+DL  +A+ NA ++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNATKYYNQKRSAAKKQQKT 453

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---------- 313
           I+S +KALKSAEKKTKQTLK+VQT+ +INK RK+YWFEKFYWFISSENYL          
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQN 513

Query: 314 ----------GNVAWDAK-------VVTN-----------------------AW------ 327
                     G++   A        V+ N                       AW      
Sbjct: 514 ELIVKRYLKSGDIYVHADLTGASSVVIKNPGSDSVPPKTLAEAGTMAVAYSIAWDAKVVA 573

Query: 328 --W-VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
             W V  DQVSKTAPTGE+LTTGSFMIRGKKN+ PPCQL MG+ FLF+LEESSI R
Sbjct: 574 GAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIER 629


>gi|345495372|ref|XP_001603770.2| PREDICTED: nuclear export mediator factor NEMF homolog [Nasonia
           vitripennis]
          Length = 972

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 218/444 (49%), Positives = 274/444 (61%), Gaps = 97/444 (21%)

Query: 6   DVSSDIHKCGPTLHVLVSNLNVS----SQGYLVQKKEKRP---NSEDYIVANMEFHPMIL 58
           DV+ D+ K    L      LN +    S+GY++QKKE++P     E+++ AN+EFHP + 
Sbjct: 215 DVAKDVDKLYSALENAEKMLNNAKKDVSKGYIIQKKEEKPIKDGEEEFMYANIEFHPFLF 274

Query: 59  QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD 118
           +         QH     +E+E+F  AVDE+FST E  K+DLK +QQERDALKKL+NVK+D
Sbjct: 275 E-----QCKNQH----YKEYETFDKAVDEYFSTMEGQKLDLKVLQQERDALKKLDNVKKD 325

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
           H+ RL  L +TQ  DK+KAELI  NQE VD AILA++  +ANQ+SW+DI+  +K+AQ   
Sbjct: 326 HDQRLVTLGKTQEADKQKAELITRNQELVDNAILAMQSALANQMSWQDIQTLLKEAQAKG 385

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK---KPSLVD 235
           DPVAS IK LKL  NHIT+LL                   +DP+ DS + +   KP  VD
Sbjct: 386 DPVASAIKHLKLESNHITMLL-------------------SDPYEDSDDDEPELKPMTVD 426

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKAR 295
           +DL  SA++NA R++D KRSAAKKQQKTI+S  KALKSAE+KTKQTLK+VQ + +INKAR
Sbjct: 427 IDLAHSAFSNATRYYDQKRSAAKKQQKTIESQGKALKSAERKTKQTLKEVQAIHSINKAR 486

Query: 296 KVYWFEKFYWFISSENYL--------------------GNVAWDAK-------VVTN--- 325
           KVYWFEKFYWFI+SENYL                    G+V   A        VV N   
Sbjct: 487 KVYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLRSGDVYVHADLTGASSVVVKNPNG 546

Query: 326 --------------------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKN 356
                               AW         WV +DQVSKTAPTGE+LTTGSFMIRGKKN
Sbjct: 547 GPVPPKSLAEAGTMAVAYSIAWEAKVIAGSYWVNSDQVSKTAPTGEYLTTGSFMIRGKKN 606

Query: 357 FFPPCQLAMGISFLFKLEESSISR 380
           + PPCQL MG+ FLF+LE+SSI R
Sbjct: 607 YLPPCQLIMGLGFLFRLEDSSIER 630


>gi|403277934|ref|XP_003930597.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 1056

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 208/414 (50%), Positives = 256/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKRETKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+QTQ +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645


>gi|390469065|ref|XP_003734045.1| PREDICTED: nuclear export mediator factor NEMF isoform 2
           [Callithrix jacchus]
          Length = 1056

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 207/414 (50%), Positives = 256/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+QTQ +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA +SAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFRSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645


>gi|345804334|ref|XP_863447.2| PREDICTED: nuclear export mediator factor NEMF isoform 6 [Canis
           lupus familiaris]
          Length = 1056

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 204/414 (49%), Positives = 255/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREVKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           +  E++  +          +      +  +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDISVEKNETELPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTTGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645


>gi|426376842|ref|XP_004055191.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Gorilla
           gorilla gorilla]
          Length = 1056

 Score =  369 bits (948), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 207/414 (50%), Positives = 256/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645


>gi|397523544|ref|XP_003831789.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Pan
           paniscus]
          Length = 1055

 Score =  369 bits (948), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 207/414 (50%), Positives = 255/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645


>gi|395838618|ref|XP_003792209.1| PREDICTED: nuclear export mediator factor NEMF [Otolemur garnettii]
          Length = 1056

 Score =  369 bits (947), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 206/414 (49%), Positives = 256/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N + +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFNGKGYIIQKREIKPSLEADKPAEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVATAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  + S        +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVVDDVSVEKNETEPSKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645


>gi|194388162|dbj|BAG65465.1| unnamed protein product [Homo sapiens]
          Length = 1055

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 207/414 (50%), Positives = 255/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645


>gi|332842178|ref|XP_003314363.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Pan
           troglodytes]
          Length = 1055

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 207/414 (50%), Positives = 255/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I    EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDIFTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 591

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 645


>gi|426233098|ref|XP_004010554.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Ovis
           aries]
          Length = 1055

 Score =  368 bits (945), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 202/412 (49%), Positives = 254/412 (61%), Gaps = 68/412 (16%)

Query: 28  SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           + +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFESF 
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI  
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
           N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR  
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413

Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
                         +  E++  +          +      +  +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533

Query: 307 ISSENYL--------------------------------GNV------AWDAKVVTNAWW 328
           ISSENYL                                G +      AWDA+V+T+AWW
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSAWW 593

Query: 329 VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           V   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 594 VYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFSFLFKVDESCVWR 645


>gi|338717943|ref|XP_001496390.3| PREDICTED: nuclear export mediator factor NEMF [Equus caballus]
          Length = 827

 Score =  367 bits (943), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 203/414 (49%), Positives = 254/414 (61%), Gaps = 68/414 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N+  +GY++QK+E +P+ E       I+   EFHP +           QH   P  EFES
Sbjct: 12  NLKVKGYIIQKREMKPSLEVDKPTQDILTYEEFHPFLFS---------QHSQCPYIEFES 62

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 63  FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELI 122

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 123 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 182

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           +  E++  +          +      +  +P LVDVDL LSAYA
Sbjct: 183 NPYLLSEEEDDDVDGDISVEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 242

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 243 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 302

Query: 305 WFISSENYL--------------------------------GNV------AWDAKVVTNA 326
           WFISSENYL                                G +      AWDA+V+T+A
Sbjct: 303 WFISSENYLIIGGRDQQQNEIIVKRYLTPGEPIPPRTLTEAGTMALCYSAAWDARVITSA 362

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 363 WWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 416


>gi|328781799|ref|XP_395865.4| PREDICTED: serologically defined colon cancer antigen 1 homolog
           isoform 1 [Apis mellifera]
          Length = 970

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 199/416 (47%), Positives = 256/416 (61%), Gaps = 92/416 (22%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P ++F SF  AV
Sbjct: 242 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKKFASFDVAV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 293 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAAKKQQKT 453

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
           I+S +KALKSAEKKTKQTLK+VQT+ +INK RK+YWFEKFYWFISSENYL     D +  
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQN 513

Query: 322 -------VVTNAWWVKADQVSKTA-----PTG---------------------------- 341
                  + T   +V AD    ++     P G                            
Sbjct: 514 ELIVKRYLKTGDIYVHADLTGASSVIIKNPGGSTVPPKTLAEAGTMAVAYSIAWDAKVVA 573

Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                            E+LTTGSFMIRGKKN+ PPCQL MG+ FLF+LEESSI R
Sbjct: 574 GAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIER 629


>gi|380024993|ref|XP_003696268.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           homolog [Apis florea]
          Length = 970

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 200/416 (48%), Positives = 255/416 (61%), Gaps = 92/416 (22%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P +EF SF  AV
Sbjct: 242 SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 293 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQT 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAAKKQQKT 453

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
           I+S +KALKSAEKKTKQTLK+VQT+ +INK RK+YWFEKFYWFISSENYL     D +  
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQN 513

Query: 322 -------VVTNAWWVKADQVSKTA-----PTG---------------------------- 341
                  + T   +V AD    ++     P G                            
Sbjct: 514 ELIVKRYLKTGDIYVHADLTGASSVIIKNPGGGSVPPKTLAEAGTMAVAYSIAWDAKVVA 573

Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                            E+LTTGSFMIRGKKN+ PPCQL MG+ FLF LEESSI R
Sbjct: 574 GAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFXLEESSIER 629


>gi|307209071|gb|EFN86238.1| Serologically defined colon cancer antigen 1-like protein
           [Harpegnathos saltator]
          Length = 989

 Score =  351 bits (900), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 198/417 (47%), Positives = 257/417 (61%), Gaps = 92/417 (22%)

Query: 28  SSQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           +S+GY++QKKE RP     E++I AN+EFHP++ +         Q++  P +EF+SF A 
Sbjct: 241 ASKGYIIQKKEARPTQDGKEEFIFANIEFHPLLFE---------QYKDQPYKEFDSFDAT 291

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDE+FST E  K+DLKA+QQER+ALKKLENV++DH+ RL  LE+TQ VDK+KAELI  NQ
Sbjct: 292 VDEYFSTMEGQKLDLKALQQEREALKKLENVRKDHDQRLITLEKTQEVDKQKAELISRNQ 351

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
             VD AILAI+  +ANQ+SW DI+  +K+AQ  +DPVAS IKQLKL  NHI+LLL     
Sbjct: 352 TLVDNAILAIQSALANQMSWPDIQVLLKEAQARSDPVASAIKQLKLETNHISLLLH---- 407

Query: 205 RSNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK 262
                          DP+ +S E    KP ++DVDL  +A+ NA++++  KRSAAKKQQK
Sbjct: 408 ---------------DPYEESDEESELKPMIIDVDLAHTAFGNARKYYSQKRSAAKKQQK 452

Query: 263 TIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWD 319
           TI+S  KALKSAEKKTKQTLK+VQT+ +I K RKVYWFEKFYWFI+SENYL   G     
Sbjct: 453 TIESHGKALKSAEKKTKQTLKEVQTIHSIIKLRKVYWFEKFYWFITSENYLVIGGRDQQQ 512

Query: 320 AKVVTNAW------WVKADQVSKTA-----PTG--------------------------- 341
            +++   +      +V AD    ++     PTG                           
Sbjct: 513 NELIVKRYLRAGDLYVHADLTGASSVVIKNPTGGFVPPKSLAEAGTMAIAYSVAWDAKVV 572

Query: 342 ------------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                             E+LTTGSFMIRGKKN+ PP QL MG+  +F+LEE+SI R
Sbjct: 573 ANAWWVHHDQVSKSAPTGEYLTTGSFMIRGKKNYLPPSQLIMGLGIMFRLEENSIER 629


>gi|332237024|ref|XP_003267700.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Nomascus leucogenys]
          Length = 1058

 Score =  348 bits (893), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 199/416 (47%), Positives = 250/416 (60%), Gaps = 70/416 (16%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYW--FEK 302
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYW  F K
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWXVFSK 531

Query: 303 FYWFISSENYLG--------------------------------------NVAWDAKVVT 324
               +S EN+L                                       + AWDA+V+T
Sbjct: 532 LLGRLSQENHLNPGGEDLQRTEVLILCIVIPGEPIPPRTLTEAGTMALCYSAAWDARVIT 591

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           +AWWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 592 SAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 647


>gi|195451571|ref|XP_002072981.1| GK13887 [Drosophila willistoni]
 gi|194169066|gb|EDW83967.1| GK13887 [Drosophila willistoni]
          Length = 1004

 Score =  343 bits (879), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 200/430 (46%), Positives = 255/430 (59%), Gaps = 97/430 (22%)

Query: 20  VLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
           +L       S+GY++Q KE++P      DY   N+EFHP +           Q +   ++
Sbjct: 274 ILSDGRKGPSKGYIMQVKEEKPTDGGDVDYFFRNVEFHPFLFS---------QLKHLEVE 324

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           E E+F  AVDEFFS  ES +ID+K + QERDALKKL N+K DH  RL  L + Q VDK K
Sbjct: 325 EHETFMTAVDEFFSKQESQRIDMKTLGQERDALKKLSNIKNDHAQRLEDLNKVQSVDKRK 384

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
           AELI  NQ  VD AILA++  IA+QL W DI   VK+AQ + D VA+ IKQLKL  NHI+
Sbjct: 385 AELITCNQSLVDKAILAVQSAIASQLPWPDIRQLVKEAQANGDIVANSIKQLKLETNHIS 444

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFA-------DSSETKKPSLVDVDLDLSAYANAKRF 249
           L+L                   +DP++       D  E+++P +VDVDL LSA+ANA+R+
Sbjct: 445 LIL-------------------SDPYSANDSDEDDDEESEEPMIVDVDLALSAWANARRY 485

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           +DLKRSAAKK+QKT+ ++EKALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISS
Sbjct: 486 YDLKRSAAKKEQKTVDASEKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFYWFISS 545

Query: 310 ENYLGNVAWDAK---------------------------VVTN----------------- 325
           ENYL     DA+                           ++ N                 
Sbjct: 546 ENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIRNPNADEIPPKTLLEAGTM 605

Query: 326 ------AWWVK---------ADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
                 AW  K         +DQVSKTAPTGE+L TGSFMIRGKKNF P C L MG+SFL
Sbjct: 606 AISYSVAWDAKVITNAYWVTSDQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMGLSFL 665

Query: 371 FKLEESSISR 380
           FKLE+S + R
Sbjct: 666 FKLEDSFVQR 675


>gi|195151655|ref|XP_002016754.1| GL21904 [Drosophila persimilis]
 gi|194111811|gb|EDW33854.1| GL21904 [Drosophila persimilis]
          Length = 966

 Score =  343 bits (879), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 198/427 (46%), Positives = 259/427 (60%), Gaps = 92/427 (21%)

Query: 18  LHVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFP 74
           L+++    N  S+GY++  KE++P  +   +Y + N+EF P +           Q +   
Sbjct: 249 LNLIKDGNNGESKGYIIHVKEEKPIEDGKIEYFLRNIEFQPFLFA---------QFKDNE 299

Query: 75  IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
              FESF  AVDEF+ST ES KID+K +QQER+ALKKL NVK+DH  RL  L + Q  DK
Sbjct: 300 FSMFESFLEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVKKDHAKRLEELTKVQDDDK 359

Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINH 194
           +KAELI +NQ  VD AI A++  IA+QL+W DI   VK+AQ + D VAS IKQLKL INH
Sbjct: 360 KKAELITSNQSLVDNAIRAVQSAIASQLTWPDIHELVKEAQTNGDVVASSIKQLKLEINH 419

Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSE--TKKPSLVDVDLDLSAYANAKRFFDL 252
           I+L+L                   +DP+   +E   +  ++VDVDL LSA+ANA+R++DL
Sbjct: 420 ISLIL-------------------SDPYVSQNEKDCEDLTVVDVDLALSAWANARRYYDL 460

Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
           KRSAA+K+QKT+ +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSEN+
Sbjct: 461 KRSAAQKEQKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENF 520

Query: 313 LGNVAWDAK---------------------------VVTN-------------------- 325
           L     DA+                           V+ N                    
Sbjct: 521 LVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVVIRNTTGEDIPPKTLVEAGSMAIS 580

Query: 326 ---AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
              AW         WV +DQVSKTAPTGE+L TGSFMIRGKKNF P C L MG+S LFKL
Sbjct: 581 YSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKL 640

Query: 374 EESSISR 380
           EES ++R
Sbjct: 641 EESFVAR 647


>gi|242018711|ref|XP_002429817.1| Serologically defined colon cancer antigen, putative [Pediculus
           humanus corporis]
 gi|212514835|gb|EEB17079.1| Serologically defined colon cancer antigen, putative [Pediculus
           humanus corporis]
          Length = 1024

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 202/427 (47%), Positives = 260/427 (60%), Gaps = 81/427 (18%)

Query: 21  LVSNLNVS-SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
           +V N ++S S+GY++QK EKRP      D+ V N EFHP +          RQH   P  
Sbjct: 234 IVENASLSVSKGYIIQKVEKRPTESGMSDFHV-NTEFHPFLF---------RQHVKNPFN 283

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           E E+F  AVD FFS+ ES KID+KA+ QE++ALKK+ENV+RDH  RL  L +TQ +D+ K
Sbjct: 284 ECETFLKAVDSFFSSLESQKIDMKAINQEKEALKKIENVRRDHNQRLQQLFETQELDRIK 343

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
           AELI  N   VD A+LAIR  IANQ+SW DI+  VK+ +   DPVAS IK+LKL+INHIT
Sbjct: 344 AELITTNLTLVDQAVLAIRTAIANQISWPDIDILVKEGKNAGDPVASSIKKLKLDINHIT 403

Query: 197 LLLR-VHKERSNIKDSWVLHHDNAD-PFADSSETKKPSLVDVD--LDLSAYANAKRFFDL 252
           L L   ++  S+  +       N D P     + K P ++DVD  LDL+A+ANA++++D+
Sbjct: 404 LQLSDPYRSDSSSSEEEEEEETNDDKPI----KIKVPKIIDVDIDLDLTAFANARKYYDM 459

Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF--------Y 304
           KRSAAKKQQKTI+S +KALKSAEKKTKQ LK+++T+ NI K RK +WFEKF        Y
Sbjct: 460 KRSAAKKQQKTIESQDKALKSAEKKTKQALKEMKTIVNITKVRKTFWFEKFFWFISSENY 519

Query: 305 WFISSENYLGN------------------------------------------------- 315
             I+  + + N                                                 
Sbjct: 520 LVIAGRDMMQNELLVKRYMKSGDLYVHADIHGASSVIIKNPSNEPVPPKTLNEAGVMAIS 579

Query: 316 --VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
              AW+AKVVT+AWWV   QVSKTAPTGE+L TGSFMIRGKKN+ PP  L MG SFLFKL
Sbjct: 580 YSQAWEAKVVTSAWWVHNTQVSKTAPTGEYLGTGSFMIRGKKNYLPPANLIMGFSFLFKL 639

Query: 374 EESSISR 380
           +++S+SR
Sbjct: 640 DDNSLSR 646


>gi|195388566|ref|XP_002052950.1| GJ23608 [Drosophila virilis]
 gi|194151036|gb|EDW66470.1| GJ23608 [Drosophila virilis]
          Length = 966

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 198/415 (47%), Positives = 257/415 (61%), Gaps = 81/415 (19%)

Query: 28  SSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           + +G+++Q KE++P+S    D+   N+EFHP +           QH+  P +E+ +F  A
Sbjct: 251 NCKGFIIQIKEEKPSSTESTDHFYRNVEFHPYLFS---------QHKKMPFKEYNTFMEA 301

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDEFFST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q +DK+KAELI  NQ
Sbjct: 302 VDEFFSTQESQKIDMKTLQQEREALKKLSNVKNDHTRRLEELNKVQDLDKKKAELITCNQ 361

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
             VD AILAI+  IA+QLSW DI+  VK+AQ + D VA  IK+LKL INHI+LLL    +
Sbjct: 362 SLVDKAILAIQSAIASQLSWPDIQELVKEAQANGDIVARSIKKLKLEINHISLLLTDPYK 421

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
             N        + N +  AD+ ++    L+DVDL LSA+ANA R++DLKRSAA K++KTI
Sbjct: 422 CGN-------EYLNDENGADNDDS---LLIDVDLALSAWANACRYYDLKRSAALKEKKTI 471

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK--- 321
            +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKF+WF+SSENYL     DA+   
Sbjct: 472 DASQKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFFWFVSSENYLIIGGRDAQQNE 531

Query: 322 ------------------------VVTN-----------------------AWWVK---- 330
                                   ++ N                       AW  K    
Sbjct: 532 LIVKRYMRPKDVYVHADIQGASSVIIRNSTGGDIPPKTLLEAGTMAISYSVAWDAKVVTN 591

Query: 331 -----ADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                +DQVSKTAPTGE+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 592 SYWVYSDQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMGLSILFKLEDSFIQR 646


>gi|357620683|gb|EHJ72794.1| hypothetical protein KGM_20428 [Danaus plexippus]
          Length = 1001

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 195/421 (46%), Positives = 257/421 (61%), Gaps = 84/421 (19%)

Query: 26  NVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           N  ++GY++QK+E RPN +   D+ + N EFHP++           Q++     E+E+F 
Sbjct: 244 NQMAKGYIIQKREDRPNQDGGPDFFLTNQEFHPLL---------YLQNKDQVYVEYETFD 294

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEF+S  E  KIDLK +Q ER+A+KKL+N++ DHE RLS LE+ QL D+  AE+I  
Sbjct: 295 RAVDEFYSALEGQKIDLKTIQVEREAMKKLQNIRTDHEKRLSNLEKVQLEDRRAAEMIAR 354

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVH 202
           N+  V+ A LAI+  IANQ+SW+DI+  VK AQ + DPVAS IKQLKLN NHITLLL   
Sbjct: 355 NEPLVEQARLAIQTAIANQMSWDDIKLLVKAAQDNKDPVASAIKQLKLNTNHITLLL--- 411

Query: 203 KERSNIKDSWVLHHDNADPFADSS-----ETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                 KD +    D+ D   D+      E  +P +VD+DL L+A+ANA+R++D KRSAA
Sbjct: 412 ------KDPYDDDDDDDDDDDDNDGGGDKERLEPMMVDIDLSLTAFANARRYYDQKRSAA 465

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---- 313
           KKQQKT++S +KALKSAEKKTKQTLK+ Q +++I+KAR+ YWFEKFYWFISS+NYL    
Sbjct: 466 KKQQKTLESADKALKSAEKKTKQTLKEAQAISSISKARRNYWFEKFYWFISSDNYLVIAG 525

Query: 314 ----------------------GNVAWDAKVVTN-----------------------AW- 327
                                  +V+  + VV                         AW 
Sbjct: 526 RDQQQNELLVKRYMRSTDVYVHADVSGASSVVIKCPSGPPPPRTLSEAGQAAVAYSVAWE 585

Query: 328 --------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
                   WV   QVSK+APTGE+L+TGSFMIRGKKN+  P  L  G SF+F+LE+SSI 
Sbjct: 586 AKVLTRAWWVHGHQVSKSAPTGEYLSTGSFMIRGKKNYLLPEHLQFGFSFMFRLEDSSID 645

Query: 380 R 380
           R
Sbjct: 646 R 646


>gi|348544245|ref|XP_003459592.1| PREDICTED: nuclear export mediator factor Nemf-like [Oreochromis
           niloticus]
          Length = 1074

 Score =  333 bits (855), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 197/430 (45%), Positives = 249/430 (57%), Gaps = 85/430 (19%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N S +GY++QK EK+P+      SE+ +  + EFHP +           QH   P  EF+
Sbjct: 242 NFSGKGYIIQKTEKKPSLTPGKPSEELLTYD-EFHPFLFA---------QHAKSPYLEFD 291

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           +F  AVDEFFS  ES KIDLKA+QQE+ ALKKLENVK+DHE RL AL Q Q VD+ K EL
Sbjct: 292 TFDKAVDEFFSKMESQKIDLKALQQEKQALKKLENVKKDHEQRLEALHQAQEVDRIKGEL 351

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           +  N   VD A+  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NHIT+LL
Sbjct: 352 VEMNLPVVDRALQVVRSALANQVDWTEIGVLVKEAQAAGDPVACAIKELKLQTNHITMLL 411

Query: 200 R----------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRF 249
           +            +++  ++     + +         +  KP LVDVDL LSAYANAK++
Sbjct: 412 KNPYISEEDQEEEEKKEIVETKGKKNKNKEKGQNKKLQRNKPMLVDVDLGLSAYANAKKY 471

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           +D KRSA KK+QKTI++ +KA+KSAEKKT+QTLK+VQT+T I KARKVYWFEKF WFISS
Sbjct: 472 YDSKRSAEKKEQKTIEAADKAMKSAEKKTQQTLKEVQTVTTIQKARKVYWFEKFLWFISS 531

Query: 310 ENYL--------------------GNV-----------------------------AWDA 320
           ENYL                    G++                             A   
Sbjct: 532 ENYLVIAGRDQQQNEMIVKRYLRAGDIYVHADLHGATSCVIKNPSGNPIPPRTLTEAGTM 591

Query: 321 KVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
            V  +A W          V   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG  FL
Sbjct: 592 AVCYSAAWDAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIMGFGFL 651

Query: 371 FKLEESSISR 380
           FK+++ S+ R
Sbjct: 652 FKVDDQSVFR 661


>gi|334310399|ref|XP_001370312.2| PREDICTED: nuclear export mediator factor NEMF isoform 1
           [Monodelphis domestica]
          Length = 1094

 Score =  333 bits (853), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 201/433 (46%), Positives = 249/433 (57%), Gaps = 88/433 (20%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N S +GY++QKKEK+P+      SED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFSGKGYIIQKKEKKPSLEPDKQSED-ILTYEEFHPFLFS---------QHSKCPYLEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKLEGQKIDLKALQQEKQALKKLDNVRKDHEHRLEALHQAQEIDKIKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N  +VD AI  +R  +ANQ+ W +I   VK+AQ   D VA+ I++LKL  NH+T+LL
Sbjct: 351 IEMNLHTVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDTVANAIRELKLQTNHVTMLL 410

Query: 200 R---VHKERSNIKDSWVLHHDNADP----------FADSSETKKPSLVDVDLDLSAYANA 246
           +   +  +     +  +   +  +P               +  KP LVDVDL LSAYANA
Sbjct: 411 KNPYLVSDEEEEDEINIEKEETEEPKGKKKKQKNKQLQKLQKNKPLLVDVDLSLSAYANA 470

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           K+++D KR AA+K QKT+++ EKA KSAEKKTKQTLK+VQ +T I KARKVYWFEKF WF
Sbjct: 471 KKYYDHKRHAARKTQKTVEAAEKAFKSAEKKTKQTLKEVQMVTTIQKARKVYWFEKFLWF 530

Query: 307 ISSENYL--------------------GNVAWDAK-------VVTN-------------- 325
           ISSENYL                    G+V   A        V+ N              
Sbjct: 531 ISSENYLIIGGRDQQQNEMIVKRYLTPGDVYVHADLHGATSCVIKNPTGEPIPPRTLTEA 590

Query: 326 ---------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
                    AW         WV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG 
Sbjct: 591 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 650

Query: 368 SFLFKLEESSISR 380
           SFLFK++E  + R
Sbjct: 651 SFLFKVDEPCVWR 663


>gi|307173031|gb|EFN64173.1| Serologically defined colon cancer antigen 1-like protein
           [Camponotus floridanus]
          Length = 988

 Score =  330 bits (846), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 191/416 (45%), Positives = 248/416 (59%), Gaps = 92/416 (22%)

Query: 29  SQGYLVQKKEKR---PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +      ED+I AN+EFHP + +         Q++  P +EF+SF AAV
Sbjct: 242 SKGYIIQKKEAKMTQDGKEDFIFANIEFHPFLFE---------QYKNQPYKEFDSFDAAV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLK +QQER+AL+KLE VK+DH+ RL  LE++Q +DK+KAELI  NQ 
Sbjct: 293 DEYFSTMEGQKLDLKVLQQEREALQKLERVKKDHDQRLVTLEKSQELDKQKAELISRNQI 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ+SW DI+  +K+AQ   DPVAS IKQLKL  NHITLLL      
Sbjct: 353 LVDNAILAIQSALANQMSWPDIQILLKEAQVIGDPVASAIKQLKLETNHITLLLH----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                         DP+ DS E    KP L+D+DL  +A++NAK ++  K+SAA+K QKT
Sbjct: 408 --------------DPYEDSDEESELKPMLIDIDLAHTAFSNAKNYYSQKKSAARKHQKT 453

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDA 320
           I+S  KALKSAEKKTKQTLK+VQT+  INK RK YWFEKFYWFI+SENYL   G      
Sbjct: 454 IESQGKALKSAEKKTKQTLKEVQTIHTINKLRKTYWFEKFYWFITSENYLVIGGRDQQQN 513

Query: 321 KVVTNAW------WVKADQVSKTA-----PTGE--------------------------- 342
           +++   +      +V AD    ++     P+G+                           
Sbjct: 514 ELIVKRYLKAGDLYVHADLTGASSVVIKNPSGDPVPPKSLAEAGTMAVAYSIAWDSKVIA 573

Query: 343 ------------------FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                             +LTTGSFMIRGKKN+    QL MG+  +F+LEESSI R
Sbjct: 574 SAWWVHHDQVSKSAPTGEYLTTGSFMIRGKKNYLTQSQLIMGLGVMFRLEESSIER 629


>gi|432938285|ref|XP_004082515.1| PREDICTED: nuclear export mediator factor Nemf-like [Oryzias
           latipes]
          Length = 1089

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 197/431 (45%), Positives = 251/431 (58%), Gaps = 87/431 (20%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +G+++QK EK+P+      +E+ +  + EFHP +           QH   P  E +
Sbjct: 242 NFNGKGFIIQKSEKKPSVAPGKPAEELLTYD-EFHPFLFV---------QHAKSPFLELD 291

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEFFS  E  KID+KA+QQE+ ALKKLENVK+DHE RL AL Q Q VD+ K EL
Sbjct: 292 SFNKAVDEFFSKMEGQKIDMKALQQEKQALKKLENVKKDHEQRLEALHQAQEVDRLKGEL 351

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           +  N   V+ A+  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL+ NHIT+LL
Sbjct: 352 VEINLAVVERALQVVRSALANQVDWAEIGHIVKEAQAAGDPVACAIKELKLHSNHITMLL 411

Query: 200 R------VHKERSNIKDSWVLHHDNADPFADSSETK-----KPSLVDVDLDLSAYANAKR 248
           +        +E   +KD+ V      +   D  + K     KP LVDVDL LSAYANAK+
Sbjct: 412 KNPYISEEEQEDEEMKDA-VEEKGKKNKNRDKGQKKKLQRNKPMLVDVDLGLSAYANAKK 470

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
           ++D KRSA KKQQKT+++ +KA+KSAEKKT++TLK+VQT+T I KARKVYWFEKF WFIS
Sbjct: 471 YYDHKRSAEKKQQKTLEAADKAMKSAEKKTQKTLKEVQTVTTIQKARKVYWFEKFLWFIS 530

Query: 309 SENYL--------------------GNV-----------------------------AWD 319
           +ENYL                    G++                             A  
Sbjct: 531 AENYLVIAGRDQQQNEIIVKRYLRAGDIYVHADLHGATSCVIKNPSGDPIPPRTLTEAGT 590

Query: 320 AKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
             V  +A W          V   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG  F
Sbjct: 591 MAVCYSAAWDAKIITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIMGFGF 650

Query: 370 LFKLEESSISR 380
           LFK+EE S+ R
Sbjct: 651 LFKVEEQSVFR 661


>gi|149051344|gb|EDM03517.1| rCG61611 [Rattus norvegicus]
          Length = 899

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 198/434 (45%), Positives = 249/434 (57%), Gaps = 89/434 (20%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 75  NFNGKGYIIQKREVKPSLDANKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 124

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 125 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 184

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NHIT+LL
Sbjct: 185 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHITMLL 244

Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
           R    + +E     D  + + D   P     + K          KP LVDVDL LSAYAN
Sbjct: 245 RNPYLLSEEEDGDGDGSIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 304

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
           AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 305 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 364

Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
           FISSENYL   G      +++   +      +V AD    T+     PTG          
Sbjct: 365 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGDPIPPRTLTE 424

Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
                                              E+LTTGSFMIRGKKNF PP  L MG
Sbjct: 425 AGTMALCYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 484

Query: 367 ISFLFKLEESSISR 380
            SFLFK++ES + R
Sbjct: 485 FSFLFKVDESCVWR 498


>gi|281604208|ref|NP_001164057.1| serologically defined colon cancer antigen 1 [Rattus norvegicus]
          Length = 1065

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 198/434 (45%), Positives = 249/434 (57%), Gaps = 89/434 (20%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREVKPSLDANKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NHIT+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHITMLL 410

Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
           R    + +E     D  + + D   P     + K          KP LVDVDL LSAYAN
Sbjct: 411 RNPYLLSEEEDGDGDGSIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 470

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
           AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 471 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 530

Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
           FISSENYL   G      +++   +      +V AD    T+     PTG          
Sbjct: 531 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGDPIPPRTLTE 590

Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
                                              E+LTTGSFMIRGKKNF PP  L MG
Sbjct: 591 AGTMALCYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 650

Query: 367 ISFLFKLEESSISR 380
            SFLFK++ES + R
Sbjct: 651 FSFLFKVDESCVWR 664


>gi|148704665|gb|EDL36612.1| mCG3169, isoform CRA_a [Mus musculus]
          Length = 1083

 Score =  327 bits (838), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 196/434 (45%), Positives = 249/434 (57%), Gaps = 89/434 (20%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 260 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 309

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 310 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 369

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NH+T+LL
Sbjct: 370 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 429

Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
           R    + +E     D+ + + D   P     + K          KP LVDVDL LSAYAN
Sbjct: 430 RNPYLLSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 489

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
           AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 490 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 549

Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
           FISSENYL   G      +++   +      +V AD    T+     PTG          
Sbjct: 550 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTE 609

Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
                                              E+LTTGSFMIRGKKNF PP  L MG
Sbjct: 610 AGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 669

Query: 367 ISFLFKLEESSISR 380
            SFLFK++ES + R
Sbjct: 670 FSFLFKVDESCVWR 683


>gi|32130521|ref|NP_079717.2| nuclear export mediator factor Nemf [Mus musculus]
 gi|47606756|sp|Q8CCP0.2|NEMF_MOUSE RecName: Full=Nuclear export mediator factor Nemf; AltName:
           Full=Serologically defined colon cancer antigen 1
           homolog
          Length = 1064

 Score =  327 bits (837), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 196/434 (45%), Positives = 249/434 (57%), Gaps = 89/434 (20%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410

Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
           R    + +E     D+ + + D   P     + K          KP LVDVDL LSAYAN
Sbjct: 411 RNPYLLSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 470

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
           AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 471 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 530

Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
           FISSENYL   G      +++   +      +V AD    T+     PTG          
Sbjct: 531 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTE 590

Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
                                              E+LTTGSFMIRGKKNF PP  L MG
Sbjct: 591 AGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 650

Query: 367 ISFLFKLEESSISR 380
            SFLFK++ES + R
Sbjct: 651 FSFLFKVDESCVWR 664


>gi|157116544|ref|XP_001658543.1| hypothetical protein AaeL_AAEL007639 [Aedes aegypti]
 gi|108876416|gb|EAT40641.1| AAEL007639-PA [Aedes aegypti]
          Length = 995

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 193/439 (43%), Positives = 257/439 (58%), Gaps = 75/439 (17%)

Query: 5   EDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRP----NSEDYIVANMEFHPMILQP 60
           ED+++ +        +L   +   S+G+++QKKE +P      E++   N+E+HP +   
Sbjct: 246 EDMTALMCAINDAETMLRKAMKEPSRGFIIQKKELKPAKDKEQEEFYFTNLEYHPFLYN- 304

Query: 61  GSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
                   Q++  P++EF+SF AAVDEF+ST E  KIDLKA  QER+ALKKL NV+ DH 
Sbjct: 305 --------QYKEDPVKEFDSFTAAVDEFYSTLEGQKIDLKAFAQEREALKKLSNVRTDHA 356

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            RL  L + QL D++KAELI  NQ  VD+AILA++  +A+Q+SW DI+  VK AQ +NDP
Sbjct: 357 KRLEDLTKAQLEDRKKAELITRNQNLVDSAILAVQSALASQMSWSDIQDLVKAAQANNDP 416

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VAS IKQLKL INHI+L+L   K+     D      D+ +   D     +P +VDVDL +
Sbjct: 417 VASCIKQLKLEINHISLML---KDPYGALDEDFEDDDDEEEREDGEGKLEPMVVDVDLAM 473

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +A+ANA+R++D +R AA+K+QKTI+S+ KALK+AEKKT QTLKDV+T T I+KARKVYWF
Sbjct: 474 TAFANARRYYDQRRFAARKEQKTIESSSKALKNAEKKTMQTLKDVRTQTTISKARKVYWF 533

Query: 301 EKFYWFISSENYLGNVAWD--------------AKVVTNAWWVKADQVSKTAPTGE---- 342
           EKFYWFISSENYL     D              + +  +A    A  V    P+GE    
Sbjct: 534 EKFYWFISSENYLVIGGRDQQQNELIVKRYMRPSDIYVHAEIQGASSVIIKNPSGEDIPP 593

Query: 343 --FLTTGS---------------------------------------FMIRGKKNFFPPC 361
              L  G+                                       FMIRGKKNF PPC
Sbjct: 594 KTLLEAGTMAISYSVAWDAKVVTSAYWVKSEQVSKTAPTGEYLTTGSFMIRGKKNFLPPC 653

Query: 362 QLAMGISFLFKLEESSISR 380
            L +G+SF+FKLEESSI R
Sbjct: 654 HLVLGLSFMFKLEESSIER 672


>gi|403277932|ref|XP_003930596.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 1077

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 196/435 (45%), Positives = 246/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKRETKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+QTQ +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|354506443|ref|XP_003515270.1| PREDICTED: nuclear export mediator factor Nemf, partial [Cricetulus
           griseus]
          Length = 699

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 197/434 (45%), Positives = 248/434 (57%), Gaps = 89/434 (20%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N   +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 70  NFHGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 119

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 120 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 179

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LL
Sbjct: 180 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLL 239

Query: 200 R----VHKERSNIKDSWVLHHD----------NADPFADSSETKKPSLVDVDLDLSAYAN 245
           R    + +E  +  D+ V   D            +      +  KP LVDVDL LSAYAN
Sbjct: 240 RNPYLLSEEEDDDGDASVEVSDAEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 299

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
           AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 300 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 359

Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
           FISSENYL   G      +++   +      +V AD    T+     PTG          
Sbjct: 360 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTE 419

Query: 342 -----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMG 366
                                              E+LTTGSFMIRGKKNF PP  L MG
Sbjct: 420 AGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 479

Query: 367 ISFLFKLEESSISR 380
            SFLFK++ES I R
Sbjct: 480 FSFLFKVDESCIWR 493


>gi|125858778|gb|AAI29514.1| LOC733300 protein [Xenopus laevis]
          Length = 906

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 199/439 (45%), Positives = 248/439 (56%), Gaps = 103/439 (23%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +G+++QK+EK+P+      SED I  N EFHP +           QH      E +
Sbjct: 241 NFNGKGFIIQKREKKPSLEPDKASED-IFTNEEFHPFLFA---------QHANSTYIELD 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF   VDEFFS  E  KID+KA+QQE+ ALKKL+NV++DHE RL +L+  Q  DK K EL
Sbjct: 291 SFNKTVDEFFSKLEGQKIDIKALQQEKQALKKLDNVRKDHEHRLESLQYAQDADKAKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NHIT++L
Sbjct: 351 IEMNLDIVDRAIQVVRSALANQIDWTEIGLIVKEAQIQGDPVALAIKELKLQTNHITMML 410

Query: 200 RVHKERSNIKDSWVLHHDNADPFADSS-------------------ETKKPSLVDVDLDL 240
                    K+ +VL  + ++   D                     +  KP LVDVDL L
Sbjct: 411 ---------KNPYVLSEEESEDEEDEKEEEPKGKKKKAKNKQPKKVQKNKPVLVDVDLSL 461

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           SAYANAK+++D KR AAKK QKTI++ EKA KSAEKKTKQTLK+VQT++ I KARKVYWF
Sbjct: 462 SAYANAKKYYDHKRHAAKKSQKTIEAAEKAFKSAEKKTKQTLKEVQTVSTIQKARKVYWF 521

Query: 301 EKFYWFISSENYL--------------------GNVAWDAK-------VVTN-------- 325
           EKF WFISSENYL                    G+V   A        V+ N        
Sbjct: 522 EKFLWFISSENYLIIAGRDQQQNELIVKRYLNPGDVYVHADLHGATSCVIKNPTGEPVPP 581

Query: 326 ---------------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC 361
                          AW         WV  +QVSKTAPTGE+LTTGSFMIRGKKNF PP 
Sbjct: 582 RTLTEAGTMAVCYSAAWDARVITSAWWVHHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPS 641

Query: 362 QLAMGISFLFKLEESSISR 380
            L MG  FLFK++E+ + R
Sbjct: 642 YLMMGFGFLFKVDETCVWR 660


>gi|326921280|ref|XP_003206889.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Meleagris gallopavo]
          Length = 1080

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 200/442 (45%), Positives = 247/442 (55%), Gaps = 105/442 (23%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           + + +GY++QKKEK+P+ E     + I    EFHP +           QH   P  EF+S
Sbjct: 253 DFNGKGYIIQKKEKKPSLEPDKPAEDIYTYEEFHPFLFS---------QHSKCPYLEFDS 303

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  A DEF+S  E  KIDLKA+QQE+ ALKKLENV+RDHE RL AL+Q Q VDK K ELI
Sbjct: 304 FNKAADEFYSKLEGQKIDLKALQQEKQALKKLENVRRDHEQRLEALQQAQEVDKIKGELI 363

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N E V+ AI  +R  +ANQ+ W +I A VK+AQ   DPVA+ IK+LKL  NHIT+LLR
Sbjct: 364 EMNLEIVNRAIQVVRSALANQIDWTEIGAIVKEAQAQGDPVANAIKELKLQTNHITMLLR 423

Query: 201 VHKERSNIKDSWVL-----HHDNAD------------------PFADSSETKKPSLVDVD 237
                    + +VL       ++AD                        +  KPSLVDVD
Sbjct: 424 ---------NPYVLSEEEEEGEDADLEKEETEEPKGKKKKNKNKQLKKPQKNKPSLVDVD 474

Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
           L LSAYANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T I KARKV
Sbjct: 475 LSLSAYANAKKYYDHKRHAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKV 534

Query: 298 YWFEKFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG-- 341
           YWFEKF WFISSENYL   G      +++   +      +V AD    T+     PTG  
Sbjct: 535 YWFEKFLWFISSENYLVIAGRDQQQNELIVKRYLKPGDIYVHADLHGATSCVIKNPTGEP 594

Query: 342 -------------------------------------------EFLTTGSFMIRGKKNFF 358
                                                      E+LTTGSFMIRGKKNF 
Sbjct: 595 IPPRTLTEAGTMALCYSAAWDARVVTSAWWVSHNQVSKTAPTGEYLTTGSFMIRGKKNFL 654

Query: 359 PPCQLAMGISFLFKLEESSISR 380
           PP  L MG SFLFK++ES + R
Sbjct: 655 PPSYLMMGFSFLFKVDESCVWR 676


>gi|348572143|ref|XP_003471853.1| PREDICTED: nuclear export mediator factor NEMF-like [Cavia
           porcellus]
          Length = 1076

 Score =  324 bits (831), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 196/435 (45%), Positives = 246/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEVDKPAEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHETRL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHETRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETELPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEVVPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|73962860|ref|XP_851229.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Canis
           lupus familiaris]
          Length = 1077

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 193/435 (44%), Positives = 244/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREVKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           +  E++  +          +      +  +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDISVEKNETELPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT---------- 340
           WFISSENYL     D +         + T   +V AD    T+     PT          
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 341 -----------------------------------GEFLTTGSFMIRGKKNFFPPCQLAM 365
                                              GE+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|417405795|gb|JAA49597.1| Putative rna-binding protein [Desmodus rotundus]
          Length = 1081

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 194/435 (44%), Positives = 243/435 (55%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREVKPSLEVDKPAEEILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+N ++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNFRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N   VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 352 EMNLPVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           ++ E+S  +          +      +  +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGEINVEKSETEPPKGKKKKQKNKQLQRPQKNRPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +V+   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEVIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARIITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK+EES   R
Sbjct: 652 GFSFLFKVEESCAWR 666


>gi|296214948|ref|XP_002753922.1| PREDICTED: nuclear export mediator factor NEMF isoform 1
           [Callithrix jacchus]
          Length = 1077

 Score =  323 bits (828), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+QTQ +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA +SAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFRSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|449504623|ref|XP_002200475.2| PREDICTED: nuclear export mediator factor Nemf [Taeniopygia
           guttata]
          Length = 1213

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 196/433 (45%), Positives = 243/433 (56%), Gaps = 87/433 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+EK+P+ E     + I    EFHP +           QH   P  EF+S
Sbjct: 379 NFSGKGYVIQKREKKPSLEPDKPAEDIYTYEEFHPFLFS---------QHSKCPYLEFDS 429

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  A DEF+S  E  KIDLKA+QQE+ ALKKLENV+RDHE RL AL+Q Q  DK K ELI
Sbjct: 430 FNKATDEFYSKLEGQKIDLKALQQEKQALKKLENVRRDHEHRLEALQQAQEADKLKGELI 489

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N   VD AI  +R  +ANQ+ W +I A VK+AQ   DPVA+ IK+LKL  NHIT+LLR
Sbjct: 490 EMNLAVVDRAIQVVRSALANQIDWTEIGAIVKEAQAQGDPVATAIKELKLQTNHITMLLR 549

Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
                            E+   ++       N        +  KPSLVDVDL+LSAYANA
Sbjct: 550 NPYVLSEEEEEEDDADIEKEETEEPKGKKKKNKTKQLKKPQKNKPSLVDVDLNLSAYANA 609

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           K+++D KR AAKK QKT+++ EKA KSAEKKTKQTL++VQT+T I KARKVYWFEKF WF
Sbjct: 610 KKYYDHKRHAAKKTQKTVEAAEKAFKSAEKKTKQTLREVQTVTTIQKARKVYWFEKFLWF 669

Query: 307 ISSENYLGNVAWDAK---VVTNAW------WVKADQVSKTA-----PTGE---------- 342
           ISSENYL     D +   ++   +      +V AD    T+     P+GE          
Sbjct: 670 ISSENYLVIAGRDQQQNELIVKRYLKPGDIYVHADLHGATSCVIKNPSGEPIPPRTLTEA 729

Query: 343 -----------------------------------FLTTGSFMIRGKKNFFPPCQLAMGI 367
                                              +LTTGSFMIRGKKNF PP  L MG 
Sbjct: 730 GTMALCYSAAWDARVVTSAWWVSHSQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 789

Query: 368 SFLFKLEESSISR 380
           SFLFK++ES + R
Sbjct: 790 SFLFKVDESCVWR 802


>gi|431893718|gb|ELK03539.1| Serologically defined colon cancer antigen 1 [Pteropus alecto]
          Length = 1077

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 192/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           ++ E+   +          +      +  +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDINVEKIETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|301773240|ref|XP_002922036.1| PREDICTED: serologically defined colon cancer antigen 1-like
           [Ailuropoda melanoleuca]
          Length = 1077

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 193/435 (44%), Positives = 243/435 (55%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I    EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEVDKPTEDIFTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           +  E++  +          +      +  +P LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDLGVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT---------- 340
           WFISSENYL     D +         + T   +V AD    T+     PT          
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 341 -----------------------------------GEFLTTGSFMIRGKKNFFPPCQLAM 365
                                              GE+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|291403822|ref|XP_002718277.1| PREDICTED: serologically defined colon cancer antigen 1
           [Oryctolagus cuniculus]
          Length = 1076

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 197/436 (45%), Positives = 248/436 (56%), Gaps = 91/436 (20%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N   +GY++QK+E +P+      SED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFRGKGYIIQKREIKPSLEVDKPSED-ILTYEEFHPFLFS---------QHSQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL+ NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLHTNHVTMLL 410

Query: 200 R-----VHKERSNIKDSWVLHHDNADPFADSSETK-----------KPSLVDVDLDLSAY 243
           R       +E  ++     +  +  +P     + +           KP LVDVDL LSAY
Sbjct: 411 RNPYLLSEEEDDDVDGDVTVEKNENEPLKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAY 470

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
           ANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 471 ANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKF 530

Query: 304 YWFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT--------- 340
            WFISSENYL     D +         + T   +V AD    T+     PT         
Sbjct: 531 LWFISSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPRTL 590

Query: 341 ------------------------------------GEFLTTGSFMIRGKKNFFPPCQLA 364
                                               GE+LTTGSFMIRGKKNF PP  L 
Sbjct: 591 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 650

Query: 365 MGISFLFKLEESSISR 380
           MG SFLFK++ES + R
Sbjct: 651 MGFSFLFKVDESCVWR 666


>gi|281343421|gb|EFB19005.1| hypothetical protein PANDA_010972 [Ailuropoda melanoleuca]
          Length = 1058

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 193/435 (44%), Positives = 243/435 (55%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I    EFHP +           QH   P  EFES
Sbjct: 222 NFSGKGYIIQKREIKPSLEVDKPTEDIFTYEEFHPFLFS---------QHSQCPYIEFES 272

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 273 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 332

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 333 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 392

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           +  E++  +          +      +  +P LVDVDL LSAYA
Sbjct: 393 NPYLLSEEEDDDVDGDLGVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 452

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 453 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 512

Query: 305 WFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT---------- 340
           WFISSENYL     D +         + T   +V AD    T+     PT          
Sbjct: 513 WFISSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 572

Query: 341 -----------------------------------GEFLTTGSFMIRGKKNFFPPCQLAM 365
                                              GE+LTTGSFMIRGKKNF PP  L M
Sbjct: 573 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 632

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 633 GFSFLFKVDESCVWR 647


>gi|269849764|sp|O60524.4|NEMF_HUMAN RecName: Full=Nuclear export mediator factor NEMF; AltName:
           Full=Antigen NY-CO-1; AltName: Full=Serologically
           defined colon cancer antigen 1
          Length = 1076

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|426376840|ref|XP_004055190.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Gorilla
           gorilla gorilla]
          Length = 1077

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|397523542|ref|XP_003831788.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Pan
           paniscus]
          Length = 1076

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|355693257|gb|EHH27860.1| hypothetical protein EGK_18167 [Macaca mulatta]
          Length = 1077

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|395745874|ref|XP_002824790.2| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Pongo abelii]
          Length = 1061

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 220 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 270

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 271 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 330

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 331 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 390

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +                 +  KP LVDVDL LSAYA
Sbjct: 391 NPYLLSEEEDDDVDDDVNVEKNETEPPKGKKKKQKSKQLQKPQKNKPLLVDVDLSLSAYA 450

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 451 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 510

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 511 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 570

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 571 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 630

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 631 GFSFLFKVDESCVWR 645


>gi|355778566|gb|EHH63602.1| hypothetical protein EGM_16603 [Macaca fascicularis]
 gi|380817886|gb|AFE80817.1| nuclear export mediator factor NEMF [Macaca mulatta]
 gi|383422753|gb|AFH34590.1| nuclear export mediator factor NEMF [Macaca mulatta]
 gi|384950256|gb|AFI38733.1| nuclear export mediator factor NEMF [Macaca mulatta]
          Length = 1077

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 246/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|344273431|ref|XP_003408525.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor
           NEMF-like [Loxodonta africana]
          Length = 1000

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 193/436 (44%), Positives = 246/436 (56%), Gaps = 89/436 (20%)

Query: 25  LNVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           LN S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFE
Sbjct: 240 LNFSGKGYIIQKREVKPSLEIDKPTEDILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I + VK+AQ   DPVA+ IK+LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGSIVKEAQAQGDPVANAIKELKLQTNHVTMLL 410

Query: 200 R----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           R                +  E++  +          +      +  KP  VDVDL LSAY
Sbjct: 411 RNPYLLSEEEDDDVDGDISIEKNETEPLKGKKKKQKNKQLQKPQKNKPLPVDVDLSLSAY 470

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
           ANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF
Sbjct: 471 ANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKF 530

Query: 304 YWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG-------- 341
            WFISSENYL   G      +++   +      +V AD    T+     PTG        
Sbjct: 531 LWFISSENYLIIGGRDQQQNEIIVKRYLTAGDIYVHADLHGATSCVIKNPTGEPIPPRTL 590

Query: 342 -------------------------------------EFLTTGSFMIRGKKNFFPPCQLA 364
                                                E+LTTGSFMIRGKKNF PP  L 
Sbjct: 591 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 650

Query: 365 MGISFLFKLEESSISR 380
           MG SFLFK++ES + R
Sbjct: 651 MGFSFLFKVDESCVWR 666


>gi|32130516|ref|NP_004704.2| nuclear export mediator factor NEMF [Homo sapiens]
 gi|119586148|gb|EAW65744.1| serologically defined colon cancer antigen 1, isoform CRA_d [Homo
           sapiens]
 gi|148922399|gb|AAI46282.1| Serologically defined colon cancer antigen 1 [synthetic construct]
 gi|151556560|gb|AAI48733.1| Serologically defined colon cancer antigen 1 [synthetic construct]
          Length = 1076

 Score =  322 bits (824), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|55640675|ref|XP_509934.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Pan
           troglodytes]
 gi|410223614|gb|JAA09026.1| nuclear export mediator factor [Pan troglodytes]
 gi|410263654|gb|JAA19793.1| nuclear export mediator factor [Pan troglodytes]
 gi|410263656|gb|JAA19794.1| nuclear export mediator factor [Pan troglodytes]
 gi|410299008|gb|JAA28104.1| nuclear export mediator factor [Pan troglodytes]
 gi|410354861|gb|JAA44034.1| nuclear export mediator factor [Pan troglodytes]
          Length = 1076

 Score =  322 bits (824), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I    EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDIFTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|119586150|gb|EAW65746.1| serologically defined colon cancer antigen 1, isoform CRA_f [Homo
           sapiens]
          Length = 1010

 Score =  322 bits (824), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|402876104|ref|XP_003901819.1| PREDICTED: nuclear export mediator factor NEMF [Papio anubis]
          Length = 1048

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 194/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 212 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 262

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 263 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 322

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T++LR
Sbjct: 323 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMMLR 382

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V  E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 383 NPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 442

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 443 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 502

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 503 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 562

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 563 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 622

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 623 GFSFLFKVDESCVWR 637


>gi|426233096|ref|XP_004010553.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Ovis
           aries]
          Length = 1076

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)

Query: 28  SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           + +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFESF 
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI  
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
           N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR  
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413

Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
                         +  E++  +          +      +  +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533

Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
           ISSENYL   G      +++   +      +V AD    T+     PTG           
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593

Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
                                             E+LTTGSFMIRGKKNF PP  L MG 
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGF 653

Query: 368 SFLFKLEESSISR 380
           SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666


>gi|345306303|ref|XP_001515044.2| PREDICTED: nuclear export mediator factor NEMF [Ornithorhynchus
           anatinus]
          Length = 1076

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 193/432 (44%), Positives = 244/432 (56%), Gaps = 86/432 (19%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+EK+P+ E     + I+   EFHP +           QH  +P  EFES
Sbjct: 243 NFSGKGYIIQKREKKPSLEPDKPAEDILTYEEFHPFLFS---------QHSKYPYVEFES 293

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKLENV++DHE RL AL Q Q +DK K ELI
Sbjct: 294 FDKAVDEFYSKLEGQKIDLKALQQEKQALKKLENVRKDHEHRLEALHQAQEIDKVKGELI 353

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LL+
Sbjct: 354 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLK 413

Query: 201 -------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAK 247
                           E+   ++         D      +  KP +VDVDL LSAYANAK
Sbjct: 414 NPYVMSEEEDDDGEDIEKEETEEPKGKKKKQKDKQLKKPQKNKPLVVDVDLSLSAYANAK 473

Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFI 307
           +++D KR AA+K QKT+++ EKA KSAEKKTKQTLK+VQT+T I KARKVYWFEKF WFI
Sbjct: 474 KYYDHKRHAARKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFI 533

Query: 308 SSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG------------ 341
           SSENYL   G      +++   +      +V AD    T+     PTG            
Sbjct: 534 SSENYLIIGGRDQQQNEMIVKRYLNSGDIYVHADLHGATSCVIKNPTGEAIPPRTLTEAG 593

Query: 342 ---------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGIS 368
                                            E+LTTGSFMIRGKKNF PP  L MG S
Sbjct: 594 TMALCYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 653

Query: 369 FLFKLEESSISR 380
           FLFK+EE+ + R
Sbjct: 654 FLFKVEETCVWR 665


>gi|119586145|gb|EAW65741.1| serologically defined colon cancer antigen 1, isoform CRA_a [Homo
           sapiens]
          Length = 828

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 195/435 (44%), Positives = 245/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 591

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 592 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 651

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 652 GFSFLFKVDESCVWR 666


>gi|440907236|gb|ELR57405.1| Serologically defined colon cancer antigen 1 [Bos grunniens mutus]
          Length = 1077

 Score =  320 bits (821), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)

Query: 28  SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           + +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFESF 
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI  
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
           N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR  
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413

Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
                         +  E++  +          +      +  +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533

Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
           ISSENYL   G      +++   +      +V AD    T+     PTG           
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593

Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
                                             E+LTTGSFMIRGKKNF PP  L MG 
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGF 653

Query: 368 SFLFKLEESSISR 380
           SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666


>gi|329664770|ref|NP_001192434.1| nuclear export mediator factor NEMF [Bos taurus]
          Length = 1076

 Score =  320 bits (821), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)

Query: 28  SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           + +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFESF 
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI  
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
           N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR  
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413

Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
                         +  E++  +          +      +  +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533

Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
           ISSENYL   G      +++   +      +V AD    T+     PTG           
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593

Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
                                             E+LTTGSFMIRGKKNF PP  L MG 
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGF 653

Query: 368 SFLFKLEESSISR 380
           SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666


>gi|356640194|ref|NP_001239258.1| serologically defined colon cancer antigen 1 [Gallus gallus]
          Length = 1071

 Score =  320 bits (821), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 199/442 (45%), Positives = 245/442 (55%), Gaps = 105/442 (23%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           + + +GY++QKKEK+P+ E     + I    EFHP +           QH   P  EF+S
Sbjct: 242 DFNGKGYIIQKKEKKPSLEPDKPAEDIYTYEEFHPFLFS---------QHSKCPYLEFDS 292

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  A DEF+S  E  KIDLKA+QQE+ ALKKLENV+RDHE RL AL+Q Q VDK K ELI
Sbjct: 293 FNKAADEFYSKLEGQKIDLKALQQEKQALKKLENVRRDHEQRLEALQQAQEVDKIKGELI 352

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N E V  AI  +R  +ANQ+ W +I A VK+AQ   DPVA+ IK+LKL  NHIT+LLR
Sbjct: 353 EMNLEIVSRAIQVVRSALANQIDWTEIGAIVKEAQAQGDPVANAIKELKLQTNHITMLLR 412

Query: 201 VHKERSNIKDSWVL-----HHDNAD------------------PFADSSETKKPSLVDVD 237
                    + +VL       ++AD                        +  KPSLVDVD
Sbjct: 413 ---------NPYVLSEEEEEGEDADLEKEETEEPKGKKKKNKSKQLKKPQKNKPSLVDVD 463

Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
           L LSAYANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T I KARKV
Sbjct: 464 LSLSAYANAKKYYDHKRHAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKV 523

Query: 298 YWFEKFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG-- 341
           YWFEKF WFISSENYL   G      +++   +      +V AD    T+     PTG  
Sbjct: 524 YWFEKFLWFISSENYLVIAGRDQQQNELIVKRYLKPGDIYVHADLHGATSCVIKNPTGEP 583

Query: 342 -------------------------------------------EFLTTGSFMIRGKKNFF 358
                                                      E+LTTGSFMIRGKKNF 
Sbjct: 584 IPPRTLTEAGTMALCYSAAWDARVVTSAWWVSHNQVSKTAPTGEYLTTGSFMIRGKKNFL 643

Query: 359 PPCQLAMGISFLFKLEESSISR 380
            P  L MG SFLFK++ES + R
Sbjct: 644 QPSYLMMGFSFLFKVDESCVWR 665


>gi|296483277|tpg|DAA25392.1| TPA: hypothetical protein BOS_10863 [Bos taurus]
          Length = 1076

 Score =  320 bits (820), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)

Query: 28  SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           + +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFESF 
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI  
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
           N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR  
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413

Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
                         +  E++  +          +      +  +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533

Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
           ISSENYL   G      +++   +      +V AD    T+     PTG           
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593

Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
                                             E+LTTGSFMIRGKKNF PP  L MG 
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGF 653

Query: 368 SFLFKLEESSISR 380
           SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666


>gi|194375658|dbj|BAG56774.1| unnamed protein product [Homo sapiens]
          Length = 999

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 194/435 (44%), Positives = 244/435 (56%), Gaps = 89/435 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 199 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 249

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 250 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 309

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 310 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 369

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 370 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 429

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++  KA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 430 NAKKYYDHKRYAAKKTQKTVEAAGKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 489

Query: 305 WFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG--------- 341
           WFISSENYL   G      +++   +      +V AD    T+     PTG         
Sbjct: 490 WFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLT 549

Query: 342 ------------------------------------EFLTTGSFMIRGKKNFFPPCQLAM 365
                                               E+LTTGSFMIRGKKNF PP  L M
Sbjct: 550 EAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 609

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++ES + R
Sbjct: 610 GFSFLFKVDESCVWR 624


>gi|410898599|ref|XP_003962785.1| PREDICTED: nuclear export mediator factor Nemf-like [Takifugu
           rubripes]
          Length = 1029

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 193/424 (45%), Positives = 246/424 (58%), Gaps = 79/424 (18%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK EK+P+      SE+ +  + EFHP +           QH   P  EF+
Sbjct: 242 NFTGKGYIIQKSEKKPSVTPGKPSEELLTYD-EFHPFLFA---------QHSKSPYLEFD 291

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEFFS  ES KID+KA+Q E+ A+KKLENVK+DHE RL AL Q Q +D+ K EL
Sbjct: 292 SFDKAVDEFFSKMESQKIDMKALQLEKHAMKKLENVKKDHEQRLEALHQAQEIDRIKGEL 351

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N   V+ A+  +   +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NHITLLL
Sbjct: 352 IEMNLAIVERALQVVCSALANQVDWTEIGILVKEAQAAGDPVACAIKELKLQANHITLLL 411

Query: 200 R--VHKERSNIKDSWV--LHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
           +     E    +D  V      N +  +   +  KP LVDVDL LSAYANAK+++D KRS
Sbjct: 412 KNPYVSEDDEQEDDVVEETGRKNKNKKSKKFQKNKPMLVDVDLSLSAYANAKKYYDNKRS 471

Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL-- 313
           A +K+ KTI++ +KA+KSAEKKT++TLK+VQT+T I KARKVYWFEKF WFIS+ENYL  
Sbjct: 472 AKRKEFKTIEAADKAMKSAEKKTQKTLKEVQTVTTIQKARKVYWFEKFLWFISAENYLVI 531

Query: 314 ------------------GNV-----------------------------AWDAKVVTNA 326
                             G++                             A    V  +A
Sbjct: 532 AGRDQQQNEMIVKRYLRAGDIYVHADLHGATSCVIKNPSGDPVPPRTLTEAGTMAVCYSA 591

Query: 327 WW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
            W          V   QVSKTAPTGE+LTTGSFMIRGKKN+ PP  L MG  FLFK++E 
Sbjct: 592 AWEAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNYLPPSYLIMGFGFLFKVDEH 651

Query: 377 SISR 380
           S+ R
Sbjct: 652 SVFR 655


>gi|194379038|dbj|BAG58070.1| unnamed protein product [Homo sapiens]
          Length = 782

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 194/436 (44%), Positives = 245/436 (56%), Gaps = 90/436 (20%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N+  +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 12  NLKLKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 62

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 63  FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 122

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 123 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 182

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 183 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 242

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 243 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 302

Query: 305 WFISSENYL---GNVAWDAKVVTNAW-------WVKADQVSKTA-----PTG-------- 341
           WFISSENYL   G      +++   +       +V AD    T+     PTG        
Sbjct: 303 WFISSENYLIIGGRDQQQNEIIVKRYLTPVGDIYVHADLHGATSCVIKNPTGEPIPPRTL 362

Query: 342 -------------------------------------EFLTTGSFMIRGKKNFFPPCQLA 364
                                                E+LTTGSFMIRGKKNF PP  L 
Sbjct: 363 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 422

Query: 365 MGISFLFKLEESSISR 380
           MG SFLFK++ES + R
Sbjct: 423 MGFSFLFKVDESCVWR 438


>gi|311245467|ref|XP_001924665.2| PREDICTED: nuclear export mediator factor NEMF [Sus scrofa]
          Length = 1076

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 190/433 (43%), Positives = 244/433 (56%), Gaps = 89/433 (20%)

Query: 28  SSQGYLVQKKEKRPNSEDY-----IVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           + +GY++QK+E +P+ E       I+   EFHP +           QH   P  EFESF 
Sbjct: 243 NGKGYIIQKREVKPSLEVDKPTVDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI  
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
           N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR  
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNP 413

Query: 201 --------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
                         ++ E++  +          +      +  +P LVDVDL LSAYANA
Sbjct: 414 YLLSEEEDDDVDGDINTEKNESEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANA 473

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           K+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533

Query: 307 ISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG----------- 341
           ISSENYL   G      +++   +      +V AD    T+     PTG           
Sbjct: 534 ISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEA 593

Query: 342 ----------------------------------EFLTTGSFMIRGKKNFFPPCQLAMGI 367
                                             E+LTTGSFMIRGKKNF PP  L MG 
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 368 SFLFKLEESSISR 380
           SFLFK++ES + R
Sbjct: 654 SFLFKVDESCVWR 666


>gi|301617501|ref|XP_002938173.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Xenopus (Silurana) tropicalis]
          Length = 951

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 194/427 (45%), Positives = 242/427 (56%), Gaps = 85/427 (19%)

Query: 29  SQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
            +G+++QK+EK+P+      SED I  N EFHP +           QH      E +SF 
Sbjct: 244 GKGFIIQKREKKPSLEPDKPSED-IFTNEEFHPFLFA---------QHCNNTYIELDSFN 293

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEFFS  E  +IDLKA+QQE+ ALKKLENV++DHE RL +L+  Q  DK K ELI  
Sbjct: 294 KAVDEFFSKMEGQRIDLKALQQEKQALKKLENVRKDHEERLESLQHAQDADKAKGELIEM 353

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
           N + VD AI  +R  +ANQ+ W++I   VK+AQ   D VA  IK+LKL  NHIT+LL+  
Sbjct: 354 NLDIVDRAIQVVRSALANQIDWKEIGLIVKEAQIQGDSVALAIKELKLQTNHITMLLKNP 413

Query: 201 --------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDL 252
                     +E    ++       + +      +  KP LVDVDL LSAYANAK+++D 
Sbjct: 414 YTLSEEGSEDEEEEKEEEPKGKKKKSKNKQPKKVQKNKPVLVDVDLSLSAYANAKKYYDH 473

Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
           KR AAKK QKTI++ EKA KSAEKKTKQTLK+VQT++ I KARKVYWFEKF WFISSENY
Sbjct: 474 KRHAAKKSQKTIEAAEKAFKSAEKKTKQTLKEVQTVSTIQKARKVYWFEKFLWFISSENY 533

Query: 313 L--------------------GNV-----------------------------AWDAKVV 323
           L                    G++                             A    V 
Sbjct: 534 LVIAGRDQQQNELIVKRYLNPGDLYVHADLHGATSCVIKNPTGEPVPPRTLTEAGTMAVC 593

Query: 324 TNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
            +A W          V  +QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG  FLFK+
Sbjct: 594 YSAAWDARVITSAWWVHHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFGFLFKV 653

Query: 374 EESSISR 380
           +E  + R
Sbjct: 654 DEPCVWR 660


>gi|410962212|ref|XP_003987668.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Felis catus]
          Length = 1080

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 193/438 (44%), Positives = 243/438 (55%), Gaps = 92/438 (21%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQ---ERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F  AVDEF+S  E  KIDLKA+QQ    + ALKKL+NV++DHE RL AL+Q Q +DK K 
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQVWYXKQALKKLDNVRKDHENRLEALQQAQEIDKLKG 351

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           ELI  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+
Sbjct: 352 ELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTM 411

Query: 198 LLR----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
           LLR                +  E++  +          +      +  +P LVDVDL LS
Sbjct: 412 LLRNPYLLSEEEDDDVDGDITVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLS 471

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFE 301
           AYANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFE
Sbjct: 472 AYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFE 531

Query: 302 KFYWFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PT------- 340
           KF WF+SSENYL     D +         + T   +V AD    T+     PT       
Sbjct: 532 KFLWFVSSENYLIIGGRDQQQNEMIVKRYLTTGDIYVHADLHGATSCVIKNPTGEPIPPR 591

Query: 341 --------------------------------------GEFLTTGSFMIRGKKNFFPPCQ 362
                                                 GE+LTTGSFMIRGKKNF PP  
Sbjct: 592 TLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSY 651

Query: 363 LAMGISFLFKLEESSISR 380
           L MG SFLFK++ES I R
Sbjct: 652 LMMGFSFLFKVDESCIWR 669


>gi|194742419|ref|XP_001953700.1| GF17891 [Drosophila ananassae]
 gi|190626737|gb|EDV42261.1| GF17891 [Drosophila ananassae]
          Length = 999

 Score =  317 bits (813), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 188/427 (44%), Positives = 247/427 (57%), Gaps = 94/427 (22%)

Query: 20  VLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
           +L    +  ++GY+VQ KE++P      ++   N+EFHP             Q + F   
Sbjct: 269 LLTEGASGKTKGYIVQVKEEKPTENGKVEFFFRNIEFHPY---------QFVQFKDFECA 319

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
            FESF  AVDEF+ST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q  D++K
Sbjct: 320 TFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDEDRKK 379

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
           AELI +NQ  VD AI A++  IA+QLSW DI   VK+AQ + D VAS IKQLKL  NHI+
Sbjct: 380 AELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHIS 439

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSS----ETKKPSLVDVDLDLSAYANAKRFFDL 252
           L+L                   +DP+ ++     +T + ++VDVDL LSA+ANA+R++DL
Sbjct: 440 LIL-------------------SDPYGENEDEDLDTPEVTVVDVDLALSAWANARRYYDL 480

Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
           KRSAA+K++KT+ +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENY
Sbjct: 481 KRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENY 540

Query: 313 LGNVAWDAK--------------VVTNAWWVKADQVSKTAPTG----------------- 341
           L     DA+              +  +A    A  V    PTG                 
Sbjct: 541 LVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIRNPTGEEIPPKTLLEAGSMAIS 600

Query: 342 ----------------------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
                                       E+L TGSFMIRGKKNF P C L MG+S LFKL
Sbjct: 601 YSVAWDAKVVTNSYWVTSEQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLIMGLSLLFKL 660

Query: 374 EESSISR 380
           E+S I+R
Sbjct: 661 EDSFIAR 667


>gi|189239405|ref|XP_001813943.1| PREDICTED: similar to CG11847 CG11847-PA [Tribolium castaneum]
 gi|270010510|gb|EFA06958.1| hypothetical protein TcasGA2_TC009916 [Tribolium castaneum]
          Length = 972

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 191/424 (45%), Positives = 244/424 (57%), Gaps = 101/424 (23%)

Query: 30  QGYLVQKKEKRPNSED-----YIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           +GY++QKKE+R  S D     Y  +N EFHP++ +         QH+    +EF SF +A
Sbjct: 243 KGYIIQKKEERVVSADNPEKEYYYSNQEFHPVLYE---------QHKSSISKEFPSFNSA 293

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDEFFS+ ES K++LKA+QQER+ALKKLENVK+DH  RL ALE+TQ +DK+KAELI  NQ
Sbjct: 294 VDEFFSSLESQKLELKALQQEREALKKLENVKKDHSQRLLALEKTQEIDKQKAELITRNQ 353

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           E VD AILA++  +A Q+SW D+   +K+A    D +A  IK+LKL  NHI+L L     
Sbjct: 354 ELVDKAILAVQTALATQISWSDLADLIKEAASQGDEIAQRIKELKLETNHISLYL----- 408

Query: 205 RSNIKDSWVLHHDNADPFADSSETKK---------PSLVDVDLDLSAYANAKRFFDLKRS 255
                          DP+A+               P +VDVDLDLSA+AN +R++D KR+
Sbjct: 409 --------------TDPYAEDDSESDDEDNDDKIPPMVVDVDLDLSAFANGRRYYDQKRN 454

Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGN 315
           AAKKQQKTI+S  KA KSAEKKTKQTLKDVQT+TNINKARKVYWFEKF+WFISSENYL  
Sbjct: 455 AAKKQQKTIESQSKAFKSAEKKTKQTLKDVQTITNINKARKVYWFEKFFWFISSENYLVI 514

Query: 316 VAWDAK---------VVTNAWWVKAD-----QVSKTAPTGEFL----------------- 344
              D +         + +   +V AD      V    P+G+ +                 
Sbjct: 515 AGRDQQQNELIVKRYMKSTDVYVHADVHGASSVVIKNPSGQAVPPKTLNEAGTMAICYSV 574

Query: 345 ----------------------TTGSFM------IRGKKNFFPPCQLAMGISFLFKLEES 376
                                  TG ++      IRGKKNF P   L +G+SFLFKLEES
Sbjct: 575 AWDAKVVTNAYWVWGEQVSKTAPTGEYLSTGSFMIRGKKNFLPLSHLILGLSFLFKLEES 634

Query: 377 SISR 380
            I +
Sbjct: 635 CIEK 638


>gi|170055538|ref|XP_001863626.1| serologically defined colon cancer antigen 1 [Culex
           quinquefasciatus]
 gi|167875449|gb|EDS38832.1| serologically defined colon cancer antigen 1 [Culex
           quinquefasciatus]
          Length = 995

 Score =  316 bits (810), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 186/429 (43%), Positives = 246/429 (57%), Gaps = 86/429 (20%)

Query: 20  VLVSNLNVSSQGYLVQKKEKRP----NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
           +L   +   S+G+++QKKE RP      E+Y + N+E+ P +           Q++  P 
Sbjct: 260 MLRQGMREPSKGFIIQKKELRPAKEGEPEEYYLTNLEYQPYLYN---------QYKDEPY 310

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
           QEF SF AAVDEF+ST E  KIDLK+  QER+ALKKL NV+ DH  RL  L + QL D++
Sbjct: 311 QEFASFTAAVDEFYSTLEGQKIDLKSFAQEREALKKLSNVRTDHAKRLDDLIKAQLEDRK 370

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
           KAELI  NQ  VD+A+LA++  +A+Q++W DI+  VK AQ +NDP+AS I+QLKL INHI
Sbjct: 371 KAELITRNQNLVDSALLAVQSALASQMAWSDIQDLVKAAQANNDPIASCIRQLKLEINHI 430

Query: 196 TLLLRVHKERSNIKDSWVL-----HHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF 250
           +LLL         KD + +       +      D  +  +P +VDVDL LSA+ANA++++
Sbjct: 431 SLLL---------KDPYAVLDEEEEEEEDSDREDDEQKLEPMVVDVDLALSAFANARKYY 481

Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSE 310
           D +R AA+K+QKTI+S+ KALK+AEKKT QTLKDV+T T I+KARKVYWFEKFYWFISSE
Sbjct: 482 DQRRFAARKEQKTIESSSKALKNAEKKTLQTLKDVRTQTTISKARKVYWFEKFYWFISSE 541

Query: 311 NYLGNVAWD--------------AKVVTNAWWVKADQVSKTAPTG------EFLTTGSFM 350
           NYL     D              A +  +A    A  V    P+G        L  G+  
Sbjct: 542 NYLVIGGRDQQQNELLVKRYMRPADIYVHAEIQGASSVVIKNPSGAEIPPKTLLEAGTMA 601

Query: 351 I---------------------------------------RGKKNFFPPCQLAMGISFLF 371
           I                                       RGKKNF PPC L +G+SF+F
Sbjct: 602 ISYSVAWDAKVVTSAYWVRSEQVSKTAPTGEYLTTGSFMIRGKKNFLPPCHLVLGLSFMF 661

Query: 372 KLEESSISR 380
           KLEESS+ R
Sbjct: 662 KLEESSVER 670


>gi|195504496|ref|XP_002099104.1| GE23561 [Drosophila yakuba]
 gi|194185205|gb|EDW98816.1| GE23561 [Drosophila yakuba]
          Length = 996

 Score =  316 bits (809), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 188/416 (45%), Positives = 240/416 (57%), Gaps = 88/416 (21%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++Q KE++P      ++   N+EFHP +           Q + F    FESF  AV
Sbjct: 276 SKGYIIQVKEEKPTENGKVEFFFRNIEFHPYLFT---------QFKNFETATFESFMEAV 326

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+ST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q VD++KAELI +NQ 
Sbjct: 327 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AI A++  IA+QLSW DI   VK+AQ + D VAS IKQLKL  NHI+L+L    + 
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
               D  +               K P L  VDVDL LSA+ANA+R++D+KRSAA+K++KT
Sbjct: 447 DEDDDDDL---------------KAPELTVVDVDLALSAWANARRYYDMKRSAAQKEKKT 491

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
           + +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL     DA+  
Sbjct: 492 VDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQN 551

Query: 322 ------------VVTNAWWVKADQVSKTAPTG---------------------------- 341
                       +  +A    A  V    PTG                            
Sbjct: 552 ELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVT 611

Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                            E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 612 NSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 667


>gi|195107152|ref|XP_001998180.1| GI23827 [Drosophila mojavensis]
 gi|193914774|gb|EDW13641.1| GI23827 [Drosophila mojavensis]
          Length = 962

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 189/423 (44%), Positives = 251/423 (59%), Gaps = 88/423 (20%)

Query: 20  VLVSNLNVSSQGYLVQ-KKEKRPNSE--DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
           ++    N SS+G+++Q K+EK  N+E  ++   N+EFHP +           Q++  P +
Sbjct: 247 IMAIGRNGSSKGFIIQVKEEKLTNAENTEHFYRNIEFHPYLFS---------QYKKLPFK 297

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           E+E+F  AVDEFFS+ ES KID+K +QQER+ALKKL NVK+DH  RL  L + Q  DK+K
Sbjct: 298 EYETFMEAVDEFFSSQESQKIDIKTLQQEREALKKLSNVKKDHTKRLEELNRVQDDDKKK 357

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
           AELI +NQ  VD AILAI+  IA+QLSW DI+  VK+AQ + D VAS IKQLKL INHI+
Sbjct: 358 AELITSNQCLVDKAILAIQSAIASQLSWPDIQELVKEAQANGDIVASSIKQLKLEINHIS 417

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
           LLL    +  N        +DNAD            +VD+DL LSA+ANA+R++DLKRSA
Sbjct: 418 LLLSDPYKNEN-------ENDNADSV----------IVDIDLALSAWANARRYYDLKRSA 460

Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL--- 313
           A K++KTI +++KALKSAE+KT+QTLK+V+T++NI KARK++WFEKF+WFISSENYL   
Sbjct: 461 ALKEKKTIDASQKALKSAERKTQQTLKEVRTISNIAKARKIFWFEKFFWFISSENYLVIG 520

Query: 314 GNVAWDAKVVTNAW------WVKAD-----------QVSKTAPTGEFLTTG--------- 347
           G  A   +++   +      +V AD              +  P    L  G         
Sbjct: 521 GRDAQQNELIVKRYMRPKDIYVHADIQGASSVIIRNTTGEEIPPKTLLEAGTMAISYSVA 580

Query: 348 ------------------------------SFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
                                         SFMIRGKKNF P C L MG+S LFKLE+S 
Sbjct: 581 WDAKVVTNSYWVYSHQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMGLSLLFKLEDSF 640

Query: 378 ISR 380
           + R
Sbjct: 641 LQR 643


>gi|291230458|ref|XP_002735180.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 834

 Score =  314 bits (804), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 190/445 (42%), Positives = 253/445 (56%), Gaps = 89/445 (20%)

Query: 4   NEDVSSDIHKCGPTL----HVLVSNLNVSSQGYLVQKKEKRPNS-----EDYIVANMEFH 54
           N D+S+D+ +    L    +VL   +   S+GY++QKKEK+ +       + ++   E+H
Sbjct: 223 NVDISTDLPRIMAALQDMENVLKKIVESPSKGYVIQKKEKKTSKLSGDVPEELITYAEYH 282

Query: 55  PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
           PM+ +         QHQ     E ESF  AVDEFFS   + K+D+KA+QQE+ A+KKLEN
Sbjct: 283 PMLFE---------QHQKSLYIELESFGKAVDEFFSQMGTQKLDIKALQQEKSAIKKLEN 333

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           VK+DHE R+  L+ +Q VD  KA+LI  N   VD AI  ++  IANQ+ W +I   VK+A
Sbjct: 334 VKKDHEKRIQQLQASQNVDMVKAQLIEINLPLVDRAIQVVQSAIANQIDWAEIWDIVKEA 393

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
           Q   D VA  IK LKL+ NHITLLLR     S++ D             D      P  +
Sbjct: 394 QTQGDEVAKSIKSLKLDKNHITLLLRDPFVSSDVDDE------------DKHSGIGPLKI 441

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
           D+DLDLSAYANA+++++ K+ +A K+QKT+ +++KALKSAE KTKQTLKDV T+T+INKA
Sbjct: 442 DIDLDLSAYANARKYYEAKKHSAVKEQKTLAASQKALKSAEIKTKQTLKDVATVTSINKA 501

Query: 295 RKVYWFEKFYWFISSENYL--------------------GNV------------------ 316
           RK YWFEKF WFISSENYL                    G++                  
Sbjct: 502 RKTYWFEKFIWFISSENYLIIGGRDQQQNEIVVRKYLNKGDIYVHADLHGASSVIIKNPT 561

Query: 317 -----------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKK 355
                      A    +  +A W          V  +QVSKTAPTGE+LTTGSFM+RGKK
Sbjct: 562 GADIPPKTLNEAGSMAICYSAAWQARVVTSAWWVYHNQVSKTAPTGEYLTTGSFMVRGKK 621

Query: 356 NFFPPCQLAMGISFLFKLEESSISR 380
           N+ PP  L MG  FLFK++E S+ R
Sbjct: 622 NYLPPSYLVMGFGFLFKVDEDSLWR 646


>gi|195354790|ref|XP_002043879.1| GM17806 [Drosophila sechellia]
 gi|194129117|gb|EDW51160.1| GM17806 [Drosophila sechellia]
          Length = 972

 Score =  313 bits (803), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 186/416 (44%), Positives = 241/416 (57%), Gaps = 90/416 (21%)

Query: 29  SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++Q KE++P    + ++   N+EFHP +           Q + F    FESF  AV
Sbjct: 256 SKGYIIQVKEEKPAENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 306

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+ST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q VD++KAELI +NQ 
Sbjct: 307 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 366

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AI A++  IA+QLSW DI   VK+AQ + D VAS IKQLKL  NHI+L+L    + 
Sbjct: 367 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 426

Query: 206 SNIKDSWVLHHDNADPFADSSETKKP--SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                                + K P  ++VDVDL LSA+ANA+R++D+KRSAA+K++KT
Sbjct: 427 DE-----------------DDDLKAPEVTVVDVDLALSAWANARRYYDMKRSAAQKEKKT 469

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
           + +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL     DA+  
Sbjct: 470 VDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQN 529

Query: 322 ------------VVTNAWWVKADQVSKTAPTG---------------------------- 341
                       +  +A    A  V    PTG                            
Sbjct: 530 ELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVT 589

Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                            E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 590 NSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 645


>gi|194908933|ref|XP_001981863.1| GG11364 [Drosophila erecta]
 gi|190656501|gb|EDV53733.1| GG11364 [Drosophila erecta]
          Length = 994

 Score =  313 bits (802), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 187/416 (44%), Positives = 239/416 (57%), Gaps = 90/416 (21%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++Q KE++P      ++   N+EFHP +           Q + F    FESF  AV
Sbjct: 276 SKGYIIQVKEEKPTENGKVEFFFKNIEFHPYLFI---------QFKNFEKATFESFMDAV 326

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+ST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q VD++KAELI +NQ 
Sbjct: 327 DEFYSTQESQKIDIKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AI A++  IA+QLSW DI   VK+AQ + D VAS IKQLKL  NHI+L+L    + 
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                                + K P L  VDVDL LSA+ANA+R++D+KRSAA+K++KT
Sbjct: 447 DE-----------------DDDLKAPELTVVDVDLALSAWANARRYYDMKRSAAQKEKKT 489

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
           + +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL     DA+  
Sbjct: 490 VDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQN 549

Query: 322 ------------VVTNAWWVKADQVSKTAPTG---------------------------- 341
                       +  +A    A  V    PTG                            
Sbjct: 550 ELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVT 609

Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                            E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 610 NSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 665


>gi|281362528|ref|NP_001163721.1| caliban, isoform B [Drosophila melanogaster]
 gi|281362530|ref|NP_651341.2| caliban, isoform C [Drosophila melanogaster]
 gi|332319785|sp|Q9VBX1.2|NEMF_DROME RecName: Full=Nuclear export mediator factor NEMF homolog; AltName:
           Full=Protein Caliban
 gi|157816462|gb|ABV82224.1| IP12923p [Drosophila melanogaster]
 gi|272477156|gb|ACZ95015.1| caliban, isoform B [Drosophila melanogaster]
 gi|272477157|gb|AAF56406.2| caliban, isoform C [Drosophila melanogaster]
          Length = 992

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 185/414 (44%), Positives = 241/414 (58%), Gaps = 86/414 (20%)

Query: 29  SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++Q KE++P    + ++   N+EFHP +           Q + F    FESF  AV
Sbjct: 276 SKGYIIQVKEEKPTENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 326

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+ST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q VD++KAELI +NQ 
Sbjct: 327 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AI A++  IA+QLSW DI   VK+AQ + D VAS IKQLKL  NHI+L+L    + 
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
                             D  +  + ++VDVDL LSA+ANA+R++D+KRSAA+K++KT+ 
Sbjct: 447 DE---------------DDDLKDPEVTVVDVDLALSAWANARRYYDMKRSAAQKEKKTVD 491

Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK---- 321
           +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL     DA+    
Sbjct: 492 ASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQNEL 551

Query: 322 ----------VVTNAWWVKADQVSKTAPTG------------------------------ 341
                     +  +A    A  V    PTG                              
Sbjct: 552 IVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVTNS 611

Query: 342 ---------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                          E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 612 YWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 665


>gi|284005983|gb|ADB57053.1| MIP15468p [Drosophila melanogaster]
          Length = 939

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 185/414 (44%), Positives = 241/414 (58%), Gaps = 86/414 (20%)

Query: 29  SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++Q KE++P    + ++   N+EFHP +           Q + F    FESF  AV
Sbjct: 276 SKGYIIQVKEEKPTENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 326

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+ST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q VD++KAELI +NQ 
Sbjct: 327 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AI A++  IA+QLSW DI   VK+AQ + D VAS IKQLKL  NHI+L+L    + 
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
                             D  +  + ++VDVDL LSA+ANA+R++D+KRSAA+K++KT+ 
Sbjct: 447 DE---------------DDDLKDPEVTVVDVDLALSAWANARRYYDMKRSAAQKEKKTVD 491

Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK---- 321
           +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL     DA+    
Sbjct: 492 ASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQNEL 551

Query: 322 ----------VVTNAWWVKADQVSKTAPTG------------------------------ 341
                     +  +A    A  V    PTG                              
Sbjct: 552 IVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVTNS 611

Query: 342 ---------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                          E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 612 YWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 665


>gi|240978882|ref|XP_002403060.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215491284|gb|EEC00925.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 651

 Score =  312 bits (800), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 181/415 (43%), Positives = 236/415 (56%), Gaps = 94/415 (22%)

Query: 29  SQGYLVQKKEKRPNSED----YIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           S+GY++ + EKR    D     I +  EFHP +          RQH+   + E  SF+AA
Sbjct: 243 SKGYILVRVEKRVTPADDGSTEITSYQEFHPFL---------WRQHEKERVVELASFSAA 293

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VD+FFS+ E  +I LKA Q+E++ALKKLEN++ DHE R+ ALEQ Q  DK KAELI  N 
Sbjct: 294 VDQFFSSLEMQRISLKAHQKEKEALKKLENIRMDHEKRIVALEQVQREDKHKAELIEINL 353

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           + V+ A+L +R  +ANQ+ W +I   +++AQ   DPVA  IKQLKL+ NH  +LLR    
Sbjct: 354 DLVERALLVLRSALANQIGWAEITELLREAQEQGDPVAQSIKQLKLDTNHFAMLLR---- 409

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                          DP+    E  + +LVD+DLDLSAYANA+R++D KR AA KQQKT+
Sbjct: 410 ---------------DPY---EEDARDTLVDIDLDLSAYANARRYYDQKRHAAGKQQKTL 451

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK--- 321
           +S+ KA KSAEKKTK+ LK V   +NI +ARK +WFEKF+WFISSE+YL     DA+   
Sbjct: 452 ESSTKAYKSAEKKTKEALKQVALTSNIARARKAFWFEKFFWFISSEDYLVIGGRDAQQNE 511

Query: 322 ------------------------VVTN-----------------------AW------- 327
                                   V+ N                       AW       
Sbjct: 512 MIVKRHLNPGDVYVHADLHGASSIVIKNPGGGSVPPKTLNEAGTMAICYSAAWDAKVVTS 571

Query: 328 --WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
             WV   QVSKTAPTG++LT G+FMIRGKKN+ PP  L MG  FL+KL+E S+ R
Sbjct: 572 AWWVHHHQVSKTAPTGQYLTPGAFMIRGKKNYLPPSYLIMGFGFLYKLDEDSVER 626


>gi|195573753|ref|XP_002104856.1| GD21177 [Drosophila simulans]
 gi|194200783|gb|EDX14359.1| GD21177 [Drosophila simulans]
          Length = 972

 Score =  311 bits (798), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 184/416 (44%), Positives = 240/416 (57%), Gaps = 90/416 (21%)

Query: 29  SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
            +GY++Q KE++P    + ++   N+EFHP +           Q + F    FESF  AV
Sbjct: 256 GKGYIIQVKEEKPAENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 306

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+ST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q VD++KAELI +NQ 
Sbjct: 307 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 366

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AI A++  IA+QLSW DI   VK+AQ + D VAS IKQLKL  NHI+L+L    + 
Sbjct: 367 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 426

Query: 206 SNIKDSWVLHHDNADPFADSSETKKP--SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                                + K P  ++VDVDL +SA+ANA+R++D+KRSAA+K++KT
Sbjct: 427 DE-----------------DDDLKAPEVTVVDVDLAMSAWANARRYYDMKRSAAQKEKKT 469

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-- 321
           + +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWFISSENYL     DA+  
Sbjct: 470 VDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISSENYLVIGGRDAQQN 529

Query: 322 ------------VVTNAWWVKADQVSKTAPTG---------------------------- 341
                       +  +A    A  V    PTG                            
Sbjct: 530 ELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEEIPPKTLLEAGSMAISYSVAWDAKVVT 589

Query: 342 -----------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                            E+L TGSFMIRGKKNF P C L MG+S LFKLE+S I R
Sbjct: 590 NSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSLLFKLEDSFIER 645


>gi|291190355|ref|NP_001167106.1| Serologically defined colon cancer antigen 1 homolog [Salmo salar]
 gi|223648156|gb|ACN10836.1| Serologically defined colon cancer antigen 1 homolog [Salmo salar]
          Length = 1069

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 182/426 (42%), Positives = 239/426 (56%), Gaps = 83/426 (19%)

Query: 29  SQGYLVQKKEKRPN----SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
            +GY++QK +K+P+      + ++   EFHP +           QH      EF++F  A
Sbjct: 246 GKGYIIQKCDKKPSLAPEKPEELLTYEEFHPFLFA---------QHANSHYVEFDTFDKA 296

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDE++S  ES +ID+KA+QQE+ ALKKL+NVKRDH  RL AL Q Q VD+ + EL+  N 
Sbjct: 297 VDEYYSKMESQRIDVKALQQEKQALKKLDNVKRDHVQRLEALHQLQEVDRLRGELVEMNL 356

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR---- 200
             V+ A+  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NHIT+LL+    
Sbjct: 357 PIVERALQVVRSALANQVDWAEIGLIVKEAQAAGDPVACAIKELKLQTNHITMLLKNPYI 416

Query: 201 -------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLK 253
                          +     + +         +  +P LVDVDL LSAYANAK+++D K
Sbjct: 417 VPDEVEEEDVAEVAEEKKGKKNKNKDKGQKGKPKKDQPMLVDVDLSLSAYANAKKYYDHK 476

Query: 254 RSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           R+AAKK+QKT+++ +KA KSAEKKTKQTLK+VQT+T I KARKVYWFEKF WFISSENYL
Sbjct: 477 RTAAKKEQKTVEAAQKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSENYL 536

Query: 314 --------------------GNV-----------------------------AWDAKVVT 324
                               G++                             A    V  
Sbjct: 537 IIAGRDQQQNEIIVKRYLRAGDIYVHADLHGATSCVIKNASGVPIPPRTLTEAGTMAVCY 596

Query: 325 NAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           +A W          V   QV+K+APTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++
Sbjct: 597 SAAWDAKVITSAWWVHHHQVTKSAPTGEYLTTGSFMIRGKKNFMPPSYLMMGFSFLFKVD 656

Query: 375 ESSISR 380
           E  + R
Sbjct: 657 EQCVFR 662


>gi|383852746|ref|XP_003701886.1| PREDICTED: nuclear export mediator factor NEMF homolog [Megachile
           rotundata]
          Length = 970

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 157/290 (54%), Positives = 211/290 (72%), Gaps = 33/290 (11%)

Query: 29  SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P    +E++I  N+EFHP + +         Q++ +P +EF+SF A+V
Sbjct: 242 SKGYIIQKKEVKPVVDGTEEFIYTNIEFHPYLFE---------QYKDYPFKEFDSFDASV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+D+K +QQER+ALKKL+NVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 293 DEYFSTMEGQKLDMKVLQQEREALKKLDNVKKDHDQRLITLEKTQELDKQKAELISRNQM 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL+ NHI+LLL      
Sbjct: 353 LVDNAILAIQSALANQMAWPDIKILLKEAESRGDPVASAIKQLKLDTNHISLLLH----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                         DP+ +S E    KP L+D+DL  +A+ NA+++++ KRSAAKKQQKT
Sbjct: 408 --------------DPYEESDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAAKKQQKT 453

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           I+S +KALKSAEKKTKQTLK+VQ + +INK RK+YWFEKFYWFISSENYL
Sbjct: 454 IESQDKALKSAEKKTKQTLKEVQAIHSINKLRKIYWFEKFYWFISSENYL 503



 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 51/66 (77%), Positives = 57/66 (86%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           ++AWDAKVV  AWWV  DQVSKTAPTGE+LTTGSFMIRGKKN+  PCQL MG+ FLF+LE
Sbjct: 564 SIAWDAKVVAGAWWVNNDQVSKTAPTGEYLTTGSFMIRGKKNYLSPCQLVMGLGFLFRLE 623

Query: 375 ESSISR 380
           ESSI R
Sbjct: 624 ESSIER 629


>gi|395504204|ref|XP_003756446.1| PREDICTED: nuclear export mediator factor NEMF [Sarcophilus
           harrisii]
          Length = 996

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 180/375 (48%), Positives = 229/375 (61%), Gaps = 52/375 (13%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N S +GY++QKKEK+P+      SED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFSGKGYIIQKKEKKPSLEPDKQSED-ILTYEEFHPFLFS---------QHSKCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKLEGQKIDLKALQQEKQALKKLDNVRKDHEHRLEALHQAQEIDKIKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   D VA+ I++LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDIVANAIRELKLQTNHVTMLL 410

Query: 200 R--------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
           +              ++ E+   ++         +      +  KP LVDVDL LSAYAN
Sbjct: 411 KNPYLISDEEEEDDEINIEKEETEEPKGKKKKQKNKQLQKLQKNKPLLVDVDLSLSAYAN 470

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
           AK+++D KR AA+K QKT+++ EKA +SAEKKTKQTLK+VQ +T I KARKVY       
Sbjct: 471 AKKYYDHKRHAARKTQKTVEAAEKAFRSAEKKTKQTLKEVQMVTTIQKARKVYCIS---L 527

Query: 306 FISSENYLGNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
            I S     N+ +              QVSKTAPTGE+LTTGSFMIRGKKNF PP  L M
Sbjct: 528 VIES-----NIIF--------------QVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMM 568

Query: 366 GISFLFKLEESSISR 380
           G SFLFK++E  + R
Sbjct: 569 GFSFLFKVDEPCVWR 583


>gi|119586147|gb|EAW65743.1| serologically defined colon cancer antigen 1, isoform CRA_c [Homo
           sapiens]
          Length = 1001

 Score =  307 bits (786), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 185/404 (45%), Positives = 228/404 (56%), Gaps = 84/404 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           EFHP +           QH   P  EFESF  AVDEF+S  E  KIDLKA+QQE+ ALKK
Sbjct: 263 EFHPFLFS---------QHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKK 313

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+NV++DHE RL AL+Q Q +DK K ELI  N + VD AI  +R  +ANQ+ W +I   V
Sbjct: 314 LDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIV 373

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLR----------------VHKERSNIKDSWVLH 215
           K+AQ   DPVAS IK+LKL  NH+T+LLR                V+ E++  +      
Sbjct: 374 KEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDVDGDVNVEKNETEPPKGKK 433

Query: 216 HDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE 275
               +      +  KP LVDVDL LSAYANAK+++D KR AAKK QKT+++ EKA KSAE
Sbjct: 434 KKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAE 493

Query: 276 KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAW----- 327
           KKTKQTLK+VQT+T+I KARKVYWFEKF WFISSENYL   G      +++   +     
Sbjct: 494 KKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGD 553

Query: 328 -WVKADQVSKTA-----PTG---------------------------------------- 341
            +V AD    T+     PTG                                        
Sbjct: 554 IYVHADLHGATSCVIKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSK 613

Query: 342 -----EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                E+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 614 TAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 657


>gi|119586146|gb|EAW65742.1| serologically defined colon cancer antigen 1, isoform CRA_b [Homo
           sapiens]
          Length = 1067

 Score =  307 bits (786), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 185/404 (45%), Positives = 228/404 (56%), Gaps = 84/404 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           EFHP +           QH   P  EFESF  AVDEF+S  E  KIDLKA+QQE+ ALKK
Sbjct: 263 EFHPFLFS---------QHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKK 313

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+NV++DHE RL AL+Q Q +DK K ELI  N + VD AI  +R  +ANQ+ W +I   V
Sbjct: 314 LDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIV 373

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLR----------------VHKERSNIKDSWVLH 215
           K+AQ   DPVAS IK+LKL  NH+T+LLR                V+ E++  +      
Sbjct: 374 KEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDVDGDVNVEKNETEPPKGKK 433

Query: 216 HDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE 275
               +      +  KP LVDVDL LSAYANAK+++D KR AAKK QKT+++ EKA KSAE
Sbjct: 434 KKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAE 493

Query: 276 KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAW----- 327
           KKTKQTLK+VQT+T+I KARKVYWFEKF WFISSENYL   G      +++   +     
Sbjct: 494 KKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGD 553

Query: 328 -WVKADQVSKTA-----PTG---------------------------------------- 341
            +V AD    T+     PTG                                        
Sbjct: 554 IYVHADLHGATSCVIKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSK 613

Query: 342 -----EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                E+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 614 TAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 657


>gi|159155700|gb|AAI54741.1| Zgc:153813 protein [Danio rerio]
          Length = 883

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 183/428 (42%), Positives = 239/428 (55%), Gaps = 97/428 (22%)

Query: 26  NVSSQGYLVQKKEKRPN-----SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S QG+++QK E++PN     + + ++   EFHP +           QH      EFES
Sbjct: 242 NFSGQGFIIQKSEQKPNVCAGDAAEELLTYEEFHPFLFC---------QHVKSRYVEFES 292

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEFFS  ES K+D++A+QQE+ ALKKLENV++DH+ RL AL Q Q V++ K EL+
Sbjct: 293 FNKAVDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELV 352

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N   V  A+  +R  +ANQ+ W +I   V +AQ   DPVA  IK+LKL  NHITLLLR
Sbjct: 353 ELNLPVVQRALQVVRSALANQVDWVEIGQMVTEAQAAGDPVACAIKELKLQSNHITLLLR 412

Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETK---------KPSLVDVDLDLSAYANAKRFFD 251
                          +  A P   ++E +         K  LVD+D++LSA+ANAKR++D
Sbjct: 413 ---------------NPEACPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYD 457

Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSEN 311
            KRSAAKK+QKT+++ +KA KSAEKKTKQTLKDVQT+T+I KARKVYWFEKF WF+SSEN
Sbjct: 458 SKRSAAKKEQKTVEAAQKAFKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSEN 517

Query: 312 YLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PTGE--------------- 342
           YL     D +         +     +V AD    T+     P+GE               
Sbjct: 518 YLIIAGRDQQQNEMIVKRYLRAGDLYVHADLHGATSCVIKNPSGEAVPPRTLTEAATMAV 577

Query: 343 ------------------------------FLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
                                         +LTTGSFMIRGKKNF PP  L MG  FLFK
Sbjct: 578 CYSAAWDAKVITSAWWVQHDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFK 637

Query: 373 LEESSISR 380
           +++ S+ R
Sbjct: 638 VDDQSVFR 645


>gi|71679669|gb|AAI00005.1| Zgc:153813 protein [Danio rerio]
          Length = 881

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 183/428 (42%), Positives = 239/428 (55%), Gaps = 97/428 (22%)

Query: 26  NVSSQGYLVQKKEKRPN-----SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S QG+++QK E++PN     + + ++   EFHP +           QH      EFES
Sbjct: 242 NFSGQGFIIQKSEQKPNVCAGDAAEELLTYEEFHPFLFC---------QHVKSRYVEFES 292

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEFFS  ES K+D++A+QQE+ ALKKLENV++DH+ RL AL Q Q V++ K EL+
Sbjct: 293 FNKAVDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELV 352

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N   V  A+  +R  +ANQ+ W +I   V +AQ   DPVA  IK+LKL  NHITLLLR
Sbjct: 353 ELNLPVVQRALQVVRSALANQVDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLR 412

Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETK---------KPSLVDVDLDLSAYANAKRFFD 251
                          +  A P   ++E +         K  LVD+D++LSA+ANAKR++D
Sbjct: 413 ---------------NPEACPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYD 457

Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSEN 311
            KRSAAKK+QKT+++ +KA KSAEKKTKQTLKDVQT+T+I KARKVYWFEKF WF+SSEN
Sbjct: 458 SKRSAAKKEQKTVEAAQKAFKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSEN 517

Query: 312 YLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PTGE--------------- 342
           YL     D +         +     +V AD    T+     P+GE               
Sbjct: 518 YLIIAGRDQQQNEMIVKRYLRAGDLYVHADLHGATSCVIKNPSGEAVPPRTLTEAATMAV 577

Query: 343 ------------------------------FLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
                                         +LTTGSFMIRGKKNF PP  L MG  FLFK
Sbjct: 578 CYSAAWDAKVITSAWWVQHDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFK 637

Query: 373 LEESSISR 380
           +++ S+ R
Sbjct: 638 VDDQSVFR 645


>gi|115529351|ref|NP_001070202.1| uncharacterized protein LOC767767 [Danio rerio]
 gi|115313121|gb|AAI24465.1| Zgc:153813 [Danio rerio]
          Length = 694

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 183/428 (42%), Positives = 239/428 (55%), Gaps = 97/428 (22%)

Query: 26  NVSSQGYLVQKKEKRPN-----SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S QG+++QK E++PN     + + ++   EFHP +           QH      EFES
Sbjct: 242 NFSGQGFIIQKSEQKPNVCAGDAAEELLTYEEFHPFLFC---------QHVKSRYVEFES 292

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEFFS  ES K+D++A+QQE+ ALKKLENV++DH+ RL AL Q Q V++ K EL+
Sbjct: 293 FNKAVDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELV 352

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N   V  A+  +R  +ANQ+ W +I   V +AQ   DPVA  IK+LKL  NHITLLLR
Sbjct: 353 ELNLPVVQRALQVVRSALANQVDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLR 412

Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETK---------KPSLVDVDLDLSAYANAKRFFD 251
                          +  A P   ++E +         K  LVD+D++LSA+ANAKR++D
Sbjct: 413 ---------------NPEACPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYD 457

Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSEN 311
            KRSAAKK+QKT+++ +KA KSAEKKTKQTLKDVQT+T+I KARKVYWFEKF WF+SSEN
Sbjct: 458 SKRSAAKKEQKTVEAAQKAFKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSEN 517

Query: 312 YLGNVAWDAK---------VVTNAWWVKADQVSKTA-----PTGE--------------- 342
           YL     D +         +     +V AD    T+     P+GE               
Sbjct: 518 YLIIAGRDQQQNEMIVKRYLRAGDLYVHADLHGATSCVIKNPSGEAVPPRTLTEAATMAV 577

Query: 343 ------------------------------FLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
                                         +LTTGSFMIRGKKNF PP  L MG  FLFK
Sbjct: 578 CYSAAWDAKVITSAWWVQHDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFK 637

Query: 373 LEESSISR 380
           +++ S+ R
Sbjct: 638 VDDQSVFR 645


>gi|322784867|gb|EFZ11647.1| hypothetical protein SINV_03144 [Solenopsis invicta]
          Length = 985

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 187/416 (44%), Positives = 244/416 (58%), Gaps = 92/416 (22%)

Query: 29  SQGYLVQKKEKR---PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +      ED+I AN+EFHP +           Q+   P +EF+SF AAV
Sbjct: 242 SKGYILQKKESKLTQDGKEDFIFANIEFHPFLFD---------QYNNQPYKEFDSFDAAV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE++ST E  KIDLKA+QQER+AL+KLE V++DH  RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 293 DEYYSTMEGQKIDLKALQQEREALQKLERVRKDHSQRLITLEKTQELDKQKAELISRNQA 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ+SW DI+  +K+AQ   DPVAS IKQLKL  NHI LLL      
Sbjct: 353 LVDNAILAIQSALANQMSWPDIQVLLKEAQARGDPVASAIKQLKLETNHIALLLH----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                         DP+ DS E    KP ++D+DL  +A++NAK+++  K+SAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESELKPMIIDIDLAHTAFSNAKKYYSQKKSAAKKQQKT 453

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDA 320
           I+S  KALKSAEKKTKQTLK+VQT+  INK RK+YWFEKFYWFI+SENYL   G      
Sbjct: 454 IESHGKALKSAEKKTKQTLKEVQTIHTINKLRKMYWFEKFYWFITSENYLVIGGRDQQQN 513

Query: 321 KVVTNAW------WVKADQVSKTA-----PTGEFL------------------------- 344
           +++   +      +V AD    ++     P+G  +                         
Sbjct: 514 ELIVKRYLKAGDLYVHADLTGASSVVIKNPSGNPVPPKSLAEAGTMAVAYSIAWDSKVIA 573

Query: 345 --------------TTGSF------MIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                          TG +      MIRGKKN+    QL MG+  +F+LE+SSI R
Sbjct: 574 SAWWVHHDQVSKSAPTGEYLTTGSFMIRGKKNYLTQSQLIMGLGLMFRLEDSSIER 629


>gi|332016223|gb|EGI57136.1| Serologically defined colon cancer antigen 1 [Acromyrmex
           echinatior]
          Length = 990

 Score =  298 bits (762), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 186/416 (44%), Positives = 242/416 (58%), Gaps = 92/416 (22%)

Query: 29  SQGYLVQKKEKR---PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +      ED+I AN+EFHP + +         Q+   P +EF SF AAV
Sbjct: 242 SKGYIIQKKESKLTQDGKEDFIFANIEFHPFLFE---------QYNNQPYKEFNSFDAAV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FS  E  KIDLKA+QQER+AL+KLE V++DH  RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 293 DEYFSMMEGQKIDLKALQQEREALQKLERVRKDHSQRLITLEKTQELDKQKAELISRNQV 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ+SW DI+  +K+AQ   DPVAS IKQLKL  NHI L+L      
Sbjct: 353 LVDNAILAIQSALANQMSWPDIQVLLKEAQTRGDPVASAIKQLKLETNHIALMLH----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
                         DP+ DS E    KP ++D+DL  +A++NAK+++  K+SAAKKQQKT
Sbjct: 408 --------------DPYEDSDEESKLKPMMIDIDLAHTAFSNAKKYYSQKKSAAKKQQKT 453

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDA 320
           I+S  KALKSAEKKTKQTLK+VQT+  INK RK YWFEKFYWFI+SENYL   G      
Sbjct: 454 IESQGKALKSAEKKTKQTLKEVQTIHTINKLRKTYWFEKFYWFITSENYLVIGGRDQQQN 513

Query: 321 KVVTNAW------WVKADQVSKTA-----PTGEFL------------------------- 344
           +++   +      +V AD    ++     P+G  +                         
Sbjct: 514 ELIVKRYLKAGDLYVHADLTGASSVVIKNPSGNPVPPKSLAEAGTMAVAYSIAWDSKVIA 573

Query: 345 --------------TTGSF------MIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                          TG +      MIRGKKN+    QL MG+  +F+LE+SSI R
Sbjct: 574 SAWWVHHDQVSKSAPTGEYLTTGSFMIRGKKNYLTHSQLIMGLGIMFRLEDSSIER 629


>gi|390331684|ref|XP_003723334.1| PREDICTED: nuclear export mediator factor Nemf-like
           [Strongylocentrotus purpuratus]
          Length = 1116

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 175/411 (42%), Positives = 241/411 (58%), Gaps = 83/411 (20%)

Query: 30  QGYLVQKKEKRPN-SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF 88
           +GY+VQK+EK+P  S+D   A +     IL   S P +          EF++F  A DEF
Sbjct: 244 KGYIVQKREKKPAASQDGEDAEL-LTEFILYTNSQPYL----------EFDTFDQAADEF 292

Query: 89  FSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148
           FS  ES K+D+K +QQER ALKKL+NVK+DHE R+S+L+Q Q ++++K  LI  N   V+
Sbjct: 293 FSKMESQKLDMKVIQQERGALKKLDNVKKDHEKRISSLQQNQELNEKKGALIEINLPLVE 352

Query: 149 AAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNI 208
            A+  +R  +ANQ+ W++I++ +K+AQ   DPVA  IK L+L+ NH  +LLR        
Sbjct: 353 QALRVVRSAVANQIDWKEIDSIIKEAQTQGDPVALAIKSLRLDTNHFQMLLR-------- 404

Query: 209 KDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTE 268
            D +   +D+AD      +  +P LVD+D+  SAYANA+++F  K+++ KK+QKT++S+ 
Sbjct: 405 -DPYK-QYDDAD--EGEEDVARPMLVDIDIAQSAYANARKYFVQKKTSQKKEQKTMESSS 460

Query: 269 KALKSAEKKTKQTLKDVQTMTNINKARKV------YWF--EKFYWFISSEN--------- 311
           KA+KSAEKKT Q LKDV T+ +INK+RK       YW    + Y  I+  +         
Sbjct: 461 KAIKSAEKKTMQALKDVATVASINKSRKTYWFEKYYWCISSENYIIIAGRDQQQNEIVVK 520

Query: 312 -YLG-----------------------------------------NVAWDAKVVTNAWWV 329
            YL                                          +VAWDAKV+T+AWWV
Sbjct: 521 KYLSPGDIYVHADIHGASSVIIKNPKGGPVPPKTLQEAGTMAVCYSVAWDAKVITSAWWV 580

Query: 330 KADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           + DQVSKTAPTGEFLTTGSFM+RGKKNF PP QL MG  FL K++ES   R
Sbjct: 581 RHDQVSKTAPTGEFLTTGSFMVRGKKNFLPPTQLVMGFGFLMKIDESCAWR 631


>gi|195038845|ref|XP_001990823.1| GH19576 [Drosophila grimshawi]
 gi|193895019|gb|EDV93885.1| GH19576 [Drosophila grimshawi]
          Length = 983

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 182/415 (43%), Positives = 237/415 (57%), Gaps = 92/415 (22%)

Query: 28  SSQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           +S GY++Q KE++P    + ++   N+EFHP +           Q++  P +++E+F  A
Sbjct: 275 NSNGYIIQVKEEKPLDTENTEHFYRNVEFHPYLFV---------QNKDQPFKKYETFMEA 325

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDEFFST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q +DK KAELI +NQ
Sbjct: 326 VDEFFSTQESQKIDIKTLQQEREALKKLSNVKNDHTKRLDELNKLQDIDKRKAELITSNQ 385

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
             VD AILAI+  IA+QLSW DI+  VK+AQ + D VAS IKQLKL INHI+LLL     
Sbjct: 386 SLVDKAILAIQSAIASQLSWPDIQELVKEAQTNGDVVASSIKQLKLEINHISLLL----- 440

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                          DP+          +VDVDL LSA+ANA+R++D KRSAA K++KTI
Sbjct: 441 --------------TDPY--ECNDDDSIIVDVDLALSAWANARRYYDQKRSAALKEKKTI 484

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAK 321
            +++KALKSAE+KT+QTLK+V+T++NI KARKV+WFEKFYWF+SSENYL   G  A   +
Sbjct: 485 DASQKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFYWFVSSENYLVIGGRDAQQNE 544

Query: 322 VVTNAW------WVKAD-----------QVSKTAPTGEFLTTGSF--------------- 349
           ++   +      +V AD                 P    L  G+                
Sbjct: 545 LIVKRYMRPKDIYVHADIQGASSVIIRNATGGDIPPKTLLEAGTMAISYSVAWDAKVVTN 604

Query: 350 ------------------------MIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                                   MIRGKKNF P C L MG+S LFKLEE  + R
Sbjct: 605 SYWVYSNQVSKTAPSGEYLGTGSFMIRGKKNFLPSCHLIMGLSLLFKLEEGFVQR 659


>gi|198422494|ref|XP_002122733.1| PREDICTED: similar to serologically defined colon cancer antigen 1
           [Ciona intestinalis]
          Length = 1103

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 172/412 (41%), Positives = 232/412 (56%), Gaps = 92/412 (22%)

Query: 26  NVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           ++  +GY+VQK E + + E  +  N+EFHP          V  QH+   +QEFESF  AV
Sbjct: 251 SLQCKGYIVQKIETKSDGE--LKTNVEFHPF---------VFNQHKHRNLQEFESFNKAV 299

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEFF + ES K D+K++Q+ER A++KLENV++DHE+RLS L   Q  D+ KA LI  N  
Sbjct: 300 DEFFGSLESQKNDMKSLQRERAAMRKLENVRKDHESRLSGLRSEQESDEMKAALIETNLH 359

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD +IL +R  IANQ+ W++I+  VK+AQ   DPVAS IK LKL  N + + LR H + 
Sbjct: 360 LVDQSILVVRSAIANQVDWDEIKLLVKEAQGRGDPVASCIKTLKLETNSMVMALRSHDD- 418

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
                                + +KP+ +++DL LSAYANA++++  KR+AAKK+QKTI 
Sbjct: 419 ---------------------DDQKPTKIEIDLSLSAYANARKYYGRKRNAAKKEQKTID 457

Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKV 322
           ++ KA KSAEKKTKQTLK+   + NI KARKVYWFEKF WFISSENYL   G  A   +V
Sbjct: 458 ASTKAFKSAEKKTKQTLKEAAAVRNILKARKVYWFEKFLWFISSENYLVIGGREAQQNEV 517

Query: 323 VTNAW------WVKADQVSKTA-----PTG------------------------------ 341
           +   +      +V AD    T+     P+G                              
Sbjct: 518 LVKKYLNQGDIYVHADLHGATSCIIKNPSGQPIPPKTLNEAGTMATCHSAAWDAKVVTSA 577

Query: 342 ---------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
                          E+LTTGSF+IRGKKN+ PP  L  G  FLFK++E+ +
Sbjct: 578 WWVHHDQVSKTAPSGEYLTTGSFLIRGKKNYLPPSYLVYGFGFLFKVDETCV 629


>gi|443707183|gb|ELU02895.1| hypothetical protein CAPTEDRAFT_151175 [Capitella teleta]
          Length = 1023

 Score =  290 bits (742), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 178/413 (43%), Positives = 228/413 (55%), Gaps = 88/413 (21%)

Query: 28  SSQGYLVQKKEKRPNSE--DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           SS+GY++QKKEK+ +S   D ++   EFHP +          +QH+  P  EFESF+ AV
Sbjct: 242 SSKGYIIQKKEKKASSTEGDELITYNEFHPYL---------YKQHESCPHLEFESFSKAV 292

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEFFS  ES K+D+K +QQE+  L+KLENV++DH  RL AL   Q  D  K +LI  N  
Sbjct: 293 DEFFSKIESQKLDMKTLQQEKSVLRKLENVRKDHAQRLQALANEQEKDNIKGQLIEMNLP 352

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            V+ AIL ++  +ANQL W DI   VK+AQ   DPVAS I  L+L  NH T++LR     
Sbjct: 353 LVERAILVVQSALANQLDWADINQLVKEAQAQGDPVASSISSLQLQSNHFTMMLR----- 407

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                         D +    E   P+  V +DL LSAYANA++++D K+ AA+K+QKT+
Sbjct: 408 --------------DCYEGDEEDMLPAQKVQIDLGLSAYANARKYYDKKKHAAQKEQKTV 453

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY------WF--EKFYWFISSENYLGN- 315
            ++ KALKSAEKKTKQTLK+VQ    I K RK +      WF   + Y  I   +   N 
Sbjct: 454 AASTKALKSAEKKTKQTLKEVQVAATIRKQRKTHWFEKFLWFISSENYLVIGGRDQQQNE 513

Query: 316 ------------------------------------------------VAWDAKVVTNAW 327
                                                            AWDAKVVT+AW
Sbjct: 514 LLVKRHLRPGDLYVHADLHGASSVIIKNPSGVPPKTLNEAGTMALCHSAAWDAKVVTSAW 573

Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           WV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L  G  FLFK++++SI R
Sbjct: 574 WVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIYGFGFLFKVDDTSIFR 626


>gi|26333303|dbj|BAC30369.1| unnamed protein product [Mus musculus]
          Length = 641

 Score =  288 bits (738), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 179/407 (43%), Positives = 229/407 (56%), Gaps = 89/407 (21%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410

Query: 200 R----VHKERSNIKDSWVLHHDNADPFADSSETK----------KPSLVDVDLDLSAYAN 245
           R    + +E     D+ + + D   P     + K          KP LVDVDL LSAYAN
Sbjct: 411 RNPYLLSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYAN 470

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
           AK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF W
Sbjct: 471 AKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLW 530

Query: 306 FISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTG---------- 341
           FISSENYL   G      +++   +      +V AD    T+     PTG          
Sbjct: 531 FISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTE 590

Query: 342 -----------------------------------EFLTTGSFMIRG 353
                                              E+LTTGSFMIRG
Sbjct: 591 AGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRG 637


>gi|340374096|ref|XP_003385574.1| PREDICTED: nuclear export mediator factor Nemf-like [Amphimedon
           queenslandica]
          Length = 1137

 Score =  285 bits (728), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 176/423 (41%), Positives = 236/423 (55%), Gaps = 99/423 (23%)

Query: 26  NVSSQGYLVQKKEKRPN----------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
           ++ S+GYL+QKK+  P           +   +  N+EFHP++          RQH     
Sbjct: 242 SMPSKGYLIQKKDVAPTVGVATSDTPTTSTEVTTNIEFHPLL---------YRQHLSSCY 292

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
           +E E+F  AVDEFFS+  S K D+K +Q ++ A+KKLEN+K+DHE R+ AL ++Q  D+ 
Sbjct: 293 KEVETFDRAVDEFFSSKSSQKQDVKVIQLQKSAVKKLENIKQDHEKRIEALRKSQDEDRY 352

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
           KAELI  N + V+ A L IR  +A+ + W DIE  V  AQ   DPVA+ I+ LKL+ N I
Sbjct: 353 KAELIEWNTDLVERACLVIRSAVASSMDWGDIELLVHDAQGRGDPVANSIQGLKLHSNLI 412

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
           TL L+   E            D+ D             VD+DL LS YANA+R++D+K+ 
Sbjct: 413 TLWLKAPYEE-----------DDDDSIK----------VDIDLGLSVYANARRYYDMKKQ 451

Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY--- 312
           AAKK+QKT +S+ KALKSAE+KTKQTLK+   ++ I KARKV+WFEKFYWFISSEN+   
Sbjct: 452 AAKKEQKTSESSNKALKSAERKTKQTLKEAAVISRITKARKVHWFEKFYWFISSENFVVI 511

Query: 313 -----------------------------------------------LGNVAWDAKVVTN 325
                                                          L      A   ++
Sbjct: 512 GGRDQQQNELLVKKYLNEHDVYVHADLHGATSVIVKNHSGGPVPPKTLNEAGVMAVCYSS 571

Query: 326 AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW         WV A+QVSKTAP+GE+LTTGSFMIRGKKNF PPC L +G S +FK++ES
Sbjct: 572 AWEAKIVTSAWWVYANQVSKTAPSGEYLTTGSFMIRGKKNFLPPCHLVLGFSIMFKVDES 631

Query: 377 SIS 379
           S++
Sbjct: 632 SLA 634


>gi|119586149|gb|EAW65745.1| serologically defined colon cancer antigen 1, isoform CRA_e [Homo
           sapiens]
          Length = 628

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 159/309 (51%), Positives = 199/309 (64%), Gaps = 30/309 (9%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 305 WFISSENYL 313
           WFISSENYL
Sbjct: 532 WFISSENYL 540


>gi|321467512|gb|EFX78502.1| hypothetical protein DAPPUDRAFT_305191 [Daphnia pulex]
          Length = 997

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 153/315 (48%), Positives = 206/315 (65%), Gaps = 34/315 (10%)

Query: 10  DIHKCGPTLHV-------LVSNLNVSSQGYLVQKKEKRPN----SEDYIVANMEFHPMIL 58
           DI +  P LH+       ++ N+   S+G +VQK+E RP     ++D+   N EFHPM+ 
Sbjct: 217 DITRDLPKLHLALKSADSIMQNIGSISKGIVVQKRESRPTPSGENQDFFT-NQEFHPML- 274

Query: 59  QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD 118
                    +QH+  P  E  SF  AVDEFFS  ES K+DLK VQQERDA+KKL N+++D
Sbjct: 275 --------YKQHESHPFIELPSFNQAVDEFFSKMESQKLDLKVVQQERDAMKKLANIRQD 326

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
           HE RL+ L   Q +D+ KA LI  NQ  +D AI  +R  +ANQ+SW++I+  V++A R  
Sbjct: 327 HEKRLANLHHVQEIDELKARLIEMNQPLIDHAIQVVRSALANQVSWKEIDELVEEATRKG 386

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
           DPVA IIK+LKL+ NHI+L+L  H       DS            +S E+ KP LVD+DL
Sbjct: 387 DPVAKIIKKLKLSTNHISLMLS-HPYAEQDSDS------------ESDESYKPQLVDIDL 433

Query: 239 DLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY 298
           DL+A+ANA+++F  K++A+KK+QKTI+S+ KA KSAEKK KQTLK+   +  I KARKV 
Sbjct: 434 DLTAFANARKYFGEKKNASKKEQKTIESSHKAFKSAEKKAKQTLKESAAIATIRKARKVL 493

Query: 299 WFEKFYWFISSENYL 313
           WFEKFYWFISS+NY+
Sbjct: 494 WFEKFYWFISSDNYI 508



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 45/66 (68%), Positives = 56/66 (84%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           + AW+AKV+T AWWV++ QVSKTAP+GE+LTTGSFMIRGKKNF PP  + +G   LF+LE
Sbjct: 570 SAAWEAKVMTTAWWVESSQVSKTAPSGEYLTTGSFMIRGKKNFLPPLPIVLGFGLLFRLE 629

Query: 375 ESSISR 380
           ESSI+R
Sbjct: 630 ESSIAR 635


>gi|347968346|ref|XP_312244.5| AGAP002680-PA [Anopheles gambiae str. PEST]
 gi|333468048|gb|EAA08148.6| AGAP002680-PA [Anopheles gambiae str. PEST]
          Length = 1053

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 150/289 (51%), Positives = 197/289 (68%), Gaps = 13/289 (4%)

Query: 29  SQGYLVQKKEKRP----NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           S GY++QKKE +P    + E+Y   N+E+ P +           Q+QG P + F+SF  A
Sbjct: 268 SPGYIIQKKEVKPTKQGDEEEYYFTNLEYQPYMYN---------QYQGEPFKAFDSFTTA 318

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDEF+S+ ES KIDLKA  QER+ALKKL NVK DH  R+  L + QL D+++AELI  NQ
Sbjct: 319 VDEFYSSLESQKIDLKAFAQEREALKKLSNVKTDHAKRIEELTKAQLEDRKRAELITRNQ 378

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           + VD A+LA++  +A Q+SW DI+  VK AQ + DPVAS I+QLKL INHI+L L     
Sbjct: 379 DLVDKALLAVQSALAAQMSWTDIQDLVKAAQANKDPVASCIRQLKLEINHISLHLTDPYA 438

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
             + + S     +      D      P +VDVDL LSA+ANA+R++D +R AA+K+QKTI
Sbjct: 439 SLDEQASDEEEEEEDSEREDDEAKLVPMVVDVDLALSAFANARRYYDQRRFAARKEQKTI 498

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           +S+ KALK+AE+KT QTLKDV+T T I+K RKVYWFEKFYWF+SSENYL
Sbjct: 499 ESSSKALKNAERKTIQTLKDVRTQTTISKVRKVYWFEKFYWFVSSENYL 547



 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 50/66 (75%), Positives = 61/66 (92%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           +VAWDAKVVT+A+WV ++QVSKTAPTGE+LTTGSFMIRG+KNF PPC L +G+SFLFKLE
Sbjct: 608 SVAWDAKVVTSAYWVHSEQVSKTAPTGEYLTTGSFMIRGRKNFLPPCHLVLGLSFLFKLE 667

Query: 375 ESSISR 380
           +SS+ R
Sbjct: 668 DSSVER 673


>gi|312384850|gb|EFR29482.1| hypothetical protein AND_01485 [Anopheles darlingi]
          Length = 1109

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 150/305 (49%), Positives = 198/305 (64%), Gaps = 55/305 (18%)

Query: 29  SQGYLVQKKEKRP----NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           S GY++QKKE RP      E+Y   N+E+ P +           Q+QG P +EF+SF +A
Sbjct: 268 SPGYIIQKKEVRPAKEGEKEEYYFTNLEYQPYMYS---------QYQGEPCKEFDSFTSA 318

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDEF+S+       L+ + QER+ALKKL NVK DH  R+  L + QL D++KAELI  NQ
Sbjct: 319 VDEFYSS-------LETLAQEREALKKLSNVKTDHAKRIEELTKAQLGDRKKAELITRNQ 371

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           + VD A+LA++  +A Q+SW DI+  VK AQ + DPVAS I+QLKL INHI+L L     
Sbjct: 372 DLVDKALLAVQSALAAQMSWTDIQDLVKAAQANKDPVASCIRQLKLEINHISLYL----- 426

Query: 205 RSNIKDSWVLHHDNADPFA--DSSETK--------------KPSLVDVDLDLSAYANAKR 248
                         +DP+A  D +E+               +P +VDVDL LSA+ANA+R
Sbjct: 427 --------------SDPYAFLDENESDNEEDSDREEDEEKLEPMVVDVDLALSAFANARR 472

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
           ++D +R AA+K+QKTI+S+ KALK+AE+KT QTLKDV+T T I+K RKVYWFEKFYWFIS
Sbjct: 473 YYDQRRFAARKEQKTIESSSKALKNAERKTIQTLKDVRTQTTISKVRKVYWFEKFYWFIS 532

Query: 309 SENYL 313
           SENYL
Sbjct: 533 SENYL 537



 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 50/66 (75%), Positives = 61/66 (92%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           +VAWDAKVVT+A+WV ++QVSKTAPTGE+LTTGSFMIRG+KNF PPC L +G+SFLFKLE
Sbjct: 598 SVAWDAKVVTSAYWVHSEQVSKTAPTGEYLTTGSFMIRGRKNFLPPCHLVLGLSFLFKLE 657

Query: 375 ESSISR 380
           +SS+ R
Sbjct: 658 DSSVER 663


>gi|351702906|gb|EHB05825.1| Serologically defined colon cancer antigen 1, partial
           [Heterocephalus glaber]
          Length = 762

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 158/352 (44%), Positives = 198/352 (56%), Gaps = 75/352 (21%)

Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
           +E+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI  N + VD AI  +R  +ANQ+ 
Sbjct: 1   KEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQVVDRAIQVVRSALANQID 60

Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR----------------VHKERSN 207
           W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR                V  E++ 
Sbjct: 61  WTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDADGDVSVEKNE 120

Query: 208 IKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQST 267
            +          +      +  KP LVDVDL LSAYANAK+++D KR AAKK QKT+++ 
Sbjct: 121 TEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAA 180

Query: 268 EKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVT 324
           EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WFISSENYL   G      +++ 
Sbjct: 181 EKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIV 240

Query: 325 NAW------WVKADQVSKTA-----PTG-------------------------------- 341
             +      +V AD    T+     PTG                                
Sbjct: 241 KRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEVGTMALCYSAAWDARVITSAWW 300

Query: 342 -------------EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                        E+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 301 VYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 352


>gi|405952718|gb|EKC20496.1| Serologically defined colon cancer antigen 1-like protein
           [Crassostrea gigas]
          Length = 1084

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 170/452 (37%), Positives = 235/452 (51%), Gaps = 108/452 (23%)

Query: 7   VSSDIHKCGPTLHVLVSNLNVSS----QGYLVQKKEKRP------NSEDYIVANMEFHPM 56
           V+ D+ K    + +  S L   S    QG +VQKKEKR       N+E  ++   EFHPM
Sbjct: 217 VTEDMDKLTSAIELAESLLKTLSEEPCQGVIVQKKEKRAAVKEGENAE--LLTYEEFHPM 274

Query: 57  ILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVK 116
           +          +Q +  P   F++F  +VDEFFS  ES K+D+KA+QQE+ ALKKL+N+K
Sbjct: 275 LF---------KQFENKPHSIFDNFNKSVDEFFSQIESQKLDMKALQQEKSALKKLDNIK 325

Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
           +DHE R+  L++ Q  D  K  LI  N   VD A+L +R  +ANQ+ W +IE  V +AQ 
Sbjct: 326 KDHEKRIEGLQKEQETDINKGRLIELNLPLVDQALLIVRSALANQIDWTEIENLVHEAQL 385

Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK------- 229
             DPVAS I  LKL+ N ITLLLR                   DP+  S +         
Sbjct: 386 QGDPVASCITGLKLDSNMITLLLR-------------------DPYRYSDDEYDDDDDDD 426

Query: 230 --KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQT 287
             KP+ +D+D+ +SAY N++++FD K++AAKK+QKTI ++ KALKSAE+KTK+TLK+V T
Sbjct: 427 VLKPTKIDIDISMSAYGNSRKYFDKKKTAAKKEQKTIDASAKALKSAERKTKETLKEVAT 486

Query: 288 MTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA 338
              INKARK YWFEKF WFI+SENYL   G      +++   +      +V AD    ++
Sbjct: 487 AATINKARKTYWFEKFLWFITSENYLVIGGRDQQQNEMIVKRYLRPGDLYVHADLHGASS 546

Query: 339 -----PTGEFL---------------------------------------TTGSFMIRG- 353
                P+GE +                                        +G ++  G 
Sbjct: 547 CVLKNPSGEPVPPKSLNEAGTMAICNSVAWDAKVVTSAWWVYHDQVSKTAPSGEYLTTGS 606

Query: 354 -----KKNFFPPCQLAMGISFLFKLEESSISR 380
                KKN+ PP  L  G   LFKLE+ SI R
Sbjct: 607 FMIRGKKNYLPPTHLVYGFGLLFKLEDDSIER 638


>gi|47230001|emb|CAG10415.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 582

 Score =  257 bits (656), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 157/370 (42%), Positives = 206/370 (55%), Gaps = 51/370 (13%)

Query: 28  SSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESF 81
           + +GY++QK EK+P+       E+ +  + EFHP +           QH   P  EF+SF
Sbjct: 244 TGKGYIIQKSEKKPSVTPGKPCEELLTYD-EFHPFLFA---------QHSKSPYLEFDSF 293

Query: 82  AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
             AVDEFFS  ES KID+KA+Q E+ ALKKLENVK+DHE RL AL Q Q +D+ K ELI 
Sbjct: 294 DKAVDEFFSKMESQKIDMKALQLEKHALKKLENVKKDHEQRLEALHQAQEIDRIKGELIE 353

Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR- 200
            N   V+ A+  +   +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NHITLLL+ 
Sbjct: 354 MNLAIVERALQVVCGALANQVDWTEIGILVKEAQAAGDPVACAIKELKLQANHITLLLKN 413

Query: 201 VHKERSNIKDSWVLHHDN---ADPFADSSETKKPSLVDVDLDLSAYANAKRF-------- 249
            +    + ++  VL        +         KP LVDVDL LSAYANAK+F        
Sbjct: 414 PYISEDDEQEDDVLEETGRKNKNKKNKKFHKNKPVLVDVDLSLSAYANAKKFEKFLWFIS 473

Query: 250 -FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
             +    A + QQ+     ++ L++ E    +TL +  TM     A              
Sbjct: 474 AENYLVIAGRDQQQNEMIVKRYLRAGEPIPPRTLTEAGTMAVCYSA-------------- 519

Query: 309 SENYLGNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
                   AW+AK+VT+AWWV   QVSKTAPTGE+LTTGSFMIRGKKN+ PP  L MG  
Sbjct: 520 --------AWEAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNYLPPSYLIMGFG 571

Query: 369 FLFKLEESSI 378
           FLFK+  + +
Sbjct: 572 FLFKVTLTGV 581


>gi|324502310|gb|ADY41017.1| Serologically defined colon cancer antigen 1 [Ascaris suum]
          Length = 958

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 160/412 (38%), Positives = 217/412 (52%), Gaps = 92/412 (22%)

Query: 29  SQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF 88
           S G+++ K + R +   +  +  EFHP            +Q +   ++EFE+F+  VDE+
Sbjct: 245 SHGFIIYKLDTRADGHSF-ESYQEFHPYRF---------KQFESENLREFENFSECVDEY 294

Query: 89  FSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148
           FS  ES + D +A+  ER+ALKKLENVKRD + R+ +LE  Q+  ++ AELI  N + VD
Sbjct: 295 FSKIESQRADQRALNAEREALKKLENVKRDQQERIESLELAQVEKRQMAELIELNSDLVD 354

Query: 149 AAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNI 208
            A+L IR  IANQLSWE IE    +A    DP+AS I  L LN N +TL LR        
Sbjct: 355 KALLIIRSAIANQLSWEMIEEMRIKASEAGDPIASSIVGLNLNSNEMTLSLR-------- 406

Query: 209 KDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTE 268
                      DP+ D S  KK   V +D+ LSAY N+++F   K++A  K+QKTI S+ 
Sbjct: 407 -----------DPYHDDSSPKK---VPIDIALSAYQNSRKFHSEKKAAVDKKQKTISSSA 452

Query: 269 KALKSAEKKTKQTLKDVQTMTNINKAR------KVYWF--EKFYWFISSEN--------- 311
           KALKSA+ K K+TL  V+   ++ K+R      K +WF   + Y  I   +         
Sbjct: 453 KALKSAQLKAKETLATVRAKADVVKSRRQMWFEKFFWFVSSENYLVIGGRDAQQNELLVK 512

Query: 312 -YL------------------------------------GNVA------WDAKVVTNAWW 328
            YL                                    G++A      W+AKV+  AWW
Sbjct: 513 RYLRTGDIYVHADVRGASSVVIRNKVNGGEIPPKTLNEAGSMAVCYSSSWEAKVIAAAWW 572

Query: 329 VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           V   QVS+TAPTGE+LT GSFMIRGKKNF P CQL MG   +FKL+E S+ R
Sbjct: 573 VYHHQVSRTAPTGEYLTPGSFMIRGKKNFLPSCQLQMGFGLMFKLDEDSVER 624


>gi|313211850|emb|CBY15998.1| unnamed protein product [Oikopleura dioica]
          Length = 699

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 157/446 (35%), Positives = 236/446 (52%), Gaps = 82/446 (18%)

Query: 2   VSNEDVSSD---IHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMIL 58
           V+   VS D   I +C    +  +++     QG++  K      + D ++  +++ P + 
Sbjct: 220 VATASVSQDADEILECLQNCYEFLNSGKTKFQGFISTK------TNDNVLQYVDYQPFLF 273

Query: 59  QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD 118
                 S +         E E F+ AVD+F+   +S K + K +Q E+ A+KKLENVK D
Sbjct: 274 NQSQLDSTI---------ELEKFSLAVDKFYGEIQSQKAEQKMMQAEKSAMKKLENVKLD 324

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
           H  RL +L+  Q  +  KA+LI  N + VD+A+  +R  +A+Q+ WE+IE  +++ Q   
Sbjct: 325 HMKRLESLKLAQADNVRKAQLIEMNLDLVDSALNQVRSAVASQIGWEEIEDFLEEGQDEG 384

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS----LV 234
           DPV+  I++LKL  N I ++L       +   S    + +   +  S+   + S     +
Sbjct: 385 DPVSIAIRELKLKTNQIVMMLSEPMYDDSDSSSEEEENPSESEYTKSARVTEGSEIIIYI 444

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
            +DL LSA+ NAK F+D KR+AA K+ KTI +++KALKSAEKKT ++LK++QT+  + K 
Sbjct: 445 FLDLSLSAFGNAKAFYDSKRAAADKESKTIDASKKALKSAEKKTNESLKNIQTVRQVTKV 504

Query: 295 RKVYWFEKFYWFISSENYLGNVAWDAK---------------------------VVTN-- 325
           RK  WFEKF+WFISSENYL     DA+                           +V N  
Sbjct: 505 RKQMWFEKFFWFISSENYLVIAGKDAQQNETIVKKYLKNGDVYVHADIHGASSCIVKNID 564

Query: 326 ----------------------AW--------W-VKADQVSKTAPTGEFLTTGSFMIRGK 354
                                 AW        W V A+QVSKTAP+GE+L+TGSFMIRGK
Sbjct: 565 PSKPVSPVTLHEVGHAAVCHSAAWNAKVLTSAWWVHANQVSKTAPSGEYLSTGSFMIRGK 624

Query: 355 KNFFPPCQLAMGISFLFKLEESSISR 380
           KN+ PP QL +G  FLFKL+++ ++R
Sbjct: 625 KNYLPPSQLVLGFGFLFKLDDACVAR 650


>gi|391330989|ref|XP_003739933.1| PREDICTED: nuclear export mediator factor NEMF homolog [Metaseiulus
           occidentalis]
          Length = 956

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 138/290 (47%), Positives = 181/290 (62%), Gaps = 36/290 (12%)

Query: 47  IVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER 106
           I +  EFHP +          +Q +G  ++ F +F  AVD FFS  E  KIDL+A   E+
Sbjct: 267 IASYNEFHPFLF---------KQLEGSRVEHFATFGQAVDAFFSMQEQQKIDLRAHNLEK 317

Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
           +A+KKLENVK DHE RL+ALE TQ  D EKA LI NN E V+ A+ A+R  +A+Q SW++
Sbjct: 318 EAVKKLENVKLDHEKRLNALEGTQRTDLEKAMLIENNLELVEKALYAVRSFVASQYSWDE 377

Query: 167 IEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS 226
           I   +K+AQ   DPVA  IK L L+ N   +LL                       ++S 
Sbjct: 378 IGHMIKEAQHMGDPVACTIKALHLDRNQFGMLL-----------------------SNSF 414

Query: 227 ETK-KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
           E    PS+VD+D+DLSAYANA+R+FD+K+ AA+KQQKTI+S+ KALKSA+KKTK+ LK V
Sbjct: 415 ENDLSPSVVDIDIDLSAYANARRYFDMKKHAARKQQKTIESSAKALKSAQKKTKEILKQV 474

Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
           +  TNI + RK YWFEKF+WFISSENYL   G  A   +V+   +  K D
Sbjct: 475 ELTTNIARTRKSYWFEKFFWFISSENYLVIGGRDAQQNEVIVKKYMTKGD 524



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 41/64 (64%), Positives = 52/64 (81%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           + AW+AKVVT+AWWV   QV+KTAP+GE+LT GSFMIRGKKN+ PP  L MG  F+F+L+
Sbjct: 572 SAAWEAKVVTSAWWVHHHQVTKTAPSGEYLTAGSFMIRGKKNYLPPLYLIMGFGFMFRLD 631

Query: 375 ESSI 378
           E S+
Sbjct: 632 EESV 635


>gi|330841435|ref|XP_003292703.1| hypothetical protein DICPUDRAFT_40970 [Dictyostelium purpureum]
 gi|325077022|gb|EGC30763.1| hypothetical protein DICPUDRAFT_40970 [Dictyostelium purpureum]
          Length = 1084

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 142/376 (37%), Positives = 209/376 (55%), Gaps = 73/376 (19%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P + +Q++     E++SF  AVD+FFS  ES K++ + +QQE+  LKKL+ VK D + R+
Sbjct: 284 PFLYKQYESKKYIEYDSFDGAVDQFFSEIESQKLEQQRIQQEQTVLKKLDKVKEDQQRRI 343

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND-PVA 182
            +L   +  +  KAELI  N + VD  IL IR  +AN ++W+ +   +K+ ++ N   VA
Sbjct: 344 DSLFANEAENVRKAELIEANLQEVDQCILIIRSGVANSMNWDTLNQLLKEEKKKNPYSVA 403

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
           + I++LKL  N ITL L              L+ D+ +    +   KKP+L+DVD+ LSA
Sbjct: 404 TKIQRLKLESNQITLALT----------DGFLYDDDEEV---NKTNKKPTLIDVDISLSA 450

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           +ANA++++D K+ + +K QKTI   E ALK+AE KT+Q L +V++  ++ + RKV+WFEK
Sbjct: 451 FANARKYYDTKKQSHEKAQKTISQAEFALKAAESKTRQQLSEVKSKHSMIQMRKVFWFEK 510

Query: 303 FYWFISSENYLGNVAWDAK----------------------------------------- 321
           F+WFISS+NY+     DA+                                         
Sbjct: 511 FHWFISSDNYIVVSGRDAQQNELLFKKYLEKDDVYVHADIFGSTSCVIKNPNGGEIPPNT 570

Query: 322 ---------VVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
                      +NAW         WV + QVSKTAP+GE+LTTGSFMIRGKKN+ P  QL
Sbjct: 571 LIQAGTMTMCYSNAWSAKVVTSAYWVYSHQVSKTAPSGEYLTTGSFMIRGKKNYLPHSQL 630

Query: 364 AMGISFLFKLEESSIS 379
            MG  F+FK++ES I 
Sbjct: 631 VMGFGFMFKIDESCIG 646


>gi|341901167|gb|EGT57102.1| hypothetical protein CAEBREN_19463 [Caenorhabditis brenneri]
          Length = 920

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 147/393 (37%), Positives = 204/393 (51%), Gaps = 94/393 (23%)

Query: 60  PGSTPSVMRQHQGF-PI---------QEFESFAAAVDEFFSTAESHKIDLKAVQQERDAL 109
           P  T + ++ +Q F PI         +E  SF  +VDEF+S  E+ K + KA+  E+ AL
Sbjct: 261 PSPTSAPIQLYQEFNPIPMPLTSKFTKELPSFCESVDEFYSRIETQKQEQKAINMEKQAL 320

Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
           KKLENV++D + R+ AL+ TQ   +  A  II NQE V+ A+L IR  +ANQ SW+ IE 
Sbjct: 321 KKLENVEKDQKDRIEALQMTQEQREHMANRIILNQELVEKALLLIRSALANQFSWQTIEE 380

Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
             K A ++ DPVA  I   K   N   + L                    DP+ D +E  
Sbjct: 381 MKKTAAKNGDPVAKSIDSFKFESNEFVMTL-------------------GDPYDDEAEIL 421

Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
           K   V +D+ ++A  NA+R F  K+SAA+K +KT+ S+EKA+K+A++K K TL+ V+ +T
Sbjct: 422 K---VPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQVKIVT 478

Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
            + K+RK  WFEKF WFISSE Y+     DA+                            
Sbjct: 479 EVKKSRKAMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSVV 538

Query: 322 -------------------------VVTNAW---------WVKADQVSKTAPTGEFLTTG 347
                                      +NAW         WV+ +QVS+TAPTGE+L +G
Sbjct: 539 IRNKSFEESQEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVRPEQVSRTAPTGEYLPSG 598

Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           SFMIRGKKNF PP QL MG+  LF+++E SI R
Sbjct: 599 SFMIRGKKNFMPPSQLVMGLGVLFRMDEESIER 631


>gi|32565397|ref|NP_497411.2| Protein Y82E9BR.18 [Caenorhabditis elegans]
 gi|373220360|emb|CCD73050.1| Protein Y82E9BR.18 [Caenorhabditis elegans]
          Length = 921

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 153/394 (38%), Positives = 209/394 (53%), Gaps = 94/394 (23%)

Query: 57  ILQPGSTPSVMRQHQGF-PI---------QEFESFAAAVDEFFSTAESHKIDLKAVQQER 106
           +L P STP  ++ +Q F PI         +E  SF  AVDEF+S  E+ K + KAV  E+
Sbjct: 262 VLSPISTP--IQIYQDFNPISMEFTAKLSKELSSFCEAVDEFYSRIETQKQEQKAVNMEK 319

Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
            ALKKLENV++D + R+ AL+ TQ   ++ A  II N E V+ A+L IR  +ANQ SW+ 
Sbjct: 320 QALKKLENVEKDQKDRIEALQLTQSQREQMANRIILNTELVEKALLLIRSALANQFSWQT 379

Query: 167 IEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS 226
           IE   K A  + DPVA  I   K   N   + L                   ADP+ D +
Sbjct: 380 IEEMRKTAAGNGDPVAKSIDSFKFENNEFMMSL-------------------ADPYDDEA 420

Query: 227 ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ 286
           E  K   V +D+ L+A  NA+R F  K+SAA+K +KT+ S+EKA+K+A++K K TL+ V+
Sbjct: 421 EVLK---VPIDISLNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQVK 477

Query: 287 TMTNINKARKVYWFEKFYWFISSENY---------------------------------- 312
            +  + K+RK  WFEKF WFISSE +                                  
Sbjct: 478 IVVEVKKSRKSMWFEKFRWFISSEGFIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGAS 537

Query: 313 ---LGNVAWDAKV--------------VTNAW---------WVKADQVSKTAPTGEFLTT 346
              + N ++DA++               +NAW         WV  DQVS+TAPTGE+L +
Sbjct: 538 SVVIRNKSFDAEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPDQVSRTAPTGEYLPS 597

Query: 347 GSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           GSFMIRGKKNF PP QL MG+  LF+++E SI R
Sbjct: 598 GSFMIRGKKNFMPPSQLVMGLGILFRMDEESIER 631


>gi|281200297|gb|EFA74518.1| DUF814 family protein [Polysphondylium pallidum PN500]
          Length = 1134

 Score =  233 bits (595), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 149/418 (35%), Positives = 221/418 (52%), Gaps = 73/418 (17%)

Query: 22  VSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESF 81
           + N+ + S+GY+V +  K P   D  V   EF P +          +Q+   P QE++SF
Sbjct: 225 IFNVTMQSRGYIVLRPPKTPT--DTTVIYEEFVPFLY---------KQYHSKPNQEYDSF 273

Query: 82  AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
             AVD+FFS  E+ +++ + + QE+  LKKL+ V+ D + R+ +L   +  +  KA+LI 
Sbjct: 274 DQAVDQFFSEIEAQRVEQQRIAQEQTVLKKLDKVREDQQRRIDSLFAAEADNVRKAQLIE 333

Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND-PVASIIKQLKLNINHITLLLR 200
            N + VD  I  I+  +   + W  +   +K+ ++ N   VA+II +LKL  N I L L 
Sbjct: 334 ANLQEVDQCITIIKSGVNASMDWTALSQLLKEEKKKNPYSVANIIHKLKLESNQIQLALN 393

Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQ 260
            + +    +D      +         + KKP+LVDV++ L+AYANA+ ++D K+ A +K 
Sbjct: 394 DNYDDDYDEDEDDDEEEEKK--QQKKDKKKPTLVDVNIALTAYANAREYYDSKKHANEKA 451

Query: 261 QKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDA 320
            KTIQ  E A+K+AEKKT+Q L +V+  + + + RKV+WFEKF+WF+SS+NYL     DA
Sbjct: 452 NKTIQQAEFAMKAAEKKTRQQLSEVKAKSAMIQMRKVFWFEKFHWFLSSDNYLVISGKDA 511

Query: 321 K--------------------------------------------------VVTNAW--- 327
           +                                                    +NAW   
Sbjct: 512 QQNEMLFKKYLEKDDIYVHADIFGSTSCVIKNHGGGAIPPNTLIQAGTMTMCYSNAWSAK 571

Query: 328 ------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
                 WV A+QVSKTAP+GEFLTTGSFMIRGKKN+ P  QL MG  F+FKL+ES I+
Sbjct: 572 VVTSAYWVYANQVSKTAPSGEFLTTGSFMIRGKKNYLPHSQLVMGFGFMFKLDESCIA 629


>gi|440797731|gb|ELR18808.1| isoform 2 of serologically defined colon cancer antigen 1 family
           protein [Acanthamoeba castellanii str. Neff]
          Length = 1138

 Score =  233 bits (595), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 139/369 (37%), Positives = 195/369 (52%), Gaps = 81/369 (21%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P V +Q +G P  EF SF  AVD FFS A+  ++++K  QQE+        VK+DHETR+
Sbjct: 330 PYVYKQFEGRPRLEFPSFDEAVDIFFSKAQEQQVEVKKEQQEK-------TVKKDHETRI 382

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
           +AL + +    +KA LI  N   VDAAI     ++A  + W  +   VK+A++  DP+A+
Sbjct: 383 AALTKAEEECIKKAHLIETNVSDVDAAIKVTCSELARGMDWAQLTRVVKEAKKAGDPIAN 442

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDL 240
           +I  L    N ITLLL                    DP    AD+S   +   V+VD+  
Sbjct: 443 LIHSLDFANNRITLLL-------------------VDPLEAAADASGAMQK--VEVDIGQ 481

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +AYANA+ F+   +  A K  KT+ S++ A+K+AEKK ++ +KDV     I K RK YWF
Sbjct: 482 TAYANAQEFYAEAKRRAHKHAKTVASSQMAVKAAEKKARREIKDVGVKAAIQKVRKAYWF 541

Query: 301 EKFYWFISSENYLGNVAWDAK--------------------------------------- 321
           EKF+WFISSENY+     DA+                                       
Sbjct: 542 EKFHWFISSENYVVISGRDAQQNELIVKRYLRKGDAYVHADLHGAATCVVKNPHPDKPIP 601

Query: 322 -----------VVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
                      + T+AWWV  +QVSKTAP+GE+L TGSFMIRGKKNF PP QL MG +++
Sbjct: 602 ALTLAEAGSMTIPTSAWWVHPEQVSKTAPSGEYLVTGSFMIRGKKNFLPPSQLVMGFAYM 661

Query: 371 FKLEESSIS 379
           FK++ +S++
Sbjct: 662 FKVDPTSVA 670


>gi|393907053|gb|EJD74501.1| serologically defined colon cancer antigen 1 [Loa loa]
          Length = 1568

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 153/416 (36%), Positives = 213/416 (51%), Gaps = 98/416 (23%)

Query: 28  SSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           ++QG+LV +++ R +    E Y     EFHP +           Q      + F+SF+  
Sbjct: 244 AAQGFLVYREDARMDGVIVETY----QEFHPFMFS---------QFSDMQTKHFDSFSEC 290

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDEFFS  E  K D+KA+  E++A+KKL NV +D + R++AL+  QL  +E AELI  N 
Sbjct: 291 VDEFFSKLELQKADVKALNAEKEAMKKLNNVIKDQQDRIAALKVAQLEREEMAELIELNS 350

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           + VD A+L IR  IANQLSWE IE     A    +P+A+ I  L LN N +TLLLR    
Sbjct: 351 DLVDKALLVIRSAIANQLSWEAIEEMRVNACEAGNPIAASIVGLNLNSNQMTLLLR---- 406

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                          DP+    + KK   V +D+ LS+Y NA++    K++A +K+QKTI
Sbjct: 407 ---------------DPYRPEIDPKK---VTIDIALSSYQNARKLHTEKKAAQQKEQKTI 448

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKV------YWF--EKFYWFISSEN----- 311
            ++ KALKS + K K+TL  V +   + K R+V      +WF   + Y  I   +     
Sbjct: 449 CASSKALKSTKVKIKETLNVVHSKAEVMKKRRVMWFEKFFWFVSSENYLVIGGRDAQQNE 508

Query: 312 -----YL--GNV----------------------------------------AWDAKVVT 324
                YL  G++                                        AW+AKV +
Sbjct: 509 LLVKRYLRPGDIYMHADTRGASSIIIRNKLGGGDMPPRTLNEAATMAVSYSSAWEAKVTS 568

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
            AWWV   QVS+TAPTGE+LT GSFMIRGKKN+ P CQL MG   +F+L+E S+ R
Sbjct: 569 AAWWVHQHQVSRTAPTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLER 624


>gi|449681046|ref|XP_002157080.2| PREDICTED: nuclear export mediator factor NEMF-like, partial [Hydra
           magnipapillata]
          Length = 1467

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 134/297 (45%), Positives = 184/297 (61%), Gaps = 35/297 (11%)

Query: 26  NVSS---QGYLVQKKEKRPNS--EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N+SS   +GYL+QKKE   N+  E  ++   EFHP +           QH+  P   F S
Sbjct: 699 NISSKEGEGYLIQKKESNANAVGEKDLLIYTEFHPFLYH---------QHKSLPFIHFHS 749

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F   VDEFFS  ES KIDLKA+QQE+ ALK+LENV+ DHE R+ +L++TQ  +  +A+LI
Sbjct: 750 FNKCVDEFFSKLESQKIDLKALQQEKAALKRLENVREDHEKRIHSLKETQDKEARRAKLI 809

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N   ++ AI+ +   IANQL WE+IE  +K+A+   DPVA+IIK L+L  N IT+ + 
Sbjct: 810 ELNLPLIERAIIIVNSAIANQLDWEEIEDLLKEAKLKGDPVANIIKSLQLKTNQITISVN 869

Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF----DLKRSA 256
             +E                      +      + +DL L+A+ NA+R++    D +R+A
Sbjct: 870 DEEET-----------------ESDEDDLDEVDIIIDLGLTAFGNARRYYYILHDKRRNA 912

Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           A K++KTIQ+++KALKSAE KTK+TLK+VQ    INK RK +WFEKFYWFISSENYL
Sbjct: 913 ATKEEKTIQASKKALKSAEYKTKETLKEVQNAKIINKTRKTFWFEKFYWFISSENYL 969



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 47/64 (73%), Positives = 55/64 (85%)

Query: 317  AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
            AW+A+V+T+AWWV  +QVSKTAP+GE+LTTGSFMIRGKKNF PP  L MG S LFKL+ES
Sbjct: 1032 AWEARVITSAWWVYHNQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFSVLFKLDES 1091

Query: 377  SISR 380
             ISR
Sbjct: 1092 CISR 1095


>gi|312082754|ref|XP_003143575.1| serologically defined colon cancer antigen 1 [Loa loa]
          Length = 899

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 153/416 (36%), Positives = 213/416 (51%), Gaps = 98/416 (23%)

Query: 28  SSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           ++QG+LV +++ R +    E Y     EFHP +           Q      + F+SF+  
Sbjct: 244 AAQGFLVYREDARMDGVIVETY----QEFHPFMFS---------QFSDMQTKHFDSFSEC 290

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDEFFS  E  K D+KA+  E++A+KKL NV +D + R++AL+  QL  +E AELI  N 
Sbjct: 291 VDEFFSKLELQKADVKALNAEKEAMKKLNNVIKDQQDRIAALKVAQLEREEMAELIELNS 350

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           + VD A+L IR  IANQLSWE IE     A    +P+A+ I  L LN N +TLLLR    
Sbjct: 351 DLVDKALLVIRSAIANQLSWEAIEEMRVNACEAGNPIAASIVGLNLNSNQMTLLLR---- 406

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                          DP+    + KK   V +D+ LS+Y NA++    K++A +K+QKTI
Sbjct: 407 ---------------DPYRPEIDPKK---VTIDIALSSYQNARKLHTEKKAAQQKEQKTI 448

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKV------YWF--EKFYWFISSEN----- 311
            ++ KALKS + K K+TL  V +   + K R+V      +WF   + Y  I   +     
Sbjct: 449 CASSKALKSTKVKIKETLNVVHSKAEVMKKRRVMWFEKFFWFVSSENYLVIGGRDAQQNE 508

Query: 312 -----YL--GNV----------------------------------------AWDAKVVT 324
                YL  G++                                        AW+AKV +
Sbjct: 509 LLVKRYLRPGDIYMHADTRGASSIIIRNKLGGGDMPPRTLNEAATMAVSYSSAWEAKVTS 568

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
            AWWV   QVS+TAPTGE+LT GSFMIRGKKN+ P CQL MG   +F+L+E S+ R
Sbjct: 569 AAWWVHQHQVSRTAPTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLER 624


>gi|328723949|ref|XP_001945685.2| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Acyrthosiphon pisum]
          Length = 987

 Score =  231 bits (588), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 134/323 (41%), Positives = 193/323 (59%), Gaps = 36/323 (11%)

Query: 6   DVSSDIHK---CGPTLHVLVSNLNVSSQGYLVQKKEKR--PNS-EDYIVANMEFHPMILQ 59
           ++ +DI K   C       + N+    +G+++QK +++  P+     +  N E+HP +  
Sbjct: 220 NIDTDIQKLMNCFCIAEKFLDNITTLKEGFIIQKIDQQLLPDGIMKELCTNQEYHPFLFA 279

Query: 60  PGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH 119
                    QHQ  P + +ESF  AVDEF+S  ES K D+K +QQE+ A+KKL+N+ +DH
Sbjct: 280 ---------QHQKLPSKTYESFNEAVDEFYSNLESQKYDVKCMQQEKGAVKKLQNIVKDH 330

Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
           E RL  L+ TQ   K KAELI NN + VD  I  +RQ +A QL W++I   ++Q    +D
Sbjct: 331 EERLKKLQDTQDEHKFKAELITNNLDLVDNTIQFVRQAVAKQLHWDEIWDMIRQLNFEDD 390

Query: 180 PVA-SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS-LVDVD 237
               +I+K LKL++NHITL L                    DP+ + ++ ++ S L+D+D
Sbjct: 391 GCTYAIVKNLKLSVNHITLQL-------------------FDPYNEENKNEENSQLIDID 431

Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
           L  SA+ NA+R++  K+ +A K++KTI S+   LK AEKKTKQTLKD+Q + +INK RK 
Sbjct: 432 LGQSAFGNAERYYGSKKQSAIKEKKTIDSSSTVLKMAEKKTKQTLKDMQVVASINKVRKT 491

Query: 298 YWFEKFYWFISSENYLGNVAWDA 320
           YWFEKFYWFISSENYL     DA
Sbjct: 492 YWFEKFYWFISSENYLVIAGRDA 514



 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 45/66 (68%), Positives = 55/66 (83%), Gaps = 1/66 (1%)

Query: 316 VAWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           V+W  K+ + NA+WVK +QVSKTAPTGE+LTTGSFMIRGKKN+ P   L +G+SFLFKLE
Sbjct: 569 VSWTMKINLQNAFWVKPEQVSKTAPTGEYLTTGSFMIRGKKNYLPATHLILGLSFLFKLE 628

Query: 375 ESSISR 380
           +SSI R
Sbjct: 629 DSSIPR 634


>gi|268571229|ref|XP_002640975.1| Hypothetical protein CBG11722 [Caenorhabditis briggsae]
          Length = 894

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 145/391 (37%), Positives = 200/391 (51%), Gaps = 94/391 (24%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           EF+P+     S P  ++  +     E  SF  +VDEF+S  E+ K + KAV  E+ ALKK
Sbjct: 248 EFNPI-----SMPLTLKLQK-----ELRSFCESVDEFYSRIETQKQEQKAVNMEKQALKK 297

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LENV++D + R+ AL+ TQ   +  A  II NQE V+ A+L IR  +ANQ SW+ IE   
Sbjct: 298 LENVEKDQKERIEALQLTQEQREHMANRIILNQELVEKALLLIRSALANQFSWQTIEEMR 357

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           K A  + DPVA  I   K   N   + L                    DP+ + +E  K 
Sbjct: 358 KSAAANGDPVAKSIDSFKFENNEFFMKL-------------------GDPYDEEAELLK- 397

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
             V +D+ ++A  NA+R F  K+SAA+K +KT+ S+EKA+K+A++K K TL+ V+ +T +
Sbjct: 398 --VPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKCTLEQVKIVTEV 455

Query: 292 NKARKVYWFEKFYWFISSENYLGNVAWDAK------------------------------ 321
            K+RK  WFEKF WFISSE Y+     DA+                              
Sbjct: 456 KKSRKTMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSVIIR 515

Query: 322 -----------------------VVTNAW---------WVKADQVSKTAPTGEFLTTGSF 349
                                    +NAW         WV   QV++TAPTGE+L +GSF
Sbjct: 516 NKSFEESMEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPSQVTRTAPTGEYLPSGSF 575

Query: 350 MIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           MIRGKKNF PP QL MG+  LF+++E SI R
Sbjct: 576 MIRGKKNFMPPSQLVMGLGILFRMDEESIER 606


>gi|66804841|ref|XP_636153.1| DUF814 family protein [Dictyostelium discoideum AX4]
 gi|60464500|gb|EAL62645.1| DUF814 family protein [Dictyostelium discoideum AX4]
          Length = 1268

 Score =  230 bits (587), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 141/397 (35%), Positives = 210/397 (52%), Gaps = 73/397 (18%)

Query: 56  MILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENV 115
           M++     P + +Q++     EFESF AAVD+FFS  ES K++ + + QE+  LKKL+ V
Sbjct: 300 MVIYEEFVPYLYKQYELKKFIEFESFDAAVDQFFSEIESQKVEQQRIAQEQVVLKKLDKV 359

Query: 116 KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
           K D + R+ +L   ++ +  KA+LI  N + VD  IL IR  +A+ + WE +   +K+ +
Sbjct: 360 KEDQQRRIDSLFANEVENIRKAQLIEANLQEVDQCILIIRSGVASSMDWETLNQLLKEEK 419

Query: 176 RHND-PVASIIKQLKLNINHITLLL-------------RVHKERSNIKDSWVLHHDNADP 221
           + N   VA+ I +LKL  N ITL L                 E S+ +      +     
Sbjct: 420 KKNPYSVATKIHRLKLESNQITLSLTDNFLYDDNDGDDDDDDEESDEESDEEDQNTKKSI 479

Query: 222 FADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
               +  +KP+L+DVD+ LSA+ANA++++D K+ + +K QKTI   E ALK+AEKKT+Q 
Sbjct: 480 KKSKTSNQKPNLIDVDISLSAFANARKYYDTKKQSHEKAQKTISQAEFALKAAEKKTRQQ 539

Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK-------------------- 321
           L + ++  ++   RK++WFEKF+WFISS+NY+     DA+                    
Sbjct: 540 LSETKSKNSMIAMRKIFWFEKFHWFISSDNYIVVSGRDAQQNELLYKKYLEKDDIYVHAD 599

Query: 322 ------------------------------VVTNAW---------WVKADQVSKTAPTGE 342
                                           +NAW         WV + QVSKTAP+GE
Sbjct: 600 IFGSTSCVIKNPNGGEIPPNTLIQAGTMTMCYSNAWSAKVVTSAYWVYSHQVSKTAPSGE 659

Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           FLTTGSFMIRGKKN+ P  QL MG  F+FK+++S + 
Sbjct: 660 FLTTGSFMIRGKKNYLPHSQLVMGFGFMFKIDDSCLG 696


>gi|225432064|ref|XP_002273922.1| PREDICTED: nuclear export mediator factor Nemf-like [Vitis
           vinifera]
          Length = 1110

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 144/377 (38%), Positives = 199/377 (52%), Gaps = 76/377 (20%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P ++ Q +     +FE+F AA+DEF+S  ES + + +   +E  A++KL  ++ D E R
Sbjct: 333 CPILLNQFKSREFVKFETFDAALDEFYSKIESQRSEQQQKAKEGSAMQKLTKIRVDQENR 392

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L++      + AELI  N E VDAAILA+R  +AN ++WED+   VK+ ++  +PVA
Sbjct: 393 VHTLKKEVDHCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDLARMVKEEKKSGNPVA 452

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I +L L  N +TLLL                 +N D   D  +T     V+VDL LSA
Sbjct: 453 GLIDKLYLERNCMTLLLS----------------NNLDEMDDDEKTLPVDKVEVDLALSA 496

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           +ANA+R+++ K+    KQ+KT+ + EKA K+AEKKT+  L   +T+  I+  RKV+WFEK
Sbjct: 497 HANARRWYEQKKRQENKQEKTVIAHEKAFKAAEKKTRLQLSQEKTVATISHMRKVHWFEK 556

Query: 303 FYWFISSENYLGNVAWDAK---------------------------VVTN---------- 325
           F WFISSENYL     DA+                           V+ N          
Sbjct: 557 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLHGASSTVIKNHKPEHPVPPL 616

Query: 326 --------------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ 362
                         AW         WV   QVSKTAPTGE+LT GSFMIRGKKNF PP  
Sbjct: 617 TLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 676

Query: 363 LAMGISFLFKLEESSIS 379
           L MG   LF+L+ESS+ 
Sbjct: 677 LIMGFGLLFRLDESSLG 693


>gi|168034467|ref|XP_001769734.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679083|gb|EDQ65535.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1100

 Score =  227 bits (579), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 147/388 (37%), Positives = 200/388 (51%), Gaps = 85/388 (21%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           EF P+ L+     +VMR          E++ AA+DEFFS  E  + + +   QE  A  K
Sbjct: 315 EFSPLHLKQFDDRTVMR---------METYDAALDEFFSKIEGQRAEQQRKAQEDSAFSK 365

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+ ++ D   R+  L+Q        AELI  N E VD AILA+R  +A+ + W+D+   +
Sbjct: 366 LDKIRADQTQRVEVLKQEVDQTVRMAELIEYNLEDVDNAILAVRSTVASGMDWKDLARMI 425

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           K+ ++  +PVA +I  L+L  N ITLLL                 +N D   D  +T+  
Sbjct: 426 KEEKKAGNPVAGLIHSLQLEKNQITLLL----------------SNNLDDMDDDEKTQPV 469

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
           S VDVD+ LSA+ANA+R+F+ K+  A KQ KT  + EKA K+AEKKT Q L   +++  I
Sbjct: 470 SKVDVDIGLSAHANARRWFEQKKKHAVKQDKTKAAHEKAFKAAEKKTLQQLAQAKSVAAI 529

Query: 292 NKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------VVT 324
           +  RKV+WFEKF WF+SSENYL     DA+                           V+ 
Sbjct: 530 SHMRKVHWFEKFNWFVSSENYLIISGRDAQQNELVVKRYMRKGDLYVHADLHGASSTVIQ 589

Query: 325 N------------------------AW--------W-VKADQVSKTAPTGEFLTTGSFMI 351
           N                        AW        W V+A QVSKTAPTGE+LT GSFM+
Sbjct: 590 NHNPLYPIPPLTINQAGVFTVCRSQAWDSKIVTSAWWVEAHQVSKTAPTGEYLTVGSFMV 649

Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
           RGKKNF PP  L MG   LF+L++SSI+
Sbjct: 650 RGKKNFLPPNPLVMGFGVLFRLDDSSIA 677


>gi|328864957|gb|EGG13343.1| DUF814 family protein [Dictyostelium fasciculatum]
          Length = 1244

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 140/384 (36%), Positives = 209/384 (54%), Gaps = 67/384 (17%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P + +Q+Q     E++SF  AVD+FFS  ES K++ + + QE+  LKKL+ V+ D + R
Sbjct: 348 VPYLYKQYQDKKYLEYDSFDLAVDQFFSEIESQKVEQQRMSQEQTVLKKLDKVREDQQRR 407

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND-PV 181
           + +L  ++  + +KA+LI +N + VD  IL IR  +A  + W ++   +K+ ++ N   V
Sbjct: 408 IDSLYASEGENIKKAQLIESNLQDVDQCILIIRSGVAASMDWGNLNQLLKEEKKKNPYSV 467

Query: 182 ASIIKQLKLNINHITL-LLRVH------KERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
           A+ I +LKL+ N ITL L  +H      +E  +         +           KKP  +
Sbjct: 468 ANKIHKLKLDTNQITLSLTDLHLDDDEDEEDEDENSDDDSEDEEKKKKNQKKNAKKPVFI 527

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
           DVD+ LSAYANA+ F+D K+ + +K +KTIQ  + ALK+AEKK +Q L +V+T +++ + 
Sbjct: 528 DVDISLSAYANARNFYDSKKQSHEKAEKTIQQADFALKAAEKKARQQLSEVKTKSSMQQM 587

Query: 295 RKVYWFEKFYWFISSENYLGNVAWDAK--------------------------------- 321
           RKV+WFEKF+WFISS+NY+     DA+                                 
Sbjct: 588 RKVFWFEKFHWFISSDNYIVISGKDAQQNELLFKKYLDKDDVYVHADIFGSTSCVIKNPK 647

Query: 322 -----------------VVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKK 355
                              +NAW         WV + QVSKTAP+GEFLTTGSFMIRGKK
Sbjct: 648 GGEIPPNTLIQAGTMTMCYSNAWSAKVVTSAYWVYSHQVSKTAPSGEFLTTGSFMIRGKK 707

Query: 356 NFFPPCQLAMGISFLFKLEESSIS 379
           N+ P  QL MG  F+FK+++S I+
Sbjct: 708 NYLPHSQLVMGFGFMFKIDDSCIA 731


>gi|167516076|ref|XP_001742379.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779003|gb|EDQ92617.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1051

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 149/401 (37%), Positives = 205/401 (51%), Gaps = 75/401 (18%)

Query: 55  PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
           P+++     P +M+Q +  P     SF  A+D +FS  E+ K+ ++A+QQE  ALKKLE 
Sbjct: 268 PVLVLDDVIPFMMKQFEDRPHIHLPSFDRAIDRYFSELETQKLQMRAMQQEAAALKKLEA 327

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           VK  HE  +      Q  ++ KA+++  N E VD AI  IR  +AN+L W +I   VK+A
Sbjct: 328 VKASHEKHVEGYRLAQEANERKAQVLEANLEQVDRAIEIIRSMVANKLDWVEIAELVKEA 387

Query: 175 QRHNDPVASIIKQLKLNINHITLLL---RVHKERSNIKDSWVLHHDN------------A 219
           Q+  DP A II  LKL+ NH+T+ L     H E S    S     +             A
Sbjct: 388 QQQGDPDARIIDGLKLDKNHMTIRLPNPEAHAESSESDSSSASDSEEEEEEEEQKAIAAA 447

Query: 220 DPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT 278
                +S    P L +D+DL L+AYANA   +  K+ +A K+QK   +TE A++SAE+KT
Sbjct: 448 SKKRGTSSATDPFLTIDLDLALTAYANACNMYQHKKISAVKEQKARDATELAIQSAERKT 507

Query: 279 KQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL--------------------GNV-- 316
           +Q L+     T +NK RK+YWFEKF WFISSENYL                    G+V  
Sbjct: 508 QQQLQQNNVTTAVNKQRKIYWFEKFLWFISSENYLVIGGRDRQQNEILVRRYLKKGDVYV 567

Query: 317 ---------------------------AWDAKVVTNAWW----------VKADQVSKTAP 339
                                      A    V+ +  W          V  DQVSKTAP
Sbjct: 568 HADLHGAASVIVKNPRGGDVPPITLQEAGHMAVIYSGSWEARMPTSAWWVHHDQVSKTAP 627

Query: 340 TGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
            GE+L+TGSFMIRGKKN+ P  +L MG + LFK++E S++R
Sbjct: 628 AGEYLSTGSFMIRGKKNYLPKVELVMGFAILFKVDEGSVAR 668


>gi|308480173|ref|XP_003102294.1| hypothetical protein CRE_05887 [Caenorhabditis remanei]
 gi|308262220|gb|EFP06173.1| hypothetical protein CRE_05887 [Caenorhabditis remanei]
          Length = 917

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 143/394 (36%), Positives = 200/394 (50%), Gaps = 92/394 (23%)

Query: 57  ILQPGSTP-SVMRQHQGFPI-------QEFESFAAAVDEFFSTAESHKIDLKAVQQERDA 108
           IL P S P  +  +    P+       +E  SF  +VDEF+S  E+ K + KAV  E+ A
Sbjct: 256 ILSPTSQPIQLYNEFNPIPMPFTSKLQKELPSFCESVDEFYSRIETQKQEQKAVNMEKQA 315

Query: 109 LKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIE 168
           LKKLENV++D + R+ AL+ TQ   +  A  II NQ+ V+ A+L IR  +ANQ SW+ IE
Sbjct: 316 LKKLENVEKDQKERIEALQLTQEQREHMANRIILNQDLVEKALLLIRSALANQFSWQTIE 375

Query: 169 ARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSET 228
              K A  + D VA  I   +   N   + L                    DP+ + +E 
Sbjct: 376 EMRKNAAMNGDLVAKSIDSFRFENNEFFMNL-------------------GDPYDEEAEL 416

Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM 288
            K   V +D+ ++A  NA+R F  K+SAA+K +KT+ S+EKA+K+A++K K TL+ V+ +
Sbjct: 417 LK---VPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQVKIV 473

Query: 289 TNINKARKVYWFEKFYWFISSENYLGNVAWDAK--------------------------- 321
           T + K+RK  WFEKF WFISSE Y+     DA+                           
Sbjct: 474 TEVKKSRKAMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSV 533

Query: 322 --------------------------VVTNAW---------WVKADQVSKTAPTGEFLTT 346
                                       +NAW         WV  +QVS+TAPTGE+L +
Sbjct: 534 IIRNKSFEESQEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPNQVSRTAPTGEYLPS 593

Query: 347 GSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           GSFMIRGKKNF PP QL MG+  LF++++ SI R
Sbjct: 594 GSFMIRGKKNFMPPSQLVMGLGVLFRMDDESIER 627


>gi|256080624|ref|XP_002576579.1| hypothetical protein [Schistosoma mansoni]
 gi|353229334|emb|CCD75505.1| hypothetical protein Smp_052790 [Schistosoma mansoni]
          Length = 1009

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 139/424 (32%), Positives = 217/424 (51%), Gaps = 81/424 (19%)

Query: 18  LHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
           L   + +   +  GY+  K  +  +S++ +    EFHP + +         Q++  P   
Sbjct: 268 LEPFLCDRQATPHGYIFGKSYQ--SSDEGLRNYEEFHPFMFE---------QYRDKPHLA 316

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F+SF  AVD +FS  ES K   +  + E+ A +K+EN+K+D E RL  L+  Q +D  KA
Sbjct: 317 FDSFNKAVDAYFSKIESQKTLEQISRNEQKASRKVENIKKDQERRLMLLKTEQELDMRKA 376

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
            L+  N+  VD  I+ I   ++NQ+ W+++E  V+ A++ +DP+A  I +LKL  +   +
Sbjct: 377 YLLEANRRLVDNIIIMINHALSNQIDWKELELIVEDAKQRDDPLACHIVELKLQTSQAVI 436

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
            L+   E S+  D  ++   N D + +         V VD+D++A  NA++++D KR+A+
Sbjct: 437 RLKDPFESSSDVDETLVRSGNKDEYTE---------VVVDIDVNALTNARKYYDKKRAAS 487

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVA 317
           KK++KTI  + K LKSA    +  +K  +T+  I + RK  WFEKF+WFISSENYL    
Sbjct: 488 KKEEKTINVSRKVLKSAIHNAEIKMKTAKTVAQITEVRKPMWFEKFFWFISSENYLVVAG 547

Query: 318 WDAK----------------------------------------------------VVTN 325
            D++                                                    V+++
Sbjct: 548 HDSQQNEVLVKRYLKPGDLFVHADIHGASTVIIKARHLTSEEVDSPNHQEAGNMAVVLSS 607

Query: 326 AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW         WV  DQVSKTAP+GE+LT+G+FMIRGKKN+ PPC    G   +FKL E 
Sbjct: 608 AWQSHVLTRAWWVHHDQVSKTAPSGEYLTSGAFMIRGKKNYLPPCPFDYGFGIMFKLHED 667

Query: 377 SISR 380
           SI++
Sbjct: 668 SIAK 671


>gi|344257308|gb|EGW13412.1| Serologically defined colon cancer antigen 1-like [Cricetulus
           griseus]
          Length = 554

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 137/311 (44%), Positives = 171/311 (54%), Gaps = 73/311 (23%)

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR-- 200
           N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR  
Sbjct: 2   NLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNP 61

Query: 201 --VHKERSNIKDSWVLHHD----------NADPFADSSETKKPSLVDVDLDLSAYANAKR 248
             + +E  +  D+ V   D            +      +  KP LVDVDL LSAYANAK+
Sbjct: 62  YLLSEEEDDDGDASVEVSDAEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 121

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
           ++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T+I KARKVYWFEKF WFIS
Sbjct: 122 YYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 181

Query: 309 SENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTGE------------ 342
           SENYL   G      +++   +      +V AD    T+     PTGE            
Sbjct: 182 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGEPIPPRTLTEAGT 241

Query: 343 ---------------------------------FLTTGSFMIRGKKNFFPPCQLAMGISF 369
                                            +LTTGSFMIRGKKNF PP  L MG SF
Sbjct: 242 MALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSF 301

Query: 370 LFKLEESSISR 380
           LFK++ES I R
Sbjct: 302 LFKVDESCIWR 312


>gi|356529076|ref|XP_003533123.1| PREDICTED: nuclear export mediator factor Nemf-like [Glycine max]
          Length = 1131

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 147/407 (36%), Positives = 211/407 (51%), Gaps = 87/407 (21%)

Query: 44  EDYIVA---NMEFHPMILQPGS--------TPSVMRQHQGFPIQEFESFAAAVDEFFSTA 92
           E YI+    NM     I QPGS         P ++ Q +     +FE+F AA+DEF+S  
Sbjct: 288 EGYILMQNKNMGKDSSISQPGSVSQMYDEFCPILLNQFKSRDYTKFETFDAALDEFYSKI 347

Query: 93  ESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152
           ES + + +   +E  A +KL  +++D E R+ AL +      + AELI  N E VDAAIL
Sbjct: 348 ESQRSEQQQKAKENSASQKLNRIRQDQENRVHALRKEADHCVKMAELIEYNLEDVDAAIL 407

Query: 153 AIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSW 212
           A+R  +A  ++W+D+   VK+ ++  +PVA +I +L L+ N +TLLL             
Sbjct: 408 AVRVALAKGMNWDDLARMVKEEKKAGNPVAGLIDKLHLDRNCMTLLL------------- 454

Query: 213 VLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
               +N D   D  +T     V+VDL LSA+ANA+R+++ K+    KQ KT+ + EKA K
Sbjct: 455 ---SNNLDEMDDDEKTLPVDKVEVDLALSAHANARRWYEQKKKQESKQGKTVTAHEKAFK 511

Query: 273 SAEKKTKQTLKDVQTMTNINKARKVY------WF--EKFYWFISSEN----------YL- 313
           +AE+KT+  L   +T+ +I+  RKV+      WF   + Y  IS  +          Y+ 
Sbjct: 512 AAERKTRLQLNQEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 571

Query: 314 -GNV----------------------------------------AWDAKVVTNAWWVKAD 332
            G++                                        AWD+K+VT+AWWV   
Sbjct: 572 KGDLYIHADLHGASSTVIKNHKPAQPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPH 631

Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ESS+ 
Sbjct: 632 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 678


>gi|302768961|ref|XP_002967900.1| hypothetical protein SELMODRAFT_60048 [Selaginella moellendorffii]
 gi|300164638|gb|EFJ31247.1| hypothetical protein SELMODRAFT_60048 [Selaginella moellendorffii]
          Length = 1083

 Score =  220 bits (561), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 149/421 (35%), Positives = 218/421 (51%), Gaps = 89/421 (21%)

Query: 22  VSNLNVSSQGYLV---QKKEKRPNSEDYIVANM-EFHPMILQPGSTPSVMRQHQGFPIQE 77
           V+  + + +GY+     K  K+ N+E        EF P++L+     S  R++      +
Sbjct: 281 VTTGDFTPEGYITFHPNKTAKKKNAESAEEKMFDEFSPLLLKQ----SAHREYV-----K 331

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F++F AA+DEFFS  E  ++D +   QE  A  KLE ++ D  +R+ +L++        A
Sbjct: 332 FDTFDAALDEFFSKIEGQRLDQQRKTQEDSAYSKLEKIRADQRSRVESLKREVDQAVHTA 391

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           ELI  N   VD AI A+R  +AN + W+D+   +K+ ++  +PVA +I  L+L  NHITL
Sbjct: 392 ELIEYNLADVDLAIDAVRAALANGMDWKDLGRMIKEERKAGNPVAGLIHSLQLEKNHITL 451

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
           LL                 +N D   D  +TK    V+VDL LSA+ANA+++FD+K+  A
Sbjct: 452 LL----------------SNNLDDMDDDDKTKPADKVEVDLSLSAHANARKWFDMKKKQA 495

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY------WF--EKFYWFISS 309
            KQ+KT+ + EKA K+AE+KT+Q L   + +  I+  RKV+      WF   + Y  IS 
Sbjct: 496 LKQEKTVAAHEKAFKAAERKTQQQLSQAKAVATISHLRKVHWFEKFNWFISSENYLIISG 555

Query: 310 EN----------YL--GNV----------------------------------------A 317
            +          Y+  G++                                        A
Sbjct: 556 RDAQQNEQIVKRYMKKGDLYVHADLHGASSTLIKNHNPSQPVSPLTINQAGCFTVCRSQA 615

Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           WD+K++T+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG    F+L+ESS
Sbjct: 616 WDSKIITSAWWVYDHQVSKTAPTGEYLTVGSFMIRGKKNFLPPYPLVMGFGLFFRLDESS 675

Query: 378 I 378
           I
Sbjct: 676 I 676


>gi|302761200|ref|XP_002964022.1| hypothetical protein SELMODRAFT_266749 [Selaginella moellendorffii]
 gi|300167751|gb|EFJ34355.1| hypothetical protein SELMODRAFT_266749 [Selaginella moellendorffii]
          Length = 1052

 Score =  220 bits (561), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 147/421 (34%), Positives = 216/421 (51%), Gaps = 89/421 (21%)

Query: 22  VSNLNVSSQGYLV---QKKEKRPNSEDYIVANM-EFHPMILQPGSTPSVMRQHQGFPIQE 77
           V+  + + +GY+     K  K+ N+E        EF P++L+  +    ++         
Sbjct: 267 VTTGDFTPEGYITFHPNKTAKKKNAESAEEKMFDEFSPLLLKQSAHREYIK--------- 317

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F++F AA+DEFFS  E  ++D +   QE  A  KLE ++ D  +R+ +L++        A
Sbjct: 318 FDTFDAALDEFFSKIEGQRLDQQRKTQEDSAFSKLEKIRADQRSRVESLKREVDQAVHTA 377

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           ELI  N   VD AI A+R  +AN + W+D+   +K+ ++  +PVA +I  L+L  NHITL
Sbjct: 378 ELIEYNLADVDLAIDAVRAALANGMDWKDLGRMIKEERKAGNPVAGLIHSLQLEKNHITL 437

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
           LL                 +N D   D  +TK    V+VDL LSA+ANA+++FD+K+  A
Sbjct: 438 LL----------------SNNLDDMDDDDKTKPADKVEVDLSLSAHANARKWFDMKKKQA 481

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY------WF--EKFYWFISS 309
            KQ+KT+ + EKA K+AE+KT+Q L   + +  I+  RKV+      WF   + Y  IS 
Sbjct: 482 LKQEKTVAAHEKAFKAAERKTQQQLSQAKAVATISHLRKVHWFEKFNWFISSENYLIISG 541

Query: 310 EN----------YL--GNV----------------------------------------A 317
            +          Y+  G++                                        A
Sbjct: 542 RDAQQNEQIVKRYMKKGDLYVHADLHGASSTLIKNHNPSQPVSPLTINQAGCFTVCRSQA 601

Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           WD+K++T+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG    F+L+ESS
Sbjct: 602 WDSKIITSAWWVYDHQVSKTAPTGEYLTVGSFMIRGKKNFLPPYPLVMGFGLFFRLDESS 661

Query: 378 I 378
           I
Sbjct: 662 I 662


>gi|260803888|ref|XP_002596821.1| hypothetical protein BRAFLDRAFT_130588 [Branchiostoma floridae]
 gi|229282081|gb|EEN52833.1| hypothetical protein BRAFLDRAFT_130588 [Branchiostoma floridae]
          Length = 834

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 133/288 (46%), Positives = 173/288 (60%), Gaps = 38/288 (13%)

Query: 30  QGYLVQKKEKRPNSE----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           QGY+VQK+EK+P  +    + ++   EFHP            +QH+  P  EF SF  AV
Sbjct: 88  QGYIVQKREKKPKQDGGPAEELLTYAEFHPFQF---------KQHEKSPCVEFPSFNKAV 138

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEFFS  ES ++DLKA+QQE+ A+KKLENVK+DHE RL  L++ Q  DK KA+LI  N +
Sbjct: 139 DEFFSQLESQRLDLKALQQEKVAIKKLENVKKDHERRLETLQKVQDEDKHKAQLIELNLD 198

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AIL +R  IANQ+ W +I   VK+AQ   DPVAS IK LKL+ NHITL+LR     
Sbjct: 199 LVDKAILVVRSAIANQIDWTEIWDIVKEAQAQGDPVASTIKSLKLDSNHITLVLR--NPF 256

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
           S  +       D A    ++S + +P  +D+DL LSAYANAK+++D KR AAKK+QKTI 
Sbjct: 257 SGYESDSEGDDDKAGVGREAS-SDRPMKIDIDLALSAYANAKKYYDQKRHAAKKEQKTID 315

Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           ++EK  +                        V  FEKF WFI+SENYL
Sbjct: 316 ASEKCHEFV----------------------VERFEKFLWFITSENYL 341


>gi|410695644|gb|AFV74962.1| serologically defined colon cancer antigen1-like protein, partial
           [Apis cerana]
          Length = 273

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 116/234 (49%), Positives = 157/234 (67%), Gaps = 33/234 (14%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P +EF SF  AV
Sbjct: 68  SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+A +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKALLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273


>gi|410695646|gb|AFV74963.1| serologically defined colon cancer antigen1-like protein, partial
           [Apis florea]
          Length = 273

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 115/234 (49%), Positives = 156/234 (66%), Gaps = 33/234 (14%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P +EF SF  AV
Sbjct: 68  SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273


>gi|387175444|gb|AFJ66839.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175452|gb|AFJ66843.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175454|gb|AFJ66844.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175462|gb|AFJ66848.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175464|gb|AFJ66849.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175466|gb|AFJ66850.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175470|gb|AFJ66852.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175474|gb|AFJ66854.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175476|gb|AFJ66855.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175478|gb|AFJ66856.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175480|gb|AFJ66857.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175482|gb|AFJ66858.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175484|gb|AFJ66859.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175486|gb|AFJ66860.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175494|gb|AFJ66864.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175498|gb|AFJ66866.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175506|gb|AFJ66870.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175508|gb|AFJ66871.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175510|gb|AFJ66872.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175514|gb|AFJ66874.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175516|gb|AFJ66875.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175518|gb|AFJ66876.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175520|gb|AFJ66877.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175522|gb|AFJ66878.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175528|gb|AFJ66881.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 115/234 (49%), Positives = 156/234 (66%), Gaps = 33/234 (14%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P +EF SF  AV
Sbjct: 68  SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273


>gi|387175512|gb|AFJ66873.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175524|gb|AFJ66879.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175526|gb|AFJ66880.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 115/234 (49%), Positives = 156/234 (66%), Gaps = 33/234 (14%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P +EF SF  AV
Sbjct: 68  SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273


>gi|387175502|gb|AFJ66868.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 115/234 (49%), Positives = 155/234 (66%), Gaps = 33/234 (14%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P +EF SF  AV
Sbjct: 68  SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK KAELI  NQ 
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKXKAELISRNQS 178

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273


>gi|390356696|ref|XP_001200483.2| PREDICTED: nuclear export mediator factor Nemf-like
           [Strongylocentrotus purpuratus]
          Length = 334

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 108/221 (48%), Positives = 158/221 (71%), Gaps = 12/221 (5%)

Query: 93  ESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152
           ES K+D+K +QQER ALKKL+NVK+DHE R+S+L+Q Q ++++K  LI  N   V+ A+ 
Sbjct: 2   ESQKLDMKVIQQERGALKKLDNVKKDHEKRISSLQQNQELNEKKGALIEINLPLVEQALR 61

Query: 153 AIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSW 212
            +R  +ANQ+ W++I++ +K+AQ   DPVA  I+ L+L+ NH  +LLR         D +
Sbjct: 62  VVRSAVANQIDWKEIDSIIKEAQTQGDPVALAIRSLRLDTNHFQMLLR---------DPY 112

Query: 213 VLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
              +D+AD      +  +P LVD+D+  SAYANA+++F  K+++ KK+QKT++S+ KA+K
Sbjct: 113 -KQYDDAD--EGEEDGARPMLVDIDIAQSAYANARKYFVQKKTSQKKEQKTMESSSKAIK 169

Query: 273 SAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           SAEKKT Q LKDV T+ +INK+RK YWFEK+YW ISSENY+
Sbjct: 170 SAEKKTMQALKDVATVASINKSRKTYWFEKYYWCISSENYI 210



 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 46/59 (77%), Positives = 52/59 (88%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           +VAWDAKV+T+AWWV+ DQVSKTAPTGEFLTTGSFM+RGKKNF PP QL MG  FL K+
Sbjct: 271 SVAWDAKVITSAWWVRHDQVSKTAPTGEFLTTGSFMVRGKKNFLPPTQLVMGFGFLMKV 329


>gi|387175434|gb|AFJ66834.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175436|gb|AFJ66835.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175438|gb|AFJ66836.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175440|gb|AFJ66837.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175442|gb|AFJ66838.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175448|gb|AFJ66841.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175450|gb|AFJ66842.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175488|gb|AFJ66861.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175496|gb|AFJ66865.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 114/234 (48%), Positives = 156/234 (66%), Gaps = 33/234 (14%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P ++F SF  AV
Sbjct: 68  SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKKFASFDVAV 118

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273


>gi|387175446|gb|AFJ66840.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175458|gb|AFJ66846.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175460|gb|AFJ66847.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175468|gb|AFJ66851.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175472|gb|AFJ66853.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175490|gb|AFJ66862.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175492|gb|AFJ66863.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175500|gb|AFJ66867.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175504|gb|AFJ66869.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 114/234 (48%), Positives = 155/234 (66%), Gaps = 33/234 (14%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P + F SF  AV
Sbjct: 68  SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKXFASFDVAV 118

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+ALKKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273


>gi|387175456|gb|AFJ66845.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 114/234 (48%), Positives = 155/234 (66%), Gaps = 33/234 (14%)

Query: 29  SQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++QKKE +P ++   D+I  N+EFHP + +         Q++  P +EF SF  AV
Sbjct: 68  SKGYIIQKKEIKPTTDGQKDFIYTNIEFHPFLFE---------QYKDHPYKEFASFDVAV 118

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DE+FST E  K+DLKA+QQER+A KKLENVK+DH+ RL  LE+TQ +DK+KAELI  NQ 
Sbjct: 119 DEYFSTMEGQKLDLKALQQEREAXKKLENVKKDHDQRLITLEKTQELDKQKAELISRNQS 178

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AILAI+  +ANQ++W DI+  +K+A+   DPVAS IKQLKL  NHI+LLL      
Sbjct: 179 LVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLH----- 233

Query: 206 SNIKDSWVLHHDNADPFADSSETK--KPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                         DP+ DS E    KP L+D+DL  +A+ NA+++++ KRSAA
Sbjct: 234 --------------DPYEDSDEESELKPMLIDIDLAHTAFGNARKYYNQKRSAA 273


>gi|242085896|ref|XP_002443373.1| hypothetical protein SORBIDRAFT_08g018400 [Sorghum bicolor]
 gi|241944066|gb|EES17211.1| hypothetical protein SORBIDRAFT_08g018400 [Sorghum bicolor]
          Length = 1158

 Score =  210 bits (535), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 138/377 (36%), Positives = 198/377 (52%), Gaps = 76/377 (20%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P ++ Q +     EF +F AA+DEF+S  ES K++ +   +E  A ++L  +K D E R
Sbjct: 352 CPILLNQFKSREYNEFATFDAALDEFYSKIESQKVNQQQKAKEESAAQRLNKIKLDQENR 411

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L +      + AELI  N E VDAAILA+R  +AN++SWE +   +K+ ++  +PVA
Sbjct: 412 VHTLRKEVDHCVKMAELIEYNLEDVDAAILAVRVSLANEMSWEALTRMIKEERKAGNPVA 471

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I +L    N ITLLL                 +N D   +  +T     V+VD+ LSA
Sbjct: 472 GLIDKLNFERNCITLLL----------------SNNLDDMDEDEKTAPVEKVEVDIALSA 515

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY---- 298
           +ANA+R++++K+    KQ+KTI + EKA K+AEKKT+  L   +T+  I   RKV+    
Sbjct: 516 HANARRWYEMKKKQESKQEKTITAHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEK 575

Query: 299 --WF--EKFYWFISSEN----------YL--GNV-------------------------- 316
             WF   + Y  IS  +          Y+  G++                          
Sbjct: 576 FNWFISSENYLIISGRDAQQNELIVKRYMSKGDLYVHAELHGASSTIIKNHKPDTPIPPL 635

Query: 317 --------------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ 362
                         AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  
Sbjct: 636 TLNQAGCFTVCHSKAWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 695

Query: 363 LAMGISFLFKLEESSIS 379
           L MG   LF+L+ESS++
Sbjct: 696 LVMGFGILFRLDESSLA 712


>gi|115489110|ref|NP_001067042.1| Os12g0564600 [Oryza sativa Japonica Group]
 gi|108862839|gb|ABA98970.2| zinc knuckle family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113649549|dbj|BAF30061.1| Os12g0564600 [Oryza sativa Japonica Group]
          Length = 1159

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 137/376 (36%), Positives = 198/376 (52%), Gaps = 76/376 (20%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P ++ Q +     EFE+F AA+DEF+S  ES +++ +   +E  A ++L  +K D E R+
Sbjct: 354 PVLLNQFKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRV 413

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L +      + AELI  N E VDAAI+A+R  +AN +SW+ +   +K+ ++  +PVA 
Sbjct: 414 HTLRKEVDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAG 473

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           +I +L    N ITLLL                 +N D   +  +T     V+VDL LSA+
Sbjct: 474 LIDKLSFERNCITLLL----------------SNNLDDMDEEEKTAPVEKVEVDLSLSAH 517

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY----- 298
           ANA+R+++LK+    KQ+KT+ + EKA K+AEKKT+  L   +T+  I   RKV+     
Sbjct: 518 ANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKF 577

Query: 299 -WF--EKFYWFISSEN----------YL--GNV--------------------------- 316
            WF   + Y  IS  +          Y+  G++                           
Sbjct: 578 NWFISSENYLIISGRDAQQNELIVKRYMSKGDLYVHAELHGASSTIIKNHKPDNPIPPLT 637

Query: 317 -------------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
                        AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L
Sbjct: 638 LNQAGSFTVCHSKAWDSKIVTSAWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 697

Query: 364 AMGISFLFKLEESSIS 379
            MG   LF+L+ESS++
Sbjct: 698 VMGFGILFRLDESSLA 713


>gi|453084374|gb|EMF12418.1| hypothetical protein SEPMUDRAFT_149103 [Mycosphaerella populorum
           SO2202]
          Length = 1130

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 141/400 (35%), Positives = 211/400 (52%), Gaps = 84/400 (21%)

Query: 52  EFHPM-ILQPGSTPS-VMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDAL 109
           +FHP    Q    P+ V  +H+GF I        AVDEFFS+ E  K++ K  +++  A 
Sbjct: 311 DFHPFKPAQLAEDPANVFLEHEGFNI--------AVDEFFSSIEGQKLESKLAEKQESAR 362

Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
           K++E+ K++ E R++ L+Q Q +   KA+ I  N E V+ A  A+   IA  + WEDI  
Sbjct: 363 KRIEHAKKEQEQRINGLQQVQELHVRKAQAIEANVERVEEATAAVNGLIAQGMDWEDIGR 422

Query: 170 RVKQAQRHNDPVASIIK-QLKLNINHITLLLRV-----HKERSNIKDSWVLHHDNADPFA 223
            ++Q Q+ ++PVA +IK  LKL+ N +TLLL         E +N  DS     D+ D   
Sbjct: 423 LIEQEQKRHNPVAELIKLPLKLHENTMTLLLSELGADDEDEEANETDSE--PSDSEDEGT 480

Query: 224 DSSETK---KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
           ++++ K   K   +D+DL  SA+ NA++++D KR+AA KQ+KT+ +++KA+KS E+K   
Sbjct: 481 NAAQVKHDAKRLTIDIDLAGSAWVNARQYYDQKRTAAVKQEKTVLASKKAIKSTEQKVMA 540

Query: 281 TL-KDVQTMTNI-NKARKVYWFEKFYWFISSENYL--------------------GNV-- 316
           TL KD++   ++    RK +WFEKF +F+SS+ YL                    G+V  
Sbjct: 541 TLKKDLKQEKDVLRPVRKQFWFEKFIYFVSSDGYLVLAGKDAQQNEILYRRYLKKGDVYI 600

Query: 317 -----------------------------AWDAKVVT-NAW---------WVKADQVSKT 337
                                          D  V T +AW         WV ADQVSKT
Sbjct: 601 HADLDGAASVIIKNKLNPEDPIPPSTLAQGGDLAVCTSSAWDSKAVMSAWWVNADQVSKT 660

Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           APTGE+L  G F++RGKKNF PP +L +G   +F++ E S
Sbjct: 661 APTGEYLAAGGFIVRGKKNFLPPAKLLLGFGVMFQISEES 700


>gi|156338807|ref|XP_001620041.1| hypothetical protein NEMVEDRAFT_v1g149359 [Nematostella vectensis]
 gi|156204309|gb|EDO27941.1| predicted protein [Nematostella vectensis]
          Length = 287

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 133/349 (38%), Positives = 166/349 (47%), Gaps = 113/349 (32%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P +M Q++  P  EF SF   VD+FFS+  S K+D+KA+ QE+ ALKKLENVK+DHE R+
Sbjct: 13  PFLMTQYKDHPYLEFPSFDKTVDDFFSSIGSQKLDVKALNQEKSALKKLENVKKDHEKRI 72

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L+  Q  D  KA+LI  N + VD AIL +   IANQ+ W +I   VK+AQ   DPVAS
Sbjct: 73  QQLQSAQEADVRKAQLIEINLDLVDRAILVVNSAIANQIDWSEILNLVKEAQIQGDPVAS 132

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
            I++LKL  NHIT+LLR             L   N  P            V VD+DL A+
Sbjct: 133 AIRELKLQTNHITMLLRYVS----------LASINGRP------------VRVDIDLLAH 170

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
            NA+RF                                                   EKF
Sbjct: 171 LNARRF---------------------------------------------------EKF 179

Query: 304 YWFISSENYL--------------------GNVAWDAKVVTNAWWVKADQ---------- 333
            WFISSENY+                    GN   +  + + A  + + Q          
Sbjct: 180 LWFISSENYVVIGGRDQQQNELVVKRHLQPGNATCNT-IFSQATLICSYQSQLSTTAINH 238

Query: 334 ---------VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
                    VSKTAPTGE+LTTGSFMIRGKKNF PPC L MG SFLF++
Sbjct: 239 SYQSQLSITVSKTAPTGEYLTTGSFMIRGKKNFLPPCHLIMGFSFLFRV 287


>gi|339236819|ref|XP_003379964.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
 gi|316977305|gb|EFV60421.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
          Length = 789

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 144/414 (34%), Positives = 199/414 (48%), Gaps = 90/414 (21%)

Query: 29  SQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF 88
           S GYL    E+    E   + + EFHP             Q      ++F++F  AVDE+
Sbjct: 246 SCGYLKMTVEENAAGERIEIFH-EFHPYFFS---------QFANSECKQFDTFNGAVDEY 295

Query: 89  FSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148
           FS  +S K   K +QQER ALK+LENV++DHE RL+ L+  Q++ +  A  +  N E+V+
Sbjct: 296 FSKLDSQKCQQKQLQQERAALKRLENVRQDHEQRLANLQADQMLKERMAVAVELNSETVE 355

Query: 149 AAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNI 208
            A+  +R  IA +L W  I   ++ A+   DPVA  I  L L  N   + L V       
Sbjct: 356 QALAVLRSAIAMKLEWFQINEMIQDARDLGDPVAGKIVGLCLERNAFVMRLPV------- 408

Query: 209 KDSWVLHHDNADPFADSSETKKPSLVDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQS 266
                      D F +  E      VDV++D  LS++ N++R+F   + +A KQ+KTI +
Sbjct: 409 -----------DVFDNDQELGDAETVDVEIDLALSSHQNSRRWFSQMKESALKQKKTIAA 457

Query: 267 TEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK----- 321
             KALKSAE +TK+ LK  +  TNI K RK++WFEKF+WF SS+  L     DAK     
Sbjct: 458 GGKALKSAELRTKEQLKSTRQKTNIGKVRKMFWFEKFHWFFSSDRLLVIAGRDAKQNEIL 517

Query: 322 ---------------------------------------------VVTNAWW-------- 328
                                                        V  +A W        
Sbjct: 518 VKRYLKPGDLYVHADLRGAASVVIKQSEDKGPIPPKTLNEAAALAVCLSAAWESKVVTSA 577

Query: 329 --VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
             VK DQVSK+AP+GE+L TG FMIRGKKN+    QL MG   LF+L+  S +R
Sbjct: 578 WWVKHDQVSKSAPSGEYLKTGGFMIRGKKNYLTASQLVMGFGLLFRLDSESAAR 631


>gi|339260826|ref|XP_003368211.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
 gi|316964832|gb|EFV49764.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
          Length = 749

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 144/414 (34%), Positives = 199/414 (48%), Gaps = 90/414 (21%)

Query: 29  SQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF 88
           S+GYL    E+    E   + + EFHP             Q      ++F++F  AVDE+
Sbjct: 206 SRGYLKMTVEENAAGERIEIFH-EFHPYFFS---------QFASSECKQFDTFNGAVDEY 255

Query: 89  FSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148
           FS  +S K   K +QQER ALK+LENV++DHE RL+ L+  Q++ +  A  +  N E+V+
Sbjct: 256 FSKLDSQKCQQKQLQQERAALKRLENVRQDHEQRLANLQADQMLKERMAVAVELNSETVE 315

Query: 149 AAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNI 208
            A+  +R  IA +L W  I   ++ A+   DPVA  I  L L  N   + L V       
Sbjct: 316 QALAVLRSAIAMKLEWFQINEMIQDARDLGDPVAGKIVGLCLERNAFVMRLPV------- 368

Query: 209 KDSWVLHHDNADPFADSSETKKPSLVDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQS 266
                      D F +  E      VDV++D  LS++ N++R+F   + +A KQ+KTI +
Sbjct: 369 -----------DVFDNDQELGDAETVDVEIDLALSSHQNSRRWFSQMKESALKQKKTIAA 417

Query: 267 TEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK----- 321
             KALKSAE  TK+ LK  +  TNI K RK++WFEKF+WF SS+  L     DAK     
Sbjct: 418 GGKALKSAELHTKEQLKSTRQKTNIGKVRKMFWFEKFHWFFSSDRLLVIAGRDAKQNEIL 477

Query: 322 ---------------------------------------------VVTNAWW-------- 328
                                                        V  +A W        
Sbjct: 478 VKRYLKPGDLYVHADLRGAASVVIKQSEDKGPIPPKTLNEAAALAVCLSAAWESKVVTSA 537

Query: 329 --VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
             VK DQVSK+AP+GE+L TG FMIRGKKN+    QL MG   LF+L+  S +R
Sbjct: 538 WWVKHDQVSKSAPSGEYLKTGGFMIRGKKNYLTASQLVMGFGLLFRLDSESAAR 591


>gi|452981583|gb|EME81343.1| hypothetical protein MYCFIDRAFT_114319, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 1087

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 140/430 (32%), Positives = 218/430 (50%), Gaps = 84/430 (19%)

Query: 29  SQGYLVQKKEKRPNSEDYI--------VANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           ++GY++ K+++    ED          V    FHP        P+ + + Q F   E++ 
Sbjct: 264 AKGYILGKRKQSSKQEDTDGTADEGKDVMYEHFHPF------KPAQLAEDQSFVFLEYDG 317

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEFFS+ E  K++ +  ++E +A K++E+ +++ E R+  L+Q Q +   KA+ I
Sbjct: 318 FNVAVDEFFSSIEGQKLESRLQEREDNAKKRIEHARKEQEQRIEGLQQVQELHVRKAQAI 377

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N E V+ A  A+   IA  + W DI + ++  Q  ++PVA +IK  LKL+ N ITLLL
Sbjct: 378 EANVERVEEATAAVNGLIAQGMDWADIGSLIENEQARHNPVAELIKLPLKLHENTITLLL 437

Query: 200 ----RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKR 254
               R   E  ++ DS     ++ D     +  +   L VD+DL  SA++NA++++D KR
Sbjct: 438 SEIGRDADEEMDVTDSEPSDSEDGDAETAPARAEDKRLTVDIDLAASAWSNARQYYDQKR 497

Query: 255 SAAKKQQKTIQSTEKALKSAEKKTKQTL-KDVQTMTNI-NKARKVYWFEKFYWFISSENY 312
           +AA KQ++T  +++KALKS E+     L KD++   ++    RK +WFEKF +FISS+ Y
Sbjct: 498 TAASKQERTEAASKKALKSTEQNVMAKLKKDLKQEKDVLRPVRKQFWFEKFIYFISSDGY 557

Query: 313 L--------------------GNV--------------------------------AWDA 320
           L                    G+V                                A D 
Sbjct: 558 LVLAGRDDLQNEMLYRRHLRKGDVYVHADLNGASSVVIKNSPHTPCAPIPPSTLAQAGDL 617

Query: 321 KVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
            V  ++ W          V A+QVSKTA TGE+L  GSF+IRGKKNF PP +L +G   +
Sbjct: 618 VVCRSSAWDSKAVMSAWWVNAEQVSKTADTGEYLAVGSFIIRGKKNFLPPARLLLGFGVM 677

Query: 371 FKLEESSISR 380
           F++ E S +R
Sbjct: 678 FQISEESKAR 687


>gi|452822547|gb|EME29565.1| RNA-binding protein [Galdieria sulphuraria]
          Length = 1067

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 142/436 (32%), Positives = 219/436 (50%), Gaps = 100/436 (22%)

Query: 30  QGYLVQKKEKRPNSEDYIVANME------FHPMILQPGSTPSVMRQHQGFPIQEFESFAA 83
           +GYL  +K+   ++ D  ++N E      F P +L   S  S +         EF +F  
Sbjct: 291 RGYLFYRKD--AHTTDVSMSNSERLLYEDFSPFLLCHLSNTSHI---------EFPTFNE 339

Query: 84  AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
           AVD +F+  E  +  + A +QE    KK++++++D E R+  LE+ +  + + AE I  N
Sbjct: 340 AVDIYFANLEKERAQIVASKQESVVSKKVDSLRKDLERRIDELERAKEENFKIAEAIELN 399

Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHK 203
            + VD AI  +R  IAN ++W++++  +++ +   +PVA  I  L L+ N ITL+L +  
Sbjct: 400 ADEVDKAIWVVRAMIANGVAWDELDKMLEEEKEKGNPVAETIHSLHLDRNEITLMLPIDP 459

Query: 204 --ERSNIKDSWVLHHDNADPFADSSETK---------------KP-SLVDVDLDLSAYAN 245
             E   + +++    ++   + D+ ET+               KP  L DVDL LSA+AN
Sbjct: 460 ILEDEFVNENFQYQSEDITYYDDTDETEEHFQTERMVAELNASKPVVLADVDLSLSAFAN 519

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-----KDVQTMTNINKARKVYWF 300
           A R+F+ ++ A +K++KT+++T++AL  AEKK  + +     + ++    I + RK  WF
Sbjct: 520 AARYFESRKRAQEKKEKTMEATKRALNVAEKKASKQMERSQQRSLKPAVAIREIRKPAWF 579

Query: 301 EKFYWFISSENYLGNVAWDAK---------------------------VVTN-------- 325
           EKF WFISSEN+L     DA+                           VV N        
Sbjct: 580 EKFDWFISSENFLVIAGKDAQQNEQVVKRYMKTFDVYVHADIHGASSVVVKNRFRDKPVP 639

Query: 326 ----------------AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
                           AW         WV A QVSKTAP+GE+LTTGSFMIRGKKN+ PP
Sbjct: 640 LQTLIEAGAFAMCHSSAWSSKIVSSAWWVHASQVSKTAPSGEYLTTGSFMIRGKKNYLPP 699

Query: 361 CQLAMGISFLFKLEES 376
            QL MG   LFK++ S
Sbjct: 700 SQLVMGYGILFKMDPS 715


>gi|89130574|gb|AAI14230.1| Zgc:153813 protein [Danio rerio]
          Length = 556

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 121/287 (42%), Positives = 153/287 (53%), Gaps = 83/287 (28%)

Query: 162 LSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADP 221
           + W +I   V +AQ   DPVA  IK+LKL  NHITLLLR               +  A P
Sbjct: 99  VDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLR---------------NPEACP 143

Query: 222 FADSSETK---------KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
              ++E +         K  LVD+D++LSA+ANAKR++D KRSAAKK+QKT+++ +KA K
Sbjct: 144 EGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQKAFK 203

Query: 273 SAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK---------VV 323
           SAEKKTKQTLKDVQT+T+I KARKVYWFEKF WF+SSENYL     D +         + 
Sbjct: 204 SAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKRYLR 263

Query: 324 TNAWWVKADQVSKTA-----PTGE------------------------------------ 342
               +V AD    T+     P+GE                                    
Sbjct: 264 AGDLYVHADLHGATSCVIKNPSGEAVPPRTLTEAATMAVCYSAAWDAKVITSAWWVQHDQ 323

Query: 343 ---------FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                    +LTTGSFMIRGKKNF PP  L MG  FLFK+++ S+ R
Sbjct: 324 VSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFR 370


>gi|429858117|gb|ELA32948.1| duf814 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 1040

 Score =  197 bits (500), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 133/399 (33%), Positives = 202/399 (50%), Gaps = 77/399 (19%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP +      P+         + EF+ F   VDEFFS+ E  K++ K  ++E  A +K
Sbjct: 294 DFHPFL------PNKFANDPTIKVLEFDGFNKTVDEFFSSLEGQKLESKLSEREAAAKRK 347

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  + D   R+  L++ Q ++ +KA  I  N E V  A+ A+   +   + W DI   +
Sbjct: 348 LEAARNDQAKRIEGLQEVQSLNVQKATAIEANVERVQEAMDAVNGLLQQGMDWIDISKLI 407

Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
           ++ Q+  +PVA IIK L LN+  N ITLLL   ++  +   ++    D +D   +++  K
Sbjct: 408 EREQKRGNPVAEIIK-LPLNLADNTITLLLGEEEDIEDEDSNYETDSDASDSEDEAASNK 466

Query: 230 KPSL----VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
           + +     VDV++ L+ YANA+ +++ KRSAAKK++KT+Q TE ALK+AE+K +  L+  
Sbjct: 467 QKTAKHLEVDVNIGLTPYANAREYYEQKRSAAKKEEKTVQQTEIALKNAEQKIQAELRKG 526

Query: 286 --QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAWDAKV- 322
             Q    +   RK  WFEKF WFIS++ YL                    G+V   A + 
Sbjct: 527 LKQEKAVLAPIRKQIWFEKFIWFISTDGYLVLGGKDAQQNEMLYKRYLRKGDVYIHADIH 586

Query: 323 --------------------------------VTNAWWVKA---------DQVSKTAPTG 341
                                            ++AW  KA         DQVSK+APTG
Sbjct: 587 GAATVIIKNTPSDPDAPIPPSTLAQAGTLAVCSSSAWDSKAGMGAWWVKADQVSKSAPTG 646

Query: 342 EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           E+L TGSFM+RG+KNF PP QL +G   ++K+ E S +R
Sbjct: 647 EYLPTGSFMVRGQKNFLPPAQLLLGFGIMWKISEESKAR 685


>gi|452840445|gb|EME42383.1| hypothetical protein DOTSEDRAFT_73267 [Dothistroma septosporum
           NZE10]
          Length = 1122

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 144/453 (31%), Positives = 220/453 (48%), Gaps = 121/453 (26%)

Query: 22  VSNLNVSSQGYLVQKKEKRPNSEDYIVANME--------FHPMILQPGSTPSVMRQHQGF 73
           +++L V+ +GY++ K+ K P ++    A  +        FHP   +P        Q +G 
Sbjct: 258 ITSLPVA-RGYILAKR-KVPKADANTTATEDNQNVMYEDFHP--FKPA-------QLEGD 306

Query: 74  PIQEF---ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQ 130
           P   F   E F  AVD+FFS+ E  K++ +  ++E +A +++E  +++ E R++ L+Q Q
Sbjct: 307 PANVFIEHEGFNKAVDDFFSSIEGQKLESRLQEREENAKRRIEQARQEQEKRITGLQQVQ 366

Query: 131 LVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLK 189
            ++  KA+ I  N E V+ A+ A+   IA  + W DI   ++  Q+ ++PVA +IK  LK
Sbjct: 367 ELNIRKAQAIEANVERVEEAVAAVNGLIAQGMDWVDIGRLIENEQKRHNPVAEMIKLPLK 426

Query: 190 LNINHITLLL---------------------RVHKERSNIKDSWVLHHDNADPFADSSET 228
           L+ N  TLLL                          ++NIK S+V   +           
Sbjct: 427 LHENTATLLLSELADADDEDMDETDSEPSDSEDEDHQANIKKSFVPEDERL--------- 477

Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--Q 286
                VD+DL  S ++NA++++D +R+AA KQ+KT Q+ +KALKS E+K    LK    Q
Sbjct: 478 ----TVDIDLAASGWSNARQYYDQRRTAATKQEKTAQAAQKALKSTEQKVMADLKKGLKQ 533

Query: 287 TMTNINKARKVYWFEKFYWFISSENYL--------------------GNV---------- 316
               +   RK +WFEKF +FISS+ YL                    G+V          
Sbjct: 534 EKEVLRPVRKQFWFEKFIYFISSDGYLVLAGKDAQQNEMLYRRHLRKGDVYVHADMHGAA 593

Query: 317 ----------------------AWDAKVVTNAWW----------VKADQVSKTAPTGEFL 344
                                 A +  V T++ W          V ADQVSKTAPTGE+L
Sbjct: 594 SVIIKNNPATPQAPIPPSSLSQAGNLSVCTSSAWDSKAVMSAWWVNADQVSKTAPTGEYL 653

Query: 345 TTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           TTG FM+RGKKNF PP QL +G + +F++ E S
Sbjct: 654 TTGGFMVRGKKNFLPPAQLLLGFALVFQISEDS 686


>gi|384249421|gb|EIE22903.1| hypothetical protein COCSUDRAFT_16391 [Coccomyxa subellipsoidea
           C-169]
          Length = 1029

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 137/407 (33%), Positives = 201/407 (49%), Gaps = 97/407 (23%)

Query: 43  SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTA---------- 92
           S D +V +  F P+ILQ  S   V+R         F ++ AA+DEF++ A          
Sbjct: 267 SGDCLVYD-SFDPLILQQNSGQEVLR---------FPTYNAALDEFYAKARPAPLCLTMS 316

Query: 93  -ESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAI 151
            E  K +   +Q E+ AL KL+ ++ D   R  AL++     + KA+LI  N E+VD AI
Sbjct: 317 VEGQKAEQARLQAEQAALSKLDRIRIDQTGRAEALDREAKEAEAKAQLIEANAEAVDQAI 376

Query: 152 LAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDS 211
            A+R  +A  LSW ++E  ++      + VA ++  L L+ N +TLL           DS
Sbjct: 377 NAVRVALAQGLSWAELERLIRDEAAAGNQVAGLVHALHLDRNAVTLL-----------DS 425

Query: 212 WVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL 271
                 NA+   ++      +LV+VDLDL+A  NA+ +   +++ + KQ KT+ + ++AL
Sbjct: 426 ------NAESNDETGTDVPTALVEVDLDLNAQQNARAWHSDRKARSAKQAKTLDANKRAL 479

Query: 272 KSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK---------- 321
             A+KK +  L  V+ +  + + RK  WFEKF WF++SENYL     DA+          
Sbjct: 480 VEADKKVQVQLSKVKAVAAVQQLRKPAWFEKFNWFVTSENYLVVSGRDAQQNELLVKRYL 539

Query: 322 -----------------VVTN-----------------------AW---------WVKAD 332
                            VV N                       AW         WV A 
Sbjct: 540 RKDDLYVHAELHGASTTVVRNHNPSRPGMALVSQAGTACVCRSQAWDAKIVTSAWWVHAH 599

Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           QVSK+AP+GE+L TGSFMIRG+KNF PP  L MG++FLFKL+ES I+
Sbjct: 600 QVSKSAPSGEYLPTGSFMIRGRKNFLPPHPLIMGLTFLFKLDESCIA 646


>gi|156059014|ref|XP_001595430.1| hypothetical protein SS1G_03519 [Sclerotinia sclerotiorum 1980]
 gi|154701306|gb|EDO01045.1| hypothetical protein SS1G_03519 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 1063

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 144/436 (33%), Positives = 205/436 (47%), Gaps = 90/436 (20%)

Query: 28  SSQGYLV-QKKEKRPNSEDYIVANM------EFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           +S+GY++ +KK+ +  S+D    +       +FHP        P            EFE 
Sbjct: 257 TSKGYIIAKKKDSQVTSDDNQAEDRKGLLYDDFHPF------KPRQFEDDPTLVFLEFEG 310

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F   VDEFFS+ E  K++ +  ++E +A KK++  + +   RL  L++ Q +++ KA  +
Sbjct: 311 FNKTVDEFFSSIEGQKLESRLEERELNAKKKIQAARNEQAKRLGGLQEIQALNERKASAL 370

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N E V  A  A+   IA  + W +I   ++  Q+  +PVAS+IK  LKL+ N +TLLL
Sbjct: 371 QANVERVQEARDAVNGLIAQGMDWFEIGRLIELEQKRKNPVASMIKLPLKLDQNTVTLLL 430

Query: 200 RVHKERSNIKDSWVLHHDNADPFADSSETKKPS-----------LVDVDLDLSAYANAKR 248
                  +   S+    D +D   D  +  KP             +D++L LS +ANA+ 
Sbjct: 431 DEEVFNDDEDSSYETDSDVSDS-EDEEKAAKPVEKEEKATETRLAIDINLSLSPWANARN 489

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWF 306
           +FD KRSAA K+ KT+QS+ KALKS E K  Q LK    Q  T +   RK  WFEKF WF
Sbjct: 490 YFDQKRSAASKEDKTLQSSSKALKSTEAKIAQDLKKGLKQEKTILRPVRKQIWFEKFVWF 549

Query: 307 ISSENYLGNVAWDAK--------------------------------------------- 321
           ISS+ YL     DA+                                             
Sbjct: 550 ISSDGYLVLAGKDAQQSEILYKRYLRKGDMYLHADISGAASVIVRNNPKTPDAPIPPQTL 609

Query: 322 -------VVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLA 364
                  V T++ W          V ADQVSK APTGEFL  G F I+GKKNF PP QL 
Sbjct: 610 SQAGTLVVATSSAWDSKAGMSAWWVNADQVSKAAPTGEFLPAGKFTIQGKKNFLPPAQLL 669

Query: 365 MGISFLFKLEESSISR 380
           +G   LF++ + S +R
Sbjct: 670 LGFGILFQISDESKAR 685


>gi|407928362|gb|EKG21221.1| protein of unknown function DUF814 [Macrophomina phaseolina MS6]
          Length = 1094

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 139/437 (31%), Positives = 213/437 (48%), Gaps = 95/437 (21%)

Query: 29  SQGYLVQKKEKRP--------NSEDYIVANM----EFHPMILQPGSTPSVMRQHQGFPIQ 76
           ++GY+V KK  +P         SE+   A      +FHP        P            
Sbjct: 261 AKGYIVAKKTGQPAPTEVSEEGSEEKAPAEKLLYEDFHPF------KPKQFEADPTLTFL 314

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           EFE F   VDEFFS+ E  K++ +  ++E +A +KLE  K++H  RL  L++ Q ++  K
Sbjct: 315 EFEGFNKTVDEFFSSIEGQKLESRLQEREENAKRKLEQAKQEHLKRLGGLQRAQELNIRK 374

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
           AE I  N + V  A++A+   I   + W +I+  +++ Q H +PVA +IK  LKL  N +
Sbjct: 375 AEAIQANVDRVQEAVMAVNGLIDKGMDWIEIDRLIEREQTHGNPVAQMIKVPLKLRENTV 434

Query: 196 TLLL---RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL--------VDVDLDLSAYA 244
           TLLL    V ++  + + S      + D   +  + KKP++        +D+DL LS +A
Sbjct: 435 TLLLDEPGVEEDEEDFEGSETESEPSDD-EEEQQQRKKPAVKPQDNRLTIDIDLGLSPWA 493

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEK 302
           NAK +FD K++AA K+++T+++++KALKS +KK +  LK    Q    +   RK +WFEK
Sbjct: 494 NAKTYFDQKKTAAAKEERTLEASQKALKSTQKKIEADLKKGLKQEKELLRPVRKQFWFEK 553

Query: 303 FYWFISSENYL--------------------GNV-------------------------- 316
           F +FISS+ YL                    G++                          
Sbjct: 554 FIYFISSDGYLVIGGKDAQQNEILYRRHLKKGDIYVHADLSAAAVVIIKNRPSTPDDPIP 613

Query: 317 ------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
                 A +  V T+  W          V ADQVSKT  +GE+L  G F+I GKKNF PP
Sbjct: 614 PSTLSQAGNLSVSTSTAWDSKAVMSAWWVNADQVSKTTSSGEYLAAGGFVINGKKNFLPP 673

Query: 361 CQLAMGISFLFKLEESS 377
            QL +G + +F++ E S
Sbjct: 674 AQLLLGFAVMFQITEES 690


>gi|303312187|ref|XP_003066105.1| hypothetical protein CPC735_053300 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240105767|gb|EER23960.1| hypothetical protein CPC735_053300 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 1125

 Score =  191 bits (485), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 134/426 (31%), Positives = 210/426 (49%), Gaps = 82/426 (19%)

Query: 29  SQGYLVQKKEKRPNSEDYIVAN--------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           ++GY+V + E +P+                +++HP        P          I  F+S
Sbjct: 264 TRGYIVARNENKPSENPSFSGEAKPDKSNYIDYHPF------APRQFADGNDISILTFDS 317

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDE++S+ E+ K++ +  ++E    +KLE  KRDHE R+ AL+Q Q +   KAE I
Sbjct: 318 FNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEIHTRKAEAI 377

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N   V+  + A+   IA  + W +I   ++  Q   +PVA +IK  LKL  N +T+LL
Sbjct: 378 ATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLYENTVTVLL 437

Query: 200 RVHKERSNIKDSWVLHHDNADPFADS-SETKKPSL--VDVDLDLSAYANAKRFFDLKRSA 256
              +      DS     ++ +   ++ ++ ++P +  VD+DL L+ +ANA +++D K++A
Sbjct: 438 PEGQLDEEDDDSEESDEEDEENDGEAKTKPQRPEVLSVDIDLGLTPWANASQYYDQKKTA 497

Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL- 313
           A K++KTI+++++ALKSAEKK    LK    Q    +  AR  +WFEKFY+FISS+ YL 
Sbjct: 498 AVKEEKTIKASKQALKSAEKKLTTDLKRGLKQEKPVLRPARIPFWFEKFYFFISSDGYLV 557

Query: 314 -------------------GNV-------------------AWDAKV------------- 322
                              G+V                   A DA +             
Sbjct: 558 LGGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKNKPGASDAPIPPGTLAQAGTFTV 617

Query: 323 -VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
             + AW         WV ADQVSKT P+GE+L TG  +IRG KN   P QL +G + +F+
Sbjct: 618 ATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPGQLILGFAVMFQ 677

Query: 373 LEESSI 378
           +   S+
Sbjct: 678 ISPESV 683


>gi|297297786|ref|XP_002805097.1| PREDICTED: serologically defined colon cancer antigen 1-like
           [Macaca mulatta]
          Length = 856

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 127/341 (37%), Positives = 175/341 (51%), Gaps = 38/341 (11%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFES
Sbjct: 241 NFSGKGYIIQKREIKPSLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 291

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 292 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 351

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 352 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 411

Query: 201 ----------------VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYA 244
                           V+ E++  +          +      +  KP LVDVDL LSAYA
Sbjct: 412 NPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYA 471

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFY 304
           NAK+F       + +    I   ++       K   T  D+    +++ A          
Sbjct: 472 NAKKFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTG 531

Query: 305 WFISSEN--------YLGNVAWDAKVVTNAWWVKADQVSKT 337
             I               + AWDA+V+T+AWWV   Q+ ++
Sbjct: 532 EPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQIIRS 572


>gi|73853411|gb|AAZ86776.1| IP12823p [Drosophila melanogaster]
          Length = 489

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 111/237 (46%), Positives = 150/237 (63%), Gaps = 27/237 (11%)

Query: 29  SQGYLVQKKEKRP---NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S+GY++Q KE++P    + ++   N+EFHP +           Q + F    FESF  AV
Sbjct: 276 SKGYIIQVKEEKPTENGTVEFFFRNIEFHPYLFI---------QFKNFEKATFESFMEAV 326

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+ST ES KID+K +QQER+ALKKL NVK DH  RL  L + Q VD++KAELI +NQ 
Sbjct: 327 DEFYSTQESQKIDMKTLQQEREALKKLSNVKNDHAKRLEELTKVQDVDRKKAELITSNQS 386

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD AI A++  IA+QLSW DI   VK+AQ + D VAS IKQLKL  NHI+L+L    + 
Sbjct: 387 LVDNAIRAVQSAIASQLSWPDIHELVKEAQANGDAVASSIKQLKLETNHISLMLSDPYDN 446

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK 262
                             D  +  + ++VDVDL LSA+ANA+R++D+KRSAA+K++K
Sbjct: 447 DE---------------DDDLKDPEVTVVDVDLALSAWANARRYYDMKRSAAQKKKK 488


>gi|367021400|ref|XP_003659985.1| hypothetical protein MYCTH_2297656 [Myceliophthora thermophila ATCC
           42464]
 gi|347007252|gb|AEO54740.1| hypothetical protein MYCTH_2297656 [Myceliophthora thermophila ATCC
           42464]
          Length = 1085

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 126/374 (33%), Positives = 189/374 (50%), Gaps = 71/374 (18%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F+ +   VDEFFS+ E  K++ +  ++E  A +KLE  +RD E R+  L++ Q+++  KA
Sbjct: 312 FDGYNKTVDEFFSSLEGQKLESRLQEREAIAKRKLEAARRDQEQRIEGLQEAQMLNLRKA 371

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
             I  N E V  A+ A+   +   + W D+   V++ Q+ ++PVA II+  ++L+ N IT
Sbjct: 372 AAIEANIERVQEAMDAVNGLLQQGMDWVDVNKLVEREQKLHNPVAEIIQLPMRLHENVIT 431

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSS---ETKKPS---LVDVDLDLSAYANAKRFF 250
           LLL   +E    +D     +D  +  AD     + K P+    +D++L LS + NA+ ++
Sbjct: 432 LLLGEEEEEGEAEDKLDFDYDTDEEAADDGVPDKAKGPAKRLAIDINLKLSPWNNAREYY 491

Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFIS 308
           + KR+AA+KQQKT+Q +E ALK+AE+K  + LK    Q    +   RK  WFEKF WFIS
Sbjct: 492 EQKRTAAEKQQKTVQQSEIALKNAEQKIAEDLKKGLKQEKPVLQPIRKQLWFEKFIWFIS 551

Query: 309 SENYLGNVAWDAK----------------------------------------------- 321
           S+ YL     DA+                                               
Sbjct: 552 SDGYLVLGGRDAQQNEILYKRYLRKGDVYVHADMHGAPTVIVKNNPKTPDAPIPPSTLAQ 611

Query: 322 ------VVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMG 366
                   +NAW         WV ADQVSK+AP GE+L  GSFM+RGK+N  PP  L +G
Sbjct: 612 AGSLSVCCSNAWDSKAAMGAYWVNADQVSKSAPAGEYLPVGSFMVRGKRNPLPPALLMLG 671

Query: 367 ISFLFKLEESSISR 380
              +FK+ E S +R
Sbjct: 672 FGLMFKVSEESKAR 685


>gi|384489957|gb|EIE81179.1| hypothetical protein RO3G_05884 [Rhizopus delemar RA 99-880]
          Length = 1044

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 108/262 (41%), Positives = 161/262 (61%), Gaps = 11/262 (4%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           EFHP +          +Q      +EF +F  AVDEFFS+ E+ K++LK  +QE  ALKK
Sbjct: 250 EFHPHLY---------KQFSNRKFKEFSTFDQAVDEFFSSIEAQKLELKTRRQEEAALKK 300

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE VK + E R+ +L   QL +  KA+LI  N + VDAAI  IR  +A+Q+ W+D+   V
Sbjct: 301 LEAVKLEQEKRVESLLNQQLTNTRKAQLIELNLQFVDAAITIIRNAVASQMDWQDLNDLV 360

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           K+ +R  +P+A II  LKL  N +TLLL   +E    +       +  +      + K+ 
Sbjct: 361 KEEKRRGNPIALIIDTLKLETNQVTLLLTDPEEHEESESDDEEEEEEEE--EKEEKPKEI 418

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
             +DVD+ L+A+ANA+++++ K++ A K +KTI+++ KALKSAE+K ++ LK+ +    I
Sbjct: 419 FKIDVDIGLTAFANARKYYEQKKTTASKHEKTIEASTKALKSAERKIRKDLKETKITATI 478

Query: 292 NKARKVYWFEKFYWFISSENYL 313
           NK RK +WFEKF WFIS+E YL
Sbjct: 479 NKIRKPFWFEKFQWFISTEGYL 500



 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 45/63 (71%), Positives = 54/63 (85%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+A+WV  DQVSK+AP+GE+LTTGSFMIRGKKNF PP QL  G  +LFKL+ES
Sbjct: 566 AWDSKIVTSAYWVYPDQVSKSAPSGEYLTTGSFMIRGKKNFLPPVQLVYGFGYLFKLDES 625

Query: 377 SIS 379
           SI 
Sbjct: 626 SIG 628


>gi|440466993|gb|ELQ36234.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           Y34]
 gi|440486785|gb|ELQ66618.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           P131]
          Length = 1095

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 143/450 (31%), Positives = 210/450 (46%), Gaps = 94/450 (20%)

Query: 17  TLHVLVSNLNVSSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGST---------P 64
           TL  + S   ++  GY+V K      S   ED +       P +  PGS+         P
Sbjct: 273 TLAGITSAAQIT--GYIVAKTRDGAASVQNEDRVSEGALVKPFV--PGSSKDLLYEDFQP 328

Query: 65  SVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHET 121
            + +Q    P   I EFE F   VDEF+S+ E  K++ +  ++E  A KKL+  + +   
Sbjct: 329 FLPKQFSSDPTNVILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAK 388

Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
           R+  LE++QL++  KA  I  N E V  A+ A+   + N + W DI   V++ Q+ N+PV
Sbjct: 389 RIEGLEESQLLNFRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPV 448

Query: 182 ASIIKQLKLNINHITLLLRVH---------KERSNIKDSWVLHHDNADPFADSSETKKPS 232
           A+IIK L +++ + T+ LR+             +  +    +  D+ +  A S +  K  
Sbjct: 449 AAIIK-LPMDLANNTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRE 507

Query: 233 L-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMT 289
           L VD+ L+LS ++NA  ++D KRSAA+K++KTI  +  ALKSA +K  + L+    Q   
Sbjct: 508 LEVDIKLNLSPWSNAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKP 567

Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
            I   R   WFEK+ WF+SS+ YL     DA+                            
Sbjct: 568 VIQPIRHQVWFEKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVI 627

Query: 322 -------------------------VVTNAW---------WVKADQVSKTAPTGEFLTTG 347
                                      +NAW         WV ADQVSK APTGEFL  G
Sbjct: 628 IKNNPRTPEAPIPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAG 687

Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           SFMI+GKKN  PP  L +G   LF++ E S
Sbjct: 688 SFMIKGKKNELPPATLVIGFGLLFRISEES 717


>gi|159129335|gb|EDP54449.1| DUF814 domain protein, putative [Aspergillus fumigatus A1163]
          Length = 1116

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 142/430 (33%), Positives = 213/430 (49%), Gaps = 92/430 (21%)

Query: 31  GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFP---IQE 77
           GY+V K++KRP++E    A            +FHP   +P       RQ +G P   I E
Sbjct: 265 GYIVAKEDKRPSAESTADAGDPSEKAGLFYEDFHP--FRP-------RQFEGNPEVKILE 315

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F +F A VDE+FS+ E+ K++ +  ++E  A +KL+ V+++HE RL AL++ Q +   KA
Sbjct: 316 FSTFNATVDEYFSSLETQKLEARLTEREEAAKRKLDAVRQEHEKRLGALKEAQEIHVRKA 375

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
             I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA IIK  LKL  N IT
Sbjct: 376 AAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKLYENTIT 435

Query: 197 LLLRVHKERSNIKDS--WVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDL 252
           L+L    E  +  D   W    + ++     +  K   +  +D+DL LS +ANA ++++ 
Sbjct: 436 LVLGEASEEQDAADDLFWDESEEESESEEQEAARKASEMLTIDIDLGLSPWANATQYYEQ 495

Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSE 310
           K+ AA K+QKT QS+ KALKS EKK  + LK    Q    +  ARK +WFEKF +FISSE
Sbjct: 496 KKIAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQVLRPARKPFWFEKFLFFISSE 555

Query: 311 NYL--------------------GNV--------------------------------AW 318
            YL                    G++                                A 
Sbjct: 556 GYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLSQAG 615

Query: 319 DAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           +  V T++ W          V A+QV+KT  TG  L TG F I+G+KNF  P QL +G +
Sbjct: 616 NLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEIKGEKNFLAPSQLVLGFA 674

Query: 369 FLFKLEESSI 378
            +F++ ++S+
Sbjct: 675 VMFQISKNSL 684


>gi|451850505|gb|EMD63807.1| hypothetical protein COCSADRAFT_182004 [Cochliobolus sativus
           ND90Pr]
          Length = 1128

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 137/432 (31%), Positives = 208/432 (48%), Gaps = 91/432 (21%)

Query: 30  QGYLVQKKE---KRPNSEDYIVANM---EFHPMILQPGSTPSVMRQHQGFPIQEFESFAA 83
           +GY++ K      +P+ E    + +   +FHP   Q         ++  +   EF+ F  
Sbjct: 265 KGYILAKPNPSASKPDDESSDKSRLLYDDFHPFRPQ-------QFENTDYTFLEFDGFNK 317

Query: 84  AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
           AVDEFFS+ E  K++ K  ++E+ A KKLE  +++HE R+  L+Q Q ++  KAE I+ N
Sbjct: 318 AVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEDRIGGLQQVQELNFRKAEAILAN 377

Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL--- 199
              V  A  A+   I   + W DIE  +++ Q   + VA +I+  LKL+ N ITLLL   
Sbjct: 378 VHRVTEATEAVNGLIRQGMDWVDIERLIEREQNSGNAVAQLIRLPLKLHENTITLLLNET 437

Query: 200 ---------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRF 249
                        E S++ +      D     +      +P L +D+DL LSA+AN+  +
Sbjct: 438 NWEKGGEEEDEGNETSSVSEDTDDEDDRPRKTSPPKPVARPQLAIDIDLGLSAWANSTEY 497

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFI 307
           FD K++AA K+ +T+Q++ KALKS EKK  + LK    Q    +   RK +WFEKF +FI
Sbjct: 498 FDQKKTAADKEGRTLQASSKALKSHEKKVAEDLKKGLKQEKEVLRPVRKQHWFEKFIYFI 557

Query: 308 SSENYL--------------------GNV------------------------------- 316
           SS+ YL                    G+V                               
Sbjct: 558 SSDGYLVLGGKDAQQNEIIYRRFLRKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPSTLS 617

Query: 317 -AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
            A +  + T+  W          V++DQVSKT  TGEFL  G F I+GKK F PP QL +
Sbjct: 618 QAGNLSICTSDAWDSKAVMSAWWVRSDQVSKTGQTGEFLPAGMFNIKGKKEFLPPAQLVV 677

Query: 366 GISFLFKLEESS 377
           G++ +F++ +SS
Sbjct: 678 GLAVMFEISDSS 689


>gi|452000540|gb|EMD93001.1| hypothetical protein COCHEDRAFT_1172752 [Cochliobolus
           heterostrophus C5]
          Length = 1128

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 132/404 (32%), Positives = 196/404 (48%), Gaps = 85/404 (21%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP   Q         ++  +   EF+ F  AVDEFFS+ E  K++ K  ++E+ A KK
Sbjct: 293 DFHPFRPQ-------QFENTDYTFLEFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKKK 345

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  +++HE R+  L+Q Q ++  KAE I+ N   V  A  A+   I   + W DIE  +
Sbjct: 346 LEKARKEHEDRIGGLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQGMDWVDIERLI 405

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL------------RVHKERSNIKDSWVLHHDN 218
           ++ Q   + VA +I+  LKL+ N ITLLL                E S++ +      D 
Sbjct: 406 EREQNSGNAVAQLIRLPLKLHENTITLLLNETNWEEGGEEEDEGNETSSVSEDTDDEDDR 465

Query: 219 ADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
               +      +P L +D+DL LSA+AN+  +FD K++AA K+ +T+Q++ KALKS EKK
Sbjct: 466 PRKTSPPKPVARPQLAIDIDLGLSAWANSTEYFDQKKTAADKEGRTLQASTKALKSHEKK 525

Query: 278 TKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GN 315
             + LK    Q    +   RK +WFEKF +FISS+ YL                    G+
Sbjct: 526 VAEDLKKGLKQEKEVLRPVRKQHWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRFLRKGD 585

Query: 316 V--------------------------------AWDAKVVTNAWW----------VKADQ 333
           V                                A +  + T+  W          V++DQ
Sbjct: 586 VYVHADLKGAMPMIIKNKPDTPDAPIPPSTLSQAGNLSICTSDAWDSKAVMSAWWVRSDQ 645

Query: 334 VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           VSKT  TGEFL  G F I+GKK F PP QL +G++ +F++ +SS
Sbjct: 646 VSKTGQTGEFLPAGMFNIKGKKEFLPPAQLVVGLAVMFEISDSS 689


>gi|389646873|ref|XP_003721068.1| nuclear export mediator factor [Magnaporthe oryzae 70-15]
 gi|351638460|gb|EHA46325.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           70-15]
          Length = 1074

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 142/450 (31%), Positives = 210/450 (46%), Gaps = 94/450 (20%)

Query: 17  TLHVLVSNLNVSSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGST---------P 64
           TL  + S   ++  GY+V K      S   ED +       P +  PGS+         P
Sbjct: 252 TLAGITSAAQIT--GYIVAKTRDGAASVQNEDRVSEGALVKPFV--PGSSKDLLYEDFQP 307

Query: 65  SVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHET 121
            + +Q    P   I EFE F   VDEF+S+ E  K++ +  ++E  A KKL+  + +   
Sbjct: 308 FLPKQFSSDPTNVILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAK 367

Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
           R+  LE++QL++  KA  I  N E V  A+ A+   + N + W DI   V++ Q+ N+PV
Sbjct: 368 RIEGLEESQLLNFRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPV 427

Query: 182 ASIIKQLKLNINHITLLLRVH---------KERSNIKDSWVLHHDNADPFADSSETKKPS 232
           A+II +L +++ + T+ LR+             +  +    +  D+ +  A S +  K  
Sbjct: 428 AAII-ELPMDLANNTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRE 486

Query: 233 L-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMT 289
           L VD+ L+LS ++NA  ++D KRSAA+K++KTI  +  ALKSA +K  + L+    Q   
Sbjct: 487 LEVDIKLNLSPWSNAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKP 546

Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
            I   R   WFEK+ WF+SS+ YL     DA+                            
Sbjct: 547 VIQPIRHQVWFEKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVI 606

Query: 322 -------------------------VVTNAW---------WVKADQVSKTAPTGEFLTTG 347
                                      +NAW         WV ADQVSK APTGEFL  G
Sbjct: 607 IKNNPRTPEAPIPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAG 666

Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           SFMI+GKKN  PP  L +G   LF++ E S
Sbjct: 667 SFMIKGKKNELPPATLVIGFGLLFRISEES 696


>gi|119193306|ref|XP_001247259.1| hypothetical protein CIMG_01030 [Coccidioides immitis RS]
 gi|392863500|gb|EAS35746.2| hypothetical protein CIMG_01030 [Coccidioides immitis RS]
          Length = 1125

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 138/432 (31%), Positives = 207/432 (47%), Gaps = 94/432 (21%)

Query: 29  SQGYLVQKKEKRP------------NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
           ++GY+V + E +P            +  +YI    ++HP        P          I 
Sbjct: 264 TRGYIVARNENKPPENPSFSGEAKPDKSNYI----DYHPF------APRQFVDGNDTSIL 313

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
            F+SF  AVDE++S+ E+ K++ +  ++E    +KLE  KRDHE R+ AL+Q Q +   +
Sbjct: 314 TFDSFNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEIHTRR 373

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
           AE I  N   V+  + A+   IA  + W +I   ++  Q   +PVA +IK  LKL  N +
Sbjct: 374 AEAIATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLYENTV 433

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-----VDVDLDLSAYANAKRFF 250
           T+LL   + + + +D         D   D    KKP       VD+DL L+ +ANA +++
Sbjct: 434 TVLL--PEGQPDGEDDDSEESGEEDEENDGEAKKKPQRPEVLSVDIDLGLTPWANASQYY 491

Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFIS 308
           D K++AA K+ KTI+++++ALKSAEKK    LK    Q    +  AR  +WFEKFY+FIS
Sbjct: 492 DQKKTAAIKEDKTIKASKQALKSAEKKLTTDLKRGLKQEKPVLRPARIPFWFEKFYFFIS 551

Query: 309 SENYL--------------------GNV-------------------AWDAKV------- 322
           S+ YL                    G+V                   A DA +       
Sbjct: 552 SDGYLVLGGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKNKPGASDAPIPPGTLAQ 611

Query: 323 -------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMG 366
                   + AW         WV ADQVSKT P+GE+L TG  +IRG KN   P QL +G
Sbjct: 612 AGTFTVATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPGQLILG 671

Query: 367 ISFLFKLEESSI 378
            + +F++   S+
Sbjct: 672 FAVMFQISPESV 683


>gi|86196391|gb|EAQ71029.1| hypothetical protein MGCH7_ch7g436 [Magnaporthe oryzae 70-15]
          Length = 1095

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 142/450 (31%), Positives = 210/450 (46%), Gaps = 94/450 (20%)

Query: 17  TLHVLVSNLNVSSQGYLVQKKEKRPNS---EDYIVANMEFHPMILQPGST---------P 64
           TL  + S   ++  GY+V K      S   ED +       P +  PGS+         P
Sbjct: 273 TLAGITSAAQIT--GYIVAKTRDGAASVQNEDRVSEGALVKPFV--PGSSKDLLYEDFQP 328

Query: 65  SVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHET 121
            + +Q    P   I EFE F   VDEF+S+ E  K++ +  ++E  A KKL+  + +   
Sbjct: 329 FLPKQFSSDPTNVILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAK 388

Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
           R+  LE++QL++  KA  I  N E V  A+ A+   + N + W DI   V++ Q+ N+PV
Sbjct: 389 RIEGLEESQLLNFRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPV 448

Query: 182 ASIIKQLKLNINHITLLLRVH---------KERSNIKDSWVLHHDNADPFADSSETKKPS 232
           A+II +L +++ + T+ LR+             +  +    +  D+ +  A S +  K  
Sbjct: 449 AAII-ELPMDLANNTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRE 507

Query: 233 L-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMT 289
           L VD+ L+LS ++NA  ++D KRSAA+K++KTI  +  ALKSA +K  + L+    Q   
Sbjct: 508 LEVDIKLNLSPWSNAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKP 567

Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
            I   R   WFEK+ WF+SS+ YL     DA+                            
Sbjct: 568 VIQPIRHQVWFEKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVI 627

Query: 322 -------------------------VVTNAW---------WVKADQVSKTAPTGEFLTTG 347
                                      +NAW         WV ADQVSK APTGEFL  G
Sbjct: 628 IKNNPRTPEAPIPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAG 687

Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           SFMI+GKKN  PP  L +G   LF++ E S
Sbjct: 688 SFMIKGKKNELPPATLVIGFGLLFRISEES 717


>gi|326434920|gb|EGD80490.1| hypothetical protein PTSG_13144 [Salpingoeca sp. ATCC 50818]
          Length = 947

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 124/376 (32%), Positives = 186/376 (49%), Gaps = 63/376 (16%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P  ++Q +G   +EF+SF  AVD +FS  ESHK++++A+QQER A +KLE  +R H  R+
Sbjct: 111 PFPLKQFEGRTFKEFDSFDVAVDTYFSEIESHKLEMRALQQERAARQKLEQARRSHHDRV 170

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
              ++ +L D+ KA LI  N E V+ AI  I + + N L W +IE  V++++   DPVA+
Sbjct: 171 KGYKEARLEDEYKATLIELNHELVNEAIDVINKMVGNHLDWREIEELVQESRVRGDPVAN 230

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
            I +LKL  N I +    H    ++  +      + D   D  +  K +LV++DL  +A+
Sbjct: 231 AISKLKLKKNAIVM----HLTEPSMGGADDDSWSDEDEDEDEDDNTKGALVEIDLAETAH 286

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
            NA++  + K++   K++K + STE+AL+S EK+    LK  Q    I+K+R   WFEKF
Sbjct: 287 GNARKLHERKKTIRSKEEKALASTEQALRSVEKRAMDRLKKTQITATISKSRAPLWFEKF 346

Query: 304 YWFISSENYLGNVAWD---------AKVVTNAWWVKADQ------VSKTAPTGEF----- 343
            WFISSENYL     D           +  +  +V AD       V K + TGE      
Sbjct: 347 NWFISSENYLVLAGRDRLQNEALVRKHLTQHDLYVHADMNGASSVVVKNSNTGEIPPKTL 406

Query: 344 ---------------------------------------LTTGSFMIRGKKNFFPPCQLA 364
                                                  L  GSF I G K+F P  QLA
Sbjct: 407 SEAATFAVAHSPAWENNQQADAWWVHANQVEKTSSEGKPLGAGSFRITGAKHFIPIRQLA 466

Query: 365 MGISFLFKLEESSISR 380
           +  + LFK+++ S  R
Sbjct: 467 LAYAILFKVDDESAKR 482


>gi|71001140|ref|XP_755251.1| DUF814 domain protein [Aspergillus fumigatus Af293]
 gi|66852889|gb|EAL93213.1| DUF814 domain protein, putative [Aspergillus fumigatus Af293]
          Length = 1116

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 141/430 (32%), Positives = 212/430 (49%), Gaps = 92/430 (21%)

Query: 31  GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFP---IQE 77
           GY+V K++KRP++E    A            +FHP   +P       RQ +G P   I E
Sbjct: 265 GYIVAKEDKRPSAESTADAGDPSEKAGLFYEDFHP--FRP-------RQFEGNPEVKILE 315

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F +F A VDE+FS+ E+ K++ +  ++E  A +KL+ V+++HE RL AL++ Q +   KA
Sbjct: 316 FSTFNATVDEYFSSLETQKLEARLTEREEAAKRKLDAVRQEHEKRLGALKEAQEIHVRKA 375

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
             I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA IIK  LKL  N IT
Sbjct: 376 AAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKLYENTIT 435

Query: 197 LLLRVHKERSNIKDS--WVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDL 252
           L+L       +  D   W    + ++     +  K   +  +D+DL LS +ANA ++++ 
Sbjct: 436 LVLGEASREQDAADDLFWDESEEESESEEQEAARKASEMLTIDIDLGLSPWANATQYYEQ 495

Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSE 310
           K+ AA K+QKT QS+ KALKS EKK  + LK    Q    +  ARK +WFEKF +FISSE
Sbjct: 496 KKIAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQVLRPARKPFWFEKFLFFISSE 555

Query: 311 NYL--------------------GNV--------------------------------AW 318
            YL                    G++                                A 
Sbjct: 556 GYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLSQAG 615

Query: 319 DAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           +  V T++ W          V A+QV+KT  TG  L TG F I+G+KNF  P QL +G +
Sbjct: 616 NLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEIKGEKNFLAPSQLVLGFA 674

Query: 369 FLFKLEESSI 378
            +F++ ++S+
Sbjct: 675 VMFQISKNSL 684


>gi|340516439|gb|EGR46688.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1078

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 143/470 (30%), Positives = 215/470 (45%), Gaps = 94/470 (20%)

Query: 2   VSNEDVSS---DIHKCGPTLHVLVSNLN------------VSSQGYLVQKKEKRPNSEDY 46
            +N D+S+   DI    P L  LV++L+             + +GY+  K+   P+S   
Sbjct: 224 ANNFDISAKPADIVDNEPLLDALVNHLSEARDVVENIIASSTCKGYIFAKRRTTPSSAPD 283

Query: 47  IVANMEFHPMILQPGSTPSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQ 103
                + H  +L     P V ++ +  P   + EF+ +   VDEFFS+ E  K++ +   
Sbjct: 284 DAEQAQKHEGLLYEDFHPFVPQKFKNDPSIQVLEFDGYNRTVDEFFSSLEGQKLESRLTG 343

Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
           +E  A KKLE  +++   R+  L+  Q ++  KA  I  N E V  A+ A+   +A  + 
Sbjct: 344 REEAARKKLEAARQEQAKRIQGLQDAQAMNYRKAAAIEANVERVQEAMDAVNGLLAQGMD 403

Query: 164 WEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWV---LHHDNA 219
           W DI   +++ ++  +PVA II   LKL  N ITLLL       +  +         D++
Sbjct: 404 WVDIGKLIEREKKRQNPVAEIISLPLKLADNTITLLLAEEAFDEDEAEEEEDNPFETDDS 463

Query: 220 DPFADSS--------ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL 271
           D   D          +T K   VD+ L++S ++NA+ +++ +RSAA KQ+KT Q   KAL
Sbjct: 464 DSEEDQGGKATSKDKKTDKLLTVDIVLNMSPWSNAREYYEERRSAAMKQEKTQQQATKAL 523

Query: 272 KSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL---------------- 313
           KS E+K  + LK    Q    +   RK  WFEKF WFISS+ YL                
Sbjct: 524 KSTEQKIAEDLKKGLKQEKALLQPIRKQMWFEKFLWFISSDGYLVLGGKDPQQSEILYRR 583

Query: 314 ----GNVAWDAKV---------------------------------VTNAW--------- 327
               G+V   A +                                  + AW         
Sbjct: 584 YLRKGDVYCHADIRGAANIVIKNNPNMPDAPIPPATLSQAGSLSVCTSEAWDSKAGMGAW 643

Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           WV ADQVSKT P+G+ L  G+F I+GKKN+ PP QL +G+ F FK+ E S
Sbjct: 644 WVNADQVSKTTPSGDILPAGTFTIQGKKNYLPPTQLLLGLGFAFKISEQS 693


>gi|330929686|ref|XP_003302734.1| hypothetical protein PTT_14667 [Pyrenophora teres f. teres 0-1]
 gi|311321722|gb|EFQ89181.1| hypothetical protein PTT_14667 [Pyrenophora teres f. teres 0-1]
          Length = 1133

 Score =  184 bits (466), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 136/409 (33%), Positives = 199/409 (48%), Gaps = 90/409 (22%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP   Q         ++  +   EF+ F  AVDEFFS+ E  K++ K  ++E+ A KK
Sbjct: 292 DFHPFRPQ-------QFENSDYTFLEFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKKK 344

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  +++HE R+  L+Q Q ++  KAE I+ N   V  A  A+   I   + W DI   +
Sbjct: 345 LEKARKEHEDRIGGLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQGMDWVDIARLI 404

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVH--KERSNIKD-----SWVLHHDNADPFA 223
           ++ Q   + VA +IK  LKLN N ITLLL     +E   ++D     S V    + +   
Sbjct: 405 EREQNSGNAVAQLIKLPLKLNENTITLLLDETNWEEGQEVEDEGNETSSVSEDSDEEAAG 464

Query: 224 DSSETKK----------PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
           +    KK          P L +D+DL L+A+AN+  +FD K++AA K+ +T+Q++ +ALK
Sbjct: 465 EEDGAKKKSAPVKVSARPQLAIDIDLSLTAWANSTEYFDQKKTAANKEDRTLQASTRALK 524

Query: 273 SAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL----------------- 313
           S EKK  + LK    Q    +   RK +WFEKF +FISS+ YL                 
Sbjct: 525 SHEKKVAEDLKKGLKQEKEVLRPVRKQHWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRF 584

Query: 314 ---GNV-------------------AWDAKV--------------VTNAWWVKA------ 331
              G+V                     DA +               ++AW  KA      
Sbjct: 585 LRKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPSTLSQAGNLCICTSDAWDSKAVMSAWW 644

Query: 332 ---DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
              DQVSKT  TGEFL  G F ++GKK F PP QL +G++ +F++ ESS
Sbjct: 645 VRSDQVSKTGQTGEFLPAGMFNVKGKKEFLPPAQLVVGLAVMFEISESS 693


>gi|449299546|gb|EMC95559.1| hypothetical protein BAUCODRAFT_71160 [Baudoinia compniacensis UAMH
           10762]
          Length = 1052

 Score =  184 bits (466), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 133/407 (32%), Positives = 199/407 (48%), Gaps = 88/407 (21%)

Query: 52  EFHPM-ILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALK 110
           +FHP    Q  S PS+          E E F    DEFFS+ E  K++ +  ++E +A +
Sbjct: 295 DFHPFKPAQLASDPSIT-------FLEHEGFNKTCDEFFSSLEGQKLESRLQEREDNAKR 347

Query: 111 KLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR 170
           K+E  +++   R+  L+  Q ++  KA+ I  N E V+ A+ A+   IA  + W DI   
Sbjct: 348 KIEQARQEQAKRIDGLQHVQELNVRKAQAIEANVERVEEAVAAVNGLIAQGMDWMDIGRL 407

Query: 171 VKQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSW------------VLHHD 217
           ++  Q  ++ VA +IK  LKL  N +TLLL    E + +++ +                D
Sbjct: 408 IENEQSRHNAVAEMIKLPLKLYENTVTLLL---SEYAGLEEDYDDMADETESEESEDEAD 464

Query: 218 NADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
              P   S   +K   VDVDL LS ++NA++++D KR+AA+KQ++T Q+++KALKS E+K
Sbjct: 465 TQAPRHTSKPEEKRLAVDVDLALSPWSNARQYYDQKRTAAEKQERTAQASQKALKSTEQK 524

Query: 278 TKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GN 315
               LK    Q    +   RK  +FEKF +FISS+ YL                    G+
Sbjct: 525 VMADLKKGLKQEKDVLRPVRKQMYFEKFNYFISSDGYLVLAGRDAQQNEMLYRRYLKKGD 584

Query: 316 V--------------------------------AWDAKVVTNAWW----------VKADQ 333
           V                                A +  V T+  W          V ++Q
Sbjct: 585 VYIHADLHGAASVIVKNDPQTPEAPIPPSTLGQAGNLAVCTSTAWDSKAVMSAWWVGSEQ 644

Query: 334 VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           VSKTAPTGE+LTTG F+IRGKKN+ PP QL +G + LF++ E S +R
Sbjct: 645 VSKTAPTGEYLTTGGFVIRGKKNYLPPAQLLLGFAVLFRISEESKAR 691


>gi|361131825|gb|EHL03460.1| putative Nuclear export mediator factor Nemf [Glarea lozoyensis
           74030]
          Length = 1063

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 135/438 (30%), Positives = 207/438 (47%), Gaps = 86/438 (19%)

Query: 22  VSNLNVSSQGYLVQKKEKR------PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
           V++  V++ G+++ KK++       P      V   +FHP        P+          
Sbjct: 257 VTSCEVAT-GFIIAKKKEGYEENSDPEHSSKNVLYDDFHPF------RPAQFESDPATVF 309

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
            ++E F   VDEFFS+ E  +++ K  ++E +A +K++  ++D E RL  L+  Q +++ 
Sbjct: 310 LQYEGFNKTVDEFFSSIEGQRLESKLEERELNAQRKIQAARQDQERRLDGLQAVQSLNER 369

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINH 194
           KA  I  N E V  A+ A+   +A  + W +I   ++  ++ ++PVAS+IK  LKL  N 
Sbjct: 370 KASAIQANVERVQEAMDAVNGLVAQGMDWVEIGKLIEVEKKRSNPVASMIKLPLKLEENT 429

Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS--------LVDVDLDLSAYANA 246
           ITLLL       +   ++      +D   + ++ K+P          VD+DL L+ + NA
Sbjct: 430 ITLLLDEEVFDEDEDSAYETDDAPSDSEDEVTKQKEPKEKGVEKRLTVDIDLGLTPWKNA 489

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFY 304
           + +FD KR AA K+QKT++S+ KALKS E K    LK    Q    +   R++ WFEKF 
Sbjct: 490 REYFDEKRQAATKEQKTLESSTKALKSQEAKIAHDLKKGLQQEKAVLRPVRRLMWFEKFI 549

Query: 305 WFISSENYLGNVAWDAK---------------------------VVTN------------ 325
           WFISS+ YL     DA+                           VV N            
Sbjct: 550 WFISSDGYLVLGGKDAQQNEMLYKKYMKKGDAFLHADIQGAATVVVRNDPRTPDAPIPPS 609

Query: 326 --------------AWWVKAD---------QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ 362
                         AW  KA          Q+SK AP+G+FL  GSF + GKKNF PP Q
Sbjct: 610 TLSQAGSLVVSCSVAWDSKAGMSAWWASATQISKAAPSGDFLPPGSFSVNGKKNFLPPSQ 669

Query: 363 LAMGISFLFKLEESSISR 380
           L +G   +F++ ESS S+
Sbjct: 670 LLLGFGVIFRISESSKSK 687


>gi|67539818|ref|XP_663683.1| hypothetical protein AN6079.2 [Aspergillus nidulans FGSC A4]
 gi|40738864|gb|EAA58054.1| hypothetical protein AN6079.2 [Aspergillus nidulans FGSC A4]
          Length = 1588

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 135/431 (31%), Positives = 207/431 (48%), Gaps = 89/431 (20%)

Query: 31  GYLVQKKEKRP-----------NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           G++V K++ RP           +     +   +FHP        P       GF I E+ 
Sbjct: 265 GFIVAKEDTRPKPPGPESEKNDSPSKPALLYEDFHPF------KPRQFEGKDGFTILEYP 318

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           S  A VDE+FS+ ES K++ +  ++E  A KKL++++ +HE R+ ALEQ Q +   KA  
Sbjct: 319 SMNATVDEYFSSIESQKLESRLTERESAAKKKLDSLRSEHEKRIGALEQAQELHIRKASA 378

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLL 198
           I +N + V  A+ A+   +A  + W +I   V+  Q+  +PVAS+IK  LKL+ N ITLL
Sbjct: 379 IQDNMDRVQEAMDAVNGLVAQGMDWVEIARLVEMEQKRGNPVASLIKLPLKLHENTITLL 438

Query: 199 LR-VHKERSNIKDSWVLHHDNADPF----ADSSETKKPS--LVDVDLDLSAYANAKRFFD 251
           LR    E   +++ +                +S  KKP    +D+DL LS +ANA ++++
Sbjct: 439 LREAGDEGYEVEELFSSDESEDSDEEEGKGAASPQKKPEGLTIDIDLGLSPWANASQYYE 498

Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFEKFYWFISS 309
            K+ AA K +KT QS+ KALKS E+K +  LK    Q    +  ARK +WFEKF +F+SS
Sbjct: 499 QKKVAAVKAEKTSQSSAKALKSHERKVQDDLKRNLKQEKQVLRPARKPFWFEKFLFFVSS 558

Query: 310 ENYL--------------------GNV--------------------------------A 317
           E YL                    G+V                                A
Sbjct: 559 EGYLVLGGRDSMQSEMLYRRYLRKGDVFVHADLEGATPMIVKNKPGALSSSISPTTLSQA 618

Query: 318 WDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
            +  V T+  W          V A QVSKT+  G+ L  G F+++G+KNF  P QL +G 
Sbjct: 619 GNLCVATSTAWDSKAIMSAYWVDAAQVSKTSAVGDLLPVGEFLVKGEKNFLAPSQLVLGF 678

Query: 368 SFLFKLEESSI 378
           + ++++ + S+
Sbjct: 679 AVMWQISKGSL 689


>gi|406864313|gb|EKD17358.1| serologically defined colon cancer antigen 1 [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 1052

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 142/431 (32%), Positives = 211/431 (48%), Gaps = 92/431 (21%)

Query: 29  SQGYLVQKKEK---RPNSEDY---IVANMEFHPMILQPGSTPSVMRQHQGFPIQ---EFE 79
           ++GY++ KK++     + ED     V   +FHP   +P       RQ +  P     EF+
Sbjct: 263 AKGYIIAKKKEGYEEASPEDQARKFVIYEDFHP--FRP-------RQFENDPATVFLEFQ 313

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
            F    D+FFS+ E  K++ +  ++E+ A +K+E  ++D   RL  L++ Q ++  KA  
Sbjct: 314 GFNKTADQFFSSIEGQKLESRLQEREQMAKRKIEAARQDQAKRLGGLQEVQELNIRKAGA 373

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLL 198
           +  N E V  A+ A+   +A  + W +I   V+  Q+ N+PVASIIK  LKL  N I+LL
Sbjct: 374 LQANAERVQEAMDAVNGLVAQGMDWVEIGKLVEIEQKRNNPVASIIKLPLKLQENTISLL 433

Query: 199 LRVHKERSNIKDSWVLHHDNADPFADSSETKKPS--------LVDVDLDLSAYANAKRFF 250
           L   ++  + + ++    D +D   D +  K+P          +DV+L LS +ANA+ ++
Sbjct: 434 LDEEEDADDDESNYETDSDVSDS-EDEAPKKEPKQKTVEKRLTIDVNLALSPWANAREYY 492

Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFIS 308
           D KR+AA+K+QKT+QS+ KALKS E K    L+    Q    +   R+  WFEKF WFIS
Sbjct: 493 DQKRTAAEKEQKTLQSSTKALKSQEAKIAHDLRKGLKQEKAVLRPVRRQMWFEKFTWFIS 552

Query: 309 SENYL--------------------GNVAWDAKV-------------------------- 322
           S+ YL                    G+V   A+V                          
Sbjct: 553 SDGYLVLAGKDPQQKETLYRRYLKKGDVYVHAEVQGAASVVIRNNPKTPDAPIPPSTLSQ 612

Query: 323 -------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMG 366
                   ++AW         WV ADQVSK A TGEFL  GSF I+GKKNF PP  L +G
Sbjct: 613 AGTLSISCSSAWEAKAGMSAWWVNADQVSKAASTGEFLPAGSFNIKGKKNFLPPAVLLLG 672

Query: 367 ISFLFKLEESS 377
              +F + E S
Sbjct: 673 FGVIFLISEES 683


>gi|320169195|gb|EFW46094.1| serologically defined colon cancer antigen 1 [Capsaspora owczarzaki
           ATCC 30864]
          Length = 1151

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 117/331 (35%), Positives = 173/331 (52%), Gaps = 51/331 (15%)

Query: 21  LVSNLNVSSQ--GYLVQKKEKRPNSEDY------------------------IVANMEFH 54
           L+ ++  SS+  GY++Q+KE R  + +                         I    E+H
Sbjct: 240 LLIDVGASSEVPGYIIQRKESRATAANKNKGVGDEQAAVAAALASASGDASDIFVFDEYH 299

Query: 55  PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
           P + +         QH+  P+  F +F  AVDEF+S  E  ++D+K +  ER+ LKKLE 
Sbjct: 300 PFLFE---------QHKARPVVHFPTFDRAVDEFYSRIEGQRLDMKHIGDERNVLKKLEK 350

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
            K + E +L  L  TQ  +  + +LI++NQ     A+L IR  +A+ + W +I   V+ A
Sbjct: 351 FKLEQERKLVGLRTTQEEEALRGQLILDNQTK---ALLVIRSALAHAVDWSEISDMVEAA 407

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL- 233
           +   DPVASII +LKL+ N ITL+L    +    ++      + AD    S         
Sbjct: 408 KEQKDPVASIIHKLKLDSNIITLML-TSPDAVEEEEDDNSEDEGADQAVSSKGKGSAKGG 466

Query: 234 -----------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL 282
                      VD+D+  S +ANA+ +F  K+ AA K+Q+TI ++ KALKSAE++TKQ L
Sbjct: 467 KKGHHQQTRMKVDIDITASVHANAESYFSRKKQAAAKEQRTIDASSKALKSAERQTKQQL 526

Query: 283 KDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           K V     +NK RKV WFEKF WFI+SENYL
Sbjct: 527 KQVAVKATVNKVRKVLWFEKFLWFITSENYL 557



 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 45/64 (70%), Positives = 56/64 (87%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWDAKV+T+AWWV A+QVSKTAPTGE+LTTGSFMIRG+KNF PP  L +G  FL++L+ES
Sbjct: 621 AWDAKVITSAWWVAANQVSKTAPTGEYLTTGSFMIRGRKNFLPPSPLILGFGFLYRLDES 680

Query: 377 SISR 380
            I++
Sbjct: 681 CIAK 684


>gi|398396540|ref|XP_003851728.1| hypothetical protein MYCGRDRAFT_43818 [Zymoseptoria tritici IPO323]
 gi|339471608|gb|EGP86704.1| hypothetical protein MYCGRDRAFT_43818 [Zymoseptoria tritici IPO323]
          Length = 1060

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 130/410 (31%), Positives = 194/410 (47%), Gaps = 102/410 (24%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        P+ +   +    +EF  F   VDEFFS+ E  K++ K  ++E  A ++
Sbjct: 293 DFHPF------KPAQLTAEETIVFREFSGFNKTVDEFFSSIEGQKLESKLQEREDHAKRR 346

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           +E  + +   R+  L++ Q ++  KA+ I  N E V+ A  A+   IA  + W DI   +
Sbjct: 347 IEQAREEQAKRIDGLQEVQELNIRKAQAIEANVERVEEATAAVNGLIAQGMDWVDIGKLI 406

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFAD------ 224
           +  Q+ ++ VA +IK  LKL+ N +TLLL           S +   D  D  A+      
Sbjct: 407 ENEQKRHNAVAELIKLPLKLHENTVTLLL-----------SELDAADGGDDEANETDSEP 455

Query: 225 -------------SSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL 271
                          E K+ + +D+DL  S +ANA++++D KRSAA KQ+KT Q+++KAL
Sbjct: 456 DDSDDEDAAPAAKGGEDKRLT-IDIDLAASGWANARQYYDQKRSAATKQEKTAQASQKAL 514

Query: 272 KSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL---------------- 313
           KS E++    LK    Q    +   RK +WFEKF +F+SS+ YL                
Sbjct: 515 KSTEQRVMADLKKGLKQEKDVLRPVRKQFWFEKFIYFLSSDGYLVLAGKDAQQNEILYRR 574

Query: 314 ----GNV--------------------------------AWDAKVVTNAWW--------- 328
               G+V                                A +  V T++ W         
Sbjct: 575 YLKKGDVYVNADLQGAASVIIKNNPATPEAPIPPSTLSQAGNLAVCTSSAWESKAVMSAW 634

Query: 329 -VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
            V ADQVSKTAPTGE+LT G F+IRGKKN  PP QL +G   +F++ E S
Sbjct: 635 WVNADQVSKTAPTGEYLTNGGFVIRGKKNHLPPAQLLLGFGVMFQISEES 684


>gi|358369883|dbj|GAA86496.1| DUF814 domain protein [Aspergillus kawachii IFO 4308]
          Length = 1157

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 140/435 (32%), Positives = 210/435 (48%), Gaps = 92/435 (21%)

Query: 28  SSQGYLVQKKEKRPNSE-------------DYIVANMEFHPMILQPGSTPSVMRQHQGFP 74
           S  GY+V K++ RP+++              Y+  +  FHP        P       G  
Sbjct: 312 SHPGYIVAKEDTRPSADSPAQGEEDAARKPGYLYED--FHPF------KPKQFEGKPGVT 363

Query: 75  IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
           I E+ SF A VDE+FS+ E+ K++ +  ++E  A +KLE V+++H  R+ AL++ Q +  
Sbjct: 364 ILEYPSFNATVDEYFSSIETQKLESRLTEREETAKRKLEAVRQEHAKRIGALKEVQELHI 423

Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNIN 193
            KA  I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA+IIK  LKL  N
Sbjct: 424 RKAGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYEN 483

Query: 194 HITLLLRVHKERSNIKDSW--VLHHDNADPFADSSETKKPS----LVDVDLDLSAYANAK 247
            ITL+L    E  +  +        ++ D   ++++ +K S     +D+DL LS +ANA 
Sbjct: 484 TITLMLGESGEEQDEGEDLFSDDESESEDEQEEAAKAQKQSNNMLTIDIDLGLSPWANAT 543

Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
           ++++ K+ AA K+QKT QS+ KALKS EKK  Q LK    Q    +  ARK +WFEKF +
Sbjct: 544 QYYEQKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQVLRPARKTFWFEKFLF 603

Query: 306 FISSENYL--------------------GNV----------------------------- 316
           FISSE YL                    G+V                             
Sbjct: 604 FISSEGYLVLGGRDAMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSSNAPIPPST 663

Query: 317 ---AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
              A +  V T++ W          V A QVSKTA  G  L TG F+I+G+KNF  P QL
Sbjct: 664 LSQAGNLCVATSSAWDSKAIMSAYWVTASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQL 723

Query: 364 AMGISFLFKLEESSI 378
            +G   +F++ + S+
Sbjct: 724 VLGFGVMFQVSKESL 738


>gi|350636898|gb|EHA25256.1| hypothetical protein ASPNIDRAFT_49657 [Aspergillus niger ATCC 1015]
          Length = 1515

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 140/433 (32%), Positives = 204/433 (47%), Gaps = 88/433 (20%)

Query: 28  SSQGYLVQKKEKRPNSEDYIVANME-----------FHPMILQPGSTPSVMRQHQGFPIQ 76
           S  GY+V K + RP+++       E           FHP        P       G  I 
Sbjct: 262 SHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPF------KPKQFEGKPGVTIL 315

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           E+ SF A VDE+FS+ E+ K++ +  ++E  A KKL+ V+++H  R+ AL++ Q +   K
Sbjct: 316 EYPSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRK 375

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
           A  I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA+IIK  LKL  N I
Sbjct: 376 AGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTI 435

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADS--SETKKPS----LVDVDLDLSAYANAKRF 249
           TL+L    E  +  +      D+         ++ +K S     +D+DL LS +ANA ++
Sbjct: 436 TLMLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQY 495

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFI 307
           ++ K+ AA K+QKT QS+ KALKS EKK  Q LK    Q    +  ARK +WFEKF +FI
Sbjct: 496 YEQKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQVLRPARKTFWFEKFLFFI 555

Query: 308 SSENYL--------------------GNV------------------------------- 316
           SSE YL                    G+V                               
Sbjct: 556 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 615

Query: 317 -AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
            A +  V T++ W          V A QVSKTA  G  L TG F+I+G+KNF  P QL +
Sbjct: 616 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 675

Query: 366 GISFLFKLEESSI 378
           G   +F++ + S+
Sbjct: 676 GFGVMFQVSKESL 688


>gi|367042422|ref|XP_003651591.1| hypothetical protein THITE_2086741 [Thielavia terrestris NRRL 8126]
 gi|346998853|gb|AEO65255.1| hypothetical protein THITE_2086741 [Thielavia terrestris NRRL 8126]
          Length = 1094

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 137/431 (31%), Positives = 209/431 (48%), Gaps = 80/431 (18%)

Query: 28  SSQGYLVQKKEKRPNSEDYIVANMEFHPM----ILQPGSTPSVMRQHQGFP---IQEFES 80
           S++GY++ K+   P + D +    E        +L     P + +Q +  P   +  F+ 
Sbjct: 257 SARGYIIAKRN--PRAADQVADGEETTKEKAQNLLYEDFQPFLPKQFEDDPTCQVLSFDG 314

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F+  VDEFFS+ E  K++ +  ++E  A +KLE  +RD   R+  L++ QL++  KA  I
Sbjct: 315 FSKTVDEFFSSLEGQKLESRLQEREATAKRKLEAARRDQAQRIEGLQEAQLLNLRKAAAI 374

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N E V  A+ A+   +   + W DI   V++ QR ++PVA IIK  ++L  + ITLLL
Sbjct: 375 EANVERVQEAMDAVNGLLQQGMDWVDINKLVEREQRLHNPVAEIIKLPMRLEESIITLLL 434

Query: 200 RVHKERSNIKDSWVLHH---DNADPFADSSETKKPS---LVDVDLDLSAYANAKRFFDLK 253
              +E +  + +    +   + A     + + K P     +D++L LS + NA+ +++ K
Sbjct: 435 GEEEEEAEAEANMDFDYDTDEEAAEETAAGKAKGPDKRLAIDINLKLSPWNNAREYYEQK 494

Query: 254 RSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSEN 311
           R+AA KQQKTIQ +E AL++AEKK  + LK    Q    +   RK  WFEKF WFISS+ 
Sbjct: 495 RTAADKQQKTIQQSEIALRNAEKKISEDLKKGLKQEKPVLQLIRKQMWFEKFLWFISSDG 554

Query: 312 YLGNVAWDAK-------------------------------------------------- 321
           YL     DA+                                                  
Sbjct: 555 YLVLGGRDAQQNEILYKRYLRKGDVYVHADMHGAPSVIIKNNPKTPDAPIPPSTLAQAGS 614

Query: 322 ---VVTNAWWVKA---------DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
                ++AW  KA         DQVSK+APTGE+L  GSFM+RGK+N  PP  L +G   
Sbjct: 615 LSVCCSSAWDSKAVMGAWWVNADQVSKSAPTGEYLPAGSFMVRGKRNALPPALLTLGFGL 674

Query: 370 LFKLEESSISR 380
           +FK+ E S S+
Sbjct: 675 MFKISEDSKSK 685


>gi|225563152|gb|EEH11431.1| DUF814 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 1158

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 137/438 (31%), Positives = 210/438 (47%), Gaps = 85/438 (19%)

Query: 20  VLVSNLNVS------SQGYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGST 63
           V+  N+N S      + GY+V K E +   +  +          VA ++FHP        
Sbjct: 233 VVAENVNSSLSTAEETPGYIVSKTEGKAGEDASVDSTDPSKSSNVAYIDFHPF------E 286

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P       G  I  F++F  AVDE+FS+ ES K++ +  ++E  A +KLE  K D + R+
Sbjct: 287 PKQFESEPGTSILRFDTFNKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAKTDQDKRV 346

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L++ Q +   KA+ I  N   V+ A+ A+   IA  + W +I   ++  Q   +PVA 
Sbjct: 347 GVLKEAQELHIRKAQAIEANLLRVEEAVNAVNGLIAQGMDWGEIARLIEMEQSRQNPVAK 406

Query: 184 IIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFAD--------------SSET 228
           +IK  LKL  N +TLLL    E     D      +  +                  S +T
Sbjct: 407 VIKLPLKLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSQKT 466

Query: 229 KKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLK 283
           ++P L +D+DL +S +ANA+++++ K++AA K++KT+ ST+KA+KS EKK     KQ LK
Sbjct: 467 RQPLLSIDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKKAIKSTEKKVAADLKQALK 526

Query: 284 DVQTMTNINK----ARKVYWFEKFYW--------FISSE--------------------- 310
             + +    +     R V   E  Y         F+ ++                     
Sbjct: 527 QEKPVLRPTRTPFCGRDVQQTEILYRRHLKRGDVFVHADVQGAIPIIVKNKPGTPDAPIP 586

Query: 311 ----NYLGNV------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
               +  GN+      AWD+K V  AWWV ADQVSKT P GE+L TG F+I G+KN  PP
Sbjct: 587 PGTLSQAGNLCVATSTAWDSKAVMGAWWVNADQVSKTTPLGEYLVTGGFVICGEKNQLPP 646

Query: 361 CQLAMGISFLFKLEESSI 378
            QL +G + +F++   SI
Sbjct: 647 AQLLLGFAVMFQISGESI 664


>gi|259479735|tpe|CBF70228.1| TPA: DUF814 domain protein, putative (AFU_orthologue; AFUA_2G09170)
           [Aspergillus nidulans FGSC A4]
          Length = 1100

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 135/431 (31%), Positives = 207/431 (48%), Gaps = 89/431 (20%)

Query: 31  GYLVQKKEKRP-----------NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           G++V K++ RP           +     +   +FHP        P       GF I E+ 
Sbjct: 265 GFIVAKEDTRPKPPGPESEKNDSPSKPALLYEDFHPF------KPRQFEGKDGFTILEYP 318

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           S  A VDE+FS+ ES K++ +  ++E  A KKL++++ +HE R+ ALEQ Q +   KA  
Sbjct: 319 SMNATVDEYFSSIESQKLESRLTERESAAKKKLDSLRSEHEKRIGALEQAQELHIRKASA 378

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLL 198
           I +N + V  A+ A+   +A  + W +I   V+  Q+  +PVAS+IK  LKL+ N ITLL
Sbjct: 379 IQDNMDRVQEAMDAVNGLVAQGMDWVEIARLVEMEQKRGNPVASLIKLPLKLHENTITLL 438

Query: 199 LR-VHKERSNIKDSWVLHHDNADPF----ADSSETKKPS--LVDVDLDLSAYANAKRFFD 251
           LR    E   +++ +                +S  KKP    +D+DL LS +ANA ++++
Sbjct: 439 LREAGDEGYEVEELFSSDESEDSDEEEGKGAASPQKKPEGLTIDIDLGLSPWANASQYYE 498

Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFEKFYWFISS 309
            K+ AA K +KT QS+ KALKS E+K +  LK    Q    +  ARK +WFEKF +F+SS
Sbjct: 499 QKKVAAVKAEKTSQSSAKALKSHERKVQDDLKRNLKQEKQVLRPARKPFWFEKFLFFVSS 558

Query: 310 ENYL--------------------GNV--------------------------------A 317
           E YL                    G+V                                A
Sbjct: 559 EGYLVLGGRDSMQSEMLYRRYLRKGDVFVHADLEGATPMIVKNKPGALSSSISPTTLSQA 618

Query: 318 WDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
            +  V T+  W          V A QVSKT+  G+ L  G F+++G+KNF  P QL +G 
Sbjct: 619 GNLCVATSTAWDSKAIMSAYWVDAAQVSKTSAVGDLLPVGEFLVKGEKNFLAPSQLVLGF 678

Query: 368 SFLFKLEESSI 378
           + ++++ + S+
Sbjct: 679 AVMWQISKGSL 689


>gi|317033383|ref|XP_001395552.2| hypothetical protein ANI_1_620104 [Aspergillus niger CBS 513.88]
          Length = 1108

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 140/433 (32%), Positives = 204/433 (47%), Gaps = 88/433 (20%)

Query: 28  SSQGYLVQKKEKRPNSEDYIVANME-----------FHPMILQPGSTPSVMRQHQGFPIQ 76
           S  GY+V K + RP+++       E           FHP        P       G  I 
Sbjct: 262 SHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPF------KPKQFEGKPGVTIL 315

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           E+ SF A VDE+FS+ E+ K++ +  ++E  A KKL+ V+++H  R+ AL++ Q +   K
Sbjct: 316 EYPSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRK 375

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
           A  I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA+IIK  LKL  N I
Sbjct: 376 AGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTI 435

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADS--SETKKPS----LVDVDLDLSAYANAKRF 249
           TL+L    E  +  +      D+         ++ +K S     +D+DL LS +ANA ++
Sbjct: 436 TLMLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQY 495

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFI 307
           ++ K+ AA K+QKT QS+ KALKS EKK  Q LK    Q    +  ARK +WFEKF +FI
Sbjct: 496 YEQKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQVLRPARKTFWFEKFLFFI 555

Query: 308 SSENYL--------------------GNV------------------------------- 316
           SSE YL                    G+V                               
Sbjct: 556 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 615

Query: 317 -AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
            A +  V T++ W          V A QVSKTA  G  L TG F+I+G+KNF  P QL +
Sbjct: 616 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 675

Query: 366 GISFLFKLEESSI 378
           G   +F++ + S+
Sbjct: 676 GFGVMFQVSKESL 688


>gi|119480773|ref|XP_001260415.1| hypothetical protein NFIA_084700 [Neosartorya fischeri NRRL 181]
 gi|119408569|gb|EAW18518.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 1116

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 139/430 (32%), Positives = 210/430 (48%), Gaps = 92/430 (21%)

Query: 31  GYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGSTPSVMRQHQGFP---IQE 77
           GY++ K++KRP +E             +   +FHP   +P       RQ +G P   I E
Sbjct: 265 GYIIAKEDKRPTAESTADTGDPSQKAGLLYEDFHP--FRP-------RQFEGKPEVTILE 315

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F +F A VDE+FS+ E+ K++ +  ++E  A +KL+ V+++HE RL AL++ Q +   KA
Sbjct: 316 FSTFNATVDEYFSSLETQKLESRLTEREEAAKRKLDAVRQEHEKRLGALKEAQEIHVRKA 375

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
             I +N   V   + A+   IA  + W +I   ++  Q   +PVA IIK  LKL  N IT
Sbjct: 376 AAIEDNVYRVQEVMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKLYENTIT 435

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSET--KKPSL--VDVDLDLSAYANAKRFFDL 252
           L+L    E  +  D         +  ++  E   K P +  +D+DL LS +ANA ++++ 
Sbjct: 436 LVLGEASEEQDAADDLFSDESEEESESEEQEAARKAPEMLTIDIDLGLSPWANATQYYEQ 495

Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSE 310
           K+ AA K+QKT QS+ KALKS EKK  + LK    Q    +  ARK +WFEKF +FISSE
Sbjct: 496 KKMAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQVLRPARKPFWFEKFLFFISSE 555

Query: 311 NYL--------------------GNV--------------------------------AW 318
            YL                    G++                                A 
Sbjct: 556 GYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLSQAG 615

Query: 319 DAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           +  V T++ W          V A+QV+KT  TG  L TG F ++G+KNF  P QL +G +
Sbjct: 616 NLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEVKGEKNFLAPSQLVLGFA 674

Query: 369 FLFKLEESSI 378
            +F++ + S+
Sbjct: 675 VMFQISKESL 684


>gi|134080270|emb|CAK97173.1| unnamed protein product [Aspergillus niger]
          Length = 1180

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 140/433 (32%), Positives = 204/433 (47%), Gaps = 88/433 (20%)

Query: 28  SSQGYLVQKKEKRPNSEDYIVANME-----------FHPMILQPGSTPSVMRQHQGFPIQ 76
           S  GY+V K + RP+++       E           FHP        P       G  I 
Sbjct: 334 SHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPF------KPKQFEGKPGVTIL 387

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           E+ SF A VDE+FS+ E+ K++ +  ++E  A KKL+ V+++H  R+ AL++ Q +   K
Sbjct: 388 EYPSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRK 447

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
           A  I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA+IIK  LKL  N I
Sbjct: 448 AGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTI 507

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADS--SETKKPS----LVDVDLDLSAYANAKRF 249
           TL+L    E  +  +      D+         ++ +K S     +D+DL LS +ANA ++
Sbjct: 508 TLMLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQY 567

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFI 307
           ++ K+ AA K+QKT QS+ KALKS EKK  Q LK    Q    +  ARK +WFEKF +FI
Sbjct: 568 YEQKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQVLRPARKTFWFEKFLFFI 627

Query: 308 SSENYL--------------------GNV------------------------------- 316
           SSE YL                    G+V                               
Sbjct: 628 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 687

Query: 317 -AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAM 365
            A +  V T++ W          V A QVSKTA  G  L TG F+I+G+KNF  P QL +
Sbjct: 688 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 747

Query: 366 GISFLFKLEESSI 378
           G   +F++ + S+
Sbjct: 748 GFGVMFQVSKESL 760


>gi|378733722|gb|EHY60181.1| translation factor [Exophiala dermatitidis NIH/UT8656]
          Length = 1147

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 140/429 (32%), Positives = 207/429 (48%), Gaps = 81/429 (18%)

Query: 29  SQGYLVQKKEKRPNSEDYIVANMEFHPM-----ILQPGSTPSVMRQHQGFP---IQEFES 80
           S+GY++ K  K P+SE    A  +  P      +L     P    Q +G P   I EF+ 
Sbjct: 264 SKGYIIAKI-KSPSSEAPQEAEAQTQPSSGRDNLLYEDFHPFRPSQFEGKPDLRILEFDG 322

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F   VDEF+S+ ES K++ +  ++E  A KKL+  K + E RL AL+  Q +   KA+ I
Sbjct: 323 FNRTVDEFYSSIESQKLESRLTEREEAARKKLQAAKEEQEKRLGALQHVQELHVRKAQAI 382

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N   V+ A  A+   I   + W DI   ++  Q+  + VA +IK  LKL  N +TLLL
Sbjct: 383 EANTHRVEEACAAVNGLIGQGMDWVDIGKLIENEQKRGNVVAQMIKLPLKLEENTVTLLL 442

Query: 200 ---RVHKERSNIKDSWVLHHDNADPFADSSETKKPS----LVDVDLDLSAYANAKRFFDL 252
                ++E    +       +N+D       TK  +     +D+DL LS +ANA+++++ 
Sbjct: 443 DEPGFNEESEEDEPDETDEEENSDEDTRKKPTKPATDKRLAIDIDLGLSPWANARQYYEQ 502

Query: 253 KRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSE 310
           K++AA K+++T+++   ALKSAE+K +  LK    Q    +  ARK +WFEKF +FISS+
Sbjct: 503 KKNAAVKEKRTLEAATMALKSAERKIEADLKRGLKQEKAALRPARKQFWFEKFLYFISSD 562

Query: 311 NYL--------------------GNV--------------------------------AW 318
            YL                    G+V                                A 
Sbjct: 563 GYLVIGGKDAQQNELLYRRYLKRGDVYVHADLQGASSVIVKNNPRTPDAPIPPSTLSQAG 622

Query: 319 DAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
              V T++ W          V A+QVSKTAP+GE+LTTG F+IRG KN  PP QL +G  
Sbjct: 623 ALTVCTSSAWDSKAVMGAWWVNAEQVSKTAPSGEYLTTGGFIIRGHKNLLPPSQLLLGFG 682

Query: 369 FLFKLEESS 377
            L+ + E S
Sbjct: 683 VLWLISEES 691


>gi|328774280|gb|EGF84317.1| hypothetical protein BATDEDRAFT_8510 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 695

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 126/429 (29%), Positives = 195/429 (45%), Gaps = 111/429 (25%)

Query: 20  VLVSNLNVSSQGYLVQK-------------KEKRPNSEDYIVANMEFHPMILQPGSTPSV 66
           +L S+++   QGY+V                +  P +        EFHP   +  +    
Sbjct: 186 ILSSSIDTPQQGYIVTSDSMVSQQLASSDTAQSSPTTT--FTTYQEFHPYRFEQFNQDRS 243

Query: 67  MRQHQGFPIQ----EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
                  P Q    EF SF  AVDE+FS  ES +++++A Q E  A+KKLENVK+ H+ +
Sbjct: 244 TSLSAELPKQTRFMEFVSFDKAVDEYFSKMESQRLEIRAHQAELAAVKKLENVKKSHQAQ 303

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +   +     +++ A+ I +  E +D+ +  ++  +A+ + W+D+E  VK+   + + +A
Sbjct: 304 IQNFQSNVESNEQYAQAIESRLEDIDSVLRTVQSFLASGMDWKDLEDLVKEETNNGNALA 363

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I   KLN+                                         +D+D+  +A
Sbjct: 364 KMIIGFKLNMEFFK-------------------------------------IDLDIYSTA 386

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-TMTNINKARKVYWFE 301
           YANA+R++  K+ A  KQ KT++ + K +K AE K  Q L  VQ T  +I K RK YWFE
Sbjct: 387 YANARRYYGAKKVAITKQSKTMEQSAKVVKMAEMKIFQHLASVQKTAVSITKIRKPYWFE 446

Query: 302 KFYWFISSENYL-----------------------------------------GNVAWDA 320
           KF WF+SSEN+L                                           V  DA
Sbjct: 447 KFLWFVSSENFLVVGGKDATQSNMLVTRYLKKGDAYVHSDLPGAASVIVKCMQSCVGTDA 506

Query: 321 -------------KVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
                        K++T+A+W +A QVSKT  TG+ L  G+FMIRGKKN+ PP QL  G+
Sbjct: 507 GTMSVCQSRAWDAKIITSAYWAEAHQVSKTTSTGDTLPLGTFMIRGKKNWLPPVQLIYGM 566

Query: 368 SFLFKLEES 376
           + LF+ + S
Sbjct: 567 AMLFQTDHS 575


>gi|358379255|gb|EHK16935.1| hypothetical protein TRIVIDRAFT_10609, partial [Trichoderma virens
           Gv29-8]
          Length = 1079

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 136/436 (31%), Positives = 205/436 (47%), Gaps = 79/436 (18%)

Query: 21  LVSNL--NVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFP---I 75
           +V N+  + + +GY+  KK+  P+S        + H  +L     P V ++ +  P   +
Sbjct: 257 IVENIIASATCKGYIFAKKKTAPSSGPDETDQAQKHEGLLYEDFHPFVPQKFKNDPSIQV 316

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
            EFE +   VDEFFS+ E  K++ +   +E  A KKLE  + +   R+  L+  Q ++  
Sbjct: 317 LEFEGYNRTVDEFFSSLEGQKLESRLSGREEAAKKKLEAARHEQAKRIEGLQDAQAMNLR 376

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINH 194
           KA  I  N E V  A+ A+   +A  + W DI   +++ ++  +PVA II   LKL  N 
Sbjct: 377 KAAAIEANVERVQEAMDAVNGLLAQGMDWVDIGKLIEREKKRQNPVAEIISLPLKLAENT 436

Query: 195 ITLLLRVHKERSNIKDSW-VLHHDNADPFADSSETK--------KPSLVDVDLDLSAYAN 245
           ITLLL   +   +          D++D  A++SE          K   VD+ L+ S ++N
Sbjct: 437 ITLLLAEEEFDEDEAAEDNPFETDDSDSEAEASEVTPTKDKKADKLLTVDIVLNTSPWSN 496

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
           A+ +++ +RSAA K++KT     KALKS E+K  + LK    Q    +   RK  WFEKF
Sbjct: 497 AREYYEERRSAAMKEEKTQLQANKALKSTEQKIAEDLKKGLKQEKALLQPIRKQMWFEKF 556

Query: 304 YWFISSENYL--------------------GNVAWDAKV--------------------- 322
            WFISS+ YL                    G+V   A +                     
Sbjct: 557 IWFISSDGYLVLGGKDPQQSEMLYRRYLRKGDVYCHADIRGAAHIVIKNNPNTPDAPIPP 616

Query: 323 ------------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC 361
                        ++AW         WV ADQVSK+ PTG+ L  G+F I+GKKN+ PP 
Sbjct: 617 ATLSQAGSLSVCTSDAWDSKAGMGGWWVNADQVSKSTPTGDILPAGNFTIQGKKNYLPPT 676

Query: 362 QLAMGISFLFKLEESS 377
           QL +G+ F FK+ E S
Sbjct: 677 QLLLGLGFTFKISEQS 692


>gi|255556494|ref|XP_002519281.1| conserved hypothetical protein [Ricinus communis]
 gi|223541596|gb|EEF43145.1| conserved hypothetical protein [Ricinus communis]
          Length = 1092

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 116/320 (36%), Positives = 177/320 (55%), Gaps = 37/320 (11%)

Query: 16  PTLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
           P  ++L+ N NV          +  P+SE       EF P++L        MR++     
Sbjct: 300 PEGYILMQNKNVG---------KNHPSSESAFKIYDEFCPILLNQFK----MREYV---- 342

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
            +F++F AA+DEF+S  ES + + +   +E  A++KL  ++ D E R+  L +   +   
Sbjct: 343 -KFDTFDAALDEFYSKIESQRAEQQQKTKENSAIQKLNKIRLDQENRVLTLRKEVDLCVR 401

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
           KAELI  N E VDAAILA+R  +A  +SWED+   VK+ ++  +PVAS+I +L L  N +
Sbjct: 402 KAELIEYNLEDVDAAILAVRVALAKGMSWEDLTRMVKEEKKLGNPVASLIDKLHLERNCM 461

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
           TLLL                 +N D   D  +T     V++DL LSA+ANA+R++++K+ 
Sbjct: 462 TLLL----------------SNNLDDMDDDEKTLPVDKVEIDLALSAHANARRWYEMKKK 505

Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL-- 313
              KQ KT+ + EKA K+AE+KT+  L   +++  I+  RKV+WFEKF WFISSENYL  
Sbjct: 506 QESKQGKTVTAHEKAFKAAERKTRLQLSQEKSVATISHMRKVHWFEKFNWFISSENYLVI 565

Query: 314 -GNVAWDAKVVTNAWWVKAD 332
            G  A   +++   +  K D
Sbjct: 566 SGRDAQQNEMIVKRYMSKGD 585



 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 43/63 (68%), Positives = 50/63 (79%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF  P  L MG   LF+L+ES
Sbjct: 627 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLSPHPLIMGFGLLFRLDES 686

Query: 377 SIS 379
           S+ 
Sbjct: 687 SLG 689


>gi|212529000|ref|XP_002144657.1| DUF814 domain protein, putative [Talaromyces marneffei ATCC 18224]
 gi|210074055|gb|EEA28142.1| DUF814 domain protein, putative [Talaromyces marneffei ATCC 18224]
          Length = 1117

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 138/434 (31%), Positives = 199/434 (45%), Gaps = 93/434 (21%)

Query: 30  QGYLVQKKEKRPNSEDYIVANM--------EFHPMILQPGSTPSVMRQHQGFPIQEFESF 81
           +GY+V K+  R   ED  V +         +FHP        P       G+ I E++SF
Sbjct: 263 KGYIVAKE--RAKKEDQQVDSTSKENLLFEDFHPF------RPRQFEGKPGYHILEYDSF 314

Query: 82  AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
              VDE+FS+ ES K++ +  + E  A +KLE  + DH+ R  AL+Q Q +   KAE I 
Sbjct: 315 NKTVDEYFSSIESQKLESRLAEHEETAKRKLETARADHQNRAGALKQAQELHIRKAEAIQ 374

Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLR 200
            N   V  A  A+   IA  + W +I   ++  Q+ N+PVA  IK  LKL  N ITLLL 
Sbjct: 375 ANIYRVQEATDAVNGLIAQGMDWVEIARLIEMEQQRNNPVAQTIKLPLKLYENTITLLLS 434

Query: 201 VHKERSNIKDSWVLHHDNADPFADSSETK------------KPSLVDVDLDLSAYANAKR 248
                   +       +        SE +            +P  +D+DL LS ++NA +
Sbjct: 435 EENTEVEEEQEEFSESEPEVSEDSDSENEIEKDEGPKQKIAEPLAIDIDLSLSPWSNATQ 494

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFEKFYWF 306
           +++ KR+AA K+QKTIQS+EKALKS EKK  + LK    Q    +  +RK +WFEK+ +F
Sbjct: 495 YYEQKRTAAVKEQKTIQSSEKALKSQEKKVTEDLKKHLKQEKQVLRPSRKPFWFEKYLYF 554

Query: 307 ISSENYL--------------------GNV------------------------------ 316
           ISSE YL                    G+V                              
Sbjct: 555 ISSEGYLVLGGRDSHQVEILYQRYLKKGDVFVHADLEGATPMIVKNKEGTPDAPIPPGTL 614

Query: 317 --AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLA 364
             A    V T+  W          V A QVS+T   GE L  G+FM++G+KN+  P Q  
Sbjct: 615 TQAGSISVATSKAWETKALMPSWWVHAHQVSRTNERGELLANGAFMVKGEKNYLAPGQPI 674

Query: 365 MGISFLFKLEESSI 378
           +G + LF++ + S+
Sbjct: 675 LGFAVLFQISKESV 688


>gi|226292279|gb|EEH47699.1| DUF814 domain-containing protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 1261

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 129/403 (32%), Positives = 196/403 (48%), Gaps = 82/403 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        P       G  I  F +F  AVDE+FS+ ES K++ +  ++E  A +K
Sbjct: 400 DFHPF------KPKQFENVPGMTILTFNTFNKAVDEYFSSVESQKLEYRLTEREEIARRK 453

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  ++D E R+ AL++ Q +   KA+ I  N   V+ AI A+   IA  + W +I   +
Sbjct: 454 LEAAQKDQENRVGALKEVQELHVRKAQAIEANLLRVEEAINAVNGLIAQGMDWVEIARLI 513

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL----------RVHKERSNIKDSWVLHHDNAD 220
           +  +   +PVA +IK  LKL  N +TLLL              E     +S      N  
Sbjct: 514 EMEKSRQNPVAKVIKLPLKLYENTVTLLLGEPTEDEEPADESDEEEEDSESGDEDGGNKV 573

Query: 221 PFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
               S + ++  L +D+DL +S +ANA+++++ +++AA K++KT++ST+KA+KS EKK  
Sbjct: 574 KLERSKKAQQQLLSIDIDLGISPWANARQYYEQRKAAAVKEEKTLKSTKKAIKSTEKKVT 633

Query: 280 QTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVA 317
             LK    Q    +   R  +WFEKF +F+SS+ YL                    G+V 
Sbjct: 634 TDLKHALKQEKPILRPTRTPFWFEKFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVY 693

Query: 318 WDAKV---------------------------------VTNAW---------WVKADQVS 335
             A V                                  ++AW         WV ADQVS
Sbjct: 694 VHADVQGATPIFVKNKPGTPDAPIPPGTLSQAGNLCVATSSAWDSKAVMGAWWVNADQVS 753

Query: 336 KTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           KTAP+GEF+ TG F+IRG+K+  PP QL +G + +F++ E SI
Sbjct: 754 KTAPSGEFVGTGGFVIRGEKHQLPPAQLLLGYAVMFQISEDSI 796


>gi|169612956|ref|XP_001799895.1| hypothetical protein SNOG_09606 [Phaeosphaeria nodorum SN15]
 gi|111061751|gb|EAT82871.1| hypothetical protein SNOG_09606 [Phaeosphaeria nodorum SN15]
          Length = 1132

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 132/408 (32%), Positives = 194/408 (47%), Gaps = 86/408 (21%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP   Q         +   +   EF+ F  AVDEFFS+ E  K++ +  ++E+ A KK
Sbjct: 297 DFHPFRPQ-------QFEESDYTFLEFDGFNKAVDEFFSSIEGQKLESRLTEREQQAKKK 349

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  +R+HE RL  L+Q Q V+  KAE I+ N   V  A  A+   I   + W DI + +
Sbjct: 350 LEKARREHEERLGGLQQVQEVNFRKAEAILANVHRVAEATEAVNGLIRQGMDWGDIASLI 409

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL------RVHKERSNIKDSWVLHHDNADPFAD 224
           ++ Q H + VA  IK  LKL+ N ITLLL         ++     ++  +  D+ D    
Sbjct: 410 EREQSHGNAVAETIKLPLKLHENTITLLLDETDFDHAEEDDDEGNETSSVSEDSEDEDEG 469

Query: 225 SSETKKPSL--------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEK 276
             +   P+         +D+DL LS +AN+  ++D K++AA K+ +T+Q++ KALKS EK
Sbjct: 470 PKKKAAPAKPAARPKLAIDIDLALSPWANSTEYYDQKKTAASKEDRTLQASTKALKSHEK 529

Query: 277 KTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK------------- 321
           K  + LK    Q    +   RK  WFEKF +FISS+ YL     DA+             
Sbjct: 530 KVAEDLKKGLKQEKDILRPVRKQQWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRYFRKG 589

Query: 322 --------------VVTN--------------------------AWWVKA---------D 332
                         ++ N                          AW  KA         D
Sbjct: 590 DVYVHADLKGAVPMIIKNKPTTPDAPIPPSTLSQAGHLSVCSSDAWESKAVMSAWWVLAD 649

Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           QVSKT  TGEFL  G F I+GKK + PP QL +G++ +F++ E+S +R
Sbjct: 650 QVSKTGQTGEFLPPGLFNIKGKKEYLPPAQLIVGLAVMFEISEASKAR 697


>gi|240275734|gb|EER39247.1| DUF814 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 1183

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 135/438 (30%), Positives = 210/438 (47%), Gaps = 85/438 (19%)

Query: 20  VLVSNLNVS------SQGYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGST 63
           V+  N+N S      + GY+V K E +   +  +          VA ++FHP        
Sbjct: 258 VVAENVNSSLSTAEETPGYIVSKTEGKAGEDASVDSTDPSKLRNVAYIDFHPF------E 311

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P       G  I  F++F+ AVDE+FS+ ES K++ +  ++E  A +KLE  ++D + R+
Sbjct: 312 PKQFESEPGTSILRFDTFSKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAQKDQDKRV 371

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L++ Q +   KA+ I  N   V+ AI A+   IA  + W +I   ++  Q   +PVA 
Sbjct: 372 GVLKEAQELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQSRQNPVAK 431

Query: 184 IIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFAD--------------SSET 228
           +IK  LKL  N +TLLL    E     D      +  +                  S +T
Sbjct: 432 VIKLPLKLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSKKT 491

Query: 229 KKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLK 283
           ++P L +D+DL +S +ANA+++++ K++AA K++KT+ ST+ A+KS EKK     KQ LK
Sbjct: 492 RQPLLSIDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKTAIKSTEKKVAADLKQALK 551

Query: 284 DVQTMTNINK----ARKVYWFEKFYW--------FISSE--------------------- 310
             + +    +     R V   E  Y         F+ ++                     
Sbjct: 552 QEKPVLRPTRTPFCGRDVQQTEILYRRHLKRGDVFVHADVQGAIPIIVKNKPGTPDAPIP 611

Query: 311 ----NYLGNV------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
               +  GN+      AWD+K V  AWWV ADQVSKT P GE+L TG F+I G+KN   P
Sbjct: 612 PGTLSQAGNLCVATSTAWDSKAVMGAWWVNADQVSKTTPLGEYLVTGGFVICGEKNHLSP 671

Query: 361 CQLAMGISFLFKLEESSI 378
            QL +G + +F++   SI
Sbjct: 672 AQLLLGFAVMFQISGESI 689


>gi|391869409|gb|EIT78607.1| putative RNA-binding protein [Aspergillus oryzae 3.042]
          Length = 1103

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 133/439 (30%), Positives = 211/439 (48%), Gaps = 90/439 (20%)

Query: 24  NLNVSSQ----GYLVQKKEKRP-------NSEDYIVANM---EFHPMILQPGSTPSVMRQ 69
           N  +S+Q    GY+V K + R        N +     N+   +FHP        P     
Sbjct: 253 NTRLSTQESHPGYIVAKDDNRSVSQSANENEKPSETGNLLYEDFHPF------KPRQFEG 306

Query: 70  HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
             G  I EF S  A VDE+FS+ E+ K++ +  ++E  A +KLE V+++HE ++ AL++ 
Sbjct: 307 KPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGALKEQ 366

Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QL 188
           Q +   KA  I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA IIK  L
Sbjct: 367 QELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARIIKLPL 426

Query: 189 KLNINHITLLLRVHKERSNIKDSWVL--HHDNADPFADSSETKKPS---LVDVDLDLSAY 243
           KL+ N ITLLL    +  +  D        + ++   D+ E+++P     +D+DL +S +
Sbjct: 427 KLHENTITLLLGEAGDEQDEGDELFSSDESEKSEDEQDNGESQQPPSVLTIDIDLGISPW 486

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFE 301
           ANAK++++ K+ AA K+Q+T QS+ KALKS EKK  + LK    +    + + R+ +WFE
Sbjct: 487 ANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQPFWFE 546

Query: 302 KFYWFISSENYL--------------------GNV------------------------- 316
           KF +FISSE YL                    G++                         
Sbjct: 547 KFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDPTAPI 606

Query: 317 -------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
                  A +  V T++ W          V+A Q++KTA  G  L  G F+++G+KNF  
Sbjct: 607 PPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEKNFLA 666

Query: 360 PCQLAMGISFLFKLEESSI 378
           P QL +G    F++ + S+
Sbjct: 667 PSQLVLGFGVTFQISKDSL 685


>gi|342879256|gb|EGU80511.1| hypothetical protein FOXB_08971 [Fusarium oxysporum Fo5176]
          Length = 1060

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 138/460 (30%), Positives = 209/460 (45%), Gaps = 91/460 (19%)

Query: 2   VSNEDVSSDIHKCGPTLHVLVSNLNVS--SQGYLVQKKEKRPNSEDYI---------VAN 50
           ++NE + +D+ K        V  L  S    GY+  K+ +R    +           +  
Sbjct: 218 LANETLLADLVKSLQEARQSVEELTSSEACTGYIFAKRRERTEGAEATDESKTQRDNLLY 277

Query: 51  MEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALK 110
            +FHP +      P  +++     + EF  +   VDEFFS+ E  +++ +  ++E  A +
Sbjct: 278 EDFHPFV------PYKLKKDPTIEVLEFTGYNETVDEFFSSLEGQRLESRLSEREAAAKR 331

Query: 111 KLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR 170
           KLE  + +   R+  L++ Q ++  KA  I  N E V  A+ A+   ++  + W D+   
Sbjct: 332 KLEAARNEQSKRIEGLQEAQALNFRKAAAIEANAERVQEAMDAVNGLLSQGMDWVDVGKL 391

Query: 171 VKQAQRHNDPVASIIKQLKLNI--NHIT-------LLLRVHKERSNIKDSWVLHHDNADP 221
           V++ ++ ++PVA IIK L LN+  N IT                    D   L  D    
Sbjct: 392 VEREKKRHNPVAEIIK-LPLNLAENLITLELAEEEFEPEEDDPYETDDDDSALGDDEGTS 450

Query: 222 FADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
            A   +  K   V+++L LS ++NA+ +FD +++AA K++KT Q   +ALK+AE+K  + 
Sbjct: 451 AAKGKQANKALSVEINLGLSPWSNAREYFDQRKTAAVKEEKTQQQASRALKNAEQKITED 510

Query: 282 LKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAWD 319
           LK    Q    +   RK  WFEKF WFISS+ YL                    G+V   
Sbjct: 511 LKKGLKQEKALLQPIRKPMWFEKFVWFISSDGYLVIGGKDAQQNEMIYKKYLRKGDVYCH 570

Query: 320 AKV---------------------------------VTNAW---------WVKADQVSKT 337
           A +                                  +NAW         WV ADQVSK+
Sbjct: 571 ADLHGASSVIIKNNPKTPDAPIPPATLSQAGSLAVCSSNAWDSKAGMSAWWVNADQVSKS 630

Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           APTGEFL  GSFMIRGKKNF PP QL +G+   FK+ E S
Sbjct: 631 APTGEFLPAGSFMIRGKKNFLPPAQLLLGLGVAFKISEES 670


>gi|295673284|ref|XP_002797188.1| DUF814 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226282560|gb|EEH38126.1| DUF814 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 1258

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 136/435 (31%), Positives = 210/435 (48%), Gaps = 91/435 (20%)

Query: 29  SQGYLVQKKEKR-------PNSEDYIVANM---EFHPMILQPGSTPSVMRQHQGFPIQEF 78
           + GY++ K E +        +++     NM   +FHP        P       G  I +F
Sbjct: 461 TPGYIISKAESKTGEAITEADTDSPKPKNMLYHDFHPF------EPKQFENVPGMTILKF 514

Query: 79  ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAE 138
           ++F  AVDE+FS+ ES K++ +  ++E  A +KLE  ++D E R+ AL++ Q +   KA+
Sbjct: 515 KTFNKAVDEYFSSVESQKLEYRLTEREEIARRKLEAAQKDQENRIGALKEVQELHVRKAQ 574

Query: 139 LIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITL 197
            I  N   V+ AI A+   IA  + W +I   ++  +   +PVA++IK  LKL  N +TL
Sbjct: 575 AIEANLLRVEEAIKAVNGLIAQGMDWVEIARLIEMEKSRQNPVANVIKLPLKLYENTVTL 634

Query: 198 LL---------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAK 247
           LL             E     +S      N      S + ++  L +D+DL +S +ANA+
Sbjct: 635 LLGEPTEDEEPADESEEEEDSESDDEDGGNKVKLEGSKKAQQQLLSIDIDLGISPWANAR 694

Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
           ++++ KR AA K++KT++ST+KA+KS EKK    LK    Q    +   R  +WFEKF +
Sbjct: 695 QYYEQKRVAAVKEEKTLKSTKKAIKSTEKKVTTDLKHALKQEKPILRPTRTPFWFEKFMF 754

Query: 306 FISSENYL--------------------GNVAWDAKV----------------------- 322
           F+SS+ YL                    G+V   A V                       
Sbjct: 755 FVSSDGYLVLGGRDLQQTEILYRRYLKKGDVYVHADVQGATPIFVKNKPGTPDAPIPPGT 814

Query: 323 ----------VTNAWWVKA---------DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
                      ++AW  KA         DQVSKTAP+GEF+ TG F+IRG+K+  PP QL
Sbjct: 815 LSQAGNLCVASSSAWDSKAVMGAWWVNADQVSKTAPSGEFVGTGGFVIRGEKHQLPPAQL 874

Query: 364 AMGISFLFKLEESSI 378
            +G + +F++ E SI
Sbjct: 875 LLGFAVMFQISEDSI 889


>gi|355718192|gb|AES06188.1| serologically defined colon cancer antigen 1 [Mustela putorius
           furo]
          Length = 547

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 102/210 (48%), Positives = 120/210 (57%), Gaps = 59/210 (28%)

Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
           +P LVDVDL LSAYANAK+++D KR AAKK QKT+++ EKA KSAEKKTKQTLK+VQT+T
Sbjct: 39  RPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVT 98

Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------VVTNAWWVKADQVSKTA-- 338
           +I KARKVYWFEKF WFISSENYL     D +         + T   +V AD    T+  
Sbjct: 99  SIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIVKRYLTTGDIYVHADLHGATSCV 158

Query: 339 ---PTG---------------------------------------------EFLTTGSFM 350
              PTG                                             E+LTTGSFM
Sbjct: 159 IKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFM 218

Query: 351 IRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           IRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 219 IRGKKNFLPPSYLMMGFSFLFKVDESCVWR 248


>gi|121698891|ref|XP_001267840.1| DUF814 domain protein, putative [Aspergillus clavatus NRRL 1]
 gi|119395982|gb|EAW06414.1| DUF814 domain protein, putative [Aspergillus clavatus NRRL 1]
          Length = 1111

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 142/431 (32%), Positives = 208/431 (48%), Gaps = 94/431 (21%)

Query: 31  GYLVQKKEKRPNSEDYIVANME----------FHPMILQPGSTPSVMRQHQGFP---IQE 77
           GY+V K++ RP S      N E          FHP   +P       RQ +G P   I E
Sbjct: 265 GYIVAKEDTRP-SAGPTAENQEGKKSGLLYQDFHP--FKP-------RQFEGKPEIKILE 314

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F SF A VDE+FS+ E+ K++ +  ++E  A +KL+ V+++HE RL AL+Q Q +   KA
Sbjct: 315 FGSFNATVDEYFSSLETQKLESRLTEREEAAKRKLDAVRQEHEKRLGALKQAQELHIRKA 374

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
             I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA IIK  LKL  N IT
Sbjct: 375 GAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQDRGNPVARIIKLPLKLYENTIT 434

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSE---TKKPSL--VDVDLDLSAYANAKRFFD 251
           L+L    E  +  D      ++ +      +    K P +  +D+DL LS +ANA ++++
Sbjct: 435 LVLGEASEEQDEADELFSDDESEEDSESEEQEAIKKAPEMLTIDIDLGLSPWANATQYYE 494

Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISS 309
            K+ AA K+QKT QS+ KALKS EKK  + LK    Q    +  ARK +WFEKF +FISS
Sbjct: 495 QKKMAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQVLRPARKPFWFEKFIFFISS 554

Query: 310 ENYL--------------------GNV--------------------------------A 317
           E YL                    G++                                A
Sbjct: 555 EGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLSQA 614

Query: 318 WDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
            +  V T++ W          V A+QV+KT  TG  L TG F  +G+K+F  P QL +G 
Sbjct: 615 GNLSVATSSAWDSKAVMAAWWVNANQVTKTT-TGGLLPTGEFETKGEKSFLAPSQLVLGF 673

Query: 368 SFLFKLEESSI 378
           + +F++ + S+
Sbjct: 674 AVMFQISKESL 684


>gi|169783790|ref|XP_001826357.1| hypothetical protein AOR_1_1306054 [Aspergillus oryzae RIB40]
 gi|83775101|dbj|BAE65224.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1103

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 133/439 (30%), Positives = 212/439 (48%), Gaps = 90/439 (20%)

Query: 24  NLNVSSQ----GYLVQKKEKRP-------NSEDYIVANM---EFHPMILQPGSTPSVMRQ 69
           N  +S+Q    GY+V K++ R        N +     N+   +FHP        P     
Sbjct: 253 NTRLSTQESHPGYIVAKEDNRSVSQSANENEKPSETGNLLYEDFHPF------KPRQFEG 306

Query: 70  HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
             G  I EF S  A VDE+FS+ E+ K++ +  ++E  A +KLE V+++HE ++ AL++ 
Sbjct: 307 KPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGALKEQ 366

Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QL 188
           Q +   KA  I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA IIK  L
Sbjct: 367 QELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARIIKLPL 426

Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA--DSSETKKPS---LVDVDLDLSAY 243
           KL+ N ITLLL    +  +  D      ++ +     D+ E+++P     +D+DL +S +
Sbjct: 427 KLHENTITLLLGEAGDEQDEGDELFSSDESEESEDEQDNGESQQPPSVLTIDIDLGISPW 486

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFE 301
           ANAK++++ K+ AA K+Q+T QS+ KALKS EKK  + LK    +    + + R+ +WFE
Sbjct: 487 ANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQPFWFE 546

Query: 302 KFYWFISSENYL--------------------GNV------------------------- 316
           KF +FISSE YL                    G++                         
Sbjct: 547 KFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDPTAPI 606

Query: 317 -------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
                  A +  V T++ W          V+A Q++KTA  G  L  G F+++G+KNF  
Sbjct: 607 PPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEKNFLA 666

Query: 360 PCQLAMGISFLFKLEESSI 378
           P QL +G    F++ + S+
Sbjct: 667 PSQLVLGFGVTFQISKDSL 685


>gi|302854251|ref|XP_002958635.1| hypothetical protein VOLCADRAFT_69736 [Volvox carteri f.
           nagariensis]
 gi|300256024|gb|EFJ40301.1| hypothetical protein VOLCADRAFT_69736 [Volvox carteri f.
           nagariensis]
          Length = 744

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 133/398 (33%), Positives = 188/398 (47%), Gaps = 80/398 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           EF P+ L P         + G P  E  +F  A+DEF+S  E  +  +     ER AL K
Sbjct: 276 EFSPLPLLP---------YSGQPCLELSTFDDALDEFYSKIEGQRAGIARADAERAALSK 326

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+ +K D  TR  AL +     + KA+LI  N E VDA +LA+ Q +A  + W  +   V
Sbjct: 327 LDKIKLDQGTRAEALLRQAEECELKAQLITYNLEMVDAVLLAVNQMLATGMDWSALADLV 386

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLL--------RVHKERSNIKDSWVLHHDNADPFA 223
           +  +R  +PVA++I  L+L  N +++LL           +E +  + +  +  +   P  
Sbjct: 387 RNERRAGNPVAALIASLELENNRVSVLLANTLDDTGEEGEEEAMTRKAVKVASEECFPQH 446

Query: 224 DSSETKKPS----LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL----KSAE 275
               T++ +    LV VDL LSA ANA  +F+ +R    K  KT+ + E AL    K  E
Sbjct: 447 TQRHTQRHTHTHILVFVDLSLSAAANASTYFEARRRHLAKHAKTLAANEAALAAAEKKVE 506

Query: 276 KKTKQTLKDVQTMTNINKA---RKVYWF--EKFYWFISSEN----------YL--GNV-- 316
            + KQ       +  + K     + +WF   + Y  +S  +          Y   G+V  
Sbjct: 507 AQLKQVRAAPPALQPVRKPMWFERFHWFISSENYLVVSGRDAQQNELLVKRYFRKGDVYV 566

Query: 317 ------------------------------------AWDAKVVTNAWWVKADQVSKTAPT 340
                                               AWD+K+VT+AWWV   QVSKTAPT
Sbjct: 567 HAELHGTTICVRWRSGGPIPPLTLQQAGCACVCRSRAWDSKLVTSAWWVHHQQVSKTAPT 626

Query: 341 GEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           GE+LTTGSFMIRGKKNF PP  L MG  FLFKL++SSI
Sbjct: 627 GEYLTTGSFMIRGKKNFLPPQPLVMGFGFLFKLDDSSI 664


>gi|428183447|gb|EKX52305.1| hypothetical protein GUITHDRAFT_65529, partial [Guillardia theta
           CCMP2712]
          Length = 703

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 147/258 (56%), Gaps = 16/258 (6%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P V+RQ+ G  +  F+SF  A+D +FS AE  K++ + VQQ++ A+ K+E VKR HE  +
Sbjct: 274 PVVLRQYAGKKLDMFDSFDKAMDAYFSIAEDKKLEQQKVQQKKAAVSKVERVKRAHEASI 333

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
            AL++ +  +  +A LI  N   VD AIL I   ++  + W  ++  VK+  R  +P+A 
Sbjct: 334 QALQEEEAENYHRATLIEANLSDVDNAILVINSMLSQGMDWASLKKLVKEEGRKGNPIAQ 393

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           +I  LKL+ N ITLLL                    D   +  +T     VDVDL ++AY
Sbjct: 394 MIHGLKLDSNQITLLLTF----------------GLDAMEEEEQTLPVVAVDVDLGMNAY 437

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
            NA+ ++  K+  A K +KT+Q+  KA+K AE+K K+ LK   T  +I + RK +WFEKF
Sbjct: 438 QNAQSYYSSKKKVALKAEKTMQAAGKAIKGAERKAKEDLKKADTKASIQQIRKTHWFEKF 497

Query: 304 YWFISSENYLGNVAWDAK 321
            WFISSEN+L     DA+
Sbjct: 498 IWFISSENFLVLCGRDAQ 515



 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 40/63 (63%), Positives = 53/63 (84%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWDAK+VT+A+WV  +QVSK+APTGE+L+TGSFMIRGKKN+ PP  L MG   LF+++ES
Sbjct: 570 AWDAKMVTSAYWVHPEQVSKSAPTGEYLSTGSFMIRGKKNYLPPNSLIMGFGLLFRIDES 629

Query: 377 SIS 379
            ++
Sbjct: 630 CLA 632


>gi|238493615|ref|XP_002378044.1| DUF814 domain protein, putative [Aspergillus flavus NRRL3357]
 gi|220696538|gb|EED52880.1| DUF814 domain protein, putative [Aspergillus flavus NRRL3357]
          Length = 1105

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 133/439 (30%), Positives = 211/439 (48%), Gaps = 90/439 (20%)

Query: 24  NLNVSSQ----GYLVQKKEKRP-------NSEDYIVANM---EFHPMILQPGSTPSVMRQ 69
           N  +S+Q    GY+V K + R        N +     N+   +FHP        P     
Sbjct: 255 NTRLSTQESHPGYIVAKDDNRSVSQSANENEKPSETGNLLYEDFHPF------KPRQFEG 308

Query: 70  HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
             G  I EF S  A VDE+FS+ E+ K++ +  ++E  A +KLE V+++HE ++ AL++ 
Sbjct: 309 KPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGALKEQ 368

Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QL 188
           Q +   KA  I +N   V  A+ A+   IA  + W +I   ++  Q   +PVA IIK  L
Sbjct: 369 QELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARIIKLPL 428

Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA--DSSETKKPS---LVDVDLDLSAY 243
           KL+ N ITLLL    +  +  D      ++ +     D+ E+++P     +D+DL +S +
Sbjct: 429 KLHENTITLLLGEAGDEQDEGDELFSSDESEESEDEQDNGESQQPPSVLTIDIDLGISPW 488

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFE 301
           ANAK++++ K+ AA K+Q+T QS+ KALKS EKK  + LK    +    + + R+ +WFE
Sbjct: 489 ANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQPFWFE 548

Query: 302 KFYWFISSENYL--------------------GNV------------------------- 316
           KF +FISSE YL                    G++                         
Sbjct: 549 KFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDPTAPI 608

Query: 317 -------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
                  A +  V T++ W          V+A Q++KTA  G  L  G F+++G+KNF  
Sbjct: 609 PPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEKNFLA 668

Query: 360 PCQLAMGISFLFKLEESSI 378
           P QL +G    F++ + S+
Sbjct: 669 PSQLVLGFGVTFQISKDSL 687


>gi|225681027|gb|EEH19311.1| DUF814 domain-containing protein [Paracoccidioides brasiliensis
           Pb03]
          Length = 1161

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 128/403 (31%), Positives = 194/403 (48%), Gaps = 82/403 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        P       G  I  F +F  AVDE+FS+ ES K+  +  ++E  A +K
Sbjct: 301 DFHPF------KPKQFENVPGMTILTFNTFNKAVDEYFSSVESQKLKYRLTEREEVARRK 354

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  ++D E R+ AL++ Q +   KA+ I  N   V+ AI A+   IA  + W +I   +
Sbjct: 355 LEAAQKDQENRVGALKEVQELHVRKAQAIEANLLRVEEAINAVNGLIAQGMDWVEIARLI 414

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL----------RVHKERSNIKDSWVLHHDNAD 220
           +  +   +PVA +IK  LKL  N +TLLL              E     +S      N  
Sbjct: 415 EMEKSSQNPVAKVIKLPLKLYENTVTLLLGEPTEDEEPADESDEEEEDSESGDEDGGNKV 474

Query: 221 PFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
               S + ++  L +D+DL +S +ANA+++++ +++AA K++KT++ST+KA+KS EKK  
Sbjct: 475 KLEGSKKAQQQLLSIDIDLGISPWANARQYYEQRKAAAVKEEKTLKSTKKAIKSTEKKVT 534

Query: 280 QTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVA 317
             LK    Q    +   R  +WFEKF +F+SS+ YL                    G+V 
Sbjct: 535 TDLKHALKQEKPILRPTRTPFWFEKFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVY 594

Query: 318 WDAKV---------------------------------VTNAW---------WVKADQVS 335
             A V                                  ++AW         WV A QVS
Sbjct: 595 VHADVQGATPIFVKNKPGTPDAPIPPGTLSQAGNLCVATSSAWDSKAVMGAWWVNAGQVS 654

Query: 336 KTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           KTAP+GEF+ TG F+IRG+K+  PP QL +G + +F++ E SI
Sbjct: 655 KTAPSGEFVGTGGFVIRGEKHQLPPAQLLLGFAVMFQISEDSI 697


>gi|115443352|ref|XP_001218483.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114188352|gb|EAU30052.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 858

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 136/424 (32%), Positives = 210/424 (49%), Gaps = 76/424 (17%)

Query: 31  GYLVQKKEKRPNSED---YIVANMEFH-PMILQPGSTPSVMRQHQGFP---IQEFESFAA 83
           GY++ K++ R  + +    + AN E   P +L     P   RQ +G P   I EF S  A
Sbjct: 11  GYIIAKEDTRSKAREEGQAVEANGEGKKPGLLYEDFHPFKPRQFEGKPGVTILEFPSMNA 70

Query: 84  AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
            VDE+FS+ ES +++ +  ++E  A KKL+ V+++HE ++ AL+  Q +   KA  I +N
Sbjct: 71  TVDEYFSSIESQRLESRLTEREEAAKKKLDAVRQEHEKKIGALKHAQELHIRKAGAIEDN 130

Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVH 202
              V  A+ A+   IA  + W +I   ++  Q   +PVA IIK  LKL  N ITLLL   
Sbjct: 131 VYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVAKIIKLPLKLYENTITLLLGEA 190

Query: 203 KERSNIKDSWVLHHDNADPFADSSET--KKPSL--VDVDLDLSAYANAKRFFDLKRSAAK 258
            E     +      + ++   ++++T   KP +  +D+DL LS +ANA ++++ K+ AA 
Sbjct: 191 GEEQEEAEELFSESEESEDEQETTQTAQNKPEVLTIDIDLGLSPWANATQYYEQKKMAAV 250

Query: 259 KQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--- 313
           K+Q+T QS+ KALKS EKK  + LK    Q    +  ARK +WFEKF +F+SSE YL   
Sbjct: 251 KEQRTAQSSTKALKSHEKKVTEDLKKSLKQEKQVLRPARKPFWFEKFLFFVSSEGYLVLG 310

Query: 314 -----------------GNV--------------------------------AWDAKVVT 324
                            G++                                A +  V T
Sbjct: 311 GRDAMQSELLYRRHLKKGDIFVHADLEGATPIVVKNRPGTPNAPIPPSTLSQAGNLCVAT 370

Query: 325 NAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           ++ W          V A QVSKTA  G  L  G F+++G+KNF  P QL +G + +F++ 
Sbjct: 371 SSAWDSKAVMPAWWVHAHQVSKTAEAGGLLPAGDFLVKGEKNFLAPSQLVLGFAVMFQIS 430

Query: 375 ESSI 378
           + S+
Sbjct: 431 KESL 434


>gi|358398026|gb|EHK47384.1| hypothetical protein TRIATDRAFT_238226 [Trichoderma atroviride IMI
           206040]
          Length = 1068

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 137/445 (30%), Positives = 209/445 (46%), Gaps = 95/445 (21%)

Query: 21  LVSNLNVSSQ--GYLVQKKEKRPNSEDYIVANM----------EFHPMILQPGSTPSVMR 68
           +V N+  S+   GY+  KK K  +S   +  N           +FHP I      P   +
Sbjct: 257 IVENITASATCTGYIFAKK-KATSSSGLVEGNASQKREGLLYDDFHPFI------PHKFK 309

Query: 69  QHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQ 128
           +   F I EFE +   VDEFFS+ E  K++ +   +E  A KKLE+ + +   R+  L+ 
Sbjct: 310 KDSSFEILEFEGYNRTVDEFFSSLEGQKLESRLTGREEAAKKKLEDARHEQGKRIQGLQD 369

Query: 129 TQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-Q 187
            Q ++  KA  I  N E V  A+ A+   IA  + W DI   +++ ++  +PVA  I   
Sbjct: 370 AQAMNLRKAAAIEANVERVQEAMDAVNGLIAQGMDWIDIGKLIEREKKRQNPVAETINLP 429

Query: 188 LKLNINHITLLLRVHKERSNIKDSWVLH------HDNADPFADSSETK--KPS---LVDV 236
           LKL+ N ITLLL   +   +  ++   +       D+ +  ++++ TK  KP+    VD+
Sbjct: 430 LKLSENTITLLLAEEEFDEDEDEAQEANPYETDESDSEEGLSEANATKDTKPAKLLTVDI 489

Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKA 294
            L++S ++NA+ +++ +RSAA K++KT Q   KALKS E K  + LK    Q    +   
Sbjct: 490 VLNVSPWSNAREYYEQRRSAAIKEEKTQQQATKALKSTEHKIAEDLKKGLKQEKALLQPI 549

Query: 295 RKVYWFEKFYWFISSENYL--------------------GNVAWDAKV------------ 322
           RK  WFEKF WFISS+ YL                    G++   A +            
Sbjct: 550 RKQLWFEKFLWFISSDGYLVLGGKDPQQSEILYRRYLRKGDIYCHADIRGAANIVIKNNP 609

Query: 323 ---------------------VTNAWWVKA---------DQVSKTAPTGEFLTTGSFMIR 352
                                 + AW  KA         DQVSK+A TGE +  G+F+I 
Sbjct: 610 NTPDAPIPPATLSQAGSLSVCSSEAWDSKAGMGAWWVNTDQVSKSASTGEIMPAGNFIIE 669

Query: 353 GKKNFFPPCQLAMGISFLFKLEESS 377
           GKKN+ PP QL +G+ F F++ E S
Sbjct: 670 GKKNYLPPTQLLLGLGFAFRISEQS 694


>gi|296083204|emb|CBI22840.3| unnamed protein product [Vitis vinifera]
          Length = 993

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 106/273 (38%), Positives = 159/273 (58%), Gaps = 19/273 (6%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P ++ Q +     +FE+F AA+DEF+S  ES + + +   +E  A++KL  ++ D E R
Sbjct: 316 CPILLNQFKSREFVKFETFDAALDEFYSKIESQRSEQQQKAKEGSAMQKLTKIRVDQENR 375

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L++      + AELI  N E VDAAILA+R  +AN ++WED+   VK+ ++  +PVA
Sbjct: 376 VHTLKKEVDHCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDLARMVKEEKKSGNPVA 435

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I +L L  N +TLLL                 +N D   D  +T     V+VDL LSA
Sbjct: 436 GLIDKLYLERNCMTLLL----------------SNNLDEMDDDEKTLPVDKVEVDLALSA 479

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           +ANA+R+++ K+    KQ+KT+ + EKA K+AEKKT+  L   +T+  I+  RKV+WFEK
Sbjct: 480 HANARRWYEQKKRQENKQEKTVIAHEKAFKAAEKKTRLQLSQEKTVATISHMRKVHWFEK 539

Query: 303 FYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
           F WFISSENYL   G  A   +++   +  K D
Sbjct: 540 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 572



 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 44/63 (69%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 614 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 673

Query: 377 SIS 379
           S+ 
Sbjct: 674 SLG 676


>gi|189211034|ref|XP_001941848.1| serologically defined colon cancer antigen 1 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187977941|gb|EDU44567.1| serologically defined colon cancer antigen 1 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 1151

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 134/408 (32%), Positives = 194/408 (47%), Gaps = 89/408 (21%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP   Q         ++  +   EF+ F  AVDEFFS+ E  K++ K  ++E+ A +K
Sbjct: 311 DFHPFRPQ-------QFENSDYTFIEFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKRK 363

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  +++HE R+  L+Q Q ++  KAE I+ N   V  A  A+   I   + W DI   +
Sbjct: 364 LEKARKEHEDRIGGLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQGMDWVDIARLI 423

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL-------RVHKERSNIKDSWVLHHDNADPFA 223
           ++ Q   + VA +IK  LKLN N ITLLL           E    + S V    + D   
Sbjct: 424 EREQNSGNAVAQLIKLPLKLNENTITLLLDETNWEEGEEVEDEGNETSSVSEDSDEDAGE 483

Query: 224 DSSETKK---------PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKS 273
           +    KK         P L +D+DL L+A+AN+  +FD K++AA K+ +T+Q++ +ALKS
Sbjct: 484 EDGAKKKSAPVKVSARPQLAIDIDLSLTAWANSTEYFDQKKTAANKEDRTLQASTRALKS 543

Query: 274 AEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL------------------ 313
            EKK  + LK    Q    +   RK  WFEKF +FISS+ YL                  
Sbjct: 544 HEKKVAEDLKKGLKQEKEVLRPVRKQQWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRFL 603

Query: 314 --GNV-------------------AWDAKV--------------VTNAWWVKA------- 331
             G+V                     DA +               ++AW  KA       
Sbjct: 604 RKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPSTLSQAGNLCICTSDAWDSKAVMSAWWV 663

Query: 332 --DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
             DQVSKT  TGEFL  G F ++GKK F P  QL +G++ +F++ ESS
Sbjct: 664 RSDQVSKTGQTGEFLPAGMFNVKGKKEFLPLAQLVVGLAVMFEISESS 711


>gi|148704666|gb|EDL36613.1| mCG3169, isoform CRA_b [Mus musculus]
          Length = 658

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 101/210 (48%), Positives = 121/210 (57%), Gaps = 59/210 (28%)

Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
           KP LVDVDL LSAYANAK+++D KR AAKK Q+T+++ EKA KSAEKKTKQTLK+VQT+T
Sbjct: 49  KPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVT 108

Query: 290 NINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-- 338
           +I KARKVYWFEKF WFISSENYL   G      +++   +      +V AD    T+  
Sbjct: 109 SIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCV 168

Query: 339 ---PTG---------------------------------------------EFLTTGSFM 350
              PTG                                             E+LTTGSFM
Sbjct: 169 IKNPTGEPIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFM 228

Query: 351 IRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           IRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 229 IRGKKNFLPPSYLMMGFSFLFKVDESCVWR 258


>gi|242764776|ref|XP_002340841.1| DUF814 domain protein, putative [Talaromyces stipitatus ATCC 10500]
 gi|218724037|gb|EED23454.1| DUF814 domain protein, putative [Talaromyces stipitatus ATCC 10500]
          Length = 1111

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 136/439 (30%), Positives = 204/439 (46%), Gaps = 99/439 (22%)

Query: 29  SQGYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFPIQEF 78
           ++GY+V K+  + ++ D+  +N           +FHP        P       G+ I E+
Sbjct: 262 TKGYIVAKERAKKDT-DFDQSNDSASKENLLFEDFHPF------RPRQFEGKPGYHILEY 314

Query: 79  ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAE 138
           ++F   VDE+FS+ ES K++ +  + E  A +KLE  + DH  R  AL+Q Q +   KAE
Sbjct: 315 DNFNKTVDEYFSSIESQKLESRLAEHEETAKRKLEAARADHLDRAGALKQAQELHIRKAE 374

Query: 139 LIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITL 197
            I  N   V  A  A+   IA  + W +I   ++  Q  N+PVA  IK  LKL  N ITL
Sbjct: 375 AIQANIYRVQEATDAVNGLIAQGMDWVEIARLIEMEQERNNPVAKTIKLPLKLFENTITL 434

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETK--------------KPSLVDVDLDLSAY 243
           LL    E S   +        ++P    S ++              +P  +D+DL LS +
Sbjct: 435 LL---SEESAKGEGDKEEFSESEPEGSDSNSESEFEKDGGPKRKNAEPLAIDIDLSLSPW 491

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFE 301
           +NA ++++ K++AA K+QKTIQS+EKALKS EKK  + LK    Q    +  +RK +WFE
Sbjct: 492 SNATQYYEQKKTAAVKEQKTIQSSEKALKSQEKKVTEDLKKHLKQEKQVLRPSRKPFWFE 551

Query: 302 KFYWFISSENYL--------------------GNV------------------------- 316
           K+ +FISSE YL                    G+V                         
Sbjct: 552 KYLYFISSEGYLVLGGRDSHQVEILYQRYLKKGDVFVHADLEGATPMIVKNKEGTSNAPI 611

Query: 317 -------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
                  A    V T+  W          V A QVS+T   GE L +G FM++G+KN+  
Sbjct: 612 PPGTLTQAGSISVATSKAWETKALMPSWWVHAHQVSRTNERGELLASGGFMVKGEKNYLA 671

Query: 360 PCQLAMGISFLFKLEESSI 378
           P Q  +G + LF++ + S+
Sbjct: 672 PGQPVLGFAVLFQISKESV 690


>gi|402074990|gb|EJT70461.1| serologically defined colon cancer antigen 1 [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 1086

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 129/392 (32%), Positives = 195/392 (49%), Gaps = 81/392 (20%)

Query: 63  TPSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH 119
           +P + +Q  G P   +  FE F   VDEFFS+ E  K++ +  ++E  A +KL+  KR+H
Sbjct: 303 SPFLPQQFAGDPSNVVLTFEGFNKTVDEFFSSLEGQKLESRLTEREAGAKRKLDAAKREH 362

Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
           E R+  L++ QL++  KA  I  N E V  A+ A+   +   + W D+   V++ Q+ ++
Sbjct: 363 EKRIEGLQEYQLLNLRKAAAIEANVERVQEAMDAVIGLLEQGMDWVDVGKLVEREQKRHN 422

Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS-----ETKKPSLV 234
           PVA II +L +++ + T+ L + ++     DS   +   +    D       +T K   +
Sbjct: 423 PVAEII-ELPMDLANNTITLVIAEQDDVDDDSEDGYETESSASDDDDDAAAVQTGKAKTL 481

Query: 235 DVD--LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
           +VD  L L+ + NA  ++D KRSAA KQ+KT+Q +  ALKSA++K     KD+Q      
Sbjct: 482 EVDIKLSLTPWGNAGEYYDQKRSAAVKQEKTVQQSSIALKSAQEKIA---KDLQKGLKKE 538

Query: 293 K-----ARKVYWFEKFYWFISSENYL--------------------GNV----------- 316
           K     AR+  WFEKF+WFISS+ YL                    G+V           
Sbjct: 539 KPVMQLARRQMWFEKFHWFISSDGYLVLGGRDAQQNEILYRRYLKRGDVYVHADLHGAPS 598

Query: 317 --------AWDAKV--------------VTNAWWVK---------ADQVSKTAPTGEFLT 345
                     DA V               ++AW  K         ADQVSK+APTGEFL 
Sbjct: 599 VIIKNNPRTPDAPVPPSTLSQAGQLAVCASSAWESKAGMGAYWVGADQVSKSAPTGEFLP 658

Query: 346 TGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           TGSFM+RGK+N  PP  L +G   +F++ + S
Sbjct: 659 TGSFMVRGKRNELPPAPLIVGFGVMFRISDES 690


>gi|85091915|ref|XP_959135.1| hypothetical protein NCU09191 [Neurospora crassa OR74A]
 gi|28920536|gb|EAA29899.1| conserved hypothetical protein [Neurospora crassa OR74A]
 gi|29150083|emb|CAD79644.1| conserved hypothetical protein [Neurospora crassa]
          Length = 1097

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 124/386 (32%), Positives = 182/386 (47%), Gaps = 83/386 (21%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +F P +      P      + + I  F  +   VDEFFS+ E  +++ K  ++E  A +K
Sbjct: 297 DFQPFL------PKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLESKLSEREAAAKRK 350

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  + D   R+  L++ ++++  KA  I  N E V  A+ A+   +   + W DI   +
Sbjct: 351 LEAARMDQAKRIEGLQEMEMLNYRKAATIQANIERVQEAMDAVNGLLQEGMDWVDITKLI 410

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKD--------SWVLHHDNADPF 222
           ++ Q+  +PVA IIK  +KL  N ITLLL    E  +  D         +    D+AD  
Sbjct: 411 EKEQKQGNPVAEIIKLPMKLKENTITLLLGEGVEEEDEADQDKEDDEFDYSDSEDDAD-G 469

Query: 223 ADSSETKKPSL---VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
           AD+++ K P     VD++L LS + NA+ ++D KR+AA K QKT+Q +  ALK+AE+K  
Sbjct: 470 ADTTKDKAPVKRLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIA 529

Query: 280 QTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVA 317
           + L+    Q    +   RK  WFEKF WFISS+ YL                    G+V 
Sbjct: 530 EDLRKGLKQEKPVLQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVY 589

Query: 318 WDAKV---------------------------------VTNAW---------WVKADQVS 335
             A V                                  ++AW         WV ADQVS
Sbjct: 590 VHADVHGAASVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVS 649

Query: 336 KTAPTGEFLTTGSFMIRGKKNFFPPC 361
           K+AP GE+L  GSFM+RGK+N  PP 
Sbjct: 650 KSAPAGEYLPVGSFMVRGKRNLLPPA 675


>gi|396473834|ref|XP_003839430.1| similar to DUF814 domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312215999|emb|CBX95951.1| similar to DUF814 domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 1115

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 126/382 (32%), Positives = 184/382 (48%), Gaps = 79/382 (20%)

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           EF+ F  AVDEFFS+ E  K++ K  ++E+ A KKLE  +++HE R+  L+Q Q ++  K
Sbjct: 311 EFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEERIGGLQQVQEMNFRK 370

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHI 195
           AE I+ N   V  A  A+   I   + W DI   +++ Q   + VA  I+  LKL+ N I
Sbjct: 371 AEAILANVHRVTEATEAVNGLIRQGMDWVDISRLIEREQAQGNAVAQSIRLPLKLHQNTI 430

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPF-------------ADSSETKKPSL-VDVDLDLS 241
           TLLL         ++    +  ++                A +  T++P L +D+DL LS
Sbjct: 431 TLLLNETDWDHEEEEEDEGNETSSVSEDSEEEEEGSKKKAAPTKVTQQPQLAIDIDLGLS 490

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYW 299
           A+AN+  ++D K++AA K+ +T  ++ KALKS EKK  + LK    Q    +   RK  W
Sbjct: 491 AWANSTEYYDQKKTAASKEDRTAAASSKALKSHEKKVTEDLKKGLKQEKEVLRPVRKQQW 550

Query: 300 FEKFYWFISSENYL--------------------GNV----------------------- 316
           FEK+ +FISS+ YL                    G+V                       
Sbjct: 551 FEKYIYFISSDGYLVLGGKDAQQNEIIYKRFLRKGDVYVHADLKGAVPMIIKNKPDTPDA 610

Query: 317 ---------AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
                    A    V T+  W          V++ QVSKT  TGEFL  G F I GKK F
Sbjct: 611 PIPPSTLSQAGHLSVCTSEAWESKAVMSAWWVRSTQVSKTGQTGEFLPAGMFNITGKKEF 670

Query: 358 FPPCQLAMGISFLFKLEESSIS 379
            PP QL +G++ +F++ ESSIS
Sbjct: 671 LPPAQLVVGLAVMFEISESSIS 692


>gi|449017191|dbj|BAM80593.1| unknown RNA-binding protein, conserved [Cyanidioschyzon merolae
           strain 10D]
          Length = 1371

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 126/418 (30%), Positives = 196/418 (46%), Gaps = 100/418 (23%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
           TP ++RQHQ    + F SF  AVDE+F+  E  +   +   ++R A   LE ++R+ ETR
Sbjct: 412 TPRLLRQHQHLEPRMFPSFDEAVDEYFARMEELRYRQEIENRQRQAQGTLERMRRELETR 471

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L+Q +     KA LI  N   VD A+  IR  +A+ + W++++  +   +R  +PVA
Sbjct: 472 VLTLKQQEERCLRKAALIETNLVDVDNALQVIRAALASGIDWKELDQMLVLERRRGNPVA 531

Query: 183 SIIKQLKLNINHITLLL---------------------RVHKER--SNIKDSWVLHHDNA 219
            +I  L+L  N +TL+L                     R  + R  SN   +  +   + 
Sbjct: 532 QLIHSLQLQENQMTLMLADDSGSVDNTDAETGSSSRQRRPAETRDLSNEDSASSVESASE 591

Query: 220 DPFADSSE--TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
           D   DS+   + +  LV VDL LSA+ANA+R+++ ++ AA+K  KT++++ +AL++AEKK
Sbjct: 592 DESGDSTSVCSSRVELVQVDLSLSAFANARRYYEQRKKAAEKGTKTMEASAQALRAAEKK 651

Query: 278 TKQTL---------KDVQTMTNINKARKVYWFEKFYWFISSENYL--------------- 313
             + L         K    +  +   RK  WFEKF +FI+SENYL               
Sbjct: 652 ALEVLAGTASKNKRKKATPLNTLKAIRKPLWFEKFRYFITSENYLVIAGKDSQQNEQLVR 711

Query: 314 -------GNVAWDAKV------------------------------VTNAW--------- 327
                  G++   A V                               ++AW         
Sbjct: 712 RYLEENTGDLYMHADVHGAASVIIKGKKNRPAPPLSIQEAAIFAAACSSAWDAKVAVNAY 771

Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-----PCQLAMGISFLFKLEESSISR 380
           WV  +QVS+TAP+G +L  GSF+IRG +N+ P        L MG  FLF+L   S+ R
Sbjct: 772 WVYPEQVSRTAPSGMYLQQGSFVIRGSRNYVPVTTSGSGPLVMGFGFLFRLAPESVWR 829


>gi|327348881|gb|EGE77738.1| DUF814 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 1166

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 130/437 (29%), Positives = 201/437 (45%), Gaps = 95/437 (21%)

Query: 31  GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           GY+V K E +   +  + A           ++FHP        P          I +F++
Sbjct: 267 GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPF------EPKQFENQADMAILKFDT 320

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDE+FS+ E  K++ +  ++E  A +KLE  ++D E R+  L++ + +   KA+ I
Sbjct: 321 FNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQAI 380

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N   V+ A+ A+   IA  + W +I   ++  Q   +PVA +IK  LKL  N +TLLL
Sbjct: 381 EANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLLL 440

Query: 200 RVHKERSNIKDSWVLHHDNADPFADS------SETKKPSL--------VDVDLDLSAYAN 245
               E     D      ++ +   D         +KKP          +D+DL +S +AN
Sbjct: 441 GEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWAN 500

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
           A+++++ K++AA K++KT+ S +KA+KS EKK    LK    Q    +   R  +WFEKF
Sbjct: 501 ARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEKF 560

Query: 304 YWFISSENYLGNVAWDAK------------------------------------------ 321
            +FISS+ YL     DA+                                          
Sbjct: 561 IYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIPP 620

Query: 322 ----------VVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC 361
                     V T++ W          V ADQVSKT P+GE+L TG F+IRG+KN  PP 
Sbjct: 621 GTLSQAGNLCVATSSAWHSKAVMGAWWVNADQVSKTTPSGEYLETGGFVIRGEKNQLPPA 680

Query: 362 QLAMGISFLFKLEESSI 378
           QL +G + +F++   SI
Sbjct: 681 QLLLGFAVMFQISSESI 697


>gi|350296215|gb|EGZ77192.1| hypothetical protein NEUTE2DRAFT_99766 [Neurospora tetrasperma FGSC
           2509]
          Length = 1095

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 122/385 (31%), Positives = 181/385 (47%), Gaps = 81/385 (21%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +F P +      P      + + I  F  +   VDEFFS+ E  +++ K  ++E  A +K
Sbjct: 297 DFQPFL------PKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLESKLSEREAAAKRK 350

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  + D   R+  L++ ++++  KA  I  N E V  A+ A+   +   + W DI   +
Sbjct: 351 LEAARMDQAKRIEGLQEMEMLNYRKAATIQANTERVQEAMDAVNGLLQEGMDWVDITKLI 410

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNI----KDSWVLHHDNADPFADSS 226
           ++ Q+  +PVA IIK  +KL  N ITLLL    E  +     K+     + +++  AD +
Sbjct: 411 EKEQKQGNPVAEIIKLPMKLKENTITLLLGEGVEEEDEGDQDKEDDEFDYSDSEDDADGA 470

Query: 227 ETKKPSL------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
           ET K         VD++L LS + NA+ ++D KR+AA K QKT+Q +  ALK+AE+K  +
Sbjct: 471 ETTKHKAPVKRLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIAE 530

Query: 281 TLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAW 318
            L+    Q    +   RK  WFEKF WFISS+ YL                    G+V  
Sbjct: 531 DLRKGLKQEKPVLQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYV 590

Query: 319 DAKV---------------------------------VTNAW---------WVKADQVSK 336
            A V                                  ++AW         WV ADQVSK
Sbjct: 591 HADVHGAASVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSK 650

Query: 337 TAPTGEFLTTGSFMIRGKKNFFPPC 361
           +AP GE+L  GSFM+RGK+N  PP 
Sbjct: 651 SAPAGEYLPVGSFMVRGKRNLLPPA 675


>gi|449485009|ref|XP_004157045.1| PREDICTED: nuclear export mediator factor NEMF homolog [Cucumis
           sativus]
          Length = 1090

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 116/310 (37%), Positives = 171/310 (55%), Gaps = 32/310 (10%)

Query: 30  QGY-LVQKKEKRPNSEDYIVANM---EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           +GY L+QKK+ +    +   AN    EF P++L          Q       +FE+F AA+
Sbjct: 297 EGYILMQKKDVKKEESEAATANEIYDEFCPILLN---------QFMSRKYTKFETFDAAL 347

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+S  ES + + +   +E  A  KL  ++ D   R+  L+Q      + AELI  N E
Sbjct: 348 DEFYSKIESQRSEQQQKAKESSATHKLNKIRMDQGNRVELLKQEVDHSVKMAELIEYNLE 407

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VDA ILA+R  +A  +SWED+   VK+ ++  +PVA +I +L L  N +TLLL      
Sbjct: 408 DVDAVILAVRVALAKGMSWEDLARMVKEEKKSGNPVAGLIDKLNLERNCMTLLLS----- 462

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
                      +N D   D  +T+    V+VD+ LSA+ANA+R+++LK+    KQ+KTI 
Sbjct: 463 -----------NNLDEMDDDEKTQPVDKVEVDISLSAHANARRWYELKKKQESKQEKTIT 511

Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKV 322
           + EKA K+AE+KT+  L   +T+  I+  RKV+WFEKF WFISSENYL   G  A   ++
Sbjct: 512 AHEKAFKAAERKTRLQLSQEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 571

Query: 323 VTNAWWVKAD 332
           +   +  K D
Sbjct: 572 IVKRYMSKGD 581



 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 44/63 (69%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 623 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 682

Query: 377 SIS 379
           S+ 
Sbjct: 683 SLG 685


>gi|449441522|ref|XP_004138531.1| PREDICTED: nuclear export mediator factor Nemf-like [Cucumis
           sativus]
          Length = 1119

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 116/310 (37%), Positives = 171/310 (55%), Gaps = 32/310 (10%)

Query: 30  QGY-LVQKKEKRPNSEDYIVANM---EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           +GY L+QKK+ +    +   AN    EF P++L          Q       +FE+F AA+
Sbjct: 297 EGYILMQKKDVKKEESEAATANEIYDEFCPILLN---------QFMSRKYTKFETFDAAL 347

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           DEF+S  ES + + +   +E  A  KL  ++ D   R+  L+Q      + AELI  N E
Sbjct: 348 DEFYSKIESQRSEQQQKAKESSATHKLNKIRMDQGNRVELLKQEVDHSVKMAELIEYNLE 407

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VDA ILA+R  +A  +SWED+   VK+ ++  +PVA +I +L L  N +TLLL      
Sbjct: 408 DVDAVILAVRVALAKGMSWEDLARMVKEEKKSGNPVAGLIDKLNLERNCMTLLLS----- 462

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
                      +N D   D  +T+    V+VD+ LSA+ANA+R+++LK+    KQ+KTI 
Sbjct: 463 -----------NNLDEMDDDEKTQPVDKVEVDISLSAHANARRWYELKKKQESKQEKTIT 511

Query: 266 STEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKV 322
           + EKA K+AE+KT+  L   +T+  I+  RKV+WFEKF WFISSENYL   G  A   ++
Sbjct: 512 AHEKAFKAAERKTRLQLSQEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 571

Query: 323 VTNAWWVKAD 332
           +   +  K D
Sbjct: 572 IVKRYMSKGD 581



 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 44/63 (69%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 623 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 682

Query: 377 SIS 379
           S+ 
Sbjct: 683 SLG 685


>gi|327287378|ref|XP_003228406.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Anolis carolinensis]
          Length = 635

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/180 (50%), Positives = 117/180 (65%), Gaps = 14/180 (7%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N   +GY++QK+EK+P+ E     + I+   EFHP +           Q+   P  EF+S
Sbjct: 240 NFDGKGYIIQKREKKPSLEPEKPAEEILTYEEFHPFLFS---------QYTKCPFVEFDS 290

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKLENV++DHE RL AL Q Q +DK K EL+
Sbjct: 291 FNKAVDEFYSKLEGQKIDLKALQQEKQALKKLENVRKDHEHRLEALHQAQEIDKVKGELV 350

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N E VD AI  +R  +ANQ+ W +I A VK+AQ   DP+AS IK+LKL  NHIT+LL+
Sbjct: 351 EMNLEMVDRAITVVRSALANQIDWTEIGALVKEAQAQGDPLASAIKELKLQTNHITMLLK 410


>gi|336276025|ref|XP_003352766.1| hypothetical protein SMAC_01600 [Sordaria macrospora k-hell]
 gi|380094654|emb|CCC08036.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1086

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 132/421 (31%), Positives = 193/421 (45%), Gaps = 97/421 (23%)

Query: 29  SQGYLVQKKEKRPNSEDYIVA----------NM---EFHPMILQPGSTPSVMRQHQGFPI 75
           + GY++ K   RP  ED  V           N+   +F P +      P     ++ + I
Sbjct: 263 ANGYIIAKP--RPGFEDAEVVVDAPPAEKAKNLLYEDFQPFL------PKQFENNKDYRI 314

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
             F  +   VDEFFS+ E  +++ K  ++E  A +KLE  + D   R+  L++ ++++  
Sbjct: 315 LPFVGYNKTVDEFFSSLEGQRLESKLSEREAAAKRKLEAARMDQAKRIEGLQEMEMLNYR 374

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINH 194
           KA  I  N E V  A+ A+   +   + W DI   +++ Q+  +PVA IIK  +KL  N 
Sbjct: 375 KAATIQANIERVQEAMDAVNGLLQEGMDWVDITKLIEKEQKQGNPVAEIIKLPMKLKENT 434

Query: 195 ITLLL-----RVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV-----DVDLDLSAYA 244
           ITLLL        +   + +D    + D+ D   D  E K    V     D++L LS + 
Sbjct: 435 ITLLLGEGVEEEEEGDEDKEDDEFDYSDDEDE-GDVGEPKDKVPVNRLEIDINLTLSVWN 493

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEK 302
           NA+ ++D KR+AA K QKT+Q +  ALKSAE+K  + L+    Q    +   RK  WFEK
Sbjct: 494 NAREYYDQKRTAAHKAQKTVQQSVIALKSAEQKISEDLRKGLKQEKPVLQPIRKAMWFEK 553

Query: 303 FYWFISSENYL--------------------GNVAWDAKV-------------------- 322
           F WFISS+ YL                    G+V   A V                    
Sbjct: 554 FTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYVHADVHGAASVIIKNNPKTPDAPIP 613

Query: 323 -------------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
                         ++AW         WV ADQVSK+APTGE+L  GSFM+RGK+N  PP
Sbjct: 614 PSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSKSAPTGEYLPVGSFMVRGKRNLLPP 673

Query: 361 C 361
            
Sbjct: 674 A 674


>gi|357161759|ref|XP_003579195.1| PREDICTED: nuclear export mediator factor Nemf-like [Brachypodium
           distachyon]
          Length = 1163

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 106/273 (38%), Positives = 158/273 (57%), Gaps = 19/273 (6%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P +++Q +     EFE+F AA+DEF+S  ES +++ +   +E  A+++L  +K D E R
Sbjct: 354 CPILLKQFKAREYDEFETFDAALDEFYSKIESQRVNQQQKAKEDSAVQRLNKIKLDQENR 413

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L +      + AELI  N E VDAAI+A+R  +AN +SWE +   +K+ +R  +PVA
Sbjct: 414 VHTLRKEADHCIKMAELIEYNLEDVDAAIVAVRVSLANGMSWEALARMIKEERRAGNPVA 473

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I +L    N ITLLL                 +N D   +  +T     V+VDL LSA
Sbjct: 474 GLIDKLSFENNCITLLL----------------SNNLDDMDEDEKTAPVEKVEVDLSLSA 517

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           +ANA+R++++K+    KQ+KTI + +KA K+AEKKT+  L   +T+  I   RKV+WFEK
Sbjct: 518 HANARRWYEMKKKQETKQEKTITAHDKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEK 577

Query: 303 FYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
           F WFISSENYL   G  A   ++V   +  K D
Sbjct: 578 FNWFISSENYLIVSGRDAQQNELVVKRYMSKGD 610



 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 43/63 (68%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 652 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDES 711

Query: 377 SIS 379
            ++
Sbjct: 712 CLA 714


>gi|117938818|gb|AAH06001.1| SDCCAG1 protein [Homo sapiens]
          Length = 398

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 91/180 (50%), Positives = 116/180 (64%), Gaps = 14/180 (7%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 199 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 249

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 250 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 309

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 310 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 369


>gi|34784822|gb|AAH56687.1| SDCCAG1 protein, partial [Homo sapiens]
          Length = 426

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 91/180 (50%), Positives = 116/180 (64%), Gaps = 14/180 (7%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P  E     + I+   EFHP +           QH   P  EFES
Sbjct: 218 NFSGKGYIIQKREIKPCLEADKPVEDILTYEEFHPFLFS---------QHSQCPYIEFES 268

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 269 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 328

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVAS IK+LKL  NH+T+LLR
Sbjct: 329 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 388


>gi|299471369|emb|CBN79324.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1380

 Score =  167 bits (424), Expect = 5e-39,   Method: Composition-based stats.
 Identities = 120/379 (31%), Positives = 183/379 (48%), Gaps = 86/379 (22%)

Query: 64  PSVMRQHQGFP-IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
           P ++ QH+G   I  F SF  AVD FF      K+   A+  E    +K+  ++ D E R
Sbjct: 454 PQLLAQHEGGAVIHSFASFDQAVDAFFGRIVEQKLKQTAMAAEAAVERKVAWIRNDQERR 513

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           + ALE+ Q      A+L     + V+ A++ +R  +AN + W+D+E  VK    + +P+A
Sbjct: 514 VLALEERQEKMLRHAQLAEAWADEVEKALMVVRSALANGMDWQDLEDLVKAETANGNPIA 573

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
           S+I +L+L+ N + L L                     P A+  E  +  LV+VD+ LSA
Sbjct: 574 SLIHELRLDRNQVVLSL---------------------PTAEDGEDDQ--LVEVDIMLSA 610

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           +ANA+  ++ K+ A  K+ KT+ ++EK LK AE++ ++TL+      ++  ARKVYWFEK
Sbjct: 611 HANARVMYENKKLARAKELKTLTASEKVLKIAEQQAERTLQRQAHKRSLQVARKVYWFEK 670

Query: 303 FYWFISSENYL---GNVAWDAKVVTNAW------WVKADQ-------VSKTAPTGE---- 342
           F WFISSENYL   G  A   +VV   +      +V AD        V    P+G+    
Sbjct: 671 FNWFISSENYLVISGRNAQQNEVVVKKYLRPGDIYVHADLHGASSCVVRNKDPSGKRAVS 730

Query: 343 ------------------------------------------FLTTGSFMIRGKKNFFPP 360
                                                     +L TGSFM+RG+K+F PP
Sbjct: 731 PLALEEAGCMTVCRSGAWGAKMVTSAWWVYADQVSKTAPTGEYLVTGSFMVRGRKHFLPP 790

Query: 361 CQLAMGISFLFKLEESSIS 379
             L MG + LFKL++S ++
Sbjct: 791 RALEMGFALLFKLDDSCLA 809


>gi|125579741|gb|EAZ20887.1| hypothetical protein OsJ_36526 [Oryza sativa Japonica Group]
          Length = 1176

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 104/272 (38%), Positives = 156/272 (57%), Gaps = 19/272 (6%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P ++ Q +     EFE+F AA+DEF+S  ES +++ +   +E  A ++L  +K D E R+
Sbjct: 354 PVLLNQFKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRV 413

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L +      + AELI  N E VDAAI+A+R  +AN +SW+ +   +K+ ++  +PVA 
Sbjct: 414 HTLRKEVDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAG 473

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           +I +L    N ITLLL                 +N D   +  +T     V+VDL LSA+
Sbjct: 474 LIDKLSFERNCITLLL----------------SNNLDDMDEEEKTAPVEKVEVDLSLSAH 517

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
           ANA+R+++LK+    KQ+KT+ + EKA K+AEKKT+  L   +T+  I   RKV+WFEKF
Sbjct: 518 ANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKF 577

Query: 304 YWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
            WFISSENYL   G  A   +++   +  K D
Sbjct: 578 NWFISSENYLIISGRDAQQNELIVKRYMSKGD 609



 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 44/63 (69%), Positives = 52/63 (82%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 668 AWDSKIVTSAWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDES 727

Query: 377 SIS 379
           S++
Sbjct: 728 SLA 730


>gi|336464133|gb|EGO52373.1| hypothetical protein NEUTE1DRAFT_71883 [Neurospora tetrasperma FGSC
           2508]
          Length = 1095

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 122/385 (31%), Positives = 180/385 (46%), Gaps = 81/385 (21%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +F P +      P      + + I  F  +   VDEFFS+ E  ++  K  ++E  A +K
Sbjct: 297 DFQPFL------PKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLKSKLSEREAAAKRK 350

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  + D   R+  L++ ++++  KA  I  N E V  A+ A+   +   + W DI   +
Sbjct: 351 LEAARMDQAKRIEGLQEMEMLNYRKAATIQANIERVQEAMDAVNGLLQEGMDWVDITKLI 410

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLL----RVHKERSNIKDSWVLHHDNADPFADSS 226
           ++ Q+  +PVA IIK  +KL  N ITLLL       +E    K+     + +++  AD +
Sbjct: 411 EKEQKQGNPVAEIIKLPMKLKENTITLLLGEGVEEEEEGDQDKEDDEFDYSDSEDDADGA 470

Query: 227 ETKKPSL------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
           ET K         VD++L LS + NA+ ++D KR+AA K QKT+Q +  ALK+AE+K  +
Sbjct: 471 ETTKDKAPVKRLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIAE 530

Query: 281 TLKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAW 318
            L+    Q    +   RK  WFEKF WFISS+ YL                    G+V  
Sbjct: 531 DLRKGLKQEKPVLQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYV 590

Query: 319 DAKV---------------------------------VTNAW---------WVKADQVSK 336
            A V                                  ++AW         WV ADQVSK
Sbjct: 591 HADVHGAASVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSK 650

Query: 337 TAPTGEFLTTGSFMIRGKKNFFPPC 361
           +AP GE+L  GSFM+RGK+N  PP 
Sbjct: 651 SAPAGEYLPVGSFMVRGKRNLLPPA 675


>gi|125537046|gb|EAY83534.1| hypothetical protein OsI_38746 [Oryza sativa Indica Group]
          Length = 1153

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 104/272 (38%), Positives = 156/272 (57%), Gaps = 19/272 (6%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P ++ Q +     EFE+F AA+DEF+S  ES +++ +   +E  A ++L  +K D E R+
Sbjct: 331 PVLLNQFKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRV 390

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L +      + AELI  N E VDAAI+A+R  +AN +SW+ +   +K+ ++  +PVA 
Sbjct: 391 HTLRKEVDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAG 450

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           +I +L    N ITLLL                 +N D   +  +T     V+VDL LSA+
Sbjct: 451 LIDKLSFERNCITLLL----------------SNNLDDMDEEEKTAPVEKVEVDLSLSAH 494

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF 303
           ANA+R+++LK+    KQ+KT+ + EKA K+AEKKT+  L   +T+  I   RKV+WFEKF
Sbjct: 495 ANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKF 554

Query: 304 YWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
            WFISSENYL   G  A   +++   +  K D
Sbjct: 555 NWFISSENYLIISGRDAQQNELIVKRYMSKGD 586



 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 44/63 (69%), Positives = 52/63 (82%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 645 AWDSKIVTSAWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDES 704

Query: 377 SIS 379
           S++
Sbjct: 705 SLA 707


>gi|328909421|gb|AEB61378.1| serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 302

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/180 (50%), Positives = 116/180 (64%), Gaps = 14/180 (7%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E       I+   EFHP +           QH   P  EFES
Sbjct: 103 NFSGKGYIIQKREMKPSLEVDKPTQDILTYEEFHPFLFS---------QHSQCPYIEFES 153

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 154 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELI 213

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 214 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 273


>gi|346325475|gb|EGX95072.1| serologically defined colon cancer antigen 1 [Cordyceps militaris
           CM01]
          Length = 1048

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 123/400 (30%), Positives = 192/400 (48%), Gaps = 81/400 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP +      P   + +    I  FE F   VD+FFS+ E  K+  + V++E  A +K
Sbjct: 295 DFHPFV------PQKFQNNSKIEILRFEGFNRTVDDFFSSLEGQKLQSRVVEREAAAQRK 348

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+  K+D E RL  L+ +Q  +  KA  I  N E V  A+ +I   +A  + W DI   V
Sbjct: 349 LDAAKQDQENRLKGLQTSQSDNFRKAAAIEANIERVQEAMDSINGLLAQGMDWVDIGKLV 408

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKE---RSNIKDSWVLHHDNADPFADSSET 228
            + Q+ N+ VA++I  L L++    + +R+ +E    S ++D +     +AD   D +  
Sbjct: 409 AREQKKNNAVANLI-CLPLSLADNVISIRLSEEDDAGSEVEDPFETDDSDADSETDLNAA 467

Query: 229 K-------KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
           K       K  +V++ L LS ++NA+ ++D +++A  K++KT    ++A+KS E+K K  
Sbjct: 468 KSVQNYSDKTIIVELTLTLSPWSNAREYYDQRKTAVVKEEKTQLQADRAIKSTEQKIKHD 527

Query: 282 LKDV--QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNV--- 316
           LK    Q    +   R + WFEKFYWFISS+ YL                    G++   
Sbjct: 528 LKRALKQEKALLQPIRNLMWFEKFYWFISSDGYLVVGAKDKSQAEILYRRHLGSGDIFCH 587

Query: 317 --AWDAKVV----------------------------TNAWWVKAD---------QVSKT 337
             A +A +V                            + AW  KA          QVSK+
Sbjct: 588 ADANNAAIVIVKNNSNTEDAHIAPATLAQAGQLSICSSEAWDSKAGIGAWWVNSSQVSKS 647

Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
            PTG+ L  G+F I G+KNF PP QL +G+S +FK+ E S
Sbjct: 648 TPTGDILQPGNFNISGEKNFLPPGQLILGLSIMFKISEES 687


>gi|51593729|gb|AAH80716.1| Sdccag1 protein, partial [Mus musculus]
          Length = 443

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/181 (49%), Positives = 117/181 (64%), Gaps = 16/181 (8%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410

Query: 200 R 200
           R
Sbjct: 411 R 411


>gi|74152610|dbj|BAE42589.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/181 (49%), Positives = 117/181 (64%), Gaps = 16/181 (8%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410

Query: 200 R 200
           R
Sbjct: 411 R 411


>gi|26334499|dbj|BAC30950.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/181 (49%), Positives = 117/181 (64%), Gaps = 16/181 (8%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410

Query: 200 R 200
           R
Sbjct: 411 R 411


>gi|54887337|gb|AAH37106.2| Sdccag1 protein [Mus musculus]
          Length = 415

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/181 (49%), Positives = 117/181 (64%), Gaps = 16/181 (8%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410

Query: 200 R 200
           R
Sbjct: 411 R 411


>gi|414878087|tpg|DAA55218.1| TPA: hypothetical protein ZEAMMB73_985047 [Zea mays]
          Length = 608

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 104/273 (38%), Positives = 156/273 (57%), Gaps = 19/273 (6%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P ++ Q +     EF +F AA+DEF+S  ES K++ +   +E  A ++L  +K D E R
Sbjct: 351 CPVLLNQFKSREYNEFATFDAALDEFYSKIESQKVNQQQKAKEESAAQRLNKIKLDQENR 410

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L +      + AELI  N E VDAAILA+R  +AN++SWE +   +K+ ++  +PVA
Sbjct: 411 VHTLRKEVDHCVKMAELIEYNLEDVDAAILAVRVSLANEMSWEALTRMIKEERKAGNPVA 470

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I +L    N ITLLL                 +N D   +  +T     V+VD+ LSA
Sbjct: 471 GLIDKLNFERNCITLLL----------------SNNLDDMDEDEKTAPVEKVEVDIALSA 514

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           +ANA+R++++K+    KQ+KTI + +KA K+AEKKT+  L   +T+  I   RKV+WFEK
Sbjct: 515 HANARRWYEMKKKQESKQEKTITAHDKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEK 574

Query: 303 FYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
           F WFISSENYL   G  A   +++   +  K D
Sbjct: 575 FNWFISSENYLIISGRDAQQNELIVKRYMSKGD 607


>gi|356558107|ref|XP_003547349.1| PREDICTED: nuclear export mediator factor NEMF homolog [Glycine
           max]
          Length = 1119

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/303 (36%), Positives = 168/303 (55%), Gaps = 30/303 (9%)

Query: 44  EDYIVA---NMEFHPMILQPGS--------TPSVMRQHQGFPIQEFESFAAAVDEFFSTA 92
           E YI+    N+     I QPGS         P ++ Q +     +FE+F AA+DEF+S  
Sbjct: 288 EGYILMQNKNLGKDSSISQPGSVSQMYDEFCPILLNQFKSRDYTKFETFDAALDEFYSKI 347

Query: 93  ESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152
           ES + + +   +E  A +KL  +++D E R+  L +      + AELI  N E VDAAIL
Sbjct: 348 ESQRAEQQQKSKENSAAQKLNKIRQDQENRVHVLRKEADHCVKMAELIEYNLEDVDAAIL 407

Query: 153 AIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSW 212
           A+R  +A  ++W+D+   VK+ ++  +PVA +I +L L  N + LLL             
Sbjct: 408 AVRVALAKGMNWDDLARMVKEEKKAGNPVAGLIDKLHLERNCMNLLL------------- 454

Query: 213 VLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
               +N D   D  +T     V+VDL LSA+ANA+R+++ K+    KQ+KT+ + EKA K
Sbjct: 455 ---SNNLDEMDDDEKTLPVDKVEVDLALSAHANARRWYEQKKKQESKQEKTVTAHEKAFK 511

Query: 273 SAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWV 329
           +AE+KT+  L   +T+ +I+  RKV+WFEKF WFISSENYL   G  A   +++   +  
Sbjct: 512 AAERKTRLQLNQEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNELIVKRYMS 571

Query: 330 KAD 332
           K D
Sbjct: 572 KGD 574



 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 44/63 (69%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 616 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 675

Query: 377 SIS 379
           S+ 
Sbjct: 676 SLG 678


>gi|349605644|gb|AEQ00813.1| Serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 388

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 89/179 (49%), Positives = 115/179 (64%), Gaps = 14/179 (7%)

Query: 26  NVSSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           N S +GY++QK+E +P+ E       I+   EFHP +           QH   P  EFES
Sbjct: 219 NFSGKGYIIQKREMKPSLEVDKPTQDILTYEEFHPFLFS---------QHSQCPYIEFES 269

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI
Sbjct: 270 FDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELI 329

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
             N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LL
Sbjct: 330 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLL 388


>gi|113911846|gb|AAI22665.1| SDCCAG1 protein [Bos taurus]
          Length = 443

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 88/178 (49%), Positives = 116/178 (65%), Gaps = 14/178 (7%)

Query: 28  SSQGYLVQKKEKRPNSE-----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           + +GY++QK+E +P+ E     + I+   EFHP +           QH   P  EFESF 
Sbjct: 243 NGKGYIIQKREIKPSLEVDKPTEDILTYEEFHPFLFS---------QHSQCPYIEFESFD 293

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
            AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K ELI  
Sbjct: 294 KAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEM 353

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
           N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA+ IK+LKL  NH+T+LLR
Sbjct: 354 NLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLR 411


>gi|12837616|dbj|BAB23886.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 89/181 (49%), Positives = 116/181 (64%), Gaps = 16/181 (8%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF   VDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKPVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NH+T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410

Query: 200 R 200
           R
Sbjct: 411 R 411


>gi|116193227|ref|XP_001222426.1| hypothetical protein CHGG_06331 [Chaetomium globosum CBS 148.51]
 gi|88182244|gb|EAQ89712.1| hypothetical protein CHGG_06331 [Chaetomium globosum CBS 148.51]
          Length = 1115

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 125/377 (33%), Positives = 189/377 (50%), Gaps = 71/377 (18%)

Query: 75  IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
           I  F+ +   VD FFS+ E  K++ +  ++E  A +KLE  +RD   R+  L+  Q+++ 
Sbjct: 317 ILSFDGYNKTVDNFFSSLEGQKLESRLQEREITAKRKLEAARRDQAQRIEGLQDVQMLNL 376

Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNIN 193
            KA  +  N E V  A+ A+   I   + W DI   V++ Q+ ++PVA +IK  +KL+ +
Sbjct: 377 RKAAAVEANIERVQEAMDAVNGLIQQGMDWVDINKLVEREQKQHNPVAEMIKLPMKLHES 436

Query: 194 HITLLLRVHKERSNIKDSWVLHHDNADPFADSS---ETKKPS---LVDVDLDLSAYANAK 247
            ITLLL   +E   +++     +D  +  AD +   ++K P     +D++L LS   NA+
Sbjct: 437 VITLLLGEEEEEGKVEEEMDFDYDTDEETADDAAEEKSKGPDKRLAIDINLKLSPRNNAR 496

Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
            ++D KR+AA KQ+KT+Q +E ALK+AE+K  + LK    Q    +   RK  WFEKF W
Sbjct: 497 YYYDQKRTAADKQEKTVQRSEIALKNAEQKIAEDLKKGLKQEKPILQPIRKQMWFEKFTW 556

Query: 306 FISSENYL--------------------GNV----------------------------- 316
           F+SS+ YL                    G+V                             
Sbjct: 557 FVSSDGYLVLGGRDAQQNEILYKRYLRKGDVYVHADMHGASSVVIKNNPKTPDAPIPPST 616

Query: 317 ---AWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL 363
              A +  V  ++ W          V ADQVSK+AP+GE+L  GSFM+RGK+N  PP  L
Sbjct: 617 LAQAGNLSVCCSSAWDSKAAMGAWWVNADQVSKSAPSGEYLPVGSFMVRGKRNLLPPSLL 676

Query: 364 AMGISFLFKLEESSISR 380
            +G   LFK+ E S SR
Sbjct: 677 MLGFGLLFKISEESKSR 693


>gi|12857277|dbj|BAB30959.1| unnamed protein product [Mus musculus]
          Length = 415

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/181 (49%), Positives = 116/181 (64%), Gaps = 16/181 (8%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +GY++QK+E +P+      +ED I+   EFHP +           QH   P  EFE
Sbjct: 241 NFNGKGYIIQKREAKPSLDADKPAED-ILTYEEFHPFLFS---------QHLQCPYIEFE 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF  AVDEF+S  E  KIDLKA+QQE+ ALKKL+NV++DHE RL AL+Q Q +DK K EL
Sbjct: 291 SFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  N +T+LL
Sbjct: 351 IEMNLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNQVTMLL 410

Query: 200 R 200
           R
Sbjct: 411 R 411


>gi|171684415|ref|XP_001907149.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942168|emb|CAP67820.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1070

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 122/400 (30%), Positives = 186/400 (46%), Gaps = 81/400 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +F P +      P      QG     F+ +   VDEFFS+ E  K++ K  ++E  A +K
Sbjct: 296 DFQPFL------PKQFEDDQGLTTLSFDGYNKTVDEFFSSIEGQKLESKLQEREATAKRK 349

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+  ++D   R+  L   Q ++  KA  I  N E V  A+ A+   +   + W +I   V
Sbjct: 350 LDAARQDQAKRIEGLVGFQTLNLRKAAAIEANIERVQEAMDAVNGLLEQGMDWVNINKLV 409

Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLL-RVHKERSNIKDSWVLHHDNADPFADSS-- 226
           ++ Q   +PVA IIK L +N+  + ITLLL    +E +   +    ++D  +   D++  
Sbjct: 410 EREQAQGNPVAEIIK-LPVNLAESTITLLLGEEEEEEAGEDEDMEFNYDTDEEVVDAAPE 468

Query: 227 --ETKKPS---LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
             + K P     +D++L LS + NA+ +++ KR+AA K++KT+  +  ALKSAE+K  + 
Sbjct: 469 PEKAKGPDKRLAIDINLKLSVWNNAREYYEQKRTAADKEKKTVAQSVIALKSAEQKITED 528

Query: 282 LKDV--QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK------------------ 321
           L+    Q    +   RK  WFEKF WFISS+ YL     DA+                  
Sbjct: 529 LRKGLKQEKPVLQLIRKQMWFEKFVWFISSDGYLVLGGRDAQQNEILYKRYLKKGDVYVH 588

Query: 322 -----------------------------------VVTNAWWVKA---------DQVSKT 337
                                                ++AW  KA         DQVSK+
Sbjct: 589 ADMHGASTVIIKNSPKTPDAPIPPSTLAQAGSLSVCCSSAWDSKAAMGAWWVNADQVSKS 648

Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           APTGE+L  GSFM+RGKKN  PP  L +G   +F++ E S
Sbjct: 649 APTGEYLPAGSFMVRGKKNPLPPALLMLGFGLMFRISEES 688


>gi|154281559|ref|XP_001541592.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150411771|gb|EDN07159.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 1177

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 135/455 (29%), Positives = 211/455 (46%), Gaps = 102/455 (22%)

Query: 20  VLVSNLNVS------SQGYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGST 63
           V+  N+N S      + GY+V K E +   +  +          VA ++FHP        
Sbjct: 257 VVAENVNSSLSTAEETPGYIVSKTEGKAGEDASVDSTVPSKSSNVAYIDFHPF------E 310

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P       G  I  F++F  AVDE+FS+AES K++ +  ++E  A +KLE  ++D + R+
Sbjct: 311 PKQFESEPGTSILRFDTFNKAVDEYFSSAESQKLESRLTEREEIAKRKLEAAQKDQDKRV 370

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L++ Q +   KA+ I  N   V+ AI A+   IA  + W +I   ++  Q   +PVA+
Sbjct: 371 GVLKEAQELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQGRQNPVAN 430

Query: 184 IIK-QLKLNINHITLLLRVHKERSN-----IKDSWVLHHDNADPFADSSETKKPSL---- 233
           +IK  LKL  N +TLLL    E          ++ V   +  +   D    KKP +    
Sbjct: 431 VIKLPLKLYENAVTLLLGEPTENEEPMDESEDEAEVEEEEEQESSEDEDSGKKPGVSKKP 490

Query: 234 ------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV-- 285
                 +D+DL +S +ANA+++++ K+ AA K++KT+ ST++A+KS +KK    LK    
Sbjct: 491 RQPLLSIDIDLGISPWANARQYYEQKKVAAVKEKKTLNSTKEAIKSTKKKVAADLKQALK 550

Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL--------------------GNV--------- 316
           Q    +   R  +WFEKF +F+SS+ YL                    G+V         
Sbjct: 551 QEKPVLRPTRTPFWFEKFIFFLSSDGYLVLGGRDVQQTEILYRRHLKRGDVFVHADVQGA 610

Query: 317 -----------------------AWDAKVVTNAWW----------VKADQVSKTAPTGEF 343
                                  A +  V T+  W            A+QVSKT P GE+
Sbjct: 611 IPVIVKNKPGTLDAPIPPGTLSQAGNLCVATSTAWDSKAVMGAWWANANQVSKTTPLGEY 670

Query: 344 LTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           L TG F+I G+KN  PP QL +G + +F++   SI
Sbjct: 671 LVTGGFVICGEKNQLPPAQLLLGFAVMFQISGESI 705


>gi|357448763|ref|XP_003594657.1| Serologically defined colon cancer antigen-like protein [Medicago
           truncatula]
 gi|355483705|gb|AES64908.1| Serologically defined colon cancer antigen-like protein [Medicago
           truncatula]
          Length = 1146

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 106/283 (37%), Positives = 158/283 (55%), Gaps = 29/283 (10%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH--- 119
            P ++ Q +     +FE+F  A+DEF+S  ES + + +   +E  AL+KL  ++ D    
Sbjct: 318 CPILLNQFKSRDHTKFETFDLALDEFYSKIESQRSEQQHTAKENSALQKLNKIRNDQVGT 377

Query: 120 -------ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
                  E R+  L +      + AELI  N E VDAAILA+R  +A  +SW+D+   VK
Sbjct: 378 HVQTSTIENRVHTLRKEADNCIKMAELIEYNLEDVDAAILAVRVSLAKGMSWDDLARMVK 437

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           + ++  +PVA +I +L L  N +TLLL                 +N D   D  +T    
Sbjct: 438 EEKKAGNPVAGLIDKLHLERNCMTLLLS----------------NNLDEMDDDEKTLPAD 481

Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
            V+VDL LSA+ANA+R+++LK+    KQ+KTI + EKA K+AE+KT+  L   +T+ +I+
Sbjct: 482 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHEKAFKAAERKTRLQLNQEKTVASIS 541

Query: 293 KARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
             RKV+WFEKF WFISSENYL   G  A   +++   +  K D
Sbjct: 542 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 584



 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 43/63 (68%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKN+ PP  L MG   LF+L+ES
Sbjct: 626 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNYLPPHPLIMGFGLLFRLDES 685

Query: 377 SIS 379
           S+ 
Sbjct: 686 SLG 688


>gi|224101503|ref|XP_002312307.1| predicted protein [Populus trichocarpa]
 gi|222852127|gb|EEE89674.1| predicted protein [Populus trichocarpa]
          Length = 796

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 107/284 (37%), Positives = 160/284 (56%), Gaps = 28/284 (9%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           EF P++L        MR+H      +F++F AA+DEF+S  ES K + +   +E  A++K
Sbjct: 4   EFCPLLLNQFR----MREHV-----KFDAFDAALDEFYSKIESQKSEHQQKTKEGSAIQK 54

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L  ++ D E R+  L +      + AELI  N E V++AILA+R  +A  + WED+   V
Sbjct: 55  LNKIRLDQENRVEMLRKEVDHSVKMAELIEYNLEDVNSAILAVRVALAKGMGWEDLARMV 114

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           K  ++  +PVA +I +L    N +TLLL                 +N D   D  +T   
Sbjct: 115 KDEKKAGNPVAGLIDKLHFEKNCMTLLLS----------------NNLDEMDDDEKTFPV 158

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
             V+VDL LSA+ANA+R+++LK+    KQ+KT+ + EKA K+AEKKT+  L   +++  I
Sbjct: 159 DKVEVDLALSAHANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLSQEKSVATI 218

Query: 292 NKARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
           +  RKV+WFEKF WFISSENYL   G  A   +++   +  K D
Sbjct: 219 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYVSKGD 262



 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 44/63 (69%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 304 AWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 363

Query: 377 SIS 379
           S+ 
Sbjct: 364 SLG 366


>gi|63054438|ref|NP_588145.2| nuclear export mediator factor NEMF [Schizosaccharomyces pombe
           972h-]
 gi|48475020|sp|Q9USN8.2|YJY1_SCHPO RecName: Full=Uncharacterized protein C132.01c
 gi|157310510|emb|CAA22870.2| nuclear export mediator factor NEMF [Schizosaccharomyces pombe]
          Length = 1021

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 131/398 (32%), Positives = 191/398 (47%), Gaps = 77/398 (19%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP   QP       R+   FP     ++   VDEFFS+ E+ K+  +A  +   A ++
Sbjct: 315 DFHP--FQPLQLVQANRKCMEFP-----TYNECVDEFFSSIEAQKLKKRAHDRLATAERR 367

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE+ K D   +L +L+  Q     +A+ I  N E V+A I  I   +   + W DIE ++
Sbjct: 368 LESAKEDQARKLQSLQDAQATCALRAQAIEMNPELVEAIISYINSLLNQGMDWLDIE-KL 426

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKK 230
            Q+Q+   PVA+ I+  LKL  N +T+ L   +   N  +S     D+ D   D ++ K+
Sbjct: 427 IQSQKRRSPVAAAIQIPLKLIKNAVTVFLPNPESVDNSDESSETSDDDLDDSDDDNKVKE 486

Query: 231 PSL------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD 284
             +      V++DL L A+ANA++ ++L+R A  K+ KT ++  KALKS ++K +Q LK 
Sbjct: 487 GKVSSKFIAVELDLSLGAFANARKQYELRREALIKETKTAEAASKALKSTQRKIEQDLKR 546

Query: 285 VQT--MTNINKARKVYWFEKFYWFISSENYLGNVAWDAK--------------------- 321
             T     I   RK ++FEKF+WFISSE YL     DA+                     
Sbjct: 547 STTADTQRILLGRKTFFFEKFHWFISSEGYLVLGGRDAQQNELLFQKYCNTGDIFVCADL 606

Query: 322 ------VVTN------------------------AW--------W-VKADQVSKTAPTGE 342
                 +V N                        AW        W V+ D+VSK APTGE
Sbjct: 607 PKSSIIIVKNKNPHDPIPPNTLQQAGSLALASSKAWDSKTVISAWWVRIDEVSKLAPTGE 666

Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
            L TGSF IR KKN+ PP  L MG   L++L+E S  R
Sbjct: 667 ILPTGSFAIRAKKNYLPPTVLIMGYGILWQLDEKSSER 704


>gi|297795761|ref|XP_002865765.1| EMB1441 [Arabidopsis lyrata subsp. lyrata]
 gi|297311600|gb|EFH42024.1| EMB1441 [Arabidopsis lyrata subsp. lyrata]
          Length = 1080

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 111/311 (35%), Positives = 171/311 (54%), Gaps = 33/311 (10%)

Query: 30  QGYLVQKKEKRPN---SEDYIVANM--EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           +GY++ +K+   N   SE   V  M  EF  ++L          Q +    ++FE+F AA
Sbjct: 285 EGYILMQKQILANDTPSESGGVKKMYDEFCSILLN---------QFKSRVYEKFETFDAA 335

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           +DEF+S  ES + + +   +E  A +KL  +++D E R+  L++        AELI  N 
Sbjct: 336 LDEFYSKIESQRSEQQQKAKEDSASQKLNKIRQDQENRVQILKKEVNHCVNMAELIEYNL 395

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           E VDAAILA+R  +A  + W+D+   VK+ ++  +PVA +I +L L  N +TLLL     
Sbjct: 396 EDVDAAILAVRVALAKGMGWDDLARMVKEEKKLGNPVAGLIDKLYLEKNCMTLLL----- 450

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                       +N D   D  +T     V+VDL LSA+ NA+R++++K+    KQ+KT+
Sbjct: 451 -----------CNNLDEMDDDEKTLPVEKVEVDLSLSAHGNARRWYEMKKKQETKQEKTV 499

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAK 321
            + EKA ++AEKKT+  L   + +  I+  RKV+WFEKF WFISSENYL   G  A   +
Sbjct: 500 SAHEKAFRAAEKKTRHQLSQEKVVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 559

Query: 322 VVTNAWWVKAD 332
           ++   +  K D
Sbjct: 560 MIVKRYMSKGD 570



 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 43/63 (68%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QV+KTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 612 AWDSKIVTSAWWVYPHQVTKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 671

Query: 377 SIS 379
           S+ 
Sbjct: 672 SLG 674


>gi|290975413|ref|XP_002670437.1| hypothetical protein NAEGRDRAFT_81846 [Naegleria gruberi]
 gi|284083996|gb|EFC37693.1| hypothetical protein NAEGRDRAFT_81846 [Naegleria gruberi]
          Length = 1146

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 117/376 (31%), Positives = 184/376 (48%), Gaps = 75/376 (19%)

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           EF+SF  + DEFFS  E+ KI+ +    E    KK+  VKR+ E +L  L+ +    +  
Sbjct: 397 EFDSFDRSADEFFSAIEAKKIESQKSSIENTVEKKMSKVKREQELKLQELQASFDKYETI 456

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
           A LI  + E VD AI  I   +A   SWE I+  +K+  R  DP+A++I +LKL  + IT
Sbjct: 457 ATLIETHYEIVDQAIQVICSALAQSQSWETIKQIIKE-HRDVDPIAAMIHKLKLESSQIT 515

Query: 197 LLL---------------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
           + L                   E ++ +D      +  +  +D  + ++P  +D+D+ L+
Sbjct: 516 VTLPPPSIDDDDEDEFEYEESDEENDDEDEESDDEEKKEKKSDKKKKEEPMRIDIDISLT 575

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFE 301
           A+ANA +++ L++ + + ++K   +++KA+K  E+KT ++ K  Q  + I   RK +WFE
Sbjct: 576 AHANAAKYYSLRKKSGENKEKAAFASKKAIKKTEQKTLESAKKSQIKSEITIRRKRFWFE 635

Query: 302 KFYWFISSENYL---GNVAWDAKVVTNAW------WVKADQVSKTA-----PTGE----- 342
           KFYWFI+SENYL   G  A   ++V   +      ++ AD    ++     PTGE     
Sbjct: 636 KFYWFITSENYLVLGGRDAQQNELVVKRYMRKGDIYIHADVHGASSCIIKNPTGEPIPPL 695

Query: 343 ----------------------------------------FLTTGSFMIRGKKNFFPPCQ 362
                                                   +LT GSFMIRGKKNF PP  
Sbjct: 696 SLQEAGMFCVCRSVAWDNKVMSSAYWVYDHQVSKTAPTGEYLTVGSFMIRGKKNFLPPSP 755

Query: 363 LAMGISFLFKLEESSI 378
           L MG + +FK++ES I
Sbjct: 756 LVMGFAVMFKVDESCI 771


>gi|15240582|ref|NP_199804.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana]
 gi|8777424|dbj|BAA97014.1| unnamed protein product [Arabidopsis thaliana]
 gi|332008489|gb|AED95872.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana]
          Length = 1080

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 111/311 (35%), Positives = 170/311 (54%), Gaps = 33/311 (10%)

Query: 30  QGYLVQKKEKRPN---SEDYIVANM--EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           +GY++ +K+   N   SE   V  M  EF  ++L          Q +    ++FE+F AA
Sbjct: 285 EGYILMQKQILANDTTSESGGVKKMYDEFCSILLN---------QFKSRVYEKFETFDAA 335

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           +DEF+S  ES + + +   +E  A  KL  +++D E R+  L++        AELI  N 
Sbjct: 336 LDEFYSKIESQRSEQQQKAKEDSASLKLNKIRQDQENRVQILKKEVNHCVNMAELIEYNL 395

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           E VDAAILA+R  +A  + W+D+   VK+ ++  +PVA +I +L L  N +TLLL     
Sbjct: 396 EDVDAAILAVRVALAKGMGWDDLARMVKEEKKLGNPVAGVIDRLYLEKNCMTLLL----- 450

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                       +N D   D  +T     V+VDL LSA+ NA+R++++K+    KQ+KT+
Sbjct: 451 -----------CNNLDEMDDDEKTVPVEKVEVDLSLSAHGNARRWYEMKKKQETKQEKTV 499

Query: 265 QSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAWDAK 321
            + EKA ++AEKKT+  L   + +  I+  RKV+WFEKF WFISSENYL   G  A   +
Sbjct: 500 SAHEKAFRAAEKKTRHQLSQEKVVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 559

Query: 322 VVTNAWWVKAD 332
           ++   +  K D
Sbjct: 560 MIVKRYMSKGD 570



 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 43/63 (68%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QV+KTAPTGE+LT GSFMIRGKKNF PP  L MG   LF+L+ES
Sbjct: 612 AWDSKIVTSAWWVYPHQVTKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 671

Query: 377 SIS 379
           S+ 
Sbjct: 672 SLG 674


>gi|170576547|ref|XP_001893673.1| Serologically defined colon cancer antigen 1 [Brugia malayi]
 gi|158600188|gb|EDP37492.1| Serologically defined colon cancer antigen 1, putative [Brugia
           malayi]
          Length = 307

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 98/268 (36%), Positives = 130/268 (48%), Gaps = 82/268 (30%)

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           QA     P+A+ I  L LN N +TLLL                    DP+    + KK  
Sbjct: 4   QACEAGSPIAASIVGLNLNSNQMTLLL-------------------GDPYRPEIDPKK-- 42

Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
            V +D+ LS+Y NA++    K++A +K+QKTI ++ KALKS + K K+TLK V T   + 
Sbjct: 43  -VTIDIALSSYQNARKLHTEKKAAQQKEQKTICASSKALKSTKMKMKETLKVVHTKAEVM 101

Query: 293 KARKVYWFEKFYWFISSENYL--------------------GNVAWDAKV---------- 322
           K R V WFEKF+WF+SSENYL                    G++   A V          
Sbjct: 102 KKRHVMWFEKFFWFVSSENYLVIGGRDAQQNELLVKRYLRPGDIYMHADVRGASSIIIRN 161

Query: 323 ---------------------VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIR 352
                                 ++AW         WV   QVS+TAPTGE+LT GSFMIR
Sbjct: 162 KLGGGDVPPRTLNEAATMAISYSSAWEAKITSSAWWVHQHQVSRTAPTGEYLTPGSFMIR 221

Query: 353 GKKNFFPPCQLAMGISFLFKLEESSISR 380
           GKKN+ P CQL MG   +F+L+E S+ R
Sbjct: 222 GKKNYLPTCQLQMGFGVMFQLDEESLER 249


>gi|347828082|emb|CCD43779.1| similar to serologically defined colon cancer antigen 1
           [Botryotinia fuckeliana]
          Length = 674

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 114/330 (34%), Positives = 173/330 (52%), Gaps = 33/330 (10%)

Query: 28  SSQGYLVQKKE--KRPNSEDYI-----VANMEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           +++GY+V KK+  + P+ E+       +   +FHP        P   +        EFE 
Sbjct: 336 TAKGYIVAKKKDPQTPSDENETDIRKGLLYDDFHPF------KPKQFQDDPSLVFLEFEG 389

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F   VDEFFS+ E  K++ K  ++E+ A KK++  + +   RL  L++ Q +++ KA  +
Sbjct: 390 FNKTVDEFFSSIEGQKLESKLEEREKQAQKKIQAARNEQAKRLGGLQEIQALNERKASAL 449

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N E V  A  A+   IA  + W +I   +++ Q+ N+PVAS+IK  LKL  N +T+LL
Sbjct: 450 QANVERVQEATDAVNGLIAQGMDWFEIGRLIEREQKFNNPVASMIKLPLKLEENTVTILL 509

Query: 200 RVHKERSNIK-----DSWVLHHDNADPFADS-------SETKKPSLVDVDLDLSAYANAK 247
                          DS V   ++ D  A +       ++T+ P  +D+DL LS +ANA+
Sbjct: 510 DEEAFDEEEDSTYETDSDVSESEDEDDTAKTNKKKEKVADTRIP--IDIDLALSPWANAR 567

Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
            +FD KRSAA K+ KT+QS+ KALKS E K  Q LK    Q    +   RK  WFEKF W
Sbjct: 568 NYFDQKRSAASKEDKTLQSSSKALKSTEAKIAQDLKKGLKQEKAILRPVRKQMWFEKFIW 627

Query: 306 FISSENYL---GNVAWDAKVVTNAWWVKAD 332
           FISS+ YL   G  A  ++++   +  K D
Sbjct: 628 FISSDGYLVLAGKDAQQSEILYKRYLKKGD 657


>gi|308808798|ref|XP_003081709.1| zinc knuckle (CCHC-type) family protein (ISS) [Ostreococcus tauri]
 gi|116060174|emb|CAL56233.1| zinc knuckle (CCHC-type) family protein (ISS), partial
           [Ostreococcus tauri]
          Length = 1090

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/324 (32%), Positives = 171/324 (52%), Gaps = 28/324 (8%)

Query: 63  TPSVMRQHQGFPIQEFE---------SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLE 113
            P  ++Q++G   + +E         +F   VDE+F+  +S  + L+  + E  A+ +LE
Sbjct: 426 APFELKQNEGRRTKTYELPKGLDPALAFDHYVDEYFNELDSQSVILQRRKAEAQAIARLE 485

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
              RD + R+  LE+ + +++++A LI  N E+VD AI A+   +A+ +SW+++EA +K+
Sbjct: 486 KTLRDQKNRVEQLERERELEEQRAVLIEYNHEAVDVAIEAVNSALASGMSWDELEAMIKE 545

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
            +R  +PVA +IK + L  N IT+ L  H +         L  D      D+   KK   
Sbjct: 546 ERRLGNPVAGMIKSMDLANNEITITLENHLDE--------LGEDE-----DALGKKKRVA 592

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           V VDL LSA+ANA   F  K+  A K +KT+ +  KA+ +AE K K  ++    +    +
Sbjct: 593 VSVDLGLSAHANASVRFAAKKKNADKFEKTLNAQNKAVAAAESKMKSAMERAANVVVATR 652

Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVKADQVSKTAP-TGEFLTTGSFMI- 351
           AR+  WFEKF+WFI+         W  +   +AWW+  ++VSK +P  G+ L  G   + 
Sbjct: 653 ARQPLWFEKFHWFITXXXX--XXXW--RGCDSAWWIPIERVSKISPVAGDALPPGVVHVA 708

Query: 352 RGKKNFFPPCQLAMGISFLFKLEE 375
             +K F P  QL MG   +F + E
Sbjct: 709 HAEKQFLPHTQLVMGFGLMFVVSE 732


>gi|320581674|gb|EFW95893.1| hypothetical protein HPODL_2176 [Ogataea parapolymorpha DL-1]
          Length = 940

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 127/447 (28%), Positives = 201/447 (44%), Gaps = 91/447 (20%)

Query: 7   VSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEK--RPNSEDYIVANME-----FHPMILQ 59
           V++ ++ C    + L +       GY++ K     +P  ED    N+E     FHP   +
Sbjct: 238 VTAAVNSCEQEFYRLTNLPAGELSGYIIGKHNPFFKP-EEDASYDNLEYVYDEFHP--FE 294

Query: 60  PGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH 119
           P     V ++ +   ++E + +   +D+FFST ES K  LK  QQ+ +A K+L+ VK +H
Sbjct: 295 P-----VHKKKENTRVEEVKGYNRTLDKFFSTLESSKAVLKIQQQQANAAKRLQTVKDEH 349

Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
            T+L  LE+ Q ++  K ELI  + E ++    +++  +  Q+ W +IE  V   Q+  +
Sbjct: 350 MTKLQRLEEQQAINYRKGELITFHSEQIEQCKQSVQALLDQQMDWTNIEKLVAMEQKRRN 409

Query: 180 PVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVD 237
           P+A++IK L LN+  N IT+LL          D       ++D  ++      P  V +D
Sbjct: 410 PIANMIK-LPLNLAKNEITVLL---------PDIEEQSDSDSDSDSEEKRKSGPVAVAID 459

Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM----TNINK 293
           L LSAYANA R+FD  R+A  KQ KT  S   A+K+ E+  +Q LK +Q      + + +
Sbjct: 460 LSLSAYANATRYFDAMRAALDKQNKTKNSASIAIKNTERTIQQDLKRMQKKSQEPSGLKQ 519

Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWW---------------------VKAD 332
            R  +WFEKF+WFI+S+N+L     D   V   ++                     VK  
Sbjct: 520 IRAKFWFEKFWWFITSDNHLCIAGRDDTQVDLIYYRYFDKNNDVLVSNDLDGLKVIVKNP 579

Query: 333 QVSKTAPTGEFLTTGSFM---------------------------------------IRG 353
             +K  P    L  G F                                        I+G
Sbjct: 580 FKNKDIPPSTLLQAGIFSLSASKAWDNKMVTSPWMVKGTQVSKKDFDGSIVPAGMLNIQG 639

Query: 354 KKNFFPPCQLAMGISFLFKLEESSISR 380
           +K F PPCQL MG   L+  +E +  +
Sbjct: 640 EKTFLPPCQLVMGFGLLWLGDEETTRK 666


>gi|320040092|gb|EFW22026.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 1136

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/299 (33%), Positives = 165/299 (55%), Gaps = 20/299 (6%)

Query: 29  SQGYLVQKKEKRPNSEDYIVAN--------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           ++GY+V + E +P+                +++HP        P          I  F+S
Sbjct: 264 TRGYIVARNENKPSENPSFSGEAKPDKSNYIDYHPF------APRQFADGNDISILTFDS 317

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDE++S+ E+ K++ +  ++E    +KLE  KRDHE R+ AL+Q Q +   KAE I
Sbjct: 318 FNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEIHTRKAEAI 377

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N   V+  + A+   IA  + W +I   ++  Q   +PVA +IK  LKL  N +T+LL
Sbjct: 378 ATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLYENTVTVLL 437

Query: 200 RVHKERSNIKDSWVLHHDNADPFADS-SETKKPSL--VDVDLDLSAYANAKRFFDLKRSA 256
              +      DS     ++ +   ++ ++ ++P +  VD+DL L+ +ANA +++D K++A
Sbjct: 438 PEGQLDEEDDDSEESDEEDEENDGEAKTKPQRPEVLSVDIDLGLTPWANASQYYDQKKTA 497

Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           A K++KTI+++++ALKSAEKK    LK    Q    +  AR  +WFEKFY+FISS+ YL
Sbjct: 498 AVKEEKTIKASKQALKSAEKKLTTDLKRGLKQEKPVLRPARIPFWFEKFYFFISSDGYL 556



 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 31/62 (50%), Positives = 43/62 (69%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW++K +  AWWV ADQVSKT P+GE+L TG  +IRG KN   P QL +G + +F++   
Sbjct: 633 AWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPGQLILGFAVMFQISPE 692

Query: 377 SI 378
           S+
Sbjct: 693 SV 694


>gi|68533893|gb|AAH99277.1| LOC733300 protein [Xenopus laevis]
          Length = 453

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 85/181 (46%), Positives = 112/181 (61%), Gaps = 16/181 (8%)

Query: 26  NVSSQGYLVQKKEKRPN------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N + +G+++QK+EK+P+      SED I  N EFHP +           QH      E +
Sbjct: 241 NFNGKGFIIQKREKKPSLEPDKASED-IFTNEEFHPFLFA---------QHANSTYIELD 290

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
           SF   VDEFFS  E  KID+KA+QQE+ ALKKL NV++DHE RL +L+  Q  DK K EL
Sbjct: 291 SFNKTVDEFFSKLEGQKIDIKALQQEKQALKKLGNVRKDHEHRLESLQYAQDADKAKGEL 350

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           I  N + VD AI  +R  +ANQ+ W +I   VK+AQ   DPVA  IK+LKL  NHIT++L
Sbjct: 351 IEMNLDIVDRAIQVVRSALANQIDWTEIGLIVKEAQIQGDPVALAIKELKLQTNHITMML 410

Query: 200 R 200
           +
Sbjct: 411 K 411


>gi|425773025|gb|EKV11400.1| hypothetical protein PDIG_50370 [Penicillium digitatum PHI26]
 gi|425782195|gb|EKV20118.1| hypothetical protein PDIP_19610 [Penicillium digitatum Pd1]
          Length = 1107

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 109/305 (35%), Positives = 163/305 (53%), Gaps = 23/305 (7%)

Query: 26  NVSSQGYLVQKKEKRPNSEDYIVANM------EFHPMILQPGSTPSVMRQHQGFPIQEFE 79
             S  GY+V K++ RP  E    +        +FHP        P       G  I EFE
Sbjct: 260 GASHPGYIVAKEDTRPIPEGETSSKAAGLLYEDFHPF------KPRQFENKPGIKILEFE 313

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
            F A VDE+FS+ ES +++ +  ++E  A KKLE+V+ +H+ R+  L+  Q +   KA  
Sbjct: 314 RFNATVDEYFSSLESQRLESRLTEREEAAKKKLESVRFEHKKRIDELKNVQELHIRKANA 373

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLL 198
           I +N   V  A+ A+   +A  + W +I   ++  Q   +PVA IIK  LKL  N ++LL
Sbjct: 374 IQDNVYRVQEAMDAVNGLVAQGMDWGEIARLIEMEQDRGNPVAQIIKLPLKLYENTVSLL 433

Query: 199 L-------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFF 250
           L          +E S+  +S     + AD    S+E +   L +D+DL LS +ANA +++
Sbjct: 434 LGEAGDDEDEEEEFSSSDESDSDSENEADQETSSAERESKLLTIDIDLGLSPWANASQYY 493

Query: 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFIS 308
           D K+ A++K+Q+T QS+ KALKS EKK    LK    +    + +AR  +WFEKF +FIS
Sbjct: 494 DQKKQASEKEQRTTQSSTKALKSHEKKVTTELKRGLKKEKQVLRQARTPFWFEKFVFFIS 553

Query: 309 SENYL 313
           SE YL
Sbjct: 554 SEGYL 558



 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 1/65 (1%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGE-FLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           + AWD+K V +AWW  A QVSK A  G   + TG F I+G+KNF  P QL +G   +F++
Sbjct: 638 STAWDSKAVMSAWWAHAHQVSKIAENGSGIMPTGVFQIKGEKNFLAPSQLVLGFGIMFQV 697

Query: 374 EESSI 378
            + S+
Sbjct: 698 SQESV 702


>gi|261195108|ref|XP_002623958.1| DUF814 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239587830|gb|EEQ70473.1| DUF814 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 1150

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 121/411 (29%), Positives = 190/411 (46%), Gaps = 79/411 (19%)

Query: 31  GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           GY+V K E +   +  + A           ++FHP        P          I +F++
Sbjct: 267 GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPF------EPKQFENQADMAILKFDT 320

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDE+FS+ E  K++ +  ++E  A +KLE  ++D E R+  L++ + +   KA+ I
Sbjct: 321 FNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQAI 380

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N   V+ A+ A+   IA  + W +I   ++  Q   +PVA +IK  LKL  N +TLLL
Sbjct: 381 EANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLLL 440

Query: 200 RVHKERSNIKDSWVLHHDNADPFADS------SETKKPSL--------VDVDLDLSAYAN 245
               E     D      ++ +   D         +KKP          +D+DL +S +AN
Sbjct: 441 GEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWAN 500

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
           A+++++ K++AA K++KT+ S +KA+KS EKK    LK    Q    +   R  +WFEKF
Sbjct: 501 ARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEKF 560

Query: 304 YWFISSENYLGNVAWDAK--------------VVTNA-------WWVKADQVSKTAP--- 339
            +FISS+ YL     DA+              V  +A       ++VK    +  AP   
Sbjct: 561 IYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIPP 620

Query: 340 ----------------------TGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
                                  GE+L TG F+IRG+KN  PP QL +G +
Sbjct: 621 GTLSQAGNLCVATSSAWHSKAVMGEYLETGGFVIRGEKNQLPPAQLLLGFA 671


>gi|50555916|ref|XP_505366.1| YALI0F13277p [Yarrowia lipolytica]
 gi|49651236|emb|CAG78173.1| YALI0F13277p [Yarrowia lipolytica CLIB122]
          Length = 1134

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 174/380 (45%), Gaps = 75/380 (19%)

Query: 75  IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
           +  FE +   VD +FST ES +  L+   QE+ A K+L   + + + R+  L+Q Q    
Sbjct: 328 VDTFEGYNLTVDRYFSTVESTRYSLRVNAQEQIAEKRLNAARNETKKRVDGLQQVQDRSI 387

Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINH 194
                +      V+ AI A++Q     + W+D+E  +   ++  +PVA ++  + L  N 
Sbjct: 388 LMGTALQTYAGRVEEAIAAVKQLQDQGMDWKDMEHLIDLEKKKGNPVAQMVSSMNLEKNR 447

Query: 195 ITLLL-----------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           +TL+L               +              ++  ++ +E+K    V+V+LDL+AY
Sbjct: 448 VTLILPNPDVEDESDSDSDSDMDETDSEGESEESGSESDSNKNESKTLK-VEVNLDLTAY 506

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFE 301
           ANA  +FD+K+ AA+KQ+KT +++  ALKSAE+K K  LK    Q    +   R  YWFE
Sbjct: 507 ANANNYFDIKKVAAQKQEKTEKNSATALKSAEQKVKLDLKRSLAQEQHALRPMRPSYWFE 566

Query: 302 KFYWFISSENYL---GNVAWDAKVVTNAWWVKAD-----------------QVSKTAP-- 339
           KF+WF SS+ YL   G  A   +++   ++ K D                  +  TAP  
Sbjct: 567 KFWWFFSSDGYLVIGGKDAQQNEMLYKRYFRKGDAYVHAEIQGASTVIVKNHLGPTAPLP 626

Query: 340 ---------------------------------------TGEFLTTGSFMIRGKKNFFPP 360
                                                  +GEFL TGSFMIRGKKNF PP
Sbjct: 627 PSTLSQAGSLSICTSKAWDSKVLISAWWVEHGQVSKSAPSGEFLPTGSFMIRGKKNFLPP 686

Query: 361 CQLAMGISFLFKLEESSISR 380
             L +G++ L+  +E S ++
Sbjct: 687 TSLDVGLAILWIADEDSTAK 706


>gi|239610682|gb|EEQ87669.1| DUF814 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 1131

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 121/411 (29%), Positives = 190/411 (46%), Gaps = 79/411 (19%)

Query: 31  GYLVQKKEKRPNSEDYIVAN----------MEFHPMILQPGSTPSVMRQHQGFPIQEFES 80
           GY+V K E +   +  + A           ++FHP        P          I +F++
Sbjct: 267 GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPF------EPKQFENQADMAILKFDT 320

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F  AVDE+FS+ E  K++ +  ++E  A +KLE  ++D E R+  L++ + +   KA+ I
Sbjct: 321 FNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQAI 380

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
             N   V+ A+ A+   IA  + W +I   ++  Q   +PVA +IK  LKL  N +TLLL
Sbjct: 381 EANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLLL 440

Query: 200 RVHKERSNIKDSWVLHHDNADPFADS------SETKKPSL--------VDVDLDLSAYAN 245
               E     D      ++ +   D         +KKP          +D+DL +S +AN
Sbjct: 441 GEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWAN 500

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
           A+++++ K++AA K++KT+ S +KA+KS EKK    LK    Q    +   R  +WFEKF
Sbjct: 501 ARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEKF 560

Query: 304 YWFISSENYLGNVAWDAK--------------VVTNA-------WWVKADQVSKTAP--- 339
            +FISS+ YL     DA+              V  +A       ++VK    +  AP   
Sbjct: 561 IYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIPP 620

Query: 340 ----------------------TGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
                                  GE+L TG F+IRG+KN  PP QL +G +
Sbjct: 621 GTLSQAGNLCVATSSAWHSKAVMGEYLETGGFVIRGEKNQLPPAQLLLGFA 671


>gi|255941192|ref|XP_002561365.1| Pc16g10550 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211585988|emb|CAP93725.1| Pc16g10550 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1160

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 108/311 (34%), Positives = 165/311 (53%), Gaps = 26/311 (8%)

Query: 28  SSQGYLVQKKEKRPNSEDYIVANM------EFHPMILQPGSTPSVMRQHQGFPIQEFESF 81
           S  GY+V K++ RP  E    +        +FHP        P       G  I EFE F
Sbjct: 315 SHPGYIVAKEDTRPVPEGETASKAPALLYEDFHPF------KPRQFENKPGTKILEFERF 368

Query: 82  AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
            A VDE+FS+ ES +++ +  ++E  A KKLE+V+ +H+ R+  L+  Q +   KA+ I 
Sbjct: 369 NATVDEYFSSLESQRLESRLTEREEAAKKKLESVRSEHKKRIDELKNVQEIHIRKADAIQ 428

Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL- 199
           +N   V  A+ A+   +A  + W +I   ++  Q   +PVA  IK  LKL  N +TL+L 
Sbjct: 429 DNVYRVQEAMDAVNGLVAQGMDWGEIARLIEMEQGRGNPVAQTIKLPLKLYENTVTLVLG 488

Query: 200 ------RVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLK 253
                    +E S+  +     ++     A +    K   +D+DL LS +ANA +++D K
Sbjct: 489 EAGDDEDEDEEFSSSDEESDSENEAEQETARAERESKLLTIDIDLGLSPWANASQYYDQK 548

Query: 254 RSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           + A++K+Q+T QS+ KALKS EKK     K+ LK  + +  + +AR  +WFEKF +FISS
Sbjct: 549 KQASEKEQRTTQSSAKALKSHEKKVTTDLKRGLKKEKQV--LRQARTPFWFEKFIFFISS 606

Query: 310 ENYLGNVAWDA 320
           E YL   A DA
Sbjct: 607 EGYLVIGARDA 617



 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 29/63 (46%), Positives = 40/63 (63%), Gaps = 1/63 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGE-FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
           AWD+K V +AWW  A QVSK A  G   + TG F I+G+KNF  P QL +G   +F++ +
Sbjct: 676 AWDSKAVMSAWWAHAHQVSKIAENGSGIMPTGVFQIKGEKNFLAPSQLVLGFGIMFQISQ 735

Query: 376 SSI 378
            S+
Sbjct: 736 ESV 738


>gi|156101618|ref|XP_001616502.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805376|gb|EDL46775.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 2067

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 129/437 (29%), Positives = 197/437 (45%), Gaps = 94/437 (21%)

Query: 26  NVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
            V  +G + + KE+R   E   +A  EF P+IL          +     +  F+ F   V
Sbjct: 421 GVGGEGEM-EAKEERSGEE---IAFTEFSPIILNNHKNKV---EENKLEVVHFDDFNKCV 473

Query: 86  DEFFSTAESHKIDLKAVQQE----RDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
           D +FS  E  K D    QQE    + +L K++ +K DHE R+  LE+     ++K  LI 
Sbjct: 474 DTYFSRMELSKYDK---QQEVIKIKKSLTKMDKIKLDHERRIDQLEKEVSTLRKKISLIQ 530

Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRV 201
            N E V+ AI  +R  +A   +WE I   +K  ++ N P+A  I  +  N   + LLL  
Sbjct: 531 MNDELVEQAIQLMRAAVATNANWEKIWEHIKLFKKQNHPIALRISSVNFNNCEMELLLDD 590

Query: 202 HKERS-NIKDSWVLHHDNADPFADSSE--------------TKKPSLVDVDLDLSAYANA 246
            +E      DS   +  + DP ++++E              T     V ++L+ S Y N 
Sbjct: 591 GEENGLGSDDSSEANGRSDDPSSEANEQPSKGKKSSNKKAATNNRFAVTINLNNSVYGNV 650

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV-----QTMTNINKARKVYWFE 301
           + +  L++ A +K +KT  ST  A+K  EKK K+          +T+  I K RKVYWFE
Sbjct: 651 EDYQKLRKKAEEKIRKTKISTNFAVKKVEKKKKEKENKQKGKHNKTVGQIQKIRKVYWFE 710

Query: 302 KF--------YWFISSENYLGN-------------------------------------- 315
           KF        Y  I+  + L N                                      
Sbjct: 711 KFHWFISSENYLVIAGRDALQNEILFRRYFQKNDVYVHADIHGASTCIIKNPHKDIPIPE 770

Query: 316 --------------VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC 361
                          AW+ K++T+AWWV   QVSK+APTGE+L TGSF+IRGKKN+ P  
Sbjct: 771 KTLSEAGQLAICRSSAWNNKIITSAWWVHYHQVSKSAPTGEYLKTGSFVIRGKKNYLPHV 830

Query: 362 QLAMGISFLFKLEESSI 378
           +L MG+  +F+++ +++
Sbjct: 831 KLEMGLCIIFQVDNAAL 847


>gi|297736760|emb|CBI25961.3| unnamed protein product [Vitis vinifera]
          Length = 321

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 84/185 (45%), Positives = 105/185 (56%), Gaps = 47/185 (25%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           V+VDL LSA+ANA+R+++ K+    KQ+KTI + EKA K+AEKK+   L  V        
Sbjct: 11  VEVDLALSAHANARRWYEQKKRQENKQEKTIIAHEKAFKAAEKKSCVQLSQV-------G 63

Query: 294 ARKVYWFEKFYWFISSENYL--------------------GNV----------------- 316
              ++WFEKF WFISSENYL                    G++                 
Sbjct: 64  EHYIHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLHGASRCFTVCH 123

Query: 317 --AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
             AWD+K+VT+AWWV   QVSKTA TGE+LT GSFMIRG KNF PP  L MG   LF L+
Sbjct: 124 SQAWDSKIVTSAWWVYPHQVSKTASTGEYLTVGSFMIRG-KNFLPPHPLMMGFGLLFCLD 182

Query: 375 ESSIS 379
           ESS+ 
Sbjct: 183 ESSLG 187


>gi|322693747|gb|EFY85597.1| serologically defined colon cancer antigen 1 [Metarhizium acridum
           CQMa 102]
          Length = 1063

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 119/403 (29%), Positives = 184/403 (45%), Gaps = 81/403 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP I      P  +++       EF+ +   VDEFFS+ E  K++ +  ++E  A +K
Sbjct: 295 DFHPFI------PHKLQRDPSIKALEFKGYNQTVDEFFSSLEGQKLETRLNEREAAAKRK 348

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+  K D   R+  L+  Q ++  KA  I  N E V  A+ A+   +A  + W DI   V
Sbjct: 349 LDAAKADQAKRIEGLQDAQTLNMRKAAAIEANVEWVQEAMDAVNGLLAQGMDWVDIGKLV 408

Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERS-------NIKDSWVLHHDNADPF 222
           ++ ++  +PVA II  L LN+  N ITL L   +E            DS     + A   
Sbjct: 409 EREKKRKNPVADII-VLPLNLAENLITLSLAEEEEEEAEEADPFETDDSDSEDENEASTI 467

Query: 223 ADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAA----KKQQKTIQSTEKALKSAEKK 277
           +  SE     L V+++L LS ++NA+ +++ +R+A     K QQ+  ++ + A +   + 
Sbjct: 468 SKKSEKPAKGLNVEINLKLSPWSNAREYYEQRRTAVVKEEKTQQQASRALKNAEQKIVED 527

Query: 278 TKQTLKDVQTMTN-INKA---RKVYWF--EKFYWFISSEN----------YL-------- 313
            K+ LK  + +   I K     K  WF     Y  +  ++          YL        
Sbjct: 528 LKKGLKQEKALLQPIRKQLWFEKFLWFISSDGYLVLGGKDAQQNEILYKRYLRKGDVYCH 587

Query: 314 ------------------------------GNV------AWDAKVVTNAWWVKADQVSKT 337
                                         GN+      AWD K    AWWVKADQVSK+
Sbjct: 588 ADLRGAPSVIIKNNPSTPDAPIPPATLAQAGNLSVCASEAWDQKAGMGAWWVKADQVSKS 647

Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
            P G+FL TG+FM+RG+KNF  P QL +G+  +FK+ E S +R
Sbjct: 648 GPAGDFLPTGNFMVRGQKNFLAPAQLLLGLGIMFKISEESKAR 690


>gi|322712137|gb|EFZ03710.1| DUF814 domain-containing protein [Metarhizium anisopliae ARSEF 23]
          Length = 959

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 120/403 (29%), Positives = 184/403 (45%), Gaps = 81/403 (20%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP I      P  ++      + EF+ +   VDEFFS+ E  K++ +  ++E  A +K
Sbjct: 190 DFHPFI------PHKLQGDPSIKVLEFKGYNRTVDEFFSSLEGQKLETRLNEREAAAKRK 243

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+  K D   R+  L+  Q ++  KA  I  N E V  A+ A+   +A  + W DI   V
Sbjct: 244 LDATKADQAKRIEGLQDAQTLNMRKAAAIEANVEWVQEAMDAVNGLLAQGMDWVDIGKLV 303

Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERS-------NIKDSWVLHHDNADPF 222
           ++ ++  + VA II  L LN+  N ITL L   +E            DS     + A   
Sbjct: 304 EREKKRKNAVAGII-VLPLNLAENLITLSLAEEEEEEEEEADPFETDDSDSEDENEASTI 362

Query: 223 ADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAA----KKQQKTIQSTEKALKSAEKK 277
           +  SE     L V+++L LS ++NA+ +++ +R+A     K QQ+  ++ + A +   + 
Sbjct: 363 SKKSEKPVRGLNVEINLKLSPWSNAREYYEQRRTAVVKEEKTQQQASRALKNAEQKIAED 422

Query: 278 TKQTLKDVQTMTN-INKA---RKVYWF--EKFYWFISSEN----------YL-------- 313
            K+ LK  + +   I K     K  WF     Y  +  ++          YL        
Sbjct: 423 LKKGLKQEKALLQPIRKQLWFEKFLWFISSDGYLVLGGKDAQQNEILYKRYLRKGDVYCH 482

Query: 314 ------------------------------GNV------AWDAKVVTNAWWVKADQVSKT 337
                                         GN+      AWD K    AWWVKADQVSK+
Sbjct: 483 ADLRGAPSVIIKNNPSTPDAPIPPATLAQAGNLSVCASEAWDQKAGMGAWWVKADQVSKS 542

Query: 338 APTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           AP G+FL TGSFM+RG+KNF  P QL +G+  +FK+ E S +R
Sbjct: 543 APAGDFLPTGSFMVRGQKNFLAPAQLLLGLGIMFKISEESKAR 585


>gi|325093107|gb|EGC46417.1| DUF814 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 1136

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 167/328 (50%), Gaps = 40/328 (12%)

Query: 20  VLVSNLNVS------SQGYLVQKKEKRPNSEDYI----------VANMEFHPMILQPGST 63
           V+  N+N S      + GY+V K E +   +  +          VA ++FHP        
Sbjct: 266 VVAENVNSSLSTAEETPGYIVSKTEGKAGEDASVDSTDPSKLRNVAYIDFHPF------E 319

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P       G  I  F++F+ AVDE+FS+ ES K++ +  ++E  A +KLE  ++D + R+
Sbjct: 320 PKQFESEPGTSILRFDTFSKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAQKDQDKRV 379

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L++ Q +   KA+ I  N   V+ AI A+   IA  + W +I   ++  Q   +PVA 
Sbjct: 380 GVLKEAQELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQSRQNPVAK 439

Query: 184 IIK-QLKLNINHITLLLRVHKE--------------RSNIKDSWVLHHDNADPFADSSET 228
           +IK  LKL  N +TLLL    E                  +       D+      S +T
Sbjct: 440 VIKLPLKLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSKKT 499

Query: 229 KKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV-- 285
           ++P L +D+DL +S +ANA+++++ K++AA K++KT+ ST+ A+KS EKK    LK    
Sbjct: 500 RQPLLSIDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKTAIKSTEKKVAADLKQALK 559

Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL 313
           Q    +   R  +WFEKF +F+SS+ YL
Sbjct: 560 QEKPVLRPTRTPFWFEKFIFFLSSDGYL 587



 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 34/64 (53%), Positives = 44/64 (68%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           + AWD+K V  AWWV ADQVSKT P GE+L TG F+I G+KN   P QL +G + +F++ 
Sbjct: 672 STAWDSKAVMGAWWVNADQVSKTTPLGEYLVTGGFVICGEKNHLSPAQLLLGFAVMFQIS 731

Query: 375 ESSI 378
             SI
Sbjct: 732 GESI 735


>gi|440634980|gb|ELR04899.1| hypothetical protein GMDG_00158 [Geomyces destructans 20631-21]
          Length = 1072

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 106/329 (32%), Positives = 166/329 (50%), Gaps = 38/329 (11%)

Query: 31  GYLVQKKEKRPN--------SEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           GY++ KK++R +        +E   +   +FHP        P             FE F 
Sbjct: 264 GYIIAKKQERTDKVASRDEETERQALLYEDFHPF------KPRQFENDPACTFVPFEGFN 317

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
             VDEFFS+ E  +++ +  ++E  A KKL+  K D + RL  L++ Q +++ KA  I  
Sbjct: 318 NTVDEFFSSIEGQRLESRLYEREVTAKKKLQAAKDDQQKRLGGLQEIQTLNERKAGAIET 377

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL-- 199
           N + V  A  A+   IA  + W +I   +   Q+  +PVASIIK  LKL+ N +TLLL  
Sbjct: 378 NVQRVQEATDAVNGLIAQGMDWIEIGKLIDIEQKRGNPVASIIKLPLKLHENTVTLLLDE 437

Query: 200 ---------RVHKERSNIKDSWVLHHDNADPFADSSE--TKKPSLVDVDLDLSAYANAKR 248
                      ++  S++ DS     D A P  ++ +    K   +D++L  S ++NA+ 
Sbjct: 438 EIFVEDLNDEAYETGSDVSDS----EDEA-PIKEAVKKVVDKRLAIDINLGASPWSNARE 492

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWF 306
           ++  +RSAA+K++KT++S+ KALKS   K +Q LK    Q    +   RK  WFEKF WF
Sbjct: 493 YYGQRRSAAEKEKKTLESSTKALKSTSHKIEQDLKKGLKQEKAILRPVRKHMWFEKFMWF 552

Query: 307 ISSENYL---GNVAWDAKVVTNAWWVKAD 332
           ISS+ YL   G  A   +++   +  K D
Sbjct: 553 ISSDGYLVLGGRDAQQNEILYKRYLRKGD 581



 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 31/61 (50%), Positives = 40/61 (65%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW++K    AWW  ADQVSK+APTG++   GSF +RGKKNF PP  L +G   +F +   
Sbjct: 625 AWESKAGMPAWWANADQVSKSAPTGDYFKPGSFDVRGKKNFLPPAPLLLGFGVMFHVSNE 684

Query: 377 S 377
           S
Sbjct: 685 S 685


>gi|340975808|gb|EGS22923.1| hypothetical protein CTHT_0014010 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 1116

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 125/443 (28%), Positives = 196/443 (44%), Gaps = 90/443 (20%)

Query: 2   VSNEDVSSDIHKCGPTLHVLVSNLNVSS--QGYLVQKKE-----------KRPNSEDYIV 48
           + NED+   + K       ++  +  S   +GY++ K             + PN +   +
Sbjct: 229 LKNEDLFESLFKALEQGRAILDEITSSPVLKGYIIAKPNPHAQEQASEGGEAPNGKASSL 288

Query: 49  ANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA 108
              +F P +      P    +     +  F+ F   VDEFFS+ E  K+  +  ++E  A
Sbjct: 289 LYEDFQPFL------PKQFEEDPNLEVLTFDGFNKTVDEFFSSLEGQKLQSRLQEREATA 342

Query: 109 LKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIE 168
            KKLE  ++D   R+  L++ Q+++  KA  I  N E V  A+ A+   +   + W DI 
Sbjct: 343 KKKLEAARQDQAKRIEGLQEAQVLNLRKAAAIEANIERVQEAMDAVNGLLQQGMDWVDIN 402

Query: 169 ARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSE 227
             V++ Q+ ++PVA IIK  ++L+ N ITLLL   +E     +     +D  +  A+  +
Sbjct: 403 KLVEREQKLHNPVAEIIKLPMRLHENIITLLLGEEEEEGPEDEEMDFEYDTDEEAANDPQ 462

Query: 228 TKKPS------LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
            +K         VD++L LS + NA+ +++ KRSAA K QKTIQ  E ALK+AE K  + 
Sbjct: 463 PEKAKGPDKRLAVDINLKLSPWNNAREYYEQKRSAADKAQKTIQQAEIALKNAEMKIAKD 522

Query: 282 L-KDV-QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK------------------ 321
           L KD+ Q    +   R+  WFEKF WFISS+ YL     DA+                  
Sbjct: 523 LKKDLKQEKPILQPIRQQLWFEKFIWFISSDGYLVLGGRDAQQNEILYKRYFKKGDVFVH 582

Query: 322 -----------------------------------VVTNAWWVKA---------DQVSKT 337
                                                ++AW  KA         D+VSK 
Sbjct: 583 SDVKGAATVIIKNDPKTPDAPIPPATLTQAGCLSVCCSSAWDSKAAMGAWWVTADKVSKL 642

Query: 338 APTGEFLTTGSFMIRGKKNFFPP 360
            PTG+ +  G+FMI G++N   P
Sbjct: 643 GPTGDPMPEGTFMINGERNPLEP 665


>gi|389585510|dbj|GAB68240.1| hypothetical protein PCYB_131140 [Plasmodium cynomolgi strain B]
          Length = 1898

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 122/422 (28%), Positives = 192/422 (45%), Gaps = 92/422 (21%)

Query: 37  KEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHK 96
           KE+  N E  ++A  EF P+IL          +     +  F+ F   VD +FS  E  K
Sbjct: 453 KEEDKNGE--VIAFTEFSPIILNNHKNKV---EENKLEVINFDDFNKCVDTYFSRMELSK 507

Query: 97  IDLKAVQQE----RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152
            D    QQE    + +L K++ +K DHE R+  LE+     K+K  LI  N E V+ AI 
Sbjct: 508 YDK---QQEVIKIKKSLTKMDKIKLDHERRIEQLEKEVSSLKKKISLIQMNDELVEQAIQ 564

Query: 153 AIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL---RVHKERSNIK 209
            +R  +A   +WE I   +K  ++ N P+A  I  +  N   + LLL      ++ S+  
Sbjct: 565 LMRAAVATNANWEKIWEHIKLFKKQNHPIALRISSVNFNNCEMELLLDDEEATEQGSDDL 624

Query: 210 DSWVLHHDNADPFADSSE-------------TKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
            S      + DP ++++E             T+    V ++L+ S Y N + +  L++ A
Sbjct: 625 SSEANEQGSDDPSSEANEQQSKGKASNREVATRSRFAVTINLNNSVYGNVEDYQKLRKKA 684

Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF--------YWFIS 308
            +K +KT  ST  A+K  EKK K+ +   + +T     ++VYWFEKF        Y  I+
Sbjct: 685 EEKIRKTKISTNFAVKKVEKKKKKKINRRENITR----QRVYWFEKFHWFISSENYLVIA 740

Query: 309 SENYLGN----------------------------------------------------V 316
             + L N                                                     
Sbjct: 741 GRDALQNEILFRRYFQKNDIYVHADIHGASTCIIKNPHKDIPIPEKTLSEAGQLAICRSS 800

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW+ K++T+AWWV   QVSK+AP GE+L TGSF+IRGKKN+ P  +L MG+  +F+++ +
Sbjct: 801 AWNNKIITSAWWVHYHQVSKSAPAGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNA 860

Query: 377 SI 378
           ++
Sbjct: 861 AL 862


>gi|258574555|ref|XP_002541459.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237901725|gb|EEP76126.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 1070

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 133/432 (30%), Positives = 211/432 (48%), Gaps = 85/432 (19%)

Query: 19  HVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEF 78
           +++  N N  S+    Q  E +P+   YI    ++HP        P     +    I   
Sbjct: 244 YIITRNENKPSEPP-TQGTETKPDKSSYI----DYHPF------EPKQFADNPDTRILPL 292

Query: 79  ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAE 138
           ESF  AVDE++S+ E+ K++ +   +E    +KLE  KRDHE R+ AL++ Q ++  KA+
Sbjct: 293 ESFNKAVDEYYSSVEAQKLESRLTDREETMKRKLEATKRDHEKRVGALKEVQQLNVRKAQ 352

Query: 139 LIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITL 197
            I  N   V+ AI A    IA  + W +I   ++  Q   +P+A +IK  LKL  N IT+
Sbjct: 353 AIEANLSKVEEAINAANSLIAQGMDWVEIARLIEMEQSRRNPIAKMIKLPLKLYENTITI 412

Query: 198 LL--RVHKERSNIKDSWVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDLK 253
           LL   +  +  +  +S     ++        ++++P +  +D+DL L+ +ANA +++D K
Sbjct: 413 LLPDGMPVDDESESESEDEDEEDESGDEPEKKSREPEVLSIDIDLALTPWANASQYYDQK 472

Query: 254 RSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           ++AA K+ KTI++++KALKSAEKK     KQ LK  Q    +  AR  +WFEKF++FISS
Sbjct: 473 KTAAMKEDKTIKASKKALKSAEKKVTADLKQGLK--QEKPVLRPARTPFWFEKFFFFISS 530

Query: 310 ENYL--------------------GNV-------------------AWDAKV-------- 322
           + YL                    G+V                   A+D  +        
Sbjct: 531 DGYLVLGGQDARQDEILYHRHLQKGDVYVHTDTEGAMPMIIKNKPGAFDDPIPPGTLAQA 590

Query: 323 ------VTNAWWVKA---------DQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGI 367
                  + AW  KA         +QVS+T  TGE+L T S +I G+KN   P QL +G 
Sbjct: 591 GTFTVATSRAWDTKALLGAWWVKAEQVSRTTATGEYLPT-SVVISGEKNHLAPGQLILGF 649

Query: 368 SFLFKLEESSIS 379
           + LF++   S++
Sbjct: 650 AVLFQISPESVA 661


>gi|118350963|ref|XP_001008760.1| conserved hypothetical protein [Tetrahymena thermophila]
 gi|89290527|gb|EAR88515.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
          Length = 1213

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 110/378 (29%), Positives = 178/378 (47%), Gaps = 83/378 (21%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
           +P  +  H+G    E  SF A+VD++F    + KI  +    E  A KK EN+K D   R
Sbjct: 372 SPLYLTCHEGKKFIENNSFNASVDKYFQVM-AQKIQEEQNDVESIAWKKYENIKNDQLNR 430

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L+  Q     KA+LI  N + VDA I  I+   ++  SW+ I   + + +++ DP+A
Sbjct: 431 IQKLKNEQEEYVVKAQLIEMNIDYVDAIINIIKTLKSSGESWDKITKMINEGKKNGDPMA 490

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I  L    N I++LL                    DP  D  E    ++V +D+  SA
Sbjct: 491 YLIHSLDFENNEISVLL-------------------GDPCDDMEEY---TVVAIDIAYSA 528

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARK------ 296
           + NA+ +++ K+    K++KT+ +++ ALK AEK   + +++++   N+   RK      
Sbjct: 529 HQNARNYYENKKKNIVKEKKTLDASKLALKQAEKTALKEIENLKLKNNVVNTRKQYWFEK 588

Query: 297 VYWF--EKFYWFISSEN----------YL--GNV-------------------------- 316
            YWF   + Y  IS  +          Y+  G++                          
Sbjct: 589 FYWFISSENYLVISGRDMQQNEIIVKKYMRKGDIYMHADFHGAASTIIKNPFKDIPVSQQ 648

Query: 317 --------------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ 362
                         AW+AK++ +AWWV   QVSK A TGE+L +GSFMIRGKKNF  P +
Sbjct: 649 TIEEAAIATICRSKAWEAKIIASAWWVYDHQVSKRAETGEYLPSGSFMIRGKKNFIYPAR 708

Query: 363 LAMGISFLFKLEESSISR 380
           + MG + L+KL++  + +
Sbjct: 709 MEMGCTLLYKLDDQFVEK 726


>gi|213403135|ref|XP_002172340.1| DUF814 family protein [Schizosaccharomyces japonicus yFS275]
 gi|212000387|gb|EEB06047.1| DUF814 family protein [Schizosaccharomyces japonicus yFS275]
          Length = 1013

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 117/397 (29%), Positives = 178/397 (44%), Gaps = 77/397 (19%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        P  + Q+      EF S+   VDEFFS+ ES KI+ +   + +   K+
Sbjct: 310 DFHPF------KPLQLLQNNNRTCIEFPSYNECVDEFFSSLESQKIEKQNHDRLKTFAKR 363

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           +EN KRD E +L  L++ Q + ++KA+ I  N + V+ AI  +   +   + W DIE  +
Sbjct: 364 IENAKRDVENKLKELQKAQELSEKKAQAIELNPQLVEGAIEYVNSLVGQAMDWLDIEKLI 423

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR--------VHKERSNIKDSWVLHHDNADPF 222
              QR     AS+I+  L+L  N ITL+L            E+S  +           P 
Sbjct: 424 TVQQRRQHAFASVIRLPLQLKKNLITLVLPDPNPLAVDEESEQSESESDSEPESTIITPV 483

Query: 223 ADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL 282
                  K   V+VDL L A+ANA+  ++ +R AA K++KTI+S+ KA+K+ +K+ +  L
Sbjct: 484 QRRLIQPKGLAVEVDLALGAFANARVHYNNRRLAALKEEKTIESSSKAIKNTQKRAEADL 543

Query: 283 KD--VQTMTNINKARKVYWFEKFYWFISSENYL--------------------GNVAWDA 320
           K    +    +  +R+ ++FEKF+WFISS+ YL                    G+V   A
Sbjct: 544 KTAAAEAKQALTASRRTFFFEKFHWFISSDGYLVLGGRDNQQRELLYEKYCNKGDVYVSA 603

Query: 321 KV-------------------------------VTNAW---------WVKADQVSKTAPT 340
            +                                + AW         WV    VSK   T
Sbjct: 604 DLPNSSSVIIKNRNENDPIPPNTLQQAGALALATSKAWDTKTVISAWWVPIHAVSKVDQT 663

Query: 341 GEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
            + L TG F I  +KN+ PP  L MG   L+ L+E S
Sbjct: 664 KQILPTGHFWINEEKNYLPPTNLVMGYGILWFLDEVS 700


>gi|380483775|emb|CCF40411.1| hypothetical protein CH063_10996 [Colletotrichum higginsianum]
          Length = 1087

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 94/272 (34%), Positives = 150/272 (55%), Gaps = 17/272 (6%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP +      PS         + EF+ +   VDEFFS+ E  K++ K  ++E  A +K
Sbjct: 300 DFHPFL------PSKFANDPTVKVLEFDGYNKTVDEFFSSLEGQKLESKLTEREAAARRK 353

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+  + D E R+  L   Q ++ +KA  I  N E V  A+ A+   +   + W DI   +
Sbjct: 354 LDAARSDQEKRIEGLRGAQSINVQKATAIEANVERVQEAMDAVNGLLQQGMDWVDISKLI 413

Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADPFADSS--- 226
           ++ Q+  +PVA IIK L LN+  N ITLLL   ++  + + ++    D +D   + S   
Sbjct: 414 EREQKRRNPVAEIIK-LPLNLAENKITLLLGEEEDIEDDESNYETDSDASDSENEESSNN 472

Query: 227 ---ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL- 282
              +  K   ++VD+ LS +AN++ + + KRSAAKK +KT+Q ++ ALK+AE+K +  L 
Sbjct: 473 NKQKNDKRLEIEVDITLSPWANSRGYHEQKRSAAKKAEKTVQQSQMALKNAEQKIQAELK 532

Query: 283 KDVQTMTNI-NKARKVYWFEKFYWFISSENYL 313
           K ++T   +    RK  WFEKF WF+SS+ YL
Sbjct: 533 KGLKTEKAVLQPIRKQSWFEKFIWFVSSDGYL 564



 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 40/64 (62%), Positives = 50/64 (78%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K    AWWV A+QVSK+APTGE+L TGSFM+RG+KNF PP QL +GI  +FK+ E 
Sbjct: 630 AWDSKAGMGAWWVNANQVSKSAPTGEYLPTGSFMVRGQKNFLPPAQLLLGIGIMFKISEE 689

Query: 377 SISR 380
           S +R
Sbjct: 690 SKAR 693


>gi|221059774|ref|XP_002260532.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|193810606|emb|CAQ42504.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 2040

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 122/417 (29%), Positives = 186/417 (44%), Gaps = 79/417 (18%)

Query: 31  GYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFS 90
           G  V K+E++   E   +   EF P+IL          +     I  F+ F   VD +FS
Sbjct: 439 GKGVVKEEEKSGEE---ITFTEFSPIILNNHKNKV---EENKLEIIHFDDFNKCVDSYFS 492

Query: 91  TAESHKIDLKAVQQE----RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQES 146
             E  K D    QQE    + +L K++ +K DHE R+  LE+     ++K  LI  N E 
Sbjct: 493 RMELSKYDK---QQEVIKIKKSLTKMDKIKLDHERRIEQLEKEVSSLRKKISLIQMNDEL 549

Query: 147 VDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERS 206
           V+ AI  +R  +A   +WE I   +K  ++ N P+A  I  +  N   + LLL   +E  
Sbjct: 550 VEQAIQLMRAAVATNANWEKIWEHIKLFKKQNHPIALRISSVNFNNCEMELLLDDGEENE 609

Query: 207 NIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQS 266
              D      D   P   +    K + V ++L+ S Y N + +  +++ A +K +KT  S
Sbjct: 610 EGSDDSSREADEESPKRATGRESKLA-VTINLNNSVYGNVEDYQKMRKKAEEKIRKTKIS 668

Query: 267 TEKALKSAEKKTKQTLKDV-----QTMTNINKARKVY------WF--EKFYWFISSENYL 313
           T  A+K  EKK K+          +T+  I K RKVY      WF   + Y  I+  + L
Sbjct: 669 TNFAVKKVEKKKKEKENKQKGKHNKTVGQIQKLRKVYWFEKFHWFISSENYLVIAGRDAL 728

Query: 314 GN----------------------------------------------------VAWDAK 321
            N                                                     AW+ K
Sbjct: 729 QNEILFRRYFQKNDVYVHADIHGASTCIIKNPYKDIPIPEKTLSEAGQLAICRSSAWNNK 788

Query: 322 VVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           ++T+AWWV   QVSK+APTGE+L TGSF+IRGKKN+ P  +L MG+  +F+++ +++
Sbjct: 789 IITSAWWVHYHQVSKSAPTGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNAAV 845


>gi|346976277|gb|EGY19729.1| DUF814 domain-containing protein [Verticillium dahliae VdLs.17]
          Length = 1086

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 101/309 (32%), Positives = 160/309 (51%), Gaps = 30/309 (9%)

Query: 26  NVSSQGYLVQK-------KEKRPNS------EDYIVANMEFHPMILQP-GSTPSVMRQHQ 71
           + ++ GY++ K        E++P+       ED +  +  FHP + Q     PSV     
Sbjct: 309 SATTTGYVIAKYRQKSEETEEKPDDGAETKREDLLYDD--FHPFLPQKFADDPSVK---- 362

Query: 72  GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQL 131
              +  F+ F   VDEFFS+ E  K++ K  ++E  A KKLE  ++DH  R+  L++ Q 
Sbjct: 363 ---VLTFDGFNKTVDEFFSSLEGQKLESKLTEREAAAKKKLEATRQDHAQRIEGLQEAQS 419

Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKL 190
           ++++KA  I  N E V  A+ A+   +   + W +I   +++ Q+  +PVA  IK   KL
Sbjct: 420 LNEQKAAAIEANVERVQEAMDAVNGLVQQGMDWVNIGKLIEREQKRRNPVAETIKLPRKL 479

Query: 191 NINHITLLLR---VHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANA 246
             N +TLLL    V  E    +          D     ++     L ++++L LS +ANA
Sbjct: 480 GENLMTLLLGTEAVEDEDEAYETGSDASDSEDDEDGAKAKGADRRLQIEINLGLSPWANA 539

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFY 304
           + ++D +R+AA K+QKT+Q +  AL++AEKK  + LK    Q    +   RK  WFEKF 
Sbjct: 540 REYYDQRRTAAVKEQKTVQHSTMALRNAEKKITEDLKKGLKQEKAVLQPIRKQMWFEKFI 599

Query: 305 WFISSENYL 313
           WFISS+ YL
Sbjct: 600 WFISSDGYL 608



 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 36/61 (59%), Positives = 47/61 (77%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K    AWWV+ADQVSK+APTGE+L  GSFM+RG+KNF PP  L +G+  +F++ E 
Sbjct: 674 AWDSKAGMGAWWVRADQVSKSAPTGEYLPAGSFMVRGQKNFLPPAPLVLGLGIMFRISEE 733

Query: 377 S 377
           S
Sbjct: 734 S 734


>gi|310791286|gb|EFQ26815.1| hypothetical protein GLRG_02635 [Glomerella graminicola M1.001]
          Length = 1073

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 93/272 (34%), Positives = 150/272 (55%), Gaps = 17/272 (6%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP +      P          + EF+ +   VDEFFS+ E  K++ K  ++E  A +K
Sbjct: 299 DFHPFL------PKKFADDPTVKVLEFDGYNKTVDEFFSSLEGQKLESKLTEREAAARRK 352

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L+  + D E R+  L   Q ++ +KA  I  N E V  A+ A+   +   + W DI   +
Sbjct: 353 LDAARSDQEKRIEGLRGAQSINVQKATAIEANVERVQEAMDAMNGLLQQGMDWVDISKLI 412

Query: 172 KQAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADP------FA 223
           ++ Q+ ++PVA IIK L LN+  N ITLLL   ++  + + ++    D +D        +
Sbjct: 413 EREQKRHNPVAEIIK-LPLNLAENTITLLLGEEEDIEDDESNYETDSDASDSEDEDNGNS 471

Query: 224 DSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL- 282
           +  ++ K   VDV++ LS +AN++ + + KRSAAKK +KT+Q +  ALK+AE+K +  L 
Sbjct: 472 NKQKSDKRLEVDVNIALSPWANSREYHEQKRSAAKKAEKTVQQSVIALKNAEQKIQAELK 531

Query: 283 KDVQTMTNI-NKARKVYWFEKFYWFISSENYL 313
           K ++T   +    RK  WFEKF WF+SS+ YL
Sbjct: 532 KGLKTEKAVLQPIRKQIWFEKFIWFVSSDGYL 563



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 41/64 (64%), Positives = 50/64 (78%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K    AWWV ADQVSK+APTGE+L TGSFM+RG+KNF PP QL +GI  +FK+ E 
Sbjct: 629 AWDSKAGMGAWWVNADQVSKSAPTGEYLPTGSFMVRGQKNFLPPAQLLLGIGIMFKISEE 688

Query: 377 SISR 380
           S +R
Sbjct: 689 SKAR 692


>gi|307109165|gb|EFN57403.1| hypothetical protein CHLNCDRAFT_57209 [Chlorella variabilis]
          Length = 1158

 Score =  135 bits (339), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 95/233 (40%), Positives = 137/233 (58%), Gaps = 19/233 (8%)

Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
           +E+ A+ KLE ++RDHE RL +L Q     + KA LI  N E+VDAA+ A+R+ +A+ + 
Sbjct: 335 KEKAAVGKLEAIRRDHEKRLGSLGQEAEAAELKAALIEYNLEAVDAALNAVREALASGMD 394

Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHK---------ERSNIKDSWVL 214
           W D+   VK+ +R  +PVA +I  L+L  + +TLLLR  +                +W+ 
Sbjct: 395 WRDLARMVKEERRAGNPVAGLIDSLQLERSRVTLLLRRARVCAWGGGGVAGGVRGGNWLD 454

Query: 215 HHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS-----AAKKQQKTIQSTEK 269
             D  +  A    TK    V+VDL LSA+ANA+ ++D +R      A  KQQKT+ + +K
Sbjct: 455 EEDGDEEAATRPATK----VEVDLGLSAHANARTYYDSRRKHQARGAGVKQQKTLDANQK 510

Query: 270 ALKSAEKKTKQTLKDVQTMTNINK-ARKVYWFEKFYWFISSENYLGNVAWDAK 321
           ALK+AEKK +Q LK V++        RK +WFEKF+WFISSENYL     DA+
Sbjct: 511 ALKAAEKKAQQQLKQVRSAAAAPAITRKPFWFEKFFWFISSENYLVLSGRDAQ 563



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 29/62 (46%), Positives = 36/62 (58%), Gaps = 17/62 (27%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWDAK+VT+AWWV  +QVSKTAP+GE+L                  L MG  ++F L E 
Sbjct: 619 AWDAKIVTSAWWVHPEQVSKTAPSGEYLP-----------------LVMGYGYMFGLAEE 661

Query: 377 SI 378
           SI
Sbjct: 662 SI 663


>gi|400593352|gb|EJP61303.1| DUF814 domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 1062

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 90/274 (32%), Positives = 148/274 (54%), Gaps = 19/274 (6%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP +      P+   ++    I  FE +   VDEFFS+ E  +++ + +++E  A +K
Sbjct: 295 DFHPFV------PTKFEKNDDIEILRFEGYNRTVDEFFSSLEGQRLESRLMEREAAAQRK 348

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           ++  ++D E R+  L+  QL +  KA  I  N E V  A+ +I   +   + W DI   V
Sbjct: 349 IDAARQDQENRIRGLQTAQLDNFRKAAAIEANIERVQEAMDSINGLLNQGMDWVDIGKLV 408

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERS-NIKDSWVLHHDNAD-PFADSSETK 229
            + Q+ N+PVA++I  L LN+    + +R+ +E     +D      D++D  F D  +T 
Sbjct: 409 AREQKKNNPVATLI-CLPLNLVDNVISVRLSEEDDVASEDEEPYETDDSDVRFEDDLDTT 467

Query: 230 KPSL--------VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
           +  L        V++ L+LS ++NA+ ++D +++A  K++KT    +KA+KS E K KQ 
Sbjct: 468 ESGLKNSDKTIVVELTLNLSPWSNARGYYDQRKNAVVKEEKTQLQADKAIKSTEHKVKQD 527

Query: 282 LKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           LK V  Q    +   R   WFEKFYWFISS+ YL
Sbjct: 528 LKKVLKQEKALLQPIRNPMWFEKFYWFISSDGYL 561



 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 33/61 (54%), Positives = 43/61 (70%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K    AWWV ++QVSK+  TG+ L  G+F I G+KNF PP QL +G+S LFK+ E 
Sbjct: 627 AWDSKAGIGAWWVNSNQVSKSTSTGDILQPGNFNISGEKNFLPPGQLVLGLSVLFKISEE 686

Query: 377 S 377
           S
Sbjct: 687 S 687


>gi|403374308|gb|EJY87098.1| DUF3441 multi-domain protein [Oxytricha trifallax]
          Length = 1126

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/378 (28%), Positives = 171/378 (45%), Gaps = 73/378 (19%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P  + Q    P  E+ SF   VDE+FS  +  +   K   +E +  KK+  +K D   R+
Sbjct: 354 PIPLAQFASDPCLEYASFDQCVDEYFSQLDKQREQSKYSNKEDEIWKKMSRIKDDQAKRI 413

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L++ Q + + KA+L+      V A I  ++    + +SW DI+  VK+ ++  +P+A 
Sbjct: 414 QGLQKEQDLSEFKAQLLQKYIYEVQALIDILQVMQTSGISWNDIQRMVKEEKKAGNPLAD 473

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHH--DNADPFADSSETKKPSLVDVDLDLS 241
           +I ++    N +TL+L    E  + ++ + +    +N DP            VDVDL +S
Sbjct: 474 LIYKMNFEKNSVTLMLDACNE-EDAENEFAVDEKFENFDPVV---------RVDVDLHIS 523

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT-KQTLKDVQTMTNINKARKVYWF 300
           A  N +++F++K+ + +K+ KT  + + A K AE    K+ +K  QT   I++ RKVYWF
Sbjct: 524 AQMNIRKYFEIKKKSYEKEVKTKTAADIAFKDAETNALKEIVKHRQTQ-KIDRMRKVYWF 582

Query: 301 EKFYWFISSENYL---GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLT------------ 345
           EKF WFISSENYL   G  A   +V+   +  K D    T   G  +T            
Sbjct: 583 EKFDWFISSENYLCISGKNAQLNEVLVKRYMDKGDLFMHTDMPGAAVTIIKNPSGLIVPP 642

Query: 346 --------------------------------------TGSF------MIRGKKNFFPPC 361
                                                 TG +      MIRGK+N   P 
Sbjct: 643 ITLNEAAIFELCHSKAWEGKIVTSVYWVHADQVSKTPPTGLYIPTGSFMIRGKRNIMTPS 702

Query: 362 QLAMGISFLFKLEESSIS 379
           +L +G + +F L E SI+
Sbjct: 703 KLELGFTIMFTLNEESIA 720


>gi|302509578|ref|XP_003016749.1| DUF814 domain protein, putative [Arthroderma benhamiae CBS 112371]
 gi|291180319|gb|EFE36104.1| DUF814 domain protein, putative [Arthroderma benhamiae CBS 112371]
          Length = 1073

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/290 (33%), Positives = 153/290 (52%), Gaps = 36/290 (12%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        PS  ++     I  FE+F +AVD +FS+ E+ K++ +  ++E  A KK
Sbjct: 254 DFHPF------EPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARKK 307

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE+ KR+HE R++AL++ Q     KA  I  N   V+ A+ A+   +A  + W +I   +
Sbjct: 308 LESTKREHEKRVNALKEKQEFHVRKARAIETNLPRVEEAMNAVNGLVAQGMDWVEIARLI 367

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR------------------------VHKERS 206
           +  Q   +PVA  IK  LKL  N IT+LL                          + +  
Sbjct: 368 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEEESEEEEEEDDDGYGDDE 427

Query: 207 NIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
             + S   H  +A P  +  E K   L +D+DL +S +ANA++++D K+ AA K++KT++
Sbjct: 428 YERPSQKKH--SAKPLKEKKEKKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTLK 485

Query: 266 STEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           ++ KA+KS E+K K  LK    Q    + + R   WFEKF++FISS+ YL
Sbjct: 486 ASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 535



 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 38/62 (61%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD K     WWV A QVSK   TG+ L  G FMI+G+KN  PP Q+ +G + LF++   
Sbjct: 601 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 660

Query: 377 SI 378
           S+
Sbjct: 661 SV 662


>gi|326471330|gb|EGD95339.1| hypothetical protein TESG_02825 [Trichophyton tonsurans CBS 112818]
          Length = 1099

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/291 (32%), Positives = 150/291 (51%), Gaps = 35/291 (12%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        PS  ++     I  FE+F +AVD +FS+ E+ K++ +  ++E  A KK
Sbjct: 277 DFHPF------EPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARKK 330

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE+ KR+HE R++AL++ Q     KA  I  N   V+ A+ A+   +A  + W +I   +
Sbjct: 331 LESTKREHEKRVNALKEKQEFHVRKARAIEINLPRVEEAMNAVNGLVAQGMDWVEIARLI 390

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDN------------ 218
           +  Q   +PVA  IK  LKL  N IT+LL       + ++      +             
Sbjct: 391 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEEESEEEEEEEEEDDDGYG 450

Query: 219 -------------ADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                          P  +  E K   L +D+DL +S +ANA++++D K+ AA K++KT+
Sbjct: 451 DDEYERPSQKKQLTKPLKEKKEMKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTL 510

Query: 265 QSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           +++ KA+KS E+K K  LK    Q    + + R   WFEKF++FISS+ YL
Sbjct: 511 KASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 561



 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 38/62 (61%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD K     WWV A QVSK   TG+ L  G FMI+G+KN  PP Q+ +G + LF++   
Sbjct: 627 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 686

Query: 377 SI 378
           S+
Sbjct: 687 SV 688


>gi|320589532|gb|EFX01993.1| duf814 domain containing protein [Grosmannia clavigera kw1407]
          Length = 1969

 Score =  132 bits (332), Expect = 3e-28,   Method: Composition-based stats.
 Identities = 103/299 (34%), Positives = 154/299 (51%), Gaps = 22/299 (7%)

Query: 28   SSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGST--------PSVMRQHQGFP---IQ 76
            + Q Y+V  KE   +++       E    I  P ST        P   RQ +  P   + 
Sbjct: 1183 TCQSYIVATKES-ADTDASAATTEEGGSAIPVPASTKLDYVDFHPFKPRQFEADPKCVLL 1241

Query: 77   EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
             F+SF  A DEF+S  +  K D +  QQE  A KKLE  +RD   R+ +L++TQ ++  K
Sbjct: 1242 PFDSFNKAADEFYSHLQGLKADRQLHQQESVAFKKLEATRRDQAMRIESLQETQQLNTRK 1301

Query: 137  AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
            A  I  NQE V AAI A+   +     WED+   ++ +   N PVA++IK L + +    
Sbjct: 1302 AAAIEANQEWVQAAIDAVNDQLHVGTDWEDLAHLIENSADSN-PVAALIK-LPMRLADGI 1359

Query: 197  LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
            + L++  E +   D       + D   + +E ++   V+V L LSA+ NA+ ++D KR A
Sbjct: 1360 ITLQLSDEPAADFDE------DFDEDEEEAEEEELLDVNVKLALSAWGNAREYYDQKRVA 1413

Query: 257  AKKQQKTIQSTEKALKSAEKKTKQTLKDVQT--MTNINKARKVYWFEKFYWFISSENYL 313
            A K+QKT + T  AL++AEKK  + LK VQ          R+  WFEKF WF+SS+ +L
Sbjct: 1414 ASKEQKTKEVTSMALRNAEKKVAEELKRVQKGGKPAPQLIRRQLWFEKFLWFVSSDGHL 1472



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 28/60 (46%), Positives = 37/60 (61%)

Query: 317  AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
            AWD K    A+WV A+QV KT  +G+ L  GSF IRG+KN  PP Q  +G   LF++  +
Sbjct: 1540 AWDNKAGFGAYWVHANQVFKTTASGDVLPLGSFDIRGEKNHLPPPQRVLGFGLLFQISNA 1599


>gi|326479424|gb|EGE03434.1| DUF814 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 979

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 94/291 (32%), Positives = 150/291 (51%), Gaps = 35/291 (12%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        PS  ++     I  FE+F +AVD +FS+ E+ K++ +  ++E  A KK
Sbjct: 277 DFHPF------EPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARKK 330

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE+ KR+HE R++AL++ Q     KA  I  N   V+ A+ A+   +A  + W +I   +
Sbjct: 331 LESTKREHEKRVNALKEKQEFHVRKARAIEINLPRVEEAMNAVNGLVAQGMDWVEIARLI 390

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDN------------ 218
           +  Q   +PVA  IK  LKL  N IT+LL       + ++      +             
Sbjct: 391 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEEESEEEEEEEEEDDDGYG 450

Query: 219 -------------ADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                          P  +  E K   L +D+DL +S +ANA++++D K+ AA K++KT+
Sbjct: 451 DDEYERPSQKKQLTKPLKEKKEMKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTL 510

Query: 265 QSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           +++ KA+KS E+K K  LK    Q    + + R   WFEKF++FISS+ YL
Sbjct: 511 KASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 561



 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 38/62 (61%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD K     WWV A QVSK   TG+ L  G FMI+G+KN  PP Q+ +G + LF++   
Sbjct: 627 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 686

Query: 377 SI 378
           S+
Sbjct: 687 SV 688


>gi|374109900|gb|AEY98805.1| FAGL017Wp [Ashbya gossypii FDAG1]
          Length = 1006

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 121/442 (27%), Positives = 196/442 (44%), Gaps = 106/442 (23%)

Query: 31  GYLVQKK----EKRPNSEDYIVANMEFHPM--ILQPGSTPSVMRQHQGFPIQEFES-FAA 83
           GY+V ++    +++ ++E   +   +FHP    L  GS        + F I E +  +  
Sbjct: 273 GYIVARENPLYDEKKDAEGIRLTYEQFHPFRPYLPDGS-------QKKFEIVEVDGDYNR 325

Query: 84  AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
            VD+FFST +S K  L+   QE++A KKLE  K +++ ++  L + Q  ++++   IINN
Sbjct: 326 TVDKFFSTIDSTKYALRIQTQEQNARKKLEKAKAENQKKIQELVEVQHTNEQRGNAIINN 385

Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVH 202
            E V+ A  AI+  +  Q+ W  IE  +K  Q  ++ +A +IK  L L  N I++ L + 
Sbjct: 386 IELVEEAKSAIQGLLDQQMDWTSIEKLIKTEQAKSNRIARVIKLPLNLKANKISVELPLS 445

Query: 203 KERSNIKD-SW----------------------------VLHHDNADPFADSSETKKPSL 233
            E     D SW                            V    + +    S  + K   
Sbjct: 446 NEDDESSDGSWGDSESDSGFSSSDDELSDSGLSDFDAEVVRGSGSKNKKGKSKVSNKSIT 505

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--I 291
           V +DL +SAYANA  +F++K++ AKKQ    Q+ +KA+K+ E+K ++ LK      +  +
Sbjct: 506 VSIDLSMSAYANASSYFEMKKTGAKKQLGVEQNVQKAMKNIEQKIEKDLKKKLKEQHDVL 565

Query: 292 NKARKVYWFEKFYWFISSENYL-----------------------------GNVAW---- 318
              R  Y+FEK++WFIS+E +L                             G+  W    
Sbjct: 566 QVIRSPYFFEKYFWFISTEGFLVLMGKSGIETDQIYSKYIEDDDVYVSNGFGSQVWIKNF 625

Query: 319 ------------------------DAKVVTNAWWVKADQVSKTAPT-GEFLTTGSFMIRG 353
                                     KV T+ WW  A  +SK     G  L +G+F ++ 
Sbjct: 626 ERTEIPPNTLMQAGIFANSASEAWSKKVATSPWWCAAKNLSKFDDVGGGLLPSGAFRLKS 685

Query: 354 --KKNFFPPCQLAMGISFLFKL 373
              KNF PP QL MG +F++K+
Sbjct: 686 DEAKNFLPPAQLVMGFAFMWKI 707


>gi|302917991|ref|XP_003052561.1| hypothetical protein NECHADRAFT_77690 [Nectria haematococca mpVI
           77-13-4]
 gi|256733501|gb|EEU46848.1| hypothetical protein NECHADRAFT_77690 [Nectria haematococca mpVI
           77-13-4]
          Length = 1072

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 107/330 (32%), Positives = 171/330 (51%), Gaps = 25/330 (7%)

Query: 2   VSNEDVSSDIHKCGPTLHVLVSNLNVSSQ--GYL-VQKKEKRPNSEDYIVANME------ 52
           ++NE +  D+ K        V  L  S +  GY+  +++EK   +ED      E      
Sbjct: 234 LANETLLDDLVKSLTEARKTVDELTSSGECTGYIFAKRREKLEGAEDDDKPKRENLLYED 293

Query: 53  FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
           FHP +      P  + +     + EF+ +   VDEFFS+ E  K++ +  ++E  A +KL
Sbjct: 294 FHPFV------PQKLSKDPTIEVLEFKGYNETVDEFFSSLEGQKLESRLTEREAAAKRKL 347

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           +  K++   R+  L++ Q ++  KA  I  N E V  A+ A+   ++  + W D+   V+
Sbjct: 348 DAAKQEQAKRIEGLQEAQNLNFRKAAAIEANVERVQEAMDAVNGLLSQGMDWVDVGKLVE 407

Query: 173 QAQRHNDPVASIIKQLKLNI--NHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK- 229
           + ++ ++PVA IIK L LN+  N ITL L   +      D +    D +    DS+ TK 
Sbjct: 408 REKKRHNPVAEIIK-LPLNLAENLITLELAEEEFEPEEDDPYETDDDESADEEDSTPTKG 466

Query: 230 ----KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
               K   V+++L LS ++NA+ +FD ++SAA K++KT Q   +ALK+AE+K  Q LK  
Sbjct: 467 KHASKALSVEINLGLSPWSNAREYFDQRKSAAVKKEKTEQQASRALKNAEQKITQDLKKG 526

Query: 286 --QTMTNINKARKVYWFEKFYWFISSENYL 313
             Q    +   RK  WFEKF WFISS+ YL
Sbjct: 527 LKQEKALLQPIRKQLWFEKFIWFISSDGYL 556



 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 40/61 (65%), Positives = 49/61 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K   +AWWV ADQVSK+APTGEFL TGSFM+RGKKNF PP QL +G+  +F++ E 
Sbjct: 622 AWDSKAGMSAWWVNADQVSKSAPTGEFLPTGSFMVRGKKNFLPPAQLLLGLGLVFRISEE 681

Query: 377 S 377
           S
Sbjct: 682 S 682


>gi|315050252|ref|XP_003174500.1| hypothetical protein MGYG_02028 [Arthroderma gypseum CBS 118893]
 gi|311339815|gb|EFQ99017.1| hypothetical protein MGYG_02028 [Arthroderma gypseum CBS 118893]
          Length = 1093

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 98/295 (33%), Positives = 150/295 (50%), Gaps = 45/295 (15%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        PS  +         FESF +AVD++FS+ E+ K++ +  ++E  A KK
Sbjct: 277 DFHPF------EPSQTKDLPNTTTLRFESFNSAVDKYFSSIEARKLESRLTEKEDAARKK 330

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE+ KR+HE R++AL++ Q     KA  I  N   V+ A+ A+   +A  + W +I   +
Sbjct: 331 LESTKREHEKRVNALKEKQEFHVRKARAIETNLLQVEEAMTAVNGLVAQGMDWVEIARLI 390

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR-----------------------------V 201
           +  Q   +PVA  IK  LKL  N IT+LL                               
Sbjct: 391 EMEQGKRNPVALSIKLPLKLYENTITVLLNEEVAEEEEEEESDESDEEEDEDDDDGYGDD 450

Query: 202 HKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQ 260
             ER   K   V      +P  +  E K   L +D+DL +S +ANA++++D K+ AA K+
Sbjct: 451 EYERPKQKKRLV------NPQREKKEKKDTRLSIDIDLGISPWANARQYYDEKKIAAVKE 504

Query: 261 QKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           +KT++++ KA+KS E+K K  LK    Q    + + R   WFEKF++FISS+ YL
Sbjct: 505 EKTLKASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 559



 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 30/62 (48%), Positives = 38/62 (61%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD K     WWV A QVSK   TG+ L  G FMI+G+KN  PP Q+ +G + LF++   
Sbjct: 625 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISSQ 684

Query: 377 SI 378
           SI
Sbjct: 685 SI 686


>gi|190345457|gb|EDK37344.2| hypothetical protein PGUG_01442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 873

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 124/455 (27%), Positives = 194/455 (42%), Gaps = 122/455 (26%)

Query: 21  LVSNLNVSSQGYLVQKK----EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
           LVS+L     G +V KK    E   +++D      EFHP   +P          + F   
Sbjct: 127 LVSSLGTEVTGIIVSKKNPAFEPSDDNKDLEYLYDEFHP--FKP-----YKENLESFKFT 179

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           E   +   VD FFST +S K +L+  QQ+ +A K+L N + + + ++  L   Q ++ +K
Sbjct: 180 EIRGYNKTVDTFFSTLDSKKHELRMEQQKHNAKKRLLNAREERDKQIDNLRIQQEMNSKK 239

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
            + II + + V   I +++  +  Q+ W +IE+ +K  Q   + VA  IK L LN+    
Sbjct: 240 GDAIIYHADLVSECIASVQTLLDQQMDWANIESLIKLEQSRGNSVAKTIK-LPLNLTENK 298

Query: 197 LLLRVHKERSNIKDSWVLHHDNAD--------------------------------PFAD 224
           + L++    S         +D AD                                P   
Sbjct: 299 IGLKLPDTDS--------MYDPADIDSESDSETSSESETESESESESGSESEDETPPKRM 350

Query: 225 SSETKK---PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
           S + K    P+L V +DL LS +ANA+ +F+ K+ A  KQ+K  ++T+ AL++A+KK +Q
Sbjct: 351 SKKAKSKEIPALSVWIDLSLSPFANARTYFESKKQAESKQEKVEKNTDMALRNAQKKIEQ 410

Query: 281 TLKDVQTMTNIN----KARKVYWFEKFYWFISSENYL----------------------- 313
            L   + + N N    + R  YWFEKF+WF+SSE YL                       
Sbjct: 411 DL--AKNLKNENETLRQVRPKYWFEKFFWFVSSEGYLCIAGRDDAQVDMIYYRHFSDNDF 468

Query: 314 ---GNVAWDAKVV-------------------------TNAW--------WVKA-DQVSK 336
               ++    KVV                         + AW        W  A + V+K
Sbjct: 469 FVSSDIEGSLKVVVKNPYRGEALPPYTLMQAGMFAMSASAAWNGKITTSPWFLAGNDVTK 528

Query: 337 TAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
               G  + +G+F  +GKK F PP QL MG+ F F
Sbjct: 529 LDFDGSLVPSGTFNYKGKKEFLPPTQLVMGLGFYF 563


>gi|219109751|ref|XP_002176629.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411164|gb|EEC51092.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1238

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 169/324 (52%), Gaps = 36/324 (11%)

Query: 5   EDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKE-------KRPNSEDY-----IVANME 52
           E + + + + GP +   + +  + + GY++ +          +P+SE+      +VA   
Sbjct: 309 ERLQTSLSEQGPAIMYNLHSAAIDTPGYILYQPRVEEDIVNGKPHSENLSSAVAVVAKEL 368

Query: 53  FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
            H   +     P ++ QHQ  P  E++ F AAV +FF+   + K  LK    E    +KL
Sbjct: 369 AHADKVLLEFQPHLLAQHQNCPRLEYKHFGAAVADFFAHMVAQKRLLKVQASEMAVQEKL 428

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
             V++D   R+ ALE+ Q   +  A+++ NN E+VD A+L I   + + + W+ +   V 
Sbjct: 429 RKVQQDQADRVMALERDQQTLQAYAQVVKNNAENVDKALLVINSALDSGMDWDQLIELVS 488

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
             Q + +P+A++I +L+L  N I ++LR+ +                DPF + S+     
Sbjct: 489 VEQANRNPIANLIVRLELE-NEI-MILRLPR----------------DPFDELSDVLN-- 528

Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
            V+V L  SA+ANA   F   R++ +K QKT++S+ KAL++AE+  ++ L + Q  T   
Sbjct: 529 -VNVSLKDSAHANASALFAKYRASKEKTQKTLESSSKALQAAEESAQRQLIEAQRRTKQT 587

Query: 293 KA---RKVYWFEKFYWFISSENYL 313
            A   RK  W+EKF+WF++S+NYL
Sbjct: 588 VAAVKRKPAWYEKFHWFVTSDNYL 611



 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 41/65 (63%), Positives = 55/65 (84%), Gaps = 1/65 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
           AW +++VT+AWWV++ QVSKTAP+GEFLT GSFM+RGKKNF PP  L MG++ LF+L ++
Sbjct: 683 AWASRMVTSAWWVESHQVSKTAPSGEFLTVGSFMVRGKKNFLPPSPLEMGLAVLFRLGDD 742

Query: 376 SSISR 380
            SI+R
Sbjct: 743 DSIAR 747


>gi|385304258|gb|EIF48283.1| tae2-like protein [Dekkera bruxellensis AWRI1499]
          Length = 979

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 114/422 (27%), Positives = 202/422 (47%), Gaps = 53/422 (12%)

Query: 5   EDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQK------KEKRPNSEDYIVANMEFHPMIL 58
           E V + + +   T   L+       QG++++K        K  +SE+      EFHP   
Sbjct: 259 EKVVTSLQESENTFKSLLQTPPGKVQGWILRKINPLFDNTKEESSENLKYTYEEFHP--F 316

Query: 59  QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD 118
           +P     V ++++   +   + +   VD FF+  E  K  L   QQ+  A K+L+ VK +
Sbjct: 317 EP-----VHKENEDSKVDVVDGYNKTVDTFFTMIELSKASLSRQQQKAAAAKRLQLVKEE 371

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
           +E +L+ L+  Q ++++K  LI  +   ++    +I+  +  Q+ W++I+  ++  +R  
Sbjct: 372 NEKKLAKLDAVQELNRKKGYLITLHSSEIEDCRSSIQALLDQQMDWQNIDKLIEVERRRG 431

Query: 179 DPVASIIKQLKLNINHITLLLRVHKE-----RSNIKDSWVLHHDNADPFADSSETKKPSL 233
           +P A +IK L L  +  T+LL   +E       +  DS      + D   D +E KK ++
Sbjct: 432 NPTAKMIKSLNLLKHEFTVLLPDEQEVVDDENEDESDSDSDSDSDDDDDDDETEDKKSNI 491

Query: 234 VDVDLDL--SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK-------- 283
           + V +D+  SA+AN+ R+FD K++A +KQ+KT ++   A+K++E K  + +K        
Sbjct: 492 ISVSIDIRESAFANSTRYFDAKKNAQEKQEKTKENAAIAIKNSEMKIHRDMKRLENESKN 551

Query: 284 ----------------DVQTMTNINKARKVYWFEKF-------YWFISSENYLGNV--AW 318
                           D    ++++K+ KV     +         F+ +  Y      AW
Sbjct: 552 TVDIHSIYYRYLDNNTDYLVSSDVDKSLKVVIKNPYKNKEIPPSTFVQAGIYCLTTSKAW 611

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           D+K+  + W+VK D VSK    G  L  G   I+G KNF PP QL MGI  L+  +E + 
Sbjct: 612 DSKMSPSPWFVKGDAVSKKDFDGSLLPPGLLNIKGDKNFLPPSQLVMGIGLLWLPDEKTK 671

Query: 379 SR 380
           +R
Sbjct: 672 AR 673


>gi|407039370|gb|EKE39608.1| zinc knuckle domain containing protein [Entamoeba nuttalli P19]
          Length = 959

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 164/360 (45%), Gaps = 87/360 (24%)

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           EFESF  A+DEF S  E  + + +  ++E    KK++ V   H+ R   L       K +
Sbjct: 290 EFESFEKAMDEFHSHIEKQEYEAEVEKKEMIMKKKIQAVIDGHQKRYQGLLDKAETLKNE 349

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP--VASIIKQLKLNINH 194
           AE +  N + VD  I  I   +  ++ WE IE  + ++ + NDP  +A  IK+     N 
Sbjct: 350 AEAVEENIQVVDQLIQEINVFLKEKMKWEQIEGII-ESLKENDPTSIAKYIKRFDFK-NE 407

Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKR 254
           + +L               L H N D   +         V+V L+ + + N + F+++++
Sbjct: 408 VVVL--------------ELKHTNEDKIIE---------VEVALNKNGFENIRNFYEMRK 444

Query: 255 SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-TMTNINKARKVYWFEKFYWFISSENYL 313
           +   K +KT++S + A+K AE K ++  K+ + T+ ++ K RK +WFEKF+WF+SSEN++
Sbjct: 445 NILAKAEKTMESKDLAIKQAENKQERVAKEKKITLVDVKKMRKRFWFEKFHWFLSSENFI 504

Query: 314 ---GNVAWDAKVVTNAW------WVKAD-----------------------QVSKTA--- 338
              G  A    V+   +      +V AD                       Q  K A   
Sbjct: 505 IISGKDALQNDVIYRRYMKSTDVYVHADIHGAASCIIKGIPGKTIGAPTLEQAGKIAVCR 564

Query: 339 ------------------------PTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
                                   P+GE+LTTGSFMIRGKKN+ PP  L  GI  +F +E
Sbjct: 565 SSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKNYLPPVPLVFGIGIMFAVE 624


>gi|296813237|ref|XP_002846956.1| serologically defined colon cancer antigen 1 [Arthroderma otae CBS
           113480]
 gi|238842212|gb|EEQ31874.1| serologically defined colon cancer antigen 1 [Arthroderma otae CBS
           113480]
          Length = 1103

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/289 (33%), Positives = 150/289 (51%), Gaps = 33/289 (11%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        PS  +      +  F++F +AVD++FS+ E+ K++ +  ++E  A KK
Sbjct: 277 DFHPF------EPSQSKDLPNTTMLRFDTFNSAVDKYFSSIEARKLESRLTEREDAARKK 330

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE  KRDHE R++AL++ Q     KA  I  N   V+ AI A+   +A  + W +I   +
Sbjct: 331 LEATKRDHEKRVNALKEKQEFHVRKAHAIEANLPQVEDAINAVNGLVAQGMDWVEIARLI 390

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHD------------- 217
           +  Q   +PVA  IK  LKL  N IT+LL      +  +D      +             
Sbjct: 391 EMEQAKGNPVALCIKLPLKLYENTITILLTEETAETEDEDEESDESEGDDEDEDNDYGDD 450

Query: 218 ----------NADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQS 266
                      A    +  E K   L +D+DL +S +ANA++++D K+ AA K++KT+++
Sbjct: 451 EYERPKHKKMTAKTQKEKKERKDNRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTLKA 510

Query: 267 TEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           + KA+KS EKK K  LK    Q    + +AR   WFEKF++FISS+ YL
Sbjct: 511 STKAIKSTEKKVKADLKLALKQEKPVLRRARNPAWFEKFFFFISSDGYL 559



 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 30/62 (48%), Positives = 38/62 (61%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD K     WWV A QVSK   TG+ L  G FMI+G+KN  PP Q+ +G + LF+L   
Sbjct: 625 AWDTKAAMGGWWVHASQVSKVTSTGDILKAGHFMIKGEKNHLPPGQIVLGFAVLFQLSPQ 684

Query: 377 SI 378
           S+
Sbjct: 685 SV 686


>gi|327303108|ref|XP_003236246.1| hypothetical protein TERG_03295 [Trichophyton rubrum CBS 118892]
 gi|326461588|gb|EGD87041.1| hypothetical protein TERG_03295 [Trichophyton rubrum CBS 118892]
          Length = 1098

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 96/292 (32%), Positives = 152/292 (52%), Gaps = 38/292 (13%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        PS  ++     I  F +F +AVD +FS+ E+ K++ +  ++E  A KK
Sbjct: 277 DFHPF------EPSQSKEAPNTTILRFGNFNSAVDRYFSSIEAQKLESRLTEKEDAARKK 330

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE+ KR+HE R++AL++ Q     KA  I  N   V+ A+ A+   +A  + W +I   +
Sbjct: 331 LESTKREHEKRVNALKEKQEFHVRKARAIETNLPRVEEAMNAVNGLVAQSMDWVEIARLI 390

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR--------------------------VHKE 204
           +  Q   +PVA  IK  LKL  N IT+LL                            + +
Sbjct: 391 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEGGTEDDEEEEEEEEPEEEEEEDDDDGYGD 450

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKT 263
               + S   H  +A P  +  E K   L +D+DL +S +ANA++++D K+ AA K++KT
Sbjct: 451 DEYERPSQKKH--SAKPLKEKKEKKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKT 508

Query: 264 IQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           ++++ KA+KS E+K K  LK    Q    + + R   WFEKF++FISS+ YL
Sbjct: 509 LKASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 560



 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 38/62 (61%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD K     WWV A QVSK   TG+ L  G FMI+G+KN  PP Q+ +G + LF++   
Sbjct: 626 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 685

Query: 377 SI 378
           S+
Sbjct: 686 SL 687


>gi|67468480|ref|XP_650274.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56466879|gb|EAL44894.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449704977|gb|EMD45123.1| zinc knuckle domain containing protein [Entamoeba histolytica KU27]
          Length = 959

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 103/360 (28%), Positives = 164/360 (45%), Gaps = 87/360 (24%)

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           EFESF  A+DEF S  E  + + +  ++E    KK++ V   H+ R   L       K +
Sbjct: 290 EFESFEKAMDEFHSHIEKQEYEAEVEKKEMIMKKKIQAVIDGHQKRYQGLLDKAETLKNE 349

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP--VASIIKQLKLNINH 194
           AE +  N + VD  I  I   +  ++ WE IE  + ++ + NDP  +A  IK+     N 
Sbjct: 350 AEAVEENIQVVDQLIQEINVFLKEKMKWEQIEGII-ESLKENDPTSIAKYIKRFDFK-NE 407

Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKR 254
           + +L               L H N D   +         V++ L+ + + N + F+++++
Sbjct: 408 VVVL--------------ELKHTNEDKIIE---------VEIALNKNGFENIRNFYEMRK 444

Query: 255 SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-TMTNINKARKVYWFEKFYWFISSENYL 313
           +   K +KT++S + A+K AE K ++  K+ + T+ ++ K RK +WFEKF+WF+SSEN++
Sbjct: 445 NILAKAEKTMESKDLAIKQAENKQERVAKEKKITLVDVKKMRKRFWFEKFHWFLSSENFI 504

Query: 314 ---GNVAWDAKVVTNAW------WVKAD-----------------------QVSKTA--- 338
              G  A    V+   +      +V AD                       Q  K A   
Sbjct: 505 IISGKDALQNDVIYRRYMKSTDVYVHADIHGAASCIIKGIPGKTIGAPTLEQAGKIAVCR 564

Query: 339 ------------------------PTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
                                   P+GE+LTTGSFMIRGKKN+ PP  L  GI  +F +E
Sbjct: 565 SSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKNYLPPVPLVFGIGIMFAVE 624


>gi|340059520|emb|CCC53907.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 1048

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/249 (30%), Positives = 137/249 (55%), Gaps = 6/249 (2%)

Query: 63  TPSVMRQHQGFPIQE--FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP ++ Q+ G  + +    SF +  D FF   E  KI+ +  ++    + K E  +RDH+
Sbjct: 305 TPVLLAQYDGENVTKSYLPSFGSVCDTFFLHTEEGKIEQQKEKKTVAVMSKKERCERDHQ 364

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            R+ ALE+ +L +  K EL+I N E +DAAI  I   +A+ + W+ +   +KQ      P
Sbjct: 365 RRIEALERMELENARKGELLIQNAEKIDAAIGLINGALASGIQWDALRRLLKQRHAEGHP 424

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VA ++ +L L+ N++++L+  + +   I +   + +++     D    K P +++VDL  
Sbjct: 425 VAYMVHELFLDRNNMSVLVETNDDDDCIDEGGSVSYESK---VDDC-NKPPWVIEVDLSK 480

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +A+ANA  +F  K++   K  +T+ +T +A++ AEKK ++     QT+ +I   R   W+
Sbjct: 481 TAHANAAAYFSQKKANRAKLDRTVAATAQAMRGAEKKGERMAARHQTVKDIATERHRCWW 540

Query: 301 EKFYWFISS 309
           EKF WF +S
Sbjct: 541 EKFNWFRTS 549



 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 13/74 (17%)

Query: 301 EKFYWFISSENYLGNVAWDAKVVTNAWWVKADQ-VSKTAPTGEFLTTGSFMIRGKKNFFP 359
           E   W +S  +     AW+ K    AWWV A Q +  TA       TG ++  G+K+   
Sbjct: 632 EAAAWCVSRSS-----AWEGKFNVGAWWVYASQIIGGTA-------TGCYLFSGEKHHVL 679

Query: 360 PCQLAMGISFLFKL 373
           P  LA+G   LF++
Sbjct: 680 PQPLALGCGLLFRV 693


>gi|255710571|ref|XP_002551569.1| KLTH0A02530p [Lachancea thermotolerans]
 gi|238932946|emb|CAR21127.1| KLTH0A02530p [Lachancea thermotolerans CBS 6340]
          Length = 1058

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 127/457 (27%), Positives = 197/457 (43%), Gaps = 131/457 (28%)

Query: 31  GYLVQKKEKRPNSEDYIVANMEF-----HPMILQPGSTPSVMRQHQGFP--IQEFESFAA 83
           GY++ KK    +SE    AN+EF     HP        P +    +G+   I+    +  
Sbjct: 266 GYIIAKKNPHYDSEKGD-ANLEFVYEQFHPF------PPHLSEDEKGYTKIIEVPGQYNK 318

Query: 84  AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
            VD+FFST ES K  L+   QE  A  +LE+ K D+E R+ AL   Q  ++ +   II  
Sbjct: 319 TVDDFFSTIESSKYALRIQNQEFQAKNRLESAKLDNEKRIQALIDVQTQNEVRGHAIIAA 378

Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVH 202
            + V+ A  AI+  +  Q+ W+ IE  +   Q+ N+ +A +IK  L L  N  TL L  +
Sbjct: 379 ADLVEEAQNAIKALVEQQMDWKTIEVLISNEQKKNNRIARLIKLPLDLKNNKFTLSLPRN 438

Query: 203 KERSNIKDSWVLHHDNADPFADS------------------------------------- 225
            E         +  DN+D   D+                                     
Sbjct: 439 DE---------IESDNSDEEEDNLTSSEDETSSSDSSDSSLSDFEADDNDEDELTSVSNI 489

Query: 226 -------SETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
                   + +KPS+   +DL LSAYANA  +F++K+S  +KQ+K  ++ +KALK+ E++
Sbjct: 490 KKDRNDNKKKEKPSIDATIDLTLSAYANASNYFNIKKSNVEKQKKVEKNAQKALKNIEQR 549

Query: 278 TKQTLKDVQTMTN--INKARKVYWFEKFYWFISSENYL--------------GNVAWDAK 321
            ++ LK     ++  +NK RK Y+FEKF+WF+SSE +L              G    D  
Sbjct: 550 IEKDLKKKLKESHDVLNKTRKPYFFEKFHWFVSSEGFLVLMGKSGMESDQIYGKYIHDND 609

Query: 322 V-VTNAW----WVKADQVSKTAPT------------------------------------ 340
           V V+N++    W+K    ++  P                                     
Sbjct: 610 VFVSNSFDTHVWIKNPDETEVPPNTLMQAGIMCMSASPAWSKKIQSSAWWCFAKELSKFD 669

Query: 341 ---GEFLTTGSFMIRG--KKNFFPPCQLAMGISFLFK 372
              GE L  G+F ++   KK+F PP QL MG + L+K
Sbjct: 670 NYGGEVLPAGTFRLKDEKKKSFLPPSQLVMGFALLWK 706


>gi|46128721|ref|XP_388914.1| hypothetical protein FG08738.1 [Gibberella zeae PH-1]
          Length = 1077

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/308 (31%), Positives = 156/308 (50%), Gaps = 30/308 (9%)

Query: 28  SSQGYLVQKKEKRPNS----------EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
           +  GY+  K+ +RP             D ++ + +FHP I      P  ++      + E
Sbjct: 262 TCTGYIFAKRRERPEGTEVDEETKTKRDNLLYD-DFHPFI------PYKLKNDPAIEVLE 314

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F+ +   VDEFFS+ E  +++ K  ++E  A +KLE  K +   R+  L++ Q ++  KA
Sbjct: 315 FQGYNETVDEFFSSLEGQRLESKLTEREATAKRKLEAAKNEQNKRIEGLQEAQSLNFRKA 374

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
             I  N E V  A+ A+   +   + W D+   V++ ++ ++PVA IIK L LN+  N I
Sbjct: 375 AAIEANVERVQEAMDAVNGLLNQGMDWVDVGKLVEREKKRHNPVAEIIK-LPLNLAENLI 433

Query: 196 T--------LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAK 247
           T                 ++  D   L  D A   A   ++ K   V+++L LS ++NA+
Sbjct: 434 TLELAEEEFEPEEDDPYETDDDDDSALGDDEATSAAKGKQSNKALNVEINLGLSPWSNAR 493

Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
            +FD +++AA K++KT Q   KALK+AE+K  + LK    Q    +   RK  WFEKF W
Sbjct: 494 EYFDQRKTAAVKEEKTQQQASKALKNAEQKITEDLKKGLKQEKALLQPIRKQMWFEKFTW 553

Query: 306 FISSENYL 313
           FISS+ YL
Sbjct: 554 FISSDGYL 561



 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 40/61 (65%), Positives = 46/61 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K    AWWV ADQVSK+APTGEFL  GSFMIRGKKNF PP QL +G+   F++ E 
Sbjct: 627 AWDSKAGMPAWWVNADQVSKSAPTGEFLQAGSFMIRGKKNFLPPAQLLLGLGLAFRISEE 686

Query: 377 S 377
           S
Sbjct: 687 S 687


>gi|301106825|ref|XP_002902495.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262098369|gb|EEY56421.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1051

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 91/261 (34%), Positives = 146/261 (55%), Gaps = 13/261 (4%)

Query: 63  TPSVMRQH-QGFP-IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP +  QH Q +  ++ F++F  AVDE+FS  E+   ++     +  A  KL  +K++ +
Sbjct: 345 TPYLYAQHLQAYKKVKSFDTFDEAVDEYFSRFEAETAEVAKQSAQLAAENKLAKLKKNQQ 404

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            +L+ L + Q    + A+LI  NQ+ V+  +L IR  +A+ + W  +E  V+  Q++ +P
Sbjct: 405 QQLAQLREVQEQSFQDAQLIEANQQDVENVLLVIRSALASGMDWRGLEELVRYEQKNGNP 464

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VAS+I QL L  N + +LL          DS    +++        E KK  ++ +DL L
Sbjct: 465 VASLIHQLDLEHNRVAILL---------CDSDEDDYEDGG-DGTGEEDKKAHVIWIDLSL 514

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-INKARKVYW 299
           SA ANA+  +  K+ A  K +K  ++T+KA+  AEK TK+TL+  QT  N I + RK  W
Sbjct: 515 SALANAREIYTKKKKAGAKVKKATEATDKAIALAEKNTKKTLEKQQTKRNVIYQRRKTLW 574

Query: 300 FEKFYWFISSENYLGNVAWDA 320
           FEKF+WF+++E YL     DA
Sbjct: 575 FEKFHWFLTNEKYLVVAGKDA 595



 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 38/63 (60%), Positives = 51/63 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW ++V+  A+WV ADQVSKTAP GE+LTTGSFMIRGKKN+  P +L MG++ LF++++S
Sbjct: 660 AWTSQVIAGAYWVHADQVSKTAPAGEYLTTGSFMIRGKKNYIQPSRLEMGLAILFRIDDS 719

Query: 377 SIS 379
            I 
Sbjct: 720 CIG 722


>gi|302665563|ref|XP_003024391.1| DUF814 domain protein, putative [Trichophyton verrucosum HKI 0517]
 gi|291188443|gb|EFE43780.1| DUF814 domain protein, putative [Trichophyton verrucosum HKI 0517]
          Length = 1074

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 93/290 (32%), Positives = 149/290 (51%), Gaps = 35/290 (12%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP        PS  ++     I  FE+F +AVD +FS+ E+ K++ +  ++E  A KK
Sbjct: 254 DFHPF------EPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARKK 307

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           LE+ KR+HE R++AL++ Q     KA  I  N   V+ A+ A+   +A  + W +I   +
Sbjct: 308 LESTKREHEKRVNALKEKQEFHVRKARAIETNLPQVEEAMNAVNGLVAQGMDWVEIARLI 367

Query: 172 KQAQRHNDPVASIIK-QLKLNINHITLLLR-------------------------VHKER 205
           +  Q   +PVA  IK  LKL  N IT+LL                           + + 
Sbjct: 368 EMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEDESEEEEEDDDDDGYGDD 427

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
              + S   H           +  + S+ D+DL +S +ANA++++D K+ AA K++KT++
Sbjct: 428 EYERPSQKKHSAKPLKEKKGKKDTRLSI-DIDLGISPWANARQYYDEKKIAAVKEEKTLK 486

Query: 266 STEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYWFISSENYL 313
           ++ KA+KS E+K K  LK    Q    + + R   WFEKF++FISS+ YL
Sbjct: 487 ASTKAIKSTERKVKADLKMALKQEKPVLRRTRNPTWFEKFFFFISSDGYL 536



 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 38/62 (61%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD K     WWV A QVSK   TG+ L  G FMI+G+KN  PP Q+ +G + LF++   
Sbjct: 602 AWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNR 661

Query: 377 SI 378
           S+
Sbjct: 662 SV 663


>gi|410077749|ref|XP_003956456.1| hypothetical protein KAFR_0C03290 [Kazachstania africana CBS 2517]
 gi|372463040|emb|CCF57321.1| hypothetical protein KAFR_0C03290 [Kazachstania africana CBS 2517]
          Length = 1038

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 124/449 (27%), Positives = 200/449 (44%), Gaps = 113/449 (25%)

Query: 30  QGYLVQKK-------EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE-SF 81
           +G+++ KK       + +P  E Y+  N  FHP   +P  + +   + +   I E E S+
Sbjct: 294 KGFILAKKNVNYDSLKDKPELE-YLYEN--FHP--FKPYISGA---EEKSVRILEIEGSY 345

Query: 82  AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
              +D FFST ES K  L+   QE  A KKLE+ + D++ R+ +L   Q++++ KA  I+
Sbjct: 346 NRTLDVFFSTIESLKYSLRIQNQELQAKKKLEDARSDNQKRIQSLSDVQILNETKANAIL 405

Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
           NN + VD+A  A++  +  Q  W  IE  +   ++  + +A II +L LN+  N I + +
Sbjct: 406 NNTDLVDSAKQAVQDLLEQQTDWNMIEKLIMNEKKRRNKIAEII-ELPLNLKNNKINIKI 464

Query: 200 RVHK----ERSNIKDS---------------------------WVLHHDNADPFADSSET 228
            +      E     D+                            V  H+N      + + 
Sbjct: 465 PLQSPSQFEEETFSDNESVKSSLSDSDFSDESDSELSDFSMEEVVGRHENTRKIR-AKDD 523

Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQ 286
           K+   V +DL LS+YANA ++F+ K+ +A+KQ+K  +   KA+ + E +  Q LK    +
Sbjct: 524 KQHVTVTIDLSLSSYANASQYFNSKKDSAEKQKKMEKHMAKAMTNIENRIDQQLKKKLRE 583

Query: 287 TMTNINKARKVYWFEKFYWFISSENYL-----------------------------GNVA 317
           + T + K RK Y+FEK+ WFISSE YL                             G+ A
Sbjct: 584 SHTVLKKIRKPYFFEKYNWFISSEGYLVMTGKSALENDQIYMKYIEDDDIFMSTSFGSKA 643

Query: 318 W----------------------------DAKVVTNAWWVKADQVSKTAPTGEFLT-TGS 348
           W                              KVV +  W  A  ++K    G  +  TG 
Sbjct: 644 WIKNPDRGEIPPNTLMQAGIFCASSSKAWSNKVVCSPKWCYARNITKFTQDGSIVAETGE 703

Query: 349 FMI--RGKKNFFPPCQLAMGISFLFKLEE 375
           F++    K++  PP QL MGI FL+KL++
Sbjct: 704 FVLIDEQKQSTLPPAQLIMGIGFLWKLKQ 732


>gi|83033024|ref|XP_729296.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23486663|gb|EAA20861.1| strong similarity to unknown protein-related [Plasmodium yoelii
           yoelii]
          Length = 1768

 Score =  128 bits (322), Expect = 4e-27,   Method: Composition-based stats.
 Identities = 113/393 (28%), Positives = 187/393 (47%), Gaps = 73/393 (18%)

Query: 51  MEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDL-KAVQQERDAL 109
           +EF P++L+           +   I +F++F   VD +FS  E  K D  + + + ++AL
Sbjct: 440 VEFSPILLKNHINKI---NEKKIEIIKFDNFNMCVDTYFSKIELTKYDKHQEMNKNKNAL 496

Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
            K++ +K DHE R+  LE+   + K+K  LI  N + V  AI  +R  I+   +WE I  
Sbjct: 497 TKMDKIKLDHEKRIEGLEKEVSMLKKKILLIELNYQFVGEAIKLMRSAISTSANWEKIWD 556

Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITL-LLRVHKERSNIKDSWVLHHDNADPFADS--S 226
            +K  ++ N P+A  +K + +N N+  + LL    +  ++++S   ++   D + +    
Sbjct: 557 HIKLFKKRNHPIA--LKIMSVNFNNCEMELLLDDNDDDDVEESGDDNNLKNDKWKEKVIE 614

Query: 227 ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV- 285
           E  K   V ++L+ S + N + +  L++ A +K +K   ST  A+K  EKK K       
Sbjct: 615 EKNKTCAVTINLNNSVFGNIEDYEKLRKKAEEKIRKIKMSTNIAVKKVEKKKKDKDIKQK 674

Query: 286 ---QTMTNINKARKVYWFEKF--------YWFISSENYLGN------------------- 315
              +++  I K RK++WFEKF        Y  IS  + L N                   
Sbjct: 675 GKNKSVFQIKKIRKIFWFEKFNWFISSENYLVISGRDSLQNEILFRRYFQNNDIYVHADI 734

Query: 316 ---------------------------------VAWDAKVVTNAWWVKADQVSKTAPTGE 342
                                             AW+ K++T+AWWV   QVSKTAPTGE
Sbjct: 735 HGAASCIIKNPYKDIPIPEKTLSEAGQLAMCRSSAWNNKMITSAWWVYYHQVSKTAPTGE 794

Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
           ++ TGSF+IRGKKN+ P  +L MG+  +F++ +
Sbjct: 795 YIKTGSFVIRGKKNYLPYAKLEMGLCIIFQINK 827


>gi|302309325|ref|NP_986649.2| AGL017Wp [Ashbya gossypii ATCC 10895]
 gi|299788305|gb|AAS54473.2| AGL017Wp [Ashbya gossypii ATCC 10895]
          Length = 1006

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 120/442 (27%), Positives = 197/442 (44%), Gaps = 106/442 (23%)

Query: 31  GYLVQKK----EKRPNSEDYIVANMEFHPM--ILQPGSTPSVMRQHQGFPIQEFES-FAA 83
           GY++ ++    +++ ++E   +   +FHP    L  GS        + F I E +  +  
Sbjct: 273 GYILARENPLYDEKKDAEGIRLTYEQFHPFRPYLPDGS-------QKKFEIVEVDGDYNR 325

Query: 84  AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
            VD+FFST +S K  L+   QE++A KKLE  K +++ ++ AL + Q  ++++   IINN
Sbjct: 326 TVDKFFSTIDSTKYALRIQTQEQNARKKLEKAKAENQKKIQALVEVQHTNEQRGNAIINN 385

Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLRVH 202
            E V+ A  AI+  +  Q+ W  IE  +K  Q  ++ +A +IK  L L  N I++ L + 
Sbjct: 386 IELVEEAKSAIQGLLDQQMDWTSIEKLIKTEQAKSNRIARVIKLPLNLKANKISVELPLS 445

Query: 203 KERSNIKD-SW----------------------------VLHHDNADPFADSSETKKPSL 233
            E     D SW                            V    + +    S  + K   
Sbjct: 446 NEDDESSDGSWGDSESDSGFSSSDDELSDSGLSDFDAEVVRGSGSKNKKGKSKVSNKSIT 505

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--I 291
           V +DL +SAYANA  +F++K++ AKKQ    Q+ +KA+K+ E+K ++ LK      +  +
Sbjct: 506 VSIDLSMSAYANASSYFEMKKTGAKKQLGVEQNVQKAMKNIEQKIEKDLKKKLKEQHDVL 565

Query: 292 NKARKVYWFEKFYWFISSENYL-----------------------------GNVAW---- 318
              R  Y+FEK++WFIS+E +L                             G+  W    
Sbjct: 566 QVIRSPYFFEKYFWFISTEGFLVLMGKSGIETDQIYSKYIEDDDVYVSNGFGSQVWIKNF 625

Query: 319 -----------DAKVVTNA-------------WWVKADQVSKTAPT-GEFLTTGSFMIRG 353
                       A +  N+             WW  A  +SK     G  L +G+F ++ 
Sbjct: 626 ERTEIPPNTLMQAGIFANSASEAWSKKVATSPWWCAAKNLSKFDDVGGGLLPSGAFRLKS 685

Query: 354 --KKNFFPPCQLAMGISFLFKL 373
              KNF PP QL MG +F++K+
Sbjct: 686 DEAKNFLPPAQLVMGFAFMWKI 707


>gi|408392777|gb|EKJ72097.1| hypothetical protein FPSE_07722 [Fusarium pseudograminearum CS3096]
          Length = 1078

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 96/310 (30%), Positives = 155/310 (50%), Gaps = 32/310 (10%)

Query: 28  SSQGYLVQKKEKRPNS----------EDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
           +  GY+  K+ +RP             D ++ + +FHP I      P  ++      + E
Sbjct: 262 TCTGYIFAKRRERPEGTEVDEETKTKRDNLLYD-DFHPFI------PYKLKNDPAIEVLE 314

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           FE +   VDEFFS+ E  +++ K  ++E  A +KLE  K +   R+  L++ Q ++  KA
Sbjct: 315 FEGYNETVDEFFSSLEGQRLESKLTEREATAKRKLEAAKNEQNKRIEGLQEAQSLNFRKA 374

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
             I  N E V  A+ A+   +   + W D+   V++ ++ ++PVA IIK L LN+  N I
Sbjct: 375 AAIEANVERVQEAMDAVNGLLNQGMDWVDVGKLVEREKKRHNPVADIIK-LPLNLAENLI 433

Query: 196 T----------LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
           T               ++   +  D   L  D     A   +T K   V+++L  S ++N
Sbjct: 434 TLELAEEEFEPEEDDPYETDDDDDDDSALGDDEGTSAAKGKQTNKALNVEINLGFSPWSN 493

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKF 303
           A+ +FD +++AA K++KT Q   +ALK+AE+K  + LK    Q    +   RK  WFEKF
Sbjct: 494 AREYFDQRKTAAVKEEKTQQQASRALKNAEQKITEDLKKGLKQEKALLQPIRKQMWFEKF 553

Query: 304 YWFISSENYL 313
            WFISS+ YL
Sbjct: 554 TWFISSDGYL 563



 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 40/61 (65%), Positives = 46/61 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K    AWWV ADQVSK+APTGEFL  GSFMIRGKKNF PP QL +G+   F++ E 
Sbjct: 629 AWDSKAGMPAWWVNADQVSKSAPTGEFLQAGSFMIRGKKNFLPPAQLLLGLGLAFRISEE 688

Query: 377 S 377
           S
Sbjct: 689 S 689


>gi|146419620|ref|XP_001485771.1| hypothetical protein PGUG_01442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 873

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 124/455 (27%), Positives = 193/455 (42%), Gaps = 122/455 (26%)

Query: 21  LVSNLNVSSQGYLVQKK----EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQ 76
           LVS+L     G +V KK    E   +++D      EFHP   +P          + F   
Sbjct: 127 LVSSLGTEVTGIIVSKKNPAFEPSDDNKDLEYLYDEFHP--FKP-----YKENLELFKFT 179

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           E   +   VD FFST +S K +L+  QQ+ +A K+L N + + + ++  L   Q ++ +K
Sbjct: 180 EIRGYNKTVDTFFSTLDSKKHELRMEQQKHNAKKRLLNAREERDKQIDNLRIQQEMNSKK 239

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
            + II + + V   I +++  +  Q+ W +IE+ +K  Q   + VA  IK L LN+    
Sbjct: 240 GDAIIYHADLVSECIASVQTLLDQQMDWANIESLIKLEQSRGNSVAKTIK-LPLNLTENK 298

Query: 197 LLLRVHKERSNIKDSWVLHHDNAD--------------------------------PFAD 224
           + L++    S         +D AD                                P   
Sbjct: 299 IGLKLPDTDS--------MYDPADIDSELDSETSSESETESESESESGSESEDETPPKRM 350

Query: 225 SSETKK---PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
           S + K    P+L V +DL LS +ANA+ +F+ K+ A  KQ+K  ++T+ AL++A+KK +Q
Sbjct: 351 SKKAKSKEIPALSVWIDLLLSPFANARTYFESKKQAESKQEKVEKNTDMALRNAQKKIEQ 410

Query: 281 TLKDVQTMTNIN----KARKVYWFEKFYWFISSENYL----------------------- 313
            L   + + N N    + R  YWFEKF+WF+SSE YL                       
Sbjct: 411 DL--AKNLKNENETLRQVRPKYWFEKFFWFVSSEGYLCIAGRDDAQVDMIYYRHFSDNDF 468

Query: 314 ---GNVAWDAKVVTN-------------------------AW--------WVKA-DQVSK 336
               ++    KVV                           AW        W  A + V+K
Sbjct: 469 FVSSDIEGSLKVVVKNPYRGEALPPYTLMQAGMFAMSASAAWNGKITTSPWFLAGNDVTK 528

Query: 337 TAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
               G  + +G+F  +GKK F PP QL MG+ F F
Sbjct: 529 LDFDGSLVPSGTFNYKGKKEFLPPTQLVMGLGFYF 563


>gi|145351275|ref|XP_001420008.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580241|gb|ABO98301.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1069

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/267 (36%), Positives = 148/267 (55%), Gaps = 22/267 (8%)

Query: 63  TPSVMRQHQGFPIQEFE---------SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLE 113
           +P  ++Q++G P ++FE         +F  AVDE+F   E+    L   + E  AL KLE
Sbjct: 307 SPFSLKQNEGRPTRKFELPKGLDPVCAFDHAVDEYFIALEAQSQILARRKAEAQALAKLE 366

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
              +D ++R+  LE+ +  ++++A LI  N E+VD AI A+   +A+ +SW ++EA + +
Sbjct: 367 KSLKDQKSRVEQLEREREKEEQRAVLIEYNHEAVDTAIDAVNSALASGMSWPELEAMINE 426

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
            +R  +PVA +IK L L  N IT+ L      +N       H D  D    +S  +K   
Sbjct: 427 ERRLGNPVAGMIKSLDLANNQITITL------AN-------HLDEVDEVDAASGKRKRVA 473

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           V VDL LSA+ANA   F  K+  A+K  KT+ +  KA+ +AE K K  ++     ++I +
Sbjct: 474 VGVDLGLSAHANASMRFAAKKKHAEKFSKTVDAQSKAVAAAEAKAKAAMEKAANGSSIAR 533

Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDA 320
           AR+  WFEKF WFI+SEN L   A DA
Sbjct: 534 ARQPLWFEKFNWFITSENCLVLQAKDA 560



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 35/61 (57%), Gaps = 2/61 (3%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPT-GEFLTTG-SFMIRGKKNFFPPCQLAMGISFLFKLE 374
           AW+++ V +AWW  +++VSK +P  G+ L  G + +    K F P  QL MG   +F + 
Sbjct: 619 AWNSRAVKSAWWTSSERVSKISPVAGDALPPGVTHVAHADKQFLPHAQLVMGFGLMFVVS 678

Query: 375 E 375
           E
Sbjct: 679 E 679


>gi|345565416|gb|EGX48366.1| hypothetical protein AOL_s00080g336 [Arthrobotrys oligospora ATCC
           24927]
          Length = 1207

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 172/382 (45%), Gaps = 73/382 (19%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +FHP +      P+  +           ++   VD FFS+ ES K++ K  +++  A K+
Sbjct: 385 DFHPFL------PTQFKDKPNVHTIPITTYNKTVDSFFSSIESQKLEQKTAEKKSLAAKR 438

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
           L N + +H+ ++ +L+  Q V   KA+ I  N E V+  I A+   IA  + W +I + V
Sbjct: 439 LANARNEHKNKIESLKSAQEVHVRKAQAIEANVERVEEVIDAVNGLIAQGMDWTEIRSLV 498

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           ++ +   + VA +I+ +K   N  T+++R+++E              ++      +  + 
Sbjct: 499 EREKSAGNGVAEMIRDVKFMEN--TVVVRLYEEEEEDDSDDDDDESGSEDGNGEEKEGRS 556

Query: 232 SL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM-- 288
            L +++DL L+ YANA+ +++ KRSAA K+ KT+QS+ KALKS EKK ++ LK       
Sbjct: 557 HLDIEIDLALTGYANARIYYEQKRSAAVKETKTLQSSAKALKSTEKKIQKDLKQAYKAEK 616

Query: 289 TNINKARKVYWFEKFYWFISSENYL--------------------GNVAWDAKVVTNAWW 328
             +   R+  W+EKFYWF SSE YL                    G+V   A+V  +   
Sbjct: 617 MELRTLRRQGWWEKFYWFRSSEGYLVLGAKDPTQADMLYKKYFKKGDVWVHAEVPGSCHV 676

Query: 329 V---KADQVSKTAPTGEFLTTGS------------------------------------- 348
           V   K + V+   P G     GS                                     
Sbjct: 677 VVKNKVEDVNSPIPPGTLSQAGSLAVASSDAWEKKMVISAWWAGYEQVGKIGAGGIVLGT 736

Query: 349 --FMIRGKKNFFPPCQLAMGIS 368
             F+++G+K + PP  L MG +
Sbjct: 737 GEFVVKGEKKWLPPAMLVMGFA 758


>gi|50312521|ref|XP_456296.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49645432|emb|CAG99004.1| KLLA0F27335p [Kluyveromyces lactis]
          Length = 1027

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 124/451 (27%), Positives = 195/451 (43%), Gaps = 117/451 (25%)

Query: 31  GYLVQKKEK-------RPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAA 83
           GY+V  K K       +P  E Y  +N  FHP     G +  V     G        +  
Sbjct: 273 GYIVAHKNKLYDPLRDKPELE-YTFSN--FHPFKPFVGDSTDVKIIEIG------GMYNN 323

Query: 84  AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
            VD FFST ES+K   +   Q+  A KKL+  K ++ET + +L   Q  ++EK  ++I N
Sbjct: 324 TVDTFFSTIESNKYASRIQNQDFQAQKKLDEAKNNNETIIKSLLHAQQTNEEKGNILIAN 383

Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHK 203
              V+ A  A++  +  Q+ W+ +E  +   QR  + +A IIK L +++ +  + + + K
Sbjct: 384 ANLVEEAKNAVKSLLDQQMDWQSMETLIANEQRKGNKIARIIK-LPMDLPNNKITIELPK 442

Query: 204 ERSNIKDSWVLHHDNADPFAD--------------------------------------S 225
           +  +  DS   H   AD  ++                                      +
Sbjct: 443 DGYSEDDSTEHHQSEADYSSNESDVNQSDSSVSSDYSDSDFEELTSSKSKQQSRRKSKIT 502

Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KD 284
           SE ++  L+ VDL LSA+ANA  +F+ K++ ++KQ+K  ++ EKALKS ++K ++ L K 
Sbjct: 503 SEKRETVLLTVDLSLSAFANASSYFNAKKATSEKQKKVEKNAEKALKSIQQKIEKDLQKK 562

Query: 285 VQTMTNINKA-RKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNA-- 326
            +   +I KA R  Y+FEK+YWFISSE++L                 V  D  +VTNA  
Sbjct: 563 SKESHDILKAIRTPYFFEKYYWFISSESFLVLMGKSPVETDQLYAKYVNDDDIMVTNAFD 622

Query: 327 ----------------------------------------WWVKADQVSKTAPT-GEFLT 345
                                                   WW  A  V+K     G  L 
Sbjct: 623 VKAWILNPQKTEVPPNTLMQAGTFANSASDAWSKKIASSPWWCFAKNVTKFDDIDGSVLP 682

Query: 346 TGSFMIR--GKKNFFPPCQLAMGISFLFKLE 374
            GSF ++    KN  PP QL MG+  ++K++
Sbjct: 683 VGSFRMKQPKAKNMLPPAQLVMGLGLVWKVK 713


>gi|348681953|gb|EGZ21769.1| hypothetical protein PHYSODRAFT_557667 [Phytophthora sojae]
          Length = 1063

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 88/261 (33%), Positives = 145/261 (55%), Gaps = 12/261 (4%)

Query: 63  TPSVMRQHQGF--PIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP +  QHQ     ++ F++F  AVDE+FS  E+   ++     +  A  KL  +K++ +
Sbjct: 346 TPFLYAQHQQAHKKVKSFDTFDEAVDEYFSRFEADTAEVAKQSAQLAAENKLAKLKKNQQ 405

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            +L+ L + Q    + A+LI  NQ+ V+  +L IR  +A+ + W  +E  V+  Q++ +P
Sbjct: 406 QQLAQLREVQEQSFQHAQLIEANQQDVENVLLVIRSALASGMDWRGLEELVRYEQKNGNP 465

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VAS+I +L L  N + +LL   ++    +D                E K+  ++ +DL L
Sbjct: 466 VASLIHKLDLEHNRVAILLCDEEDDDEGEDGGD---------GTGEEDKQAHVIWIDLSL 516

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-INKARKVYW 299
           SA ANA+  +  K+ A +K +K  ++T+KA+  AEK TK+TL+  QT  N I + RK  W
Sbjct: 517 SALANAREIYTKKKKAGEKVKKATEATDKAIALAEKNTKKTLEKQQTKRNVIYQRRKTLW 576

Query: 300 FEKFYWFISSENYLGNVAWDA 320
           FEKF+WF+++E YL     DA
Sbjct: 577 FEKFHWFLTNEKYLVVAGKDA 597



 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 41/63 (65%), Positives = 53/63 (84%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW ++V+  A+WV ADQVSKTAP GE+LTTGSFMIRGKKN+  P +L MG++ LF+++ES
Sbjct: 662 AWTSQVIAGAYWVHADQVSKTAPAGEYLTTGSFMIRGKKNYIQPSRLEMGLAVLFRIDES 721

Query: 377 SIS 379
           SIS
Sbjct: 722 SIS 724


>gi|401842736|gb|EJT44818.1| TAE2-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 1032

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 159/311 (51%), Gaps = 34/311 (10%)

Query: 30  QGYLVQKKEKRPNS----EDYIVANMEFHPMILQPGSTPSVMRQHQG-FPIQEFES-FAA 83
           +GY++ KK +  NS     D       FHP        P +  ++ G F I + E  +  
Sbjct: 281 KGYILAKKNENFNSIKDPADLEFIYDTFHPF------RPYINEKNSGSFRIADVEGPYNK 334

Query: 84  AVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN 143
            +D+FFST ES K  L+   QE  A KK+++ + +++ ++ AL   Q +++ K  LII N
Sbjct: 335 TLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLNVQELNERKGHLIIEN 394

Query: 144 QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLLRV 201
              ++   LA++  +  Q+ W  IE  +K  Q+  + +A ++  L LN+  N I++ L +
Sbjct: 395 ASLIEEVKLAVQGLVDQQMDWSTIEKLIKSEQKKGNKIAQLL-NLPLNLKQNKISVKLDI 453

Query: 202 HKERSNI------------KDSWVLHHDNADPFADSSETKKPSL-----VDVDLDLSAYA 244
            +E  +I                     +   F + +  KK        V +DL LSAYA
Sbjct: 454 SREEESITSSDEDDESEDSSSEGSSDSGDMSTFKEENSKKKGQSNNALNVTIDLGLSAYA 513

Query: 245 NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD--VQTMTNINKARKVYWFEK 302
           NA  +F++K+++A+KQ+K  ++  KA+K+ E K  Q LK    ++ + + K R  Y+FEK
Sbjct: 514 NASEYFNIKKTSAEKQKKVEKNVGKAMKNIEVKIDQQLKRKLKESHSVLKKVRTPYFFEK 573

Query: 303 FYWFISSENYL 313
           + WFISSE +L
Sbjct: 574 YNWFISSEGFL 584


>gi|367011407|ref|XP_003680204.1| hypothetical protein TDEL_0C01040 [Torulaspora delbrueckii]
 gi|359747863|emb|CCE90993.1| hypothetical protein TDEL_0C01040 [Torulaspora delbrueckii]
          Length = 1016

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 103/329 (31%), Positives = 166/329 (50%), Gaps = 50/329 (15%)

Query: 26  NVSSQGYLVQKKEKR----PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFP-IQEFES 80
           N  S+G+++ KK        +SED       FHP        P +  Q QG   I E   
Sbjct: 276 NKDSRGFILAKKNVNYTLGKDSEDLEFIFENFHPF------KPFIEEQDQGRSRITEVPG 329

Query: 81  -FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
            +   +D FFST ES K  L+  QQE+ A KK+E+ + +++ R+ AL   Q  +++K   
Sbjct: 330 EYNKTLDTFFSTIESSKYALRIQQQEQLAKKKIEDARLENQKRIQALLDVQSSNEQKGHA 389

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITL 197
           II N + V+ A +A++  I  Q+ W+ IE  ++  Q   + +A +I  L LN+  N + +
Sbjct: 390 IIANADLVEEAKIAVQGLIDQQMDWQTIEKLIRNEQLKKNKIAMVI-DLPLNLKENAVNI 448

Query: 198 LLRV-----HKERSNIKDSWVLHH----------DNADPFADSSETKKPS---------- 232
           L+ V     H   S   +S+V             D+ D      ET++            
Sbjct: 449 LVPVSHDDEHNNESESDESFVESSSDESDSDEGTDSDDSEVSDFETEESRNESRTSKRKV 508

Query: 233 ----LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE----KKTKQTLKD 284
                + +DL LSAYANA ++F +K+++A KQ+K  ++ EKA+K+ E    K+ KQ LK+
Sbjct: 509 ENKLKIRIDLGLSAYANASKYFTVKKTSADKQKKVEKNVEKAMKNIEQRIDKQLKQKLKE 568

Query: 285 VQTMTNINKARKVYWFEKFYWFISSENYL 313
             ++  + +AR  Y+FEK +WF SSE +L
Sbjct: 569 SHSV--LKRARSPYFFEKHFWFYSSEGFL 595


>gi|325185450|emb|CCA19934.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1061

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 91/314 (28%), Positives = 164/314 (52%), Gaps = 42/314 (13%)

Query: 33  LVQKKEKRPNSEDYIVANMEFHPMILQPGS--------TPSVMRQHQGFPIQEFESFAAA 84
           ++ + EK+P     I+ N        Q G+        TP +  QH+      F++F   
Sbjct: 299 ILSESEKKPKGGYIILGNA-------QTGTKTDTYESFTPVLYAQHREKAYVSFDTFDQT 351

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           VDE+FS  E+ K    +   ++ A  KLE ++++   +L  L  ++ ++ + A+LI  +Q
Sbjct: 352 VDEYFSYHEARKTQTGSQAAQQAASSKLEKMRKNQIQQLDELHHSEEINLKHAQLIELHQ 411

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
             ++  +  IR  +A+ + W+ ++  VK  Q + +PVAS+I +  L+ N +++LL     
Sbjct: 412 LDIEKVLSVIRSALASGMDWKALKDLVKYEQTNANPVASMIHEFDLSKNRVSVLL----- 466

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSL--VDVDLDLSAYANAKRFFDLKRSAAKKQQK 262
                        + DP+ + +E   P++  + +DL LSA  NA   +  K+++A+K +K
Sbjct: 467 -------------SDDPYFEDAE---PAVHAIWLDLSLSALGNAAELYAKKKTSAEKAKK 510

Query: 263 TIQSTEKALKSAEKKTKQTLK-DVQTMTNINKARKVYWFEKFYWFISSENYL---GNVAW 318
              +TEKA+K A  KT++ +K  +   T I++ RK +WFEKF+WF+SSEN L   G  A 
Sbjct: 511 AEVATEKAIKLAASKTEKFMKTQLIKPTPIHQRRKTFWFEKFHWFLSSENILVISGKDAQ 570

Query: 319 DAKVVTNAWWVKAD 332
             +++ N +  K D
Sbjct: 571 QNELLVNRYVRKND 584



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 29/60 (48%), Positives = 41/60 (68%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW  +V+T A+WVKA+ VSK+  +GE L  G+F+I GKKNF    +L MG++ L+  EE 
Sbjct: 631 AWKNQVITGAYWVKAECVSKSTSSGELLPPGTFLILGKKNFLQALRLEMGLAILYHTEEG 690


>gi|358339725|dbj|GAA47729.1| nuclear export mediator factor NEMF [Clonorchis sinensis]
          Length = 449

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/262 (35%), Positives = 135/262 (51%), Gaps = 30/262 (11%)

Query: 21  LVSNLNVS---SQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
           LV   NV    S+GY+  KK  +P  E+ +    +F P +          R H  FP   
Sbjct: 205 LVGTDNVEMGMSRGYIFGKK-LQPEDEE-LSRQEDFQPFLFDQFRN----RPHVAFP--- 255

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
             +F+ AVD +FS  E  K     VQ E  A KK EN+K+DHE RL+AL+  Q  D  KA
Sbjct: 256 --TFSKAVDTYFSKIERDKTTELLVQNENKANKKFENIKKDHELRLAALKADQEQDVHKA 313

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           +L+  N++ VD  IL I   ++NQL W  ++  +++A+   D +AS I QL L  N IT+
Sbjct: 314 QLLEKNRQLVDNIILMINHALSNQLDWGTLDTMIQEARARGDLLASHIVQLNLQQNQITV 373

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKK---------PSLVDVDLDLSAYANAKR 248
            L+          S  L     DPF   SE +          P+ V + LDL+A  NA++
Sbjct: 374 SLKYGF-------SLYLLIMPRDPFESESEGENCERDQTISAPTEVVISLDLNALNNARK 426

Query: 249 FFDLKRSAAKKQQKTIQSTEKA 270
           ++D KR+A KK++KT+ ++ K 
Sbjct: 427 YYDRKRAALKKEEKTLIASRKV 448


>gi|297736754|emb|CBI25955.3| unnamed protein product [Vitis vinifera]
          Length = 712

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 88/260 (33%), Positives = 134/260 (51%), Gaps = 36/260 (13%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P ++ Q +     +FE+F AA DEF+S  E  + +    QQ+          K   E R
Sbjct: 314 CPILLNQFKSREFVKFETFDAASDEFYSKIEGQRSE----QQQ----------KAKEENR 359

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L++      + AELI  N E VDAAILA+R  +AN ++WED+   VK+ ++  +PVA
Sbjct: 360 VHTLKKEDDRCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDLARMVKEKKKSGNPVA 419

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I +L L  N +TLLL                 +N D   D  +T     V+VDL LSA
Sbjct: 420 GLIDKLYLERNCMTLLLS----------------NNLDEMDDDEKTLHVDKVEVDLALSA 463

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           +ANA+++++ K+    K++KTI + EK LK  +++   +      +   N      WFEK
Sbjct: 464 HANARQWYEQKKRQENKREKTIIAHEKLLKLLKRRLASSFHSYWPLVLFN------WFEK 517

Query: 303 FYWFISSENYLGNVAWDAKV 322
           F WFISS+NY      DA++
Sbjct: 518 FNWFISSKNYFVISGRDAQL 537



 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 40/60 (66%), Positives = 46/60 (76%)

Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           WD+K+VT+AWWV   QVSKTAPTGE+LT GSFMI GKKNF PP  L MG   LF L+E +
Sbjct: 643 WDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIHGKKNFLPPHPLMMGFGLLFCLDERA 702


>gi|444317477|ref|XP_004179396.1| hypothetical protein TBLA_0C00610 [Tetrapisispora blattae CBS 6284]
 gi|387512437|emb|CCH59877.1| hypothetical protein TBLA_0C00610 [Tetrapisispora blattae CBS 6284]
          Length = 1053

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 125/464 (26%), Positives = 196/464 (42%), Gaps = 133/464 (28%)

Query: 30  QGYLVQKKEKRPNSEDYIVA----NME--------FHPMILQPGSTPSVMRQHQGFPIQE 77
           +GY+V KK   PN   Y++     ++E        F P I +   T S +    G     
Sbjct: 273 RGYIVAKK--NPN---YVIGRDADDLEYVYETFNPFEPFIDETHRTNSKIIIVDG----- 322

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
              +   +D+FF+T ES K  LK   QE  A KK+E+   +++ R+ AL   Q  +++K 
Sbjct: 323 --PYNLTLDKFFTTIESSKYALKIQTQEEQAKKKIEDAHLENKKRIDALINVQTSNEQKG 380

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ-RHNDPVASIIKQLKLNINHIT 196
             II N E ++    A++  +  Q+ W  IE  +K  Q R N+   +II  L L  N I 
Sbjct: 381 YAIIANTELIETTKYAVQGLVDQQMDWNTIEKLIKNEQVRGNEVAENIILPLNLKENTIN 440

Query: 197 LLLRVH-------KERSNIKDSWVLHHDNADPFADSSE---------------------- 227
           ++L +           S  +D +    DN     ++S+                      
Sbjct: 441 MILPLKSETSSIENSSSEEQDEYCSESDNEPANENTSDEESDISVEQDVSDFVEVTTIGN 500

Query: 228 ----TKKPS-----------LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
               +KK             +V +DL LSAYANA R+FD K+  A+KQ++  ++ EKA+K
Sbjct: 501 SPLISKKSKHKRLQNNENSIIVSIDLSLSAYANASRYFDTKKKTAEKQKRVEENAEKAMK 560

Query: 273 SAEKKTKQTL--KDVQTMTNINKARKVYWFEKFYWFISSEN------------------- 311
           + E+  + +L  K  ++   + K RK Y+FEK++WFISSE                    
Sbjct: 561 NIEQGIETSLQRKLKESHEVLKKIRKPYFFEKYHWFISSEKILVLMGKSSTETDQIYSKY 620

Query: 312 ------YLGNVAWDAKV------------------------VTNAW---------WVKAD 332
                 Y+ N ++D +V                         + AW         W KA 
Sbjct: 621 IEDDDIYMSN-SFDTQVWIKNPEKIEISPNTLMQAGVFCMSSSEAWSKKIASSPWWCKAK 679

Query: 333 QVSKTAPTGE-FLTTGSFMIR--GKKNFFPPCQLAMGISFLFKL 373
            VSK    G   L  G F+++   +K+  PP QL MGI  L+K+
Sbjct: 680 NVSKFDKEGNTCLEPGKFILKNENEKHSLPPAQLVMGIGLLWKV 723


>gi|340505619|gb|EGR31934.1| hypothetical protein IMG5_099620 [Ichthyophthirius multifiliis]
          Length = 1423

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 156/316 (49%), Gaps = 60/316 (18%)

Query: 30  QGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFF 89
           QGYLV  ++++ N         EF P+IL           +QG  I++ ESF   ++++F
Sbjct: 714 QGYLVISEKEKANY-------FEFSPLILNS---------YQGKQIEQMESFNDCINKYF 757

Query: 90  S-----TAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
                   E  K D++++     A KK  N+K D E R+  L+  Q     KA+LI  N 
Sbjct: 758 QKMSKKIEEEQKEDVESI-----AWKKYLNIKTDQENRIKKLKDEQEEFITKAQLIEENY 812

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKE 204
           + V+A    ++   ++ L+W+ I   + + ++  DP+A++I Q+    N +++ L     
Sbjct: 813 QDVEAITNILKTMKSSGLAWDKIIKMINEGKKQGDPLANLIHQIDFENNEVSIYL----- 867

Query: 205 RSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
                            F D   ++    V VD+  SA+ NA+ +++ KR    K++KT+
Sbjct: 868 ----------------GFIDDQMSELIP-VSVDIYQSAHQNARNYYENKRKNVLKEKKTL 910

Query: 265 QSTEKALKSAEKKTKQTLKDVQ-----TMTNINKARKVYWFEKFYWFISSENYL---GNV 316
            +T+ ALK AE   K  LK+++     TM  +N  RK YWFEKFYWFI+SENYL   G  
Sbjct: 911 DATKTALKQAE---KTALKEIETQKHKTMQLVN-VRKQYWFEKFYWFITSENYLVISGRD 966

Query: 317 AWDAKVVTNAWWVKAD 332
           +   +++   +  K D
Sbjct: 967 SQQNEILVKKYMKKGD 982



 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 36/64 (56%), Positives = 48/64 (75%)

Query: 317  AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
            AW+AK++ +AWWV + QVSK A TGE+L +GSFMIRGKKNF  P ++ M  + LFKL + 
Sbjct: 1025 AWEAKIIASAWWVDSHQVSKRAETGEYLPSGSFMIRGKKNFVYPSRMEMACTILFKLNDD 1084

Query: 377  SISR 380
            S+ R
Sbjct: 1085 SLER 1088


>gi|254566655|ref|XP_002490438.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238030234|emb|CAY68157.1| hypothetical protein PAS_chr1-4_0316 [Komagataella pastoris GS115]
 gi|328350832|emb|CCA37232.1| Uncharacterized protein YPL009C [Komagataella pastoris CBS 7435]
          Length = 1007

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 122/446 (27%), Positives = 187/446 (41%), Gaps = 124/446 (27%)

Query: 31  GYLVQKKEK--RPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           GY+V K+     P++E   +YI    EFHP   +P    S   Q     I+  + +   +
Sbjct: 271 GYIVSKRNPIHDPSTEGSLEYIYD--EFHPY--EPTHKRSSDTQ-----IKTIKGYNKTI 321

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           D+FF+T E  K  LK  QQ+ +A ++L++VK ++  +++ L + QL++ +K E+I+   +
Sbjct: 322 DDFFTTIEVSKHSLKEQQQKVNAERRLQSVKSENLEKIAKLTEAQLLNIQKGEVIMVYSD 381

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLLRV-- 201
            V+    A++  +  Q+ W  IE  +   ++  + +A +I  L LN+  N I+L L +  
Sbjct: 382 VVEQCKAAVQSLLDQQMDWNHIEKLIGVEKKRGNEIAKLI-NLPLNLLENKISLALPLVN 440

Query: 202 --------------------------HKERSNIKDSWVLHHDNADPFADSSETKKPSLVD 235
                                      +E  N K S   H    D         K   V+
Sbjct: 441 FDESSEEEDESDSEDESDSEDSSSSDEQETKNKKQSSTKHSRKKD---------KTINVN 491

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK-----DVQTMTN 290
           +DL LSAYANA  +FD K+ A  K  KT +++E A+KSAE K  + LK     +   + N
Sbjct: 492 IDLSLSAYANASTYFDAKKIAQDKLVKTEKNSELAIKSAESKINRDLKKQKKTESSQVNN 551

Query: 291 INKA----RKVYWFEKFYWFISSENYLGNVAWD--------------------------- 319
            N A    R  +WFEK++WFISS+ +L     D                           
Sbjct: 552 SNAALRQIRDKFWFEKYFWFISSDGFLCVAGRDDQQFDHIYFEYFDNDNDFLVSNELEGA 611

Query: 320 AKVV-------------------------TNAW--------W-VKADQVSKTAPTGEFLT 345
            KV+                         T AW        W V    VSK    G  L 
Sbjct: 612 LKVIVKNPFLNKDVAPNTFIQAGAFSLSTTKAWENKMVSSPWIVTGSSVSKRDVDGSALA 671

Query: 346 TGSFMIRGKKNFFPPCQLAMGISFLF 371
            G   I  +K F PPCQ+ MG   L+
Sbjct: 672 PGLVNITTEKQFLPPCQMVMGFGMLW 697


>gi|68475252|ref|XP_718344.1| hypothetical protein CaO19.10114 [Candida albicans SC5314]
 gi|68475451|ref|XP_718248.1| hypothetical protein CaO19.2582 [Candida albicans SC5314]
 gi|46440007|gb|EAK99318.1| hypothetical protein CaO19.2582 [Candida albicans SC5314]
 gi|46440107|gb|EAK99417.1| hypothetical protein CaO19.10114 [Candida albicans SC5314]
          Length = 1018

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 120/459 (26%), Positives = 191/459 (41%), Gaps = 107/459 (23%)

Query: 19  HVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
           ++ + N  ++++GY+V KK  + + +   +YI    EFHP   +P   P+     +G   
Sbjct: 257 YIDLINGEIATEGYIVAKKNNKVSEKSDLEYIYD--EFHP--FEP-YKPN----QEGIKF 307

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
                +   +D+FFS  ES K  +K  QQ+ +A K+LE  + + + ++ +L   Q ++ +
Sbjct: 308 ISVSGYNKTLDKFFSNIESTKFSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAK 367

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--N 193
           K ELI  + E V+     ++  I  Q+ W +IE  +   Q+  + +A  I QL LN+  N
Sbjct: 368 KGELIQYHSELVEECRSYVQSFIDQQMDWTNIETVISLEQKKKNELAQHI-QLPLNLKEN 426

Query: 194 HITLLLRVHKERSNIKDS-------------------WVLHHDNADPFADSSETKKPSLV 234
            I +LL    +   I +S                        DN +       T++ +  
Sbjct: 427 KIKVLLEDFDDYEEITESASATETGSETETESESESSSESESDNDEDKIPVKRTQRKTNT 486

Query: 235 D-----------VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
                       +DL  SA+ANA+ +FD K++A  KQ K   ST  ALK+AE+K  Q L 
Sbjct: 487 KEKPKRKTIPTWIDLSQSAFANARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLT 546

Query: 284 DVQTMTN--INKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWW---------VKAD 332
                 N  + + R  YWFEKF+WF+SSE YL     DA      ++         V AD
Sbjct: 547 RSLKQENDTLKEIRPKYWFEKFFWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSAD 606

Query: 333 QVS------KTAPTGEFL-----------------------TTGSFMIRG---------- 353
                    K    GE L                       TT ++++ G          
Sbjct: 607 MEGSLKVFIKNPLKGEALPPSTLMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDG 666

Query: 354 ------------KKNFFPPCQLAMGISFLFKLEESSISR 380
                       +K + PP QL MG  F   L++ S  R
Sbjct: 667 SIVPEGEFNYLVQKEYLPPAQLIMGFGFYCLLDDESTKR 705


>gi|349581807|dbj|GAA26964.1| K7_Ypl009cp [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1027

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/441 (25%), Positives = 197/441 (44%), Gaps = 104/441 (23%)

Query: 30  QGYLVQKKEKRPNSEDYIVANMEF--------HPMILQPGSTPSVMRQHQGFPIQEFESF 81
           +GY++ K+ +  NSE    A++EF         P I    S  S + + +G        +
Sbjct: 282 KGYILAKRNENYNSEKD-TADLEFIYDTFHPFKPYINGGDSDSSCIIEVEG-------PY 333

Query: 82  AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
              +D+FFST ES K  L+   QE  A KK+++ + +++ ++ AL   Q +++ K  LII
Sbjct: 334 NRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLII 393

Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
            N   ++   LA++  I  Q+ W  IE  +K  Q+  + +A ++  L LN+  N I++ L
Sbjct: 394 ENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKISVKL 452

Query: 200 RVHKERSNIKD----------SWVLHHDNADPFADSSETKKPSL---------VDVDLDL 240
            +  +  N             +      +++    S E    S+         V +DL L
Sbjct: 453 DLSSKELNTSSDEDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGL 512

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTMTNINKARK 296
           SAYANA  +F++K+++A+KQ+K  ++  KA+K+ E K     K+ LKD  ++  + K R 
Sbjct: 513 SAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV--LKKIRT 570

Query: 297 VYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WVKADQVSKT 337
            Y+FEK+ WFISSE +L                 +  D   ++N++    W+K  + ++ 
Sbjct: 571 PYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWIKNPERTEV 630

Query: 338 APT---------------------------------------GEFLTTGSFMIRGK--KN 356
            P                                           L  G+F ++ +  +N
Sbjct: 631 PPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQN 690

Query: 357 FFPPCQLAMGISFLFKLEESS 377
             PP QL MG  FL+K++ S 
Sbjct: 691 HLPPAQLVMGFGFLWKVKTSG 711


>gi|406604691|emb|CCH43887.1| putative RNA-binding protein [Wickerhamomyces ciferrii]
          Length = 983

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 102/314 (32%), Positives = 158/314 (50%), Gaps = 35/314 (11%)

Query: 25  LNVS-SQGYLVQKK----EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGF-PIQEF 78
           LNV  + G++V KK    +   + ED      +FHP        P  +++ +   PIQ  
Sbjct: 253 LNVKPTNGHVVLKKNPLYDSEKDEEDLKYLYEQFHPF------EPINLKEDEELVPIQ-- 304

Query: 79  ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAE 138
             +   +D+FFST ES K  L+   QE  A K+L+  + D + ++  L   Q V+  K E
Sbjct: 305 -GYNKTLDKFFSTIESSKYALRIQNQENQAKKRLQQARDDKQQQVQRLLDVQAVNTLKGE 363

Query: 139 LIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHIT 196
            II N E V+ A  A++  +  Q+ W+ +E  +   +   + VA +I  L LN+  N I+
Sbjct: 364 TIIFNAEIVEEAKAAVQALLDQQMDWKTMEKLINVEKAKGNRVAKVI-NLPLNLKENKIS 422

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADS----------SETKKPSL-----VDVDLDLS 241
           L L      +N +D      ++                 S+ KK ++     V +DL LS
Sbjct: 423 LSLSTEDPYANDEDEDESSSESEPESESDSDEDEPKPVKSQAKKDNVKNTINVTIDLTLS 482

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--INKARKVYW 299
           +YANA  +F++K+S  +KQ+K  QS  KALK+ E+K ++ LK      N  + K R  Y+
Sbjct: 483 SYANASEYFNVKKSTVEKQKKVEQSATKALKNIEQKIEKDLKKNLKQENDILRKLRNPYF 542

Query: 300 FEKFYWFISSENYL 313
           FEKF WFIS+ENYL
Sbjct: 543 FEKFNWFISNENYL 556



 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 27/57 (47%), Positives = 37/57 (64%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AW+ K+VT++WW+ A  V+K    G  L  GSF    +KNF PP QL MG +FL+K+
Sbjct: 619 AWENKMVTSSWWLYASDVTKKDIDGTILNAGSFRYLKEKNFLPPSQLVMGFAFLWKV 675


>gi|3859683|emb|CAA22020.1| conserved hypothetical protein [Candida albicans]
          Length = 1018

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 119/459 (25%), Positives = 191/459 (41%), Gaps = 107/459 (23%)

Query: 19  HVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
           ++ + N  ++++GY+V KK  + + +   +YI    EFHP   +P   P+     +G   
Sbjct: 257 YIDLINGAIATEGYIVAKKNNKVSEKSDLEYIYD--EFHP--FEP-YKPN----QEGIKF 307

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
                +   +D+FFS  ES K+ +K  QQ+ +A K+LE  + + + ++ +L   Q ++ +
Sbjct: 308 ISVSGYNKTLDKFFSNIESTKLSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAK 367

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--N 193
           K ELI  + E V+     ++  I  Q+ W +IE  +   Q+  + +A  I QL LN+  N
Sbjct: 368 KGELIQYHSELVEECRSYVQSFIDQQMDWTNIETVISLEQKKKNELAQHI-QLPLNLKEN 426

Query: 194 HITLLLRVHKERSNIKDS-------------------WVLHHDNADPFADSSETKKPSLV 234
            I +LL    +     +S                        DN +       T++ +  
Sbjct: 427 KIKVLLEDFDDYEESTESASATETGSETETESESESSSESESDNDEDKIPVKRTQRKTNT 486

Query: 235 D-----------VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
                       +DL  SA+ANA+ +FD K++A  KQ K   ST  ALK+AE+K  Q L 
Sbjct: 487 KEKPKRKTIPTWIDLSQSAFANARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLT 546

Query: 284 DVQTMTN--INKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWW---------VKAD 332
                 N  + + R  YWFEKF+WF+SSE YL     DA      ++         V AD
Sbjct: 547 RSLKQENDTLKEIRPKYWFEKFFWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSAD 606

Query: 333 QVS------KTAPTGEFL-----------------------TTGSFMIRG---------- 353
                    K    GE L                       TT ++++ G          
Sbjct: 607 MEGSLKVFIKNPLKGEALPPSTLMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDG 666

Query: 354 ------------KKNFFPPCQLAMGISFLFKLEESSISR 380
                       +K + PP QL MG  F   L++ S  R
Sbjct: 667 SIVPEGEFNYLVQKEYLPPAQLVMGFGFYCLLDDESTKR 705


>gi|392296002|gb|EIW07105.1| Tae2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1036

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 116/445 (26%), Positives = 199/445 (44%), Gaps = 112/445 (25%)

Query: 30  QGYLVQKKEKRPNSEDYI----VANMEF-----HPM--ILQPGSTPSVMRQHQGFPIQEF 78
           +GY++ K+     +E+YI     A++EF     HP    +  G T S         I E 
Sbjct: 282 KGYILAKR-----NENYISEKDTADLEFIYDTFHPFKPYINGGDTDSSC-------IIEV 329

Query: 79  ES-FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           E  +   +D+FFST ES K  L+   QE  A KK+++ + +++ ++ AL   Q +++ K 
Sbjct: 330 EGPYNRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKG 389

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
            LII N   ++   LA++  I  Q+ W  IE  +K  Q+  + +A ++  L LN+  N I
Sbjct: 390 HLIIENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKI 448

Query: 196 TLLLRVHKERSNIKD----------SWVLHHDNADPFADSSETKKPSL---------VDV 236
           ++ L +  +  N             +      +++    S E    S+         V +
Sbjct: 449 SVKLDLSSKELNTSSDEDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTI 508

Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTMTNIN 292
           DL LSAYANA  +F++K+++A+KQ+K  ++  KA+K+ E K     K+ LKD  ++  + 
Sbjct: 509 DLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV--LK 566

Query: 293 KARKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WVKADQ 333
           K R  Y+FEK+ WFISSE +L                 +  D   ++N++    W+K  +
Sbjct: 567 KIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWIKNPE 626

Query: 334 VSKTAPT---------------------------------------GEFLTTGSFMIRGK 354
            ++  P                                           L  G+F ++ +
Sbjct: 627 KTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNE 686

Query: 355 --KNFFPPCQLAMGISFLFKLEESS 377
             +N  PP QL MG  FL+K++ S 
Sbjct: 687 NDQNHLPPAQLVMGFGFLWKVKTSG 711


>gi|6325248|ref|NP_015316.1| Tae2p [Saccharomyces cerevisiae S288c]
 gi|74676621|sp|Q12532.1|TAE2_YEAST RecName: Full=Translation-associated element 2
 gi|683781|emb|CAA88377.1| unknown [Saccharomyces cerevisiae]
 gi|965084|gb|AAB68096.1| Ypl009cp [Saccharomyces cerevisiae]
 gi|1314067|emb|CAA95032.1| unknown [Saccharomyces cerevisiae]
 gi|285815527|tpg|DAA11419.1| TPA: Tae2p [Saccharomyces cerevisiae S288c]
          Length = 1038

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 117/449 (26%), Positives = 197/449 (43%), Gaps = 120/449 (26%)

Query: 30  QGYLVQKKEKRPNSEDYI----VANMEF-----HPM--ILQPGSTPSVMRQHQGFPIQEF 78
           +GY++ K+     +E+YI     A++EF     HP    +  G T S         I E 
Sbjct: 282 KGYILAKR-----NENYISEKDTADLEFIYDTFHPFKPYINGGDTDSSC-------IIEV 329

Query: 79  ES-FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           E  +   +D+FFST ES K  L+   QE  A KK+++ + +++ ++ AL   Q +++ K 
Sbjct: 330 EGPYNRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKG 389

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
            LII N   ++   LA++  I  Q+ W  IE  +K  Q+  + +A ++  L LN+  N I
Sbjct: 390 HLIIENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKI 448

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSS--------------ETKKPSL-------- 233
           ++ L +  +  N         DN      +               E    S+        
Sbjct: 449 SVKLDLSSKELNTSSD----EDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKI 504

Query: 234 -VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTM 288
            V +DL LSAYANA  +F++K+++A+KQ+K  ++  KA+K+ E K     K+ LKD  ++
Sbjct: 505 NVTIDLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV 564

Query: 289 TNINKARKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WV 329
             + K R  Y+FEK+ WFISSE +L                 +  D   ++N++    W+
Sbjct: 565 --LKKIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWI 622

Query: 330 KADQVSKTAPT---------------------------------------GEFLTTGSFM 350
           K  + ++  P                                           L  G+F 
Sbjct: 623 KNPEKTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFR 682

Query: 351 IRGK--KNFFPPCQLAMGISFLFKLEESS 377
           ++ +  +N  PP QL MG  FL+K++ S 
Sbjct: 683 LKNENDQNHLPPAQLVMGFGFLWKVKTSG 711


>gi|190407936|gb|EDV11201.1| hypothetical protein SCRG_02481 [Saccharomyces cerevisiae RM11-1a]
          Length = 1030

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 117/449 (26%), Positives = 197/449 (43%), Gaps = 120/449 (26%)

Query: 30  QGYLVQKKEKRPNSEDYI----VANMEF-----HPM--ILQPGSTPSVMRQHQGFPIQEF 78
           +GY++ K+     +E+YI     A++EF     HP    +  G T S         I E 
Sbjct: 282 KGYILAKR-----NENYISEKDTADLEFIYDTFHPFKPYINGGDTDSSC-------IIEV 329

Query: 79  ES-FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           E  +   +D+FFST ES K  L+   QE  A KK+++ + +++ ++ AL   Q +++ K 
Sbjct: 330 EGPYNRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKG 389

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
            LII N   ++   LA++  I  Q+ W  IE  +K  Q+  + +A ++  L LN+  N I
Sbjct: 390 HLIIENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKI 448

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSS--------------ETKKPSL-------- 233
           ++ L +  +  N         DN      +               E    S+        
Sbjct: 449 SVKLDLSSKELNTSSD----EDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKI 504

Query: 234 -VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTM 288
            V +DL LSAYANA  +F++K+++A+KQ+K  ++  KA+K+ E K     K+ LKD  ++
Sbjct: 505 NVTIDLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV 564

Query: 289 TNINKARKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WV 329
             + K R  Y+FEK+ WFISSE +L                 +  D   ++N++    W+
Sbjct: 565 --LKKIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWI 622

Query: 330 KADQVSKTAPT---------------------------------------GEFLTTGSFM 350
           K  + ++  P                                           L  G+F 
Sbjct: 623 KNPEKTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFR 682

Query: 351 IRGK--KNFFPPCQLAMGISFLFKLEESS 377
           ++ +  +N  PP QL MG  FL+K++ S 
Sbjct: 683 LKNENDQNHLPPAQLVMGFGFLWKVKTSG 711


>gi|151942783|gb|EDN61129.1| conserved protein [Saccharomyces cerevisiae YJM789]
          Length = 1040

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 117/449 (26%), Positives = 197/449 (43%), Gaps = 120/449 (26%)

Query: 30  QGYLVQKKEKRPNSEDYI----VANMEF-----HPM--ILQPGSTPSVMRQHQGFPIQEF 78
           +GY++ K+     +E+YI     A++EF     HP    +  G T S         I E 
Sbjct: 282 KGYILAKR-----NENYISEKDTADLEFIYDTFHPFKPYINGGDTDSSC-------IIEV 329

Query: 79  ES-FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           E  +   +D+FFST ES K  L+   QE  A KK+++ + +++ ++ AL   Q +++ K 
Sbjct: 330 EGPYNRTLDKFFSTIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKG 389

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHI 195
            LII N   ++   LA++  I  Q+ W  IE  +K  Q+  + +A ++  L LN+  N I
Sbjct: 390 HLIIENAPLIEEVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLL-NLPLNLKQNKI 448

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSS--------------ETKKPSL-------- 233
           ++ L +  +  N         DN      +               E    S+        
Sbjct: 449 SVKLDLSSKELNTSSD----EDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKI 504

Query: 234 -VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----KQTLKDVQTM 288
            V +DL LSAYANA  +F++K+++A+KQ+K  ++  KA+K+ E K     K+ LKD  ++
Sbjct: 505 NVTIDLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQLKKKLKDSHSV 564

Query: 289 TNINKARKVYWFEKFYWFISSENYL---------------GNVAWDAKVVTNAW----WV 329
             + K R  Y+FEK+ WFISSE +L                 +  D   ++N++    W+
Sbjct: 565 --LKKIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFNSHVWI 622

Query: 330 KADQVSKTAPT---------------------------------------GEFLTTGSFM 350
           K  + ++  P                                           L  G+F 
Sbjct: 623 KNPEKTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFR 682

Query: 351 IRGK--KNFFPPCQLAMGISFLFKLEESS 377
           ++ +  +N  PP QL MG  FL+K++ S 
Sbjct: 683 LKNENDQNHLPPAQLVMGFGFLWKVKTSG 711


>gi|412992730|emb|CCO18710.1| predicted protein [Bathycoccus prasinos]
          Length = 1191

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 78/253 (30%), Positives = 130/253 (51%), Gaps = 21/253 (8%)

Query: 68  RQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALE 127
           R   G   +E   F   +DE+++  E  +  ++  + E  ALKK E +K+D E RL  LE
Sbjct: 378 RDSSGEEKEEQNVFNDVLDEYYAKLELQQGVVQRNRAEAQALKKFEKLKKDQENRLKVLE 437

Query: 128 QTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQ 187
           +    + ++AELI  N + V   + A+   +A+ ++W+D+E+ + + +R  +PVA  I  
Sbjct: 438 RDAEKEMQRAELISRNVDDVANVLNAVNDALASGMAWDDLESMIHEEKRRGNPVAKFISS 497

Query: 188 LKLNINHITLLLR----VHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           L L  N +T+ L     V  + +  ++  V  H             KP  + +DL LSA+
Sbjct: 498 LDLKNNQVTIKLAPKEDVEIDETTGEEVVVKKH-------------KPIDIPLDLSLSAF 544

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK---QTLKDVQTMTNINKARKVYWF 300
           AN    FDLK+    K  KT +  ++ +  AEKK     +TL++ + +     AR  +WF
Sbjct: 545 ANCNARFDLKKKHDTKLDKTAKHHDRVVSDAEKKALMKIETLRE-KALPGAKAARTPFWF 603

Query: 301 EKFYWFISSENYL 313
           EKF W+++SEN L
Sbjct: 604 EKFIWWVTSENCL 616



 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 30/61 (49%), Gaps = 6/61 (9%)

Query: 317 AWDAKVVTNAWWVKADQVSKT----APTGEFLTTGSF-MIRG-KKNFFPPCQLAMGISFL 370
            W  K   +AWW +A  +SK           L +GSF +++G +K F PP +  MG   L
Sbjct: 697 CWAKKRSASAWWARARDISKRRWDLPEDASLLPSGSFALVKGARKKFLPPAKPEMGFGLL 756

Query: 371 F 371
           F
Sbjct: 757 F 757


>gi|238879662|gb|EEQ43300.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1018

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 119/459 (25%), Positives = 190/459 (41%), Gaps = 107/459 (23%)

Query: 19  HVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
           ++ + N  ++++GY+V KK  + + +   +YI    EFHP   +P   P+     +G   
Sbjct: 257 YIDLINGAIATEGYIVAKKNNKVSEKSDLEYIYD--EFHP--FEP-YKPN----QEGIKF 307

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
                +   +D+FFS  ES K  +K  QQ+ +A K+LE  + + + ++ +L   Q ++ +
Sbjct: 308 ISVSGYNKTLDKFFSNIESTKFSIKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAK 367

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--N 193
           K ELI  + E V+     ++  I  Q+ W +IE  +   Q+  + +A  I QL LN+  N
Sbjct: 368 KGELIQYHSELVEECRSYVQSFIDQQMDWTNIETVISLEQKKKNELAQHI-QLPLNLKEN 426

Query: 194 HITLLLRVHKERSNIKDS-------------------WVLHHDNADPFADSSETKKPSLV 234
            I +LL    +     +S                        DN +       T++ +  
Sbjct: 427 KIKVLLEDFDDYEESTESASATETGSETETESESESSSESESDNDEDKIPVKRTQRKTNT 486

Query: 235 D-----------VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
                       +DL  SA+ANA+ +FD K++A  KQ K   ST  ALK+AE+K  Q L 
Sbjct: 487 KEKPKRKTIPTWIDLSQSAFANARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLT 546

Query: 284 DVQTMTN--INKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWW---------VKAD 332
                 N  + + R  YWFEKF+WF+SSE YL     DA      ++         V AD
Sbjct: 547 RSLKQENDTLKEIRPKYWFEKFFWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSAD 606

Query: 333 QVS------KTAPTGEFL-----------------------TTGSFMIRG---------- 353
                    K    GE L                       TT ++++ G          
Sbjct: 607 MEGSLKVFIKNPLKGEALPPSTLMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDG 666

Query: 354 ------------KKNFFPPCQLAMGISFLFKLEESSISR 380
                       +K + PP QL MG  F   L++ S  R
Sbjct: 667 SIVPEGEFNYLVQKEYLPPAQLVMGFGFYCLLDDESTKR 705


>gi|297736763|emb|CBI25964.3| unnamed protein product [Vitis vinifera]
          Length = 403

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 114/206 (55%), Gaps = 16/206 (7%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P ++ Q +     +FE+F AA DEF+S  E  + + +   +E  A++KL  +  D E R
Sbjct: 198 CPILLNQFKSREFVKFETFDAASDEFYSKIEGQRSEQQQKAKEVTAMQKLSKICMDQENR 257

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +  L++      + AELI  N E VDAAILA+R  +AN ++WED+   VK+ ++  +PVA
Sbjct: 258 VHTLKKEDDRCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDLARMVKEKKKSGNPVA 317

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I +L L  N +TLLL                 +N D   D  +T     V+VDL LSA
Sbjct: 318 GLIDKLYLERNCMTLLL----------------SNNLDEMDDDEKTLHVDKVEVDLALSA 361

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTE 268
           +ANA+R+++ K+    K++KTI + E
Sbjct: 362 HANARRWYEQKKRQENKREKTIIAHE 387


>gi|367000852|ref|XP_003685161.1| hypothetical protein TPHA_0D00840 [Tetrapisispora phaffii CBS 4417]
 gi|357523459|emb|CCE62727.1| hypothetical protein TPHA_0D00840 [Tetrapisispora phaffii CBS 4417]
          Length = 1016

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 121/453 (26%), Positives = 193/453 (42%), Gaps = 121/453 (26%)

Query: 26  NVSSQGYLVQKK------EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFE 79
           N  S+GY++ K+      EK     +YI     FHP    P   P+ +   +   ++   
Sbjct: 267 NRYSKGYILAKRNPKFIEEKDSKDTEYIYET--FHP--FAPYVDPNEIDISKVIEVEG-- 320

Query: 80  SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAEL 139
            +   +D FFST ES K  L+   QE  A KKL++   ++ T+++AL   Q +++EK  L
Sbjct: 321 PYNNTLDLFFSTIESSKYALRIQNQEFLAKKKLDDAVNENLTKINALRDIQSINEEKGVL 380

Query: 140 IINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITL 197
           II   + ++    A++  I  Q+ W  IE  ++  Q+  + +A +I  L LN+  N I +
Sbjct: 381 IIEKADLIEEVKGAVQSLIDQQMDWNAIENIIRNEQKKRNNIARLI-MLPLNLKENKINI 439

Query: 198 LLRVHK--------------------------------ERSNIKDSWVLHHDNADPFADS 225
           +L                                    E++ IK     +  N       
Sbjct: 440 ILPAEDNNSDDSDNSSSSSDSDSEYSDNSDSDSSDDDIEKNRIKRKNRKNSKNV------ 493

Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK----QT 281
               K + + +DL LSA+ANA  +F+ K+++A+KQ+K  ++ EKALK+ E++ K    + 
Sbjct: 494 --KIKGTQITIDLALSAFANASEYFNKKKTSAEKQKKVEKNAEKALKNIEERIKVQLNKK 551

Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISSENYL---------------------------- 313
           LKD   +  + K R  Y+FE+F WF SSE +L                            
Sbjct: 552 LKDSHDI--LKKIRAPYFFERFNWFFSSEGFLILMGKSPLDTDQIYSKYIEDDDIYMSNS 609

Query: 314 -GNVAW---------------DAKVV----TNAW---------WVKADQVSKTAPTGE-F 343
            G   W                A V+    + AW         W  A  VSK +  G+  
Sbjct: 610 FGTQVWIKNPEKTEIPPNTLMQAGVLCMSASEAWSKKIASSPWWCFAKNVSKFSSDGKSV 669

Query: 344 LTTGSFMIRGKK--NFFPPCQLAMGISFLFKLE 374
           L  G F ++  K  NF PP QL MG  FL+K++
Sbjct: 670 LEPGLFRMKNDKQQNFLPPAQLVMGFGFLWKVK 702


>gi|440301763|gb|ELP94149.1| zinc knuckle domain containing protein, partial [Entamoeba invadens
           IP1]
          Length = 703

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 162/371 (43%), Gaps = 104/371 (28%)

Query: 71  QGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQ 130
           +G     F+SF  A+DEF S  E  + + +  +++    KK++ V   HE R   L    
Sbjct: 1   KGRLFDTFDSFCDAMDEFHSHIEKQEYEEELEKKDATMKKKIQAVIDGHEKRYKGL---- 56

Query: 131 LVDKEKAELIINNQESVDAAILAIRQDI-------ANQLSWEDIEARVKQAQRHNDP--V 181
               EKAE ++   + V++ I+ + Q I       + ++ WE +E  + Q+ + NDP  V
Sbjct: 57  ---LEKAEEMVVKAKVVESHIIIVDQLIKEINVFLSEKMQWERVE-EIIQSAKENDPTSV 112

Query: 182 ASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
           A  IK+                      D  VL  +NA+         +   VDV L  +
Sbjct: 113 AQYIKKFDF-----------------ANDVVVLSLENAN--------NQKIDVDVLLTKN 147

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQT-MTNINKAR----- 295
            + N + F++++R    K  KT++S E A++ A +K ++  K  Q  + ++ K R     
Sbjct: 148 GFENVRNFYEMRRVVLAKADKTLESRETAIQQATQKQERVAKTKQIDLADLKKMRRRFWF 207

Query: 296 -KVYWF---EKFYWFISSENYLGN------------------------------------ 315
            K +WF   E F   IS ++ L N                                    
Sbjct: 208 EKFHWFISSENFV-IISGKDALQNDVMYRRYMKNTDIYVHADIHGAASCLIKGVKGKVIG 266

Query: 316 ---------------VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
                           AW  K+VT+A+WV +DQVSKTAP+GE+L TGSFMIRGKKN+ PP
Sbjct: 267 AATLEQAGKVAVCRSSAWTNKIVTSAYWVYSDQVSKTAPSGEYLVTGSFMIRGKKNYLPP 326

Query: 361 CQLAMGISFLF 371
             L  G+  +F
Sbjct: 327 APLVFGLGIVF 337


>gi|145509741|ref|XP_001440809.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408037|emb|CAK73412.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1071

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 132/245 (53%), Gaps = 25/245 (10%)

Query: 70  HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
           +QG  + E  +F  AV ++F   +  + + +++  E  A KK EN+K+D  +R+  L++ 
Sbjct: 274 YQGKYVVETPTFNQAVHQYFLVVDRQEENKQSI--EDIAWKKFENIKQDQMSRIQKLQEE 331

Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLK 189
           Q     KA LI  N   V A I  I++ I N + W+ I+  +  +++  +P++++I  + 
Sbjct: 332 QDEYIMKAGLIQENINDVQAIIDIIQKMIENGIPWDKIQRMINDSKKEGNPLSNMIGGMN 391

Query: 190 LNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRF 249
           L  N +T+LL                 +  D ++D  +      +++D+  SAY NA+++
Sbjct: 392 LKQNKVTILL----------------GNKDDEYSDLIQ------IEIDITQSAYQNARKY 429

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKT-KQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
           ++ K+    K+ KT ++ E+ALK AEK   K+  ++   +  +   RK YWFEKF+WFIS
Sbjct: 430 YESKKKNRDKEIKTKEAVEQALKQAEKTALKEIEREKNKIQKVQNQRKKYWFEKFFWFIS 489

Query: 309 SENYL 313
           S+ YL
Sbjct: 490 SDGYL 494



 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 37/64 (57%), Positives = 48/64 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           +WDAK+V +AWWV A QVSK+APTG  +  GSFMI GKKNF  P +L MG + L++L++ 
Sbjct: 557 SWDAKIVVSAWWVHASQVSKSAPTGMNIPAGSFMIYGKKNFIYPPRLEMGCTILYQLDQD 616

Query: 377 SISR 380
           SI R
Sbjct: 617 SIKR 620


>gi|255083452|ref|XP_002504712.1| predicted protein [Micromonas sp. RCC299]
 gi|226519980|gb|ACO65970.1| predicted protein [Micromonas sp. RCC299]
          Length = 1219

 Score =  111 bits (277), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 84/254 (33%), Positives = 134/254 (52%), Gaps = 14/254 (5%)

Query: 81  FAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140
           F AA+D FF++ E+ +   +  +    A KKLE V+RD E R+ ALE+ +   +  A LI
Sbjct: 344 FDAALDLFFASFEARRDRSRREKSANAAAKKLEKVRRDQEARVRALEKERESQELAATLI 403

Query: 141 INNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLR 200
             N   VDA + A+   +A  ++W+D+   +K+  R  +PVA ++K L L  N +T+ L+
Sbjct: 404 EYNLTQVDAVLAAVNGALAGGMAWDDLTLMIKEEARAGNPVARLVKTLDLPKNKVTVTLK 463

Query: 201 VH-----------KERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRF 249
            H            +  +  D+  +   +A P +   +      V++DL L A+ANA+  
Sbjct: 464 NHLDVDDDEGDDDGDDGDGGDADDVGEGDAKPRSRRLKRDGGVSVELDLALGAHANAREH 523

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT---LKDVQTMTNINKARKVYWFEKFYWF 306
           FD K+    K  KT+   ++A+ +AEKK K+    +    T   I +AR   WFEKF+WF
Sbjct: 524 FDRKKKHDAKHGKTLAQNKRAVAAAEKKAKEAGARMASKGTGMGIARARVPEWFEKFHWF 583

Query: 307 ISSENYLGNVAWDA 320
           I++EN L   A DA
Sbjct: 584 ITTENCLVLSARDA 597



 Score = 47.8 bits (112), Expect = 0.009,   Method: Composition-based stats.
 Identities = 21/61 (34%), Positives = 32/61 (52%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           + AWD++ V +A+W+  + V K  P G+ L  G     G K + PP  L MG   +F L 
Sbjct: 682 SAAWDSRHVVSAFWIPPENVRKVTPDGDPLAPGVVWHVGAKTYLPPAPLVMGFGCVFLLR 741

Query: 375 E 375
           +
Sbjct: 742 D 742


>gi|193787557|dbj|BAG52763.1| unnamed protein product [Homo sapiens]
          Length = 481

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 47/66 (71%), Positives = 55/66 (83%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           + AWDA+V+T+AWWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++
Sbjct: 6   SAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVD 65

Query: 375 ESSISR 380
           ES + R
Sbjct: 66  ESCVWR 71


>gi|34364937|emb|CAE45889.1| hypothetical protein [Homo sapiens]
          Length = 505

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 47/66 (71%), Positives = 55/66 (83%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           + AWDA+V+T+AWWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++
Sbjct: 30  SAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVD 89

Query: 375 ESSISR 380
           ES + R
Sbjct: 90  ESCVWR 95


>gi|349602918|gb|AEP98908.1| Serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 517

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 47/66 (71%), Positives = 55/66 (83%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           + AWDA+V+T+AWWV   QVSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++
Sbjct: 41  SAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVD 100

Query: 375 ESSISR 380
           ES + R
Sbjct: 101 ESCVWR 106


>gi|167395586|ref|XP_001741648.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165893772|gb|EDR21907.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 960

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 159/360 (44%), Gaps = 87/360 (24%)

Query: 77  EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           EFESF  A+DEF S  E  + + +  ++E    KK++ V   H+ R   L       K +
Sbjct: 290 EFESFEKAMDEFHSHIEKQEYEAEVEKKEMIMKKKVQAVIDGHQKRYQGLLDKAETLKNE 349

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP--VASIIKQLKLNINH 194
           AE +  N + VD  I  I   +  ++ WE IE  +++  + NDP  +A  IK+       
Sbjct: 350 AEAVEENIQVVDQLIQEINVFLKEKMKWEQIEGIIEEL-KENDPTSIAKYIKRFDFKNEV 408

Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKR 254
           + L LR               H N D   D         V++ L+ + + N + F+++++
Sbjct: 409 VVLELR---------------HTNEDKIID---------VEIALNKNGFENVRNFYEMRK 444

Query: 255 SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-TMTNINKARKVYWFEKFYWFISSENYL 313
           +   K +KTI+S + A+K AE K ++  K+ + T+ ++ K RK +WFEKF+WF+SSEN++
Sbjct: 445 NILAKAEKTIESKDLAIKQAENKQERVAKEKKITLVDVKKMRKRFWFEKFHWFLSSENFI 504

Query: 314 ---GNVAWDAKVVTNAW------WVKAD-----------------------QVSKTAP-- 339
              G  A    ++   +      +V AD                       Q  K A   
Sbjct: 505 IISGKDALQNDIIYRRYMKNTDIYVHADIHGAASCIIKGIPGKTIGAPTLEQAGKIAVCR 564

Query: 340 ----TGEFLT---------------TGSFM------IRGKKNFFPPCQLAMGISFLFKLE 374
               T + +T               +G ++      IRGKKN+ PP  L  GI  +F +E
Sbjct: 565 SSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKNYLPPVPLVFGIGIMFVVE 624


>gi|313215449|emb|CBY16187.1| unnamed protein product [Oikopleura dioica]
          Length = 404

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 43/66 (65%), Positives = 58/66 (87%)

Query: 315 NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           + AW+AKV+T+AWWV A+QVSKTAP+GE+L+TGSFMIRGKKN+ PP QL +G  FLFKL+
Sbjct: 36  SAAWNAKVLTSAWWVHANQVSKTAPSGEYLSTGSFMIRGKKNYLPPSQLVLGFGFLFKLD 95

Query: 375 ESSISR 380
           ++ ++R
Sbjct: 96  DACVAR 101


>gi|145494650|ref|XP_001433319.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400436|emb|CAK65922.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1070

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 131/245 (53%), Gaps = 25/245 (10%)

Query: 70  HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQT 129
           +QG  + E  +F  AV ++F   +  + + +++  E  A KK EN+K+D  +R+  L+  
Sbjct: 274 YQGKYVIETPTFNEAVHQYFLVVDRQEDNKQSI--EDIAWKKFENIKQDQMSRIQKLQSE 331

Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLK 189
           Q     KA LI  N   V A I  I++ I N + W+ I+  +  +++  +P++++I  + 
Sbjct: 332 QDEYIMKAGLIQENINDVQAIIDIIQKMIENGIPWDKIQRMINDSKKEGNPLSNMIGGMN 391

Query: 190 LNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRF 249
           L  N +T+LL                 +  D ++D  +      +++D+  SA+ NA+++
Sbjct: 392 LKQNKVTILL----------------GNKEDEYSDLIQ------IEIDITQSAHQNARKY 429

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEKKT-KQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
           ++ K+    K+ KT ++ E+ALK AEK   K+  ++   +  +   RK YWFEKF+WFIS
Sbjct: 430 YESKKKNRDKEIKTKEAVEQALKQAEKTALKEIEREKNKIQKVQNQRKKYWFEKFFWFIS 489

Query: 309 SENYL 313
           S+ YL
Sbjct: 490 SDGYL 494



 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 37/64 (57%), Positives = 48/64 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           +WDAK+V +AWWV A QVSK+APTG  +  GSFMI GKKNF  P +L MG + L++L++ 
Sbjct: 557 SWDAKIVVSAWWVHASQVSKSAPTGMNIPAGSFMIYGKKNFIYPPRLEMGCTILYQLDQD 616

Query: 377 SISR 380
           SI R
Sbjct: 617 SIKR 620


>gi|303290793|ref|XP_003064683.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226453709|gb|EEH51017.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 807

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/383 (26%), Positives = 153/383 (39%), Gaps = 111/383 (28%)

Query: 75  IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
           +Q +ESF  A+D +F++ E+     +  + E+  + +LE V++D   R +ALE+ +  D+
Sbjct: 313 VQAYESFDDALDAYFASFETQAATRQRERAEKAVVDRLEKVRKDQSQRAAALEREREADE 372

Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINH 194
            +A LI  N E VD A+ A+   +A  + W D+E  +++  R  +PVA  IK L L  N 
Sbjct: 373 LRATLIEYNLERVDVALAAVNNALAGGMGWGDLEIMIREETRAGNPVAGTIKSLDLANNK 432

Query: 195 ITLLLRVH------------KERSNIKDSWVLHHDNADPFADSSETKKPSL--------- 233
           IT+ L  H            ++  +                 SSE K+            
Sbjct: 433 ITVTLANHLDDDEDDEDEEEEDGEDEDKDGDEDDAGEGDDEKSSERKRKQQQKKLRRKRR 492

Query: 234 --VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
             V V+LD  LSAYANA+  F+ K+  A K  KT+  TE+A                   
Sbjct: 493 KAVAVELDLSLSAYANARTHFEKKKKHATKHDKTLAQTERA------------------- 533

Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------- 321
                       KF+WF+++EN L   A DA                             
Sbjct: 534 ------------KFWWFVTTENCLVVSARDAAQTDAMLKKYAPPGSSVVVGGGGGGGGAG 581

Query: 322 ------------------VVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGK 354
                               +NAW         +VK +Q+ K  P GE L  G     GK
Sbjct: 582 WCNGVPPASLAQAGAACLCRSNAWDSRQVISAWYVKPEQIRKETPEGEPLLNGVVWTVGK 641

Query: 355 KNFFPPCQLAMGISFLFKLEESS 377
           K F PP  L MG +++F L + +
Sbjct: 642 KTFLPPAPLVMGFAYMFVLGDDA 664


>gi|430813962|emb|CCJ28739.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 631

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 83/291 (28%), Positives = 142/291 (48%), Gaps = 30/291 (10%)

Query: 28  SSQGYLVQKKEK--RPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAV 85
           S  GY+V+K E   R N  D  + +    P+ +     P + +Q+   P     +F    
Sbjct: 277 SVTGYIVEKIESKTRINLNDITLESTP-DPVKIYVDFNPFIPKQYSNNPNYSVITFDDGY 335

Query: 86  DEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQE 145
           ++      S K D+K   Q+  A ++L+  K +H+ ++  L++ Q +  +KA+ I  NQE
Sbjct: 336 NK------SQKFDMKLKNQKDIAYRRLQITKEEHQKKIDDLQKFQNICIKKAKAIEENQE 389

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
            VD  I A+   +   + WEDI   VK  + +     +I    +L   H+          
Sbjct: 390 IVDETIKAVNTCVLRSMDWEDIAKLVKTEKEYESNTITI----QLPCPHL---------- 435

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTI 264
               D  +  +D+     +    K  +L +D+ L L+A+ NA+ +++ K++A+ K++KTI
Sbjct: 436 ----DDNIYENDSTTGLFNGQNDKTETLNIDIKLSLNAWTNARDYYEKKKAASVKEEKTI 491

Query: 265 QSTEKALKSAEKKTKQTLK--DVQTMTNINKARKVYWFEKFYWFISSENYL 313
            ++ KALK+AE+K    LK    Q    +   R + WFEKF WFISS+ YL
Sbjct: 492 AASSKALKNAERKINSDLKRNTAQEKKKLVPMRNLQWFEKFLWFISSDGYL 542


>gi|358254228|dbj|GAA54239.1| nuclear export mediator factor Nemf, partial [Clonorchis sinensis]
          Length = 527

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 41/62 (66%), Positives = 52/62 (83%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW+A+VVT+AWWV+ DQVSKTAP+GE+LTTG+FMIRG+KN+ PPC    G   LFKL+E 
Sbjct: 123 AWNARVVTSAWWVRQDQVSKTAPSGEYLTTGAFMIRGRKNYLPPCHFMYGFGVLFKLDEE 182

Query: 377 SI 378
           S+
Sbjct: 183 SV 184



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/44 (65%), Positives = 34/44 (77%)

Query: 270 ALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           ALKSA  K +QT KD++T+  I K RK  WFEKF+WFISSENYL
Sbjct: 1   ALKSAAFKAQQTRKDLRTVAQITKIRKPMWFEKFFWFISSENYL 44


>gi|403216659|emb|CCK71155.1| hypothetical protein KNAG_0G00970 [Kazachstania naganishii CBS
           8797]
          Length = 1006

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 99/321 (30%), Positives = 150/321 (46%), Gaps = 49/321 (15%)

Query: 31  GYLVQKKEKRPNSE--------DYIVANME-FHPMILQPGSTPSVMRQHQGFPIQEFESF 81
           GY+V KK   PN E        +Y   N   F P I +   +   + + +G       ++
Sbjct: 270 GYIVAKK--NPNFEPSRDQEDLEYFFENFHPFKPFIKECERSILHVLKIEG-------TY 320

Query: 82  AAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELII 141
              VD+FFST ES K  +K   QE  A KKLE  + ++  R+ AL   Q  +++K  LII
Sbjct: 321 NRTVDKFFSTLESSKYAMKIQNQETLAGKKLEEARSENGKRIQALIDVQSQNEQKGHLII 380

Query: 142 NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLLR 200
            + E V+ A  A++  +  QL W  IE  +   Q+  + +A  IK  LKL  N I L L 
Sbjct: 381 THAELVEDAKGAVQGLLDQQLDWNIIEKLIITEQKKGNKIAKAIKLPLKLKKNTIVLELP 440

Query: 201 VHKERSNIKDSWV------------------------LHHDNADPFADSSETKKPSLVDV 236
           +        D+ +                          H +    ++     KP+ V V
Sbjct: 441 LEDNNDTEDDTELSEEVDSSDISSSELSSDEESDQGSTQHQHRK--SNRIRALKPTTVSV 498

Query: 237 D--LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL--KDVQTMTNIN 292
           D  LDLS YANA  +F +K+   +KQ+K  Q+ +KA+K+ E K  + L  K  ++   + 
Sbjct: 499 DIKLDLSTYANASEYFMVKKHTVEKQKKVEQNLDKAMKNIETKVNKQLNSKLKESHKVLK 558

Query: 293 KARKVYWFEKFYWFISSENYL 313
           + R  Y+FEK+ WFISSE +L
Sbjct: 559 RLRTPYFFEKYNWFISSEGHL 579


>gi|378754807|gb|EHY64836.1| hypothetical protein NERG_02239 [Nematocida sp. 1 ERTm2]
          Length = 697

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 152/369 (41%), Gaps = 113/369 (30%)

Query: 72  GFPIQEFESFAAAVDEFFSTAE-----SHKIDLKAVQ-QERDALKKLENVKRDHETRLSA 125
           G  IQEFE F AA+D  F+  E     S K   K  + QERD  KK+E            
Sbjct: 263 GQKIQEFEGFGAAMDAVFNVQEITETASQKKQRKIREAQERDLHKKIE------------ 310

Query: 126 LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASII 185
            E T L  K+KAEL+  NQ  V   I  I    A  LS ++ E R ++ ++  +P A II
Sbjct: 311 -EMTIL--KDKAELLSENQAEVKNVISVIEAANAASLSEKEFE-RFRETEKDTNPTAQII 366

Query: 186 KQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
           K  K+N  + T+ L + K+                             V +D   S +  
Sbjct: 367 K--KVNFGNKTVDLTLDKK----------------------------AVSIDYTKSIFEQ 396

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
               +      AKK ++ ++ T KAL  ++ K  +    V+ +  I   R  +WFEKF W
Sbjct: 397 INMLY----QKAKKIEEKLKKTRKALDESKHKEVEIASKVEKIEKIE--RNPFWFEKFRW 450

Query: 306 FISSENYLGNVAWDAKV------------------------------------------- 322
           FI+ ++ L     D+K                                            
Sbjct: 451 FITKDSDLIIAGRDSKQNEILVKKYLLDTDYYFHADIRGGSSVIVGEHATDHTKEIAASM 510

Query: 323 ---VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
              ++ AW          V+ DQVSKTAP GE+LT GSFMI GKK F+ P +L  G S +
Sbjct: 511 AMHLSKAWENNLITEVYCVRGDQVSKTAPAGEYLTHGSFMITGKKEFYHPTRLEYGFSLI 570

Query: 371 FKLEESSIS 379
           +K+E+  I+
Sbjct: 571 YKIEDEEIT 579


>gi|407846065|gb|EKG02413.1| hypothetical protein TCSYLVIO_006562 [Trypanosoma cruzi]
          Length = 1080

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 75/249 (30%), Positives = 126/249 (50%), Gaps = 17/249 (6%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP ++ Q+   G      +SF +  D FF   E+ KI+    ++    + K    +RDH+
Sbjct: 323 TPVLLAQYSSDGVVTSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNKFERDHQ 382

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            RL+ALE  +  ++ K E II N   +D AI  I   +A  + W+ + + +K+      P
Sbjct: 383 RRLNALEMEEQENQRKGECIIQNAVKIDEAIGLINGALAAGIQWDALRSLLKRRHAEGHP 442

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VA ++  L L  N I++L+  +++            D  +   D +    P +++V+L  
Sbjct: 443 VAYMVHDLFLERNSISVLVESNEQ-----------EDEGEEDCDVT----PMVIEVELSK 487

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +AYANA  +F   +S   K +KT+ +T KAL  AEKK ++     +T   I K R+ +W+
Sbjct: 488 TAYANATTYFSKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWW 547

Query: 301 EKFYWFISS 309
           EKF WF +S
Sbjct: 548 EKFSWFRTS 556


>gi|343472755|emb|CCD15168.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 559

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 77/281 (27%), Positives = 137/281 (48%), Gaps = 18/281 (6%)

Query: 31  GYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFS 90
           GYL+ K  KR   E       +F P++L      +V  QH       F +F++  DEFFS
Sbjct: 254 GYLL-KSSKRGGQEANDSRYDDFSPVLLDQYKKDAVAYQH-------FPNFSSVCDEFFS 305

Query: 91  TAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAA 150
            +E  +I+    + +   + K E  +R+H  R+  L++++     K  LI  N E++D  
Sbjct: 306 YSEKKRIEHHNDKVKTVVVSKREECERNHNRRIDKLKRSEEESIRKGHLIFQNTETIDKI 365

Query: 151 ILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKD 210
           I  I + +  ++ W+D  + +KQ +    P+AS+IK++      + +L+       N   
Sbjct: 366 IGLINEALDMKIRWDDFRSVLKQRRDEGHPLASMIKEVLFERRKVVVLM-------NEDA 418

Query: 211 SWVLHHDNADPFADSSETKKPSLVDVDLDL--SAYANAKRFFDLKRSAAKKQQKTIQSTE 268
                    D   +  E +  +  ++++DL  +A+ NA+ +F   +S A K ++TI +TE
Sbjct: 419 DDDDDEQTEDEEGEKREDRDRATYEIEIDLTKTAHTNAEEYFARAKSTAAKLKRTIAATE 478

Query: 269 KALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           KA+  AE+K +      Q    I + R  +W+EKF WF +S
Sbjct: 479 KAMAGAERKGRTVTGKTQEKKIITE-RCRFWWEKFNWFRTS 518


>gi|71413048|ref|XP_808681.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70872935|gb|EAN86830.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1082

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 76/258 (29%), Positives = 129/258 (50%), Gaps = 22/258 (8%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +F P++L   S+  V+           +SF +  D FF   E+ KI+    ++    + K
Sbjct: 321 DFSPVLLAQYSSDGVVTSF-------LKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISK 373

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
               +RDH+ RL+ALE  +  ++ K E II N   +D AI  I   +A  + W+ + + +
Sbjct: 374 RNKFERDHQRRLNALEMEEQENQRKGECIIQNAVKIDEAIGLINGALAAGIQWDALRSLL 433

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           K+      PVA ++  L L  N I++L+  +++            D  +   D +    P
Sbjct: 434 KRRHAEGHPVAYMVHDLFLERNSISVLVESNEQ-----------EDEGEEDCDVT----P 478

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
            +++V+L  +AYANA  +F   +S   K +KT+ +T KAL  AEKK ++     +T   I
Sbjct: 479 MVIEVELSKTAYANATTYFAKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAI 538

Query: 292 NKARKVYWFEKFYWFISS 309
            K R+ +W+EKF WF +S
Sbjct: 539 VKERRRFWWEKFSWFRTS 556


>gi|74025594|ref|XP_829363.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70834749|gb|EAN80251.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1100

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 84/263 (31%), Positives = 131/263 (49%), Gaps = 16/263 (6%)

Query: 63  TPSVMRQHQGFPIQE--FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           +P ++ Q++G  +      SF +  D FF   E  KI+    +     L K E  +RDH 
Sbjct: 317 SPVLLAQYRGDAVSASYLPSFGSVCDAFFLYTEKEKIEQHNDRATTCVLSKKEKFERDHN 376

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            R++ALE+++  +  K ELII N E +D AI  I   +A  + WE +   +KQ      P
Sbjct: 377 RRIAALERSEEENTRKGELIIQNAEKIDEAIGLINGALAAGIQWEALRRLLKQRHAEGHP 436

Query: 181 VASIIKQLKLNINHITLLLRVHKE------------RSNIKDSWVLHHDNADPFADSSE- 227
           VA ++ +L L+ N I++L+  + E            +   K     +  N+    D  E 
Sbjct: 437 VAYMVHELFLDRNSISVLVEENDEDVECYEDEESKVKVGGKGENHRYGGNSGEKKDRVEG 496

Query: 228 -TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ 286
            ++ PS+++VDL  +AYANA  +F  K++   K +KTI +T KA   AEKK ++     Q
Sbjct: 497 CSRTPSVIEVDLSKTAYANAASYFTQKKANRAKLEKTIAATAKAAAGAEKKGERLAAKKQ 556

Query: 287 TMTNINKARKVYWFEKFYWFISS 309
           T   I   R   W+EKF WF +S
Sbjct: 557 TKKAIATERHRCWWEKFNWFRTS 579



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 8/63 (12%)

Query: 317 AWDAKVVTNAWWVKADQ-VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
           AW++K    AWWV A Q V  TA        G +++ G+KN+  P  L +G   LF++  
Sbjct: 672 AWESKFSVGAWWVHASQIVGGTA-------AGCYLLSGEKNYLRPRPLMLGCGLLFRISS 724

Query: 376 SSI 378
            +I
Sbjct: 725 RAI 727


>gi|261335340|emb|CBH18334.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1100

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 84/263 (31%), Positives = 131/263 (49%), Gaps = 16/263 (6%)

Query: 63  TPSVMRQHQGFPIQE--FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           +P ++ Q++G  +      SF +  D FF   E  KI+    +     L K E  +RDH 
Sbjct: 317 SPVLLAQYRGDAVSASYLPSFGSVCDAFFLYTEKEKIEQHNDRATTCVLSKKEKFERDHN 376

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            R++ALE+++  +  K ELII N E +D AI  I   +A  + WE +   +KQ      P
Sbjct: 377 RRIAALERSEEENTRKGELIIQNAEKIDEAIGLINGALAAGIQWEALRRLLKQRHAEGHP 436

Query: 181 VASIIKQLKLNINHITLLLRVHKE------------RSNIKDSWVLHHDNADPFADSSE- 227
           VA ++ +L L+ N I++L+  + E            +   K     +  N+    D  E 
Sbjct: 437 VAYMVHELFLDRNSISVLVEENDEDVECYEDEESKVKVGGKGENHRYGGNSGEKKDRVEG 496

Query: 228 -TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ 286
            ++ PS+++VDL  +AYANA  +F  K++   K +KTI +T KA   AEKK ++     Q
Sbjct: 497 CSRTPSVIEVDLSKTAYANAASYFTQKKANRAKLEKTIAATAKAAAGAEKKGERLAAKKQ 556

Query: 287 TMTNINKARKVYWFEKFYWFISS 309
           T   I   R   W+EKF WF +S
Sbjct: 557 TKKAIATERHRCWWEKFNWFRTS 579



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 8/63 (12%)

Query: 317 AWDAKVVTNAWWVKADQ-VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
           AW++K    AWWV A Q V  TA        G +++ G+KN+  P  L +G   LF++  
Sbjct: 672 AWESKFSVGAWWVHASQIVGGTA-------AGCYLLSGEKNYLRPRPLMLGCGLLFRISS 724

Query: 376 SSI 378
            +I
Sbjct: 725 RAI 727


>gi|224014996|ref|XP_002297159.1| signal peptidase [Thalassiosira pseudonana CCMP1335]
 gi|220968134|gb|EED86484.1| signal peptidase [Thalassiosira pseudonana CCMP1335]
          Length = 968

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 147/315 (46%), Gaps = 40/315 (12%)

Query: 14  CGPTLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGF 73
           CG    V+ +  +  S GY++ K ++  +  DY    +EF P +L          QH+  
Sbjct: 234 CGEGAKVIENLSSGESGGYILYKPKQTDDKNDYNKTLLEFQPHLLH---------QHKNQ 284

Query: 74  PIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVD 133
               + +FA A DEFFS   S +I  +A   E  A ++L  ++ D + R+  L   Q   
Sbjct: 285 HALSYTTFATATDEFFSHLSSQRIAQRADAAEAAARERLSKIQLDQQRRVDGLVAEQEKS 344

Query: 134 KEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNIN 193
           ++ A L+  + E VD  +  I   + + ++W+ +E  V   Q + +P+A +I +L+L  +
Sbjct: 345 RDCARLVEMHAEDVDRVLGVINSALESGMNWDALEQLVLVEQGNENPIALLIFKLELCKD 404

Query: 194 HITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLK 253
            + L L    +  +                      K   V V +  SA+ NA+  F   
Sbjct: 405 QVVLALPDIDDWDDSD---------------PDRPPKLHYVTVSIKESAHGNARNMF--- 446

Query: 254 RSAAKKQQKTIQSTEKALKSAEKKTKQTLKD-------VQTMTNINKARKVYWFEKFYWF 306
             A  KQ KT++++  ALK+AE K KQ L +       +Q M N    RK YWFEKF WF
Sbjct: 447 --ATIKQSKTLEASTTALKAAEAKAKQQLAEAQKKKQRIQVMPN----RKTYWFEKFAWF 500

Query: 307 ISSENYLGNVAWDAK 321
           I+S+NYL     DA+
Sbjct: 501 ITSDNYLVVAGQDAQ 515



 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 39/65 (60%), Positives = 54/65 (83%), Gaps = 1/65 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
           AW +K+V +A+WV++ QVSKTAPTGE+LT GSFMIRG+KNF PP  L MG+  LF+L ++
Sbjct: 579 AWSSKMVCSAYWVESHQVSKTAPTGEYLTVGSFMIRGRKNFLPPSSLEMGMGVLFRLGDD 638

Query: 376 SSISR 380
           +S++R
Sbjct: 639 ASVAR 643


>gi|71411706|ref|XP_808091.1| hypothetical protein Tc00.1047053507483.60 [Trypanosoma cruzi
           strain CL Brener]
 gi|70872222|gb|EAN86240.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1081

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 76/258 (29%), Positives = 129/258 (50%), Gaps = 22/258 (8%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +F P++L   S+  V+       +   +SF +  D FF   E+ KI+    ++    + K
Sbjct: 321 DFSPVLLAQYSSDGVV-------MSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISK 373

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
               +RDH  RL+ALE  +  ++ K E II N   +D AI  I   +A  + W+ + + +
Sbjct: 374 RNKFERDHLRRLNALEMEEQENQRKGECIIQNVVKIDEAIGLINGALAAGIQWDALRSLL 433

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           K+      PVA ++  L L  N I++L+  +++            D  +   D +    P
Sbjct: 434 KRRHAEGHPVAYMVHDLFLERNSISVLVESNEQ-----------EDEGEEDCDVT----P 478

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
            +++V+L  +AYANA  +F   +S   K +KT+ +T KAL  AEKK ++     +T   I
Sbjct: 479 MVIEVELSKTAYANATTYFAKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAI 538

Query: 292 NKARKVYWFEKFYWFISS 309
            K R+ +W+EKF WF +S
Sbjct: 539 VKERRRFWWEKFSWFRTS 556


>gi|170582502|ref|XP_001896158.1| hypothetical protein [Brugia malayi]
 gi|158596691|gb|EDP34993.1| conserved hypothetical protein [Brugia malayi]
          Length = 643

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 40/64 (62%), Positives = 51/64 (79%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW+AK+ ++AWWV   QVS+TAPTGE+LT GSFMIRGKKN+ P CQL MG   +F+L+E 
Sbjct: 40  AWEAKITSSAWWVHQHQVSRTAPTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEE 99

Query: 377 SISR 380
           S+ R
Sbjct: 100 SLER 103


>gi|407406699|gb|EKF30889.1| hypothetical protein MOQ_005283 [Trypanosoma cruzi marinkellei]
          Length = 1098

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 74/258 (28%), Positives = 129/258 (50%), Gaps = 22/258 (8%)

Query: 52  EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +F P++L   S+  V+           +SF +  D FF   E+ KI+    ++    + K
Sbjct: 333 DFSPVLLAQYSSEGVVTSF-------LKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISK 385

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
               +RDH+ RL+ LE  +  ++ K E II +   +D AI  I   +A  + W+ + + +
Sbjct: 386 RNKFERDHQRRLNTLEMEEQENQRKGECIIQHAVKIDEAIGLINGALAAGIQWDALRSLL 445

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           K+      PVA ++ +L L  N I++L+  +++            D  +   D +    P
Sbjct: 446 KRRHAEGHPVAYMVHELFLERNSISVLVESNEQ-----------EDEGEEDCDVT----P 490

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
            +++V+L  +AYANA  +F   +S   K +KT+ +T KAL  AEKK ++     +T   I
Sbjct: 491 MVIEVELSKTAYANATTYFSKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAI 550

Query: 292 NKARKVYWFEKFYWFISS 309
            K R+ +W+EKF WF +S
Sbjct: 551 VKERRRFWWEKFSWFRTS 568


>gi|422293271|gb|EKU20571.1| hypothetical protein NGA_2069500, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 107

 Score =  100 bits (249), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 44/64 (68%), Positives = 50/64 (78%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW AK+VT+AWWV A QVSKTAP GEFL TGSFM+RGKKNF  P  L MG+  LFKL+E 
Sbjct: 28  AWKAKMVTSAWWVGAGQVSKTAPAGEFLPTGSFMVRGKKNFLAPQPLEMGLGLLFKLDEG 87

Query: 377 SISR 380
           S+ R
Sbjct: 88  SVGR 91


>gi|387595331|gb|EIJ92956.1| hypothetical protein NEPG_02355 [Nematocida parisii ERTm1]
          Length = 700

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 103/368 (27%), Positives = 151/368 (41%), Gaps = 113/368 (30%)

Query: 72  GFPIQEFESFAAAVDEFFSTAE-----SHKIDLKAVQ-QERDALKKLENVKRDHETRLSA 125
           G  I+EF+ F +A+D  F+  E     S K   K  + QERD  KK++            
Sbjct: 263 GQKIKEFDGFGSAMDAAFAVQEITETVSQKKHRKIREAQERDLHKKID------------ 310

Query: 126 LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASII 185
            E T L  K KAEL+  NQ  V   I  I    A  LS ++ E R K++++  +P A II
Sbjct: 311 -EMTIL--KTKAELLSENQAEVKNVISVIEAAHAASLSEKEFE-RFKESEKDKNPTAKII 366

Query: 186 KQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
           K  K N    T+ L + K+                            LV +D   S +  
Sbjct: 367 K--KANFGKKTVDLIIDKQ----------------------------LVTIDYTASIFEQ 396

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF-- 303
               +      AKK ++ ++ T  AL+ +  K  +  K ++ +  I+  R V+WFEKF  
Sbjct: 397 INALY----QKAKKIEEKLKKTRVALEESRTKEIEVTKRIEKIEKID--RNVFWFEKFRW 450

Query: 304 ------------------------------YWF----------ISSENYLGNV------- 316
                                         Y+F          I  EN   +        
Sbjct: 451 LITKDSDLILAGRDSKQNEILVKKHLLDTDYYFHADVRGGSSVIVGENATVHTKEVAAAM 510

Query: 317 ------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
                 AW+   +T  + V+ +QVSKTAP GE+LT GSFMI GKK F+ P +L  G S +
Sbjct: 511 ALHLSKAWENSTITEVYCVRGEQVSKTAPAGEYLTHGSFMITGKKEFYHPTKLEYGFSIM 570

Query: 371 FKLEESSI 378
           +KL++  I
Sbjct: 571 YKLKDKEI 578


>gi|387592702|gb|EIJ87726.1| hypothetical protein NEQG_02273 [Nematocida parisii ERTm3]
          Length = 700

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 103/368 (27%), Positives = 151/368 (41%), Gaps = 113/368 (30%)

Query: 72  GFPIQEFESFAAAVDEFFSTAE-----SHKIDLKAVQ-QERDALKKLENVKRDHETRLSA 125
           G  I+EF+ F +A+D  F+  E     S K   K  + QERD  KK++            
Sbjct: 263 GQKIKEFDGFGSAMDAAFAVQEITETASQKKHRKIREAQERDLHKKID------------ 310

Query: 126 LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASII 185
            E T L  K KAEL+  NQ  V   I  I    A  LS ++ E R K++++  +P A II
Sbjct: 311 -EMTIL--KTKAELLSENQAEVKNVISVIEAAHAASLSEKEFE-RFKESEKDKNPTAKII 366

Query: 186 KQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYAN 245
           K  K N    T+ L + K+                            LV +D   S +  
Sbjct: 367 K--KANFGKKTVDLIIDKQ----------------------------LVTIDYTASIFEQ 396

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF-- 303
               +      AKK ++ ++ T  AL+ +  K  +  K ++ +  I+  R V+WFEKF  
Sbjct: 397 INALY----QKAKKIEEKLKKTRVALEESRTKEIEVTKRIEKIEKID--RNVFWFEKFRW 450

Query: 304 ------------------------------YWF----------ISSENYLGNV------- 316
                                         Y+F          I  EN   +        
Sbjct: 451 LITKDSDLILAGRDSKQNEILVKKHLLDTDYYFHADVRGGSSVIVGENATVHTKEVAAAM 510

Query: 317 ------AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
                 AW+   +T  + V+ +QVSKTAP GE+LT GSFMI GKK F+ P +L  G S +
Sbjct: 511 ALHLSKAWENSTITEVYCVRGEQVSKTAPAGEYLTHGSFMITGKKEFYHPTKLEYGFSIM 570

Query: 371 FKLEESSI 378
           +KL++  I
Sbjct: 571 YKLKDKEI 578


>gi|221502557|gb|EEE28284.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1859

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 73/249 (29%), Positives = 121/249 (48%), Gaps = 20/249 (8%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F      VDE+FS+ +  K +    Q  ++AL ++E +K D E R+  LE+      ++A
Sbjct: 454 FRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAANLLQQA 513

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           + +  N   V+  I  +R  +A  + W+++  ++K   +   P+A  + +LKL      L
Sbjct: 514 QAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLEKQRAML 573

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
           LL   +                    +  E  +  LV VD+ LSA+ NA+      +   
Sbjct: 574 LLEAPRREEA---------------EEPGEASETILVPVDVALSAHGNAQLLHSQVKQLK 618

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKD-----VQTMTNINKARKVYWFEKFYWFISSENY 312
            K QKT  +T  AL +A++K ++TLK      +Q    + K RK +WFEKF+WFISS++Y
Sbjct: 619 AKTQKTSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHY 678

Query: 313 LGNVAWDAK 321
           L     DA+
Sbjct: 679 LVLAGRDAQ 687



 Score = 71.6 bits (174), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 34/65 (52%), Positives = 47/65 (72%), Gaps = 1/65 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
           AW  K  + AWWV   QVSK+AP+G +L+TGSFMIRG++NF    +L MG   LF+L +E
Sbjct: 750 AWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLLFRLADE 809

Query: 376 SSISR 380
           +S++R
Sbjct: 810 ASVAR 814


>gi|221482059|gb|EEE20420.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 1859

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 73/249 (29%), Positives = 121/249 (48%), Gaps = 20/249 (8%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F      VDE+FS+ +  K +    Q  ++AL ++E +K D E R+  LE+      ++A
Sbjct: 454 FRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAANLLQQA 513

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           + +  N   V+  I  +R  +A  + W+++  ++K   +   P+A  + +LKL      L
Sbjct: 514 QAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLEKQRAML 573

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
           LL   +                    +  E  +  LV VD+ LSA+ NA+      +   
Sbjct: 574 LLEAPRREEA---------------EEPGEASETILVPVDVALSAHGNAQLLHSQVKQLK 618

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKD-----VQTMTNINKARKVYWFEKFYWFISSENY 312
            K QKT  +T  AL +A++K ++TLK      +Q    + K RK +WFEKF+WFISS++Y
Sbjct: 619 AKTQKTSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHY 678

Query: 313 LGNVAWDAK 321
           L     DA+
Sbjct: 679 LVLAGRDAQ 687



 Score = 71.6 bits (174), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 34/65 (52%), Positives = 47/65 (72%), Gaps = 1/65 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
           AW  K  + AWWV   QVSK+AP+G +L+TGSFMIRG++NF    +L MG   LF+L +E
Sbjct: 750 AWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLLFRLADE 809

Query: 376 SSISR 380
           +S++R
Sbjct: 810 ASVAR 814


>gi|159477991|ref|XP_001697091.1| hypothetical protein CHLREDRAFT_181058 [Chlamydomonas reinhardtii]
 gi|158269999|gb|EDO96040.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 246

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 42/59 (71%), Positives = 48/59 (81%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
           AWD+K+VT+AWWV   QVSKTAP+GE+L TGSFMIRGKKNF PP  L MG  FLFK +E
Sbjct: 128 AWDSKIVTSAWWVHHHQVSKTAPSGEYLVTGSFMIRGKKNFLPPQPLVMGFGFLFKWDE 186


>gi|237842889|ref|XP_002370742.1| hypothetical protein TGME49_014090 [Toxoplasma gondii ME49]
 gi|211968406|gb|EEB03602.1| hypothetical protein TGME49_014090 [Toxoplasma gondii ME49]
          Length = 1859

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 73/249 (29%), Positives = 121/249 (48%), Gaps = 20/249 (8%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F      VDE+FS+ +  K +    Q  ++AL ++E +K D E R+  LE+      ++A
Sbjct: 454 FRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAANLLQQA 513

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           + +  N   V+  I  +R  +A  + W+++  ++K   +   P+A  + +LKL      L
Sbjct: 514 QAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLEKQRAML 573

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
           LL   +                    +  E  +  LV VD+ LSA+ NA+      +   
Sbjct: 574 LLEAPRREEA---------------EEPGEASETILVPVDVALSAHGNAQLLHSQVKQLK 618

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKD-----VQTMTNINKARKVYWFEKFYWFISSENY 312
            K QKT  +T  AL +A++K ++TLK      +Q    + K RK +WFEKF+WFISS++Y
Sbjct: 619 AKTQKTSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHY 678

Query: 313 LGNVAWDAK 321
           L     DA+
Sbjct: 679 LVLAGRDAQ 687



 Score = 71.6 bits (174), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 34/65 (52%), Positives = 47/65 (72%), Gaps = 1/65 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
           AW  K  + AWWV   QVSK+AP+G +L+TGSFMIRG++NF    +L MG   LF+L +E
Sbjct: 750 AWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLLFRLADE 809

Query: 376 SSISR 380
           +S++R
Sbjct: 810 ASVAR 814


>gi|156083749|ref|XP_001609358.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154796609|gb|EDO05790.1| conserved hypothetical protein [Babesia bovis]
          Length = 1006

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 88/366 (24%), Positives = 160/366 (43%), Gaps = 66/366 (18%)

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
           + F+ F  A+D +F+ +E  KI+ K  +Q +  +K L+ +K D   R    E+       
Sbjct: 315 ERFDDFNMALDAYFTKSELRKIERK--EQPKKPIK-LQKIKDDQNRRELEREREVHRLGV 371

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
              L+  ++++ D  +  +R  +A+  SW++I  ++ + +     +A  I+ + +    +
Sbjct: 372 SIALVEGHRDTFDTVLDLMRSLVASGASWQEITDQLSRQRDSGHLLARHIRSVNIPDRRV 431

Query: 196 TLLLRVHKER--SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLK 253
            + L        +N+       +      + SS+    + V +D  L+ + N +  +  K
Sbjct: 432 DVCLPNDDPGYYTNVTSMGDKRNKRGSKKSQSSDQFDDTSVTLDYGLTCFQNLEIMYSQK 491

Query: 254 RSAAKKQQKTIQSTEKALKSAEK-KTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
           +  A+K ++T    + ALK  ++ K KQ         ++ K RK  WFEKF+WFI+S+ +
Sbjct: 492 KRMAEKLERTRAGHQFALKRVDREKEKQVKSRGDRNVSLVKVRKRMWFEKFHWFITSDGF 551

Query: 313 L--------------------GNVAWDAKV------------------------------ 322
           L                    G++ + A V                              
Sbjct: 552 LVLGGRDSTQNELLVKRYLTKGDLYFHADVHGAASCILKNPSGNAESFPNTIDEAACFSL 611

Query: 323 -VTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFK 372
            +++AW         WV   QVS++AP+GE+L  GSFMIRGKKN+  P +L M I  +F 
Sbjct: 612 CLSSAWSQKMVVPAWWVHHHQVSRSAPSGEYLPHGSFMIRGKKNYVQPQRLEMAIGVVFH 671

Query: 373 LEESSI 378
           +E   I
Sbjct: 672 IEVPDI 677


>gi|297736765|emb|CBI25966.3| unnamed protein product [Vitis vinifera]
          Length = 369

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/63 (66%), Positives = 48/63 (76%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTAPT E+LT GSFMIRGKKNF PP  L MG   L  L+ES
Sbjct: 145 AWDSKIVTSAWWVYPHQVSKTAPTVEYLTVGSFMIRGKKNFLPPHPLMMGFGLLLCLDES 204

Query: 377 SIS 379
           S+ 
Sbjct: 205 SLG 207


>gi|257215816|emb|CAX83060.1| Serologically defined colon cancer antigen 1 [Schistosoma
           japonicum]
          Length = 521

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 38/62 (61%), Positives = 47/62 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW + V+T AWWV  DQVSKTAP+GE+LT+GSF+IRGKKN+ PPC    G   +FKL E 
Sbjct: 115 AWQSHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFGIMFKLHED 174

Query: 377 SI 378
           S+
Sbjct: 175 SV 176



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 27/40 (67%)

Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK 321
           +K  +T+  I + RK  WFEKF+WFISSENYL     D++
Sbjct: 1   MKTAKTIAQITEVRKPMWFEKFFWFISSENYLVVAGHDSQ 40


>gi|56753953|gb|AAW25169.1| SJCHGC08981 protein [Schistosoma japonicum]
          Length = 414

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 38/62 (61%), Positives = 47/62 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW + V+T AWWV  DQVSKTAP+GE+LT+GSF+IRGKKN+ PPC    G   +FKL E 
Sbjct: 8   AWQSHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFGIMFKLHED 67

Query: 377 SI 378
           S+
Sbjct: 68  SV 69


>gi|254583608|ref|XP_002497372.1| ZYRO0F04004p [Zygosaccharomyces rouxii]
 gi|238940265|emb|CAR28439.1| ZYRO0F04004p [Zygosaccharomyces rouxii]
          Length = 1024

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 92/202 (45%), Gaps = 62/202 (30%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNI 291
           V +DL LSAYANA  +F++K++ A+KQ+K  ++ EKA K+ E+K  + LK    +T   +
Sbjct: 519 VSIDLGLSAYANASYYFNIKKNNAEKQKKVEKNVEKAFKNIEEKVGRQLKQKLKETHNVL 578

Query: 292 NKARKVYWFEKFYWFISSEN-------------------------YLGNV---------- 316
            K R  Y+FEK +WFISSE                          YL N           
Sbjct: 579 RKVRTPYFFEKHHWFISSEGFLVLMGKSDSETDLIYSKYIEDDDVYLFNTFGTQVWIKNP 638

Query: 317 ----------------------AWDAKVVTNAWWVKADQVSKTAPT-GEFLTTGSFMIRG 353
                                 AW  K+ ++ WW  A  +SK  P+    L  G F+++ 
Sbjct: 639 DSTEVPPNTLMQAGILCMSASEAWSKKISSSPWWCFAKNISKFEPSDNSVLPPGRFLLKN 698

Query: 354 K--KNFFPPCQLAMGISFLFKL 373
           +  KNF PP QL MG  FL+K+
Sbjct: 699 ENNKNFMPPAQLVMGFGFLWKV 720



 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 87/175 (49%), Gaps = 9/175 (5%)

Query: 31  GYLVQKKEKRPNSE----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVD 86
           GY++ K+    NSE    D       FHP   +P   P  +   +   I+    +   +D
Sbjct: 279 GYILAKRNVNFNSERDEKDLEFVYETFHP--FEPFVAPQNVGDTRTIKIEG--GYNKVLD 334

Query: 87  EFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQES 146
            FFST ES K  L+  QQE+ A K+LE  + D++ ++ AL   Q  ++EK   II N + 
Sbjct: 335 SFFSTIESSKYALRIQQQEQQATKRLEAARLDNQKKIQALVDAQSFNEEKGHSIIANADL 394

Query: 147 VDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRV 201
           V+    A++  +  Q+ W  IE  ++  Q+  + +A +I QL LN+    + +R+
Sbjct: 395 VEQTKSAVQGYVDQQMDWSTIEKLIQVEQKRGNKIAQLI-QLPLNLQENKIAIRL 448


>gi|294875379|ref|XP_002767293.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239868856|gb|EER00011.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 1087

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 41/62 (66%), Positives = 50/62 (80%), Gaps = 1/62 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC-QLAMGISFLFKLEE 375
           AWD K+VT+AWWV A QVSKTAPTGEFL+TGSFMIRGKKNF PP  +L MG+  ++ + +
Sbjct: 659 AWDNKIVTSAWWVWAKQVSKTAPTGEFLSTGSFMIRGKKNFLPPTGRLEMGLGVMWTVTD 718

Query: 376 SS 377
            S
Sbjct: 719 DS 720



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/205 (27%), Positives = 100/205 (48%), Gaps = 17/205 (8%)

Query: 75  IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
           + E+ SF   VD++++     +++ + VQ++   + K+EN+K D   R+  LE+ Q    
Sbjct: 365 VVEYPSFTECVDDYYTRLMRAQLEGQLVQKQSQMISKVENIKSDQRRRMGELEKEQQSLW 424

Query: 135 EKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINH 194
           E+A  +  N    DAAI  +   +A +L W+++   VKQ QR   P+A  I+QL L+ N 
Sbjct: 425 EQAVALEANTTLADAAIQMVNALLAAKLRWDELTIAVKQQQRAGHPLAMHIRQLALDKNR 484

Query: 195 ITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKR 254
           I+++L                 + A    D  +      V +DL  +A AN     + ++
Sbjct: 485 ISIVL-----------------EKAASTDDDDDGATTVEVWLDLGRTAQANVALLHEKRK 527

Query: 255 SAAKKQQKTIQSTEKALKSAEKKTK 279
              +K  KT +   +A+K AEK+ K
Sbjct: 528 GMQEKMGKTEEQMARAVKMAEKRLK 552


>gi|297736751|emb|CBI25952.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 42/63 (66%), Positives = 48/63 (76%), Gaps = 1/63 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K+VT+AWWV   QVSKTA TGE+LT GSFMIRG KNF PP  L MG   LF L+ES
Sbjct: 149 AWDSKIVTSAWWVYPHQVSKTASTGEYLTVGSFMIRG-KNFLPPHPLMMGFGLLFCLDES 207

Query: 377 SIS 379
           S+ 
Sbjct: 208 SLG 210



 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/141 (36%), Positives = 76/141 (53%), Gaps = 12/141 (8%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           V+VDL LSA+ANA+R+++ K+    KQ+KTI + EKA K+AEKK+   L  V        
Sbjct: 11  VEVDLALSAHANARRWYEQKKRQENKQEKTIIAHEKAFKAAEKKSCVQLSQV-------G 63

Query: 294 ARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFM 350
              ++WFEKF WFISSENYL   G  A   K++   +  K D       T     + S  
Sbjct: 64  EHYIHWFEKFNWFISSENYLVISGRDAQQNKMIVKRYMSKGDLFIHFKSTNN--NSSSTF 121

Query: 351 IRGKKNFFPPCQLAMGISFLF 371
           +  +++    C++ +  S LF
Sbjct: 122 LFFQRHLNTCCRIPLNYSSLF 142


>gi|397618049|gb|EJK64734.1| hypothetical protein THAOC_14501 [Thalassiosira oceanica]
          Length = 1217

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 39/65 (60%), Positives = 53/65 (81%), Gaps = 1/65 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
           AW +K+VT+A+WV++ QVSKTAPTGE+LT GSFMIRG+KNF P   L MG+  LF+L ++
Sbjct: 707 AWSSKMVTSAYWVESHQVSKTAPTGEYLTVGSFMIRGRKNFLPASTLEMGVGVLFRLGDD 766

Query: 376 SSISR 380
            S++R
Sbjct: 767 VSVAR 771



 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 73/267 (27%), Positives = 131/267 (49%), Gaps = 26/267 (9%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P ++ QH+  P   + +FA A DEFFS   S K   +A   E  A ++L  +  D   R+
Sbjct: 394 PHLLIQHKNQPHLTYSTFATATDEFFSNLSSQKAAARADAAESAARERLAKIHADQARRV 453

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
             L + Q   ++ A L+  + + VD A+  I   + + + W+ +E  V   Q + +P+A 
Sbjct: 454 DGLVREQDKFRDAARLVELHADDVDRALGVINGALQSGMDWDQLEQLVTVEQGNENPIAL 513

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           +I +L L+ + I L L                  + D + D SE     +V V++  SA+
Sbjct: 514 LIHKLVLDKDEIMLAL-----------------PDIDNWEDESEAPPIVIVTVNIKESAH 556

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKS---------AEKKTKQTLKDVQTMTNINKA 294
            NA+  + + R++ +K++KTI+++E ALK+         AE + ++  K +   + + + 
Sbjct: 557 GNARAKYAVYRASKEKERKTIEASETALKAAEAKAKQQLAEAQKRKARKQLSVNSQVYQG 616

Query: 295 RKVYWFEKFYWFISSENYLGNVAWDAK 321
              +   KF WFI+S+NYL     DA+
Sbjct: 617 NLQFCLNKFAWFITSDNYLVVAGKDAQ 643


>gi|387220185|gb|AFJ69801.1| hypothetical protein NGATSA_2069500, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 75

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 40/59 (67%), Positives = 46/59 (77%)

Query: 322 VVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           +VT+AWWV A QVSKTAP GEFL TGSFM+RGKKNF  P  L MG+  LFKL+E S+ R
Sbjct: 1   MVTSAWWVGAGQVSKTAPAGEFLPTGSFMVRGKKNFLAPQPLEMGLGLLFKLDEGSVGR 59


>gi|154332902|ref|XP_001562713.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059716|emb|CAM41838.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 1198

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 73/253 (28%), Positives = 122/253 (48%), Gaps = 17/253 (6%)

Query: 63  TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP ++ Q+    +  F   SF    DEFF   E+ +ID    +++  +  K +    DH 
Sbjct: 359 TPILLAQYTEDGVVSFYRASFGRVCDEFFLITETARIDASNEKRKNTSKNKEDKFAADHA 418

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            R++ALE     ++ K + +I N + VD AI  I   +A  +SWE +   +K+      P
Sbjct: 419 RRINALETDIAANQLKGQQLILNADRVDEAIQLINGALATGISWEALRILLKRRHAEGHP 478

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VA +I  L L  N I++LL             VL  +  +   D      P +V+V L  
Sbjct: 479 VAYMIHDLFLERNSISVLLET-----------VLDEEAGEEDCDVP----PMVVEVALSK 523

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +A+ANA  +F  ++    K ++TI +T++A   A +K ++   + +    I K R+  W+
Sbjct: 524 TAHANAADYFGRQKQHRSKLERTIAATDRAAAGAARKGERKAAEQKERKVIVKERQRSWW 583

Query: 301 EKFYWFISSENYL 313
           EKF+WF +S   L
Sbjct: 584 EKFFWFRTSAGDL 596



 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 6/55 (10%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           AW+ K  T +WWV A QV+    TG +L T      G+++  PP  +++G + LF
Sbjct: 679 AWERKQTTGSWWVYASQVTGGTATGTYLYT------GERHHLPPQSMSLGCALLF 727


>gi|154304164|ref|XP_001552487.1| hypothetical protein BC1G_08352 [Botryotinia fuckeliana B05.10]
          Length = 484

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 38/64 (59%), Positives = 46/64 (71%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AWD+K   +AWWV ADQVSK+APTGEFL  GSF   GKKNF PP QL +G   LF++ + 
Sbjct: 48  AWDSKAGMSAWWVTADQVSKSAPTGEFLPAGSFNTHGKKNFLPPAQLLLGFGVLFQISDE 107

Query: 377 SISR 380
           S +R
Sbjct: 108 SKAR 111


>gi|67624075|ref|XP_668320.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54659500|gb|EAL38073.1| hypothetical protein Chro.50204 [Cryptosporidium hominis]
          Length = 1375

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 75/251 (29%), Positives = 119/251 (47%), Gaps = 24/251 (9%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
            ++F   VDEF+S+ +  K    A Q+ +    K++ VK D E RL  L   +     +A
Sbjct: 367 LDNFCKCVDEFYSSIDIVKESKFATQEHKTIYSKVDKVKIDQERRLEGLSSEKEACIVRA 426

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           + + ++QE ++  +  IR  IA    W+DI   ++Q +++N P+A  IK L L  + + +
Sbjct: 427 KFMESHQEILEKILQLIRHLIATGAQWQDIWNEIQQQKKNNHPLARHIKSLNLKDDKVKI 486

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
           L       S I            P  D  +  K    D+ +  S  +N +  +   ++ A
Sbjct: 487 LFSQRDLGSEIT-----------PVVD--QIGKSIEFDLIISKSIQSNIRFQYMESKALA 533

Query: 258 KKQQKTIQSTEKAL--------KSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           +K +KT  + + AL        K AEK +K  + +V     I K R  YWFEKFYWFISS
Sbjct: 534 EKFEKTQHAYKIALKKVTNIAKKDAEKASKGLVSNV---PRIKKLRAQYWFEKFYWFISS 590

Query: 310 ENYLGNVAWDA 320
           + YL     DA
Sbjct: 591 DGYLIIGGHDA 601



 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 35/55 (63%), Positives = 42/55 (76%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           AW  K V +AWWV  DQVSK AP+GE+L+TGSF+IRGKKNF PP +L MG +  F
Sbjct: 658 AWVNKTVISAWWVYPDQVSKNAPSGEYLSTGSFVIRGKKNFLPPLKLEMGCALYF 712


>gi|70949333|ref|XP_744087.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56523889|emb|CAH79538.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 1345

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 83/268 (30%), Positives = 124/268 (46%), Gaps = 33/268 (12%)

Query: 51  MEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDL-KAVQQERDAL 109
           +EF P++L+           +   + +F  F   VD +FS  E  K D  + + + ++AL
Sbjct: 406 VEFSPILLKNHINKI---DEKKIELIKFNDFNMCVDTYFSKMELTKYDKHQEMNKRKNAL 462

Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
            K++ +K DHE R+ ALE+   + K+K  LI  N E V  AI  +R  I+   +WE I  
Sbjct: 463 TKIDKIKLDHERRIEALEKEVNILKKKILLIQANDEFVGEAIKLMRAAISTSANWEKIWD 522

Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
            VK  ++ N PVA  I  +  N   I LLL       N  D+     +++       E  
Sbjct: 523 HVKLFKKRNHPVALKIMSVNFNNCEIELLL-------NEGDTEESSSEDSSKEKGMEEKN 575

Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV---- 285
           K                     L++ A +K +K   ST  A+K  EKK K          
Sbjct: 576 KACT------------------LRKKAEEKIRKIKMSTNVAIKKVEKKKKDKDTKQKGKH 617

Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL 313
           +++  I K RK++WFEKF WF+SSENYL
Sbjct: 618 KSVFQIQKLRKIFWFEKFNWFLSSENYL 645



 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 35/60 (58%), Positives = 49/60 (81%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW+ KV+T+AWWV   QVSKTAPTGE++ TGSF+IRGKKN+ P  +L MG+S +F++ ++
Sbjct: 709 AWNNKVITSAWWVYYHQVSKTAPTGEYIKTGSFVIRGKKNYLPYAKLEMGLSIIFQVNKN 768


>gi|300706574|ref|XP_002995542.1| hypothetical protein NCER_101531 [Nosema ceranae BRL01]
 gi|239604689|gb|EEQ81871.1| hypothetical protein NCER_101531 [Nosema ceranae BRL01]
          Length = 644

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 92/384 (23%), Positives = 156/384 (40%), Gaps = 108/384 (28%)

Query: 52  EFHPMILQPGSTPSV-MRQHQGFPI----------QEFESFAAAVDEFFSTAESHKIDLK 100
           EF+  I   G   +V M++ + F            + F+SF  AV+ FF      KI   
Sbjct: 202 EFNEFISNIGGFGAVQMKKGKPFSFFPFDIEIDNKKTFQSFNEAVEFFFMDRRKKKI--- 258

Query: 101 AVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN 160
              ++ D L+K+ N + +H   L  + +   +   KA+LI+ N + V+  +      I N
Sbjct: 259 ---EKVDKLQKIRNKQYEHIKELENMVKDMTM---KADLILKNADIVENVLDIHNYVIKN 312

Query: 161 QLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNAD 220
           +L+W D     +  +   + +A II                   +S+ K        N  
Sbjct: 313 KLNWNDFLKFKEDEKSKGNEIADII------------------VKSDFK--------NKS 346

Query: 221 PFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280
              D  + +    +++  D S ++NA+ +F+ KR   KK ++ I  TEKA+ + + KT  
Sbjct: 347 CIIDLKDNEDSHFIEISFDKSLHSNAQNYFE-KR---KKFEEKILKTEKAIDTIKIKTYT 402

Query: 281 TLKDVQTMTNINKARKVYWFEKFYWFISSENYL--------------------------- 313
             + ++        R V+WFEKF +  +++  L                           
Sbjct: 403 KEEKIKI------QRSVFWFEKFNFCFTTDKKLVIGGKNAQQNEIIVKKHLTPNHLYFHT 456

Query: 314 -------------------------GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGS 348
                                     +  W+  VV+  ++VK+DQVSKT PTG+FL  GS
Sbjct: 457 ESSGGSSVISEADVNIDEVALVALCNSACWEVNVVSPVFYVKSDQVSKTPPTGQFLPKGS 516

Query: 349 FMIRGKKNFFPPCQLAMGISFLFK 372
           F+IRG K +    +L  G+  LFK
Sbjct: 517 FLIRGTKTYVNVYKLEYGVGLLFK 540


>gi|428671810|gb|EKX72725.1| hypothetical protein BEWA_012840 [Babesia equi]
          Length = 842

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 77/297 (25%), Positives = 125/297 (42%), Gaps = 88/297 (29%)

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           E++  N + VD  +  IR  IA   SW+DI+ +++  +    P+A  I ++ +    +T 
Sbjct: 229 EILQENIDLVDKCLNLIRTLIATGASWKDIQDQIEIQKTLKHPIAQRIAKIDIPNQIVTF 288

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
                                     D ++   P   +++  LS + N ++ +  K+   
Sbjct: 289 A------------------------GDPNDKSGP---EIEYGLSCHQNLEKMYSKKKKLE 321

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-INKARKVYWFEKFYWFISSENYL--- 313
            K ++T    E ALK   K+ ++    V+  T+ I   RK +WFEKFYWFI+S+ YL   
Sbjct: 322 SKLERTKIGREFALKKVNKEKEKEQTQVKGPTHKITTLRKRFWFEKFYWFITSDGYLVLG 381

Query: 314 -----------------GNVAWDAKV-------------------------------VTN 325
                            G++ + A +                                +N
Sbjct: 382 GRDALQNEILVKKYLTKGDLYFHADIHGASSCILKNPGQVAEIPQTSIDQGGCFAVCHSN 441

Query: 326 AW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AW         WV   QVSKT PTGEF+  GSF+IRGKKN+  P +L MG++ +F +
Sbjct: 442 AWNQKFMVPSWWVYHHQVSKTPPTGEFVPQGSFVIRGKKNYIQPQKLEMGLTVVFHI 498


>gi|398011164|ref|XP_003858778.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322496988|emb|CBZ32058.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 1228

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 73/253 (28%), Positives = 120/253 (47%), Gaps = 17/253 (6%)

Query: 63  TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP ++ Q+    ++     SF    DEFF   E+ +ID    ++ + A  K +    DH 
Sbjct: 359 TPILLAQYTNDGVEALYRTSFGRVCDEFFLITETERIDASNAKRTKTAKSKEDKFAADHA 418

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            R++ALE     ++ K E +I N + VD AI  I   +A  +SW+ +   +K+      P
Sbjct: 419 RRINALETDIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHP 478

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VA +I  L L  N I++LL             VL  +  +   D      P +V+V L  
Sbjct: 479 VAYMIHDLFLERNSISVLLET-----------VLDEEKGEEDCDV----PPLVVEVTLSK 523

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +A+ANA  +F  ++    K ++T+ +TEKA   A +K  +     +    I K R+  W+
Sbjct: 524 TAHANAADYFSKQKQHRSKLERTVAATEKAAAGAARKGARKAAAQKEKKVIVKERQRNWW 583

Query: 301 EKFYWFISSENYL 313
           EKF+WF ++   L
Sbjct: 584 EKFFWFRTTAGDL 596



 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           AW+ K  T +WWV A QV+          TG+++  G+++  PP  +++G + LF
Sbjct: 679 AWERKQTTGSWWVYASQVTGGT------ATGAYLYAGERHHLPPQSMSLGCALLF 727


>gi|302761992|ref|XP_002964418.1| hypothetical protein SELMODRAFT_405643 [Selaginella moellendorffii]
 gi|300168147|gb|EFJ34751.1| hypothetical protein SELMODRAFT_405643 [Selaginella moellendorffii]
          Length = 382

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 58/96 (60%), Gaps = 7/96 (7%)

Query: 283 KDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVKADQVSKTAPTGE 342
           KDV T+++   ++ +   +++  +       G       ++T+AWWV   QVSK APTGE
Sbjct: 142 KDVATISHTTGSKSLTVHKQYVCYA-----FGPAG--PTIITSAWWVYDHQVSKNAPTGE 194

Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           +LT GS MIRGKKNF PP  L MG    F+L++SSI
Sbjct: 195 YLTVGSLMIRGKKNFLPPYPLVMGFGLFFRLDKSSI 230



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 1/85 (1%)

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
           ++F++F A +DEFFS  E  ++D +   QE  A  KLE ++ D  +R+ +L++       
Sbjct: 63  EKFDTFDAVLDEFFSKIEGQRLDQQRKTQEDSAYSKLEKIRVDQRSRVESLKREVDQAVH 122

Query: 136 KAELIINNQESVDAAILAIRQDIAN 160
            AELI  N   VD AI  + +D+A 
Sbjct: 123 TAELIEYNLADVDLAI-DVAKDVAT 146


>gi|146078492|ref|XP_001463556.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134067642|emb|CAM65921.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 1228

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 73/253 (28%), Positives = 120/253 (47%), Gaps = 17/253 (6%)

Query: 63  TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP ++ Q+    ++     SF    DEFF   E+ +ID    ++ + A  K +    DH 
Sbjct: 359 TPILLAQYTNDGVEALYRTSFGRVCDEFFLITETERIDASNAKRTKTAKSKEDKFAADHA 418

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            R++ALE     ++ K E +I N + VD AI  I   +A  +SW+ +   +K+      P
Sbjct: 419 RRINALETDIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHP 478

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VA +I  L L  N I++LL             VL  +  +   D      P +V+V L  
Sbjct: 479 VAYMIHDLFLERNSISVLLET-----------VLDEEKGEEDCDV----PPLVVEVTLSK 523

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +A+ANA  +F  ++    K ++T+ +TEKA   A +K  +     +    I K R+  W+
Sbjct: 524 TAHANAADYFSKQKQHRSKLERTVAATEKAAAGAARKGARKAAAQKEKKVIVKERQRNWW 583

Query: 301 EKFYWFISSENYL 313
           EKF+WF ++   L
Sbjct: 584 EKFFWFRTTAGDL 596



 Score = 46.2 bits (108), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           AW+ K  T +WWV A QV+          TG+++  G+++  PP  +++G + LF
Sbjct: 679 AWERKQTTGSWWVYASQVTGGT------ATGAYLYAGERHHLPPQSMSLGCALLF 727


>gi|157865120|ref|XP_001681268.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68124563|emb|CAJ02783.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 1224

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 73/253 (28%), Positives = 120/253 (47%), Gaps = 17/253 (6%)

Query: 63  TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP ++ Q+    ++     SF    DEFF   E+ +ID    +++  A  K +    DH 
Sbjct: 359 TPILLAQYANDGVEALYRTSFGRVCDEFFLLTETERIDASNAKRKNTAKSKEDKFAADHA 418

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            R++ALE     ++ K E +I N + VD AI  I   +A  +SW+ +   +K+      P
Sbjct: 419 RRINALETDIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHP 478

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VA +I  L L  N I++LL              L  +N +   D      P +V+V L  
Sbjct: 479 VAYMIHDLFLERNSISVLLET-----------ALDEENGEEDCDVP----PLVVEVALSK 523

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +A+ANA  +F  ++    K ++T+ +TEKA   A +K  +     +    I K R+  W+
Sbjct: 524 TAHANAADYFSKQKQYRSKLERTVAATEKAAAGAARKGARKAAGQKEKKVIVKERQRNWW 583

Query: 301 EKFYWFISSENYL 313
           EKF+WF ++   L
Sbjct: 584 EKFFWFRTTAGDL 596



 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           AW+ K  T +WWV A QV+          TG+++  G+++  PP  +++G + LF
Sbjct: 679 AWERKQTTGSWWVYASQVTGGT------ATGAYLYAGERHHLPPQSMSLGCALLF 727


>gi|124805420|ref|XP_001350435.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
 gi|23496557|gb|AAN36115.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
          Length = 2158

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 34/60 (56%), Positives = 49/60 (81%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW+ K++T+AWWV  +QVSK+AP+GE+L TGSF+IRGKKN+ P  +L MG   LF++E++
Sbjct: 753 AWNNKIITSAWWVYYNQVSKSAPSGEYLKTGSFVIRGKKNYLPHVKLEMGFCVLFQIEKN 812



 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 81/277 (29%), Positives = 130/277 (46%), Gaps = 26/277 (9%)

Query: 52  EFHPMILQPGSTPSVMRQHQG-FPIQEFESFAAAVDEFFSTAESHKID-LKAVQQERDAL 109
           EF P+IL+       M+ ++G      F+ +   VD +FS  E  K D  + + + ++A+
Sbjct: 438 EFSPIILKNHE----MKLNEGKIKYISFDDYNLCVDTYFSKLELSKYDKQQEITKSKNAI 493

Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
            K++ +K DHE R+  LE+  L+ K+K  LI  N   ++  I  +R  ++   +WE I  
Sbjct: 494 TKVDKIKLDHERRIEQLEKEVLLLKKKITLIQLNDVLIEEGIKLMRSALSTSANWEKIWE 553

Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
            +K  ++   P+A  IK +      +  LL    +R   K              D ++  
Sbjct: 554 HIKIFKKQEHPIAVRIKSVNFKNCEMDYLLSDCDDRKGNK---------MGDDGDDNDDD 604

Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV---- 285
                D + +  +    K F     +A +K +KT  +T+ A+K  EKK K    +     
Sbjct: 605 DDGDDDNNNNNKSCVKPKTF-----AAEEKIRKTKMATDFAVKKVEKKKKNKDNNKQKGK 659

Query: 286 --QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDA 320
              ++  I K RKVYWFEKF+WFISSENYL     DA
Sbjct: 660 AKSSVGQIQKLRKVYWFEKFHWFISSENYLVIAGRDA 696


>gi|396081612|gb|AFN83228.1| putative RNA-binding protein [Encephalitozoon romaleae SJ-2008]
          Length = 648

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 83/352 (23%), Positives = 148/352 (42%), Gaps = 102/352 (28%)

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
           ++F +F  A + FF   +      K  + +R++  K++ V++  E  +  +EQ +   ++
Sbjct: 243 RDFSTFNDAAEFFFQNRK------KFGRNDRES--KVDKVRKRQENYMKEMEQERQSYRK 294

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
           KAEL+  N + V+  +   +    N++ W D E   +Q  +  + ++  I          
Sbjct: 295 KAELLEENADFVNKILDIFKIVKKNKVRWTDFEKFREQENKKGNEISKAI---------- 344

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
                       +K  ++ H       A   E      + +D + S ++N  RF+   + 
Sbjct: 345 ------------VKTDFISH---TCTIALEGEE-----IQIDFETSLFSNISRFYQKNKK 384

Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKF-YWF-------I 307
             +K +KT  S E+ LK    K      + + +T     R +YWFEKF ++F       I
Sbjct: 385 LEEKIKKTRDSLEEVLKKVAPKV-----ETKKVT-----RALYWFEKFHFFFSSDGILVI 434

Query: 308 SSEN------------------YLGNV----------------------------AWDAK 321
             +N                  + G++                             W+A 
Sbjct: 435 GGKNAQQNEILVKKHLEPTDLYFHGDMHGSSSIIVKKPTQKTIEEAASMALCMSKCWEAN 494

Query: 322 VVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           VV+  W+V  +QVSKTAP+GE+LT GSFMI+GKKN+    ++  G+  LFK+
Sbjct: 495 VVSPVWYVYGEQVSKTAPSGEYLTKGSFMIKGKKNYVECHKIEYGLGLLFKV 546


>gi|209875685|ref|XP_002139285.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209554891|gb|EEA04936.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 1427

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 89/369 (24%), Positives = 159/369 (43%), Gaps = 76/369 (20%)

Query: 72  GFPIQEFES-FAAAVDEFFSTAESHKIDLKAVQQERDALK-KLENVKRDHETRLSALEQT 129
           G  +  F S F+  VDEF+S+ + +K + K +QQE  A+  K++ ++ D E RL  L + 
Sbjct: 379 GLIVNRFTSKFSDCVDEFYSSIDINK-ETKEIQQEEKAINSKIDKLRIDQERRLKELVEE 437

Query: 130 QLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLK 189
           +    ++A  +   +  ++  +L  R  IA    W+DI   V+Q ++   P+A  IK L 
Sbjct: 438 KEACIKRANFMECCELLLEKILLLTRHLIATGAQWKDICNEVRQQRKIGHPIAKYIKSLD 497

Query: 190 LNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL--SAYANAK 247
           L  + + +     +   +   S   + ++      +S+ K    +++ L++  S  AN +
Sbjct: 498 LEHDRVVVYFGADEFPEDFDYSRYGYGES------NSKLKSQEGIEIYLNISKSMQANIR 551

Query: 248 RFFDLKRSAAKKQQKTIQSTEKAL----KSAEKKTKQTLKDVQTMTN-INKARKVYWFEK 302
             ++  +  + K ++T  + ++AL    K+  + T++    + T  N I+K R+ YWFEK
Sbjct: 552 SEYEESKHISAKLERTKSAYKRALNKVTKTVNRNTEKLTGPLNTGVNRIHKIRQSYWFEK 611

Query: 303 FYWFISSENYL---GN------VAWDAKVVTNAWWVKADQVSKTA------------PTG 341
           F+WFISS+ +L   GN      + +   +  N  ++ AD    T             P  
Sbjct: 612 FHWFISSDGFLVIGGNDSSQNELLYRRYLEKNDRYIHADTHGATTCIVKNPKNLADIPMN 671

Query: 342 EFLTTGSFMI---------------------------------------RGKKNFFPPCQ 362
                G   I                                       RGKKNF PP +
Sbjct: 672 TLCEAGQMSICYSRSWANKTVISAWWVYPDQVSKTAPSGEYLTTGSFVIRGKKNFLPPLK 731

Query: 363 LAMGISFLF 371
           L MGI+ +F
Sbjct: 732 LEMGIALVF 740


>gi|76156132|gb|AAX27365.2| SJCHGC07862 protein [Schistosoma japonicum]
          Length = 241

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 54/161 (33%), Positives = 94/161 (58%), Gaps = 11/161 (6%)

Query: 28  SSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDE 87
           S +GY+  K  +  ++E  + +  EFHP + +         Q++  P   F+SF  AVD 
Sbjct: 82  SLKGYIFGKPHQ--STEKGLQSYEEFHPFMFE---------QYREKPHLAFDSFNRAVDA 130

Query: 88  FFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESV 147
           FFS  ES K   +  + E+ A +K+EN+K+D E R+  L+  Q +D +KA LI  N++ V
Sbjct: 131 FFSKIESQKTLGQISRNEQKANRKVENIKKDQERRIMLLKTEQELDMQKAYLIEANRQLV 190

Query: 148 DAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
           D  I+ I   ++NQ+ W+++E  +++A++ NDP++  I +L
Sbjct: 191 DNIIILINHALSNQIDWKELELIIEEAKQRNDPLSCHIVEL 231


>gi|300176454|emb|CBK23765.2| unnamed protein product [Blastocystis hominis]
          Length = 767

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 90/359 (25%), Positives = 153/359 (42%), Gaps = 80/359 (22%)

Query: 102 VQQERDAL-KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN 160
           +++ R+A+ K++E +      R+ ALE      + +   + +    V+ AI+ + Q ++ 
Sbjct: 1   MRRSREAINKRVETISAKQSARIRALESEIAACRRRIRALQSYPTVVNTAIVVLNQYLSQ 60

Query: 161 QLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI---------------TLLLRVHKER 205
            + W  ++ +VK  ++    V   IKQL L  N +                    +  E 
Sbjct: 61  GVQWSVLQEQVKLLKQRPYNVFHHIKQLDLEHNRVKLEFDDDFDDDDDDDDDDDNLEDED 120

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS--AYANAKRFFDLKRSAAKKQQKT 263
            N++D        A   A SS ++ P  ++VD++LS     N    F  K+    K  KT
Sbjct: 121 DNLEDDDDNLEGEAKMAAASSRSESPHKINVDVELSLNCNQNISLLFSQKKDLQDKLDKT 180

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTM--TNINKARKVYWFEKFYWFISSENYL-------- 313
           +Q+ + A+  A K+ +  L+  +      I + R+  WFEKF W ++++ ++        
Sbjct: 181 VQAAQAAVAEASKQRQTELRVAEAAHPAEIARQREKRWFEKFDWCVTTDGFIVLAGKSGE 240

Query: 314 ------------------GNVAWDAKVV-------------------------TNAW--- 327
                              +V   A VV                         ++AW   
Sbjct: 241 QNEILVRRYLRPGDLFLHADVHGAATVVLRNYRAPELPGEAALLQAAAFALCHSSAWDAQ 300

Query: 328 ------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
                 WV A QVSKTAP+GE+L TGSFMIRGKKNF  P ++ MG++ LF++    + R
Sbjct: 301 LLCKVYWVPARQVSKTAPSGEYLPTGSFMIRGKKNFLAPYRMEMGLTVLFEVRPEDVQR 359


>gi|223478404|ref|YP_002582764.1| fibronectin-binding protein A domain-containing protein
           [Thermococcus sp. AM4]
 gi|214033630|gb|EEB74457.1| Fibronectin-binding protein A domain protein [Thermococcus sp. AM4]
          Length = 650

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/365 (21%), Positives = 147/365 (40%), Gaps = 89/365 (24%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++ ++G     F++F+ A+DE+F      K  ++  ++  +  ++L    R  E  
Sbjct: 242 VPIELKIYEGLEKHYFKTFSEALDEYFGKLTIEKAKIERTRKLENKKRQLLATLRKQEEM 301

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   E+    ++E  +LI  N   ++  +   R+    +L WE+ + R++  ++  + VA
Sbjct: 302 LKGFEKAMNENQEIGDLIYANYALIERLLEEFRK-ATEKLGWEEFKKRIEAGKKEGNRVA 360

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            ++K +      +T+ L   K +  +  S     +NA+ +                    
Sbjct: 361 LMVKSIDPKEKAVTIELEGKKVKLYLNKSI---GENAELY-------------------- 397

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           Y  AK+F    R   +   K  + T++ L   EK  ++ ++    +  I + RK  WFEK
Sbjct: 398 YEKAKKF----RHKYEGALKAYEDTKRKLDEVEKLIEEEMRKELNVKRIER-RKKKWFEK 452

Query: 303 FYWFISSENYLGNVAWDAKV---------------------------------------- 322
           F WF+SSE +L     DA                                          
Sbjct: 453 FRWFVSSEGFLVLAGKDASTNEVLIKKHMTENDLYCHADVYGAPHVVIKDGQKAGERTIF 512

Query: 323 --------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQ 362
                   ++ AW          W   +QV+K AP+GE+L  G+FM+ GK+N+    P +
Sbjct: 513 EACQFAVSMSRAWSQGLYGADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLK 572

Query: 363 LAMGI 367
           LA+G+
Sbjct: 573 LAVGV 577


>gi|66357888|ref|XP_626122.1| MJ1625/yease Yp1009cp-like HhH domain [Cryptosporidium parvum Iowa
           II]
 gi|46227289|gb|EAK88239.1| MJ1625/yease Yp1009cp-like HhH domain [Cryptosporidium parvum Iowa
           II]
          Length = 1378

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 35/55 (63%), Positives = 42/55 (76%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           AW  K V +AWWV  DQVSK AP+GE+L+TGSF+IRGKKNF PP +L MG +  F
Sbjct: 661 AWVNKTVISAWWVYPDQVSKNAPSGEYLSTGSFVIRGKKNFLPPLKLEMGCALYF 715



 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/251 (29%), Positives = 118/251 (47%), Gaps = 24/251 (9%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
            ++F   VDEF+S+ +  K    A Q+ +    K++ VK D E RL  L   +     +A
Sbjct: 370 LDNFCKCVDEFYSSIDIVKESKFATQEHKTIYSKVDKVKIDQERRLEGLSSEKEACIVRA 429

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           + + ++QE ++  +  IR  IA    W+DI   ++Q +++N P+A  IK L L  + + +
Sbjct: 430 KFMESHQEILEKILQLIRHLIATGAQWQDIWNEIQQQKKNNHPLARHIKSLNLKDDKVKI 489

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
           L       S              P  D  +  K    D+ +  S  +N +  +   ++ A
Sbjct: 490 LFSQRDLGS-----------ETTPVVD--QIGKSIEFDLIISKSIQSNIRFQYMESKALA 536

Query: 258 KKQQKTIQSTEKAL--------KSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           +K +KT  + + AL        K AEK +K  + +V     I K R  YWFEKFYWFISS
Sbjct: 537 EKFEKTQLAYKIALKKVTNIAKKDAEKASKGLVSNV---PRIKKLRAQYWFEKFYWFISS 593

Query: 310 ENYLGNVAWDA 320
           + YL     DA
Sbjct: 594 DGYLIIGGHDA 604


>gi|341581973|ref|YP_004762465.1| Fibronectin-binding protein A (FbpA) [Thermococcus sp. 4557]
 gi|340809631|gb|AEK72788.1| Fibronectin-binding protein A (FbpA) [Thermococcus sp. 4557]
          Length = 650

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 86/394 (21%), Positives = 155/394 (39%), Gaps = 106/394 (26%)

Query: 38  EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKI 97
           E RPN    ++ +   H ++      P  +R ++GF  + F +F+ A+DE+F      K 
Sbjct: 226 EARPN---IVLKDGNMHDVV------PIELRIYEGFEKRYFTTFSEALDEYFGKITMEKA 276

Query: 98  DLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQD 157
            ++  ++     ++L    R  E  L   E+    ++E  +LI  N   ++  +   R+ 
Sbjct: 277 RVEQTKRLEAKKRQLLMTLRKQEEMLKGFEEGAKANQEIGDLIYANYALIERLLEEFRK- 335

Query: 158 IANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER----SNIKDSWV 213
               L W++ + R+++ +R  + VA ++K        +T+ L   K R     +I ++  
Sbjct: 336 ATETLGWDEFKKRIEEGKRAGNRVALMVKGTDPKEKAVTIELEGKKVRLYLNRSIGENAE 395

Query: 214 LHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKS 273
           L+++ A  F    E            L AY + KR  D      +++ K     E ++K 
Sbjct: 396 LYYEKAKKFRHKHEGA----------LKAYEDTKRKLDEIERLIEEELK----KELSVKR 441

Query: 274 AEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKV----------- 322
            E++ K+                  WFEKF WF+SSE +L     DA             
Sbjct: 442 IERRKKK------------------WFEKFRWFVSSEGFLVLAGKDAGTNEILIKRHMDD 483

Query: 323 -------------------------------------VTNAW----------WVKADQVS 335
                                                ++ AW          W   +QV+
Sbjct: 484 NDLYCHADVYGAPHVVIKDGQKAGEKTIFEACQFAVSMSKAWSRGVYSEDAYWAHPNQVT 543

Query: 336 KTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           K  P+GE+L  G+FM+ GK+N+    P +LA+G+
Sbjct: 544 KQTPSGEYLGKGAFMVYGKRNWLHGLPLKLAVGV 577


>gi|342186351|emb|CCC95837.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1015

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 70/279 (25%), Positives = 128/279 (45%), Gaps = 15/279 (5%)

Query: 31  GYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFS 90
           GYL+ K  KR   E  I       P  +      S+  +   + I  F+   A    FF 
Sbjct: 254 GYLL-KSSKRGGQEAMI-------PGTMISALFCSISTRRMLWLINIFQISVAFAMNFFH 305

Query: 91  TAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAA 150
             +  +I+    + +   + K E  +R+H  R+  L++++     K  LI  N E++D  
Sbjct: 306 IRKKKRIEHHNDKVKTVVVSKREECERNHNRRIDKLKRSEEESIRKGHLIFQNTETIDKI 365

Query: 151 ILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKD 210
           I  I + +  ++ W+D  + +KQ +    P+AS+IK++      + +L+    +  + + 
Sbjct: 366 IGLINEALDMKIRWDDFRSVLKQRRDEGHPLASMIKEVLFERRKVVVLMNEDADDDDEQT 425

Query: 211 SWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKA 270
                    D    + E      +++DL  +A+ NA+ +F   +S A K ++TI +TEKA
Sbjct: 426 EDEEGEKREDRDRATYE------IEIDLTKTAHTNAEEYFARAKSTAAKLKRTIAATEKA 479

Query: 271 LKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           +  AE+K +      Q    I + R  +W+EKF WF +S
Sbjct: 480 MAGAERKGRTVTGKTQEKKIITE-RCRFWWEKFNWFRTS 517


>gi|302419577|ref|XP_003007619.1| DUF814 domain-containing protein [Verticillium albo-atrum VaMs.102]
 gi|261353270|gb|EEY15698.1| DUF814 domain-containing protein [Verticillium albo-atrum VaMs.102]
          Length = 1107

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 136/308 (44%), Gaps = 70/308 (22%)

Query: 26  NVSSQGYLVQK-------KEKRPNS------EDYIVANMEFHPMILQP-GSTPSVMRQHQ 71
           + ++ GY++ K        E++P+       ED +  +  FHP + Q     PSV     
Sbjct: 677 SATTTGYVIAKYRQKSEETEEKPDDGAETKREDLLYDD--FHPFLPQKFADDPSVK---- 730

Query: 72  GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQL 131
              +  F+ F   VDEFF  A                        R  ETR     + Q 
Sbjct: 731 ---VLTFDGFNKTVDEFFFLA------------------------RGPETR-----EAQS 758

Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLN 191
           ++++KA  I  N E V  A+ A+   +   + W +I   +++ Q+ ++P           
Sbjct: 759 LNEQKAAAIEANVERVQEAMDAVNGLVQQGMDWVNIGKLIEREQKRHNP----------- 807

Query: 192 INHITLLLR---VHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAK 247
            N +TLLL    V  E    +          D     ++     L ++++L LS +ANA+
Sbjct: 808 -NLMTLLLGTEAVEDEDEAYETGSDASDSEDDEDGAKAKGADRRLQIEINLGLSPWANAR 866

Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV--QTMTNINKARKVYWFEKFYW 305
            ++D +R+AA K+ KT+Q +  ALK+AEKK  + LK    Q    +   RK  WFEKF W
Sbjct: 867 EYYDQRRTAAVKELKTVQHSTMALKNAEKKITEDLKKGLKQEKAVLQPIRKQMWFEKFIW 926

Query: 306 FISSENYL 313
           F+SS+ YL
Sbjct: 927 FLSSDGYL 934



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 27/45 (60%), Positives = 33/45 (73%), Gaps = 1/45 (2%)

Query: 317  AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGK-KNFFPP 360
            AWD+K    AWWV+ADQVSK+APTGE+L   +FM  G  +NF PP
Sbjct: 1000 AWDSKAGMGAWWVRADQVSKSAPTGEYLPAAAFMGAGPGRNFLPP 1044


>gi|401416565|ref|XP_003872777.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322489002|emb|CBZ24251.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 1189

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 72/253 (28%), Positives = 117/253 (46%), Gaps = 17/253 (6%)

Query: 63  TPSVMRQHQGFPIQEF--ESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           TP ++ Q+    ++     SF    DEFF   E+ +ID    +++  A  K +    DH 
Sbjct: 333 TPILLAQYTNDGVEALYRSSFGRVCDEFFLITETERIDASNAKRKNTAKSKEDKFATDHA 392

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            R++ALE     ++ K E +I N + VD AI  I   +A  +SW+ +   +K+      P
Sbjct: 393 RRINALEADIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHP 452

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           VA +I  L L  N I++LL             VL  +  +   D      P +V+V L  
Sbjct: 453 VAYMIHDLFLERNSISVLLEA-----------VLDEEKGEEDCDV----PPLVVEVALSK 497

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
           +A+ANA  +F  ++    K ++T+ +T KA   A  K  +     +    I K R+  W+
Sbjct: 498 TAHANAADYFSKQKHHRSKLERTVAATAKAAAGAALKGARKAAAQKERKVIVKERQRQWW 557

Query: 301 EKFYWFISSENYL 313
           EKF WF ++   L
Sbjct: 558 EKFLWFRTTAGDL 570



 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 6/55 (10%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           AW+ K  T +WWV A QV+          TG+++  G+++  PP  +++G + LF
Sbjct: 653 AWERKQTTGSWWVYASQVTGGT------ATGAYLYAGERHHLPPQSMSLGCALLF 701


>gi|390960715|ref|YP_006424549.1| hypothetical protein containing fibronectin-binding protein
           [Thermococcus sp. CL1]
 gi|390519023|gb|AFL94755.1| hypothetical protein containing fibronectin-binding protein
           [Thermococcus sp. CL1]
          Length = 649

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 90/390 (23%), Positives = 157/390 (40%), Gaps = 108/390 (27%)

Query: 50  NMEFHP-MILQPGS----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQ 104
           N E  P ++ + G+     P  ++ ++G   + F +F+ A+DE+F      +I ++  + 
Sbjct: 223 NDELRPNIVFKEGNMHDVVPIELKIYEGLEKKYFNTFSEALDEYFG-----RITIEKAKI 277

Query: 105 ERDALKKLENVKRD-------HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQD 157
           ER   +KLEN KR         E  L   E     ++E  +LI  N   ++  +   R+ 
Sbjct: 278 ERT--RKLENKKRQLLMTLRKQEEMLKGFEGAMRENQEIGDLIYANYALIERLLDEFRK- 334

Query: 158 IANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHD 217
              +L WE+   R+++ ++  + VA ++K +      +T+ L   K +  +  S     +
Sbjct: 335 ATEKLGWEEFRKRIEEGKKAGNRVAMMVKGINPKEKAVTIELDGKKVKLYLNRSI---GE 391

Query: 218 NADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK 277
           NA+ +                    Y  AK+F    R   +   K  + T++ L   EK 
Sbjct: 392 NAELY--------------------YEKAKKF----RHKHEGALKAYEDTKRKLNEVEKL 427

Query: 278 TKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKV--------------- 322
            ++ +K    +  I + RK  WFEKF WFISSE +L     DA                 
Sbjct: 428 IEEEMKKELNVKRIER-RKKKWFEKFRWFISSEGFLVLAGKDASTNEILIKRHMGENDLY 486

Query: 323 ---------------------------------VTNAW----------WVKADQVSKTAP 339
                                            ++ AW          W   +QV+K  P
Sbjct: 487 CHADVYGAPHVVIKDGQKAGERTIFEACQFAVSMSKAWSRGVYSEDAYWAYPNQVTKQTP 546

Query: 340 TGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           +GE+L  G+FM+ GK+N+    P +LA+G+
Sbjct: 547 SGEYLGKGAFMVYGKRNWLHGLPLKLAVGV 576


>gi|389852774|ref|YP_006355008.1| hypothetical protein Py04_1359 [Pyrococcus sp. ST04]
 gi|388250080|gb|AFK22933.1| hypothetical protein Py04_1359 [Pyrococcus sp. ST04]
          Length = 642

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 102/394 (25%), Positives = 174/394 (44%), Gaps = 115/394 (29%)

Query: 48  VANMEFHP-MILQPGST----PSVMRQHQGFPIQEFESFAAAVDEFFS--TAESHKID-L 99
           V N+E +P ++ + GS     P  +  +  +    ++SF+ A+DE+F   T E  K +  
Sbjct: 217 VLNLEKNPNIVYKNGSMVDVLPVDLVWYSNYEKVFYDSFSKALDEYFGKLTIEKAKRERT 276

Query: 100 KAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159
           KA++++R AL+   ++KR  E ++   E+    ++E+ +L+  N   V   +  IR+ I 
Sbjct: 277 KALEEKRKALEI--SLKRIEE-QIRGFEKEAQENQERGDLLYANYTLVKEILETIRRGIK 333

Query: 160 NQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNA 219
             L  E++  R+++A++   P A+II ++                    KDS V+     
Sbjct: 334 T-LGVEEVVKRIEEAKKKGYPWANIISKVS-------------------KDSLVIE---- 369

Query: 220 DPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
                  E KK   + +D++ +   NA+ F++     AKK ++ ++   KA +  +KK +
Sbjct: 370 ------LEGKK---IKLDINKTLEENAEIFYE----KAKKARQKLEGARKAYEETKKKIE 416

Query: 280 QTLKDVQ------TMTNINKARKVYWFEKFYWFISSENYL---GNVA------------- 317
              +++        +  + K RK  WFEKF WFISSE +L   G  A             
Sbjct: 417 NIEQEIMEEEKKIAVKKLEKRRK-KWFEKFRWFISSEGFLVIGGKDATTNEIVVKRHMSE 475

Query: 318 ---------WDAKVV-----------------------TNAW----------WVKADQVS 335
                    W A  V                       + AW          WV  +QVS
Sbjct: 476 NDLYCHADIWGAPHVVIKEGRKASEKTIFEACQFAVSMSRAWSEGLASADAYWVYPEQVS 535

Query: 336 KTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           K AP GE+L  G+FM+ GK+N+    P +LA+GI
Sbjct: 536 KQAPAGEYLPKGAFMVYGKRNWLHGIPLKLAVGI 569


>gi|401410580|ref|XP_003884738.1| hypothetical protein NCLIV_051350 [Neospora caninum Liverpool]
 gi|325119156|emb|CBZ54708.1| hypothetical protein NCLIV_051350 [Neospora caninum Liverpool]
          Length = 1853

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 67/249 (26%), Positives = 118/249 (47%), Gaps = 26/249 (10%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           F      VDE+FS+ +  K +        +AL ++E ++ D   R+  LE+      E+A
Sbjct: 441 FRDINVCVDEYFSSVDVQKGERAEALARHEALSRVEKIRSDQAQRMQQLEEEAASLLEEA 500

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           + +  N   V+  I  +R  +A  + W+++  ++KQ  +   P+A  +++L+L      L
Sbjct: 501 QAVEANVGLVEQIIQLLRAALATGVDWDELGRQMKQQAKEGHPLAVHVQELRLEKQRALL 560

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAA 257
           LL    E +                     +   ++V +D+ LSA+ NA+      +   
Sbjct: 561 LLEAPGEEA---------------------SGATTVVSIDITLSAHGNAQLLHSQVKQLK 599

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQ-----TMTNINKARKVYWFEKFYWFISSENY 312
            K  KT  +T  AL +A++K ++TLK  +         + K RK +WFEKF+WFISS++Y
Sbjct: 600 AKTLKTSSATAAALAAADRKAQRTLKQKEQQVLQAQQQLQKVRKAFWFEKFHWFISSDHY 659

Query: 313 LGNVAWDAK 321
           L     DA+
Sbjct: 660 LVLAGRDAQ 668



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 34/65 (52%), Positives = 47/65 (72%), Gaps = 1/65 (1%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL-EE 375
           AW+ K    AWWV   QVSK+AP+G +L+TGSFMIRG++NF    +L MG   LF+L +E
Sbjct: 731 AWNTKTPAAAWWVYGHQVSKSAPSGLYLSTGSFMIRGRRNFIQIHRLEMGFGLLFRLADE 790

Query: 376 SSISR 380
           +S++R
Sbjct: 791 ASVAR 795


>gi|448089209|ref|XP_004196743.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|448093427|ref|XP_004197774.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|359378165|emb|CCE84424.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|359379196|emb|CCE83393.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
          Length = 1056

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/212 (30%), Positives = 100/212 (47%), Gaps = 68/212 (32%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-----KDVQTM 288
           V +DL LS++AN++ +FD K++A  KQ K  ++TE AL++AEKK  + L     K+ +T+
Sbjct: 513 VSIDLSLSSFANSRIYFDNKKNAETKQAKVEKNTEIALRNAEKKINRDLSSNLKKESETL 572

Query: 289 TNINKARKVYWFEKFYWFISSENYL---GNVAWDAKVV------TNAWWVKAD------- 332
             I   R  +WFEKFYWF+S+E YL   GN      ++       N ++V +D       
Sbjct: 573 KQI---RPKFWFEKFYWFVSNEGYLCLAGNDDTQTDMIYYRHFNDNDYFVTSDIEGSLKV 629

Query: 333 ------QVSKTAPT-----GEF-----------LTT----------------------GS 348
                 Q  + +P+     G F           +TT                      G+
Sbjct: 630 FVKNPYQGKEVSPSTLTQAGIFSMSASKAWDNKITTSAWYLKGSEVSKKDFDGSLVSFGN 689

Query: 349 FMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           F  +G+K F PP QL MG++F F  +E +  R
Sbjct: 690 FNYKGEKQFLPPSQLVMGLAFYFLGDEETTQR 721



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 82/161 (50%), Gaps = 10/161 (6%)

Query: 41  PNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLK 100
           P+S+D      EFHP   +P            +   E E +   +D +FST ES +  L+
Sbjct: 286 PDSKDLKYIMDEFHPY--KPHK-----ENEDSYQFMEVEGYNKTLDTYFSTIESSRYALR 338

Query: 101 AVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN 160
             QQ+  A K+LE  + + + ++ +L   + ++ +K E II + + ++    ++ Q I  
Sbjct: 339 IEQQKEQARKRLEKARNERDKQIQSLLDQKNLNIKKGEAIIYHADVIEECKESVLQLIRQ 398

Query: 161 QLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
           Q+ WE+IE  ++  Q   + +A +IK L LN+  N I +LL
Sbjct: 399 QMDWENIEKLIQLEQTRGNKLAQMIK-LPLNLVQNKINVLL 438


>gi|448508289|ref|XP_003865916.1| hypothetical protein CORT_0A00840 [Candida orthopsilosis Co 90-125]
 gi|380350254|emb|CCG20475.1| hypothetical protein CORT_0A00840 [Candida orthopsilosis Co 90-125]
          Length = 654

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 86/379 (22%), Positives = 149/379 (39%), Gaps = 104/379 (27%)

Query: 102 VQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQ 161
           + Q+  A ++L   K   +  + +L+  Q  + +K ELI  +   VD     +++ +  Q
Sbjct: 26  IHQKNLAAERLNKAKTRRDQEIQSLKDEQASNAKKGELIQLHSGLVDECRQYVQRFVDQQ 85

Query: 162 LSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVL------- 214
           + W ++E  +K  ++    +A  I +L L++     ++R+    + I+ +  +       
Sbjct: 86  MDWTNMELAIKLEKKRPGSIARYI-ELPLHLKENRFVIRLKDPENGIESNGKITSDSEPD 144

Query: 215 -------------------------HHDNADPFADSSETKKP--------SLVDVDLDLS 241
                                     ++ +D    S+    P        + V +D  LS
Sbjct: 145 SESDSESDSESDSDSDSDNSSDGDEENERSDSKGTSTSESHPKDKYTNEGTPVSIDYTLS 204

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KDV--QTMTNINKARKVY 298
           +YANA  +F+ K++A  KQ K  +  E A K+AEKK  Q L K++  +  T+    R+ +
Sbjct: 205 SYANASVYFENKKAAEAKQTKVEKGAEIAYKNAEKKINQDLVKNLRRENGTSSKSEREKF 264

Query: 299 WFEKFY-------------------------------WFISSE----------NYL---- 313
           WFE FY                               +F+SSE          N L    
Sbjct: 265 WFESFYWFVSSEGYLCLAGRTKSQIDLLYFKYFSDDDFFVSSEIEGSLKVFVKNPLKGES 324

Query: 314 ---------------GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
                           + AW+ K+ T AW +   +VSK   +G  L  G F    +K+F 
Sbjct: 325 VPPSTILQAGIFAMSASQAWNGKINTAAWVLHGSEVSKYNQSGALLPPGEFEYLARKHFL 384

Query: 359 PPCQLAMGISFLFKLEESS 377
           PP QL MG    F ++E S
Sbjct: 385 PPAQLVMGFGLYFLVDEGS 403


>gi|50293495|ref|XP_449159.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528472|emb|CAG62129.1| unnamed protein product [Candida glabrata]
          Length = 1031

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 61/203 (30%), Positives = 93/203 (45%), Gaps = 62/203 (30%)

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL--KDVQTMT 289
           ++V +DL LSAYANA  +F++K+  A+KQ+K  ++ EKA+K+ E K  + L  K  ++  
Sbjct: 514 TVVAIDLGLSAYANASTYFNMKKDHAEKQKKVEKNIEKAMKNIEDKIGKQLQKKLKESHD 573

Query: 290 NINKARKVYWFEKFYWFISSENYL-----GNV-------------------AWDAKV--- 322
            + K RK Y+FEK++WF S+E +L      NV                   ++D KV   
Sbjct: 574 VLKKIRKPYFFEKYFWFYSTEGFLVMLGKSNVETDQIYSRYIEDDDIFMSNSFDTKVWIK 633

Query: 323 ------------------------------VTNAWWVKADQVSKTAPT-GEFLTTGSFMI 351
                                          ++ WW  A  V+K     G  L  G F +
Sbjct: 634 NPERVEVPPNTLMQAGILCMSASPAWQKKIASSPWWCFAKNVTKFDDVDGSVLAPGVFRL 693

Query: 352 RGKK--NFFPPCQLAMGISFLFK 372
           R +K  N  PP QL MG+ F++K
Sbjct: 694 RNEKQINMLPPAQLVMGVGFMWK 716



 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 86/176 (48%), Gaps = 13/176 (7%)

Query: 31  GYLVQKKE-----KRPNSE-DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           GY++ KK        P  + +YI  N  FHP I    +T     + +   I+    +   
Sbjct: 273 GYIIAKKNPLYKIDTPGYDLEYIYEN--FHPFIPHIPATD----EDKSKVIKIEGDYNKT 326

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           +D+FFST ES K  LK   QE+ A +K+E  +++++ R+ AL + Q  ++ K  L+I N 
Sbjct: 327 LDDFFSTIESSKYALKIQNQEQQAKQKIEAARQENKKRIDALREQQASNETKGNLLIANV 386

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
           + V+    A+   +  Q+ W  IE  ++  Q   + +A  +   L L  N I +LL
Sbjct: 387 DLVEEVKSAVLGLVNQQMDWNTIEKLIQSEQNKGNKIAKHVSLPLDLKNNKIKILL 442


>gi|401826788|ref|XP_003887487.1| hypothetical protein EHEL_061370 [Encephalitozoon hellem ATCC
           50504]
 gi|395460005|gb|AFM98506.1| hypothetical protein EHEL_061370 [Encephalitozoon hellem ATCC
           50504]
          Length = 648

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 78/352 (22%), Positives = 144/352 (40%), Gaps = 102/352 (28%)

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
           ++F +F  A + FF + +      K  + +R++  K++ V++  E  +  +EQ     ++
Sbjct: 243 RDFPTFNDAAEFFFQSRK------KFGKNDRES--KVDKVRKRQENYMKEMEQEGESYRK 294

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHI 195
           KAEL+  N + V+  +   +    N++ W D E   +Q  R  + ++  I +     +  
Sbjct: 295 KAELLEANADFVNKILDIFKVVKKNKVKWTDFEKFREQENRKGNEISKAIVKTDFISHTC 354

Query: 196 TLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRS 255
           T++L   +                              + +D +++ + N  RF+   + 
Sbjct: 355 TIVLEGEE------------------------------IQIDFEVTLFNNVSRFYQKSKK 384

Query: 256 AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL-- 313
             +K  KT  S E+ LK    K      + + +T     R +YWFEKF++F SS+  L  
Sbjct: 385 LEEKIMKTRDSLEEVLKKIAPKV-----ETKKIT-----RALYWFEKFHFFFSSDGVLVI 434

Query: 314 ------------------------GNVAWDAKVVTN------------------------ 325
                                   G++   + ++                          
Sbjct: 435 GGRNAQQNEILVKKHLEPNDLYFHGDMHGSSSIIVKKPTPKTIEEAASMALCMSKCWEAN 494

Query: 326 ----AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
                W+V  +QVSKTAP+GE+LT GSFMI+GKKN+    ++  G+  LFK+
Sbjct: 495 VVSPVWYVYGEQVSKTAPSGEYLTKGSFMIKGKKNYVECHKIEYGLGLLFKV 546


>gi|68071251|ref|XP_677539.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497695|emb|CAH96713.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 1012

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 32/57 (56%), Positives = 46/57 (80%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           +W+ K++T+AWWV   QVSKTAPTGE++ TGSF+IRGKKN+ P  +L MG+  +F++
Sbjct: 282 SWNNKIITSAWWVYYHQVSKTAPTGEYIKTGSFVIRGKKNYLPYAKLEMGLCIIFQV 338



 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 72/232 (31%), Positives = 118/232 (50%), Gaps = 26/232 (11%)

Query: 88  FFSTAESHKIDL-KAVQQERDALKKLENVKRDHETRLSALEQTQL-VDKEKAELIINNQE 145
           + +  ES K D  + + + ++AL K++ +K DHE R+    + Q+ + K+K  LI  N E
Sbjct: 7   YLTKMESTKYDKHQEMNKRKNALTKIDKIKLDHERRIEGSTKKQVSILKKKISLIQLNDE 66

Query: 146 SVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKER 205
           SV  AI  +R  I+   +WE I   +K  ++ + P+A  +K + +N N+  + L      
Sbjct: 67  SVGEAIKLMRSAISTSANWEQIWDHIKLFKKRDHPIA--LKIMSVNFNNCEMEL------ 118

Query: 206 SNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
                  +L+ D+ +   D +  K  S  +   D ++     +   L++ A +K +K   
Sbjct: 119 -------LLNDDDIEENGDDNNLKNNSWKEKIADKNS-----KTCTLRKKAEEKIRKIKM 166

Query: 266 STEKALKSAEKKTKQTLKDV----QTMTNINKARKVYWFEKFYWFISSENYL 313
           ST  A+K  EKK K          +++  I K RKV+WFEKF WFISSENYL
Sbjct: 167 STNMAVKKVEKKKKDKDTKQKGKNKSVFQIKKLRKVFWFEKFNWFISSENYL 218


>gi|399216143|emb|CCF72831.1| unnamed protein product [Babesia microti strain RI]
          Length = 933

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 33/55 (60%), Positives = 45/55 (81%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           AW AK +T+AWWV++ QVSKT  TGE+L +GSF+IRGKKNF PP +L MG++ ++
Sbjct: 582 AWKAKFMTSAWWVESSQVSKTPETGEYLPSGSFVIRGKKNFLPPSKLEMGLAVIY 636


>gi|18977764|ref|NP_579121.1| hypothetical protein PF1392 [Pyrococcus furiosus DSM 3638]
 gi|397651884|ref|YP_006492465.1| hypothetical protein PFC_06185 [Pyrococcus furiosus COM1]
 gi|18893505|gb|AAL81516.1| hypothetical protein PF1392 [Pyrococcus furiosus DSM 3638]
 gi|393189475|gb|AFN04173.1| hypothetical protein PFC_06185 [Pyrococcus furiosus COM1]
          Length = 649

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 97/393 (24%), Positives = 164/393 (41%), Gaps = 106/393 (26%)

Query: 38  EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKI 97
           EK+PN    IV N E     +     P  ++ ++ +  + +ESF+ A+DE+F      K 
Sbjct: 227 EKKPN----IVYNKEG----VMVDVVPIDLQWYREYTKRYYESFSEALDEYFGKLTIEKA 278

Query: 98  DLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQD 157
            L+  +Q  +  K LE   R  E ++   E+  + ++EK + +  +   V+  IL +   
Sbjct: 279 RLEKTKQLEERRKALEISLRRIEEQIKGFEKEAMTNQEKGDALYAHYSIVN-EILRVISS 337

Query: 158 IANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHD 217
              Q   E+++ R+++ ++   P A +I  + +  N +TL L        IK        
Sbjct: 338 ALKQYGVEEVKKRIEEGKKAGYPWAKMI--IDVTDNKVTLNL------DGIK-------- 381

Query: 218 NADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSA 274
                           V +D++ S   NA+ +++  + A KK +      + T++ L   
Sbjct: 382 ----------------VSLDVEKSLEENAELYYERAKKAKKKLEGAKIAYEETKRKLIEL 425

Query: 275 EKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---GNVA-------------- 317
           EK+ ++  K++       K +K  WFEKF WFISSE +L   G  A              
Sbjct: 426 EKEIERESKEINIKKITRKKKK--WFEKFRWFISSEGFLVIGGKDATTNEIVVKKHMDEN 483

Query: 318 --------WDAK-----------------------VVTNAW----------WVKADQVSK 336
                   W A                         ++ AW          WV  +QVSK
Sbjct: 484 DIYCHADIWGAPHVIIKNGRNASEKTIREACQFAVAMSRAWSEGLASADAYWVYPEQVSK 543

Query: 337 TAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
            AP GE+L  G+FM+ GK+N+    P +LA+GI
Sbjct: 544 QAPAGEYLPKGAFMVYGKRNWIHGIPLKLAVGI 576


>gi|154418675|ref|XP_001582355.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121916590|gb|EAY21369.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 875

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 66/251 (26%), Positives = 115/251 (45%), Gaps = 27/251 (10%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
           +P  + Q+     Q F++F  A DEF+S  E  +   +  + E    KK+++VK++ + +
Sbjct: 283 SPFPLAQYDPSQSQVFDTFDKACDEFWSVRELERAQKEHKENEAAPDKKVQSVKKNFDKK 342

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
               +    +      LI  N   ++     I   IAN++ W++I   ++  Q   + +A
Sbjct: 343 RKQFQDELDLLNRTGHLIQANATQIEQCRNVINSFIANRVRWDEIRMSIRAYQECGNELA 402

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
           S+I ++               E+S     + L +D         E  K   + ++L  +A
Sbjct: 403 SMIDKVDF-------------EKSGF---YCLVND---------EEGKTERIFIELKKTA 437

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           YANA  +FD +    KK +      E+ LK  EK      K V   + I + RK +WFE+
Sbjct: 438 YANASAYFDKRAVLVKKLEGANAKEEEVLKKVEKDAIAAKKKV--TSTIQERRKTWWFER 495

Query: 303 FYWFISSENYL 313
           F+WFI++ENYL
Sbjct: 496 FHWFITTENYL 506


>gi|212223298|ref|YP_002306534.1| fibronectin-binding protein [Thermococcus onnurineus NA1]
 gi|212008255|gb|ACJ15637.1| predicted fibronectin-binding protein [Thermococcus onnurineus NA1]
          Length = 649

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/365 (21%), Positives = 144/365 (39%), Gaps = 89/365 (24%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++ ++ F  + F +F+ A+DE+F      K  ++  ++     ++L    +  E  
Sbjct: 241 VPVELKVYENFEKRYFSTFSEALDEYFGKVTLEKAKIEQTKKLEAKKRQLLMTLKKQEEL 300

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   E+    ++E  +LI  N   V+  +   R+    +L WE+ + R+ + ++  +  A
Sbjct: 301 LKGFEEQAKANQEIGDLIYANFTMVERLLDEFRK-ATERLGWEEFKKRIDEGKKAGNKAA 359

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            ++K +      +T+ L   K R  +  S     +NA+ + + ++  K  L      L A
Sbjct: 360 LMVKSIDPKEKAVTIELEGKKVRLYLNKSI---GENAELYYEKAKKAKHKLEGA---LKA 413

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           Y + KR  D                       EK  ++ +K    +  I + RK  WFEK
Sbjct: 414 YEDTKRKLD---------------------EIEKLIEEEMKKELAVKRIER-RKKKWFEK 451

Query: 303 FYWFISSENYLGNVAWDAKV---------------------------------------- 322
           F WF+SSE +L     DA                                          
Sbjct: 452 FRWFVSSEGFLVLAGKDASTNENLIKKHMDENDLYCHADVYGAPHVVIKDGQKAGEKTIF 511

Query: 323 --------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQ 362
                   ++ AW          W   +QV+K AP+GE+L  G+FM+ GK+N+    P +
Sbjct: 512 EACQFAVSMSKAWSQGLYSADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLK 571

Query: 363 LAMGI 367
           LA+G+
Sbjct: 572 LAVGV 576


>gi|78395025|gb|AAI07765.1| SDCCAG1 protein, partial [Homo sapiens]
          Length = 458

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 36/47 (76%), Positives = 40/47 (85%)

Query: 334 VSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           VSKTAPTGE+LTTGSFMIRGKKNF PP  L MG SFLFK++ES + R
Sbjct: 2   VSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWR 48


>gi|76156824|gb|AAX27946.2| SJCHGC07203 protein [Schistosoma japonicum]
          Length = 184

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 39/52 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW   V+T AWWV  DQVSKTAP+GE+LT+GSF+IRGKKN+ PPC    G  
Sbjct: 133 AWQNHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFG 184



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/43 (58%), Positives = 31/43 (72%)

Query: 271 LKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           LKSA  K + T+K  +T+  I + RK  WFEKF+WFISSENYL
Sbjct: 8   LKSAIHKAEATMKTAKTIAQITEVRKPMWFEKFFWFISSENYL 50


>gi|332157694|ref|YP_004422973.1| hypothetical protein PNA2_0051 [Pyrococcus sp. NA2]
 gi|331033157|gb|AEC50969.1| hypothetical protein PNA2_0051 [Pyrococcus sp. NA2]
          Length = 650

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 89/370 (24%), Positives = 158/370 (42%), Gaps = 100/370 (27%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  +R + G+    F++F+ A+DE+F      K   +  ++  +  K+L    +  E  
Sbjct: 243 VPIDLRWYDGYEKVYFDTFSKALDEYFGKLTIEKAREEKTKKLEEKKKQLIATLKRQENM 302

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +   ++    ++E A+LI  N + VD  +  + + +  +L WE++  RV++ ++  + +A
Sbjct: 303 IKGFKEEMRRNQEIADLIYANYQLVDNLLKELSKAV-ERLGWEELIRRVEEGKKKGNRIA 361

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +IK +    N +T+ +                           E KK  L  +D D++ 
Sbjct: 362 MMIKSINPQENSVTIEI---------------------------EDKKVRLY-IDRDIN- 392

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV----- 297
             NA+ +++     AKK +  ++  +KA +  +KK +Q  K+++      + +K+     
Sbjct: 393 -ENAEIYYE----KAKKAKHKLEGAKKAYEELKKKLEQVEKEIEEEEKKVQVKKIERRKK 447

Query: 298 YWFEKFYWFISSENYL---GNVA----------------------WDAKVV--------- 323
            WFEKF WFISSE +L   G  A                      W A  V         
Sbjct: 448 KWFEKFRWFISSEGFLVIGGKDATTNEIVVRKYMGENDIYCHADIWGAPHVIIKDGRRAS 507

Query: 324 --------------TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
                         + AW          WV  +QV K AP+GEFL  G+FM+ GK+N+  
Sbjct: 508 EKTIFEACQFAVSMSRAWSEGLYSADAYWVYPEQVKKQAPSGEFLPKGAFMVYGKRNWMH 567

Query: 360 --PCQLAMGI 367
             P +LA+GI
Sbjct: 568 GIPLKLAVGI 577


>gi|150865765|ref|XP_001385110.2| highly conserved hypothetical protein Predicted RNA-binding
           [Scheffersomyces stipitis CBS 6054]
 gi|149387021|gb|ABN67081.2| conserved hypothetical protein Predicted RNA-binding protein
           [Scheffersomyces stipitis CBS 6054]
          Length = 1038

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 60/183 (32%), Positives = 94/183 (51%), Gaps = 18/183 (9%)

Query: 26  NVSSQGYLVQKK--------EKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQE 77
           N  + G++V KK        E   N  +Y+    EFHP   +P       +  +G+   E
Sbjct: 270 NEQTNGFIVSKKNPLYNPSEEHSDNDLEYVYD--EFHP--FKP-----FKKNLEGYKFTE 320

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
            E +   +D FFS  ES K  LK  QQ+++A K+LEN + +   ++ +L Q Q  + +K 
Sbjct: 321 IEGYNKTLDTFFSALESTKFALKIEQQKQNANKRLENARSERNKQIQSLIQQQETNSKKG 380

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHIT 196
           + II + + V + I AI++ +  Q+ W +IEA VK  Q   + + S IK  L LN N I 
Sbjct: 381 DTIIYHADLVASCISAIQKMLDKQMDWGNIEAIVKHEQSSGNEIMSTIKLPLNLNENKIN 440

Query: 197 LLL 199
           L+L
Sbjct: 441 LVL 443



 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 63/207 (30%), Positives = 86/207 (41%), Gaps = 67/207 (32%)

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN----I 291
           +DL LS YANA+ +F+ K+SA  K++K  ++TE ALK+AE+K KQ L     + N    +
Sbjct: 523 IDLSLSPYANARLYFESKKSAESKKEKVEKNTEMALKNAERKIKQDL--AHNLKNEHDTL 580

Query: 292 NKARKVYWFEKF------------------------YWFISSENYL-------------- 313
            + R  YWFEKF                        Y F +  ++               
Sbjct: 581 KQLRPKYWFEKFYWFVSSEGYLCLAGRDPSQTDMIYYRFFNDNDFFVSAEMEGSLKVFVK 640

Query: 314 ----------------GN------VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMI 351
                           GN       AW  KV T+AW +    VSK    G  L  G F  
Sbjct: 641 NPFKGESVPPYTLMQAGNFAKSTSTAWSGKVSTSAWVLHGSDVSKKDFDGSLLAGGEFNY 700

Query: 352 RGKKNFFPPCQLAMGIS-FLFKLEESS 377
           + KK F PP QL MG   +L   EE++
Sbjct: 701 KSKKEFLPPTQLTMGFGLYLLGDEETA 727


>gi|409095360|ref|ZP_11215384.1| Fibronectin-binding protein A (FbpA) [Thermococcus zilligii AN1]
          Length = 650

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 83/365 (22%), Positives = 149/365 (40%), Gaps = 89/365 (24%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++ + G   + F +F+ A+DE+F      K  ++  Q+     K+L       E  
Sbjct: 242 VPIELKVYGGLEKKYFSTFSEALDEYFGRITVEKARIEQTQKLEAKKKQLLTTLMMQEEM 301

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   E+    ++E  +LI  N   V+  +   ++    +L WE+ + R+++ ++  + VA
Sbjct: 302 LRGFEKAMKENQELGDLIYANYPVVERLLEEFKR-ATEKLGWEEFKKRIEEGKKAGNRVA 360

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            ++K++      +T+ L    E   +K    LH D +    +++E               
Sbjct: 361 LMVKEIDPKEKAVTVEL----EGKEVK----LHVDRS--LGENAEL-------------Y 397

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
           Y NAK+F    R   +   K  + T + ++  EK  ++ +K    +  I + RK  WFEK
Sbjct: 398 YENAKKF----RHKYEGALKAYEDTRRKIEEIEKLIEEEMKKELNVRRI-EGRKKRWFEK 452

Query: 303 FYWFISSENYLGNVAWDAKV---------------------------------------- 322
           F WF+SSE +L     DA                                          
Sbjct: 453 FRWFVSSEGFLVLAGKDANTNETLVKKHMDKNDLYCHADVYGAPHVVIKDGQKAGEKTIF 512

Query: 323 --------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQ 362
                   ++ AW          W   +QV+K AP+GE+L  G+FM+ GK+N+    P +
Sbjct: 513 EACQFAVSMSRAWSQGLYSADAYWAYPEQVTKQAPSGEYLGKGAFMVYGKRNWLHGLPLK 572

Query: 363 LAMGI 367
           LA+G+
Sbjct: 573 LAVGV 577


>gi|255722283|ref|XP_002546076.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240136565|gb|EER36118.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 857

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 97/194 (50%), Gaps = 16/194 (8%)

Query: 11  IHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSE---DYIVANMEFHPMILQPGSTPSVM 67
           +H C      L+S     +QGY+V +K K  + +   DYI    EFHP   +P  +    
Sbjct: 96  LHVCEDKYMDLISG-KTETQGYIVSRKNKNASEDSEFDYICD--EFHP--FKPYKSNVT- 149

Query: 68  RQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALE 127
                    E   +   VD+FFST ES K  LK  QQ+ +A K+LE  K + + ++ +L 
Sbjct: 150 ----DLKFTEVSGYNKTVDQFFSTLESSKFSLKIEQQKENASKRLEKAKSERDKQIESLV 205

Query: 128 QTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQ 187
             Q ++ +K ELI  + E V+     ++Q +  Q+ W +IE  +   Q+ N+P +  I Q
Sbjct: 206 AQQQLNSKKGELIQYHSELVEECRRYVQQYLDQQMDWTNIETVIALEQKKNNPTSKSI-Q 264

Query: 188 LKLNI--NHITLLL 199
           L LN+  N I +LL
Sbjct: 265 LPLNLKDNKIKVLL 278



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 35/67 (52%)

Query: 314 GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
            + AW+ KV T+AW +   ++SK    G  +  G F    KK + PP QL MG+ F   +
Sbjct: 482 ASTAWNGKVTTSAWVLHGTEISKRDFDGSIVPDGEFKYLAKKEYLPPAQLVMGLGFYCLV 541

Query: 374 EESSISR 380
           +E S  +
Sbjct: 542 DEESTKK 548


>gi|85000891|ref|XP_955164.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65303310|emb|CAI75688.1| hypothetical protein, conserved [Theileria annulata]
          Length = 1185

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 32/59 (54%), Positives = 44/59 (74%)

Query: 316 VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
            AW+ K    +WWV   QVSKT PTGE++  GSF+IRGKKN+ PP +L MGI++LF+++
Sbjct: 613 TAWNEKFSVQSWWVYWHQVSKTPPTGEYVPQGSFVIRGKKNYLPPQKLEMGITYLFQVQ 671



 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 70/258 (27%), Positives = 123/258 (47%), Gaps = 25/258 (9%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR-LSALEQTQLVDKEK 136
           FE F  AVD FF+  E  K + K+V +    + K   +K D   R L+ +E  Q +D  K
Sbjct: 274 FEDFNDAVDTFFTKHELAKQEKKSVDKRPTKINK---IKIDQNKRELNLMEDIQKID-SK 329

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
            +L+  + +  +  +   +  IA+  SW DI  +++  ++ N P+   IK+  ++I   T
Sbjct: 330 IKLLEEHVDVAENCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVHYIKE--IHIPTQT 387

Query: 197 LLLRV------HKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF 250
           L+         H E++             +   +   T+   +V++D  L+++ N K+ +
Sbjct: 388 LIFYSNQNQDQHNEQNKQNQFQQNIQQKNENKQNKKNTRDEVVVELDYRLNSHQNLKKLY 447

Query: 251 DLKRSAAKKQQKTIQSTEKALKSA--------EKKTKQTLKDVQTMTNINKARKVYWFEK 302
           + ++    K ++T    E ALK           KKT +  +DV+    I+  R+ +WFEK
Sbjct: 448 NERKRLENKLERTRIGKEYALKKVTKSLKKEENKKTDKKGRDVK----ISSVRRRFWFEK 503

Query: 303 FYWFISSENYLGNVAWDA 320
           FYWFI+S+ YL     DA
Sbjct: 504 FYWFITSQGYLVLAGRDA 521


>gi|159906014|ref|YP_001549676.1| hypothetical protein MmarC6_1632 [Methanococcus maripaludis C6]
 gi|159887507|gb|ABX02444.1| protein of unknown function DUF814 [Methanococcus maripaludis C6]
          Length = 680

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 99/373 (26%), Positives = 152/373 (40%), Gaps = 113/373 (30%)

Query: 75  IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDK 134
           I+ +ESF  A+DE+FS      I  K ++Q    L+KL  VK+      S LE  +  +K
Sbjct: 276 IKHYESFLTALDEYFSRF----IMKKEIKQAESKLQKL--VKKQERILKSQLETKEKYEK 329

Query: 135 E------KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
           +      + +LI  N   VD  +  I+     ++ WE I+  +K+ + H  PV S  K +
Sbjct: 330 QSRSNHKRGDLIYANYSFVDEIVSTIKL-AREKMGWEGIKNVIKENKTH--PVLS--KII 384

Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKR 248
            +N  +  L+L++  +  N      L  DN               V VDL  +A+ NA  
Sbjct: 385 NVNEKNAELMLKLSADYGN-----GLIEDN---------------VPVDLRKNAFENA-- 422

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-----------INKARKV 297
             D+    +KK +  +Q   +ALK +EKK  + LKD + + +             + + +
Sbjct: 423 --DIVYQKSKKFKNKVQGVIEALKISEKKLAE-LKDKEKLDSEVLKEKEENIKKKERKVL 479

Query: 298 YWFEKFYWF--------------------------------------------------I 307
            W+EK  W                                                   I
Sbjct: 480 KWYEKLKWTVIGGYLIVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEI 539

Query: 308 SSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
             EN L  VA  A   + AW          WV+ DQ+SKTA +GE+L  G+F+IRGK+NF
Sbjct: 540 PDENILFEVAKFASSHSRAWKLGVGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNF 599

Query: 358 FPPCQLAMGISFL 370
                L +GI  L
Sbjct: 600 IRSAALELGIGML 612


>gi|347828081|emb|CCD43778.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 430

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 34/56 (60%), Positives = 41/56 (73%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           +AWWV ADQVSK+APTGEFL  GSF   GKKNF PP QL +G   LF++ + S +R
Sbjct: 2   SAWWVTADQVSKSAPTGEFLPAGSFNTHGKKNFLPPAQLLLGFGVLFQISDESKAR 57


>gi|71027701|ref|XP_763494.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68350447|gb|EAN31211.1| hypothetical protein, conserved [Theileria parva]
          Length = 1249

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 32/58 (55%), Positives = 43/58 (74%)

Query: 316 VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
            AW+ K    +WWV   QVSKT PTGE++  GSF+IRGKKN+ PP +L MGI++LF++
Sbjct: 598 TAWNEKFSVQSWWVYWHQVSKTPPTGEYVPQGSFVIRGKKNYLPPQKLEMGITYLFQV 655



 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 79/256 (30%), Positives = 124/256 (48%), Gaps = 26/256 (10%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           FE F  AVD FF+  E  K + K  Q ++    KL  +K D + R    EQ  + D  K 
Sbjct: 275 FEDFNDAVDAFFTKHELAKQE-KKTQDKKPT--KLNKIKIDQDKR----EQKLVEDIRKL 327

Query: 138 ELIINN-QESVDAA---ILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNIN 193
           +L I   +E+VD A   +   +  IA+  SW DI  +++  ++ N P+   IK+    IN
Sbjct: 328 DLEIKLLEENVDIAENCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVHYIKE----IN 383

Query: 194 HITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP----SLVDVDLDLSAYANAKRF 249
             T  L  H   S   D       +  P    +++K      + V +D  L+++ N K+ 
Sbjct: 384 IPTQTLIFHNPISG-SDQLSQGGQSGKPGKSGTQSKLSKDLTASVSLDYRLNSHQNLKKL 442

Query: 250 FDLKRSAAKKQQKTIQSTEKALKSAEK-----KTKQTLKDVQTMTNINKARKVYWFEKFY 304
           ++ ++    K ++T    E ALK   K     +TK+T K+ + +  I+  RK +WFEKFY
Sbjct: 443 YNERKRLENKLERTKIGKEYALKKVTKSLKKQETKKTDKNKRDV-RISSVRKRFWFEKFY 501

Query: 305 WFISSENYLGNVAWDA 320
           WFI+S+ YL     DA
Sbjct: 502 WFITSQGYLVLAGRDA 517


>gi|134045609|ref|YP_001097095.1| hypothetical protein MmarC5_0566 [Methanococcus maripaludis C5]
 gi|132663234|gb|ABO34880.1| protein of unknown function DUF814 [Methanococcus maripaludis C5]
          Length = 680

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 99/388 (25%), Positives = 155/388 (39%), Gaps = 116/388 (29%)

Query: 63  TPSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDH 119
           +P  ++ H+      I+ +ESF  A+DE+FS      I  K ++Q    L+KL  VK+  
Sbjct: 261 SPIDLKMHENLKENEIKHYESFLTALDEYFSRF----IMKKEIKQAESKLQKL--VKKQE 314

Query: 120 ETRLSALEQTQLVDKE------KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
               S LE  +  +K+      + +LI  N   VD  +  I+ D   ++ W  I+  +K+
Sbjct: 315 RILKSQLETKEKYEKQSLSNHKRGDLIYANYSLVDEIVGTIK-DAREKMDWNGIKKIIKE 373

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
            + H  P+ S I    +N+N             N + +  L  D  +   + +       
Sbjct: 374 NKTH--PILSKI----INVNE-----------KNAELTLKLSADYGNGLIEDT------- 409

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           V VDL  +A+ NA    D+    +KK +  +Q   +ALK +EKK  + LKD + + +   
Sbjct: 410 VPVDLRKNAFENA----DIVYQKSKKFKHKVQGVIEALKISEKKLAE-LKDKEKLDSEIL 464

Query: 294 ARK-----------VYWFEKFYWF------------------------------------ 306
             K           + W+EK  W                                     
Sbjct: 465 KEKEEKIKKKERKVLKWYEKLKWTVIGGYLIVAGKDATTNEMLIKRYVEKNDIVFHTLME 524

Query: 307 --------------ISSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGE 342
                         I  EN L  VA  A   + AW          WV+ DQ+SKTA +GE
Sbjct: 525 GAPFTIIRTEGSEEIPDENVLFEVAKFASSHSRAWKLGIGSADVYWVRPDQISKTAESGE 584

Query: 343 FLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           +L  G+F+IRGK+NF     L +GI  L
Sbjct: 585 YLKKGAFVIRGKRNFIRSAALELGIGML 612


>gi|147920849|ref|YP_685344.1| hypothetical protein RCIX612 [Methanocella arvoryzae MRE50]
 gi|110620740|emb|CAJ36018.1| conserved hypothetical protein [Methanocella arvoryzae MRE50]
          Length = 670

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 83/365 (22%), Positives = 148/365 (40%), Gaps = 95/365 (26%)

Query: 69  QHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQ 128
           + +G+    FE+F  AVD +F      +     V+++ + L   E   R  +  ++  E+
Sbjct: 259 EGEGYEKVYFETFNKAVDAYFGARIKTEAKAAIVEKKAEKLGVFERRLRQQQDAIAKFER 318

Query: 129 TQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
            +  +  K E+I    + V+  I  I+       SW+DI   +K A++  +  A+ I+ +
Sbjct: 319 EEQENARKGEVIYAEYQKVEEIIKVIKGARDRGYSWDDIRKILKDAKKAGNQAAAAIQAI 378

Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKR 248
                 IT++L                                + V++D+ L+   NA+ 
Sbjct: 379 DSATGLITVVL------------------------------PEATVNIDVKLTVPQNAQA 408

Query: 249 FFD-LKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
           ++D +K+  AKK+   K I+ T KA+  A+ K  +  K VQ   +  K RK  W+++F W
Sbjct: 409 YYDKVKKVQAKKEGALKAIEETRKAMAKAQPKVAEPGKPVQKKVS-AKPRKPKWYDRFRW 467

Query: 306 FIS-------------------------------------------------SENYLGNV 316
           F +                                                 +E  L  V
Sbjct: 468 FFTSDGFLVVAGRDADTNEEIVKKYMEKNDVFFHAQAHGAPITVLKTAGKPVTEQALAEV 527

Query: 317 A---------WDA-KVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL--A 364
           A         W A +   + +WVK +QVSKT   GE++  G+F++RG++N+    Q+  A
Sbjct: 528 AQFAVSYSSVWKAGQFSGDCYWVKPEQVSKTPEPGEYVAKGAFIVRGERNYVKDVQVRAA 587

Query: 365 MGISF 369
           +GI F
Sbjct: 588 IGIRF 592


>gi|403222989|dbj|BAM41120.1| uncharacterized protein TOT_030000383 [Theileria orientalis strain
           Shintoku]
          Length = 1119

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 78/279 (27%), Positives = 130/279 (46%), Gaps = 27/279 (9%)

Query: 65  SVMRQHQGFPIQE---FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHET 121
            VM     F +Q     E +  A+D FF+  E  K + K    E     KL  +K D + 
Sbjct: 255 GVMDDFGPFELQNAEYHEDYNYALDAFFTKNELVKQEKKT---ESKKPTKLTKIKADQDK 311

Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
           R S L +  +   ++  ++  N + VD  +   +  IA+  SW DI  +++  ++ N P+
Sbjct: 312 RESKLMEEIMGYDKQIRVLEENIDIVDNCLNLTKALIASGASWNDIYEQLQIQRKQNHPL 371

Query: 182 ASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
              IK++  NI + TL+  V     N +D         +P  +  E  +  +V +D  L+
Sbjct: 372 VCYIKEI--NIPNQTLVF-VSNPEGNERDD--------EP--ERKELVEEQVVVLDYRLT 418

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALK----SAEKKTKQTLKDVQTM-TNINKARK 296
            Y N K+F+  ++ A  K ++T    E ALK    S  K+ +    D +T    I+  RK
Sbjct: 419 GYQNLKKFYINRKKAENKLERTKIGKEYALKKVAKSLSKQPEVKKGDRRTREVKISSLRK 478

Query: 297 VYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKAD 332
            +WFEKFYWFI+S+ YL   G  +   +++   +  K D
Sbjct: 479 RFWFEKFYWFITSQGYLVLAGRDSLQNELLVKKYLTKGD 517



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 31/57 (54%), Positives = 42/57 (73%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AW+ K    +WWV   QVSKT PTGE++  GSF+IRGKKN+  P +L MGI++LF++
Sbjct: 584 AWNDKFSVQSWWVYWHQVSKTPPTGEYVPQGSFVIRGKKNYLQPQKLEMGITYLFQV 640


>gi|57641373|ref|YP_183851.1| fibronectin-binding protein [Thermococcus kodakarensis KOD1]
 gi|57159697|dbj|BAD85627.1| predicted fibronectin-binding protein [Thermococcus kodakarensis
           KOD1]
          Length = 650

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/367 (22%), Positives = 146/367 (39%), Gaps = 93/367 (25%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++ ++GF  + F +F+ A+DE+F      K  ++  ++  +  + L    R  E  
Sbjct: 242 VPIELKIYEGFEKRYFPTFSEALDEYFGKITLEKAKIEQTKKLEEKKRGLMATLRKQEEM 301

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   E+    ++E  +LI  N   ++  +   R+     L W++   R+ + ++  + VA
Sbjct: 302 LKGFEKAMRENQEIGDLIYANYTLIERLLEEFRK-ATETLGWDEFRRRIDEGKKTGNKVA 360

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            ++K +      +T+ L   K +  ++ S     +NA+ +                    
Sbjct: 361 LMVKGIDPKEKAVTIELDGKKVKLYLEKSL---GENAEIY-------------------- 397

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA--RKVYWF 300
           Y  AK+F      A K  + T +  E+  K  E++ K+ L       N+ K   RK  WF
Sbjct: 398 YEKAKKFRHKYEGALKAYEDTKRKLEEIEKLIEEEQKKEL-------NVKKLERRKRKWF 450

Query: 301 EKFYWFISSENYLGNVAWDAKV-------------------------------------- 322
           EKF WF+SSE +L     DA                                        
Sbjct: 451 EKFRWFVSSEGFLVLAGKDASTNEVLVKKHMEDNDLYCHADVYGAPHVVIKDGQKAGEKT 510

Query: 323 ----------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--P 360
                     ++ AW          W   +QV+K AP+GE+L  G+FM+ GK+N+    P
Sbjct: 511 IFEACQFAVSMSRAWSQGLYSADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWMHGLP 570

Query: 361 CQLAMGI 367
            +LA+G+
Sbjct: 571 LKLAVGV 577


>gi|260942807|ref|XP_002615702.1| hypothetical protein CLUG_04584 [Clavispora lusitaniae ATCC 42720]
 gi|238850992|gb|EEQ40456.1| hypothetical protein CLUG_04584 [Clavispora lusitaniae ATCC 42720]
          Length = 605

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 92/213 (43%), Gaps = 63/213 (29%)

Query: 231 PSL-VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KDVQTM 288
           P+L VD+DL LSA+ANA  +F+ K+ A  KQ +  ++T+ ALK+AE+K +  L K+++  
Sbjct: 71  PTLTVDIDLALSAFANASVYFESKKVAVTKQTRVEKNTKIALKNAERKIQSDLNKNLKNE 130

Query: 289 TNINKA-RKVYWFEKFYWFISSENYLGNVAWD--------------------------AK 321
           T   +A R  +WFEK++WF +S+ YL     D                          AK
Sbjct: 131 TESLRAFRHKFWFEKYFWFTTSDGYLCLAGRDDLQTDMIYYRHFSDGDYFVSSDLDGAAK 190

Query: 322 V-------------------------VTNAW---------WVKADQVSKTAPTGEFLTTG 347
           V                          + AW         W+    V+K    G  L  G
Sbjct: 191 VFILNPYKAQNVSPSALFQAGIFALSTSTAWSAKISSSAWWMSGADVTKREFDGSLLGPG 250

Query: 348 SFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
               + KKN+ PP Q+ MG  F +  +E +  +
Sbjct: 251 ILKYKAKKNYMPPAQMVMGFGFYWLCDEETTQK 283


>gi|19074389|ref|NP_585895.1| hypothetical protein ECU06_1390 [Encephalitozoon cuniculi GB-M1]
 gi|19069031|emb|CAD25499.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
 gi|449329389|gb|AGE95661.1| hypothetical protein ECU06_1390 [Encephalitozoon cuniculi]
          Length = 648

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/377 (22%), Positives = 152/377 (40%), Gaps = 103/377 (27%)

Query: 53  FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
           +  +I++ G  PS +   +    ++FE+F  A + +F + +      K  + +R++  K+
Sbjct: 221 YGGVIMEKGK-PSSLVPFKTAGARDFETFNEAAEFYFQSRK------KFGKNDRES--KV 271

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + V++  E  +  +EQ   + + KAEL+  N + V+  +   +    N++ W D E    
Sbjct: 272 DKVRKRQEEYVKEMEQQGELLRRKAELLERNSKLVNRILDIFKVVKKNRIKWTDFEKFWG 331

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           Q  +  + V+  I         +      HK        W++                  
Sbjct: 332 QENKKGNEVSKAI---------VKTDFMAHK-------CWIVLEGEE------------- 362

Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
            +++D D S ++N    +   +   +K ++T  S E+ LK    K +      + +T   
Sbjct: 363 -IEIDFDSSLFSNISGLYQKSKKLEEKIRRTRDSLEEVLKRIAPKIES-----KKIT--- 413

Query: 293 KARKVYWFEKFYWFISSENYL--------------------GNVAWD------------- 319
             R  YWFEKF++F SS+  L                    G++ +              
Sbjct: 414 --RAPYWFEKFHFFFSSDGVLVIGGKNAQQNEILVKKHLEPGDLYFHSDMHGSSSIIVKK 471

Query: 320 ------------AKVVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
                       A  ++  W         +V  DQVSKTAP+GE+L  GSFMI GKKN+ 
Sbjct: 472 ATQKTIEEAASMALCMSKCWEANVVSPVWYVYGDQVSKTAPSGEYLKKGSFMITGKKNYV 531

Query: 359 PPCQLAMGISFLFKLEE 375
              ++  G+  LF++ E
Sbjct: 532 ECHRIEYGLGLLFRVSE 548


>gi|303389736|ref|XP_003073100.1| putative RNA-binding protein [Encephalitozoon intestinalis ATCC
           50506]
 gi|303302244|gb|ADM11740.1| putative RNA-binding protein [Encephalitozoon intestinalis ATCC
           50506]
          Length = 648

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 83/377 (22%), Positives = 148/377 (39%), Gaps = 103/377 (27%)

Query: 53  FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
           +  +I++ G  PS +   +    ++F +F  A + FF   +      K  + +R+   K+
Sbjct: 221 YGGVIMEKGK-PSNLIPFKADGAKDFNTFNDAAEYFFQGRK------KFGKNDRET--KV 271

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + V++  E  +  +EQ     ++KAEL+  N + V+  +   +    N++ W D E   +
Sbjct: 272 DKVRKRQENYMKEMEQQGECYRKKAELLEKNADLVNRILEIFKVVRKNKVKWTDFEKFRE 331

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           Q  +    V+  I +    ++H                 W+       P           
Sbjct: 332 QENKKGSEVSKAIVKTDF-VSHTC---------------WITLEGEEIP----------- 364

Query: 233 LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN 292
              +D ++S + N   F+   +   +K +KT  S  + LK    K      + + +T   
Sbjct: 365 ---IDFNISLFNNVSEFYQKSKKLEEKIRKTRDSLGEVLKKIAPKV-----ETKKIT--- 413

Query: 293 KARKVYWFEKFYWFISSENYL---GNVAWD------------------------------ 319
             R +YWFEKF++F SS+  L   G  A                                
Sbjct: 414 --RTLYWFEKFHFFFSSDGVLVIGGKTAQQNEILVKKHLEPTDLYFHSDVHGASSIIVKK 471

Query: 320 ------------AKVVTNAW---------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
                       A  ++  W         +V  +QVSKTAP+GE+L  GSFMI+GKKN+ 
Sbjct: 472 PTEKTIVETASMALCMSRCWETNVVSPVWYVYGEQVSKTAPSGEYLGKGSFMIKGKKNYV 531

Query: 359 PPCQLAMGISFLFKLEE 375
              ++  G+  LF++ E
Sbjct: 532 DCHKIEYGLGLLFRVFE 548


>gi|294658357|ref|XP_002770767.1| DEHA2F07678p [Debaryomyces hansenii CBS767]
 gi|202953070|emb|CAR66294.1| DEHA2F07678p [Debaryomyces hansenii CBS767]
          Length = 1064

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 88/207 (42%), Gaps = 62/207 (29%)

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--INK 293
           +D+ LS +ANA+ +F+ K+SA  KQ K  +STE ALK+A+KK +Q L +     N  + +
Sbjct: 516 IDISLSPFANARVYFESKKSAESKQIKVEKSTEFALKNAKKKIEQDLNNKLKNENDSLKQ 575

Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAKVV---------TNAWWVKAD-----QV----- 334
            R  YWFEKF WF+SSE YL     D   +          N +++ +D     +V     
Sbjct: 576 IRPKYWFEKFLWFVSSEGYLCLAGRDNSQIDMIYYRHFNDNDYFISSDIEGSLKVFIKNP 635

Query: 335 --SKTAPTGEFLTTGSFMI---------------------------------------RG 353
              ++ P    +  G F I                                       + 
Sbjct: 636 FKGESIPPSTLMQAGIFAISASSAWNGKVTTSAWLLHGADISKKDFDGTLISSGNFNYKA 695

Query: 354 KKNFFPPCQLAMGISFLFKLEESSISR 380
           KK + PPCQL MG  F +  +E +  +
Sbjct: 696 KKTYLPPCQLIMGFGFYWLGDEETTKK 722



 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 88/179 (49%), Gaps = 20/179 (11%)

Query: 31  GYLVQKK--EKRPNSED------YIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           GY+V KK     P+++D      YI+   EF+P   +P  +         +   E + + 
Sbjct: 273 GYIVSKKNLSYNPDNDDSTNDLEYIMD--EFYPY--KPYKS-----DMDNYRFTEIQGYN 323

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
             +D FFST ES K  L+  QQ++ A K+L+  + + + ++ +L   Q  + +K + I+ 
Sbjct: 324 RTMDSFFSTIESTKYALRIDQQKQQATKRLDYAREERDKQIQSLLAQQESNIKKGDAIMY 383

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
             + VD    ++ + I  Q+ W +IE+ ++  Q   + +A  I  L LN+  N I L L
Sbjct: 384 YADLVDQCKDSVVKLIDQQMDWTNIESLIELEQSRGNKIARFI-NLPLNLKENKINLHL 441


>gi|354544800|emb|CCE41525.1| hypothetical protein CPAR2_800770 [Candida parapsilosis]
          Length = 661

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 86/207 (41%), Gaps = 63/207 (30%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KDV--QTMTN 290
           V +D  LS+YANA  +F+ K++A  KQ K  +  E A K+AEKK  Q L K++  +  T+
Sbjct: 200 VSIDYTLSSYANASIYFESKKAAESKQAKIEKGAEIAYKNAEKKINQDLVKNLRRENGTS 259

Query: 291 INKARKVYWFEK------------------------FYWFISSENYL------------- 313
            N  R+ +WFE                         ++ + S +++L             
Sbjct: 260 SNAEREKFWFESFYWFVSSEGYLCLAGRSKSQTDLLYFKYFSDDDFLVSSEIEGSLKVFV 319

Query: 314 -----------------------GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFM 350
                                   + AW+ K+ T AW +   ++SK   +G  L  G F 
Sbjct: 320 KNPLKGESVPPTTILQAGIFAMAASQAWNGKINTAAWVLHGSEISKYNSSGALLPAGEFE 379

Query: 351 IRGKKNFFPPCQLAMGISFLFKLEESS 377
              KK+F PP QL MG    F ++E S
Sbjct: 380 YLAKKHFLPPAQLVMGFGLYFLVDEGS 406


>gi|344304197|gb|EGW34446.1| hypothetical protein SPAPADRAFT_70556 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 865

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 89/207 (42%), Gaps = 62/207 (29%)

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT--------KQTLKDVQT 287
           +DL L+++ANA+ +FD K++AA KQ+K  +ST+ ALK+AE+K         KQ    ++ 
Sbjct: 355 IDLSLTSFANARAYFDTKKTAATKQEKVEKSTKIALKNAERKITHDLTKNLKQENDSLKQ 414

Query: 288 MTNINKARKVYWF-----------------EKFYW--------FISSE----------NY 312
           M       K +WF                 +  Y+        F+SS+          N 
Sbjct: 415 MRPKYWFEKFFWFVTNEGYLCLAGRDNAQVDMIYYRHFSDNDCFVSSDLEGSLKVFVKNP 474

Query: 313 L-------------------GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRG 353
           L                    +  W+ KV T+AW +  D++SK    G  + +G F   G
Sbjct: 475 LKGETVPPSSIMQAGIFAMSASSGWNGKVTTSAWVLHGDEISKRDFDGSLIASGEFRYTG 534

Query: 354 KKNFFPPCQLAMGISFLFKLEESSISR 380
           KK + PP QL MG      ++E S  R
Sbjct: 535 KKVYLPPAQLVMGFGLYSLVDEESTKR 561



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 95/195 (48%), Gaps = 10/195 (5%)

Query: 7   VSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNS-EDYIVANMEFHPMILQPGSTPS 65
           V++ +  C      LV + N ++ G++V K+ K  ++   +     EFHP        P 
Sbjct: 110 VANALGACEDAYLSLVDSKNENT-GFIVAKRNKASDTNSSFEFIYDEFHPF------KPY 162

Query: 66  VMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSA 125
              Q + +   E   +   +D FFST ES K +LK  Q ++ A K+L+  K + + ++ +
Sbjct: 163 KANQ-EDYQYTEVSGYNKTLDRFFSTLESSKFELKVEQLKQTAAKRLDKAKSERDKQIQS 221

Query: 126 LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK-QAQRHNDPVASI 184
           L + Q ++ +K ELI  + + VD     I+  +   + W +IE  ++ + +++N    +I
Sbjct: 222 LLEQQDLNAKKGELIQYHADLVDDCRAYIQSFLDQSMDWTNIETVLELEKKKNNKLAKAI 281

Query: 185 IKQLKLNINHITLLL 199
              L L  N I +LL
Sbjct: 282 GLPLNLKDNKIKVLL 296


>gi|14591254|ref|NP_143331.1| hypothetical protein PH1465 [Pyrococcus horikoshii OT3]
 gi|3257889|dbj|BAA30572.1| 650aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 650

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/373 (23%), Positives = 158/373 (42%), Gaps = 106/373 (28%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDALKKLENVKRDH 119
            P  ++ ++G+    FE+F+ A+DE+F   +  ++     K ++++R  L  L  +KR  
Sbjct: 243 VPIDLKWYEGYEKVYFETFSQALDEYFGKLTIEKAKAEKTKKLEEKRKQL--LATLKRQE 300

Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
           E  +   E+    ++E   LI  N   +D  +    + + N L W++ + R+++ ++  +
Sbjct: 301 EM-IKGFEKELKKNQEIGNLIYANYTLIDGLLREFSKAVKN-LGWDEFKKRIEEGKKKGN 358

Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLD 239
            +A ++K ++   N IT+ +                           E K+  L  +D D
Sbjct: 359 KIALMVKGIEPESNSITVEI---------------------------EGKRVKLY-LDKD 390

Query: 240 LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA----- 294
           L+   NA+ +++     AKK +  ++   KA +  ++K +   ++++      +      
Sbjct: 391 LN--ENAEIYYE----KAKKAKHKLEGARKAYEDLKRKLESIEREIEEEEKKIQVKKIEK 444

Query: 295 RKVYWFEKFYWFISSENYL---GNVA----------------------WDAKVV------ 323
           RK  WFEKF WFISSE +L   G  A                      W A  V      
Sbjct: 445 RKKKWFEKFRWFISSEGFLVIGGKDATTNEIVVRKYLEENDLYCHADIWGAPHVVIKDGQ 504

Query: 324 -----------------TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKN 356
                            + AW          WV  +QV K AP+GEFL  G+FM+ GK+N
Sbjct: 505 KAGEKTIFEACQFAVSMSRAWSEGLYSADAYWVYPNQVKKQAPSGEFLPKGAFMVYGKRN 564

Query: 357 --FFPPCQLAMGI 367
             +  P +LA+GI
Sbjct: 565 WMYGIPLKLAVGI 577


>gi|383318475|ref|YP_005379316.1| RNA-binding protein, eukaryotic snRNP-like protein [Methanocella
           conradii HZ254]
 gi|379319845|gb|AFC98797.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Methanocella conradii HZ254]
          Length = 662

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 109/237 (45%), Gaps = 35/237 (14%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           FESF  A+DE+FS   + +   + V+++ + L   E   R  E  ++  E+ +  +  K 
Sbjct: 266 FESFNQALDEYFSRHVAAEAKAEVVERKAEKLGVYERRLRQQEEAIAKFEREEAENVRKG 325

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITL 197
           E I     ++   I  IR   A   SW+DI   ++ A++  +  AS+I+ +         
Sbjct: 326 EAIYAEYNTISEVIGVIRGARAKGYSWDDIRKILRDARKAGNKAASLIQSV--------- 376

Query: 198 LLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL-VDVDLDLSAYANAKRFFDLKRSA 256
                                 DP A++   K  S+ V+V++DL+   NA+ ++D  + A
Sbjct: 377 ----------------------DPAANTVNVKLSSVSVNVNIDLTVPQNAQAYYDKAKKA 414

Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
             K++  +++ E+  K+  K+T    ++     +  K R   W+EK+ WF +S+ +L
Sbjct: 415 RLKKEGALKAIEETKKAMAKETPAPPREPSAKAHPRKPR---WYEKYRWFYTSDGFL 468



 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 32/47 (68%), Gaps = 2/47 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQL--AMGISF 369
           + +WV+ +QVSKT   GE++  G+F+IRG +N+    ++  A+GI F
Sbjct: 539 DCFWVRPEQVSKTPEHGEYVAKGAFIIRGDRNYVKNVEVRAAVGIRF 585


>gi|149246271|ref|XP_001527605.1| hypothetical protein LELG_00125 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146447559|gb|EDK41947.1| hypothetical protein LELG_00125 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 701

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 59/209 (28%), Positives = 85/209 (40%), Gaps = 65/209 (31%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ----TLKDVQTMT 289
           V +DL  S++ANA+ +FD K++A + Q K  +  E A ++AEKK  Q     +K     T
Sbjct: 210 VKIDLTQSSFANARIYFDSKKAAEQLQLKVEKGAEIAYRNAEKKISQDFVRNVKKELGST 269

Query: 290 NINKARKVYWFEKFYW------------------------FISSENYL------GNV--- 316
           + +  R   WFEKFYW                        ++  ++YL      G++   
Sbjct: 270 DSSALRSKLWFEKFYWFVSSEGYLCLAGRDKTQVDMIYFKYVGDDDYLVSSEIEGSLKVF 329

Query: 317 ---------------------------AWDAKVVTNAWWVKADQVSK-TAPTGEFLTTGS 348
                                      AW  KV T AW ++A  VSK  +  G  L  G 
Sbjct: 330 IKNPIKDEAIPPSTILQAGIFAMSASHAWSGKVNTAAWVMQASDVSKYDSAAGNLLPPGE 389

Query: 349 FMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           F    KK+  PP QL MG  F   ++E S
Sbjct: 390 FEYFAKKDLLPPAQLVMGFGFYCDVDEES 418


>gi|429961918|gb|ELA41462.1| hypothetical protein VICG_01446 [Vittaforma corneae ATCC 50505]
          Length = 351

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 32/60 (53%), Positives = 43/60 (71%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
            WD  V+   ++V+ DQVSK+AP+GE++T GSFMI+GK+N   P +L  GI  LFKLE S
Sbjct: 200 CWDEGVIKPVFYVEPDQVSKSAPSGEYITKGSFMIKGKRNIMNPYRLEYGIGLLFKLEGS 259


>gi|269863550|ref|XP_002651263.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064852|gb|EED42793.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 335

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 49/170 (28%), Positives = 71/170 (41%), Gaps = 60/170 (35%)

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKA------RKVYWFEKFYWF----------- 306
           I  T + ++   +KTK  ++D+Q      K       R  YWFEKF++F           
Sbjct: 135 IYKTRRRMREKAEKTKIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGG 194

Query: 307 ----------------------------------------ISSENYLGNV---AWDAKVV 323
                                                   I    Y   V   AWD +V+
Sbjct: 195 KNAQQNDQIVNKYMEDRDLYFHCDVKGASSVICKGSADRNIEDATYFALVYSKAWDEQVI 254

Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
            + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 255 KDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 304


>gi|282165250|ref|YP_003357635.1| hypothetical protein MCP_2580 [Methanocella paludicola SANAE]
 gi|282157564|dbj|BAI62652.1| conserved hypothetical protein [Methanocella paludicola SANAE]
          Length = 666

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 61/261 (23%), Positives = 121/261 (46%), Gaps = 37/261 (14%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P  + ++ G+    FESF  A+DE++S     +   + V+++ + L  LE   +  E  +
Sbjct: 252 PIELSRYAGYQKVYFESFNKALDEYYSKHIVAEAKAEVVEKKAEKLGVLERRLKQQEDAI 311

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
           +  E+ +     K ELI     +VD  I  I+   +  +SW+DI   +K A++  +P AS
Sbjct: 312 AKFEKEEKEYVRKGELIYAEYGAVDDIIKVIKGARSRGISWDDIRKILKDAKKAGNPAAS 371

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP-SLVDVDLDLSA 242
           +I+ +                               DP A++   K P + +++++DL+ 
Sbjct: 372 MIQSV-------------------------------DPAANTVAVKFPEATININVDLTV 400

Query: 243 YANAKRFFDLKRSAAKKQQ---KTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYW 299
             N++ ++D  +    K+    K I+ T++A+  A++  ++   + +      K RK  W
Sbjct: 401 PQNSQTYYDKAKKVQSKKDGALKAIEDTKRAM--AKEMPREKPAEPKKPAVKMKPRKPKW 458

Query: 300 FEKFYWFISSENYLGNVAWDA 320
           +EK+ WF +S+ +L     DA
Sbjct: 459 YEKYRWFFTSDGFLVIAGRDA 479



 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 32/47 (68%), Gaps = 2/47 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISF 369
           + +WV+ +QVSKT  +GE++  G+F+IRG +N+      + A+GI F
Sbjct: 543 DCFWVRPEQVSKTPESGEYVAKGAFIIRGDRNYVKNVEARAAVGIRF 589


>gi|150402208|ref|YP_001329502.1| hypothetical protein MmarC7_0281 [Methanococcus maripaludis C7]
 gi|150033238|gb|ABR65351.1| protein of unknown function DUF814 [Methanococcus maripaludis C7]
          Length = 680

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 96/386 (24%), Positives = 152/386 (39%), Gaps = 114/386 (29%)

Query: 64  PSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           P  ++ H+      I+ +ESF  A+DE+FS      I  K ++Q    L+KL  VK+   
Sbjct: 262 PIDLKMHENLKENEIKHYESFLTALDEYFSRF----IMKKEIKQAESKLQKL--VKKQER 315

Query: 121 TRLSALEQTQ------LVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
              S LE  +      +++ ++ +LI  N   VD  +  I+     ++ W  I+  +K+ 
Sbjct: 316 ILKSQLETKEKYEKQSILNHKRGDLIYANYSLVDEIVSTIKL-AREKMDWNGIKNVIKEN 374

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
           + H  PV S I    +N+N             N + +  L  D  +   + +       V
Sbjct: 375 KTH--PVLSKI----INVNE-----------KNAELTLNLSADYGNGLIEDT-------V 410

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT----------KQTLKD 284
            VDL  +A+ NA    D+    +KK +  +    +ALK +EKK            + LK+
Sbjct: 411 PVDLRKNAFENA----DIVYQKSKKFKNKVHGVIEALKISEKKLAELKEKEKLDSEVLKE 466

Query: 285 VQTMTNINKARKVYWFEKFYWF-------------------------------------- 306
            +      + + + W+EK  W                                       
Sbjct: 467 KEENIKKKERKVLKWYEKLKWTVIGGYLIVAGKDATTNEMLIKRYVEKNDIVFHTLMEGA 526

Query: 307 ------------ISSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFL 344
                       I  EN L  VA  A   + AW          WV+ DQ+SKTA +GEFL
Sbjct: 527 PFTIIRTEGSEEIPDENILFEVAKFAASHSRAWKLGIGSADVYWVRPDQISKTAESGEFL 586

Query: 345 TTGSFMIRGKKNFFPPCQLAMGISFL 370
             G+F+IRGK+NF     L +GI  L
Sbjct: 587 KKGAFVIRGKRNFIRSAALELGIGML 612


>gi|269862592|ref|XP_002650899.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065446|gb|EED43157.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 480

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 344 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 400


>gi|269862824|ref|XP_002650989.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065304|gb|EED43067.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 506

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 423 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 479


>gi|14520906|ref|NP_126381.1| hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
 gi|5458123|emb|CAB49612.1| Hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
 gi|380741455|tpe|CCE70089.1| TPA: hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
          Length = 649

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 83/377 (22%), Positives = 153/377 (40%), Gaps = 114/377 (30%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRD---- 118
            P  ++ ++G+    FE+F+ A+DE+F      K+ ++  ++ER   +KLE  K+     
Sbjct: 242 VPIELKWYEGYERVYFETFSQALDEYFG-----KLTIEKAKEERT--RKLEEKKKQLMAT 294

Query: 119 ---HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
               E  +   E+    ++E  +LI  N   +D  +    + +  +  W + + R+++ +
Sbjct: 295 LERQERMIKGFEEEARKNQEIGDLIYANYTIIDGILREFSKAV-EKFGWNEFKKRLEEGK 353

Query: 176 RHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD 235
           +  + +A ++K +    + IT+ L   K                              + 
Sbjct: 354 KQGNKIALLVKNVNPEEDSITIELEGRK------------------------------IK 383

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK-----TKQTLKDVQTMTN 290
           + L+ S   NA+ +++     AKK +  ++  +KA +  ++K      +   ++ +    
Sbjct: 384 LYLNRSINDNAELYYE----KAKKAKHKLEGAKKAYEELKRKLEQIEKEIEEEEKKIQVK 439

Query: 291 INKARKVYWFEKFYWFISSENYL---GNVA----------------------WDAKVV-- 323
             + RK  WFEKF WFISSE +L   G  A                      W A  V  
Sbjct: 440 KIEKRKKKWFEKFRWFISSEGFLVIGGKDATTNEIVVRKYMQENDIYCHADIWGAPHVII 499

Query: 324 ---------------------TNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
                                + AW          WV  +QV K AP+GEFL  G+FM+ 
Sbjct: 500 KDGQKASERTIFEACQFAVSMSRAWSEGLYSGDAYWVYPEQVKKQAPSGEFLPKGAFMVY 559

Query: 353 GKKNFFP--PCQLAMGI 367
           GK+N+    P +LA+G+
Sbjct: 560 GKRNWMHGIPLKLAVGV 576


>gi|269867209|ref|XP_002652521.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220062310|gb|EED41535.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 265

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 127 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 183


>gi|45358591|ref|NP_988148.1| hypothetical protein MMP1028 [Methanococcus maripaludis S2]
 gi|44921349|emb|CAF30584.1| unnamed protein product [Methanococcus maripaludis S2]
          Length = 680

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 99/403 (24%), Positives = 160/403 (39%), Gaps = 126/403 (31%)

Query: 58  LQPGST----------PSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQ 104
           L+P ST          P  ++ H+      I+ +ESF  A+DE+FS      I  K ++Q
Sbjct: 246 LKPKSTLKNGEFVNIDPIDLKIHENLKENEIKHYESFLTALDEYFSRF----IMKKEIKQ 301

Query: 105 ERDALKKLENVKRDHETRLSAL------EQTQLVDKEKAELIINNQESVDAAILAIRQDI 158
               L+KL  VK+      S L      E+  + + ++ +LI  N   VD  +  I+ D 
Sbjct: 302 AESKLQKL--VKKQERILKSQLDTKDKYEKQSVSNHKRGDLIYANYSLVDEIVSTIK-DA 358

Query: 159 ANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDN 218
             ++ W  I+  +K+ + H  P+ S I    +N+N             N + +  L  D 
Sbjct: 359 REKMDWNGIKNVIKENKTH--PILSKI----INVNE-----------KNAELTLKLSADY 401

Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT 278
            +   + S       V VDL  +A+ NA    D+    +KK +  +Q   +ALK +EKK 
Sbjct: 402 GNGLIEDS-------VPVDLRKNAFENA----DIVYQKSKKFKNKVQGVIEALKISEKKL 450

Query: 279 KQTLKDVQTMTN-----------INKARKVYWFEKFYWFI-------------------- 307
            + LK+ + + +             + + + W+EK  W +                    
Sbjct: 451 AE-LKEKEKLDSEVFKEKEEKIKKKERKVLKWYEKLKWTVIGGYLIVAGKDATTNEMLIK 509

Query: 308 ------------------------------SSENYLGNVAWDAKVVTNAW---------- 327
                                           EN L  VA  A   + AW          
Sbjct: 510 RYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENILFEVAKFAASHSRAWKLGIGSADVY 569

Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           WV+ DQ+SKTA +GE+L  G+F+IRGK+NF     L +GI  +
Sbjct: 570 WVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGII 612


>gi|240103770|ref|YP_002960079.1| Fibronectin-binding protein A (FbpA) [Thermococcus gammatolerans
           EJ3]
 gi|239911324|gb|ACS34215.1| Fibronectin-binding protein A (FbpA) [Thermococcus gammatolerans
           EJ3]
          Length = 650

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 79/342 (23%), Positives = 150/342 (43%), Gaps = 43/342 (12%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++ ++G   + F++F+ A+DE+F      K  ++  ++     K+L    R  E  
Sbjct: 242 VPIELKIYEGLEKRYFKTFSEALDEYFGKLTIEKAKIEKTRKLESKKKQLLATLRKQEEM 301

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   E+    ++E  +LI  N   V+  +   R+    +L WE+ + R++  ++  + VA
Sbjct: 302 LKGFEKAMNENQEIGDLIYANYAMVERLLDEFRK-ATEKLGWEEFKRRIEAGKKEGNKVA 360

Query: 183 SIIKQLKLNINHITLLLRVHKER----SNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
            ++K +      +T+ L   K +     +I ++  L+++ A  F    E    +  D   
Sbjct: 361 LMVKAIDPKEKTVTIELEGRKVKLYLNKSIGENAELYYEKAKKFRHKYEGALKAYEDTRR 420

Query: 239 DLSAYAN-----AKRFFDLKRSAAKKQQ-----KTIQSTEKALKSAEK--KTKQTL---- 282
            L           K+  ++KR   +K++     +   S+E  L  A K   T +TL    
Sbjct: 421 KLDEVEKLIEEEMKKELNVKRIERRKKKWFEKFRWFISSEGFLVLAGKDASTNETLIKKH 480

Query: 283 ---------KDVQTMTNI-----NKARKVYWFEKFYWFISSENYLGNVAWDAKVV-TNAW 327
                     DV    ++      KA +   FE   + +S        AW   +   +A+
Sbjct: 481 MSDNDLYCHADVYGAPHVVIKDGQKAGEKTIFEACQFAVSMSR-----AWSQGLYGADAY 535

Query: 328 WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           W   +QV+K AP+GE+L  G+FM+ GK+N+    P +LA+G+
Sbjct: 536 WAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLKLAVGV 577


>gi|224108804|ref|XP_002314973.1| predicted protein [Populus trichocarpa]
 gi|222864013|gb|EEF01144.1| predicted protein [Populus trichocarpa]
          Length = 235

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 94/177 (53%), Gaps = 23/177 (12%)

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
           AE I +N + VD+AILA+   +A  + WED+   VK  ++  +P+A +I +L    N + 
Sbjct: 2   AEFIEHNLQGVDSAILAVPVALAKGIGWEDLARMVKDEKKAGNPIAGLIDKLHFEKNCMA 61

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
           LL+ +      I   W++   +    +                 S++ANA+R+++LK+  
Sbjct: 62  LLIAI------ISMKWMMMKRHFQCIS-----------------SSHANAQRWYELKKKQ 98

Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
             KQ+KT  + +KA K+AEKK    L   +++  I+   KV+W EKF WFI + NYL
Sbjct: 99  ECKQEKTFTAHKKAFKAAEKKIHLQLSQEKSVATISHMHKVHWLEKFNWFIGTWNYL 155


>gi|269865041|ref|XP_002651784.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063882|gb|EED42272.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 243

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 107 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 163


>gi|344230527|gb|EGV62412.1| hypothetical protein CANTEDRAFT_126343 [Candida tenuis ATCC 10573]
          Length = 969

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 84/207 (40%), Gaps = 62/207 (29%)

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL-KDVQTMTN-INK 293
           +DL LS YANA  FFD K+SA  KQ K  ++T  AL++AE+K    L K +Q  +  +NK
Sbjct: 459 IDLSLSPYANASTFFDSKKSAEVKQLKVEKNTGIALQNAERKITHDLTKALQNESEALNK 518

Query: 294 ARKVYWFEKF--------YWFISSENYLGN------------------------------ 315
            R+ +WFEKF        Y  +S ++ L N                              
Sbjct: 519 VREKFWFEKFYWFVTSDGYLCLSGKDDLQNDMIYYRYFNDDDFFVYSDIEGALKVFIKNP 578

Query: 316 ---------VAWDAKVVT-------------NAWWVKADQVSKTAPTGEFLTTGSFMIRG 353
                      W A + +             +AW++    VSK    G  L  G F  +G
Sbjct: 579 YKGETVPPSTIWQAGMFSLSNSESWSNKSSSSAWYLPGPGVSKKDIDGSLLRPGKFNFKG 638

Query: 354 KKNFFPPCQLAMGISFLFKLEESSISR 380
           KK   PP QL MG    F  ++ +  R
Sbjct: 639 KKEHMPPVQLVMGFGIYFVGDDETTKR 665



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 85/177 (48%), Gaps = 18/177 (10%)

Query: 30  QGYLVQKKEKRPNSEDYIVANM-----EFHPMILQPGSTPSVMRQHQGFPIQEFESFAAA 84
           +G +V KK    N+ +    N+     EFHP   +P    S+      +     + +   
Sbjct: 227 EGIIVSKK--NTNATEETAENLQYLFDEFHP--FRPFKDGSL------YKFTSIQGYNKT 276

Query: 85  VDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144
           +D+FFST ES K ++K   Q++ A+K+L+  K +   ++ +L   +  + +K +LII N 
Sbjct: 277 LDQFFSTLESLKNEIKIENQKQLAMKRLDKAKNERVKQIESLINEKNANIKKGDLIILNA 336

Query: 145 ESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--NHITLLL 199
             V   I  I   +  Q+ W DIE  ++  +   D + + I QL LN+  N I L L
Sbjct: 337 NLVSGCIDFINGMLEKQMDWHDIEKYIELQKSSGDDITNAI-QLPLNLLENKIKLNL 392


>gi|269864556|ref|XP_002651614.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064197|gb|EED42442.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 320

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 170 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 226


>gi|340624350|ref|YP_004742803.1| fibronectin-binding A domain-containing protein [Methanococcus
           maripaludis X1]
 gi|339904618|gb|AEK20060.1| Fibronectin-binding A domain protein [Methanococcus maripaludis X1]
          Length = 680

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 92/373 (24%), Positives = 150/373 (40%), Gaps = 113/373 (30%)

Query: 75  IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSAL------EQ 128
           I+ +ESF  A+DE+FS      I  K ++Q    L+KL  VK+      S L      E+
Sbjct: 276 IKHYESFLTALDEYFSRF----IMKKEIKQAESKLQKL--VKKQERILKSQLDTKDKYEK 329

Query: 129 TQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
             + + ++ +LI  N   VD  +  I+ D   ++ W  I+  +K+ + H  P+ S I   
Sbjct: 330 QSISNHKRGDLIYANYSLVDEIVSTIK-DAREKMDWNGIKNVIKENKTH--PILSKI--- 383

Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKR 248
            +N+N             N + +  L  D  +   + S       V VDL  +A+ NA  
Sbjct: 384 -INVNE-----------KNAELTLKLSADYGNGLIEDS-------VPVDLRKNAFENA-- 422

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-----------INKARKV 297
             D+    +KK +  +Q   +ALK +EKK  + LK+ + + +             + + +
Sbjct: 423 --DIVYQKSKKFKNKVQGVIEALKISEKKLAE-LKEKEKLDSEVFKEKEEKIKKKERKVL 479

Query: 298 YWFEKFYWFI-------------------------------------------------- 307
            W+EK  W +                                                  
Sbjct: 480 KWYEKLKWTVIGGYLIVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEI 539

Query: 308 SSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
             EN +  VA  A   + AW          WV+ DQ+SKTA +GE+L  G+F+IRGK+NF
Sbjct: 540 PDENIMFEVAKFAASHSRAWKLGIGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNF 599

Query: 358 FPPCQLAMGISFL 370
                L +GI  +
Sbjct: 600 IRSAALELGIGII 612


>gi|159111661|ref|XP_001706061.1| Serologically defined colon cancer antigen 1 [Giardia lamblia ATCC
           50803]
 gi|157434154|gb|EDO78387.1| Serologically defined colon cancer antigen 1 [Giardia lamblia ATCC
           50803]
          Length = 1063

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 30/61 (49%), Positives = 45/61 (73%)

Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           W  KV T ++WV ADQVSKTAP G ++ TGSF+IRGK+NF P   L +G++ L++ + ++
Sbjct: 691 WAQKVGTQSYWVYADQVSKTAPAGMYIGTGSFVIRGKRNFIPQQPLELGVALLWRYDTAN 750

Query: 378 I 378
           +
Sbjct: 751 V 751



 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 11/107 (10%)

Query: 226 SETKKPSLVDVDLDLSAYAN------AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
           S+  +P ++ VD+     A       AK  F+  ++A +K ++T+  +       EKK  
Sbjct: 516 SDPTEPIVISVDVPFKGAAGTTAHIIAKTLFEAAKAAEEKCKRTLGHSSAYFDKVEKKAT 575

Query: 280 QTLKDVQTMTN-----INKARKVYWFEKFYWFISSENYLGNVAWDAK 321
             +  V   T+     +   R   WFEKF+WF S++ YL     DA+
Sbjct: 576 ADIDSVMKETDAELIALQHQRSPLWFEKFHWFFSTDGYLVLSGRDAQ 622


>gi|308160802|gb|EFO63274.1| Serologically defined colon cancer antigen 1 [Giardia lamblia P15]
          Length = 1063

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 30/61 (49%), Positives = 44/61 (72%)

Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           W  KV T ++WV ADQVSKTAP G ++ TGSF+IRGK+NF P   L +G++ L++ +  +
Sbjct: 691 WTQKVGTQSYWVYADQVSKTAPAGMYIGTGSFVIRGKRNFIPQQPLELGVALLWRYDTPN 750

Query: 378 I 378
           +
Sbjct: 751 V 751



 Score = 44.7 bits (104), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 46/103 (44%), Gaps = 11/103 (10%)

Query: 230 KPSLVDVDLDLSAYAN------AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
           +P ++ VD+     A       AK  F+  ++A +K ++T+  +       EKK    + 
Sbjct: 520 EPIIISVDVPFKGTAGTNAHTIAKTLFEAAKAAEEKCKRTLGHSSAYFDKVEKKATADID 579

Query: 284 DVQTMTN-----INKARKVYWFEKFYWFISSENYLGNVAWDAK 321
            V   T+     +   R   WFEKF+WF S+  YL     DA+
Sbjct: 580 SVMKETDAELIALQHQRSPLWFEKFHWFFSTNGYLVLSGRDAQ 622


>gi|428179079|gb|EKX47951.1| hypothetical protein GUITHDRAFT_106038 [Guillardia theta CCMP2712]
          Length = 841

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 102/396 (25%), Positives = 168/396 (42%), Gaps = 100/396 (25%)

Query: 30  QGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFF 89
           +GY++  +    +++  +++  EF P++L         +Q +   + E+ S   A+DEFF
Sbjct: 380 KGYVILDRISDSSNQTLVIS--EFSPILL---------KQQEDKMVLEYPSIDVAMDEFF 428

Query: 90  STAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDA 149
           ST + ++    A +      KK+   K+D ++    L+Q +L++ +KA L+  N   +D 
Sbjct: 429 STIDFNRDQKDANEAVETVSKKVNKAKKDIKSHTEGLKQEELLNHKKATLLELNSFHIDE 488

Query: 150 AILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIK 209
           A+  IR  +                              K++ N   +L  +H+  S   
Sbjct: 489 ALDKIRGLV------------------------------KIHRNAANVLHEIHEMNSTA- 517

Query: 210 DSWVLHHDNADPFADSSETKKPSLVDV----DLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
            S+ L  +      +S   K     D+    D  +S+ ANA+ FF  K+  A KQQ+  +
Sbjct: 518 -SFRLPQEG---IVESEAVKSRGATDITLVLDYSISSLANARNFFQKKKKVAAKQQRAEE 573

Query: 266 STEKALK------SAEKKTKQTLKDVQTMTNINK---ARKVYWFEKFYWFISSENYL--- 313
             + +LK      S  K TK +  D Q+ ++       R+ +WFEKF+WFISS+  L   
Sbjct: 574 MADISLKNTQIKASQRKNTKASKNDFQSKSSSIGISSVRRKFWFEKFFWFISSDQILVIA 633

Query: 314 ------------------------------------GNVAWDAKVV--TNAWWVKADQVS 335
                                                + AW +K    T A+WV  DQVS
Sbjct: 634 GKDAQQNELLVKRNELKERKVLPENTILQAAEFAVCRSSAWKSKTASGTAAYWVYPDQVS 693

Query: 336 KTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           K   +GE+L+ G F+IRGKKNF     L MG    F
Sbjct: 694 KAPQSGEYLSKGGFVIRGKKNFVSISTLCMGFGIFF 729


>gi|269863594|ref|XP_002651278.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064823|gb|EED42778.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 262

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 107 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 163


>gi|269864527|ref|XP_002651604.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064216|gb|EED42452.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 257

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 107 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 163


>gi|269862884|ref|XP_002651013.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065270|gb|EED43045.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 191

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 41  AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 97


>gi|269864419|ref|XP_002651566.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064286|gb|EED42490.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 290

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 107 AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 163


>gi|269862032|ref|XP_002650678.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065783|gb|EED43376.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 166

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 43/57 (75%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           AWD +V+ + ++V +DQVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 16  AWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 72


>gi|366991987|ref|XP_003675759.1| hypothetical protein NCAS_0C04050 [Naumovozyma castellii CBS 4309]
 gi|342301624|emb|CCC69395.1| hypothetical protein NCAS_0C04050 [Naumovozyma castellii CBS 4309]
          Length = 1020

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/179 (30%), Positives = 89/179 (49%), Gaps = 15/179 (8%)

Query: 30  QGYLVQKKEKRPNSE------DYIVANME-FHPMILQPGSTPSVMRQHQGFPIQEFESFA 82
           +G+++ KK +  NSE      +Y+      F P +    S  + + +  G        + 
Sbjct: 276 EGFILAKKNENYNSERDNSDLEYVYETFHPFRPYVDDKNSANTHIIEIDG-------QYN 328

Query: 83  AAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIIN 142
             +D FFST ES K  L+   QE  A KK+E+ + ++E R+ AL   Q  ++EK  LII 
Sbjct: 329 KTLDNFFSTIESSKYALRIQAQELQAKKKIEDARLENERRIQALVDVQASNEEKGHLIIA 388

Query: 143 NQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRV 201
           N + V+ A LAI+  +  Q+ W  IE  +   QR  + +A +I QL LN+      +R+
Sbjct: 389 NADLVEEAKLAIQGLLDQQMDWNTIEKLILSEQRKGNMIAQLI-QLPLNLKENKFTVRL 446



 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 95/214 (44%), Gaps = 69/214 (32%)

Query: 229 KKPSLVD-------VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQT 281
           KKP+ VD       +DL L+AYANA  +F++K+++ +KQ++  ++ EKA+K+ E++  + 
Sbjct: 507 KKPATVDKNTVKVTIDLGLTAYANASEYFNIKKTSVEKQKRMEKNIEKAMKNIEERVDKQ 566

Query: 282 LKDVQTMTN--INKARKVYWFEKFYWFISSENYLGNVA---------------------- 317
           LK     ++  + + R  Y+FEK+ WFISSE +L  +                       
Sbjct: 567 LKKKLKDSHDVLKRIRTPYFFEKYNWFISSEGFLVMMGKNGVETDQIYSKFIEDNDIYMS 626

Query: 318 -------W----DAKVV---------------TNAW---------WVKADQVSKTAPTGE 342
                  W    D K V               + AW         W  A  VSK     +
Sbjct: 627 NHFNTQVWIKNPDGKEVPPNTLMQAGILCMSSSEAWSKKIAASAWWCFAKNVSKFDDVDQ 686

Query: 343 -FLTTGSFMIRGKK--NFFPPCQLAMGISFLFKL 373
             L+ G F ++ +K  N  P  QL MG  F++K+
Sbjct: 687 SVLSPGVFRLKNEKDRNELPASQLVMGFGFMWKV 720


>gi|242399100|ref|YP_002994524.1| fibronectin-binding protein [Thermococcus sibiricus MM 739]
 gi|242265493|gb|ACS90175.1| Predicted fibronectin-binding protein [Thermococcus sibiricus MM
           739]
          Length = 650

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 89/377 (23%), Positives = 156/377 (41%), Gaps = 113/377 (29%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++ ++ +  + F +F+ A+DE+F      KI L++ + ER   KKL+N KR  E  
Sbjct: 242 VPIELKWYESYKKKFFTTFSEALDEYFG-----KILLESAKIERT--KKLQNKKRQLEAT 294

Query: 123 LSALEQTQLV---------DKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
           L   +Q +++         ++E  +LI  N   ++  +  + + +  +L W++ + RV+ 
Sbjct: 295 LR--KQEEMINGFKNQIQENQEIGDLIYTNFAFIENLLKELSKAV-EKLGWKEFKERVEN 351

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
            ++  + +A IIK +      +T+ L   K                              
Sbjct: 352 GKKSGNKIAQIIKNIDAKEKAVTIELDGKK------------------------------ 381

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKK---QQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
           V + L+ S   NA+ +++  + A  K    QK  + T K +K  EK  ++  K   ++  
Sbjct: 382 VKLYLNKSVGENAEIYYEKAKKAKHKLEGAQKAHKETLKKIKEIEKLIEEEEKKELSVRK 441

Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAKV---------------------------- 322
           + K RK  WFEKF WF+SSE +L     DA                              
Sbjct: 442 LEK-RKKKWFEKFRWFLSSEGFLIIAGKDATTNEIVVKRYMSENDLYCHADIYGAPHVVI 500

Query: 323 --------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
                               ++ AW          W   +QV+K AP+GE+L  G+FM+ 
Sbjct: 501 KDGKKAGEKTLFEACQFAVSMSRAWKEGLYSGDAYWTDPNQVTKKAPSGEYLGKGAFMVY 560

Query: 353 GKKNFFP--PCQLAMGI 367
           GK+N+    P +LA+GI
Sbjct: 561 GKRNWMHGLPVKLAVGI 577


>gi|385773877|ref|YP_005646444.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323477992|gb|ADX83230.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 609

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 93/206 (45%), Gaps = 64/206 (31%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           +++D  LSA  NA RFFD  +   +K ++ ++S ++  +  EK  KQ ++    +  +  
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV-- 392

Query: 294 ARKVYWFEKFYWFIS------------SEN------YL--------GNVA---------- 317
            RK  W+EK+ W IS            S+N      YL         ++A          
Sbjct: 393 LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQ 452

Query: 318 ----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRG 353
                     +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI G
Sbjct: 453 ENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYG 512

Query: 354 KKNFFPPCQLAMGISFLFKLEESSIS 379
           KKNF    +L + I  +  L E+S+S
Sbjct: 513 KKNFVKNVKLQLAIGLI--LSENSVS 536


>gi|238620391|ref|YP_002915217.1| hypothetical protein M164_1946 [Sulfolobus islandicus M.16.4]
 gi|238381461|gb|ACR42549.1| protein of unknown function DUF814 [Sulfolobus islandicus M.16.4]
          Length = 609

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 93/206 (45%), Gaps = 64/206 (31%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           +++D  LSA  NA RFFD  +   +K ++ ++S ++  +  EK  KQ ++    +  +  
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV-- 392

Query: 294 ARKVYWFEKFYWFIS------------SEN------YL--------GNVA---------- 317
            RK  W+EK+ W IS            S+N      YL         ++A          
Sbjct: 393 LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQ 452

Query: 318 ----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRG 353
                     +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI G
Sbjct: 453 ENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYG 512

Query: 354 KKNFFPPCQLAMGISFLFKLEESSIS 379
           KKNF    +L + I  +  L E+S+S
Sbjct: 513 KKNFVKNVKLQLAIGLI--LSENSVS 536


>gi|227828200|ref|YP_002829980.1| hypothetical protein M1425_1938 [Sulfolobus islandicus M.14.25]
 gi|229585429|ref|YP_002843931.1| hypothetical protein M1627_2016 [Sulfolobus islandicus M.16.27]
 gi|227459996|gb|ACP38682.1| protein of unknown function DUF814 [Sulfolobus islandicus M.14.25]
 gi|228020479|gb|ACP55886.1| protein of unknown function DUF814 [Sulfolobus islandicus M.16.27]
          Length = 609

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 93/206 (45%), Gaps = 64/206 (31%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           +++D  LSA  NA RFFD  +   +K ++ ++S ++  +  EK  KQ ++    +  +  
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV-- 392

Query: 294 ARKVYWFEKFYWFIS------------SEN------YL--------GNVA---------- 317
            RK  W+EK+ W IS            S+N      YL         ++A          
Sbjct: 393 LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQ 452

Query: 318 ----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIRG 353
                     +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI G
Sbjct: 453 ENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYG 512

Query: 354 KKNFFPPCQLAMGISFLFKLEESSIS 379
           KKNF    +L + I  +  L E+S+S
Sbjct: 513 KKNFVKNVKLQLAIGLI--LSENSVS 536


>gi|284998447|ref|YP_003420215.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284446343|gb|ADB87845.1| protein of unknown function DUF814 [Sulfolobus islandicus L.D.8.5]
          Length = 609

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)

Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
           V+++LD  LSA  NA RFFD  +   +K ++ ++S ++  +  EK  KQ ++    +  +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV 392

Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
              RK  W+EK+ W IS            S+N      YL         ++A        
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450

Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                       +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510

Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
            GKKNF    +L + I  +  L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536


>gi|333910763|ref|YP_004484496.1| fibronectin-binding A domain-containing protein [Methanotorris
           igneus Kol 5]
 gi|333751352|gb|AEF96431.1| Fibronectin-binding A domain protein [Methanotorris igneus Kol 5]
          Length = 675

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 83/376 (22%), Positives = 149/376 (39%), Gaps = 98/376 (26%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  +++++ F  +E+  F  A+D++F+   +     K   + +  +KK E + +     
Sbjct: 261 VPINLKKYENFEKKEYGEFLEALDDYFAQFMAKVETKKEESKLQKLIKKQERILKTQLET 320

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   E+    ++EK +LI  N   VD  IL   ++   ++ W  I+  V + + H  P+ 
Sbjct: 321 LEKYEKQMQENQEKGDLIYANYTLVD-EILNTLRNAREKMEWYKIKKIVNEHKDH--PIL 377

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I+ +      I + L                       AD  + K    V +D+  +A
Sbjct: 378 GLIQNINEKNGEIVIKLS----------------------ADYGDKKIEKNVSLDIRKNA 415

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD----------VQTMTNIN 292
           + NA+ ++    + +KK +  I+  ++A+K +EKK  + LK+           +      
Sbjct: 416 FENAETYY----TKSKKLKSKIEGIKEAIKLSEKKLAE-LKEKGEIELKELKEKEKIKKK 470

Query: 293 KARKVYWFEKFYWF---------------------------------------------- 306
           + ++  W+EKF W                                               
Sbjct: 471 ERKERKWYEKFKWTVINGFLVIAGKDAVTNELLIKKYTEDDDIVFHAQIEGAPFTVIKTN 530

Query: 307 --ISSENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGK 354
             I  E  L  VA  +   + AW          WVK +QVSKTA +GE+L  G+F+IRGK
Sbjct: 531 KRIVDEETLNEVAKFSVAHSRAWKLGWGALDTYWVKPEQVSKTAESGEYLKKGAFVIRGK 590

Query: 355 KNFFPPCQLAMGISFL 370
           +NF     L +GI  +
Sbjct: 591 RNFIRNVPLELGIGII 606


>gi|385776519|ref|YP_005649087.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323475267|gb|ADX85873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 609

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)

Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
           V+++LD  LSA  NA RFFD  +   +K ++ ++S ++  +  EK  KQ ++    +  +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV 392

Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
              RK  W+EK+ W IS            S+N      YL         ++A        
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450

Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                       +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510

Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
            GKKNF    +L + I  +  L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536


>gi|229579837|ref|YP_002838236.1| hypothetical protein YG5714_2060 [Sulfolobus islandicus Y.G.57.14]
 gi|228010552|gb|ACP46314.1| protein of unknown function DUF814 [Sulfolobus islandicus
           Y.G.57.14]
          Length = 609

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)

Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
           V+++LD  LSA  NA RFFD  +   +K ++ ++S ++  +  EK  KQ ++    +  +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV 392

Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
              RK  W+EK+ W IS            S+N      YL         ++A        
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450

Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                       +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510

Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
            GKKNF    +L + I  +  L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536


>gi|227830959|ref|YP_002832739.1| hypothetical protein LS215_2101 [Sulfolobus islandicus L.S.2.15]
 gi|227457407|gb|ACP36094.1| protein of unknown function DUF814 [Sulfolobus islandicus L.S.2.15]
          Length = 609

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)

Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
           V+++LD  LSA  NA RFFD  +   +K ++ ++S ++  +  EK  KQ ++    +  +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV 392

Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
              RK  W+EK+ W IS            S+N      YL         ++A        
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450

Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                       +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510

Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
            GKKNF    +L + I  +  L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536


>gi|253745574|gb|EET01418.1| Serologically defined colon cancer antigen 1 [Giardia intestinalis
           ATCC 50581]
          Length = 1065

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 28/61 (45%), Positives = 44/61 (72%)

Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESS 377
           W  K+   ++WV ADQVSKTAP G ++ TGSF+IRGK+NF P   L +G++ L++ + ++
Sbjct: 693 WAQKIGAQSYWVYADQVSKTAPPGMYIGTGSFVIRGKRNFIPQQPLELGVALLWRYDAAN 752

Query: 378 I 378
           +
Sbjct: 753 V 753



 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 49/112 (43%), Gaps = 10/112 (8%)

Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANA----KRFFDLKRSAAKKQQKTIQSTEKALKSA 274
           A P ++ +E    S VDV    SA  NA       F+  + A +K ++T+  +       
Sbjct: 514 AKPISEPTEAIIIS-VDVPFKGSAGTNAHTIANTLFEAAKEAEQKCERTLGHSSAYFNKV 572

Query: 275 EKKTKQTLKDVQTMTN-----INKARKVYWFEKFYWFISSENYLGNVAWDAK 321
           EKK    +      T+     +   R   WFEKF+WF S++ YL     DA+
Sbjct: 573 EKKATAEIDSAIKETDAKLIALQHQRPPLWFEKFHWFFSTDGYLVLSGRDAQ 624


>gi|402470262|gb|EJW04606.1| hypothetical protein EDEG_01190 [Edhazardia aedis USNM 41457]
          Length = 393

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query: 318 WDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           W  KV  N ++VK+DQVSK A +GE+L  GSFMIRGKKN+    +L  GI  +FK+
Sbjct: 147 WKEKVTGNVYYVKSDQVSKKAQSGEYLKAGSFMIRGKKNYVDVYRLEYGIGIVFKI 202


>gi|15897146|ref|NP_341751.1| hypothetical protein SSO0195 [Sulfolobus solfataricus P2]
 gi|13813331|gb|AAK40541.1| Membrane conserved hypothetical protein [Sulfolobus solfataricus
           P2]
          Length = 609

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 87/207 (42%), Gaps = 66/207 (31%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN- 292
           +++D  LSA  NA RFFD  +   +K ++ ++S E+  +   K  KQ   +++    I  
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALKSLEELKEKLAKIEKQ---EIEKQNEIKL 391

Query: 293 KARKVYWFEKFYWFISSENY---LGNVA-------------------------------- 317
             RK  W+EK+ W IS   Y   LG  A                                
Sbjct: 392 TLRKKEWYEKYRWSISRSGYLIILGRDASQNESIVKKYLRDKDIFLHADIIGAPATIIIT 451

Query: 318 -----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
                      +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI 
Sbjct: 452 QDNKTISEEDIYDAAVMAASYSKAWKVGLASVDIFWVLGNQVSKSPPSGEYLNKGSFMIY 511

Query: 353 GKKNFFPPCQLAMGISFLFKLEESSIS 379
           GKKNF    +L + I  +  L E+ +S
Sbjct: 512 GKKNFIKNVKLQLAIGLI--LSENGVS 536


>gi|15679889|ref|NP_277007.1| hypothetical protein MTH1907 [Methanothermobacter
           thermautotrophicus str. Delta H]
 gi|2623041|gb|AAB86367.1| conserved protein [Methanothermobacter thermautotrophicus str.
           Delta H]
          Length = 655

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 75/288 (26%), Positives = 130/288 (45%), Gaps = 46/288 (15%)

Query: 51  MEFHPMILQPGS--TPSVMRQHQGFPIQEFESFAAAVDEFFSTA---ESHKIDLKAVQQE 105
           ++  P I+  G    P  +R + G   + FESF  A DEFFS+    E  +   +  ++E
Sbjct: 236 LDLKPHIIGDGEDVLPIELRVYSGRERRYFESFNDAADEFFSSIFREEIRRAHEEEWERE 295

Query: 106 RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWE 165
            D  +K   ++R+    L   ++T  V   + +L+  N   V+  +  IR+    + SW+
Sbjct: 296 VDRFRKRLRIQRET---LEKFKKTIEVSTRRGDLLYANYSLVEEVLATIRR-AREKYSWD 351

Query: 166 DIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
           +I+  +  A++   P AS I ++   + +IT+ L                  + +P    
Sbjct: 352 EIKNIIADARKRGLPEASNITEID-RMGNITIFL------------------DGEP---- 388

Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKT---IQSTEKALKSAEKKTKQTL 282
                   V +D  L    NA+ +++  + A +K +     I+ TEK ++  EKK    L
Sbjct: 389 --------VRIDSKLGVPENAEVYYEKAKKAKRKIKGVMTAIEKTEKEIERIEKKRDDAL 440

Query: 283 KDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
           +++       K RK+ WFEKF WF+SS+ +L     DA   TN   VK
Sbjct: 441 RNIMVPRRRVK-RKLRWFEKFRWFVSSDGFLVIGGRDAG--TNEMVVK 485



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 32/45 (71%), Gaps = 2/45 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +WV  +QVSKT  +GEF+  G+F+IRG +N+    P ++A+G+
Sbjct: 541 DVYWVHPEQVSKTPRSGEFVARGAFIIRGSRNYLRGVPLKIAIGV 585


>gi|284174391|ref|ZP_06388360.1| hypothetical protein Ssol98_07002 [Sulfolobus solfataricus 98/2]
 gi|384433658|ref|YP_005643016.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|261601812|gb|ACX91415.1| protein of unknown function DUF814 [Sulfolobus solfataricus 98/2]
          Length = 609

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 87/207 (42%), Gaps = 66/207 (31%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN- 292
           +++D  LSA  NA RFFD  +   +K ++ ++S E+  +   K  KQ   +++    I  
Sbjct: 335 IELDTSLSATKNAMRFFDEAKEYKRKIERALKSLEELKEKLAKIEKQ---EIEKQNEIKL 391

Query: 293 KARKVYWFEKFYWFISSENY---LGNVA-------------------------------- 317
             RK  W+EK+ W IS   Y   LG  A                                
Sbjct: 392 TLRKKEWYEKYRWSISRSGYLIILGRDASQNESIVKKYLRDKDIFLHADIIGAPATIIIT 451

Query: 318 -----------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
                      +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI 
Sbjct: 452 QDNKTISEEDIYDAAVMAASYSKAWKVGLASVDIFWVLGNQVSKSPPSGEYLNKGSFMIY 511

Query: 353 GKKNFFPPCQLAMGISFLFKLEESSIS 379
           GKKNF    +L + I  +  L E+ +S
Sbjct: 512 GKKNFIKNVKLQLAIGLI--LSENGVS 536


>gi|440491782|gb|ELQ74392.1| putative RNA-binding protein [Trachipleistophora hominis]
          Length = 886

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 44/60 (73%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
            W  ++V+++++V  DQVSK+AP+GEFL+ G FM++GKKN+    +L   I+ LF LE++
Sbjct: 730 CWKDRIVSDSYYVNEDQVSKSAPSGEFLSKGGFMVKGKKNYVHNVRLEYAIALLFALEKN 789


>gi|147771936|emb|CAN75697.1| hypothetical protein VITISV_035984 [Vitis vinifera]
          Length = 431

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 61/105 (58%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P ++ Q +     +FE+F AA DEF+S  E  + + +   +E  A++KL  +  D E R
Sbjct: 245 CPILLNQFKSREFVKFETFDAASDEFYSKIEGQRSEQQQKAKEVXAMQKLSKICMDQENR 304

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI 167
           +  L++      + AELI  N E VDAAILA+R  +AN ++WED+
Sbjct: 305 VHTLKKEDDRCIKMAELIEYNLEDVDAAILAVRVALANGMNWEDL 349


>gi|449066809|ref|YP_007433891.1| hypothetical protein SacN8_03855 [Sulfolobus acidocaldarius N8]
 gi|449069082|ref|YP_007436163.1| hypothetical protein SacRon12I_03840 [Sulfolobus acidocaldarius
           Ron12/I]
 gi|449035317|gb|AGE70743.1| hypothetical protein SacN8_03855 [Sulfolobus acidocaldarius N8]
 gi|449037590|gb|AGE73015.1| hypothetical protein SacRon12I_03840 [Sulfolobus acidocaldarius
           Ron12/I]
          Length = 588

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 48/199 (24%), Positives = 81/199 (40%), Gaps = 67/199 (33%)

Query: 234 VDVDLDLSAYANAKRFFDLKRS---AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
           +D+D  L+ Y NA +++DL +     AKK  + ++   K L   + K  +  ++++    
Sbjct: 320 IDIDPKLTVYKNASKYYDLAKEYSEKAKKAGEVLEELRKKLSELQFKIDERKEEIRI--- 376

Query: 291 INKARKVYWFEKFYWFIS------------------------------------------ 308
               RK  W+EK++W I+                                          
Sbjct: 377 --SLRKKEWYEKYHWGITRNGHIVIAGRDSDQNESIVRKLLDEKDIFLHADIQGAAATVL 434

Query: 309 -------SENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                  SE+ + + A+ A   + AW          WV  +QVSK+ P+GE+L  GSFMI
Sbjct: 435 KANSGQVSEDDILDAAYIAACYSKAWKTGLGSVDVFWVYGNQVSKSPPSGEYLAKGSFMI 494

Query: 352 RGKKNFFPPCQLAMGISFL 370
            G+KNF    +L + I  +
Sbjct: 495 YGRKNFIKNVKLELAIGIM 513


>gi|70606588|ref|YP_255458.1| hypothetical protein Saci_0795 [Sulfolobus acidocaldarius DSM 639]
 gi|68567236|gb|AAY80165.1| conserved Prokaryal protein [Sulfolobus acidocaldarius DSM 639]
          Length = 594

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 48/199 (24%), Positives = 81/199 (40%), Gaps = 67/199 (33%)

Query: 234 VDVDLDLSAYANAKRFFDLKRS---AAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
           +D+D  L+ Y NA +++DL +     AKK  + ++   K L   + K  +  ++++    
Sbjct: 326 IDIDPKLTVYKNASKYYDLAKEYSEKAKKAGEVLEELRKKLSELQFKIDERKEEIRI--- 382

Query: 291 INKARKVYWFEKFYWFIS------------------------------------------ 308
               RK  W+EK++W I+                                          
Sbjct: 383 --SLRKKEWYEKYHWGITRNGHIVIAGRDSDQNESIVRKLLDEKDIFLHADIQGAAATVL 440

Query: 309 -------SENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                  SE+ + + A+ A   + AW          WV  +QVSK+ P+GE+L  GSFMI
Sbjct: 441 KANSGQVSEDDILDAAYIAACYSKAWKTGLGSVDVFWVYGNQVSKSPPSGEYLAKGSFMI 500

Query: 352 RGKKNFFPPCQLAMGISFL 370
            G+KNF    +L + I  +
Sbjct: 501 YGRKNFIKNVKLELAIGIM 519


>gi|429964304|gb|ELA46302.1| hypothetical protein VCUG_02190 [Vavraia culicis 'floridensis']
          Length = 943

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 41/60 (68%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
            W  ++V ++++V  +QVSK+AP+GEFLT G FM++GKKN+    +L   +  LF LE+ 
Sbjct: 742 CWKDRIVCDSYYVNEEQVSKSAPSGEFLTKGGFMVKGKKNYVHNVRLEYAVGLLFALEKG 801


>gi|333987711|ref|YP_004520318.1| fibronectin-binding A domain-containing protein [Methanobacterium
           sp. SWAN-1]
 gi|333825855|gb|AEG18517.1| Fibronectin-binding A domain protein [Methanobacterium sp. SWAN-1]
          Length = 663

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 94/385 (24%), Positives = 159/385 (41%), Gaps = 104/385 (27%)

Query: 52  EFHPMIL---QPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA 108
           +FHP I+   +    P  + +++ +  + FE+F  A DEF+S+      D+K V ++  A
Sbjct: 234 QFHPQIISSEKDDVLPLNLTKYEKYEKKTFETFNQAADEFYSSIVGD--DIKQVHEDVWA 291

Query: 109 --LKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
             + K E   +     L   + T +  K K E I +N +++   IL I  +     SW D
Sbjct: 292 AEVGKFEKRLKIQMETLEKFKDTIVKTKIKGEAIYSNYQNIQN-ILDIIHNARETYSWLD 350

Query: 167 IEARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
           I   +K+ ++       II+ L K+ +  +TL                    N D     
Sbjct: 351 IIDIIKKGKKEKVSGLDIIESLDKMGV--LTL--------------------NLDG---- 384

Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEK-KTKQTLKD 284
                 ++V+VD ++S   NA+ +++  + A +K      + EK +K  E+ K K+ +  
Sbjct: 385 ------TIVNVDSNMSIPENAEIYYNKGKKAKRKISGVNIAIEKTMKEVERAKNKREIAM 438

Query: 285 VQTMTNINKARK-VYWFEKFYWFISSENYL--------------------------GNVA 317
            + +    + RK + WFEK  WF+SS+  L                           ++ 
Sbjct: 439 EKVLVPQKRVRKELKWFEKLRWFLSSDGLLVIGGRDATTNEMIVKKHMENRDIYFHSDIH 498

Query: 318 WDAKVV-----------------------TNAW----------WVKADQVSKTAPTGEFL 344
             A VV                       ++AW          WV  DQVSKT  +GEF+
Sbjct: 499 GAASVVVKAGEGEVPESTLNETASFAGSFSSAWSAGFGSTDVYWVHPDQVSKTPQSGEFV 558

Query: 345 TTGSFMIRGKKNFF--PPCQLAMGI 367
             G+F+IRG +NF    P  +A+GI
Sbjct: 559 GKGAFIIRGSRNFIRNAPLLVAVGI 583


>gi|386003039|ref|YP_005921338.1| hypothetical protein Mhar_2365 [Methanosaeta harundinacea 6Ac]
 gi|357211095|gb|AET65715.1| hypothetical protein Mhar_2365 [Methanosaeta harundinacea 6Ac]
          Length = 668

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/270 (21%), Positives = 121/270 (44%), Gaps = 30/270 (11%)

Query: 52  EFHPMILQPGS-----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLK---AVQ 103
           E +P ++  G       P+ +  + G   +EF SF+ A+DEFFS+ E+   + K   A++
Sbjct: 233 EAYPHLVAEGERIVDVVPAPLAVYDGLERREFGSFSEALDEFFSSKEAEAEEAKPKTALE 292

Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
           + R+  ++     R+ E  L+ L           E +      V+A + AI +      +
Sbjct: 293 RRREMQERSIQEFRERERELARL----------GEKVYERYGEVEAVLAAISKGFERGFT 342

Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA 223
           + +I A++K +     P+A  I  L      + L L    +    +       D      
Sbjct: 343 YSEILAKIKTS---GLPIAEKILALDYQ-GELRLRLDDPGDGDGGEGKGGTVGDTG---- 394

Query: 224 DSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
                 + ++++++ +L+   NA+R++DL +  AKK++   ++ E+ ++   +K      
Sbjct: 395 -GKGEARGAVLELNSNLTVPQNAQRYYDLAKEQAKKREGAEKALEETIRLIARKAGPEKA 453

Query: 284 DVQTMTNINKARKVYWFEKFYWFISSENYL 313
             +    + + RK  W+E+F WF SS+ +L
Sbjct: 454 KTRA---VYRRRKPKWYERFRWFTSSDGFL 480



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 35/54 (64%), Gaps = 3/54 (5%)

Query: 318 WDAKVV-TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGIS 368
           W A +   + + V ADQV+KT   GEFL  G+F++RG++ ++   P  LA+GI+
Sbjct: 543 WKAGLFEGDCYLVAADQVTKTPEPGEFLKKGAFVVRGERRYYRDVPLGLALGIA 596


>gi|70918391|ref|XP_733179.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56504739|emb|CAH85243.1| hypothetical protein PC301461.00.0 [Plasmodium chabaudi chabaudi]
          Length = 169

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/150 (33%), Positives = 76/150 (50%), Gaps = 4/150 (2%)

Query: 51  MEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDL-KAVQQERDAL 109
           +EF P++L+           +   + +F  F   VD +FS  E  K D  + + + ++AL
Sbjct: 19  VEFIPILLKNHINKI---DEKKIELIKFNDFNMCVDTYFSKMELTKYDKHQEMNKRKNAL 75

Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
            K++ +K DHE R+ ALE+   + K+K  LI  N E V  AI  +R  I+   +WE I  
Sbjct: 76  TKIDKIKLDHERRIEALEKEVNILKKKILLIQANDEFVGEAIKLMRAAISTSANWEKIWD 135

Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLL 199
            VK  ++ N PVA  I  +  N   I LLL
Sbjct: 136 HVKLFKKRNHPVALKIMSVNFNNCEIELLL 165


>gi|15920412|ref|NP_376081.1| hypothetical protein ST0231 [Sulfolobus tokodaii str. 7]
 gi|15621194|dbj|BAB65190.1| hypothetical protein STK_02310 [Sulfolobus tokodaii str. 7]
          Length = 595

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/204 (25%), Positives = 90/204 (44%), Gaps = 69/204 (33%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAEKKTKQTLKDVQTMTN 290
           +++D  LS Y NA ++FD+ +  A+K++K   T+ + ++ LK  +K+ ++  ++++    
Sbjct: 326 IELDPKLSVYKNASKYFDIAKEYAEKRKKAEETLNNLKQKLKELDKQIEERTEEIRI--- 382

Query: 291 INKARKVYWFEKF--------YWFIS---------------------------------- 308
               RK  W+EK+        Y  I+                                  
Sbjct: 383 --SLRKREWYEKYRWSFTRNGYLVIAGRDIDQNESLVRKLLEPKDIFLHADIQGAPATII 440

Query: 309 -------SENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                  +E+ + + A  A   + AW          WV  DQVSK+ P+GE+L  GSFMI
Sbjct: 441 KTQGNNVTEDDIRDAAVIAACYSKAWKVGMGAIDVFWVNGDQVSKSPPSGEYLKKGSFMI 500

Query: 352 RGKKNFF--PPCQLAMGISFLFKL 373
            GKKNF      QL +G++  FK+
Sbjct: 501 YGKKNFINNVKMQLFLGLTEDFKI 524


>gi|337284225|ref|YP_004623699.1| hypothetical protein PYCH_07400 [Pyrococcus yayanosii CH1]
 gi|334900159|gb|AEH24427.1| hypothetical protein PYCH_07400 [Pyrococcus yayanosii CH1]
          Length = 648

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/317 (23%), Positives = 141/317 (44%), Gaps = 48/317 (15%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFS--TAESHKID-LKAVQQERDALK-KLENVKRD 118
            P  ++ + G+  + FE+F+ A+DE+F   T E  K +  + ++++R AL+  LE ++  
Sbjct: 241 VPIELKWYDGYERKYFETFSEALDEYFGKLTVEKAKAEKTRKLEEKRKALEISLERIR-- 298

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
              ++ A E+    ++E  +LI  N   V+  +  +R  +  +L WE++E RV++ ++  
Sbjct: 299 --EQMMAFEEEAKKNQELGDLIYANYSLVERLLEELRAAV-KKLGWEELERRVEEGKKTG 355

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
           +  A +IK +  + N +T+ +                        D    K      + L
Sbjct: 356 NKAAEVIKGIHPSENAVTVEI------------------------DGKAIK------LYL 385

Query: 239 DLSAYANAKRFFD-LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
           + S   NA+ +++  KR+ AK +       E  +K  E +     +  +      + RK 
Sbjct: 386 NRSLGENAELYYERAKRAKAKLEGARKAYEETKIKIEELERLIEEEGKKVGVKKLERRKK 445

Query: 298 YWFEKFYWFISSENYLGNVAWDAKVVTNAWWVKAD-QVSKTAPTGEFLTTGSFMIR---- 352
            WFEKF WFISSE +L     DA   TN   VK   + +      +       +I+    
Sbjct: 446 KWFEKFRWFISSEGFLVIGGKDA--TTNEMVVKRHMEENDIYCHADVYGAPHVVIKDGRK 503

Query: 353 -GKKNFFPPCQLAMGIS 368
            G++  F  CQ A+ +S
Sbjct: 504 AGERTIFEACQFAVSMS 520



 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 38/54 (70%), Gaps = 3/54 (5%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           AW   + + +A+WV  +QVSK +P GE+L  G+FM+ GK+N+F   P +LA+G+
Sbjct: 522 AWGQGLYSADAYWVYPEQVSKKSPAGEYLPKGAFMVYGKRNWFHGIPLKLAVGV 575


>gi|222445070|ref|ZP_03607585.1| hypothetical protein METSMIALI_00687 [Methanobrevibacter smithii
           DSM 2375]
 gi|222434635|gb|EEE41800.1| fibronectin-binding protein A domain protein [Methanobrevibacter
           smithii DSM 2375]
          Length = 668

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 83/384 (21%), Positives = 155/384 (40%), Gaps = 98/384 (25%)

Query: 53  FHPMILQPGSTPSV----MRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--R 106
           F P I++  S   V    + ++ GF    F++F  A DEF+S   +   D+K +++    
Sbjct: 239 FKPQIVKKDSKEDVVAFDLVKYDGFEKTYFDNFNEACDEFYSKKVN--TDIKNIKEAAWN 296

Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
             + K E   +  E  L    +T    + K E+I +N  +++  +  +   I+   S+++
Sbjct: 297 KKVNKFEKRLKLQEETLDNFHKTIETSQHKGEVIYSNYTTIENLVKVVNNAISKDYSYKE 356

Query: 167 IEARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
           I   +K+A+++    A I + + K+ +    L L++++   NI     +  +NA+ + + 
Sbjct: 357 IGKTLKEAKKNGLKEAEIFESIDKMGV----LTLKLNETSININPKLTI-PENAEIYYEK 411

Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
           ++  K       +   A  N K+   L++  AKK+        + +   +K+ K+ LK  
Sbjct: 412 AKKAKKKTKGATI---AIENTKK--QLEKIKAKKEVAM-----EHISVPKKRVKKNLK-- 459

Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL-------------------------------- 313
                        W+EK  WF++S+N L                                
Sbjct: 460 -------------WYEKLRWFVTSDNVLVIGGRDAGTNEAVVKKYMDNNDIYLHADIHGA 506

Query: 314 --------GNVAWD---------AKVVTNAW----------WVKADQVSKTAPTGEFLTT 346
                   GN   D         A   + AW          WV  +QV+KT   GEFL  
Sbjct: 507 TSTVIKLEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVNPEQVTKTPEAGEFLPK 566

Query: 347 GSFMIRGKKNFFPPCQLAMGISFL 370
           GSF+IRG +N+    ++ + I  +
Sbjct: 567 GSFVIRGNRNYIRSAKVRIAIGIV 590


>gi|375084281|ref|ZP_09731287.1| fibronectin-binding protein [Thermococcus litoralis DSM 5473]
 gi|374741041|gb|EHR77473.1| fibronectin-binding protein [Thermococcus litoralis DSM 5473]
          Length = 650

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 73/316 (23%), Positives = 138/316 (43%), Gaps = 47/316 (14%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P  ++ ++G+  + FE+F+ A+DE+F         ++  ++ +D  + LE   R  E  +
Sbjct: 243 PIELKWYEGYEKKFFETFSEALDEYFGKILIESAKIERTKKLQDKKRGLEVTLRKQEEMI 302

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
              E+    ++E  +LI  N   V+  +  + + +  +L WE+ + R+++ ++  + VA 
Sbjct: 303 KGFERQMQENQEIGDLIYANFTFVENLLKELSKAV-EKLGWEEFKKRIEEGRKSGNKVAQ 361

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
           IIK +      +T+ L                           E KK   V + L+ S  
Sbjct: 362 IIKGIDPKEKAVTVEL---------------------------EGKK---VKLYLNKSIG 391

Query: 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI--NKARKVYWFE 301
            NA+ +++  + A  K +   ++ E  LK  ++  K   ++ +   ++   + RK  WFE
Sbjct: 392 ENAEIYYEKAKKAKHKLEGARKAYEDTLKKIQEIEKLIEEEEKKELSVKKLEKRKKKWFE 451

Query: 302 KFYWFISSENYL---GNVAWDAKVVT------NAWWVKADQVSKTAPTGEFLTTGSFMIR 352
           KF WF+SSE +L   G  A   ++V       N  +  AD     AP    +        
Sbjct: 452 KFRWFVSSEGFLVIGGKDATTNEIVVKRHMSENDLYCHADIYG--AP---HVVIKDGKKA 506

Query: 353 GKKNFFPPCQLAMGIS 368
           G+K  F  CQ A+ +S
Sbjct: 507 GEKTIFEACQFAVSMS 522



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 35/54 (64%), Gaps = 3/54 (5%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           AW D     +A+W    QV+K AP+GE+L  G+FM+ GK+N+    P +LA+GI
Sbjct: 524 AWKDGIYSGDAYWADPSQVTKKAPSGEYLGKGAFMVYGKRNWMHGLPVKLAIGI 577


>gi|261350362|ref|ZP_05975779.1| fibronectin-binding protein A [Methanobrevibacter smithii DSM 2374]
 gi|288861145|gb|EFC93443.1| fibronectin-binding protein A [Methanobrevibacter smithii DSM 2374]
          Length = 668

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 83/384 (21%), Positives = 155/384 (40%), Gaps = 98/384 (25%)

Query: 53  FHPMILQPGSTPSV----MRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--R 106
           F P I++  S   V    + ++ GF    F++F  A DEF+S   +   D+K +++    
Sbjct: 239 FKPQIVKKDSKEDVVAFDLVKYDGFEKTYFDNFNEACDEFYSKKVN--TDIKNIKEAAWN 296

Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
             + K E   +  E  L    +T    + K E+I +N  +++  +  +   I+   S+++
Sbjct: 297 KKVNKFEKRLKLQEETLDNFHKTIETSQHKGEVIYSNYTTIENLVKVVNNAISKDYSYKE 356

Query: 167 IEARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
           I   +K+A+++    A I + + K+ +    L L++++   NI     +  +NA+ + + 
Sbjct: 357 IGKTLKEAKKNGLKEAEIFESIDKMGV----LTLKLNETSININPKLTI-PENAEIYYEK 411

Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
           ++  K       +   A  N K+   L++  AKK+        + +   +K+ K+ LK  
Sbjct: 412 AKKAKKKTKGATI---AIENTKK--QLEKIKAKKEVAM-----EHISVPKKRVKKNLK-- 459

Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL-------------------------------- 313
                        W+EK  WF++S+N L                                
Sbjct: 460 -------------WYEKLRWFVTSDNVLVIGGRDAGTNETVVKKYMDNNDIYLHADIHGA 506

Query: 314 --------GNVAWD---------AKVVTNAW----------WVKADQVSKTAPTGEFLTT 346
                   GN   D         A   + AW          WV  +QV+KT   GEFL  
Sbjct: 507 TSTVIKLEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVNPEQVTKTPEAGEFLPK 566

Query: 347 GSFMIRGKKNFFPPCQLAMGISFL 370
           GSF+IRG +N+    ++ + I  +
Sbjct: 567 GSFVIRGNRNYIRSAKVRIAIGIV 590


>gi|156844590|ref|XP_001645357.1| hypothetical protein Kpol_1058p36 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156116018|gb|EDO17499.1| hypothetical protein Kpol_1058p36 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 1019

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 52/174 (29%), Positives = 84/174 (48%), Gaps = 9/174 (5%)

Query: 31  GYLVQKKEKRPNSE----DYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVD 86
           GY++ KK +  N E    D       FHP   +P      + + +   I     +   +D
Sbjct: 277 GYILAKKNENYNPEKHSPDTEFIYETFHP--FEPFVESKDLEKTKIIEIPG--DYNKTLD 332

Query: 87  EFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQES 146
           +FFST ES K  L+   QE  A KKL++ K ++E R+ AL   Q  +++K  LII +   
Sbjct: 333 QFFSTIESSKYSLRIQNQELQAKKKLDDAKLENERRIQALVDVQTSNEQKGHLIIAHSNL 392

Query: 147 VDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIK-QLKLNINHITLLL 199
           ++    A++  I  Q+ W  IE  +   Q+  + +A  +K  LKL  N I ++L
Sbjct: 393 IEEVKFAVQGLIDQQMDWNTIENLIGSEQKKGNKIAQKVKLPLKLKNNKIDVIL 446



 Score = 41.2 bits (95), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 317 AWDAKVVTNAWWVKADQVSK-TAPTGEFLTTGSFMIRGKKNF--FPPCQLAMGISFLFKL 373
           AW  K+  +  W  A  +SK  + T   L  G F ++ +K+    PP QL MG  F +K+
Sbjct: 661 AWSKKIAASPRWCYARNISKFDSTTNTLLPRGRFALKDEKSMIHLPPAQLVMGFGFAWKV 720

Query: 374 EESS 377
           +  S
Sbjct: 721 KTES 724


>gi|410730361|ref|XP_003671360.2| hypothetical protein NDAI_0G03400 [Naumovozyma dairenensis CBS 421]
 gi|401780178|emb|CCD26117.2| hypothetical protein NDAI_0G03400 [Naumovozyma dairenensis CBS 421]
          Length = 1037

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 8/140 (5%)

Query: 53  FHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
           F P + +     S + +  G+       +   +D+FFST ES K  L+   QE  A KKL
Sbjct: 310 FEPYVEEKDKASSHISEIPGY-------YNKTLDKFFSTIESSKYALRIQNQELQAKKKL 362

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           +  K D++ +L AL   Q  ++EK  LI+ N + V+ A  AI+  +  Q+ W  IE  +K
Sbjct: 363 DEAKLDNQKKLQALIDVQSSNEEKGHLIVANADLVEEAKSAIQGLVDQQMDWNTIEKLIK 422

Query: 173 QAQRHNDPVASIIKQLKLNI 192
             Q+ +  +A +I  L LN+
Sbjct: 423 SEQKKHVKIAELI-VLPLNL 441



 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/218 (26%), Positives = 93/218 (42%), Gaps = 66/218 (30%)

Query: 225 SSETKKPS----LVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE----- 275
           SS T +P+     V +DL  SAYANA  +F+ K+++A+KQ++  ++ EKA+K+ E     
Sbjct: 516 SSATSQPNEKMITVTIDLGFSAYANASEYFNAKKTSAEKQKRVEKNIEKAMKNIEEKVNT 575

Query: 276 ------KKTKQTLKDVQT------------------MTNINKARKVYWFEKFYW---FIS 308
                 K++ + LK ++T                  M   N A     + K+        
Sbjct: 576 QLKKKLKESHEVLKKIRTPYFFEKYHWFISSEGYLVMMGKNDAETDQIYSKYIEDDDVFM 635

Query: 309 SENYLGNV---------------------------AWDAKVVTNAWWVKADQVSKTAPTG 341
           S N+   V                           AW  K+ ++AWW  A  V+K     
Sbjct: 636 SNNFGTKVWIKNPMKHEVPPNTLMQAGILCMSSSEAWSKKIASSAWWCNAKNVTKFDKFD 695

Query: 342 E-FLTTGSFMIRGKK--NFFPPCQLAMGISFLFKLEES 376
           +  L  G F+++ +K  N  P  QL MG+ FL+K++ S
Sbjct: 696 KSVLPPGVFVLKDEKDQNTLPASQLVMGLGFLWKVKTS 733


>gi|229581503|ref|YP_002839902.1| hypothetical protein YN1551_0858 [Sulfolobus islandicus Y.N.15.51]
 gi|228012219|gb|ACP47980.1| protein of unknown function DUF814 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 609

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 95/208 (45%), Gaps = 66/208 (31%)

Query: 234 VDVDLD--LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
           V+++LD  LSA  NA RFFD  +   +K ++ ++S ++  +  +K  KQ ++    +  +
Sbjct: 333 VEIELDTLLSATKNAMRFFDEAKEYKRKIERALESLDELKEKLKKIEKQEIEKQNEIKLV 392

Query: 292 NKARKVYWFEKFYWFIS------------SEN------YL--------GNVA-------- 317
              RK  W+EK+ W IS            S+N      YL         ++A        
Sbjct: 393 --LRKKEWYEKYRWSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIII 450

Query: 318 ------------WDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                       +DA V+    + AW          WV  +QVSK+ P+GE+L  GSFMI
Sbjct: 451 AQENNTILEDDIYDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMI 510

Query: 352 RGKKNFFPPCQLAMGISFLFKLEESSIS 379
            GKKNF    +L + I  +  L E+S+S
Sbjct: 511 YGKKNFVKNVKLQLAIGLI--LSENSVS 536


>gi|448677723|ref|ZP_21688913.1| hypothetical protein C443_04694 [Haloarcula argentinensis DSM
           12282]
 gi|445773398|gb|EMA24431.1| hypothetical protein C443_04694 [Haloarcula argentinensis DSM
           12282]
          Length = 717

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 63/257 (24%), Positives = 117/257 (45%), Gaps = 19/257 (7%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKR---DH 119
           TP  + +++G   + F  F +A+D++F   +  +       Q  D   ++E  KR     
Sbjct: 275 TPIPLSEYEGLYSESFTEFNSALDDYFFNFQREEEVEGGETQRPDFEVEIEKQKRIIQQQ 334

Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
           E  +   E    V++EKAEL+  N + VD  +  +R    + +SW+DIEA+  +      
Sbjct: 335 EQAIEDFEADAEVEREKAELLYANYDLVDDVLSTVRAAREDDVSWDDIEAKFDEGADRG- 393

Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDV 236
            +A+    + L+ +  T+ L +   R  + D++     NAD     A   E KK      
Sbjct: 394 -IAAAEAVVSLDGSEGTVTLDIGGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA--- 448

Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARK 296
              L+A  N +   DL+  A K+++   ++ +   + AE + +    D  +M +I     
Sbjct: 449 ---LAAIENTRE--DLE--AVKERRDEWEADDGEDEVAEDEGEDEPTDWLSMQSIPTRST 501

Query: 297 VYWFEKFYWFISSENYL 313
             W+E+F WF +S+ +L
Sbjct: 502 ERWYEQFRWFHTSDGFL 518



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE+L  G F IRG + +F   P  +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPVGVAVGIT 641


>gi|322368861|ref|ZP_08043428.1| Fibronectin-binding A domain protein [Haladaptatus paucihalophilus
           DX253]
 gi|320551592|gb|EFW93239.1| Fibronectin-binding A domain protein [Haladaptatus paucihalophilus
           DX253]
          Length = 711

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 63/268 (23%), Positives = 111/268 (41%), Gaps = 46/268 (17%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFS---------TAESHKIDLKAVQQERDALKKLE 113
           TP  + +++G   + F+SF AAVD++F+           E  K D +A       ++K +
Sbjct: 275 TPFPLEEYEGLDSEAFDSFNAAVDDYFTNLDVSENEDAGEPQKPDFQA------QIEKQQ 328

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
            +    E  +   E+    ++EKAEL+  N   VD  +  +R   A    W DIEAR ++
Sbjct: 329 RIIEQQEGAIEGFERKADAEREKAELLYANYGFVDEILATVRNARAEDTPWADIEARFEE 388

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
                   A  ++ +  +   +T+          I D+ +        F D    K  + 
Sbjct: 389 GAERGIEAAEAVQGIDPSEGTVTV---------EIDDTKITL------FPDDGVEKNANR 433

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN- 292
           +        Y  AKR  + K  A    + T +  E+  K AE+  ++  ++     NI+ 
Sbjct: 434 L--------YQEAKRIEEKKEGALAAIEDTREELEEVKKRAEQWEEEPEEERTEPENIDW 485

Query: 293 ------KARK-VYWFEKFYWFISSENYL 313
                   RK   W+E+F WF +S+ +L
Sbjct: 486 LSRASIPVRKQEQWYERFRWFRTSDGFL 513



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D +   +A+ V  DQVSKT  +GE+L  G F IRG + +F   P  +A+GI+
Sbjct: 585 DGRFAGDAYMVTPDQVSKTPESGEYLEKGGFAIRGDRTYFEDTPVGVAVGIT 636


>gi|148642838|ref|YP_001273351.1| RNA-binding protein snRNP-like protein [Methanobrevibacter smithii
           ATCC 35061]
 gi|148551855|gb|ABQ86983.1| predicted RNA-binding protein, eukaryotic snRNP-like protein
           [Methanobrevibacter smithii ATCC 35061]
          Length = 668

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 83/384 (21%), Positives = 155/384 (40%), Gaps = 98/384 (25%)

Query: 53  FHPMILQPGSTPSV----MRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--R 106
           F P I++  S   V    + ++ GF    F++F  A DEF+S   +   D+K +++    
Sbjct: 239 FKPQIVKKDSKEDVVAFDLVKYDGFEKTYFDNFNEACDEFYSKKVN--TDIKNIKEAAWN 296

Query: 107 DALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED 166
             + K E   +  E  L    +T    + K E+I +N  +++  +  +   I+   S+++
Sbjct: 297 KKVNKFEKRLKLQEETLDNFHKTIETSQHKGEVIYSNYTTIENLVKVVNNAISKDYSYKE 356

Query: 167 IEARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS 225
           I   +K+A+++    A I + + K+ +    L L++++   NI     +  +NA+ + + 
Sbjct: 357 IGKTLKEAKKNCLKEAEIFESIDKMGV----LTLKLNETSININPKLTI-PENAEIYYEK 411

Query: 226 SETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV 285
           ++  K       +   A  N K+   L++  AKK+        + +   +K+ K+ LK  
Sbjct: 412 AKKAKKKTKGATI---AIENTKK--QLEKIKAKKEVAM-----EHISVPKKRVKKNLK-- 459

Query: 286 QTMTNINKARKVYWFEKFYWFISSENYL-------------------------------- 313
                        W+EK  WF++S+N L                                
Sbjct: 460 -------------WYEKLRWFVTSDNVLVIGGRDAGTNEAVVKKYMDNNDIYLHADIHGA 506

Query: 314 --------GNVAWD---------AKVVTNAW----------WVKADQVSKTAPTGEFLTT 346
                   GN   D         A   + AW          WV  +QV+KT   GEFL  
Sbjct: 507 TSTVIKLEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVNPEQVTKTPEAGEFLPK 566

Query: 347 GSFMIRGKKNFFPPCQLAMGISFL 370
           GSF+IRG +N+    ++ + I  +
Sbjct: 567 GSFVIRGNRNYIRSAKVRIAIGIV 590


>gi|10640760|emb|CAC12538.1| conserved hypothetical protein [Thermoplasma acidophilum]
          Length = 588

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 52/88 (59%), Gaps = 8/88 (9%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           V++D  +SA  NA R+F    S AK  ++ I+   KA++ AEK   Q L ++Q      K
Sbjct: 331 VEIDYTVSAGENANRYF----SQAKDYRRKIEGAMKAIEEAEK---QRLTEMQKAEK-KK 382

Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAK 321
            RKV+WFE ++WFISSE YL     DAK
Sbjct: 383 RRKVFWFETYHWFISSEGYLVIAGRDAK 410



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 35/54 (64%), Gaps = 1/54 (1%)

Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           AW A + +  A+WV   QVSKT  +GE++ TGS++IRGK+N+    +L + I  
Sbjct: 465 AWPAGIASGTAYWVYPSQVSKTPESGEYVATGSWIIRGKRNYITDLKLELCIGM 518


>gi|16082623|ref|NP_394872.1| RNA-binding protein snRNP [Thermoplasma acidophilum DSM 1728]
          Length = 601

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 52/88 (59%), Gaps = 8/88 (9%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           V++D  +SA  NA R+F    S AK  ++ I+   KA++ AEK   Q L ++Q      K
Sbjct: 344 VEIDYTVSAGENANRYF----SQAKDYRRKIEGAMKAIEEAEK---QRLTEMQKAEK-KK 395

Query: 294 ARKVYWFEKFYWFISSENYLGNVAWDAK 321
            RKV+WFE ++WFISSE YL     DAK
Sbjct: 396 RRKVFWFETYHWFISSEGYLVIAGRDAK 423



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 35/54 (64%), Gaps = 1/54 (1%)

Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           AW A + +  A+WV   QVSKT  +GE++ TGS++IRGK+N+    +L + I  
Sbjct: 478 AWPAGIASGTAYWVYPSQVSKTPESGEYVATGSWIIRGKRNYITDLKLELCIGM 531


>gi|410722235|ref|ZP_11361543.1| putative RNA-binding protein, snRNP like protein [Methanobacterium
           sp. Maddingley MBC34]
 gi|410597380|gb|EKQ52002.1| putative RNA-binding protein, snRNP like protein [Methanobacterium
           sp. Maddingley MBC34]
          Length = 742

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 79/339 (23%), Positives = 150/339 (44%), Gaps = 49/339 (14%)

Query: 70  HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--RDALKKLENVKRDHETRLSALE 127
           +Q F  + F++F  A DEF+S       D+K VQ++     + K E   R  E  L   +
Sbjct: 341 YQNFHKERFDTFNQAADEFYSGKVG--ADIKKVQEDIWAKEVGKYEKRLRIQEETLEKFQ 398

Query: 128 QTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQ 187
           +T +  K+K  L+ ++   +   +L I      + SW +I ++ K+A++     A II+ 
Sbjct: 399 KTIVETKKKGNLLYSHYSEIQD-LLDIIHQAREKFSWMEIASKFKKARKEGMKEAQIIES 457

Query: 188 L-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL-------D 239
           + K+ +    L L +  ER  + D+ +   +NA+ + +  +  K  +  V++       D
Sbjct: 458 MDKMGV----LTLNLEGERVTV-DANLEIPENAEKYYNKGKKAKRKIRGVNIAIERTKKD 512

Query: 240 LSAYANAKRF-FDLKRSAAKKQQKTIQSTEK-----------ALKSAEKKTKQTL----- 282
           +    N +    +  R   K+ +K ++  EK            +   +  T + +     
Sbjct: 513 VERKRNKREMALERVRVPQKRVRKELKWFEKLRWFLSSDGYLVIGGRDAGTNEMVVKRHL 572

Query: 283 --KDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAW----------WVK 330
             +D+   ++I+ A  V   +        E+ +      A   ++AW          WV 
Sbjct: 573 DNQDIYLHSDIHGAPSVVIKKGEVEGEIPESTVQEAGTLAASFSSAWSKGYGSQDVYWVH 632

Query: 331 ADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
            DQVSKT  +GEF+  G+F+IRG +N+    P ++A+GI
Sbjct: 633 PDQVSKTPQSGEFVARGAFIIRGSRNYLRGIPLKIAVGI 671


>gi|429217609|ref|YP_007175599.1| RNA-binding protein [Caldisphaera lagunensis DSM 15908]
 gi|429134138|gb|AFZ71150.1| putative RNA-binding protein, snRNP like protein [Caldisphaera
           lagunensis DSM 15908]
          Length = 669

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 38/55 (69%), Gaps = 3/55 (5%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPC--QLAMGIS 368
           AW A + +  A+WV  +QVSK+ P+GE+L  GSFMI GKKNF  P   +L +GIS
Sbjct: 524 AWKAGMGSVGAYWVLGNQVSKSPPSGEYLAKGSFMIYGKKNFLKPINMELYLGIS 578


>gi|241958102|ref|XP_002421770.1| conserved hypothetical protein [Candida dubliniensis CD36]
 gi|223645115|emb|CAX39711.1| conserved hypothetical protein [Candida dubliniensis CD36]
          Length = 1012

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 55/186 (29%), Positives = 95/186 (51%), Gaps = 15/186 (8%)

Query: 19  HVLVSNLNVSSQGYLVQKKEKRP--NSE-DYIVANMEFHPMILQPGSTPSVMRQHQGFPI 75
           ++ + N  + ++GY+V KK  +   NSE +YI    EF P   +P   P+     +G   
Sbjct: 257 YIDLINGAIDNEGYIVAKKNNKASENSELEYIYD--EFDP--FEP-YKPN----QEGLKF 307

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
                +   +D+FFS  ES K  +K  QQ+ +A K+LE  + + + ++ +L   Q ++ +
Sbjct: 308 IPVSGYNKTLDKFFSNIESTKFSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQKLNAK 367

Query: 136 KAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI--N 193
           K ELI  + E V+     ++  I  Q+ W +IE  +   Q+  + +A  I QL LN+  N
Sbjct: 368 KGELIQYHSELVEECRNYVQSFIDQQMDWTNIETVISLEQKKKNDLAKHI-QLPLNLKEN 426

Query: 194 HITLLL 199
            I +LL
Sbjct: 427 KIKVLL 432



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 32/61 (52%)

Query: 317 AWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           AW+ KV T+AW +   ++SK    G  +  G F    +K + PP QL MG  F   L+E 
Sbjct: 642 AWNGKVTTSAWVLHGTEISKRDYDGSIVPEGEFNYLVQKEYLPPAQLVMGFGFYCLLDEE 701

Query: 377 S 377
           S
Sbjct: 702 S 702


>gi|304314240|ref|YP_003849387.1| hypothetical protein MTBMA_c04780 [Methanothermobacter marburgensis
           str. Marburg]
 gi|302587699|gb|ADL58074.1| conserved hypothetical protein [Methanothermobacter marburgensis
           str. Marburg]
          Length = 653

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 70/290 (24%), Positives = 132/290 (45%), Gaps = 47/290 (16%)

Query: 50  NMEFHPMILQPGST---PSVMRQHQGFPIQEFESFAAAVDEFFSTA---ESHKIDLKAVQ 103
           +++F+P I++ G     P  +  ++    + FE+F  A DEFFS+    E  K+     +
Sbjct: 232 DLKFNPHIIKNGEGDVLPIELMVYRDREREYFETFNEAADEFFSSIFREELRKVHEAEWE 291

Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
           +E +  +K   ++R+    L   + T      K +L+  +  +V+  +  IR D   + S
Sbjct: 292 KEVEKFRKRLRIQRET---LQKFQDTIDTSTRKGDLLYAHYAAVEDVLRTIR-DAREKYS 347

Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA 223
           W++I   +  A+      A +I+++   + ++TLL  +  ER                  
Sbjct: 348 WKEIRKIIADARSKGMVEAQMIQEID-GMGNMTLL--IDGER------------------ 386

Query: 224 DSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQS---TEKALKSAEKKTKQ 280
                     + +D  L    NA+ +++  + A +K +  +Q+   TE+ ++  EK+   
Sbjct: 387 ----------IRIDPTLGVPENAEVYYEKAKKAKRKIKGVLQAIEKTEREIEKVEKRRDD 436

Query: 281 TLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
            L+++       K RK+ WFEKF WFISS+++L     DA   TN   VK
Sbjct: 437 ALRNIMVPQKRVK-RKLRWFEKFRWFISSDDFLVIGGRDAG--TNEMVVK 483



 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 32/45 (71%), Gaps = 2/45 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +WV  +QVSKT  +GEF+  G+F+IRG +N+    P ++A+G+
Sbjct: 539 DVYWVHPEQVSKTPRSGEFVARGAFIIRGTRNYIRGVPLKVAVGV 583


>gi|13542268|ref|NP_111956.1| RNA-binding protein snRNP [Thermoplasma volcanium GSS1]
 gi|14325702|dbj|BAB60605.1| hypothetical protein [Thermoplasma volcanium GSS1]
          Length = 604

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 10/89 (11%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEK-KTKQTLKDVQTMTNIN 292
           +D+D   SA  NA R+FDL    +K  +K I+  +KA++ AE+ + K   K V+++    
Sbjct: 345 IDIDYTKSAGENANRYFDL----SKDYRKKIEGAKKAIEEAEQERIKLQEKKVKSVN--- 397

Query: 293 KARKVYWFEKFYWFISSENYLGNVAWDAK 321
             R+++WFE ++WFISSE YL     DAK
Sbjct: 398 --RRIFWFETYHWFISSEGYLVIAGRDAK 424



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 36/54 (66%), Gaps = 1/54 (1%)

Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           AW A + +  A+WV   QVSKT  +GE+++TGS++IRGK+N+    +L + I  
Sbjct: 479 AWPAGIASGTAYWVYPSQVSKTPESGEYVSTGSWIIRGKRNYVTNLKLELCIGL 532


>gi|424813826|ref|ZP_18239004.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalina sp. J07AB43]
 gi|339757442|gb|EGQ42699.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalina sp. J07AB43]
          Length = 632

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/346 (22%), Positives = 157/346 (45%), Gaps = 68/346 (19%)

Query: 6   DVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPN-----SEDYIVANMEFHPMILQP 60
           +++S +   G     ++   N+  + ++ Q  ++R +     +ED + +     P++   
Sbjct: 167 EIASKLSFGGTYAEEVLKRANIDKETHVEQLDDERKDIISEKTEDMLNSAENPDPILYVD 226

Query: 61  GSTPSVMRQHQGFPIQ--------EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKL 112
           G  P    +   FP++        +F++F+ A+DE++   ++ K   +  +  ++  + +
Sbjct: 227 GEEPV---RSSPFPLERYADDESIDFDTFSEAIDEYYYRKKALKEKKEKEEAYQEKKQGI 283

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQL---SWEDIEA 169
           E  K+  E ++  LE++   ++EKAE I  N +     +  I++ I N L    WE    
Sbjct: 284 ERQKQQQERKIQGLEKSAEQNREKAERIYENYQ----LLQRIKRQIENSLDEDGWEQTRQ 339

Query: 170 RVKQAQRHN-DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSET 228
           ++++++  + D VAS+ KQ     + I++                          D+ E 
Sbjct: 340 KLEKSESEDADKVASLNKQE----DFISV--------------------------DTGEE 369

Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK----TKQTLKD 284
                +  DL+    A A +++D     AK  ++ I+S ++ALK  +K+     K+ +  
Sbjct: 370 NLKVYLFQDLE----ATASQYYD----KAKNSEEKIESAKEALKETKKELEDLKKEEINT 421

Query: 285 VQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
            + + +  + RK  WFEK+ WF SSE+YL     DA+  TN   VK
Sbjct: 422 DEVLEDKTQKRKKKWFEKYRWFYSSEDYLVLCGRDAQ--TNDMLVK 465


>gi|146302942|ref|YP_001190258.1| hypothetical protein Msed_0157 [Metallosphaera sedula DSM 5348]
 gi|145701192|gb|ABP94334.1| protein of unknown function DUF814 [Metallosphaera sedula DSM 5348]
          Length = 601

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 46/196 (23%), Positives = 82/196 (41%), Gaps = 66/196 (33%)

Query: 234 VDVDLDLSAYANAKRFFDLKR---SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
           +++D  +SA  NA ++F+  +   +  ++ ++TI+  EK  +  + K+K+T++  + +  
Sbjct: 323 IEIDPKISASKNASQYFEKAKELDAKIRRTRETIEELEKKKQEIKAKSKETIEGSKILV- 381

Query: 291 INKARKVYWFEKFYWFIS------------------------------------------ 308
               RK  W+E+++W I+                                          
Sbjct: 382 ----RKKEWYERYHWTITSNGFIVIAGRDIDQNESIVRKMLEDKDIFLHADIQGAPATVI 437

Query: 309 ------SENYLGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIR 352
                  E  L + A  A   + AW          WV  +QVSK+ P+GE+L  GSFMI 
Sbjct: 438 KNPVGIGEQDLMDAAVLAGCYSKAWKLGLASIDVFWVYGEQVSKSPPSGEYLPKGSFMIY 497

Query: 353 GKKNFFPPCQLAMGIS 368
           GKKN+    +L + I 
Sbjct: 498 GKKNYIKNVKLELTIG 513


>gi|150400994|ref|YP_001324760.1| hypothetical protein Maeo_0563 [Methanococcus aeolicus Nankai-3]
 gi|150013697|gb|ABR56148.1| protein of unknown function DUF814 [Methanococcus aeolicus
           Nankai-3]
          Length = 686

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 34/46 (73%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           + +WVK DQ+SKTA +GE+L+ G+FM+RGK+NF     L +GI  +
Sbjct: 575 DVYWVKPDQISKTAESGEYLSKGAFMVRGKRNFIRSAILDLGIGII 620



 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 64/276 (23%), Positives = 120/276 (43%), Gaps = 54/276 (19%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFST--------AESHKIDLKAVQQERDALKKLEN 114
           +P  + ++  +  + +++F  A+D++FS          +  KI     +QER    ++E+
Sbjct: 261 SPIELLKYANYDKKYYDNFLTAMDDYFSIFILKTEIKKQETKIQKMVNRQERILNSQIES 320

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDA---AILAIRQDIANQLSWEDIEARV 171
           +K+         E+  + +K K +LI  N   VD     I++ R+    +L W++I+  V
Sbjct: 321 LKK--------YEKQDIENKLKGDLIYANYAMVDEILNTIISARE----KLEWKEIKKIV 368

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
           KQ +  ++P+   I  +      I L L V                    + D +   K 
Sbjct: 369 KQNK--DNPILGKIVSINEKNGEIILNLTVD-------------------YGDGAPITKN 407

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQ---QKTIQSTEKALKSA---EKKTKQTLKDV 285
            ++D+  +  A+ NA  ++   +    K       I+ +EK LK     E+   +TLK+ 
Sbjct: 408 VILDIRKN--AFENADNYYGKSKKFKHKIKGVHTAIEISEKKLKKLKIQEESEMETLKEK 465

Query: 286 QTMTNINKARKVY-WFEKFYWFISSENYLGNVAWDA 320
           +  T + K RK   W+EKF W + + +YL     DA
Sbjct: 466 EETTMVKKERKKRKWYEKFKWTVIN-DYLVIAGKDA 500


>gi|448502987|ref|ZP_21612851.1| Fibronectin-binding A domain protein [Halorubrum coriense DSM
           10284]
 gi|445693389|gb|ELZ45541.1| Fibronectin-binding A domain protein [Halorubrum coriense DSM
           10284]
          Length = 730

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 65/286 (22%), Positives = 116/286 (40%), Gaps = 57/286 (19%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
           TP  + +H+G P   F+SF AAVDE+F    +   D      +  A        + K E 
Sbjct: 269 TPFPLAEHEGLPSVGFDSFNAAVDEYFYRLGNEATDDGEAPADATASRPDFEAEIAKQER 328

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +       +   E+    ++E+AEL+  N + VD  +  +R+   N++ W++I A +   
Sbjct: 329 IVEQQRGAIEGFEEQAQAERERAELLYANYDLVDEVLSTVREARENEVPWDEIAATLDAG 388

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A+ +  +      +T+ L       + +D   +  D      D+SE      V
Sbjct: 389 AERGIPAAAAVVDVDGGEGTVTVAL-------DDEDGGSVRID-----LDASEG-----V 431

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK----------- 283
           +V+ D   Y  AKR  + K++ AK     I+ST + L++  ++  +  +           
Sbjct: 432 EVNAD-RLYQEAKR-VEEKKAGAK---AAIESTREELEAVNERKAEWEEQEAAADESAGA 486

Query: 284 ----------------DVQTMTNINKARKVYWFEKFYWFISSENYL 313
                           D  + ++I       WFE+F WF +S  YL
Sbjct: 487 DGDGEDGEDGDEAYETDWLSRSSIPIRSPDDWFERFRWFRTSTGYL 532



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 604 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 654


>gi|385805336|ref|YP_005841734.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Fervidicoccus fontis Kam940]
 gi|383795199|gb|AFH42282.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Fervidicoccus fontis Kam940]
          Length = 629

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/54 (50%), Positives = 35/54 (64%), Gaps = 1/54 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           AW A + V + +WV +DQVSKT P GE+L  GSFMI G+KN+     L +GI  
Sbjct: 500 AWKASLKVVDVFWVHSDQVSKTPPAGEYLEKGSFMIYGEKNYVRNVPLKLGIGL 553


>gi|307354208|ref|YP_003895259.1| Fibronectin-binding A domain-containing protein [Methanoplanus
           petrolearius DSM 11571]
 gi|307157441|gb|ADN36821.1| Fibronectin-binding A domain protein [Methanoplanus petrolearius
           DSM 11571]
          Length = 636

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 84/379 (22%), Positives = 149/379 (39%), Gaps = 106/379 (27%)

Query: 56  MILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENV 115
           +I + G  P  +   +  P+  F SF  A+  +F   +S   D K     ++ L K E +
Sbjct: 232 VITKSGCWPYPLPGEE--PVSSFSSFNDALSAYFPLPQSAAKDAK-----KEKLPKSEII 284

Query: 116 KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
           +R  +  +   E+     +EK + I  N + +   I  +R D +++LSW++IE  +K + 
Sbjct: 285 RRRQQEAIVNFEKKIAELQEKVDAIYENYQDISGIIDTLR-DASSKLSWQEIEETLKNSS 343

Query: 176 RHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD 235
               P A  I             +R++   S +           D  A   + K      
Sbjct: 344 L---PAAKSI-------------VRIYPSESAV-----------DVMAGGKKVK------ 370

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL-KSAEKKTKQTLKDVQTMTNINKA 294
           + ++ +  ANA R++   +   KK+   + + EK + K  + K +Q            K 
Sbjct: 371 IFINENPEANANRYYGEIKKYKKKKAGALVAMEKFMPKEKQAKKRQDY----------KP 420

Query: 295 RKVYWFEKFYWFISSENYL-------------------------------GNVA------ 317
           +K  W+ K+ WF++S+  L                               G+V       
Sbjct: 421 QKKKWYHKYRWFVTSDGVLVIGGQDAGSNEDIGKKYLEGRDYFVHADVHGGSVVVVKGET 480

Query: 318 --WD-----AKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
             W+     A   +NAW            K +QVSKTA +GEF+  G+F+IRG++ +F  
Sbjct: 481 ENWEEVAEFAASYSNAWKAGHFNCDVYAAKPEQVSKTAESGEFVKRGAFIIRGERRYFRN 540

Query: 361 CQLAMGISFLFKLEESSIS 379
             L + I    + E + I 
Sbjct: 541 IGLKVAIGLQLEPELAVIG 559


>gi|284161856|ref|YP_003400479.1| fibronectin-binding A domain-containing protein [Archaeoglobus
           profundus DSM 5631]
 gi|284011853|gb|ADB57806.1| Fibronectin-binding A domain protein [Archaeoglobus profundus DSM
           5631]
          Length = 626

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 83/374 (22%), Positives = 150/374 (40%), Gaps = 121/374 (32%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P  +  ++ +  + FESF  A+DEF+S     K   +  ++E + LKKL   ++  E +L
Sbjct: 231 PIELEYYRDYEKKYFESFNKALDEFYS-----KTIAETEEEESEELKKL---RKRLEIQL 282

Query: 124 SA---LEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDP 180
            +   LE+     K   + I  N  +++ A+ A RQ    ++S+E+ +A+ K  +   D 
Sbjct: 283 ESKRKLEEEAEKFKSLGDFIYENYATIEKALNAFRQ-AKEKMSFEEFKAKAKSLKFVKD- 340

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
                    +  +++ ++L       N K+                       + +DLD 
Sbjct: 341 ---------VGKDYVVIVL-------NGKE-----------------------IRLDLDK 361

Query: 241 SAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
             +  A+ +++  + A +K +     I+ T+K ++ AE+K K     ++    I   RK 
Sbjct: 362 DIHGIAESYYEKAKKAREKLEGLLIAIEKTKKEIEEAERKEK-----LKYTAPIRIVRKR 416

Query: 298 YWFEKFYWFISSENYLG------------------------------------------- 314
            WFE+F WFI+S+ +L                                            
Sbjct: 417 EWFERFRWFITSDGFLAIGGRNAQMNEEIVSKYLEPKDLFFHTQTPGAPAVVLKKGLEAP 476

Query: 315 -----NVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
                  A  A + ++ W          +V ADQV K+A  GE+L  GSF I GK+N+  
Sbjct: 477 EISIVETAQFAAIYSSLWKQGLHSGEVYYVTADQVKKSAKAGEYLPKGSFYIVGKRNYIT 536

Query: 360 ---PCQLAMGISFL 370
               C + + +S L
Sbjct: 537 VELKCAVGVDLSNL 550


>gi|374630447|ref|ZP_09702832.1| Fibronectin-binding A domain protein [Methanoplanus limicola DSM
           2279]
 gi|373908560|gb|EHQ36664.1| Fibronectin-binding A domain protein [Methanoplanus limicola DSM
           2279]
          Length = 629

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 147/368 (39%), Gaps = 107/368 (29%)

Query: 55  PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
           P I + G  P +       P + FE+++ A+D +F   E  + ++K        L K E 
Sbjct: 234 PAITKSGCWPLIFEGE--IPEETFETYSQALDSYFGLPEVSEAEVK------KKLSKAEI 285

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +++  +  +   E+   +  EK E+I  N +++ A I+    D + ++SW++IE  +K A
Sbjct: 286 IRKRQQEAIVKFEEKITLASEKVEIIYANYQTI-ADIVKTLSDASLKMSWQEIEDILKNA 344

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
              ++P+A +IK++  +   + +LL                        D    K  +  
Sbjct: 345 ---DNPMAKMIKRVYPSEAAVDILL------------------------DGKTIKLYASE 377

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
            V+       NA R++   +   KK+   + + E+   +   + K+T  D++ +      
Sbjct: 378 GVE------GNAGRYYSEIKKFKKKKAGALVAMERFKVTERPERKRT--DIKFI------ 423

Query: 295 RKVYWFEKFYWFISSENYLGNVAWDAKV-------------------------------- 322
            K  W+ KF WF +S++ L     DA                                  
Sbjct: 424 -KPKWYHKFRWFYTSDDVLVIGGRDAGTNEDIVRKYLEGKDTFLHADIHGGSAVAVKGET 482

Query: 323 ------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP 360
                        +NAW           V  DQVSKTA +GE L  G+F+IRG++ ++  
Sbjct: 483 ECMDEAAVFAVSYSNAWKSGFYSADVYAVPRDQVSKTAESGESLKRGAFVIRGERKYYR- 541

Query: 361 CQLAMGIS 368
             +A GIS
Sbjct: 542 -NVAPGIS 548


>gi|312136934|ref|YP_004004271.1| fibronectin-binding a domain-containing protein [Methanothermus
           fervidus DSM 2088]
 gi|311224653|gb|ADP77509.1| Fibronectin-binding A domain protein [Methanothermus fervidus DSM
           2088]
          Length = 645

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 90/198 (45%), Gaps = 66/198 (33%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAEKKTKQTLKDVQTMTN 290
           V++D++ S   NA+ +++  +   KK +     ++ TEK L + +KK    +K++  +  
Sbjct: 382 VNIDINKSLEKNAEIYYEKAKKIRKKIKGAIKAMEETEKKLNNLKKKRDIEIKNI--LIP 439

Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAK---------------------------VV 323
           I K RK+ WFEKF WFISS+ +L     DA+                           V+
Sbjct: 440 IKKRRKLKWFEKFRWFISSDGFLVIGGRDAQTNEIIVKKYMEENDIYLHADIHGAPSVVI 499

Query: 324 TN----------------------AW----------WVKADQVSKTAPTGEFLTTGSFMI 351
            N                      AW          WV   QV+K+ P+GE+++ G+F+I
Sbjct: 500 KNKNKKIPENTINEAAIFAASFSKAWTYGLGSADVYWVYPQQVTKSPPSGEYISKGAFVI 559

Query: 352 RGKKNFF--PPCQLAMGI 367
           RGK+N+    P +LA+GI
Sbjct: 560 RGKRNYIRNVPIELAVGI 577


>gi|448474105|ref|ZP_21602073.1| Fibronectin-binding A domain protein [Halorubrum aidingense JCM
           13560]
 gi|445818385|gb|EMA68244.1| Fibronectin-binding A domain protein [Halorubrum aidingense JCM
           13560]
          Length = 731

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 60/282 (21%), Positives = 104/282 (36%), Gaps = 56/282 (19%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
           TP  + +H+G P   F+SF AAVDE+F   E    D      +  A        + K E 
Sbjct: 277 TPFPLSEHEGLPSVGFDSFNAAVDEYFYRLEHEDTDAGEAPADASASRPDFEEEIAKQER 336

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +       +   ++    ++E+AEL+    + VD  +  +R   AN + W++I   +   
Sbjct: 337 IIEQQRGAIEGFDEQAAQERERAELLYAEYDLVDEVLSTVRDARANDVPWDEIADTLAAG 396

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A  +  +  +   +T+ L     R  I D+      NAD              
Sbjct: 397 AERGIPAAEAVVDVDGSDGTVTVELGDDGTRVEI-DTGAGVEVNADRL------------ 443

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ-------- 286
                   Y  AKR  D K  A    ++ I+ST   L++ +++  +              
Sbjct: 444 --------YQEAKRIEDKKAGA----EQAIESTRAELEAVKERKAEWAAQQAAADDDQSD 491

Query: 287 ---------------TMTNINKARKVYWFEKFYWFISSENYL 313
                          + ++I   R   W+E+F WF ++  YL
Sbjct: 492 SEEDDDEEEHEIDWLSRSSIPIRRPEDWYERFRWFHTASGYL 533



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 605 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRVAVGV 655


>gi|389860344|ref|YP_006362583.1| hypothetical protein TCELL_0020 [Thermogladius cellulolyticus 1633]
 gi|388525247|gb|AFK50445.1| hypothetical protein TCELL_0020 [Thermogladius cellulolyticus 1633]
          Length = 644

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 38/53 (71%), Gaps = 2/53 (3%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQLAMGI 367
           AW A + +   +WV+  QVSK+ P+GE+L+ GSFM+ GK+N+   P +LA+G+
Sbjct: 510 AWKAGLASIEVFWVRGGQVSKSPPSGEYLSKGSFMVYGKRNYLSIPLELALGV 562


>gi|336476370|ref|YP_004615511.1| fibronectin-binding A domain-containing protein [Methanosalsum
           zhilinae DSM 4017]
 gi|335929751|gb|AEH60292.1| Fibronectin-binding A domain protein [Methanosalsum zhilinae DSM
           4017]
          Length = 660

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 132/320 (41%), Gaps = 36/320 (11%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKA 137
           +ESF  A+DE+F  A   K+D K    ++D     E   +  +T +S  E+      + A
Sbjct: 273 YESFNEALDEYFGKASLEKVDEKVETVKKDKAGVFERRLQQQKTAISKFEKQAEKYVQAA 332

Query: 138 ELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN---------DPVASII--- 185
           E I +  + ++    A+    +   SW +I++ +K ++            DP   II   
Sbjct: 333 EKIYSYYQDIEHITDALNNARSKGYSWSEIKSIIKSSKDSTQAAKSIINIDPGKGIIVLD 392

Query: 186 ---KQLKLNIN-----HITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVD 237
                +++NIN     +  +     K+ +  +D  +   +         E K+PS   + 
Sbjct: 393 LDGTNVEININKSIPQNAEMYYEKAKKVTRKRDGALKALEETKASMQKKEKKEPSKRKII 452

Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK-----TKQTLKDVQTMTNIN 292
              S Y   + F           +    + E   K  EK+     T+     V  +    
Sbjct: 453 RKPSWYERFRWFISSDGFLVVGGRDADTNEEIVKKYMEKRDLFFHTQAPGAPVTIIKTEG 512

Query: 293 KARKVYWFEKFYWFISSENYL---GNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSF 349
           K       E+   F+ S + L   G+ A D       + VK +QVSKT  +GE+L  GSF
Sbjct: 513 KEVPSTTIEEASRFVVSYSSLWKLGHFAGDC------YMVKPEQVSKTPESGEYLKKGSF 566

Query: 350 MIRGKKNFFP--PCQLAMGI 367
           +IRG++N+F   P ++A+GI
Sbjct: 567 VIRGERNYFKNVPMRVAVGI 586


>gi|310752298|gb|ADP09459.1| FbpA and DUF814 domain protein [uncultured marine crenarchaeote
           E48-1C]
          Length = 608

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 79/377 (20%), Positives = 146/377 (38%), Gaps = 101/377 (26%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK-LENVKR--DHE 120
           P  ++ +  F  + +ESF   +DEF+    + +  L    +E  +LK+ +E +KR  + +
Sbjct: 180 PFRLKCYADFEHKCYESFNETLDEFYVRVGAIEKALTVATEEVGSLKQEMERLKRIIEMQ 239

Query: 121 TRLSALEQTQLVDKEKAELII----NNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
               A  +T + + ++   II       E++    LA R++     +W++I + V   ++
Sbjct: 240 EEACATAKTNMQENKRMGDIIHVHAGELEALLHRFLAGREE---GKAWDEIVSEVLAEKK 296

Query: 177 HNDPVASII-----KQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
                +  +     K L +++    L   +   RS          DNA  F         
Sbjct: 297 TGVKSSGFLVSFDDKHLVVDVCLDGLQFGLSLRRS--------LFDNAARF--------- 339

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNI 291
                      Y  AK+  D  + A ++  + ++  E  L+ AE     +  +V      
Sbjct: 340 --------YRRYKRAKQKLDGAKIAMEESHRKLEEVEARLEKAEAAGSVSPVEVIEEVAE 391

Query: 292 NKARKVYWFEKFYWFISSENYL--------------------GNVAWDAKVV-------- 323
            K  +  WFEKF WF+SS+  L                    G++ + A VV        
Sbjct: 392 RKIERKKWFEKFRWFVSSDGVLVVAGKDAVSNEVLVNKYATDGDIVFHADVVGAPFVVVK 451

Query: 324 -------------------------------TNAWWVKADQVSKTAPTGEFLTTGSFMIR 352
                                           + +WVK DQ+ K+A +G+++  G F++R
Sbjct: 452 MNGEKPSEECLRQAGVFAASFSRGWREGFASVDVYWVKPDQLDKSAKSGQYVPKGGFVVR 511

Query: 353 GKKNFF--PPCQLAMGI 367
           GK+N+    P +LA+GI
Sbjct: 512 GKRNWMRGSPLRLAVGI 528


>gi|448435995|ref|ZP_21587011.1| Fibronectin-binding A domain protein [Halorubrum tebenquichense DSM
           14210]
 gi|445683155|gb|ELZ35558.1| Fibronectin-binding A domain protein [Halorubrum tebenquichense DSM
           14210]
          Length = 743

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 67/289 (23%), Positives = 113/289 (39%), Gaps = 53/289 (18%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKKL 112
           TP  + +H+  P   F+SF  AVDE+F    S   D +A     DA          + K 
Sbjct: 272 TPFPLAEHENLPSVGFDSFNDAVDEYFYRLGSE--DTEAGDAPADASASRPDFEGEIAKQ 329

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E +    E  +   E+    ++E+AEL+  N + VD  I  +R+   +++ W++IE  + 
Sbjct: 330 ERIIEQQEGAIEGFEEQAQAERERAELLYANYDLVDEVISTVREARESEVPWDEIEETLD 389

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
                  P A  +  +      +T+ L    +     D  V   D A     S  T +  
Sbjct: 390 AGAERGIPAAEAVVDVDGGEGTVTVELAEEPD-----DDAVDGEDGA-----SGGTTR-- 437

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQK--TIQSTEKALKS-----AEKKTKQTLKD 284
            +++D       NA R + + KR   KK+     I+ST   L++     AE + +Q   D
Sbjct: 438 -IELDASEGVEVNADRLYQEAKRVEEKKEGAVAAIESTRAELEAVKERKAEWEEQQAADD 496

Query: 285 VQ--------------------TMTNINKARKVYWFEKFYWFISSENYL 313
                                 +  +I       W+++F WF +S  YL
Sbjct: 497 GSAQGGDGDDEDDDEEYETDWLSRASIPIRSPDDWYDRFRWFHTSTGYL 545



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 35/51 (68%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V  DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 617 DGRGAGDAYMVDPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 667


>gi|448491980|ref|ZP_21608648.1| Fibronectin-binding A domain protein [Halorubrum californiensis DSM
           19288]
 gi|445692198|gb|ELZ44379.1| Fibronectin-binding A domain protein [Halorubrum californiensis DSM
           19288]
          Length = 729

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 63/288 (21%), Positives = 112/288 (38%), Gaps = 63/288 (21%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
           TP  + +H+G P   F++F AAVDE+F    + + D      +  A        + K E 
Sbjct: 270 TPFPLAEHEGLPSVGFDTFNAAVDEYFYRLGNEETDEGEAPADAGASRPDFEEEIAKQER 329

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +       +   E+    ++E+AEL+  + + VD  +  +R+   N++ W++I A +   
Sbjct: 330 IIEQQLGAIEGFEEQAQAERERAELLYAHYDLVDEVLSTVREARENEVPWDEIAATLDAG 389

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A+ +  +      +T+ L                           E      V
Sbjct: 390 AERGIPAAAAVVDVDGGEGTVTVEL-------------------------DEEGDGEGTV 424

Query: 235 DVDLDLS--AYANAKRFF-DLKRSAAKK--QQKTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V+LD S     NA R + + KR   KK   ++ I+ST + L++  K+ K   ++ Q   
Sbjct: 425 TVELDASEGVEVNADRLYREAKRVEEKKAGAKEAIESTREELEAV-KERKAEWEEQQAAD 483

Query: 287 ---------------------TMTNINKARKVYWFEKFYWFISSENYL 313
                                + ++I       WFE+F WF +S  YL
Sbjct: 484 DGSGGDDGGEDDEEEYETDWLSRSSIPIRSPDDWFERFRWFRTSTGYL 531



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 603 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 653


>gi|448546430|ref|ZP_21626594.1| hypothetical protein C460_17818 [Haloferax sp. ATCC BAA-646]
 gi|448548417|ref|ZP_21627684.1| hypothetical protein C459_05213 [Haloferax sp. ATCC BAA-645]
 gi|448557611|ref|ZP_21632800.1| hypothetical protein C458_13126 [Haloferax sp. ATCC BAA-644]
 gi|445702883|gb|ELZ54823.1| hypothetical protein C460_17818 [Haloferax sp. ATCC BAA-646]
 gi|445714168|gb|ELZ65935.1| hypothetical protein C458_13126 [Haloferax sp. ATCC BAA-644]
 gi|445714512|gb|ELZ66274.1| hypothetical protein C459_05213 [Haloferax sp. ATCC BAA-645]
          Length = 702

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 62/281 (22%), Positives = 113/281 (40%), Gaps = 65/281 (23%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +R+HQ  G   + +++F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E +    E  +   EQ    ++E+AEL+  N + VD  +  +R      + W+DI A ++
Sbjct: 313 ERIIDQQEGAIEGFEQQAQDERERAELLYANYDLVDDVLSTVRDAREEGVPWDDIGATLE 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                 D+A             
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +D+ +    NA R + + KR   KK+     I+ T + L++ +K+  +   D     
Sbjct: 403 TVTLDVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELEAVKKRRDEWEADDDEDD 462

Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
                          + ++      +WFE+F WF +S  YL
Sbjct: 463 EEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSTGYL 503



 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF++RG + +F   P ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKVAVGI 625


>gi|395645660|ref|ZP_10433520.1| protein of unknown function DUF814 [Methanofollis liminatans DSM
           4140]
 gi|395442400|gb|EJG07157.1| protein of unknown function DUF814 [Methanofollis liminatans DSM
           4140]
          Length = 635

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 83/399 (20%), Positives = 156/399 (39%), Gaps = 126/399 (31%)

Query: 47  IVANMEFHP--MILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQ 104
           ++ ++E HP  ++ + G  P    +  G   Q F+++ AA++ F+    +    +   ++
Sbjct: 225 LIRDVETHPEPVVTESGCWPV---RGLGTVQQAFDTYNAALESFYPEVPA---SVTKEEE 278

Query: 105 ERDALKKLENVKRDHETRLSALEQTQLVDKEKA-ELIINNQESVDAAILAIRQDIANQLS 163
           +R  L + E ++   ET +   E +++   EKA E I  N   V   I  + Q  +  +S
Sbjct: 279 KRPKLTREEVIRLQQETAIKKFE-SKIARAEKAVEAIYTNYPLVQEVITTL-QRASRSMS 336

Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA 223
           W++IE  +K +   + P A  +             + VH                    A
Sbjct: 337 WQEIEKILKSS---DLPAAKAV-------------VSVHP-------------------A 361

Query: 224 DSSETKKPSLVDVDLDL--------SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE 275
           D++       VDVD+ +        S  AN +R++D  +   KK++  + + E+ +   +
Sbjct: 362 DAA-------VDVDVGMQVTIHVHESVEANVERYYDQIKKFRKKKEGALAAMERGVPKQK 414

Query: 276 KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSEN--YLGN------------------ 315
           +K K+TL  +          K  WF +F WF +++    LG                   
Sbjct: 415 EKPKETLHLL----------KKKWFHRFRWFYTTDGTLVLGGRDASQNEELVKRYMEGKD 464

Query: 316 -------------------------VAWDAKVVTNAW----------WVKADQVSKTAPT 340
                                    VA  A   +NAW            + DQVSKT  +
Sbjct: 465 TFVHADVHGGSVVIVKGPTEHLEDEVACFAASYSNAWKAGHFAADVYIARPDQVSKTPES 524

Query: 341 GEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           GE+++ G+F++RG++ +     L + I    K + + I 
Sbjct: 525 GEYVSRGAFIVRGERQYVRDVPLGVAIGVQLKPDVTVIG 563


>gi|320100405|ref|YP_004175997.1| hypothetical protein [Desulfurococcus mucosus DSM 2162]
 gi|319752757|gb|ADV64515.1| protein of unknown function DUF814 [Desulfurococcus mucosus DSM
           2162]
          Length = 665

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 45/176 (25%), Positives = 69/176 (39%), Gaps = 63/176 (35%)

Query: 258 KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT----NINKARKVYWFEKFYWFIS----- 308
           KK+ + +   E A+K  E+  +Q  +   TM+     + +     W+E+F+W  +     
Sbjct: 398 KKRGELVGKIESAVKVLEEMRRQLNEASSTMSIEDKTVRRLSPTLWYERFHWLFTRNGFL 457

Query: 309 -------SEN------YLGN------------------------------VAWDAKVVTN 325
                  S+N      YLG+                               ++ A   + 
Sbjct: 458 AIGGRDQSQNEMVVRKYLGDNDVFIHADIHGGSAVVLKSRGLHSVEDVVDASYLAACYSR 517

Query: 326 AW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQLAMGISFL 370
           AW          WV   QVSKT P+GE+L  G+FMI G KNF   P +LA+G  F 
Sbjct: 518 AWRAGFSFIEVFWVPGSQVSKTPPSGEYLPRGAFMIYGSKNFLSIPLRLAVGARFF 573


>gi|14601515|ref|NP_148055.1| hypothetical protein APE_1611 [Aeropyrum pernix K1]
 gi|5105298|dbj|BAA80611.1| conserved hypothetical protein [Aeropyrum pernix K1]
          Length = 650

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 81/204 (39%), Gaps = 70/204 (34%)

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN--INKARKVYWFEKF 303
            K    L R A + + K  +  EKA   A  + ++ ++  +  +   I + RK +WFEK+
Sbjct: 363 GKLILRLYREAGELEAKA-ERAEKAFAEARSRLEEAVRRARLRSLRRIIEGRKRFWFEKY 421

Query: 304 YWFISS------------------ENYLGNV---------------------------AW 318
           +W I+                   + YLG+                             +
Sbjct: 422 HWTITRNGFLAIGGRDAGQNESVVKRYLGDDDIFLHADIHGAPATVLLTRRLQPGDDDIY 481

Query: 319 DAKVVTNAW--------------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQ 362
           DA V+  A+              WV   QVSK+ P GE+L  G+FM+ GK+N+    P +
Sbjct: 482 DAAVLAAAYSRAWKAGAGGVSVYWVYGSQVSKSPPAGEYLARGAFMVYGKRNYIHHVPLK 541

Query: 363 LAMGI------SFLFKLEESSISR 380
           LA+GI       ++    E ++SR
Sbjct: 542 LALGIVMHKDVPYIVAGSEEAVSR 565


>gi|432328279|ref|YP_007246423.1| putative RNA-binding protein, snRNP like protein [Aciduliprofundum
           sp. MAR08-339]
 gi|432134988|gb|AGB04257.1| putative RNA-binding protein, snRNP like protein [Aciduliprofundum
           sp. MAR08-339]
          Length = 596

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 47/195 (24%), Positives = 81/195 (41%), Gaps = 66/195 (33%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKAL-KSAEKKTKQTLKDVQTMTNIN 292
           V++ +D S   NA  +++     AKK ++ I+  ++AL K+ E+       + +    I 
Sbjct: 330 VELRVDKSVGENASLYYE----KAKKMREKIKGAQQALEKAKEELKSVKKMEEKKKREIR 385

Query: 293 KARKVYWFEKFYWFISSENYLGNVAWDAKV------------------------------ 322
           K+R+ +WFEK+ WFISSE+ L     DAK                               
Sbjct: 386 KSRRRFWFEKYRWFISSEDILVIAGRDAKTNEEVVKKHLGDKDLYMHADIHGAPSVVIKS 445

Query: 323 -------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRG 353
                              ++ AW          WV   QVSK   +GE++  G++++ G
Sbjct: 446 EGKEIGEKTLYEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVVHG 505

Query: 354 KKNFF--PPCQLAMG 366
           ++N+    P +LA+G
Sbjct: 506 RRNYIHKVPLRLAVG 520


>gi|448424081|ref|ZP_21582207.1| Fibronectin-binding A domain protein [Halorubrum terrestre JCM
           10247]
 gi|448478971|ref|ZP_21603977.1| Fibronectin-binding A domain protein [Halorubrum arcis JCM 13916]
 gi|445682746|gb|ELZ35159.1| Fibronectin-binding A domain protein [Halorubrum terrestre JCM
           10247]
 gi|445822801|gb|EMA72563.1| Fibronectin-binding A domain protein [Halorubrum arcis JCM 13916]
          Length = 735

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 64/289 (22%), Positives = 113/289 (39%), Gaps = 64/289 (22%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF-------STAESHKIDLKAVQQE-RDALKKLEN 114
           TP  + +H+  P   F+SF AAVDE+F       +       D  A + +  + + K E 
Sbjct: 275 TPFPLAEHEDLPSVGFDSFNAAVDEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQER 334

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +    +  +   E+    ++E+AEL+  + + VD  I  +R+   N++ W++IE  +   
Sbjct: 335 IIEQQKGAIEGFEEQAQAERERAELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAG 394

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A  +  +      +T+ L                           E      V
Sbjct: 395 AERGIPAAEAVAGVDGGEGTVTVEL-------------------------EEEGDDGGTV 429

Query: 235 DVDLDLS--AYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQTLKDVQTM- 288
            V+LD S     NA R + + KR   KK+  ++ I+ST + L++  K+ K+  ++ Q   
Sbjct: 430 TVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREELEAV-KERKREWEEQQAAD 488

Query: 289 ------------------------TNINKARKVYWFEKFYWFISSENYL 313
                                   ++I       WFE+F WF +S  YL
Sbjct: 489 DGSGGDGGDNEEEDEEYETDWLARSSIPIRSPDDWFERFRWFHTSTGYL 537



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 35/51 (68%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F    C++A+G+
Sbjct: 609 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVACRIAVGV 659


>gi|219852170|ref|YP_002466602.1| hypothetical protein Mpal_1566 [Methanosphaerula palustris E1-9c]
 gi|219546429|gb|ACL16879.1| protein of unknown function DUF814 [Methanosphaerula palustris
           E1-9c]
          Length = 629

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 81/370 (21%), Positives = 147/370 (39%), Gaps = 108/370 (29%)

Query: 55  PMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLEN 114
           P++   G+ P V+      PI    +F+ A++  +     H+       Q++  + + E 
Sbjct: 234 PVVTSGGAWPIVLTGMT--PISHHPTFSEALEAIYPLVTRHE-----GPQKKAPIPREER 286

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQ-LSWEDIEARVKQ 173
           ++   E  L + ++  +++K   +LI  N   V   I  +  D A++ LSW++I + +K+
Sbjct: 287 IRLQQEAALKSFDKKIVLNKAIVDLIYENYTLVTDVIKTL--DAASKTLSWQEIGSMLKE 344

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
           +   ND VA  I  +      + LLL   K   ++ +S                      
Sbjct: 345 SD--ND-VARQIAGVHPAEAAVDLLLDGKKVLIHVHES---------------------- 379

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           ++V+L+   YA  K+F        KK+   + + E+ +  A+K T +       +T + K
Sbjct: 380 IEVNLE-RYYAQVKKF-------KKKRDGAVSAMERPV--AKKATSKV-----HLTPLKK 424

Query: 294 ARKVYWFEKFYWFISSEN--YLGN------------------------------------ 315
                W+ +F WF +S+N   LG                                     
Sbjct: 425 R----WYHRFRWFFTSDNCLVLGGRDAGQNEELVKRYMEGGDTFVHADVHGASVVIVKGK 480

Query: 316 ------VAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
                 VA  A   + AW           V+ DQVSKT   GEF++ GSF++RG++ +F 
Sbjct: 481 TEQMDEVAQFAASYSGAWRSGHFSADVYAVRPDQVSKTPEAGEFVSRGSFIVRGERTYFK 540

Query: 360 PCQLAMGISF 369
              L + I +
Sbjct: 541 SVPLGVAIGY 550


>gi|448583074|ref|ZP_21646543.1| hypothetical protein C454_08194 [Haloferax gibbonsii ATCC 33959]
 gi|445730031|gb|ELZ81623.1| hypothetical protein C454_08194 [Haloferax gibbonsii ATCC 33959]
          Length = 702

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/281 (21%), Positives = 113/281 (40%), Gaps = 65/281 (23%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +R+HQ  G   + +++F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSNRPDFEEEIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + +    E  +   EQ    ++E+AEL+  N + VD  +  +R      + W+DI AR++
Sbjct: 313 QRIIDQQEGAIEGFEQQAEDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAARLE 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                 D+A             
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTIEL-----------------DDA------------- 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +++ +    NA R + + KR   KK+     I+ T + L + +K+  +   D     
Sbjct: 403 TVTLEVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDDEDD 462

Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
                          + ++      +WFE+F WF +S  YL
Sbjct: 463 EEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYL 503



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF+IRG + +F     ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYFEDVAAKVAVGI 625


>gi|374635672|ref|ZP_09707266.1| Fibronectin-binding A domain protein [Methanotorris formicicus
           Mc-S-70]
 gi|373561525|gb|EHP87758.1| Fibronectin-binding A domain protein [Methanotorris formicicus
           Mc-S-70]
          Length = 673

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 36/55 (65%), Gaps = 3/55 (5%)

Query: 316 VAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           + W A    + +WVK +Q+SKTA +GE+L  G+F+IRGK+NF     L +GI  +
Sbjct: 554 LGWGA---LDTYWVKPEQISKTAESGEYLKKGAFVIRGKRNFIRNVPLELGIGVI 605


>gi|297619525|ref|YP_003707630.1| Fibronectin-binding A domain-containing protein [Methanococcus
           voltae A3]
 gi|297378502|gb|ADI36657.1| Fibronectin-binding A domain protein [Methanococcus voltae A3]
          Length = 722

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/364 (23%), Positives = 139/364 (38%), Gaps = 100/364 (27%)

Query: 78  FESFAAAVDEFFSTAESHKIDLKAVQQERDAL-KKLENVKRDHETRLSALEQTQLVDKEK 136
           +E++  A+DE+FS     K D+K  + + D L +K E +        +  E+    + +K
Sbjct: 311 YENYLNALDEYFSQFILQK-DIKKEETKLDKLIRKQERIVNSQIETKAKYEKQSAKNHQK 369

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASII-----KQLKLN 191
            +LI  N   +D  I  IR     ++ W+ I+  V  ++  ++P+ S I     K  +LN
Sbjct: 370 GDLIYANFTEIDEIINTIRS-AREKMEWKQIKKIV--SENKDNPILSKIESINEKNAELN 426

Query: 192 INHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFD 251
           +    L+     E   IK +                      V +D+  SA+ NA  ++ 
Sbjct: 427 LK---LIAEYGGELGTIKGN----------------------VAIDIRESAFENANSYYT 461

Query: 252 LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN------KARKVYWFEKF-- 303
             +    K    I + E + K  EK  +QT  D + +          + R + W+EK   
Sbjct: 462 KAKKFKNKVSGVIVALEISQKKLEKIRQQTELDAELLKQKQQNIKKKERRVLKWYEKLKW 521

Query: 304 -----YWFIS------------------------------------------SENYLGNV 316
                Y  I+                                          SE  L  V
Sbjct: 522 TIIDNYLIIAGKDATTNEIIVKKYLEKNDVVFHTLMEGAPFTVIKNTSEETPSEETLLEV 581

Query: 317 AWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMG 366
           A  A   + AW          WV  +Q+SKTA +GEFL  G+F+IRGK+NF     L +G
Sbjct: 582 AKFAVSHSKAWKLGLGSADVYWVLPEQISKTAESGEFLKKGAFVIRGKRNFIRSAPLDLG 641

Query: 367 ISFL 370
           +  +
Sbjct: 642 VGIV 645


>gi|298675852|ref|YP_003727602.1| fibronectin-binding A domain-containing protein [Methanohalobium
           evestigatum Z-7303]
 gi|298288840|gb|ADI74806.1| Fibronectin-binding A domain protein [Methanohalobium evestigatum
           Z-7303]
          Length = 670

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 119/275 (43%), Gaps = 72/275 (26%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFS------------TAESHKIDL--KAVQQERDAL 109
           P  + Q++ +  Q F+SF  A+DEFF             +A+  K+D+  + +QQ++ A+
Sbjct: 262 PFELEQYKEYEKQYFDSFNKALDEFFGKEVVEAERKIQESAKKEKVDIYQRRLQQQQGAI 321

Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
           +K E     + +               AE I ++   V+  I  +     +  SW+DI++
Sbjct: 322 EKFEKEANKYNS--------------IAEAIYSHYPFVEEVITVLTNARKSGYSWDDIKS 367

Query: 170 RVKQAQRHND-PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSET 228
           ++K+A   ND P A +I+ +      I +                          D   T
Sbjct: 368 KLKEA---NDIPSAKLIQSIDPKSGTIVM--------------------------DLDGT 398

Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQ---KTIQSTEKALKSAEKKTKQTLKDV 285
           K      +D+  S   NA+ +++  +   KK++   + I+ T++ +++ +K  +QT K  
Sbjct: 399 K----ATLDIRYSVPQNAQTYYEKAKRVMKKREGALRAIEETKRIIENRDKPQQQTRKR- 453

Query: 286 QTMTNINKARKVYWFEKFYWFISSENYLGNVAWDA 320
                    RK +W+ +F WFISS+ +L     DA
Sbjct: 454 ------KVIRKKHWYSRFRWFISSDGFLVVGGRDA 482



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 32/47 (68%), Gaps = 2/47 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISF 369
           + +WV  +QVSKT  +GEFL  GSF+IRG +N+F   P  +A+G+  
Sbjct: 546 DCYWVYPNQVSKTPESGEFLKKGSFIIRGDRNYFKNVPVSVAIGLEL 592


>gi|315231919|ref|YP_004072355.1| RNA-binding protein [Thermococcus barophilus MP]
 gi|315184947|gb|ADT85132.1| RNA-binding protein [Thermococcus barophilus MP]
          Length = 650

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 66/312 (21%), Positives = 131/312 (41%), Gaps = 37/312 (11%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++ ++ +  + FE+F+ A+DE+F      K  ++  ++  +  +++    R  E +
Sbjct: 242 VPIELKWYENYEKKYFETFSEALDEYFGKITVEKAKIERTKRLEEKKRQILATLRRQEEQ 301

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +   E     ++E  +LI  N   +D  +    + +  +L WE+ + R+++ ++  + +A
Sbjct: 302 MKGFEAEMKKNQELGDLIYANFTFIDNLLREFSKAV-EKLGWEEFKKRIEEGKKAGNKIA 360

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            ++K +      +T+ +   K +  +  S     +NA+ + + ++  K  L         
Sbjct: 361 LMVKSIDPKEKAVTIEIEGRKIKLYLNKSI---GENAEIYYEKAKKAKHKL--------- 408

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
              AKR       A +  +K +Q  EK ++   KK  +  K  +            WFEK
Sbjct: 409 -EGAKR-------AYEDTKKKLQEIEKLIEEEMKKELKVKKLEKRKKK--------WFEK 452

Query: 303 FYWFISSENYLGNVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMI------RGKKN 356
           F WFISSE +L     DA   TN   VK             +     ++       G+K 
Sbjct: 453 FRWFISSEGFLVIGGKDA--TTNEMVVKRHMGDNDLYCHADVHGAPHVVIKDGQKAGEKT 510

Query: 357 FFPPCQLAMGIS 368
            F  CQ A+ +S
Sbjct: 511 IFEACQFAVSMS 522



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 38/54 (70%), Gaps = 3/54 (5%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           AW   V + +A+W   +QV+K AP+GE+L  G+FM+ GK+N++   P +LA+GI
Sbjct: 524 AWSEGVYSADAYWAYPNQVTKKAPSGEYLGKGAFMVYGKRNWYHGIPLKLAVGI 577


>gi|150399105|ref|YP_001322872.1| hypothetical protein Mevan_0351 [Methanococcus vannielii SB]
 gi|150011808|gb|ABR54260.1| protein of unknown function DUF814 [Methanococcus vannielii SB]
          Length = 680

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           AW   V + + +WV  DQ+SKTA +GE+L  G+F+IRGK+NF     L +G+   
Sbjct: 558 AWKLGVGSADVYWVMPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALDLGVGIF 612



 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 59/257 (22%), Positives = 110/257 (42%), Gaps = 46/257 (17%)

Query: 64  PSVMRQHQGFP---IQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHE 120
           P  ++ H+  P   I+ +ESF  A+DE+FS      I  K ++Q    + KL  VK+   
Sbjct: 262 PISLKIHEKIPENEIKNYESFLVALDEYFSRF----IIKKEIKQAETKINKL--VKKQER 315

Query: 121 TRLSALEQTQ------LVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
              S LE  +      ++++EK +LI  N   VD  +  IR     ++ W  I+  + + 
Sbjct: 316 ILNSQLETKEKYEKQSVLNQEKGDLIYANYMDVDEILSTIRS-AREKMDWNAIKEVINKN 374

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
           + H      I+ ++ +++N             N + S  L  D  +   + +       V
Sbjct: 375 KDH-----QILGKI-ISVNE-----------KNAEISLKLSLDYGNGIIEKN-------V 410

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQS---TEKA---LKSAEKKTKQTLKDVQTM 288
            +DL  +A+ +A  F+   +    K    I++   +EK    LK  EK   + L++ +  
Sbjct: 411 VLDLRKNAFESADDFYQKSKKFKNKVSGVIEALKISEKKLNELKEKEKTDSEVLREKEEN 470

Query: 289 TNINKARKVYWFEKFYW 305
               + + + W+EK  W
Sbjct: 471 IKKKEKKLLKWYEKLKW 487


>gi|336122066|ref|YP_004576841.1| Fibronectin-binding A domain-containing protein
           [Methanothermococcus okinawensis IH1]
 gi|334856587|gb|AEH07063.1| Fibronectin-binding A domain protein [Methanothermococcus
           okinawensis IH1]
          Length = 684

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 33/46 (71%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           + +WVK +Q+SKTA +GE+L  G+F+IRGK+NF     L +GI  +
Sbjct: 572 DVYWVKPEQISKTAESGEYLKKGAFVIRGKRNFIRSVPLELGIGIV 617


>gi|448512226|ref|ZP_21616340.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           9100]
 gi|448520849|ref|ZP_21618182.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           10118]
 gi|445694546|gb|ELZ46671.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           9100]
 gi|445702985|gb|ELZ54924.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           10118]
          Length = 735

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 64/289 (22%), Positives = 113/289 (39%), Gaps = 64/289 (22%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF-------STAESHKIDLKAVQQE-RDALKKLEN 114
           TP  + +H+  P   F+SF AAVDE+F       +       D  A + +  + + K E 
Sbjct: 275 TPFPLAEHEDLPSVGFDSFNAAVDEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQER 334

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +    +  +   E+    ++E+AEL+  + + VD  I  +R+   N++ W++IE  +   
Sbjct: 335 IIEQQKGAIEGFEEQAQAERERAELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAG 394

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A  +  +      +T+ L                           E      V
Sbjct: 395 AERGIPAAEAVVGVDGGEGTVTVEL-------------------------EEEGDDGGTV 429

Query: 235 DVDLDLS--AYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQTLKDVQTM- 288
            V+LD S     NA R + + KR   KK+  ++ I+ST + L++  K+ K+  ++ Q   
Sbjct: 430 TVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREELEAV-KERKREWEEQQAAD 488

Query: 289 ------------------------TNINKARKVYWFEKFYWFISSENYL 313
                                   ++I       WFE+F WF +S  YL
Sbjct: 489 DGSGGDGGDNEEEDEEYETDWLARSSIPIRSPDDWFERFRWFHTSTGYL 537



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 35/51 (68%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F    C++A+G+
Sbjct: 609 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVACRIAVGV 659


>gi|408381973|ref|ZP_11179520.1| fibronectin-binding A domain-containing protein [Methanobacterium
           formicicum DSM 3637]
 gi|407815421|gb|EKF86006.1| fibronectin-binding A domain-containing protein [Methanobacterium
           formicicum DSM 3637]
          Length = 711

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 71/266 (26%), Positives = 118/266 (44%), Gaps = 42/266 (15%)

Query: 70  HQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--RDALKKLENVKRDHETRLSALE 127
           +Q F  + FE+F  A DEF+S       D+K VQ++     + K E   R  E  L   +
Sbjct: 310 YQNFHKERFETFNQAADEFYSGKVG--ADIKKVQEDIWAKEVGKYEKRLRIQEETLEKFQ 367

Query: 128 QTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQ 187
           +T +  K K  LI ++   +   +L I      + SW +I +++K+A++     A II+ 
Sbjct: 368 KTIVETKRKGNLIYSHYSEIQ-NLLDIIHQAREKFSWMEIASKLKKARKEGMVQAQIIQS 426

Query: 188 L-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANA 246
           + K+ +  +TL L                           ET     V VD +L    NA
Sbjct: 427 MDKMGV--LTLNL-------------------------EGET-----VTVDANLEIPENA 454

Query: 247 KRFFDLKRSAAKKQQKTIQSTEKALKSAE-KKTKQTLKDVQTMTNINKARK-VYWFEKFY 304
           +++++  + A +K +    + E+  K  E K+ K+ L   +      + RK + WFEK  
Sbjct: 455 EKYYNKGKKAKRKIKGVNMAIERTKKDVERKRNKRELALERVRVPQKRVRKELKWFEKLR 514

Query: 305 WFISSENYLGNVAWDAKVVTNAWWVK 330
           WF+SS+ +L     DA   TN   VK
Sbjct: 515 WFLSSDGFLVIGGRDAG--TNEMVVK 538



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 32/45 (71%), Gaps = 2/45 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +WV  DQVSKT  +GEF+  G+F+IRG +N+    P ++A+GI
Sbjct: 595 DVYWVHPDQVSKTPQSGEFVARGAFIIRGSRNYLRGIPLKIAVGI 639


>gi|269865201|ref|XP_002651841.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063780|gb|EED42216.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 142

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/41 (58%), Positives = 31/41 (75%)

Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKL 373
           QVSKTAP+GEFL  GSFMI+GKKN   P +L  G+  +F++
Sbjct: 8   QVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRI 48


>gi|84489327|ref|YP_447559.1| RNA-binding protein [Methanosphaera stadtmanae DSM 3091]
 gi|84372646|gb|ABC56916.1| predicted RNA-binding protein [Methanosphaera stadtmanae DSM 3091]
          Length = 666

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/45 (55%), Positives = 33/45 (73%), Gaps = 2/45 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           +A+WV  DQVSKT  +GEFL  G+F+IRGKKNF    P  +A+G+
Sbjct: 541 DAYWVTLDQVSKTPQSGEFLKKGAFVIRGKKNFIRNVPVLIAIGV 585



 Score = 45.1 bits (105), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 53/263 (20%), Positives = 108/263 (41%), Gaps = 38/263 (14%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  + ++  F  + FE+F  A DEF+S      I  K  +     + K E   +  E  
Sbjct: 247 VPITLNKYAQFKSKSFETFNMAADEFYSKKIVSDIKNKEEKLWAKRIGKFEKRLKMQEET 306

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND-PV 181
           L    +T    + K + I  +   +   I  I Q   N  SW++I + +K++++    P 
Sbjct: 307 LEGFYKTIEDKQHKGDTIYAHYNEIQQIINVIHQAREN-YSWKEIGSIIKKSKKEGKIPE 365

Query: 182 ASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
             +I+ + K+ + ++ L                                  + V +D ++
Sbjct: 366 LEMIESIDKMGVINLKL--------------------------------DDTHVQIDSNI 393

Query: 241 SAYANAKRFFDLKRSAAKK---QQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKV 297
               + +++++  + A +K     K I++T+  +K  E K +  ++ ++      + R++
Sbjct: 394 GIPESTEKYYNKGKKAKRKIDGVNKAIENTKSEIKKLEDKKEVAIELLRQKQEKREKREL 453

Query: 298 YWFEKFYWFISSENYLGNVAWDA 320
            W+EK  WFIS + YL     DA
Sbjct: 454 KWYEKLRWFISRDGYLVIGGRDA 476


>gi|296109018|ref|YP_003615967.1| Fibronectin-binding A domain protein [methanocaldococcus infernus
           ME]
 gi|295433832|gb|ADG13003.1| Fibronectin-binding A domain protein [Methanocaldococcus infernus
           ME]
          Length = 666

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 32/46 (69%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           + +WVK DQ+SK A +GE+L  G+F+IRGK+N+     L +GI  L
Sbjct: 553 DTYWVKPDQISKRAESGEYLKRGAFVIRGKRNYIRNVPLELGIGVL 598



 Score = 38.9 bits (89), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 59/274 (21%), Positives = 124/274 (45%), Gaps = 39/274 (14%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  +R+++ +  + F SF  A+DE+F+   +     K   +    ++K E++ R     
Sbjct: 257 VPIELRKYKDYEKRYFNSFYEALDEYFAKFLTSVEIKKEKSKLEKEIEKQESILRRQLET 316

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L A E+    ++ K +LI +N + V+  + AIR    ++  WE+++  +++ + H  P+ 
Sbjct: 317 LKAYEEEVRKNQIKGDLIYSNYQLVEEILNAIRV-AKDKKGWEEVKRVIRENKEH--PII 373

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I                  E  N K   ++   ++D      + K    V +D+  S 
Sbjct: 374 KLI------------------EGVNEKKGEIIVRLSSD-----LDGKIEERVVLDIRKST 410

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSA--EKKTKQTLKDV----QTMTNINKARK 296
           + NA+ +++     AKK +  I+  +KA++ +  + +  +  +DV    +        ++
Sbjct: 411 FENAESYYN----KAKKFKSKIEGIKKAIEMSKKKLEELKKKRDVEIEEKKALKKKVKKE 466

Query: 297 VYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
             W+EKF W + + N+L     DA  +TN   +K
Sbjct: 467 RKWYEKFKWTVIN-NFLVIAGKDA--ITNEIIIK 497


>gi|374632982|ref|ZP_09705349.1| putative RNA-binding protein, snRNP like protein [Metallosphaera
           yellowstonensis MK1]
 gi|373524466|gb|EHP69343.1| putative RNA-binding protein, snRNP like protein [Metallosphaera
           yellowstonensis MK1]
          Length = 602

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 84/198 (42%), Gaps = 68/198 (34%)

Query: 234 VDVDLDLSAYANAKRFFDLKR---SAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
           V++D +LS    A  +F+  +   S A++ ++TI   +K ++  + K ++T ++ +++  
Sbjct: 324 VEIDPNLSLTRVASSYFERAKELESKARRAEETIAELKKKVEELKLKLRET-EESKSLV- 381

Query: 291 INKARKVYWFEKFYWFISSENYL------------------------------------- 313
               RK  W+EK+ W  +  NYL                                     
Sbjct: 382 ---IRKKEWYEKYRWSFTRNNYLVIAGRDVDQNESLVKKMLGEEEIFLHADIQGAPATII 438

Query: 314 --------GNVAWDAKVV----TNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                   G++ +DA VV    + AW          WV   QVSK+ P+GE+L  GSFMI
Sbjct: 439 KDSKGVQEGDI-YDAAVVAACYSKAWKLGLGSVDVFWVYGSQVSKSPPSGEYLPKGSFMI 497

Query: 352 RGKKNFFPPCQLAMGISF 369
            GKKNF    +L + I  
Sbjct: 498 YGKKNFIKNVRLELAIGL 515


>gi|222479900|ref|YP_002566137.1| Fibronectin-binding A domain protein [Halorubrum lacusprofundi ATCC
           49239]
 gi|222452802|gb|ACM57067.1| Fibronectin-binding A domain protein [Halorubrum lacusprofundi ATCC
           49239]
          Length = 733

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 607 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 657



 Score = 38.1 bits (87), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 54/289 (18%), Positives = 109/289 (37%), Gaps = 58/289 (20%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF-----STAESHKIDLKAVQQERD---ALKKLEN 114
           TP  + +H+G P   F+SF AAVDE+F       +E  +    A     D    + K E 
Sbjct: 267 TPFPLSEHEGLPSVGFDSFNAAVDEYFYRLDRDGSEEGEAPADASPSRPDFEEEIGKQER 326

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +    +  +   E+    ++E+AEL+    + VD  +  +++    ++ W++I   +   
Sbjct: 327 IVEQQQGAIEGFEEQAEAERERAELLYAEYDLVDEVLSTVQEAREAEVPWDEIAETLDAG 386

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A  +  +      +T+ LR      +  ++                    + +
Sbjct: 387 AEQGIPAAETVVDVDGGEGTVTVELRGGDGEDDDGET--------------------TRI 426

Query: 235 DVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQTLK-------- 283
           ++D       NA R + + KR   KK+   + I+ST   L++ +++  +           
Sbjct: 427 ELDASAGVEVNADRLYQEAKRIEGKKEGAMEAIKSTRAELEAVKERKAEWEAKEAAADET 486

Query: 284 -------------------DVQTMTNINKARKVYWFEKFYWFISSENYL 313
                              D  + ++I       W+++F WF +S  YL
Sbjct: 487 AGDGADDGEEEEDGEEYQTDWLSRSSIPIRSPDDWYDRFRWFYTSTGYL 535


>gi|292656996|ref|YP_003536893.1| hypothetical protein HVO_2883 [Haloferax volcanii DS2]
 gi|448293595|ref|ZP_21483700.1| hypothetical protein C498_17603 [Haloferax volcanii DS2]
 gi|291371020|gb|ADE03247.1| conserved protein [Haloferax volcanii DS2]
 gi|445570456|gb|ELY25019.1| hypothetical protein C498_17603 [Haloferax volcanii DS2]
          Length = 702

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/281 (21%), Positives = 112/281 (39%), Gaps = 65/281 (23%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +R+HQ  G   + +++F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + +    E  +   EQ    ++E+AEL+  N + VD  +  +R      + W+DI A ++
Sbjct: 313 QRIIDQQEGAIEGFEQQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAATLE 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                 D+A             
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +D+ +    NA R + + KR   KK+     I+ T + L + +K+  +   D     
Sbjct: 403 TVTLDVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDGDED 462

Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
                          + ++      +WFE+F WF +S  YL
Sbjct: 463 EDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSTGYL 503



 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF++RG + +F   P ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKVAVGI 625


>gi|433431126|ref|ZP_20407596.1| hypothetical protein D320_16320 [Haloferax sp. BAB2207]
 gi|448568141|ref|ZP_21637718.1| hypothetical protein C456_00247 [Haloferax lucentense DSM 14919]
 gi|448601017|ref|ZP_21656300.1| hypothetical protein C452_18184 [Haloferax alexandrinus JCM 10717]
 gi|432194170|gb|ELK50822.1| hypothetical protein D320_16320 [Haloferax sp. BAB2207]
 gi|445727091|gb|ELZ78705.1| hypothetical protein C456_00247 [Haloferax lucentense DSM 14919]
 gi|445734620|gb|ELZ86178.1| hypothetical protein C452_18184 [Haloferax alexandrinus JCM 10717]
          Length = 702

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/281 (21%), Positives = 112/281 (39%), Gaps = 65/281 (23%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +R+HQ  G   + +++F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + +    E  +   EQ    ++E+AEL+  N + VD  +  +R      + W+DI A ++
Sbjct: 313 QRIIDQQEGAIEGFEQQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAATLE 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                 D+A             
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +D+ +    NA R + + KR   KK+     I+ T + L + +K+  +   D     
Sbjct: 403 TVTLDVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDGDDD 462

Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
                          + ++      +WFE+F WF +S  YL
Sbjct: 463 EDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSTGYL 503



 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF++RG + +F   P ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKVAVGI 625


>gi|448454957|ref|ZP_21594359.1| Fibronectin-binding A domain protein [Halorubrum lipolyticum DSM
           21995]
 gi|445814337|gb|EMA64302.1| Fibronectin-binding A domain protein [Halorubrum lipolyticum DSM
           21995]
          Length = 736

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 610 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 660



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 63/308 (20%), Positives = 121/308 (39%), Gaps = 60/308 (19%)

Query: 42  NSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKA 101
            SED   A+ +  P ++    TP  + +H+G P   F+SF AAVDE+F   E    D   
Sbjct: 255 GSED---ADGDGDPRVVD--VTPFPLSEHEGLPSVGFDSFNAAVDEYFYRLEREDGDAGE 309

Query: 102 VQQE--------RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILA 153
              +         + + K E +       +   E+    ++E+AEL+    + VD  +  
Sbjct: 310 APADASPSRPEFEEEIAKQERIIEQQRGAIEGFEEQAEAERERAELLYARYDLVDEVLST 369

Query: 154 IRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWV 213
           +R+   N++ W++I   ++       P A  +  +      +T+ L    +R   +D   
Sbjct: 370 VREARENEVPWDEIAETLEAGAERGIPAAEAVADVDGGEGTVTVEL----DREGGED--- 422

Query: 214 LHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKA 270
                        E+     V++D       NA R + + KR   KK+   + I+ST + 
Sbjct: 423 ------------GESGDSVRVELDASTGVEVNADRLYQEAKRIEGKKEGAMEAIESTRRE 470

Query: 271 LKSAEKKTK--QTLKDVQ-----------------------TMTNINKARKVYWFEKFYW 305
           L++ E++    + ++                          + ++I       W+++F W
Sbjct: 471 LEAVEERKAEWEAMEAADDGDGDGGDSEDEDDEEEYETDWLSRSSIPIRSPDDWYDRFRW 530

Query: 306 FISSENYL 313
           F +S  YL
Sbjct: 531 FHTSTGYL 538


>gi|448528898|ref|ZP_21620278.1| Fibronectin-binding A domain protein [Halorubrum hochstenium ATCC
           700873]
 gi|445710346|gb|ELZ62165.1| Fibronectin-binding A domain protein [Halorubrum hochstenium ATCC
           700873]
          Length = 740

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 35/51 (68%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V  DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 614 DGRGAGDAYMVDPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 664



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 62/292 (21%), Positives = 112/292 (38%), Gaps = 59/292 (20%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKKL 112
           TP  + +H+  P   F+SF AAVDE+F    S   D +A     DA          + K 
Sbjct: 269 TPFPLAEHEDLPSVGFDSFNAAVDEYFYRLGSE--DAEAGDAPADASASRPDFEGEIAKQ 326

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + +    E  +   E+    ++E+AEL+  N + VD  +  +R+   +++ W++IE  + 
Sbjct: 327 QRIIEQQEGAIEGFEEQAQAERERAELLYANYDLVDEVLSTVREARESEVPWDEIEETLD 386

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
                  P A  +  +      +T+ L                 D +   AD        
Sbjct: 387 AGAERGIPAAEAVVDVDGGEGTVTVEL----------------ADESGDDADDEGGANGG 430

Query: 233 LVDVDLDLS--AYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKK---------- 277
              ++LD S     NA R + + KR   KK+     I+ST + L++ +++          
Sbjct: 431 TTRIELDASEGVEVNADRLYQEAKRVEEKKEGAMAAIESTREELEAVKERKAEWEEQQAA 490

Query: 278 ----------------TKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
                            ++   D  +  +I       W+++F WF +S  YL
Sbjct: 491 NDGSGQGDDGDDGADDEEEYETDWLSRASIPIRSPDNWYDRFRWFHTSTGYL 542


>gi|448448413|ref|ZP_21591226.1| Fibronectin-binding A domain protein [Halorubrum litoreum JCM
           13561]
 gi|445814829|gb|EMA64787.1| Fibronectin-binding A domain protein [Halorubrum litoreum JCM
           13561]
          Length = 736

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 62/288 (21%), Positives = 117/288 (40%), Gaps = 61/288 (21%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF-------STAESHKIDLKAVQQE-RDALKKLEN 114
           TP  + +H+  P   F+SF AAVDE+F       +       D  A + +  + + K E 
Sbjct: 275 TPFPLAEHEDLPSVGFDSFNAAVDEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQER 334

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +    +  +   E+    ++E+AEL+  + + VD  I  +R+   N++ W++IE  +   
Sbjct: 335 IIEQQKGAIEGFEEQAQAERERAELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAG 394

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A  +  + ++    T+ + + +E            D+    A          V
Sbjct: 395 AERGIPAAEAV--VGVDGGEGTVTVELEEE-----------GDDGGTMA----------V 431

Query: 235 DVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQTLKDVQTM--- 288
           ++D       NA R + + KR   KK+  ++ I+ST + L++  K+ K+  ++ Q     
Sbjct: 432 ELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREELEAV-KERKREWEEQQAADDG 490

Query: 289 -----------------------TNINKARKVYWFEKFYWFISSENYL 313
                                  ++I       WFE+F WF +S  YL
Sbjct: 491 SGGDGGDNEGEEDEEYETDWLARSSIPIRSPDDWFERFRWFHTSTGYL 538



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 35/51 (68%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F    C++A+G+
Sbjct: 610 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVACRIAVGV 660


>gi|302347972|ref|YP_003815610.1| fibronectin-binding protein [Acidilobus saccharovorans 345-15]
 gi|302328384|gb|ADL18579.1| Predicted fibronectin-binding protein [Acidilobus saccharovorans
           345-15]
          Length = 647

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 3/56 (5%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP--CQLAMGISF 369
           AW +     + +W    QVSK+ P+GE+LT GSFM+ GKKN+  P   +L +GI+ 
Sbjct: 501 AWKEGMASVSVYWAYGSQVSKSPPSGEYLTKGSFMVYGKKNYLRPLRLELYLGIAL 556


>gi|448470211|ref|ZP_21600408.1| Fibronectin-binding A domain protein [Halorubrum kocurii JCM 14978]
 gi|445808289|gb|EMA58361.1| Fibronectin-binding A domain protein [Halorubrum kocurii JCM 14978]
          Length = 735

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 609 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRVAVGV 659



 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 64/292 (21%), Positives = 113/292 (38%), Gaps = 68/292 (23%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
           TP  + +H+G P   F+SF AAVDE+F   E+   D      +  A        + K E 
Sbjct: 273 TPFPLSEHEGLPSVGFDSFNAAVDEYFYRLEAEDTDAGEAPADASASRPDFEEEIAKQER 332

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +       +   E+    ++E+AEL+    + VD  +  +++    ++ W++I   +   
Sbjct: 333 IIEQQRGAIEGFEEQAEAERERAELLYAEYDLVDEVLSTVQEAREAEVPWDEIAETLDAG 392

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A  +  +      +T+ L                 D+ D   DS        V
Sbjct: 393 ADRGIPAAEAVVDVDGGEGTVTVEL-----------------DDED--GDS--------V 425

Query: 235 DVDLDLSAY--ANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKK------------ 277
            V+LD SA    NA R + + KR   KK+   + I+ST + L++ +++            
Sbjct: 426 RVELDASAGVEVNADRLYQEAKRIEGKKEGAMEAIESTRRELEAVKERKAEWEAKEAAAD 485

Query: 278 ----------------TKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
                            ++   D  + ++I       WFE+F WF +S  YL
Sbjct: 486 ETPGGGGDGDGDDDADDEEYETDWLSRSSIPIRSPDDWFERFRWFRTSTGYL 537


>gi|332796292|ref|YP_004457792.1| hypothetical protein Ahos_0606 [Acidianus hospitalis W1]
 gi|332694027|gb|AEE93494.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 566

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 1/54 (1%)

Query: 317 AWDAKVV-TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           AW   +   + +WV   QVSK+ P GE+L  GSFMI GKKN+    +L + I F
Sbjct: 433 AWKLGLAAVDVFWVYGSQVSKSPPAGEYLPKGSFMIYGKKNYIKSVKLNLAIGF 486


>gi|448439536|ref|ZP_21588100.1| Fibronectin-binding A domain protein [Halorubrum saccharovorum DSM
           1137]
 gi|445691070|gb|ELZ43265.1| Fibronectin-binding A domain protein [Halorubrum saccharovorum DSM
           1137]
          Length = 733

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           D +   +A+ V+ DQVSKT  +GE++  GSF+IRG + +F   PC++A+G+
Sbjct: 607 DGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRVAVGV 657



 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 62/289 (21%), Positives = 108/289 (37%), Gaps = 66/289 (22%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA--------LKKLEN 114
           TP  + +H+G P   F+SF AAVDE+F   E  + D      +  A        + K E 
Sbjct: 275 TPFPLSEHEGLPSVGFDSFNAAVDEYFYRLEHEESDAGEAPTDASASRPDFEEEIAKQER 334

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +    +  +   E+    ++E+AEL+    + VD  +  +++   N + W++I   +   
Sbjct: 335 IIEQQKGAIEGFEEQAEAERERAELLYAEYDLVDEVLSTVQEARENDVPWDEIAETLDAG 394

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A  +  +      +T+ L     R                            V
Sbjct: 395 AERGIPAAEAVVDVDGGEGTVTVELGEDDTR----------------------------V 426

Query: 235 DVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKS-----AEKKTKQTLKDVQ 286
           ++D       NA R + + KR   KK+   + I+ST + L++     AE K K+   D +
Sbjct: 427 ELDASAGVEVNADRLYQEAKRIEGKKEGAMEAIESTRQDLEAVKERKAEWKAKEAADDEE 486

Query: 287 ----------------------TMTNINKARKVYWFEKFYWFISSENYL 313
                                 + ++I       WFE+F WF +S  YL
Sbjct: 487 GGSDAGGGEGDEGEEEYETDWLSRSSIPIRSPDDWFERFRWFRTSTGYL 535


>gi|435850617|ref|YP_007312203.1| putative RNA-binding protein, snRNP like protein
           [Methanomethylovorans hollandica DSM 15978]
 gi|433661247|gb|AGB48673.1| putative RNA-binding protein, snRNP like protein
           [Methanomethylovorans hollandica DSM 15978]
          Length = 664

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 66/282 (23%), Positives = 120/282 (42%), Gaps = 53/282 (18%)

Query: 52  EFHPMILQPGS---------TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAV 102
           E  P I+  G           P  +  ++    + + SF  A+DE+F   E+  I  +A 
Sbjct: 239 ELDPCIIMKGEGDTAQSIDVVPFELEVYRELTKERYPSFNKALDEYFGKREAASITEQAF 298

Query: 103 QQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAA--ILAIRQDIAN 160
             +++ +  LE   R  E  +    +        AE I  N ++V+    +LAI +D   
Sbjct: 299 SVKKEKVDLLERRLRQQEEAVEKYGKESEKHTSIAETIYANYQAVEDVLKVLAIARD--K 356

Query: 161 QLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT--LLLRVHKERSNIKDSWVLHHDN 218
             SW+ I++ +K A + + P A  I    L+I+  T  ++L +   ++NI          
Sbjct: 357 GYSWDQIKSTIKAA-KDSVPAAKSI----LSIDSATGIVVLDLMGMKTNI---------- 401

Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT 278
                             D+  +   NA+ +++  +  AKKQ+  I+S E+   + +KK 
Sbjct: 402 ------------------DVTKTVPQNAQVYYERSKKLAKKQEGAIRSIEQTKLAMQKKE 443

Query: 279 KQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDA 320
           K   +   T+  I K     W+++F WF+SS+ +L     DA
Sbjct: 444 KTATRKRGTV-RIKKQ----WYDRFRWFVSSDGFLVIGGRDA 480



 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 31/45 (68%), Gaps = 2/45 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +WV   QVSKT  +GE++  GSF+IRG++N+    P  +A+GI
Sbjct: 544 DCYWVNPTQVSKTPESGEYVKKGSFIIRGERNYLKDVPVGVAVGI 588


>gi|399576519|ref|ZP_10770274.1| RNA-binding protein, snrnp like protein [Halogranum salarium B-1]
 gi|399237963|gb|EJN58892.1| RNA-binding protein, snrnp like protein [Halogranum salarium B-1]
          Length = 706

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 58/294 (19%), Positives = 114/294 (38%), Gaps = 62/294 (21%)

Query: 52  EFHPMILQPGS-----TPSVMRQHQGFPIQEFESFAAAVDEFFS----TAESHKIDLKAV 102
           EF P + +        TP  + +++G     F+SF AA+D++F     + E+ K    A 
Sbjct: 241 EFDPRVYEKDGNVVDVTPFPLEEYEGLDSAAFDSFNAALDDYFFRLDLSDEAEKGGGGAE 300

Query: 103 QQERDALKKLENVKR---DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159
               D  +++E  KR     E  +   E+    ++EKAEL+  N E  D  +  +R    
Sbjct: 301 ANRPDFQEEIEKQKRIIQQQEGAIEGFEEQAQEEREKAELLYANYELADEVLSTVRGARE 360

Query: 160 NQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNA 219
             + W DI   + +      P A  ++ +  +   +T+ +   +                
Sbjct: 361 ENIPWADIADTLAEGAEQGIPAAEAVEDVDGSTGTVTITIDGQR---------------- 404

Query: 220 DPFADSSETKKPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEK 276
                         +D+D+ +    NA R + + KR   KK    + I++T + L++ EK
Sbjct: 405 --------------IDLDVSMGVEKNADRIYTEAKRVEEKKAGALEAIENTREKLEAVEK 450

Query: 277 KTKQTLKDVQ-----------------TMTNINKARKVYWFEKFYWFISSENYL 313
           +  +                       +  +I    +  W+++F WF +S+ +L
Sbjct: 451 RRDEWEASDDEPDEDEDDEEKPDIDWLSRNSIPIRNQDKWYDRFRWFETSDGFL 504



 Score = 45.4 bits (106), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V ADQVSKT  +GE++  GSF++RG + ++     ++A+G+
Sbjct: 576 EGRFADDAYMVSADQVSKTPESGEYVEKGSFVVRGDRTYYEDVAAEVAVGL 626


>gi|448659123|ref|ZP_21683091.1| hypothetical protein C435_18454 [Haloarcula californiae ATCC 33799]
 gi|445760625|gb|EMA11882.1| hypothetical protein C435_18454 [Haloarcula californiae ATCC 33799]
          Length = 717

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 60/263 (22%), Positives = 116/263 (44%), Gaps = 31/263 (11%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF---------STAESHKIDLKAVQQERDALKKLE 113
           TP+ + +++    + F  F  A+D++F            E+ + D +A       ++K E
Sbjct: 275 TPTPLAEYEELYSESFTEFNPALDDYFFNFQREEEVEGGETQRPDFEA------EIEKQE 328

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
            + +  E  +   E    V++EKAEL+  N + VD  +  ++    + +SW+DIEA+  +
Sbjct: 329 RIIQQQEQAIEDFEADAEVEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDE 388

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKK 230
                  +A+    + L+ +  T+ L +   R  + D++     NAD     A   E KK
Sbjct: 389 GADRG--IAAAEAVVSLDGSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKK 445

Query: 231 PSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
                    L+A  N +   DL+    ++++      E    + E + + T  D  +M +
Sbjct: 446 EGA------LAAIENTRE--DLEAVKERREEWEADDGEDEADNDEAEDEPT--DWLSMQS 495

Query: 291 INKARKVYWFEKFYWFISSENYL 313
           I       W+E+F WF +S+ +L
Sbjct: 496 IPTRSTERWYEQFRWFHTSDGFL 518



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE+L  G F +RG + +F   P  +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVGIT 641


>gi|330835774|ref|YP_004410502.1| hypothetical protein Mcup_1916 [Metallosphaera cuprina Ar-4]
 gi|329567913|gb|AEB96018.1| conserved hypothetical protein [Metallosphaera cuprina Ar-4]
          Length = 508

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 60/255 (23%), Positives = 103/255 (40%), Gaps = 77/255 (30%)

Query: 185 IKQLKLNINHITLLLRVHKERSN---IKDSWVLHHDNADPFADSSETKKPS-------LV 234
           IK+L++ I+ +  +L   K+ ++   +K   ++ +    P+ ++  +  PS        +
Sbjct: 181 IKKLEVTISKLYEILNQTKQEADLNRVKGRLIMENY---PYVETMMSNGPSEIEINGVKI 237

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
           ++D   S   NA  +FD     AK+ ++ I+ TE+ +   E+K +  L   +     +K 
Sbjct: 238 EIDPSKSIAKNAALYFD----KAKELEEKIKKTEETIVELERKKQDLLSKTKEEIESSKV 293

Query: 295 --RKVYWFEKF--------YWFISSEN----------YLGN------------------- 315
             RK  WFEK+        Y  I+  +          +LG+                   
Sbjct: 294 LIRKREWFEKYHWTITKNGYIVIAGRDIDQNESLVKKFLGDDDIFLHADIQGAPATVIKS 353

Query: 316 -----------VAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGK 354
                       A  A   + AW          WV   QVSK+ P+GE+L  GSFMI GK
Sbjct: 354 PNSISDEDLLDAATLAASYSKAWKLGLGSIDVFWVYGKQVSKSPPSGEYLPKGSFMIYGK 413

Query: 355 KNFFPPCQLAMGISF 369
           KNF    +L + +  
Sbjct: 414 KNFIKNVKLELTVGI 428


>gi|389848295|ref|YP_006350534.1| hypothetical protein HFX_2877 [Haloferax mediterranei ATCC 33500]
 gi|448618500|ref|ZP_21666737.1| hypothetical protein C439_16130 [Haloferax mediterranei ATCC 33500]
 gi|388245601|gb|AFK20547.1| hypothetical protein HFX_2877 [Haloferax mediterranei ATCC 33500]
 gi|445746871|gb|ELZ98329.1| hypothetical protein C439_16130 [Haloferax mediterranei ATCC 33500]
          Length = 701

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 59/280 (21%), Positives = 115/280 (41%), Gaps = 64/280 (22%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +++HQ  G   + +++F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLQEHQNAGLDEEAYDTFNEALDEYF-----FRLDLTADEQEATSNRPDFEEQIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E +    +  +   ++    ++E+AEL+  N + VD  +  +R+     + W+DI   + 
Sbjct: 313 ERIIEQQKQAIEGFDEQANEERERAELLYANYDLVDDVLSTVREAREQGVPWDDIAVTLD 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                 D+A             
Sbjct: 373 EGAEQGIPAAEAVTNVDGANGTVTIKL-----------------DDA------------- 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +D+ +    NA R + + KR   KKQ     I+ T + L++A+++  +   D Q   
Sbjct: 403 TVTLDVSMGVEKNADRLYTEAKRIQEKKQGALAAIEDTREELEAAKRRRDEWEADDQEDE 462

Query: 287 -------------TMTNINKARKVYWFEKFYWFISSENYL 313
                        ++ ++      +WFE+F WF +S  YL
Sbjct: 463 SDEDEEPEETDWLSLDSVPVKSTEHWFERFRWFHTSSGYL 502



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF+IRG + +F   P ++A+GI
Sbjct: 574 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDRRYFEDVPAKVAVGI 624


>gi|126466189|ref|YP_001041298.1| hypothetical protein Smar_1299 [Staphylothermus marinus F1]
 gi|126015012|gb|ABN70390.1| protein of unknown function DUF814 [Staphylothermus marinus F1]
          Length = 663

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 2/53 (3%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF-PPCQLAMGI 367
           AW+      + +WV A QVSKT P+GE+L+ G+FMI GKKN+     +LA+GI
Sbjct: 521 AWNEGFGAIDVFWVHASQVSKTPPSGEYLSKGAFMIYGKKNYVKTKLELAIGI 573



 Score = 38.1 bits (87), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 8/90 (8%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM---TN 290
           + +D+ L  + N  R+  L     KK ++     E+AL+  EK  ++ +   Q +   T 
Sbjct: 382 IPIDIRLDTWNNILRYKKLSGELLKKAKRA----EEALRELEKSLEEAVNKKQLIEKKTE 437

Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDA 320
           I    ++ W+E+F+W I+SE +L     DA
Sbjct: 438 IGIKPRL-WYERFHWMITSEGFLVIAGRDA 466


>gi|408405775|ref|YP_006863758.1| hypothetical protein Ngar_c31850 [Candidatus Nitrososphaera
           gargensis Ga9.2]
 gi|408366371|gb|AFU60101.1| hypothetical protein with domain of unknown function DUF814 and
           fibronectin-binding A protein [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 661

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 34/55 (61%), Gaps = 1/55 (1%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           AW D     +A+WV  +QV K APTG+FL  GSF+I GK+N+    ++ + I  +
Sbjct: 516 AWKDGLSSADAYWVMPEQVKKGAPTGQFLPKGSFVIEGKRNYLKGVEIRLAIGIV 570


>gi|448688255|ref|ZP_21694088.1| hypothetical protein C444_10199 [Haloarcula japonica DSM 6131]
 gi|445779316|gb|EMA30246.1| hypothetical protein C444_10199 [Haloarcula japonica DSM 6131]
          Length = 717

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 56/263 (21%), Positives = 115/263 (43%), Gaps = 31/263 (11%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF---------STAESHKIDLKAVQQERDALKKLE 113
           TP  + +++    + F  F +A+D++F            E+ + D +A       ++K +
Sbjct: 275 TPIPLSEYEELYSESFTEFNSALDDYFFNFQREEEVEGGETQRPDFEA------EIEKQK 328

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
            + +  E  +   E    V++EKAEL+  N + VD  +  ++    + +SW+DIEA+  +
Sbjct: 329 RIIQQQEQAIEDFEADAEVEREKAELLYANYDLVDDVLSTVQAAREDDVSWDDIEAKFDE 388

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKK 230
                  +A+    + L+ +  T+ L +   R  + D++     NAD     A   E KK
Sbjct: 389 GADRG--IAAAEAVVSLDGSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKK 445

Query: 231 PSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN 290
                    L+A  N +   DL+    ++ +      +      + + + T  D  +M +
Sbjct: 446 EGA------LAAIENTRE--DLEEVKERRDEWEADDGDDETDEDQSEDEPT--DWLSMQS 495

Query: 291 INKARKVYWFEKFYWFISSENYL 313
           +      +W+E+F WF +S+ +L
Sbjct: 496 VPTRSTEHWYEQFRWFHTSDGFL 518



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE+L  G F IRG + +F   P  +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPVGVAVGIT 641


>gi|161527567|ref|YP_001581393.1| hypothetical protein Nmar_0055 [Nitrosopumilus maritimus SCM1]
 gi|160338868|gb|ABX11955.1| protein of unknown function DUF814 [Nitrosopumilus maritimus SCM1]
          Length = 652

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           AW +     +A+WV  DQV K+AP+GEFL  GSF I G++NF     L + +  +
Sbjct: 507 AWREGMYGVSAYWVNPDQVKKSAPSGEFLPKGSFTIEGQRNFIKSGNLKLAVGII 561


>gi|73669087|ref|YP_305102.1| hypothetical protein Mbar_A1575 [Methanosarcina barkeri str.
           Fusaro]
 gi|72396249|gb|AAZ70522.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
          Length = 797

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 35/55 (63%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           + +WVKA+QVSKT  +GE++  G+F+IRG++N+F    L + +    K E   I 
Sbjct: 599 DCYWVKAEQVSKTPESGEYVKKGAFIIRGERNYFKDIPLGVAVGLELKGETRVIG 653



 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 62/287 (21%), Positives = 119/287 (41%), Gaps = 37/287 (12%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERD-ALKKLENVKRDHETR 122
           P  + ++ GF  + F+SF  A+DEFF      +I+     ++++  L   E      E  
Sbjct: 311 PFDLTRYSGFEKEYFDSFNTALDEFFGKKALEQIEEVKAAKKKEKTLGVYERRLLQQEGS 370

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   E+    +   AE +  N + ++  +  +    +   SW++I + +KQA++   P A
Sbjct: 371 LKKFEKEIEKNNTLAETVYANYQGIEELLSVLNGARSTGYSWDEIRSILKQAKK-TVPAA 429

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
             I     NI+  T  + V+                   F   S       + +D+  + 
Sbjct: 430 QKI----TNIDPRTGTVTVN-------------------FDGKS-------ISLDIRKTV 459

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEK 302
             NA+ +++  +   KK+   +++ E   K+ EKK    +   +    +  +RK +W+++
Sbjct: 460 PQNAQEYYEKVKKFNKKKDGALKAIEDTRKAMEKKAVAKVA--KAGRKLRASRKKHWYDR 517

Query: 303 FYWFISSENYLGNVAWDAKV---VTNAWWVKADQVSKTAPTGEFLTT 346
           F WF+SS+ +      DA     +   +  K D V  T   G  LT 
Sbjct: 518 FRWFVSSDGFFIVGGRDADTNEEIFKKYLEKRDLVFHTQTPGAPLTV 564


>gi|327401161|ref|YP_004342000.1| fibronectin-binding A domain-containing protein [Archaeoglobus
           veneficus SNP6]
 gi|327316669|gb|AEA47285.1| Fibronectin-binding A domain protein [Archaeoglobus veneficus SNP6]
          Length = 637

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 87/361 (24%), Positives = 150/361 (41%), Gaps = 117/361 (32%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P  ++ + G   + F +F  A+DE+++   S     +  Q+E + LKKL+        RL
Sbjct: 241 PIELQIYDGLERKYFPTFNEALDEYYARRIS-----EVKQEESEELKKLK-------ARL 288

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA-RVKQAQRHNDPVA 182
               + QL  K++ E   N  E   AA  A+ ++   QL  + +EA R  + Q+  D + 
Sbjct: 289 ----EKQLETKKEFE---NEMERYRAAGDAVYENY--QLLEQILEAFRQARQQKSWDEIK 339

Query: 183 SIIK-QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
            I++   KL+     L++ +H E    K+S V+   N  P                ++L+
Sbjct: 340 KIVRAHPKLS----KLVVEIHPE----KNSVVV---NIGP---------------KIELA 373

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM------TNINKAR 295
              N  +  D+    AKK ++ ++   KA++    KTK+ ++ V+ +        +  AR
Sbjct: 374 LDKNLPQIADVYYERAKKVRQKLEGLLKAIE----KTKEEMQRVEELEAKKYVKGLRVAR 429

Query: 296 KVYWFEKFYWFISSENYL----GNVAWDAKVVT--------------------------- 324
           K  WFE+F WFI+S+ +L     N A + ++V+                           
Sbjct: 430 KREWFERFRWFITSDGFLVIGGRNAAMNEEIVSKYMEPKDLFFHTQTPGAPATVLKLGQE 489

Query: 325 ----------------NAWW-----------VKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
                           +A W           VK +QV + A  GE+L  GSF I GK+N+
Sbjct: 490 APETSIIEAAQFAATYSALWKEGKYSGEVYYVKPEQVKRAAKHGEYLARGSFYIEGKRNY 549

Query: 358 F 358
            
Sbjct: 550 L 550


>gi|20089538|ref|NP_615613.1| hypothetical protein MA0651 [Methanosarcina acetivorans C2A]
 gi|19914450|gb|AAM04093.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 788

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 35/55 (63%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           + +W+KA+QV+KT  +GE+L  G+F+IRG++N+F    L + +    K E   I 
Sbjct: 575 DCYWIKAEQVTKTPESGEYLKKGAFVIRGERNYFKDIPLGVAVGLELKGETRVIG 629



 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 63/288 (21%), Positives = 121/288 (42%), Gaps = 37/288 (12%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKI-DLKAVQQERDALKKLENVKRDHET 121
            P  + ++  F  + F+SF  A+DEFF      ++ ++KA +++   L   E      E 
Sbjct: 286 VPFDLIRYSEFEKEYFDSFNTALDEFFGKKALEQVAEVKAAEKKEKTLGVYERRLLQQEE 345

Query: 122 RLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181
            L+   +    +   AE++  N + ++     +    A   SW++I + +KQA++   P 
Sbjct: 346 SLAKFGKEIEKNNTLAEIVYANYQLIEELFSVLNGARAKGYSWDEIRSILKQAKK-TVPA 404

Query: 182 ASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLS 241
           A  I     NI+  T  + V  +  N                          V++D+  +
Sbjct: 405 AQKI----TNIDQKTGTVTVDLDGRN--------------------------VNLDIRKT 434

Query: 242 AYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFE 301
              NA+ +++  +  +KK+   +++ E+  K+ EKK        +    +   RK +W++
Sbjct: 435 VPQNAQEYYEKVKKFSKKRDGALKAIEETKKAMEKKAASKAA--KAGRKLQAFRKKHWYD 492

Query: 302 KFYWFISSENYLGNVAWDAKV---VTNAWWVKADQVSKTAPTGEFLTT 346
           +F WF+SS+ +L     DA     +   +  K D V  T   G  LT 
Sbjct: 493 RFRWFVSSDGFLVVGGRDADTNEEIFKKYLEKRDIVFHTQTPGAPLTV 540


>gi|297527127|ref|YP_003669151.1| hypothetical protein Shell_1151 [Staphylothermus hellenicus DSM
           12710]
 gi|297256043|gb|ADI32252.1| protein of unknown function DUF814 [Staphylothermus hellenicus DSM
           12710]
          Length = 663

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/44 (54%), Positives = 33/44 (75%), Gaps = 1/44 (2%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF-PPCQLAMGI 367
           + +WV A+QVSKT P+GE+L  G+FMI GKKN+     +LA+GI
Sbjct: 530 DVFWVYANQVSKTPPSGEYLPKGAFMIYGKKNYVKTKLELAIGI 573



 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 45/83 (54%), Gaps = 8/83 (9%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD---VQTMTN 290
           + +D+ L+A+ N  R+  L     KK +K     E+AL+  E+  ++ +     +Q  T 
Sbjct: 382 IPIDIRLNAWNNILRYKKLSGELLKKAKKA----EEALRELERSLEEAVNKKQLIQRRTE 437

Query: 291 INKARKVYWFEKFYWFISSENYL 313
           I    ++ W+E+F+W I+SE +L
Sbjct: 438 IGIKPRL-WYERFHWMITSEGFL 459


>gi|261403479|ref|YP_003247703.1| fibronectin-binding A domain-containing protein [Methanocaldococcus
           vulcanius M7]
 gi|261370472|gb|ACX73221.1| Fibronectin-binding A domain protein [Methanocaldococcus vulcanius
           M7]
          Length = 670

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 34/46 (73%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           + +WVK +Q+SKTA +GE+L  G+F+IRG+++++    L +GI  L
Sbjct: 558 DTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGVL 603



 Score = 45.1 bits (105), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 65/283 (22%), Positives = 132/283 (46%), Gaps = 37/283 (13%)

Query: 56  MILQPGS----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK 111
           +IL+ G      P  +++++    + +ESF  AVD++F+   ++    K   +    +++
Sbjct: 249 IILKDGEHYDVVPVNLKKYEDLEKKYYESFLDAVDDYFAKFLTNVEVKKKKSKIEKEIER 308

Query: 112 LENV-KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR 170
            EN+ KR  ET L   ++    ++ K +LI  N + V+  + AI+Q    ++ W  I+  
Sbjct: 309 QENILKRQLET-LERYKKDAEKNQIKGDLIYANYQIVENLLSAIKQ-AREKMDWARIKKI 366

Query: 171 VKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKK 230
           VK+ + H  P+  +++ ++ NI  I + L+                      AD  +   
Sbjct: 367 VKENKDH--PILDLVEDIRENIGEIIVRLK----------------------ADVGDKTI 402

Query: 231 PSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAEKKTKQTLKDVQT 287
              + +D+  +A  NA+RF++  +    K +     I+ T+K ++  +KK ++TL +   
Sbjct: 403 EERIPLDIRKNASENAERFYEKAKKLKHKVEGIKTAIELTKKKIEELKKKEEKTLGEEIP 462

Query: 288 MTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
                K ++  W+EKF W + +  +L     DA  +TN   +K
Sbjct: 463 EMKKKKRKERKWYEKFKWTVIN-GFLVIAGKDA--ITNEILIK 502


>gi|407461558|ref|YP_006772875.1| hypothetical protein NKOR_00035 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407045180|gb|AFS79933.1| hypothetical protein NKOR_00035 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 651

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/176 (25%), Positives = 66/176 (37%), Gaps = 63/176 (35%)

Query: 257 AKKQQKTIQSTE----KALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENY 312
           AKKQ   I S E    K LK  EK   +T  +  ++  +++ RK  W+E++ WF +S+ +
Sbjct: 387 AKKQSGAISSIEEIKSKTLKKLEKLQNKTDSEKDSVL-VSEIRKKNWYERYRWFYTSDGF 445

Query: 313 L------------------------------------------------GNVAWDAKVVT 324
           L                                                  VA      +
Sbjct: 446 LVIGGRDAASNSAVVRKHLAKNDKIFHGDIFGSPFFIIKDAQNAPDTSMNEVAHATVCFS 505

Query: 325 NAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
            AW          WV  +QV K+AP+GEFL  GSF I G++NF     L + +  +
Sbjct: 506 RAWREGMYGVSAYWVNPEQVKKSAPSGEFLPKGSFTIEGQRNFIKSGNLKLAVGII 561


>gi|268323401|emb|CBH36989.1| conserved hypothetical protein containing fibronectin-binding
           protein A N-terminal domain, DUF814 family [uncultured
           archaeon]
 gi|268324037|emb|CBH37625.1| conserved hypothetical protein containing fibronectin-binding
           protein A N-terminal (FbpA) domain and DUF814 domain
           [uncultured archaeon]
          Length = 631

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/197 (21%), Positives = 80/197 (40%), Gaps = 63/197 (31%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKT---IQSTEKALKSAEKKTKQTLKDVQTMTN 290
           +++D  +S + NA  ++D  +   KK++     I+ T++ +++ ++K  +  +++     
Sbjct: 360 LEIDTSVSLHKNAGAYYDKAKVFRKKREGVEPAIEMTKEKIRTEKEKEVRIEEELIPTKK 419

Query: 291 INKARKVYWFEKFYWFISSENY-------------------------------------- 312
             +  K  W+EKF WF +S+ +                                      
Sbjct: 420 EVRTEKEEWYEKFRWFETSDGFLVVGGKDATTNEILAKKHMEPNDLFFHTQAEGAPVVIA 479

Query: 313 -----------LGNVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                      L  +A  A   +N W           V  +QVSKT P+GE++  GSFM+
Sbjct: 480 KAGGKEISESGLKEIAQFAASYSNLWKYGFYEGECYCVVGEQVSKTPPSGEYIKKGSFMV 539

Query: 352 RGKKNFF-PPCQLAMGI 367
           RGK+ +F     L +GI
Sbjct: 540 RGKRKYFKAELGLCIGI 556


>gi|448666601|ref|ZP_21685246.1| fibronectin-binding A domain-containing protein [Haloarcula
           amylolytica JCM 13557]
 gi|445771732|gb|EMA22788.1| fibronectin-binding A domain-containing protein [Haloarcula
           amylolytica JCM 13557]
          Length = 717

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/248 (23%), Positives = 113/248 (45%), Gaps = 31/248 (12%)

Query: 78  FESFAAAVDEFF---------STAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQ 128
           F  F  A+D++F            E+ + D +A       ++K + + +  E  +   E 
Sbjct: 290 FTEFNPALDDYFFNFQREEEVEGGETQRPDFEA------EIEKQKRIIQQQEQAIEDFEA 343

Query: 129 TQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQL 188
              V++EKAEL+  N + VD  +  ++   A+ +SW+DIEA+  +       +A+    +
Sbjct: 344 DAEVEREKAELLYANYDLVDDVLSTVQAARADDVSWDDIEAKFNEGADRG--IAAAEAVV 401

Query: 189 KLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDLSAYAN 245
            L+ +  T+ L +   R  + D++     NAD     A   E KK         L+A  N
Sbjct: 402 SLDGSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA------LAAIEN 454

Query: 246 AKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYW 305
            +   DL+  A K+++   ++ +   ++ E + +    D  +M +I      +W+E+F W
Sbjct: 455 TRE--DLE--AVKERRDEWEADDGDDEADEDEGEDEPTDWLSMQSIPTRSTEHWYEQFRW 510

Query: 306 FISSENYL 313
           F +S+ +L
Sbjct: 511 FHTSDGFL 518



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE+L  G F IRG + +F   P  +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPAGIAVGIT 641


>gi|118577090|ref|YP_876833.1| RNA-binding protein [Cenarchaeum symbiosum A]
 gi|118195611|gb|ABK78529.1| RNA-binding protein [Cenarchaeum symbiosum A]
          Length = 631

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 37/54 (68%), Gaps = 3/54 (5%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNF--FPPCQLAMGI 367
           AW +A    +A+WV  +QV K AP+G+FL  GSF+I G++NF   P  +LA+G+
Sbjct: 496 AWREAMYGLSAYWVLPEQVKKAAPSGQFLPKGSFVIEGRRNFVKIPTLRLAVGL 549


>gi|354610742|ref|ZP_09028698.1| Fibronectin-binding A domain protein [Halobacterium sp. DL1]
 gi|353195562|gb|EHB61064.1| Fibronectin-binding A domain protein [Halobacterium sp. DL1]
          Length = 745

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 64/271 (23%), Positives = 117/271 (43%), Gaps = 48/271 (17%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER-DALKKLENVKRDHET 121
           TP  +R+ +  P   F+ F  A+D++F+  ++ + +       R D   ++E  KR  E 
Sbjct: 305 TPVPLREREELPADAFDRFNDALDDYFTNLDTTEEEESGEAVSRPDFEAEIEKQKRIIEQ 364

Query: 122 RLSAL---EQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
           +  A+   EQ    ++EKAEL+  N + VD  I  +         W+DI  R ++A   +
Sbjct: 365 QQQAIDDFEQQAEAEREKAELLYGNYDLVDELIGVVADARGAGHGWQDIAERFEEAA-GD 423

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
            P A +   + +N +  T+ +R+             H    DP  +S   K    +    
Sbjct: 424 VPGADVF--VGVNESEGTVRVRIDD-----------HTIELDP--ESGVEKNADRI---- 464

Query: 239 DLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ--------------TLKD 284
               Y  AKR  + +  A    +  I++T   L+SA+++ ++               L D
Sbjct: 465 ----YTEAKRIEEKQEGA----RAAIENTRGDLESAKQRREEWEAEPDEQESEADDELAD 516

Query: 285 VQ--TMTNINKARKVYWFEKFYWFISSENYL 313
           V   + ++I    +  W+E+F WF +SE +L
Sbjct: 517 VDWLSRSSIPIRNQEQWYERFRWFRTSEGFL 547



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGIS 368
           D +   +A+ V  DQVSKT  +GE+L  G F IRG + +F   P + A+GI+
Sbjct: 619 DGRGAGDAYMVSPDQVSKTPESGEYLEKGGFAIRGDRTYFRDLPAEWAVGIA 670


>gi|218883339|ref|YP_002427721.1| hypothetical protein DKAM_0025 [Desulfurococcus kamchatkensis
           1221n]
 gi|218764955|gb|ACL10354.1| protein of unknown function DUF814 [Desulfurococcus kamchatkensis
           1221n]
          Length = 659

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/184 (25%), Positives = 69/184 (37%), Gaps = 68/184 (36%)

Query: 251 DLKRSAA---KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-----INKARKVYWFEK 302
           DL R      +K  + ++ TE A K  E+   Q L  +    N     I K    +W+E+
Sbjct: 392 DLSRQVIELERKHGELVRKTESAKKVLEE-MHQQLNTISISMNTEEKTIRKPSPTFWYER 450

Query: 303 FYWFIS------------SEN------YLG------------------------------ 314
           F+W  +            S+N      YLG                              
Sbjct: 451 FHWLFTRNGFLAIGGRDQSQNELVVRKYLGENDVFIHADIHGGSAVVLKSGGAHSLEDVV 510

Query: 315 NVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQL 363
           + ++ A   + AW          WV   QVSKT P GE+L  G+FM+ G KN+   P +L
Sbjct: 511 DASYLAACYSKAWKAGFSYIEVYWVPGRQVSKTPPPGEYLPRGAFMVYGSKNYLQVPLRL 570

Query: 364 AMGI 367
            +GI
Sbjct: 571 GIGI 574


>gi|256811227|ref|YP_003128596.1| fibronectin-binding A domain-containing protein [Methanocaldococcus
           fervens AG86]
 gi|256794427|gb|ACV25096.1| Fibronectin-binding A domain protein [Methanocaldococcus fervens
           AG86]
          Length = 671

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 34/46 (73%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           + +WVK +Q+SKTA +GE+L  G+F+IRG+++++    L +GI  +
Sbjct: 558 DTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGVI 603



 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 44/209 (21%), Positives = 95/209 (45%), Gaps = 29/209 (13%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++++ G   + + SF  AVD++F+   ++ +  K   +    ++K EN+ +     
Sbjct: 259 VPIDLKKYDGLEKKYYNSFLEAVDDYFAKFLTNIVVKKEKSKIEREIEKQENILKRQMDT 318

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   ++    ++ K +LI  N + V+  + AIRQ    ++ W  I+  V++ + H  P+ 
Sbjct: 319 LKKYKEDAEKNQIKGDLIYANYQIVEELLSAIRQ-AREKMDWARIKKIVRENKEH--PIL 375

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
            +I+ +  N+  I + L     +S + D  +                    V +D+  +A
Sbjct: 376 GLIENINENVGEIVIRL-----KSEVDDKVIEER-----------------VSLDIRKNA 413

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKAL 271
           + NA+ +++     AKK +  I+  E A+
Sbjct: 414 FENAENYYE----KAKKLKNKIEGIENAI 438


>gi|170291097|ref|YP_001737913.1| RNA-binding protein, snRNP-like protein [Candidatus Korarchaeum
           cryptofilum OPF8]
 gi|170175177|gb|ACB08230.1| RNA-binding protein, snRNP-like protein [Candidatus Korarchaeum
           cryptofilum OPF8]
          Length = 624

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 114/284 (40%), Gaps = 80/284 (28%)

Query: 148 DAAILAIRQDIANQLSW-EDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERS 206
           D  I ++R++I  Q+S  E+ E    Q +R  D + S       NI+ +   L   + RS
Sbjct: 277 DERISSLRREIERQISLKEEYERTYAQLRRIGDTILS-------NIHEVEEAL--GRARS 327

Query: 207 NIKDSWVLHHD-NADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQ 265
             + + V   D  +     S E ++   + +D+  SA  NA  ++D     AKK ++   
Sbjct: 328 GEEHALVKRVDWKSGKVIISLEGEE---IQLDIRRSASENASEYYD----KAKKAREKAL 380

Query: 266 STEKALKSAEKKTKQ---TLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKV 322
             +KAL +  ++ KQ   +L++ +   +    ++  W+EKF WF +S   L     DA+ 
Sbjct: 381 RIDKALSNIMERLKQIESSLEERKLELSPKPRKRERWYEKFRWFYTSSGNLVICGRDAQT 440

Query: 323 ------------------------------------------------VTNAW------- 327
                                                            + AW       
Sbjct: 441 NSEIVSKYMDDKDLFFHVDMPGGAVVVLKVEREVDQRSIEQAAVAAASFSRAWKEGLSYA 500

Query: 328 ---WVKADQVSKTAPTGEFLTTGSFMIRGKKNFF-PPCQLAMGI 367
              +VK +QVSK AP G +L  GSF I GK+N+     +L++GI
Sbjct: 501 DVYYVKGEQVSKHAPPGMYLPKGSFYITGKRNYLRAKLELSVGI 544


>gi|126178886|ref|YP_001046851.1| hypothetical protein Memar_0936 [Methanoculleus marisnigri JR1]
 gi|125861680|gb|ABN56869.1| protein of unknown function DUF814 [Methanoculleus marisnigri JR1]
          Length = 632

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/381 (20%), Positives = 145/381 (38%), Gaps = 129/381 (33%)

Query: 55  PMILQPGSTPSVMRQHQGFPIQE-FESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLE 113
           P+I   G  P V+    G  ++E F +F+ A+D F+      K +  A +     L + E
Sbjct: 234 PVITASGCWPVVL---AGDEVRERFATFSEALDAFYPKTVGGKEEAAAGKPR---LSQAE 287

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
            ++R     +   E+    ++   E+I  N  +V A I+A   + +   SW++IE  +K+
Sbjct: 288 VIRRRQAEAIKGFEKKIERNQRIVEVIYENYTAV-AGIIATLDEASKNRSWQEIEKILKE 346

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
               ++P A +++ +                                          P+ 
Sbjct: 347 --NGDNPAAKMVRAIH-----------------------------------------PAD 363

Query: 234 VDVDLDLSA-----YA------NAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL 282
             VD+DLS      Y       N  R++D  +   KK+   + + E+ +   + +TK+ L
Sbjct: 364 AAVDVDLSGERVKIYVHETIEQNLGRYYDQIKKFKKKKTGALAAMERTVPE-KPRTKRNL 422

Query: 283 KDVQTMTNINKARKVYWFEKFYWFISSENYL----------------------------- 313
                   + K R   W+ +F WF +S+  L                             
Sbjct: 423 P-------LQKKR---WYHRFRWFTTSDGTLVIGGRDASQNEELVKKYMEGGDLFVHADV 472

Query: 314 --GNV----------------------AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGS 348
             G+V                      AW A   T + +  + DQVSKTA +GE++  G+
Sbjct: 473 HGGSVVIVKGTTEHMDEAVRFAASYSNAWKAGHFTADVYAARPDQVSKTAESGEYVARGA 532

Query: 349 FMIRGKKNFF--PPCQLAMGI 367
           F++RG++ +F   P  +A+G+
Sbjct: 533 FIVRGERQYFRNAPLGVAIGL 553


>gi|407465827|ref|YP_006776709.1| hypothetical protein NSED_09905 [Candidatus Nitrosopumilus sp. AR2]
 gi|407049015|gb|AFS83767.1| hypothetical protein NSED_09905 [Candidatus Nitrosopumilus sp. AR2]
          Length = 648

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 35/54 (64%), Gaps = 3/54 (5%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           AW +     +A+WV  DQ+ K+AP+GEFL  GSF I G++NF      +LA+GI
Sbjct: 504 AWREGMYGVSAYWVYPDQIKKSAPSGEFLPKGSFTIEGQRNFIKSDTLRLAVGI 557


>gi|452210388|ref|YP_007490502.1| hypothetical protein MmTuc01_1891 [Methanosarcina mazei Tuc01]
 gi|452100290|gb|AGF97230.1| hypothetical protein MmTuc01_1891 [Methanosarcina mazei Tuc01]
          Length = 775

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 35/55 (63%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           + +W+K++QV+KT  +GE+L  G+F+IRG++N+F    L + +    K E   I 
Sbjct: 573 DCYWIKSEQVTKTPESGEYLKKGAFVIRGERNYFKDVPLGIAVGLELKGETRIIG 627



 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 62/273 (22%), Positives = 113/273 (41%), Gaps = 37/273 (13%)

Query: 78  FESFAAAVDEFFST-AESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEK 136
           F+SF  A+DEFF   A     ++K  +++   L   E      E  L+  E+    +   
Sbjct: 299 FDSFNTALDEFFGKKALEQVAEVKEAEKKEKTLGVFERRLMQQEESLAKFEKEIEKNNAL 358

Query: 137 AELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHIT 196
           AE +  N + ++     +    A   SW++I + +KQA++   P A  I     NI+  T
Sbjct: 359 AETVYANYQIIEELFSVLNGARAKGYSWDEIRSILKQAKK-TVPAAQTI----TNIDQKT 413

Query: 197 LLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSA 256
             + V+ +  +I                          ++D+  +   NA+ +++  +  
Sbjct: 414 GTVTVNLDGKSI--------------------------NLDIRKTVPQNAQEYYEKVKKF 447

Query: 257 AKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNV 316
            KK+   I++ E   K+ EKK        +    +  +RK +W+++F WF+SS+ +L   
Sbjct: 448 TKKKDGAIRAIEDTKKAMEKKAATKSA--KAGRKLQASRKKHWYDRFRWFVSSDGFLVVG 505

Query: 317 AWDAKV---VTNAWWVKADQVSKTAPTGEFLTT 346
             DA     +   +  K D V  T   G  LT 
Sbjct: 506 GRDADTNEEIFKKYMEKRDIVFHTQTPGAPLTV 538


>gi|387219995|gb|AFJ69706.1| hypothetical protein NGATSA_2054800, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 94

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 24/54 (44%), Positives = 36/54 (66%)

Query: 268 EKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLGNVAWDAK 321
           EKA+K+AE++   +L   Q    ++  RK YWFEKF+WFI+S+N+L     DA+
Sbjct: 2   EKAVKAAERQAAASLSKQQRKRTLSVVRKPYWFEKFHWFITSDNHLVVSGRDAQ 55


>gi|289192132|ref|YP_003458073.1| Fibronectin-binding A domain protein [Methanocaldococcus sp.
           FS406-22]
 gi|288938582|gb|ADC69337.1| Fibronectin-binding A domain protein [Methanocaldococcus sp.
           FS406-22]
          Length = 671

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 34/46 (73%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           + +WVK +Q+SKTA +GE+L  G+F+IRG+++++    L +GI  +
Sbjct: 558 DTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGVI 603



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 54/255 (21%), Positives = 110/255 (43%), Gaps = 45/255 (17%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++++ G   + + SF  AVD++F+         K   +    +++ EN+ +     
Sbjct: 259 VPIDLKKYDGLEKKYYNSFLEAVDDYFAKFLVKVEVKKEKSKFEREIERQENILKRQLGT 318

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND--P 180
           L   ++    ++ K +LI  N + V+  + AIRQ    ++ W    AR+K+  R N   P
Sbjct: 319 LKKYKEDAEKNQIKGDLIYANYQIVEELLNAIRQ-AREKMDW----ARIKKIIRENKEHP 373

Query: 181 VASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDL 240
           +  +I+ +  N+  I + L+     S + D+ +                    V +D+  
Sbjct: 374 ILGLIENINENVGEIVVRLK-----SEVDDNVIEER-----------------VSLDIRK 411

Query: 241 SAYANAKRFFDLKRSAAKKQQKTIQSTEKALK----------SAEKKTKQTLKDVQTMTN 290
           +A+ NA+ +++     AKK +  I+  E A++             ++  +  + +Q    
Sbjct: 412 NAFENAESYYE----KAKKLRNKIEGIENAIELTKKKIDELKKKGEEELKEKESIQMKKK 467

Query: 291 INKARKVYWFEKFYW 305
           + K RK  W+EKF W
Sbjct: 468 VRKERK--WYEKFKW 480


>gi|390937875|ref|YP_006401613.1| putative RNA-binding protein [Desulfurococcus fermentans DSM 16532]
 gi|390190982|gb|AFL66038.1| putative RNA-binding protein, snRNP like protein [Desulfurococcus
           fermentans DSM 16532]
          Length = 659

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 70/184 (38%), Gaps = 68/184 (36%)

Query: 251 DLKRSAA---KKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTN-----INKARKVYWFEK 302
           DL R      +K+ + ++ TE A K  E+   Q L  +    N     I K    +W+E+
Sbjct: 392 DLSRQVIELERKRGELVRKTESAKKVLEE-MHQQLNTISISMNTEEKTIRKPSPTFWYER 450

Query: 303 FYWFIS------------SEN------YLG------------------------------ 314
           F+W  +            S+N      YLG                              
Sbjct: 451 FHWLFTRNGFLAIGGRDQSQNELVVRKYLGENDVFIHADIHGGSAVVLKSGGAHSLEDVV 510

Query: 315 NVAWDAKVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQL 363
           + ++ A   + AW          WV   QVSKT P GE+L  G+FM+ G KN+   P +L
Sbjct: 511 DASYLAACYSKAWKAGFSYIEVYWVSGRQVSKTPPPGEYLPRGAFMVYGSKNYLQVPLRL 570

Query: 364 AMGI 367
            +G+
Sbjct: 571 GIGV 574


>gi|91773364|ref|YP_566056.1| hypothetical protein Mbur_1391 [Methanococcoides burtonii DSM 6242]
 gi|91712379|gb|ABE52306.1| FbpA, DUF814 containing protein [Methanococcoides burtonii DSM
           6242]
          Length = 663

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 34/47 (72%), Gaps = 2/47 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISF 369
           + +W+K +QVSKT  +GE+L  GSF+IRG++N++   P  +A+G+  
Sbjct: 543 DCYWIKPEQVSKTPESGEYLKKGSFIIRGERNYYKDVPVGVAIGLDL 589



 Score = 39.7 bits (91), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 55/278 (19%), Positives = 108/278 (38%), Gaps = 52/278 (18%)

Query: 48  VANMEFHPMILQP---------GSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKID 98
           + N E +P I++             P  + Q+     + + SF  A+DEFF    S ++ 
Sbjct: 235 IVNHELNPCIIKKTVKDELQAVDVLPLELTQYSDAEKEFYPSFNKALDEFFGKKASEEVI 294

Query: 99  LKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDI 158
            + V ++++     E   R  +  +   E         AE I  N ++V+  +  +    
Sbjct: 295 EQVVAKKKEKEDVFERRLRKQQEAILKFETDSTRYTLIAESIYGNYQTVEEVLSVLEAAR 354

Query: 159 ANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDN 218
               SW+DI   +K+A +   P A  I  +      + + L V     N++ +       
Sbjct: 355 DKGYSWKDIWDTLKKA-KDTLPAAKAIVSIDPAEGSVVVDLDVVNANINVRKTI------ 407

Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQK---TIQSTEKALKSAE 275
                       P             NA+ +++  +  +KK+      I+ T++A++  E
Sbjct: 408 ------------PQ------------NAQMYYNKAKKISKKRDGALIAIEDTKRAMQKRE 443

Query: 276 KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
           +K  +  K V          K +W+++F WF SS+ +L
Sbjct: 444 QKVSKRRKAV---------FKKHWYDRFRWFFSSDGFL 472


>gi|167044451|gb|ABZ09127.1| putative domain of unknown function (DUF814) [uncultured marine
           crenarchaeote HF4000_APKG6D9]
          Length = 648

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 72/178 (40%), Gaps = 71/178 (39%)

Query: 257 AKKQQKTIQS-------TEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISS 309
           AKKQ + I S       TEK L+  + KT ++ +D+  +T I   RK  W+E++ WF ++
Sbjct: 384 AKKQSRAISSIEEIKSKTEKKLEKFQNKT-ESEQDIMLVTEI---RKKSWYERYRWFYTT 439

Query: 310 ENYLGNVAWDAK------------------------------------------------ 321
           + YL     DA                                                 
Sbjct: 440 DGYLAVGGRDAASNSAVVRKHLVKNDKIFHADIFGSPFFIIKDAEHAPATSMDEVAHATV 499

Query: 322 VVTNAW-----WVKA-----DQVSKTAPTGEFLTTGSFMIRGKKNFFPP--CQLAMGI 367
             + AW      VKA     +QV K+AP+GEFL  GSF I G++NF      +LA+GI
Sbjct: 500 CFSRAWREGLYGVKAYWIHPEQVKKSAPSGEFLPKGSFTIEGQRNFINSKNLKLAVGI 557


>gi|15669822|ref|NP_248636.1| hypothetical protein MJ_1625 [Methanocaldococcus jannaschii DSM
           2661]
 gi|42559938|sp|Q59020.1|Y1625_METJA RecName: Full=Uncharacterized protein MJ1625
 gi|1592339|gb|AAB99643.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
           2661]
          Length = 671

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 35/45 (77%), Gaps = 2/45 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           + +WVK +Q+SKTA +GE+L  G+F+IRG+++++   P +L +G+
Sbjct: 558 DTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGVGV 602



 Score = 38.9 bits (89), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 54/253 (21%), Positives = 106/253 (41%), Gaps = 41/253 (16%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  +++++G   + + SF  AVD++F+   +  +  K   +    +++ EN+ R     
Sbjct: 259 VPIDLKKYKGLEKKYYNSFLEAVDDYFAKFLTKVVVKKEKSKIEKEIERQENILRRQLET 318

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           L   ++    ++ K +LI  N + V+  + AIRQ    ++ W    AR+K+  R N    
Sbjct: 319 LKKYKEDAEKNQIKGDLIYANYQIVEELLNAIRQ-AREKMDW----ARIKKIIRENK--- 370

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
                      H  L L  +   +  +    L  +  D   +         V +D+  +A
Sbjct: 371 ----------EHPILGLIENINENIGEIIIRLKSEVDDKVIEER-------VSLDIRKNA 413

Query: 243 YANAKRFFDLKRSAAKKQQKTIQSTEKALK----------SAEKKTKQTLKDVQTMTNIN 292
           + NA+ +++     AKK +  I+  E A++             ++  +  + +Q    I 
Sbjct: 414 FENAESYYE----KAKKLRNKIEGIENAIELTKKKIEELKKKGEEELKEKESMQMKKKIR 469

Query: 293 KARKVYWFEKFYW 305
           K RK  W+EKF W
Sbjct: 470 KERK--WYEKFKW 480


>gi|448612034|ref|ZP_21662464.1| hypothetical protein C440_11728 [Haloferax mucosum ATCC BAA-1512]
 gi|445742795|gb|ELZ94289.1| hypothetical protein C440_11728 [Haloferax mucosum ATCC BAA-1512]
          Length = 701

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/280 (20%), Positives = 110/280 (39%), Gaps = 64/280 (22%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +++HQ  G   + +E+F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLQEHQNAGLDEEAYETFNEALDEYF-----FRLDLTADEQEATSNRPDFEEQIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E +    E  +   +Q    ++E+AEL+  N +  D  +  +R      + W++I   + 
Sbjct: 313 ERIIEQQEQAIEGFDQQADEERERAELLYANYDLADDVLSTVRDAREQGVPWDEIAVTLD 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                                +
Sbjct: 373 EGADQGIPAAEAVTNVDSANGTVTVEL------------------------------DGT 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +D+ +    NA R + + KR   KK+     I+ T + L++A+++  +   D     
Sbjct: 403 SVTLDVSMGVEKNADRLYTEAKRIQEKKEGALAAIEDTREELEAAKRRRDEWEADDGGGD 462

Query: 287 -------------TMTNINKARKVYWFEKFYWFISSENYL 313
                        ++ ++      +WFE+F WF +S  YL
Sbjct: 463 ADEDDEPEETDWLSLESVPVKSTEHWFERFRWFYTSSGYL 502



 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF+IRG + +F   P ++A+GI
Sbjct: 574 EGRFADDAYMVEPSQVSKTPESGEYIDKGSFVIRGDRRYFEDVPAKVAVGI 624


>gi|124027973|ref|YP_001013293.1| hypothetical protein Hbut_1105 [Hyperthermus butylicus DSM 5456]
 gi|123978667|gb|ABM80948.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
          Length = 672

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEE 375
           AW A + + + +WV   QVSK AP GE+L  G+FM+ GK+N+    ++ + I    + E 
Sbjct: 521 AWKAGIGSVDVYWVWGSQVSKAAPAGEYLAKGAFMVYGKRNYIRNVEVKIAIGIGLEDEY 580

Query: 376 SSI 378
           S +
Sbjct: 581 SPV 583



 Score = 45.4 bits (106), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 60/262 (22%), Positives = 100/262 (38%), Gaps = 53/262 (20%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKK-----LENVKRD 118
           P  +    GF  + F   + A DE+F T          V  E +A +K     LE  +R+
Sbjct: 253 PRGLAARYGFEYRAFNDPSTAYDEYFLTVAREAAGASTVAAEIEAERKKLLASLEAARRN 312

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
            E     L + +    E AE++  N   V  A+   R+ +     WE I           
Sbjct: 313 LEHLRKKLRELE----ELAEIVSTNIADVYDAVECARK-MRETAGWEQIPGNCPGVV-DV 366

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
           +P   IIK + +  N + + +R+                            +P  + VDL
Sbjct: 367 EPNKGIIK-ISIVGNIVPIDIRM----------------------------EPGRLVVDL 397

Query: 239 DLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVY 298
                   KR  ++ R+  ++ +K ++  E  L   E+K +Q L   + M      R+  
Sbjct: 398 -------YKRIGEV-RAKIERGEKAVKDIEARLAELEEKVRQRLLRARAMV-----RRKE 444

Query: 299 WFEKFYWFISSENYLGNVAWDA 320
           W+EK++W I+S  YL     DA
Sbjct: 445 WYEKYHWVITSHGYLAIGGRDA 466


>gi|288932692|ref|YP_003436752.1| Fibronectin-binding A domain protein [Ferroglobus placidus DSM
           10642]
 gi|288894940|gb|ADC66477.1| Fibronectin-binding A domain protein [Ferroglobus placidus DSM
           10642]
          Length = 646

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 89/389 (22%), Positives = 161/389 (41%), Gaps = 107/389 (27%)

Query: 48  VANMEFHP-MILQPGST----PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAV 102
           V N  F+P +IL+ G      P  +++++G+  + FESF  AVDEF++ +   +I++K  
Sbjct: 220 VENCNFNPHIILKDGDYVDYQPIDLKKYEGYEKKYFESFNKAVDEFYTRSALKEIEVK-- 277

Query: 103 QQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQL 162
           +++ + ++KLEN       RL    + QL  KE+ E     +ES    +  I   I  + 
Sbjct: 278 EKKSEVIEKLEN-------RL----KIQLETKERYE-----RES--EKLRRIGDLIYEKY 319

Query: 163 SW-EDIEARVKQA--QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNA 219
              E I + +K+A   +  D V  I+ + K       +L  + KE+     + VL  D+ 
Sbjct: 320 PIVERIHSALKKAVELKGFDEVKKILAEQKKAGKLKEILDIIPKEK-----AVVLSIDD- 373

Query: 220 DPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279
                    K    +D +L    + NA+ ++D  +   +K    +++ EK  +   +  +
Sbjct: 374 --------VKFKLFLDKNL----HENAEYYYDQAKKLKEKVNGIVKAIEKTREEIRRAEE 421

Query: 280 QTLKDVQTMTNINKARKVYWFEKFYWFISSENYL-------------------------- 313
              K +  ++     R+  W+EK+ W+I+SE +L                          
Sbjct: 422 IEAKKI--LSEFRVVRRREWYEKYRWYITSEGFLVIGGRNAEMNEEIVSKHFESKDLFFH 479

Query: 314 -----GNV----------------AWDAKVVTNAWW-----------VKADQVSKTAPTG 341
                G V                A +   + +A W           V  +QV + A  G
Sbjct: 480 TQTPGGAVTILKRGLEAGEKSIKEAAEFAAIYSALWKHGMHSGEVYYVTYEQVKRAAKPG 539

Query: 342 EFLTTGSFMIRGKKNFFP-PCQLAMGISF 369
           E+L  GSF I GK+N+     + A+G+  
Sbjct: 540 EYLPKGSFYIVGKRNYLSVELKAAIGVDL 568


>gi|257053989|ref|YP_003131822.1| Fibronectin-binding A domain protein [Halorhabdus utahensis DSM
           12940]
 gi|256692752|gb|ACV13089.1| Fibronectin-binding A domain protein [Halorhabdus utahensis DSM
           12940]
          Length = 707

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 59/278 (21%), Positives = 115/278 (41%), Gaps = 57/278 (20%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKR---DH 119
           TP  + +++  P + F+SF  A++ +F   E    + +      D   ++E  KR     
Sbjct: 260 TPVPLVEYEDTPGESFDSFNDALEAYFLGLEQEPDEEETGSNRPDFEAEIEKQKRIIQQQ 319

Query: 120 ETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
           E  +   E+    ++EKAEL+  N + VD  +  ++   A +  W++IEA +   +    
Sbjct: 320 EGAIEDFEEDAEAEREKAELLYANYDLVDEVLSTVQDARAAETPWDEIEATLSAGKDQGI 379

Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLD 239
           P A  ++ +  +   +T+          I D    HH                 +++D D
Sbjct: 380 PAAEAVRDVDGSEGTVTV---------QIDD----HH-----------------IELDAD 409

Query: 240 LSAYANAKRFF-DLKRSAAKK--QQKTIQSTEKALKSAEKKTK---------------QT 281
                NA R + + KR   KK   ++ I +T + L++ +++ +               +T
Sbjct: 410 TGVEKNADRLYQEAKRIEGKKAGAEEAIANTREQLEAVKQRREAWEASDGDDGGDGSGET 469

Query: 282 LKDVQ------TMTNINKARKVYWFEKFYWFISSENYL 313
            +D Q      T  +I       W+E+F WF +S+ +L
Sbjct: 470 HEDDQEDVDWLTRESIPIRTSEEWYERFRWFTTSDGFL 507



 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           + K   + + V  DQV+KT  +GE+L  GSF IRG + ++   P  +A+GI+
Sbjct: 580 EGKYAGDVYCVGPDQVTKTPESGEYLEKGSFAIRGDRTYYDDTPVGVAVGIT 631


>gi|21227916|ref|NP_633838.1| hypothetical protein MM_1814 [Methanosarcina mazei Go1]
 gi|20906336|gb|AAM31510.1| hypothetical protein MM_1814 [Methanosarcina mazei Go1]
          Length = 343

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 35/55 (63%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           + +W+K++QV+KT  +GE+L  G+F+IRG++N+F    L + +    K E   I 
Sbjct: 141 DCYWIKSEQVTKTPESGEYLKKGAFVIRGERNYFKDVPLGIAVGLELKGETRIIG 195


>gi|386874769|ref|ZP_10116995.1| hypothetical protein BD31_I0230 [Candidatus Nitrosopumilus salaria
           BD31]
 gi|386807392|gb|EIJ66785.1| hypothetical protein BD31_I0230 [Candidatus Nitrosopumilus salaria
           BD31]
          Length = 539

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 35/54 (64%), Gaps = 3/54 (5%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP--CQLAMGI 367
           AW +     +A+WV  +QV K+AP+GEFL  GSF I G++NF      +LA+GI
Sbjct: 395 AWREGMYGVSAYWVNPEQVKKSAPSGEFLPKGSFTIEGQRNFIKSDTLRLAVGI 448


>gi|410670434|ref|YP_006922805.1| hypothetical protein Mpsy_1229 [Methanolobus psychrophilus R15]
 gi|409169562|gb|AFV23437.1| hypothetical protein Mpsy_1229 [Methanolobus psychrophilus R15]
          Length = 664

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 34/55 (61%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           + +WV  +QVSKT  +GE++  G+F+IRG++N+F   Q+ M ++     E   I 
Sbjct: 544 DCYWVLPEQVSKTPESGEYVKKGAFIIRGERNYFRDVQVGMAVALELGAETRVIG 598



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/289 (20%), Positives = 112/289 (38%), Gaps = 45/289 (15%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETR 122
            P  ++ H G   + F SF  A+D FF    + ++       +++ +   E   R  E  
Sbjct: 259 APFELKTHAGLEKEVFPSFNKALDGFFGKRSAEEVTEVVEAVKKEKVDVFERRLRKQEEA 318

Query: 123 LSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVA 182
           +    +      + AE I  + + ++  I  + +   N  SW++I++ +K A +   P A
Sbjct: 319 IENFGREAERHVDVAEKIYAHYQVIEDVIGVLEKARQNGYSWDEIKSILKGA-KETVPAA 377

Query: 183 SIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSA 242
             I  +      I L L   K                                +D+ L+ 
Sbjct: 378 KSISSIDSATGRIVLDLEGTK------------------------------ATIDIKLTI 407

Query: 243 YANAKRFFDLKRSAAKKQQ---KTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYW 299
             NA+ +++  +   +K++   + I+ T  A++  EKK     + V          K +W
Sbjct: 408 PQNAQSYYEKAKKLTRKKEGAIRAIEDTRVAMQKKEKKVSGNKRKVHM--------KKHW 459

Query: 300 FEKFYWFISSENYLGNVAWDAKV---VTNAWWVKADQVSKTAPTGEFLT 345
           +++F WF SSE +L     DA+    +   +  K+D V  T   G  +T
Sbjct: 460 YDRFRWFYSSEGFLVVGGRDAETNEELVKKYMDKSDVVFHTQDPGAPMT 508


>gi|429192346|ref|YP_007178024.1| RNA-binding protein [Natronobacterium gregoryi SP2]
 gi|448325749|ref|ZP_21515133.1| Fibronectin-binding A domain-containing protein [Natronobacterium
           gregoryi SP2]
 gi|429136564|gb|AFZ73575.1| putative RNA-binding protein, snRNP like protein [Natronobacterium
           gregoryi SP2]
 gi|445614570|gb|ELY68242.1| Fibronectin-binding A domain-containing protein [Natronobacterium
           gregoryi SP2]
          Length = 710

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 54/257 (21%), Positives = 109/257 (42%), Gaps = 16/257 (6%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
           TP  + +H+  G   + ++SF A +D++F   E  + D     ++R    + + K E + 
Sbjct: 264 TPFPLEEHEAEGLAAEAYDSFLAVLDDYFFRLELEEEDDSDPTEQRPDFEEEIAKYERII 323

Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
              E  +   EQ     +EKAEL+      VD  +  +R+       W++IE R ++ + 
Sbjct: 324 EQQEGAIEGFEQQAEQLREKAELLYAEYGLVDEVLSTVREAREQDRPWDEIEERFEEGKE 383

Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDV 236
                A  +  +  +   +T+ L        + D    + D     A   E KK      
Sbjct: 384 RGIEAAKAVVDVDGSEGTVTVTLDGEHVELAVHDGVEQNADRLYKEAKDIEGKKEGA--- 440

Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARK 296
              L+A  + +   DL+   AK+++   +  ++     ++  +   KD  +M ++     
Sbjct: 441 ---LAAIEDTRE--DLEE--AKRRRDQWEVDDEDDGDDDEIDEADSKDWLSMPSVPIREN 493

Query: 297 VYWFEKFYWFISSENYL 313
             W+++F WF +S++YL
Sbjct: 494 EPWYDRFRWFYTSDDYL 510



 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P  +A+GI
Sbjct: 583 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTPVGVAVGI 633


>gi|167043365|gb|ABZ08068.1| putative domain of unknown function (DUF814) [uncultured marine
           crenarchaeote HF4000_ANIW141O9]
          Length = 632

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 35/53 (66%), Gaps = 1/53 (1%)

Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEES 376
           ++A+WV  DQV K AP+G+ +  GSFMI G++NF     L M ++ + K EES
Sbjct: 509 SSAYWVNPDQVKKGAPSGQSMAKGSFMIEGQRNFVKISTLKMCVA-IIKHEES 560



 Score = 38.5 bits (88), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 6/89 (6%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNIN- 292
           + +DL+ S    A   F+     +KKQ+  I S EK L   E + ++ ++  ++  +++ 
Sbjct: 363 IKIDLNSSLPTTASTLFN----ESKKQKAAIGSIEKLLIKTENELEKVIEKGESAKSVSF 418

Query: 293 -KARKVYWFEKFYWFISSENYLGNVAWDA 320
            + RK  WFE++ WF +++  L     D+
Sbjct: 419 TQVRKKNWFERYRWFYTTDGVLAVGGRDS 447


>gi|448565126|ref|ZP_21636097.1| hypothetical protein C457_11862 [Haloferax prahovense DSM 18310]
 gi|445715785|gb|ELZ67538.1| hypothetical protein C457_11862 [Haloferax prahovense DSM 18310]
          Length = 702

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 59/281 (20%), Positives = 110/281 (39%), Gaps = 65/281 (23%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +R+HQ  G   + +++F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + +    E  +   EQ    ++E+AEL+  N + VD  +  +R      + W+DI   + 
Sbjct: 313 QRIIDQQEGAIEGFEQQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLA 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                 D+A             
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +++ +    NA R + + KR   KK+     I+ T + L + +K+  +   D     
Sbjct: 403 TVTLEVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDDEDD 462

Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
                          + ++      +WFE+F WF +S  YL
Sbjct: 463 EDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYL 503



 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF+IRG + +F     ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYFEDVAAKVAVGI 625


>gi|288560094|ref|YP_003423580.1| RNA-binding protein [Methanobrevibacter ruminantium M1]
 gi|288542804|gb|ADC46688.1| RNA-binding protein [Methanobrevibacter ruminantium M1]
          Length = 669

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 35/55 (63%), Gaps = 1/55 (1%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           AW     T + +WV+ +QVSKT  +GEF+  G+F+IRGK+N+    +L + I  +
Sbjct: 548 AWSRNYGTQDVYWVEPEQVSKTPVSGEFVPKGAFIIRGKRNYIRGAKLEIAIGIV 602


>gi|374724028|gb|EHR76108.1| putative RNA-binding protein [uncultured marine group II
           euryarchaeote]
          Length = 723

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 80/345 (23%), Positives = 142/345 (41%), Gaps = 64/345 (18%)

Query: 1   NVSNEDVS-SDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSE--DYIVANMEFHPM- 56
           N++ +D +  D+H     L  L+ +L    QG L+ K       E  +   A+ME + + 
Sbjct: 206 NMATQDANVEDVHTA---LQTLLGHLGEQRQGILLMKPSDEMGVEALEQKAASMEANGLR 262

Query: 57  -----ILQPGSTPSVMRQHQGFPIQEFESFAAAVDEF--------FSTAESHKIDLKAVQ 103
                     +TP+++  H G    +F +   AVD +         +  E+ K+D+ A  
Sbjct: 263 DRFFEAHASEATPTILPSHAGMAQAKFATLCEAVDAWKGAHDAGALARREAEKLDIAAPG 322

Query: 104 QERDA-LKKLENVKRDHETRLSA----LEQTQLVDKEKAELIINNQESVDAAILAIRQDI 158
           +     +++LE  K   E  LS     +E+ Q++       I NN   V++ ++ + + I
Sbjct: 323 RGHSTDVERLERRKVQQEKALSGFSKKIEKQQMI----GHTIQNNWTHVESLLIQVTEAI 378

Query: 159 ANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDN 218
             +  W+++++  K          SI   + LN            ERS +    VL  +N
Sbjct: 379 EAK-GWKEVKSMAK----------SIPWIVSLN----------PAERSFLS---VLPDEN 414

Query: 219 ADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKT 278
            +P       K P    + +D S + NA+RFF   R    K +  + + E  L   ++  
Sbjct: 415 GEP-------KGPQ-ATLSIDESVHQNAQRFFTAARKQKDKTKGAVDALEDTLLQLQRAQ 466

Query: 279 KQTLKDVQTMTNINKAR--KVYWFEKFYWFISSENYLGNVAWDAK 321
           K+  K  Q    +NK +  K  WFE   W + +  +L     DAK
Sbjct: 467 KKEAKQ-QATGKLNKIKRSKRLWFEHHRWSMITGGHLLVGGKDAK 510



 Score = 45.4 bits (106), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 30/44 (68%)

Query: 327 WWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           + VK  QVSKTA TGEF+  GSF++RG++ +F    + +GI  +
Sbjct: 602 YSVKPAQVSKTAQTGEFVGKGSFIVRGQRQWFKDLDVQIGIGIV 645


>gi|307595006|ref|YP_003901323.1| hypothetical protein Vdis_0882 [Vulcanisaeta distributa DSM 14429]
 gi|307550207|gb|ADN50272.1| protein of unknown function DUF814 [Vulcanisaeta distributa DSM
           14429]
          Length = 668

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 38/59 (64%), Gaps = 1/59 (1%)

Query: 323 VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF-LFKLEESSISR 380
           V +A++V+ +QV+K AP+GE+L  GSFMI G + +    +L +GI   +    E+SI R
Sbjct: 528 VLDAFYVRGEQVTKKAPSGEYLGKGSFMIYGTRGWVRNAELGLGIGVRVDSFGEASIIR 586


>gi|48478297|ref|YP_024003.1| hypothetical protein PTO1225 [Picrophilus torridus DSM 9790]
 gi|48430945|gb|AAT43810.1| hypothetical protein PTO1225 [Picrophilus torridus DSM 9790]
          Length = 611

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 36/55 (65%), Gaps = 1/55 (1%)

Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           AW A + +  A+WV   QVSKT  +GEF++ GS++IRGK+N+     L + I+ +
Sbjct: 489 AWPAGIGSGTAYWVYPSQVSKTPESGEFISKGSWVIRGKRNYIFDLPLKLKITVI 543


>gi|325958497|ref|YP_004289963.1| fibronectin-binding A domain-containing protein [Methanobacterium
           sp. AL-21]
 gi|325329929|gb|ADZ08991.1| Fibronectin-binding A domain protein [Methanobacterium sp. AL-21]
          Length = 661

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 32/46 (69%), Gaps = 2/46 (4%)

Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           T+ +WV  +QVSKT  +GEF+  G+F+IRG +N+    P  L++GI
Sbjct: 538 TDVYWVHPEQVSKTPQSGEFVAKGAFIIRGSRNYMRGLPLTLSLGI 583



 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 73/284 (25%), Positives = 120/284 (42%), Gaps = 41/284 (14%)

Query: 53  FHPMILQPGS---TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--RD 107
           F+P I++       P  +  ++ F  + FESF  A DEF+S+      D+  V +E    
Sbjct: 235 FNPQIVKGKKEDVLPLDLLMYKDFEKESFESFNDAADEFYSSIVGE--DIVNVNEEVWSG 292

Query: 108 ALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI 167
            + K E         L   E+T    K K E I ++ ++++  IL I        SW +I
Sbjct: 293 EVGKFEKRLNIQLETLEKFEKTVKDSKIKGEAIYSDYQAIE-NILNIIHSARETNSWLEI 351

Query: 168 EARVKQAQRHNDPVASIIKQL-KLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS 226
            A VK+A++   P   II+ + K+ +    L L +   R NI DS +   +NA+ + +  
Sbjct: 352 IATVKKAKKDKVPGLEIIESIDKMGV----LTLNLDGVRVNI-DSSMGIPENAEIYYNKG 406

Query: 227 ETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ 286
           +  K  +  V + +            ++   K + K     EK L   +K+ K+ LK   
Sbjct: 407 KKAKRKIKGVHIAIEK---------TRKEIDKAKNKREIEMEKVL-VPQKRVKKDLK--- 453

Query: 287 TMTNINKARKVYWFEKFYWFISSENYLGNVAWDAKVVTNAWWVK 330
                       W+EK  WF++S+  L     DA   TN   VK
Sbjct: 454 ------------WYEKLRWFVTSDGLLAIGGRDA--TTNEMVVK 483


>gi|347524253|ref|YP_004781823.1| hypothetical protein Pyrfu_1716 [Pyrolobus fumarii 1A]
 gi|343461135|gb|AEM39571.1| protein of unknown function DUF814 [Pyrolobus fumarii 1A]
          Length = 668

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 1/54 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           AW   +   + +WV   QVSK AP GE+LT G+FM+ G++N+    +L + I  
Sbjct: 516 AWKEGLGAIDVYWVWGRQVSKAAPPGEYLTKGAFMVYGERNYIRGVELKLAIGL 569


>gi|11499620|ref|NP_070862.1| hypothetical protein AF2038 [Archaeoglobus fulgidus DSM 4304]
 gi|2648497|gb|AAB89216.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 627

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 64/270 (23%), Positives = 119/270 (44%), Gaps = 46/270 (17%)

Query: 51  MEFHP---MILQPGS----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQ 103
           M F P   +IL+ G      P  +  +  +  + FESF  A+D++FS   +   +L++++
Sbjct: 205 MNFTPKPQIILKDGDYLDVVPMDLLYYSNYEKKYFESFNDALDDYFSKKLAEMDELESMK 264

Query: 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLS 163
            E   L+KL+      +  L   E      ++  + I  N + V+  I A R     + S
Sbjct: 265 SEE--LEKLKKRLEIQKESLRKFEDEAESFRKIGDAIYENYQMVEKIIEAFRA-ARERKS 321

Query: 164 WEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFA 223
           W++I    K+    ++ +  ++K +K   N I + +        IK S    H+NAD + 
Sbjct: 322 WDEI----KEIVARDEKLKKLVKAIKPEKNAIVVKVGDFDVELEIKKSI---HENADLY- 373

Query: 224 DSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLK 283
                              Y  AK+     R  A+  ++ I++T + ++  E+K ++ L 
Sbjct: 374 -------------------YEKAKK----AREKAEGVKRAIEATLREMERVEEKLEKKL- 409

Query: 284 DVQTMTNINKARKVYWFEKFYWFISSENYL 313
               +T+I   RK  W+E + WF +SE +L
Sbjct: 410 ----VTSIKVRRKKEWYENYRWFFTSEGFL 435



 Score = 42.0 bits (97), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP-PCQLAMGI 367
           + K     ++V  +QVSK+A  GE+L  GSF I GK+N+       A+G+
Sbjct: 499 EGKHAGEVYYVLPEQVSKSAKAGEYLPKGSFYITGKRNYLTVELNCAVGV 548


>gi|430813961|emb|CCJ28738.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 441

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 28/39 (71%)

Query: 342 EFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSISR 380
           E+ T G+FMI+GKKNF PP QL +G   L+ ++E S +R
Sbjct: 3   EYSTVGTFMIQGKKNFLPPSQLILGYGILWTIDEVSKAR 41


>gi|330506586|ref|YP_004383014.1| hypothetical protein MCON_0325 [Methanosaeta concilii GP6]
 gi|328927394|gb|AEB67196.1| protein of unknown function (DUF814) [Methanosaeta concilii GP6]
          Length = 641

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 32/46 (69%), Gaps = 2/46 (4%)

Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
            + + VK DQVSKT  +GEFL  G+F+IRG++ +F   P  +A+GI
Sbjct: 523 ADCYLVKGDQVSKTPESGEFLKKGAFVIRGERRYFRDVPLGIALGI 568



 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 61/264 (23%), Positives = 106/264 (40%), Gaps = 74/264 (28%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAE---------SHKIDLKAVQQERDALKKLEN 114
           P  ++ + G   + F +F+ A+D FF   E          H+I+L     +R A+++  +
Sbjct: 250 PRPLKLYSGLEKKRFVTFSEALDAFFVEREKETTRQDPLEHRIEL-----QRKAIEEFRS 304

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
                       ++ +LV   K ELI     SV+  +  +    A   S+  I  R+  +
Sbjct: 305 ------------QEAELV--RKGELIYQLYGSVEQILTLMNDARARGFSYNQIWERISGS 350

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLV 234
                P A  I  L          +RV                    F D  E      +
Sbjct: 351 ---GLPQAKTILSLDGRGE-----MRV--------------------FLDGEE------L 376

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQ---QKTIQSTE--KALKSAEKKTKQTLKDVQTMT 289
           +++ +L+   NA+R++D  +   +K    Q  +  TE  KA K A KKT+        ++
Sbjct: 377 ELNAELAVPQNAQRYYDKAKDMVRKARGAQSALAITEELKAGKVAPKKTR-------AVS 429

Query: 290 NINKARKVYWFEKFYWFISSENYL 313
           +  + RK  W+E+F WF SS+ +L
Sbjct: 430 SYYRRRKPKWYERFRWFYSSDGFL 453


>gi|345005767|ref|YP_004808620.1| fibronectin-binding A domain-containing protein [halophilic
           archaeon DL31]
 gi|344321393|gb|AEN06247.1| Fibronectin-binding A domain protein [halophilic archaeon DL31]
          Length = 717

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPP--CQLAMGIS 368
           D +   +A+ V  DQVSKT  +GE+L  G F IRG + +F    C++A+GI+
Sbjct: 589 DGRGAGDAYMVDPDQVSKTPESGEYLEKGGFAIRGDREYFNGINCEVAVGIT 640



 Score = 42.7 bits (99), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 55/261 (21%), Positives = 103/261 (39%), Gaps = 55/261 (21%)

Query: 78  FESFAAAVDEFFS--TAESHKIDLKAVQQERDALKKLENVKR---DHETRLSALEQTQLV 132
           F+SF AA+DE+F   T +  + +    ++  D    +E  KR     E  +   E+    
Sbjct: 287 FDSFNAALDEYFYRLTHQERREEEGGGRKRPDFEADIEKEKRIIQQQEGAIEGFEEEAEQ 346

Query: 133 DKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNI 192
            + +AEL     E VD  +  I+Q    +  W++I+  + Q      P A  +  +    
Sbjct: 347 RRREAELCYERYELVDEVLSTIQQARQQEHGWDEIQETLAQGAEQGIPAAEAVVDVNSAK 406

Query: 193 NHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF-D 251
             +T+ L  H+                              + +D  +    NA R + +
Sbjct: 407 GMVTIELDDHR------------------------------ITLDASMGVEKNADRLYRE 436

Query: 252 LKRSAAKKQ--QKTIQSTEKALKSAEKKTKQ-----------------TLKDVQTMTNIN 292
            KR   KK+  ++ I+ T K L++A+++ ++                    D  T  +I 
Sbjct: 437 AKRVEGKKEGAREAIEDTRKRLEAAKQRREEWEAEDDPEPEPDPDEEQEEVDWLTREDIP 496

Query: 293 KARKVYWFEKFYWFISSENYL 313
             +  +W+E+F WF +S+ YL
Sbjct: 497 IRQPEHWYEEFRWFRTSDGYL 517


>gi|305663918|ref|YP_003860206.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
 gi|304378487|gb|ADM28326.1| protein of unknown function DUF814 [Ignisphaera aggregans DSM
           17230]
          Length = 667

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 36/56 (64%), Gaps = 3/56 (5%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISF 369
           AW + ++  + +WV   Q+S + PTGE+L  GS+M+ GKKN+      +LA+GI+ 
Sbjct: 524 AWKSGLLAIDVFWVYGSQISLSPPTGEYLPKGSYMVYGKKNYIKNVSLKLALGIAI 579


>gi|119719655|ref|YP_920150.1| hypothetical protein Tpen_0745 [Thermofilum pendens Hrk 5]
 gi|119524775|gb|ABL78147.1| protein of unknown function DUF814 [Thermofilum pendens Hrk 5]
          Length = 610

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 29/42 (69%), Gaps = 1/42 (2%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
           AW   +   + +WV A+QVSK  P+GE+L  GSFM+ GKKN+
Sbjct: 482 AWKGGLYAVDVFWVPAEQVSKKPPSGEYLAKGSFMVYGKKNY 523


>gi|448622787|ref|ZP_21669436.1| hypothetical protein C438_10403 [Haloferax denitrificans ATCC
           35960]
 gi|445753295|gb|EMA04712.1| hypothetical protein C438_10403 [Haloferax denitrificans ATCC
           35960]
          Length = 701

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 58/280 (20%), Positives = 110/280 (39%), Gaps = 64/280 (22%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +R+HQ  G   + +++F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPDFEEQIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + +    E  +   E+    ++E+AEL+  N + VD  +  +R      + W+DI   + 
Sbjct: 313 QRIIDQQEGAIEGFEKQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLA 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                 D+A             
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +++ +    NA R + + KR   KK+     I+ T + L + +K+  +   D     
Sbjct: 403 TVTLEVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDDDDE 462

Query: 287 -------------TMTNINKARKVYWFEKFYWFISSENYL 313
                         + ++      +WFE+F WF +S  YL
Sbjct: 463 EDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYL 502



 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF++RG + +F   P ++A+GI
Sbjct: 574 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKVAVGI 624


>gi|257077022|ref|ZP_05571383.1| hypothetical protein Faci_08161 [Ferroplasma acidarmanus fer1]
          Length = 615

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 36/55 (65%), Gaps = 1/55 (1%)

Query: 317 AWDAKVVTN-AWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           AW A + +  A+WV   QVSKT  +GEF++ GS+++RGK+N+     L + +S +
Sbjct: 489 AWPAGIGSGTAYWVYPSQVSKTPESGEFVSKGSWIVRGKRNYVLNIPLELKVSIV 543


>gi|448633897|ref|ZP_21674396.1| hypothetical protein C437_16451 [Haloarcula vallismortis ATCC
           29715]
 gi|445750588|gb|EMA02026.1| hypothetical protein C437_16451 [Haloarcula vallismortis ATCC
           29715]
          Length = 717

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 58/264 (21%), Positives = 118/264 (44%), Gaps = 33/264 (12%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF---------STAESHKIDLKA-VQQERDALKKL 112
           TP  + +++    + F  F  A+D++F            E+ + D +A +++++  +++ 
Sbjct: 275 TPIPLAEYEELYSESFTEFNTALDDYFFNFQREEEVEGGETQRPDFEAEIEKQKRIIQQQ 334

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E    D E    A       ++EKAEL+  N + VD  +  ++    + +SW+DIEA+  
Sbjct: 335 EQAIEDFEADAEA-------EREKAELLYANYDLVDDVLSTVQAAREDDVSWDDIEAKFD 387

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
           +        A  +  + L+ +  T+ L +   R  + D++     NAD     A   E K
Sbjct: 388 EGADRGIEAAEAV--VSLDGSEGTVTLDIEGTRVTV-DAFTGVEKNADELYKEAKRIEEK 444

Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
           K         L+A  N +   DL+  A K+++   ++ +   +  E  ++    D  ++ 
Sbjct: 445 KEGA------LAAIENTRE--DLE--AVKERRDEWEADDGEDEVDEDGSEDEPTDWLSIQ 494

Query: 290 NINKARKVYWFEKFYWFISSENYL 313
           +I       W+E+F WF +S+ +L
Sbjct: 495 SIPTRSTERWYEQFRWFHTSDGFL 518



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE+L  G F +RG + +F   P  +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPVGVAVGIT 641


>gi|340345857|ref|ZP_08668989.1| RNA-binding protein [Candidatus Nitrosoarchaeum koreensis MY1]
 gi|339520998|gb|EGP94721.1| RNA-binding protein [Candidatus Nitrosoarchaeum koreensis MY1]
          Length = 638

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 35/54 (64%), Gaps = 3/54 (5%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNF--FPPCQLAMGI 367
           AW +     +A+WV  +QV K+AP+G+FL  GSF I G++NF      +LA+GI
Sbjct: 504 AWREGMYGVSAFWVNPEQVKKSAPSGQFLPKGSFTIEGQRNFVKISTLKLAVGI 557


>gi|448725341|ref|ZP_21707802.1| hypothetical protein C448_01989 [Halococcus morrhuae DSM 1307]
 gi|445798677|gb|EMA49073.1| hypothetical protein C448_01989 [Halococcus morrhuae DSM 1307]
          Length = 695

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/291 (19%), Positives = 116/291 (39%), Gaps = 55/291 (18%)

Query: 48  VANMEFHPMILQPGS-----TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAV 102
           +++ +F P I +        TP  + +   F  +EF+SF AA+D +F   ++ + +    
Sbjct: 237 LSSGDFDPRIYRDDGEPVDVTPFPLDERAEFDSEEFDSFTAALDAYFVELDTTEDEESGE 296

Query: 103 QQERDALKKLENVKR---DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159
           ++  D  +++E  +R     E  +   E     ++E AE +  N E VD  +  +R    
Sbjct: 297 RERPDFEEQIERQQRIIDQQEGAIEDFEAQADRERETAESLYANYELVDEILTTVRNARE 356

Query: 160 NQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNA 219
             + WE IE R  + +      A  +  ++ +   +T+ +                 D+ 
Sbjct: 357 EGIGWEAIEERFAEGEERGIAAAGAVTGIEPSEGTVTIEI-----------------DDR 399

Query: 220 DPFADSSETKKPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEK 276
           D             V++D       NA R + + KR   KK+  ++ +  T + L++ E+
Sbjct: 400 D-------------VELDPQEGVEQNADRLYREAKRVVEKKEGAEEAVVETREELEAIER 446

Query: 277 KTKQTLKDVQ--------------TMTNINKARKVYWFEKFYWFISSENYL 313
           +  +                    +  +I   +   W+E+F WF +S+ YL
Sbjct: 447 QRDEWEAGDVDDDPDEESEDVDWLSRRSIPTRKNEQWYERFRWFHTSDGYL 497



 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           + +   +A+  + DQVSKT  +GE+L  G F IRG + +F      +A+GI+
Sbjct: 569 NGRFAGDAYMAEPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVAVGIT 620


>gi|448602394|ref|ZP_21656450.1| hypothetical protein C441_00535 [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|445747909|gb|ELZ99363.1| hypothetical protein C441_00535 [Haloferax sulfurifontis ATCC
           BAA-897]
          Length = 702

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 58/281 (20%), Positives = 110/281 (39%), Gaps = 65/281 (23%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKL 112
           TP  +R+HQ  G   + +++F  A+DE+F      ++DL A +QE         + + K 
Sbjct: 258 TPFPLREHQNAGLDEEAYDTFNDALDEYF-----FRLDLTADEQEATSDRPNFEEQIAKQ 312

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           + +    E  +   E+    ++E+AEL+  N + VD  +  +R      + W+DI   + 
Sbjct: 313 QRIIDQQEGAIEGFEKQAQDERERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLA 372

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPS 232
           +      P A  +  +      +T+ L                 D+A             
Sbjct: 373 EGAEQGIPEAEAVTNVDGANGTVTVDL-----------------DDA------------- 402

Query: 233 LVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ--- 286
            V +++ +    NA R + + KR   KK+     I+ T + L + +K+  +   D     
Sbjct: 403 TVTLEVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVKKRRDEWEADDDDED 462

Query: 287 --------------TMTNINKARKVYWFEKFYWFISSENYL 313
                          + ++      +WFE+F WF +S  YL
Sbjct: 463 EDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSTGYL 503



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V+  QVSKT  +GE++  GSF+IRG + +F   P ++A+GI
Sbjct: 575 EGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYFEDVPAKVAVGI 625


>gi|257388236|ref|YP_003178009.1| fibronectin-binding A domain-containing protein [Halomicrobium
           mukohataei DSM 12286]
 gi|257170543|gb|ACV48302.1| Fibronectin-binding A domain protein [Halomicrobium mukohataei DSM
           12286]
          Length = 708

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ--LAMGIS 368
           D K   +A+ V  DQVSKT  +GE+L  G F IRG + +F   +  +A+GIS
Sbjct: 582 DGKYAGDAYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFRDLEVDVAVGIS 633



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 59/276 (21%), Positives = 114/276 (41%), Gaps = 55/276 (19%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF------STAE---SHKIDLKAVQQERDALKKLE 113
           TP  + ++     + FE+F  A+DE+F       TAE      +D    + E   ++K E
Sbjct: 265 TPIPLEEYDDVESRAFETFTEALDEYFYEVEREDTAEEIADAGVDRPDFESE---IEKYE 321

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
            + +  ++ +   E     ++EKAEL+    + VD  +  I+        W++IEA  ++
Sbjct: 322 RIIQQQQSAIEDFESDAEAEREKAELLYARYDLVDEILSTIQGARTQDTPWDEIEATFEE 381

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
            +      A  ++   L+ +  T+ L +   R  I D+ +    NAD             
Sbjct: 382 GKEQGIAAAEAVE--GLDGSEGTVTLSIDDVRVTI-DATMGVEKNADQL----------- 427

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK-----------TKQTL 282
                    Y  AKR  + K  A    Q  I+ T + L++ E++           T++  
Sbjct: 428 ---------YQAAKRIEEKKEGA----QAAIEDTREDLEAVERRRENWEAEDTTETQEQT 474

Query: 283 KDVQTMTNINKA-----RKVYWFEKFYWFISSENYL 313
            +   +  +++A     R+  W+++F WF +S  +L
Sbjct: 475 AEADDVDWLSRASIPVRRQEPWYDRFRWFRTSNGFL 510


>gi|393796641|ref|ZP_10380005.1| hypothetical protein CNitlB_10052 [Candidatus Nitrosoarchaeum
           limnia BG20]
          Length = 638

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           AW +     +A+WV  +QV K+AP+G+FL  GSF I G++NF     L + +  +
Sbjct: 504 AWREGMYGVSAFWVNPEQVKKSAPSGQFLPKGSFTIEGQRNFVKISTLKLAVGLM 558


>gi|329766254|ref|ZP_08257812.1| hypothetical protein Nlim_1602 [Candidatus Nitrosoarchaeum limnia
           SFB1]
 gi|329137313|gb|EGG41591.1| hypothetical protein Nlim_1602 [Candidatus Nitrosoarchaeum limnia
           SFB1]
          Length = 590

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFL 370
           AW +     +A+WV  +QV K+AP+G+FL  GSF I G++NF     L + +  +
Sbjct: 456 AWREGMYGVSAFWVNPEQVKKSAPSGQFLPKGSFTIEGQRNFVKISTLKLAVGLM 510


>gi|448639710|ref|ZP_21676858.1| hypothetical protein C436_08831 [Haloarcula sinaiiensis ATCC 33800]
 gi|445762237|gb|EMA13458.1| hypothetical protein C436_08831 [Haloarcula sinaiiensis ATCC 33800]
          Length = 717

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE+L  G F +RG + +F   P  +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVGIT 641



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/185 (24%), Positives = 85/185 (45%), Gaps = 16/185 (8%)

Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLN 191
           V++EKAEL+  N + VD  +  ++    + +SW+DIEA+  +       +A+    + L+
Sbjct: 347 VEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRG--IAAAEAVVSLD 404

Query: 192 INHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDLSAYANAKR 248
            +  T+ L +   R  + D++     NAD     A   E KK         L+A  N + 
Sbjct: 405 GSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA------LAAIENTRE 457

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
             DL+    ++++      E    + E + + T  D  +M +I       W+E+F WF +
Sbjct: 458 --DLEAVKERREEWEADDGEDEADNDEAEDEPT--DWLSMQSIPTRSTERWYEQFRWFHT 513

Query: 309 SENYL 313
           S+ +L
Sbjct: 514 SDGFL 518


>gi|344211873|ref|YP_004796193.1| fibronectin-binding A domain-containing protein [Haloarcula
           hispanica ATCC 33960]
 gi|343783228|gb|AEM57205.1| fibronectin-binding A domain protein [Haloarcula hispanica ATCC
           33960]
          Length = 717

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE+L  G F +RG + +F   P  +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVGIT 641



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/185 (24%), Positives = 85/185 (45%), Gaps = 16/185 (8%)

Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLN 191
           V++EKAEL+  N + VD  +  ++    + +SW+DIEA+  +       +A+    + L+
Sbjct: 347 VEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRG--IAAAEAVVSLD 404

Query: 192 INHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDLSAYANAKR 248
            +  T+ L +   R  + D++     NAD     A   E KK         L+A  N + 
Sbjct: 405 GSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA------LAAIENTRE 457

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
             DL+    ++++      E    + E + + T  D  +M +I       W+E+F WF +
Sbjct: 458 --DLEAVKERREEWEADDGEDEADNDEAEDEPT--DWLSMQSIPTRSTERWYEQFRWFHT 513

Query: 309 SENYL 313
           S+ +L
Sbjct: 514 SDGFL 518


>gi|55377795|ref|YP_135645.1| hypothetical protein rrnAC0969 [Haloarcula marismortui ATCC 43049]
 gi|55230520|gb|AAV45939.1| unknown [Haloarcula marismortui ATCC 43049]
          Length = 717

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE+L  G F +RG + +F   P  +A+GI+
Sbjct: 590 DGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVGIT 641



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/185 (24%), Positives = 85/185 (45%), Gaps = 16/185 (8%)

Query: 132 VDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLN 191
           V++EKAEL+  N + VD  +  ++    + +SW+DIEA+  +       +A+    + L+
Sbjct: 347 VEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRG--IAAAEAVVSLD 404

Query: 192 INHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVDVDLDLSAYANAKR 248
            +  T+ L +   R  + D++     NAD     A   E KK         L+A  N + 
Sbjct: 405 GSEGTVTLDIDGTRVTV-DAFTGVEKNADELYKEAKRIEEKKEGA------LAAIENTRE 457

Query: 249 FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFIS 308
             DL+    ++++      E    + E + + T  D  +M +I       W+E+F WF +
Sbjct: 458 --DLEAVKERREEWEADDGEDEADNDEAEDEPT--DWLSMQSIPTRSTERWYEQFRWFHT 513

Query: 309 SENYL 313
           S+ +L
Sbjct: 514 SDGFL 518


>gi|335438854|ref|ZP_08561586.1| Fibronectin-binding A domain protein [Halorhabdus tiamatea SARL4B]
 gi|334890357|gb|EGM28628.1| Fibronectin-binding A domain protein [Halorhabdus tiamatea SARL4B]
          Length = 707

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 34/52 (65%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           D K   + + V+ DQV+KT  +GE+L  GSF IRG++ ++   P  +A+GI+
Sbjct: 580 DGKYAGDVYCVEHDQVTKTPESGEYLEKGSFAIRGERTYYDDTPVGVAVGIT 631


>gi|254167318|ref|ZP_04874170.1| conserved domain protein [Aciduliprofundum boonei T469]
 gi|197623581|gb|EDY36144.1| conserved domain protein [Aciduliprofundum boonei T469]
          Length = 589

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/137 (26%), Positives = 51/137 (37%), Gaps = 61/137 (44%)

Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAKV---------------------------- 322
           I K R+ +WFEK+ WFISSE  L     DAK                             
Sbjct: 383 IRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGAPSVVI 442

Query: 323 ---------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                                ++ AW          WV   QVSK   +GE++  G++++
Sbjct: 443 KSEGKEIGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVV 502

Query: 352 RGKKNFF--PPCQLAMG 366
            GK+N+    P QLA+G
Sbjct: 503 HGKRNYIHKVPLQLAVG 519


>gi|289596339|ref|YP_003483035.1| protein of unknown function DUF814 [Aciduliprofundum boonei T469]
 gi|289534126|gb|ADD08473.1| protein of unknown function DUF814 [Aciduliprofundum boonei T469]
          Length = 589

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/137 (26%), Positives = 51/137 (37%), Gaps = 61/137 (44%)

Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAKV---------------------------- 322
           I K R+ +WFEK+ WFISSE  L     DAK                             
Sbjct: 383 IRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGAPSVVI 442

Query: 323 ---------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                                ++ AW          WV   QVSK   +GE++  G++++
Sbjct: 443 KSEGKEIGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVV 502

Query: 352 RGKKNFF--PPCQLAMG 366
            GK+N+    P QLA+G
Sbjct: 503 HGKRNYIHKVPLQLAVG 519


>gi|254166596|ref|ZP_04873450.1| conserved domain protein [Aciduliprofundum boonei T469]
 gi|197624206|gb|EDY36767.1| conserved domain protein [Aciduliprofundum boonei T469]
          Length = 593

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/137 (26%), Positives = 51/137 (37%), Gaps = 61/137 (44%)

Query: 291 INKARKVYWFEKFYWFISSENYLGNVAWDAKV---------------------------- 322
           I K R+ +WFEK+ WFISSE  L     DAK                             
Sbjct: 387 IRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGAPSVVI 446

Query: 323 ---------------------VTNAW----------WVKADQVSKTAPTGEFLTTGSFMI 351
                                ++ AW          WV   QVSK   +GE++  G++++
Sbjct: 447 KSEGKEIGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVV 506

Query: 352 RGKKNFF--PPCQLAMG 366
            GK+N+    P QLA+G
Sbjct: 507 HGKRNYIHKVPLQLAVG 523


>gi|325969240|ref|YP_004245432.1| hypothetical protein VMUT_1728 [Vulcanisaeta moutnovskia 768-28]
 gi|323708443|gb|ADY01930.1| hypothetical protein VMUT_1728 [Vulcanisaeta moutnovskia 768-28]
          Length = 668

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 31/47 (65%)

Query: 323 VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           V + ++V+ +QV+K AP+GE+L  GSFMI G + +    +L +GI  
Sbjct: 528 VLDVFYVRGEQVTKKAPSGEYLGKGSFMIYGTRGWIRNVELRLGIGL 574


>gi|414878086|tpg|DAA55217.1| TPA: hypothetical protein ZEAMMB73_507954 [Zea mays]
          Length = 522

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 350 MIRGKKNFFPPCQLAMGISFLFKLEESSIS 379
           MIRGKKNF PP  L MG   LF+L+ESS++
Sbjct: 1   MIRGKKNFLPPHPLVMGFGILFRLDESSLA 30


>gi|448303302|ref|ZP_21493251.1| fibronectin-binding A domain-containing protein [Natronorubrum
           sulfidifaciens JCM 14089]
 gi|445593087|gb|ELY47265.1| fibronectin-binding A domain-containing protein [Natronorubrum
           sulfidifaciens JCM 14089]
          Length = 716

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 57/268 (21%), Positives = 107/268 (39%), Gaps = 39/268 (14%)

Query: 62  STPSVMRQHQGFPIQEFESFAAAVDEFFSTAE-SHKIDLKAVQQERD---ALKKLENVKR 117
           +TP  + +H     + ++SF AA+D++F   E   + +     Q+ D    + K E +  
Sbjct: 272 ATPFPLEEHAELAAEPYDSFLAALDDYFFRLELDDEEEPDPTTQKPDFGEEIAKYERIID 331

Query: 118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRH 177
             +  +   EQ     +E+AEL+      VD  +  I+   A    W++IEAR ++    
Sbjct: 332 QQQGAIEGFEQQADDLREQAELLYAEYGLVDDILSTIQDARAQDRPWDEIEARFEEGAER 391

Query: 178 NDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVD 237
               A  +  + ++ +   + + +  +R ++     + HD  +  AD             
Sbjct: 392 GIEAAEAV--VGIDSSEGIVTVDIDGDRIDL-----VAHDGVEQNAD------------- 431

Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK-----SAEKKTKQTLKDVQT----- 287
                Y  AKR  + K  A    + T +  E A +      A+ +  +   D  T     
Sbjct: 432 ---RLYTEAKRVAEKKEGALAAIEDTREDLEDAKRRRDEWDADDEGDEQADDEDTEETNW 488

Query: 288 --MTNINKARKVYWFEKFYWFISSENYL 313
             M +I       WF++F WF +S+ YL
Sbjct: 489 LEMPSIPIRENEPWFDRFRWFHTSDGYL 516



 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG++ +    P  +A+GI
Sbjct: 589 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGERTYHRDTPVGVAVGI 639


>gi|313126151|ref|YP_004036421.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
 gi|448285991|ref|ZP_21477228.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
 gi|312292516|gb|ADQ66976.1| predicted RNA-binding protein, snRNP like protein [Halogeometricum
           borinquense DSM 11551]
 gi|445575584|gb|ELY30057.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
          Length = 702

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 53/292 (18%), Positives = 116/292 (39%), Gaps = 84/292 (28%)

Query: 62  STPSVMRQHQ--GFPIQEFESFAAAVDEFF----STAE--------SHKIDLKAVQQERD 107
           +TP  +++H+  G   + +++F  A+DE+F     +AE        S++ D +A      
Sbjct: 256 ATPFPLKEHEAEGLNSESYDTFNEALDEYFFAFDRSAEDEPEEEPGSNRPDFEA------ 309

Query: 108 ALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI 167
            ++K + +    E  +   EQ    ++E+AEL+  N E VD  +  +R      + W++I
Sbjct: 310 EIEKKKRIIEQQEGAIEGFEQQAERERERAELLYANYELVDEVLSTVRSARDESVPWDEI 369

Query: 168 EARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSE 227
              ++       P A  +  + ++    T+ + +   R                      
Sbjct: 370 RQTLEDGAERGIPAAEAV--VDVDGAEGTVTIEIDGTR---------------------- 405

Query: 228 TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ- 286
                 ++V++D+    NA R +        K+ K ++  ++   +A + T++ L +V+ 
Sbjct: 406 ------IEVEVDMGVEKNADRLY--------KEAKRVEGKKEGAMAAIEDTREELAEVKA 451

Query: 287 -------------------------TMTNINKARKVYWFEKFYWFISSENYL 313
                                    + ++I    +  W+E+F WF +S+ YL
Sbjct: 452 RRDAWEEDDEDDDEEPEEPEDIDWLSRSSIPLKTEEQWYEQFRWFHTSDGYL 503



 Score = 45.1 bits (105), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 27/40 (67%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
           + +   +A+ V  DQVSKT  +GE++  GSF+IRG + +F
Sbjct: 575 EGRYADDAYMVTPDQVSKTPESGEYIEKGSFVIRGDRTYF 614


>gi|156938202|ref|YP_001435998.1| hypothetical protein Igni_1415 [Ignicoccus hospitalis KIN4/I]
 gi|156567186|gb|ABU82591.1| protein of unknown function DUF814 [Ignicoccus hospitalis KIN4/I]
          Length = 644

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 37/57 (64%), Gaps = 3/57 (5%)

Query: 317 AW-DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGISFL 370
           AW + +   + ++VK  QVSK+ P G+++  G+F+I+GK+ +    P +LA+G+  +
Sbjct: 492 AWKEGRGSVDVFYVKGSQVSKSPPPGQYVAKGAFIIKGKREYVRDVPLRLALGVEMV 548


>gi|294496348|ref|YP_003542841.1| Fibronectin-binding A domain protein [Methanohalophilus mahii DSM
           5219]
 gi|292667347|gb|ADE37196.1| Fibronectin-binding A domain protein [Methanohalophilus mahii DSM
           5219]
          Length = 662

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 33/47 (70%), Gaps = 2/47 (4%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGISF 369
           + +W+  +QV+KT  +GE+L  GSF+IRG++N++   P + A+G+  
Sbjct: 542 DCYWIYPEQVTKTPESGEYLKKGSFIIRGERNYYRDVPVRAAVGLEL 588


>gi|327311796|ref|YP_004338693.1| hypothetical protein TUZN_1922 [Thermoproteus uzoniensis 768-20]
 gi|326948275|gb|AEA13381.1| hypothetical protein TUZN_1922 [Thermoproteus uzoniensis 768-20]
          Length = 623

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW A +   + ++V   QVSK  P+GE+L  GSFMI G KN+    +L + + 
Sbjct: 485 AWRAGIHAVDVFYVPGRQVSKQPPSGEYLARGSFMIYGSKNYIRHVRLELAVG 537


>gi|387209294|gb|AFJ69115.1| hypothetical protein NGATSA_3044600, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 106

 Score = 48.1 bits (113), Expect = 0.007,   Method: Composition-based stats.
 Identities = 29/101 (28%), Positives = 57/101 (56%), Gaps = 1/101 (0%)

Query: 100 KAVQQERDALKKLE-NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDI 158
           +AV+ + +A++     V+R++E RL  LE T+    + A L+  + ++VD  +L +R  I
Sbjct: 3   QAVRAQEEAVRSRPLRVQRENEARLKELEATEARLLDAARLVECHSDAVDKVLLVLRSAI 62

Query: 159 ANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLL 199
           A    W+ ++  +++ Q   +P+A +I   K + N +TL L
Sbjct: 63  ATGADWQTLDEYIRKEQAGGNPLARMITGSKWSDNKVTLSL 103


>gi|448385151|ref|ZP_21563730.1| Fibronectin-binding A domain protein [Haloterrigena thermotolerans
           DSM 11522]
 gi|445657436|gb|ELZ10264.1| Fibronectin-binding A domain protein [Haloterrigena thermotolerans
           DSM 11522]
          Length = 719

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V ADQVSKT  +GE+L  G F IRG + ++   P   A+GI
Sbjct: 592 DGRYAGDVYAVDADQVSKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 642



 Score = 44.7 bits (104), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 56/270 (20%), Positives = 107/270 (39%), Gaps = 43/270 (15%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKLENVKRD 118
           TP  + +H   P + ++SF +A+D++F     AE  + D    + + ++ + K E +   
Sbjct: 274 TPFPLEEHADLPAEGYDSFLSALDDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQ 333

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
            +  +   EQ     +E+AEL+      VD  +  ++Q      +W++I  R ++     
Sbjct: 334 QQGAIEGFEQEAEQLRERAELLYAEYGLVDEILSTVQQAREQDRAWDEIRERFEEGADRG 393

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSN-IKDSWVLHHDNADPFADSSETKKPSLVDVD 237
             +A+    + ++ +  T+ + +  ER   + D  V    NAD                 
Sbjct: 394 --IAAAEAVIDVDGSEGTVTVDLDGERIELVADRGV--EQNADRL--------------- 434

Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTL--------------K 283
                Y  AKR  D K  A    + T +  E A +  ++   Q                +
Sbjct: 435 -----YTEAKRVEDKKEGALAAIENTREDLEDAKRRRDEWEAQDAASDDEDEADDEGPKR 489

Query: 284 DVQTMTNINKARKVYWFEKFYWFISSENYL 313
           D     +I       WF++F WF +S++YL
Sbjct: 490 DWLADPSIPIRENEPWFDRFRWFHTSDDYL 519


>gi|433590765|ref|YP_007280261.1| putative RNA-binding protein, snRNP like protein [Natrinema
           pellirubrum DSM 15624]
 gi|448331831|ref|ZP_21521081.1| Fibronectin-binding A domain protein [Natrinema pellirubrum DSM
           15624]
 gi|433305545|gb|AGB31357.1| putative RNA-binding protein, snRNP like protein [Natrinema
           pellirubrum DSM 15624]
 gi|445628400|gb|ELY81707.1| Fibronectin-binding A domain protein [Natrinema pellirubrum DSM
           15624]
          Length = 721

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V ADQVSKT  +GE+L  G F IRG + ++   P   A+GI
Sbjct: 594 DGRYAGDVYAVDADQVSKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 644



 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 58/260 (22%), Positives = 110/260 (42%), Gaps = 23/260 (8%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKLENVKRD 118
           TP  + +H   P + ++SF +A+D++F     AE  + D    + + ++ + K E +   
Sbjct: 276 TPFPLEEHADLPAEGYDSFLSALDDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQ 335

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
            +  +   EQ     +E+AEL+      VD  +  ++       +W++I  R ++     
Sbjct: 336 QQGAIEGFEQEAEQLRERAELLYAEYGLVDEILSTVQGAREQDRAWDEIRERFEEGADRG 395

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSN-IKDSWVLHHDNADPF---ADSSETKKPSLV 234
             +A+    + ++ +  T+ + +  ER   + D  V    NAD     A   E KK    
Sbjct: 396 --IAAAEAVIDVDGSEGTVTVDLDGERIELVADRGV--EQNADRLYTEAKRVEDKKEGA- 450

Query: 235 DVDLDLSAYANAKR-FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
                L+A  N +    D KR   + + K   S ++     E   +  L D       N+
Sbjct: 451 -----LAAIENTREDLEDAKRRRDEWEAKDAASDDEDEADDEGPNRDWLADPSIPIRENE 505

Query: 294 ARKVYWFEKFYWFISSENYL 313
                WF++F WF +S++YL
Sbjct: 506 P----WFDRFRWFHTSDDYL 521


>gi|448409564|ref|ZP_21574778.1| hypothetical protein C475_10624 [Halosimplex carlsbadense 2-9-1]
 gi|445672910|gb|ELZ25479.1| hypothetical protein C475_10624 [Halosimplex carlsbadense 2-9-1]
          Length = 729

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 32/61 (52%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           D K   + + V +DQVSKT  +GEFL  G F IRG + ++   ++ + +    + E   I
Sbjct: 602 DGKFAGDVYEVGSDQVSKTPESGEFLEKGGFAIRGDRTYYESTEVGVAVGITCEPETRVI 661

Query: 379 S 379
            
Sbjct: 662 G 662



 Score = 38.9 bits (89), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 53/267 (19%), Positives = 104/267 (38%), Gaps = 27/267 (10%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE--------RDALKKLEN 114
           TP  + ++     + F+SF  A+D++F+ A     +                  + K E 
Sbjct: 274 TPIPLAEYDDRYGEPFDSFNEALDDYFTFASDEDDEGGGDAAGGDRGRPDFESEIAKHER 333

Query: 115 VKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQA 174
           +    +  +   E     ++  AE +  N + VD  +  +++  A   SW+DIE R  + 
Sbjct: 334 IIEQQQGAIEDFEAQAERERANAEALYANYDLVDDILSTVQEARAEDRSWDDIEERFAEG 393

Query: 175 QRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKP 231
            R   P A  +  + L+ +  T+ + +  ER  +  S  +   NAD     A   E KK 
Sbjct: 394 ARQGIPAAEAV--VSLDGSEGTVTIDIDGERVTLAASEGVEK-NADRLYREAKRIEGKKE 450

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM--- 288
                    +  A A+   +L+    +K +       +A    ++       D + +   
Sbjct: 451 G--------AEEAIAQTRSELEAVEERKAEWEAADAGEAGSGGDESEGSDEDDDEPVDWL 502

Query: 289 --TNINKARKVYWFEKFYWFISSENYL 313
              +I   +  +WFE + WF +S+ +L
Sbjct: 503 AEPSIPVRQSDHWFEDYRWFHTSDGFL 529


>gi|452206612|ref|YP_007486734.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
 gi|452082712|emb|CCQ35980.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
          Length = 703

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 35/56 (62%), Gaps = 2/56 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLE 374
           D K   + + V ADQVSKT  +GE++  GSF+IRG + ++    +++G+S   K E
Sbjct: 576 DGKFEGDVYEVDADQVSKTPESGEYVEKGSFVIRGDREYY--HDVSVGVSVGIKCE 629



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 52/278 (18%), Positives = 103/278 (37%), Gaps = 56/278 (20%)

Query: 62  STPSVMRQHQ--GFPIQEFESFAAAVDEFFS--TAESHKIDLKAVQQER----DALKKLE 113
           +TP  +R+H+  G+    FESF  A+DE+F     ES   +      +R      ++K E
Sbjct: 257 ATPLPLREHETEGYDATAFESFNGAIDEYFYRLETESETEEQAGAGTDRPDFESEIEKYE 316

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
            +    E  + + ++    ++ KAE +  N + +D     +R    + + W +IE   + 
Sbjct: 317 RIIEQQEGAIESYDEQADEEQRKAESLYGNYDLIDEICSTVRAAREDGVPWAEIEETFEA 376

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
                   A  +  +      +T+ L                                  
Sbjct: 377 GAERGIEAAEAVVSVDGAEGTVTVDL------------------------------GDGP 406

Query: 234 VDVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTK----------- 279
           +++D  +    NA R + + KR   KK+  Q  I+ T + L + E++ +           
Sbjct: 407 IELDPTVGVERNADRLYTEAKRVRGKKEGAQAAIEDTREDLAAVERRREAWEAEDADEGE 466

Query: 280 ----QTLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
                   D   ++++       W E+F WF +S+ +L
Sbjct: 467 DEDDDAETDYLAVSSVPVRYDEKWHERFRWFRTSDGFL 504


>gi|448399812|ref|ZP_21571045.1| Fibronectin-binding A domain protein [Haloterrigena limicola JCM
           13563]
 gi|445668265|gb|ELZ20895.1| Fibronectin-binding A domain protein [Haloterrigena limicola JCM
           13563]
          Length = 722

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V ADQV+KT  +GE+L  G F IRG + ++   P   A+GI
Sbjct: 595 DGRYAGDVYAVDADQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 645



 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 54/259 (20%), Positives = 105/259 (40%), Gaps = 19/259 (7%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAE-SHKIDLKAVQQERD---ALKKLENVKRD 118
           TP  + +H     + +E+F  A+D++F   E     + +  +Q  D    + K E +   
Sbjct: 275 TPFPLEEHTDLDSEPYETFLEALDDYFFQLELGEDEEPEPTEQRPDFDSEIAKYERIIEQ 334

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
            +  +   EQ     +E+AEL+      VD  +  I+        W++I  R +      
Sbjct: 335 QQGAIEGFEQEADALREQAELLYAEYGLVDEILSTIQDARVQDRPWDEIRERFEAGAEQG 394

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS----ETKKPSLV 234
              A  +  + ++ +  T+ + +  ER ++     +   NAD     +    E K+ +L 
Sbjct: 395 IEAAEAV--VDVDGSEGTVTVDLDGERIDLVVEQGVEQ-NADRLYTEAKRVEEKKEGALA 451

Query: 235 DVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
            ++       +AKR  D     A +++ T +  E     AE++      D     +I   
Sbjct: 452 AIEDTREDLEDAKRRRD--EWEATEREDTSEDGEDEADEAEQR------DWLAEPSIPIR 503

Query: 295 RKVYWFEKFYWFISSENYL 313
               WF++F WF +S+ YL
Sbjct: 504 ENEPWFDRFRWFHTSDGYL 522


>gi|76801680|ref|YP_326688.1| hypothetical protein NP2070A [Natronomonas pharaonis DSM 2160]
 gi|76557545|emb|CAI49126.1| conserved hypothetical protein [Natronomonas pharaonis DSM 2160]
          Length = 699

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 56/282 (19%), Positives = 112/282 (39%), Gaps = 66/282 (23%)

Query: 62  STPSVMRQH--QGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------L 109
           +TP  + +H  +G+    FE F  A+D +F     H++  +A + E D           +
Sbjct: 256 ATPLPLEEHTAEGYDATAFEHFNGAIDAYF-----HRLQAEA-ETETDTGDDKPDFESEI 309

Query: 110 KKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEA 169
            K E +    +  +   E+   V+++KAEL+  N + VD     ++      + W++IE 
Sbjct: 310 AKFERIIEQQQGAIEEYEKQAEVEQQKAELLYGNYDLVDEICSTVQSAREEGVPWDEIET 369

Query: 170 RVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETK 229
             ++        A+ +  +      +T+ L                        D  E  
Sbjct: 370 TFEEGAERGIDAAAAVVGVDAAEGTVTIDL------------------------DDKE-- 403

Query: 230 KPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKALKSAEKKTKQ------ 280
               +D+D  +    NA R + + KR   KK+  Q  I+ T + L+  +++ +Q      
Sbjct: 404 ----IDLDPTMGVEKNADRLYQEAKRVRGKKEGAQAAIEDTREDLEDVKERRRQWEADDD 459

Query: 281 ---------TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
                      +D  +M ++       W+E+F WF +S+++L
Sbjct: 460 EDDDADEESPDRDYLSMASVPVRYDEKWYEQFRWFRTSDDFL 501



 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 30/50 (60%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           D K   + + V  DQVSKT  +GE++  G F+IRG +N++   Q+ + + 
Sbjct: 573 DGKFEGDVYEVDPDQVSKTPESGEYIEKGGFVIRGDRNYYRDMQVGVAVG 622


>gi|448391228|ref|ZP_21566471.1| fibronectin-binding A domain-containing protein [Haloterrigena
           salina JCM 13891]
 gi|445666097|gb|ELZ18766.1| fibronectin-binding A domain-containing protein [Haloterrigena
           salina JCM 13891]
          Length = 723

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P  +A+GI
Sbjct: 596 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVDVAVGI 646



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 69/327 (21%), Positives = 136/327 (41%), Gaps = 37/327 (11%)

Query: 3   SNEDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSE-DYIVANMEFHPMILQPG 61
           ++EDV   I+     L + + N N   + YL  + +    SE D    + +  P  +   
Sbjct: 218 ADEDVYDRIYGAIERLALDLRNGNFDPRLYLADEGDDDNESESDENGGDGDSSPDRVV-D 276

Query: 62  STPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKK 111
           +TP  + +H     + ++SF AA+D++F     ++++L   ++E D           + K
Sbjct: 277 ATPFPLEEHVELASEPYDSFLAALDDYF-----YRLELAEDEEETDPTTQRPDFEEEIAK 331

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
            E +    +  +   EQ     +E+AEL+      VD  +  +++  A    WE+IE R 
Sbjct: 332 YERIIEQQQGAIEGFEQEADALREQAELLYAEYGLVDDILSTVQEARAQDRPWEEIEERF 391

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSET 228
            +       +A+    + ++ +  T+ + +  ER ++     +   NAD     A   E 
Sbjct: 392 AEGADRG--IAAAEAVVDVDGSEGTVTVELDGERIDLVAKQGVEQ-NADRLYTEAKRVEE 448

Query: 229 KKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKDVQ 286
           KK         L+A  + +   DL  + A++ +  +   + E      E   +  L +  
Sbjct: 449 KKEGA------LAAIEDTRE--DLAEAKARRDRWEEEDAAAEGDDDEDEDDDRDWLSEPS 500

Query: 287 TMTNINKARKVYWFEKFYWFISSENYL 313
                N+     WF++F WF +S+ YL
Sbjct: 501 VPIRENEP----WFDRFRWFHTSDGYL 523


>gi|448329966|ref|ZP_21519260.1| Fibronectin-binding A domain protein [Natrinema versiforme JCM
           10478]
 gi|445613154|gb|ELY66864.1| Fibronectin-binding A domain protein [Natrinema versiforme JCM
           10478]
          Length = 720

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQVSKT  +GE+L  G F IRG + ++   P   A+GI
Sbjct: 593 DGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 643



 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 67/325 (20%), Positives = 139/325 (42%), Gaps = 36/325 (11%)

Query: 3   SNEDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSE------DYIVANMEFHPM 56
           ++EDV   +++    L + + N N   + YL +  E+  + E      D  V ++     
Sbjct: 218 ADEDVYERLYETIERLALDIRNGNFDPRLYLERDDEEADDGEGESEDADANVVDV----- 272

Query: 57  ILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKL 112
                 TP  + +H     + ++SF +A+D++F     AE  + D    + + ++ + K 
Sbjct: 273 ------TPFPLEEHDDLDGEAYDSFLSALDDYFFRLELAEEEESDPTDQRPDFESEIAKQ 326

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E +    +  +   EQ     +E+AEL+      VD  +  I+       SW++I  R +
Sbjct: 327 ERIIEQQQGAIEGFEQEAEELREQAELLYAEYGLVDDILSTIQGAREQDRSWDEIRERFE 386

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
           +        A  +  + ++ +  T+ + +  ER  +     +   NAD     A   E K
Sbjct: 387 EGAEQGIDAAEAV--VDVDGSDGTVTVDIDGERIGLVAGRGVEQ-NADRLYTEAKRVEEK 443

Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQ-KTIQSTEKALKSAEKKTKQTLKDVQTM 288
           K         L+A  N +   DL+ +  ++ + +  +S   A   +++  ++T +D  + 
Sbjct: 444 KEGA------LAAIENTRE--DLEEAKRRRDEWEADESGPAAETESDEDEEETQRDWLSE 495

Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
            +I       WF++F WF +S++YL
Sbjct: 496 PSIPIRENEPWFDRFRWFQTSDDYL 520


>gi|352682802|ref|YP_004893326.1| putative RNA-binding protein [Thermoproteus tenax Kra 1]
 gi|350275601|emb|CCC82248.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Thermoproteus tenax Kra 1]
          Length = 624

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW A +   + ++V   QVSK  P+GE+L  GSFMI G +N+    +L + I 
Sbjct: 486 AWKAGIHSIDVFYVFGSQVSKQPPSGEYLARGSFMIYGTRNYIRHVRLELAIG 538


>gi|448313587|ref|ZP_21503301.1| fibronectin-binding A domain-containing protein [Natronolimnobius
           innermongolicus JCM 12255]
 gi|445597955|gb|ELY52026.1| fibronectin-binding A domain-containing protein [Natronolimnobius
           innermongolicus JCM 12255]
          Length = 723

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P  +A+GI
Sbjct: 596 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTPVGVAVGI 646


>gi|284166116|ref|YP_003404395.1| fibronectin-binding A domain-containing protein [Haloterrigena
           turkmenica DSM 5511]
 gi|284015771|gb|ADB61722.1| Fibronectin-binding A domain protein [Haloterrigena turkmenica DSM
           5511]
          Length = 723

 Score = 47.0 bits (110), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P  +A+GI
Sbjct: 596 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVDVAVGI 646



 Score = 41.6 bits (96), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 53/280 (18%), Positives = 107/280 (38%), Gaps = 60/280 (21%)

Query: 62  STPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKK 111
           +TP  + +H     + ++SF AA+D++F     ++++L   ++E D           + K
Sbjct: 276 ATPFPLEEHVELASEPYDSFLAALDDYF-----YRLELADDEEETDPTTQRPDFEEEIAK 330

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
            E +       +   EQ     +E+AEL+      VD  +  +++  A    W++IE R 
Sbjct: 331 YERIIEQQRGAIEGFEQEADALREQAELLYAEYGLVDDILSTVQEARAQDRPWDEIEERF 390

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
            +       +A+    + ++ +  T+ + +  ER ++     +   NAD           
Sbjct: 391 AEGADRG--IAAAEAVVNVDGSEGTVTVELDGERIDLVAKQGVEQ-NADRL--------- 438

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ----------- 280
                      Y  AKR  + K  A       I+ T + L  A+ +  +           
Sbjct: 439 -----------YTEAKRVGEKKEGAL----AAIEDTREDLGEAKARRDRWEEADAADEGE 483

Query: 281 -------TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
                    +D  +  ++       WF++F WF +S+ YL
Sbjct: 484 DDEDDEGEERDWLSEPSVPIRENEPWFDRFRWFHTSDGYL 523


>gi|336253827|ref|YP_004596934.1| Fibronectin-binding A domain-containing protein [Halopiger
           xanaduensis SH-6]
 gi|335337816|gb|AEH37055.1| Fibronectin-binding A domain protein [Halopiger xanaduensis SH-6]
          Length = 718

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P  +A+GI
Sbjct: 591 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTPVGVAVGI 641



 Score = 44.7 bits (104), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 55/274 (20%), Positives = 109/274 (39%), Gaps = 50/274 (18%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
           TP  + +H+  G   + ++SF  A+D++F   E    +     ++R    + + K E + 
Sbjct: 272 TPFPLEEHEAEGLVGEAYDSFLTALDDYFFRLELEDEEEPDPTEQRPDFEEEIAKHERII 331

Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
              +  +   EQ     +EKAEL+      VD  +  IR        W++IE R ++ + 
Sbjct: 332 EQQQGAIEGFEQEAEQLREKAELLYARYGLVDDILSTIRNAREQDRPWDEIEERFEEGKE 391

Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDV 236
                A  +  + ++ +   + + +  ER +++    +   NAD                
Sbjct: 392 RGIEAAEAV--VGIDGSEGIVTVDIDGERIDLEARQGVEQ-NADRL-------------- 434

Query: 237 DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ---------------- 280
                 Y  AKR  + K  A       I+ T + L+ A+++ +Q                
Sbjct: 435 ------YTEAKRVEEKKEGAL----AAIEDTREDLEEAKRRREQWEAEDAGEDDADDEDE 484

Query: 281 -TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
              KD  +M ++       W+++F WF +S++YL
Sbjct: 485 GEDKDWLSMPSVPIRENEPWYDRFRWFHTSDDYL 518


>gi|124485365|ref|YP_001029981.1| hypothetical protein Mlab_0540 [Methanocorpusculum labreanum Z]
 gi|124362906|gb|ABN06714.1| protein of unknown function DUF814 [Methanocorpusculum labreanum Z]
          Length = 642

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 77/364 (21%), Positives = 137/364 (37%), Gaps = 116/364 (31%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           PS++R+    P + F +F+ A++ F+    + K     V +++  L K E +++  E  +
Sbjct: 248 PSLVRE----PKEHFATFSQALEAFYPKPVAEK-----VIEQKIKLSKEERIRKQQEAAV 298

Query: 124 SALEQTQLVDKEKAELIINN----QESVDAAILAIRQDIANQLSWEDIEARVKQAQRHND 179
              ++      E +E+I ++    QE++D    A     + +LSW+DI A +K++   + 
Sbjct: 299 VNFDKKIAEATEISEIIYSHYGEVQETIDVLAAA-----SQKLSWQDIAAVIKKS---DL 350

Query: 180 PVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLD 239
           P A  I  +                             NA    D  E  K   V + + 
Sbjct: 351 PAAKRIISV--------------------------DPKNASVVIDLQEKHK---VTIFVH 381

Query: 240 LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYW 299
            S  AN  R+F + +    K+   +++ E  +  AEKK       +          K  W
Sbjct: 382 ESLEANVGRYFAVVKKFRAKKAGALRAMEAGIVHAEKKKAAGPGRL----------KPKW 431

Query: 300 FEKFYWFISSENYL------------------------------GNVAWDAKVVT----- 324
           + +F W  +S+  L                              G  A   K VT     
Sbjct: 432 YHRFRWMETSDGVLVIGGRNADQNEELVKKYMEGKDTFLHADVFGASAVIVKGVTERMDQ 491

Query: 325 ---------NAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQL 363
                     AW              +QVSKT  +GE++  GSF+IRG++  +   P ++
Sbjct: 492 AVQFAASYSRAWAGGGASVDVIAASPNQVSKTPESGEYVAHGSFVIRGERKIYKDVPLEI 551

Query: 364 AMGI 367
           A+G+
Sbjct: 552 AIGV 555


>gi|269986196|gb|EEZ92508.1| protein of unknown function DUF814 [Candidatus Parvarchaeum
           acidiphilum ARMAN-4]
          Length = 587

 Score = 46.6 bits (109), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 1/53 (1%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW       + +++K +QV+KT P+GE L  G+F I GK+++    QL + IS
Sbjct: 463 AWRVGATNLDVYFIKPEQVTKTPPSGESLKKGAFYIEGKRDYIKNAQLGIYIS 515


>gi|383621605|ref|ZP_09948011.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
 gi|448702236|ref|ZP_21699890.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
 gi|445777606|gb|EMA28567.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
          Length = 718

 Score = 46.6 bits (109), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P  +A+GI
Sbjct: 591 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGVAVGI 641



 Score = 45.4 bits (106), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 55/278 (19%), Positives = 110/278 (39%), Gaps = 57/278 (20%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
           TP  + +H+  G   + ++SF  A+D++F   E  + +     ++R    + + K + + 
Sbjct: 271 TPFPLEEHEAEGLSAEAYDSFLTALDDYFFRLELDEEEEPDPTEQRPDFEEEIAKHQRII 330

Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
              E  +   EQ     +E+AE +      VD  +  IRQ       W++IE R ++ + 
Sbjct: 331 EQQEGAIEGFEQQADELREQAESLYAEYGLVDEVLSTIRQARKQDRPWDEIEQRFEEGKE 390

Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDV 236
                A  +  + L+ +  T+ + V  ER                            +D+
Sbjct: 391 RGIEAAETV--VDLDGSEGTVTVEVDGER----------------------------IDL 420

Query: 237 DLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQ------------- 280
            +D     NA R + + KR   KK+     I+ T + L+ A+++  Q             
Sbjct: 421 VVDDGVEQNADRLYTEAKRVEEKKEGALAAIEDTREDLEDAKRRRDQWEAEDAAEDDEDD 480

Query: 281 -----TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
                  ++  +M ++       W+++F WF +S+ YL
Sbjct: 481 DDEDEEERNWLSMPSVPIRENEPWYDRFRWFHTSDGYL 518


>gi|448300325|ref|ZP_21490327.1| fibronectin-binding A domain-containing protein [Natronorubrum
           tibetense GA33]
 gi|445586054|gb|ELY40340.1| fibronectin-binding A domain-containing protein [Natronorubrum
           tibetense GA33]
          Length = 726

 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 31/51 (60%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P  +A+GI
Sbjct: 599 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDDTPVGVAVGI 649



 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 64/324 (19%), Positives = 136/324 (41%), Gaps = 28/324 (8%)

Query: 3   SNEDVSSDIHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSEDY----IVANMEFHPMIL 58
           + EDV   +++    L + + N N   + Y   + E   +SED        N +  P  +
Sbjct: 218 AGEDVYDRLYEAIERLALDLRNSNFDPRLYFADEGEDGDDSEDGGENETGDNGDSSPERV 277

Query: 59  QPGSTPSVMRQHQGFPIQEFESFAAAVDEFF------STAESHKIDLKAVQQERDALKKL 112
              +TP  + +H   P + ++++ +A+D++F         E    D +   +E   + K 
Sbjct: 278 V-DATPFPLEEHVELPSEPYDTYLSALDDYFFRLELEEEGEPDPTDQRPDFEEE--IAKQ 334

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E +    +  +   EQ   + +E+AE +      VD  +  I++  A    W++IE R +
Sbjct: 335 ERIIEQQQGAIEGFEQEADMLREQAESLYAEYGLVDDILSTIQEARAQDRPWDEIEERFE 394

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
                    A  +  + ++ +   + + V  E  +++ +  +   NAD     A + E K
Sbjct: 395 AGAEQGIEAAEAV--IDVDGSEGVVTVDVDGEYIDLETTQGVEQ-NADRLYTEAKAVEDK 451

Query: 230 KPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMT 289
           K         LSA  N ++  DL+  A +++ +      +       + ++  +D  +M 
Sbjct: 452 KEGA------LSAIENTRK--DLQ-EAKRRRDQWEADDGEDEGDDADEEEREDRDWLSMP 502

Query: 290 NINKARKVYWFEKFYWFISSENYL 313
           ++       W+++F WF +S+ YL
Sbjct: 503 SVPVRENEPWYDRFRWFYTSDGYL 526


>gi|119872023|ref|YP_930030.1| hypothetical protein Pisl_0509 [Pyrobaculum islandicum DSM 4184]
 gi|119673431|gb|ABL87687.1| protein of unknown function DUF814 [Pyrobaculum islandicum DSM
           4184]
          Length = 613

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 79/203 (38%), Gaps = 75/203 (36%)

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKA----------LKSAEKKTKQT 281
           ++V+++ D+       R F+  +   +K  K  Q  EK           +  AE+K K +
Sbjct: 335 TVVELEKDIPVGRQISRMFEHAKELEEKANKAAQVVEKLRRDLAKLEEEIHRAEEKVKSS 394

Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISS-----------------------ENYL---GN 315
           +K V     + +A    WFEKF W I++                       E+YL    +
Sbjct: 395 VKIV-----VKRA----WFEKFRWSITTGKRPIIGGRDASQNETIVRKYLREHYLFFHAD 445

Query: 316 VAWDAKVV--------------------TNAW----------WVKADQVSKTAPTGEFLT 345
           +   + VV                    + AW          +V+ +QVSK AP G++L 
Sbjct: 446 IPGASVVVMPPSEDPLELLQTAQFAAAYSKAWKIGIHSIDVYYVRGEQVSKHAPAGQYLA 505

Query: 346 TGSFMIRGKKNFFPPCQLAMGIS 368
            GSFMI GK+ +    +L + + 
Sbjct: 506 RGSFMIYGKREYIRHVRLELAVG 528


>gi|88601740|ref|YP_501918.1| hypothetical protein Mhun_0437 [Methanospirillum hungatei JF-1]
 gi|88187202|gb|ABD40199.1| protein of unknown function DUF814 [Methanospirillum hungatei JF-1]
          Length = 627

 Score = 46.2 bits (108), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 37/54 (68%), Gaps = 3/54 (5%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           AW A   + + +  + DQVSKTA +GE+L+ GSF++RG++ +F   P ++ +G+
Sbjct: 495 AWRAGFASADVYAARPDQVSKTAESGEYLSRGSFVVRGERQWFHDVPLEVVIGL 548


>gi|448578556|ref|ZP_21643976.1| hypothetical protein C455_13495 [Haloferax larsenii JCM 13917]
 gi|445725734|gb|ELZ77354.1| hypothetical protein C455_13495 [Haloferax larsenii JCM 13917]
          Length = 702

 Score = 45.8 bits (107), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 55/289 (19%), Positives = 109/289 (37%), Gaps = 81/289 (28%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQE---------RDALKK 111
           TP  +R+HQ  G   + ++SF  A+DE+F      ++DL A +Q+            + K
Sbjct: 258 TPFPLREHQNDGLDEEAYDSFNEALDEYF-----FRLDLTADEQQDVGSNRPDFEAQIAK 312

Query: 112 LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARV 171
            E +    E  +   ++    ++E+AEL+  N + VD  +  +R      + W+DI  ++
Sbjct: 313 QERIIEQQEGAIEGFDERAAAERERAELLYANYDLVDDVLSTVRDAREEGVPWDDIAEKL 372

Query: 172 KQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKP 231
                   P A  +  +      +T+ L                                
Sbjct: 373 DAGAEQGIPAAEAVTNVDGAEGTVTIEL------------------------------DD 402

Query: 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQ----- 286
           S + +D+ +    NA R +         + K IQ  ++   +A + T++ L++V+     
Sbjct: 403 STITLDVSMGVEKNADRLY--------TEAKRIQEKKEGALAAIEDTREELEEVKRRRDE 454

Query: 287 ----------------------TMTNINKARKVYWFEKFYWFISSENYL 313
                                 ++ ++      YW+E+F WF +S+ YL
Sbjct: 455 WEADDDEDDAEDEEEQEETDWLSLQSVPVKSTDYWYEQFRWFHTSDGYL 503



 Score = 45.1 bits (105), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   +A+ V+  QVSKT  +GE++  GSF+IRG + ++     ++A+GI
Sbjct: 575 DGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDRTYYNDVAAEVAVGI 625


>gi|159040762|ref|YP_001540014.1| hypothetical protein Cmaq_0175 [Caldivirga maquilingensis IC-167]
 gi|157919597|gb|ABW01024.1| protein of unknown function DUF814 [Caldivirga maquilingensis
           IC-167]
          Length = 650

 Score = 45.8 bits (107), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 30/45 (66%)

Query: 325 NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           + ++VK +QVSK AP+GE+L+ G FMI G++ +    +L + +  
Sbjct: 513 DVYYVKGEQVSKKAPSGEYLSKGGFMIYGQRGWVRGVELIISVGL 557


>gi|448414286|ref|ZP_21577425.1| RNA-binding protein, snrnp like protein [Halosarcina pallida JCM
           14848]
 gi|445682579|gb|ELZ34996.1| RNA-binding protein, snrnp like protein [Halosarcina pallida JCM
           14848]
          Length = 701

 Score = 45.8 bits (107), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQ--LAMGI 367
           + +    A+ V  DQVSKT  +GE+L  GSF+IRG + +F   +  +A+G+
Sbjct: 575 EGRYAGEAYMVTPDQVSKTPESGEYLEKGSFVIRGDRTYFRDVEAKVAVGV 625



 Score = 38.5 bits (88), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 55/284 (19%), Positives = 118/284 (41%), Gaps = 68/284 (23%)

Query: 62  STPSVMRQHQ--GFPIQEFESFAAAVDEFF----STAE--------SHKIDLKAVQQERD 107
           +TP  + +H+  G   +  ++F  A+DE+F     TAE        S++ D +A      
Sbjct: 256 ATPFPLEEHEAEGLNSESHDTFNDALDEYFFRLDRTAEDEPDEEPGSNRPDFEA------ 309

Query: 108 ALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI 167
            ++K + + +  E  +   E+    ++E+AEL+  + + VD  +  +R      + W+DI
Sbjct: 310 EIEKKKRIIQQQEGAIEGFEEQAQEERERAELLYAHYDLVDEVLTTVRDAREENVPWDDI 369

Query: 168 EARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSE 227
             R+++      P A  +  +      +T+ L                            
Sbjct: 370 RQRLEEGAERGIPAAESVVDVDGAEGTVTVEL---------------------------- 401

Query: 228 TKKPSLVDVDLDLSAYANAKRFF-DLKRSAAKKQQ--KTIQSTEKALKSAEKKTKQTLKD 284
             + + ++V +D     NA R + + KR   KK+     ++ T + L  A+++ ++  ++
Sbjct: 402 --EDTRIEVVVDTGVEKNADRLYTEAKRVEGKKEGALAAVEDTREELAEAKRRREEWEEE 459

Query: 285 VQ---------------TMTNINKARKVYWFEKFYWFISSENYL 313
            +               + ++I    + +WFE+F WF +S+ YL
Sbjct: 460 DEDDEEEDEEPEDIDWLSRSSIPLRTEEHWFERFRWFHTSDGYL 503


>gi|448377770|ref|ZP_21560466.1| Fibronectin-binding A domain protein [Halovivax asiaticus JCM
           14624]
 gi|445655714|gb|ELZ08559.1| Fibronectin-binding A domain protein [Halovivax asiaticus JCM
           14624]
          Length = 736

 Score = 45.8 bits (107), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V ADQV+KT  +GE+L  G F IRG + +    P  +A+GI
Sbjct: 609 DGRYAGDVYAVDADQVTKTPESGEYLEKGGFAIRGDRTYHRDTPVDVAVGI 659


>gi|289580546|ref|YP_003479012.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
 gi|448284209|ref|ZP_21475471.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
 gi|289530099|gb|ADD04450.1| Fibronectin-binding A domain protein [Natrialba magadii ATCC 43099]
 gi|445571291|gb|ELY25845.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
          Length = 727

 Score = 45.8 bits (107), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P   A+GI
Sbjct: 600 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 650



 Score = 43.1 bits (100), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 53/258 (20%), Positives = 111/258 (43%), Gaps = 16/258 (6%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
           TP  + +H   GF  + +++F  A+D++F   E    +      +R    + + K E + 
Sbjct: 279 TPFPLEEHDEAGFDGEPYDTFLDALDDYFFHLELEDEEEPDPTSQRPDFGEEIAKHERII 338

Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
              +  +   EQ     +E AEL+  N   VD  +  I++  A    W++IEAR +    
Sbjct: 339 EQQQGAIEGFEQEAENLRENAELLYANYGLVDDILSTIQEARAQDRPWDEIEARFEDGAE 398

Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD- 235
                A  +    ++++    ++ V  +   I+   ++ HD  +  AD   T+   + + 
Sbjct: 399 QGIEAAEAV----IDVDGSEGIVTVDVDGEYIE---LVAHDGVEQNADRLYTEAKRVAEK 451

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKAR 295
            +  L+A  + +   DLK +  ++ +   Q  +      ++  +   +D    ++I    
Sbjct: 452 KEGALAAIEDTRE--DLKDAKRRRDEWEEQDGKPGAGDEDEDDEDDDRDWLAESSIPIRE 509

Query: 296 KVYWFEKFYWFISSENYL 313
              WF++F WF +S+ YL
Sbjct: 510 NEPWFDRFRWFHTSDGYL 527


>gi|448317278|ref|ZP_21506835.1| fibronectin-binding A domain-containing protein [Natronococcus
           jeotgali DSM 18795]
 gi|445604315|gb|ELY58265.1| fibronectin-binding A domain-containing protein [Natronococcus
           jeotgali DSM 18795]
          Length = 717

 Score = 45.8 bits (107), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P   A+GI
Sbjct: 590 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 640


>gi|435848081|ref|YP_007310331.1| putative RNA-binding protein, snRNP like protein [Natronococcus
           occultus SP4]
 gi|433674349|gb|AGB38541.1| putative RNA-binding protein, snRNP like protein [Natronococcus
           occultus SP4]
          Length = 712

 Score = 45.4 bits (106), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F IRG + ++   P   A+GI
Sbjct: 585 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 635



 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 55/273 (20%), Positives = 104/273 (38%), Gaps = 48/273 (17%)

Query: 62  STPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENV 115
           +TP  + +H+  G   +  +SF  A+D++F   E    +     ++R    + + K E +
Sbjct: 267 ATPFPLEEHEDEGLTAEAADSFLGALDDYFFELELEDEEEPDPTEQRPDFEEEIAKHERI 326

Query: 116 KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
               +  +   EQ     +E AEL+    E VD  +  I++       W++IE R ++ +
Sbjct: 327 IEQQQGAIEGFEQQAQAQRENAELLYARYELVDDILSTIQEARTQDRPWDEIEERFEEGK 386

Query: 176 RHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD 235
                 A  +  +      +T+ L   +E   + D  V    NAD               
Sbjct: 387 ERGIEAAEAVVGVDGTEGIVTVELD-GEEIDLVADDGV--EQNADRL------------- 430

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQ--------------- 280
                  Y  AKR  + K  A       I+ T + L+ A+++  +               
Sbjct: 431 -------YTEAKRIEEKKEGAL----AAIEDTREDLEDAKRRRDEWEATDDHEDDDDEED 479

Query: 281 TLKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
             K+   M ++       W+++F WF +S+ YL
Sbjct: 480 EEKNWLEMASVPIRENEPWYDRFRWFHTSDGYL 512


>gi|154150873|ref|YP_001404491.1| hypothetical protein Mboo_1330 [Methanoregula boonei 6A8]
 gi|153999425|gb|ABS55848.1| protein of unknown function DUF814 [Methanoregula boonei 6A8]
          Length = 631

 Score = 45.4 bits (106), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 69/373 (18%), Positives = 133/373 (35%), Gaps = 104/373 (27%)

Query: 54  HPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLE 113
           HP+I +    P V+ ++      +F  F+ A++ F+   ++ K+ + A    R  L + E
Sbjct: 232 HPVISKSSCEPVVLAENAPQDENQFAGFSDALEVFYPMTKAEKVKVAA----RPKLSEGE 287

Query: 114 NVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ 173
            +++  E  +   ++     +E    I  N   +   I ++    + +LSW++IE  +K 
Sbjct: 288 RIRKYQEAAIKKFDEKVAKAEEVVAAIYENYPFISQVITSLAA-ASKRLSWQEIEHHLKD 346

Query: 174 AQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
                            +   IT                        P   + E      
Sbjct: 347 TS-------------STDAKRITAFF---------------------PGEAAVEVDIGKK 372

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
           V + +  +   NA  ++D  +   KK++  +     A+K+ + + K    D+  M  +  
Sbjct: 373 VKIFVHETVEQNAGHYYDQIKKFKKKKEGALL----AMKTVKPRKKVIRHDIVPMKKL-- 426

Query: 294 ARKVYWFEKFYWFISSENY--LGN------------------------------------ 315
                W+ +F WFI+S+    LG                                     
Sbjct: 427 -----WYHRFRWFITSDGVVVLGGRDAGQNEELVKKYMTGGDLFVHADVHGASVVIVKGK 481

Query: 316 ------VAWDAKVVTNAWW----------VKADQVSKTAPTGEFLTTGSFMIRGKKNFFP 359
                 VA  A   + AW            +  QVSKT   GEF+  GSF++RG++ ++ 
Sbjct: 482 TEKMDEVAQFAASYSGAWRSGHFTADVFSAQPTQVSKTPQAGEFVARGSFIVRGERTYYR 541

Query: 360 PCQLAMGISFLFK 372
              L++GI  + +
Sbjct: 542 DVPLSVGIGLVLE 554


>gi|116754828|ref|YP_843946.1| hypothetical protein Mthe_1534 [Methanosaeta thermophila PT]
 gi|116666279|gb|ABK15306.1| protein of unknown function DUF814 [Methanosaeta thermophila PT]
          Length = 641

 Score = 45.4 bits (106), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 51/260 (19%), Positives = 103/260 (39%), Gaps = 59/260 (22%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRL 123
           P  +  ++G   + FE F+ A+DEFF  AE     L A+++  +  +   +  R  ET+L
Sbjct: 249 PFPLEVYKGLEARSFERFSDALDEFF-VAEPEMPKLSALERRLELQRAAIDELRAKETQL 307

Query: 124 SALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPVAS 183
           +++           + I      +D+ + AI       LS+ DI  R++ + +       
Sbjct: 308 ASM----------GDFIYQRYSEIDSILKAIAGARERGLSYTDIWERIQSSGKSAVKSLD 357

Query: 184 IIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAY 243
              ++ + I+ +TL                                     +++  L+  
Sbjct: 358 YSGEMIVEIDGVTL-------------------------------------ELNAGLTVP 380

Query: 244 ANAKRFFD---LKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWF 300
            NA R+++        A   ++ ++ TE  L+  E++ +  +        + +  K  WF
Sbjct: 381 QNAGRYYERAKEAAKKAAGAEEALRRTEDLLQRGEERRRSPV--------LKRRHKPRWF 432

Query: 301 EKFYWFISSENYLGNVAWDA 320
           E+F WF SS+++L     DA
Sbjct: 433 ERFRWFYSSDDFLVIGGRDA 452


>gi|448361523|ref|ZP_21550140.1| fibronectin-binding A domain-containing protein [Natrialba asiatica
           DSM 12278]
 gi|445650542|gb|ELZ03465.1| fibronectin-binding A domain-containing protein [Natrialba asiatica
           DSM 12278]
          Length = 720

 Score = 45.4 bits (106), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F +RG + ++   P   A+GI
Sbjct: 593 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 643


>gi|448348947|ref|ZP_21537792.1| fibronectin-binding A domain-containing protein [Natrialba
           taiwanensis DSM 12281]
 gi|445641664|gb|ELY94739.1| fibronectin-binding A domain-containing protein [Natrialba
           taiwanensis DSM 12281]
          Length = 720

 Score = 45.4 bits (106), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F +RG + ++   P   A+GI
Sbjct: 593 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 643


>gi|171186042|ref|YP_001794961.1| hypothetical protein Tneu_1592 [Pyrobaculum neutrophilum V24Sta]
 gi|170935254|gb|ACB40515.1| protein of unknown function DUF814 [Pyrobaculum neutrophilum
           V24Sta]
          Length = 613

 Score = 45.4 bits (106), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 32/53 (60%), Gaps = 1/53 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW   +   + ++V+ +QV+K AP G++L  GSFMI GK+ +    +L + + 
Sbjct: 476 AWKIGIHAVDVYYVRGEQVTKQAPAGQYLARGSFMIYGKREYVRNVRLELAVG 528


>gi|15790499|ref|NP_280323.1| hypothetical protein VNG1508C [Halobacterium sp. NRC-1]
 gi|169236235|ref|YP_001689435.1| hypothetical protein OE3153R [Halobacterium salinarum R1]
 gi|10580999|gb|AAG19803.1| conserved hypothetical protein [Halobacterium sp. NRC-1]
 gi|167727301|emb|CAP14087.1| conserved hypothetical protein [Halobacterium salinarum R1]
          Length = 703

 Score = 45.1 bits (105), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGIS 368
           D +   +A+ V  DQVSKT  +GE+L  G F +RG + +F     + A+GI+
Sbjct: 576 DGRGAGDAYMVSPDQVSKTPESGEYLEKGGFAVRGDRTYFRDLSTEWAVGIT 627


>gi|448353444|ref|ZP_21542220.1| fibronectin-binding A domain-containing protein [Natrialba
           hulunbeirensis JCM 10989]
 gi|445640304|gb|ELY93393.1| fibronectin-binding A domain-containing protein [Natrialba
           hulunbeirensis JCM 10989]
          Length = 736

 Score = 45.1 bits (105), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F +RG + ++   P   A+GI
Sbjct: 609 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 659



 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 55/259 (21%), Positives = 111/259 (42%), Gaps = 15/259 (5%)

Query: 63  TPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVK 116
           TP  + +H   GF  + +++F  A+D++F   E    +      +R    + + K E + 
Sbjct: 285 TPFPLEEHDEAGFDGEPYDTFLDALDDYFFHLELEDEEEPDPTSQRPDFEEEIAKHERII 344

Query: 117 RDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176
              +  +   EQ     +E AEL+  N   VD  +  I++  A    WE IEAR ++   
Sbjct: 345 EQQQGAIEGFEQEAENLRENAELLYANYGLVDDILSTIQEARAQDRPWEAIEARFEEGAE 404

Query: 177 HNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVD- 235
                A  +    ++++    ++ V  +   I+   ++ HD  +  AD   T+   + + 
Sbjct: 405 QGIEAAEAV----IDVDGSEGIVTVDVDGEYIE---LVAHDGVEQNADRLYTEAKRVAEK 457

Query: 236 VDLDLSAYANAKR-FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
            +  L+A  + +    D KR   + ++   +S   +    E + +   +D    ++I   
Sbjct: 458 KEGALAAIEDTREDLEDAKRRRDEWEESDGESGAGSGGGDEDEGEDEDRDWLAESSIPIR 517

Query: 295 RKVYWFEKFYWFISSENYL 313
               WF++F WF +S+ YL
Sbjct: 518 ENEPWFDRFRWFHTSDGYL 536


>gi|448368844|ref|ZP_21555611.1| fibronectin-binding A domain-containing protein [Natrialba aegyptia
           DSM 13077]
 gi|445651387|gb|ELZ04295.1| fibronectin-binding A domain-containing protein [Natrialba aegyptia
           DSM 13077]
          Length = 722

 Score = 45.1 bits (105), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F +RG + ++   P   A+GI
Sbjct: 595 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 645


>gi|448359396|ref|ZP_21548054.1| fibronectin-binding A domain-containing protein [Natrialba
           chahannaoensis JCM 10990]
 gi|445643534|gb|ELY96581.1| fibronectin-binding A domain-containing protein [Natrialba
           chahannaoensis JCM 10990]
          Length = 727

 Score = 45.1 bits (105), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V +DQV+KT  +GE+L  G F +RG + ++   P   A+GI
Sbjct: 600 DGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 650



 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 55/266 (20%), Positives = 110/266 (41%), Gaps = 30/266 (11%)

Query: 62  STPSVMRQHQ--GFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENV 115
           +TP  + +H   G   + +++F  A+D++F   E    +      +R    + + K E +
Sbjct: 278 ATPFPLEEHDESGLDGEPYDTFLNALDDYFFHLELEDEEEPDPTSQRPDFGEEIAKHERI 337

Query: 116 KRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQ 175
               +  +   EQ     +E AEL+  N   VD  +  I++  A    W+DIEAR ++  
Sbjct: 338 IEQQQGAIEGFEQEAENLRENAELLYANYGLVDDILSTIQEARAQDRPWDDIEARFEEGA 397

Query: 176 RHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADS--------SE 227
                 A  +    ++++    ++ V      I+   ++ HD  +  AD         +E
Sbjct: 398 EQGIEAAEAV----IDVDGSEGIVTVDVNGEYIE---LVAHDGVEQNADRLYTEAKRVAE 450

Query: 228 TKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQT 287
            K+ +LV ++       +AKR         ++ +   Q  E      ++  +   +D   
Sbjct: 451 KKEGALVAIEDTREDLEDAKR---------RRDEWEEQDGEPGAGEEDEDDEDDDRDWLA 501

Query: 288 MTNINKARKVYWFEKFYWFISSENYL 313
            ++I       WF++F WF +S+ YL
Sbjct: 502 ESSIPIRENEPWFDRFRWFHTSDGYL 527


>gi|448306550|ref|ZP_21496454.1| fibronectin-binding A domain-containing protein [Natronorubrum
           bangense JCM 10635]
 gi|445597848|gb|ELY51920.1| fibronectin-binding A domain-containing protein [Natronorubrum
           bangense JCM 10635]
          Length = 710

 Score = 45.1 bits (105), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 56/272 (20%), Positives = 108/272 (39%), Gaps = 47/272 (17%)

Query: 62  STPSVMRQHQGFPIQEFESFAAAVDEFFSTAE-SHKIDLKAVQQERD---ALKKLENVKR 117
           +TP  + +H     + ++SF AA+D++F   E   + +     Q+ D    + K E +  
Sbjct: 266 ATPFPLEEHAELAAEPYDSFLAALDDYFFRLELDDEEEPDPTTQKPDFGEEIAKYERIID 325

Query: 118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRH 177
             +  +   EQ     +E+AEL+      VD  +  I+   A    W++IEAR ++    
Sbjct: 326 QQQGAIEGFEQQADELREQAELLYAEYGLVDDILSTIQDARAQDRPWDEIEARFEEGAER 385

Query: 178 NDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVD 237
               A  +  + ++ +   + + +  +R ++     + HD  +  AD             
Sbjct: 386 GIEAAEAV--VGIDSSEGIVTVDIDGDRIDL-----VAHDGVEQNAD------------- 425

Query: 238 LDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK----------------TKQT 281
                Y  AKR  + K  A       I+ T + L+ A+++                 +  
Sbjct: 426 ---RLYTEAKRVAEKKAGAL----AAIEDTREDLEDAKRRRDEWDADDEGDEEADDEEAE 478

Query: 282 LKDVQTMTNINKARKVYWFEKFYWFISSENYL 313
            K+   M +I       WF++F WF +S+ YL
Sbjct: 479 EKNWLEMPSIPIRENEPWFDRFRWFHTSDGYL 510


>gi|433638964|ref|YP_007284724.1| putative RNA-binding protein, snRNP like protein [Halovivax ruber
           XH-70]
 gi|433290768|gb|AGB16591.1| putative RNA-binding protein, snRNP like protein [Halovivax ruber
           XH-70]
          Length = 847

 Score = 45.1 bits (105), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           D +   + + V ADQV+KT  +GE+L  G F IRG + +    P  +A+GI
Sbjct: 720 DGRYAGDVYSVDADQVTKTPESGEYLEKGGFAIRGDRTYHRDTPVGVAVGI 770


>gi|18313944|ref|NP_560611.1| hypothetical protein PAE3259 [Pyrobaculum aerophilum str. IM2]
 gi|18161516|gb|AAL64793.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
          Length = 614

 Score = 45.1 bits (105), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 42/178 (23%), Positives = 68/178 (38%), Gaps = 57/178 (32%)

Query: 248 RFFDLKRSAAKKQQKTIQSTEKALKSAEK-KTKQTLKDVQTMTNINKARKVYWFEKFYWF 306
           + FD  +   +K QK  Q  EK  K   K   +Q   + +  +++  A K  WFEKF+W 
Sbjct: 352 QLFDEAKELEEKAQKAAQVLEKLKKDLSKLDEEQRRAEEKLKSSVKIATKKSWFEKFHWT 411

Query: 307 ISS-----------------------ENYLG-----------------------NVAWDA 320
           +++                       E+YL                         +A  A
Sbjct: 412 VTTGRKPVIGGRDASQNEVVVKKYLKEHYLFFHADIPGASAVVAPPSEDPLELLQIAQFA 471

Query: 321 KVVTNAW----------WVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
              + AW          +VK  QV+K  P+G++L  GSFMI GK+ +    +L + + 
Sbjct: 472 AAYSKAWKIGIHAVDVYYVKGVQVTKQPPSGQYLARGSFMIYGKREYVRNVRLELAVG 529


>gi|290559894|gb|EFD93216.1| protein of unknown function DUF814 [Candidatus Parvarchaeum
           acidophilus ARMAN-5]
          Length = 587

 Score = 44.7 bits (104), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISF 369
           AW A     + ++V  +QV+KT P+GE L  G+F I GK+ +     L + +SF
Sbjct: 463 AWRAGATNLDVYFVNPEQVTKTPPSGESLKKGAFYIEGKRKYIKNSSLGIYLSF 516


>gi|300711181|ref|YP_003736995.1| hypothetical protein HacjB3_09100 [Halalkalicoccus jeotgali B3]
 gi|448296718|ref|ZP_21486771.1| hypothetical protein C497_13578 [Halalkalicoccus jeotgali B3]
 gi|299124864|gb|ADJ15203.1| hypothetical protein HacjB3_09100 [Halalkalicoccus jeotgali B3]
 gi|445580850|gb|ELY35220.1| hypothetical protein C497_13578 [Halalkalicoccus jeotgali B3]
          Length = 694

 Score = 44.7 bits (104), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 62/295 (21%), Positives = 112/295 (37%), Gaps = 65/295 (22%)

Query: 48  VANMEFHPMILQPGS-----TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDL 99
           V    F P I +        TP  + +H       ++ F  A+D++F    T+E  + D 
Sbjct: 237 VTEKRFDPRIYRDDGQIVDVTPIALDEHAALEGDSYDRFNEALDDYFFELDTSEDEETDT 296

Query: 100 KAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159
                E   +++ + +    E  +   EQ    ++E+AEL+  N ++VD  +  +R  + 
Sbjct: 297 SPEFDEE--IERKKRIIDQQEGAIEGFEQEATEERERAELVYANYDTVDEVLTTVRGALE 354

Query: 160 NQLSWEDIEARVKQAQRHNDPVASII-------KQLKLNINHITLLLRVHKERSNIKDSW 212
               WE+IEA  +Q        A  +         + +++   T+ L V   RS ++   
Sbjct: 355 EGRGWEEIEATFEQGAEQGIDAAERVTGFDPENGMVSVDLGEATVSLEV---RSGVE--- 408

Query: 213 VLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALK 272
                NAD                      Y  AKR  + K  A    ++ I  T + L 
Sbjct: 409 ----KNAD--------------------RIYTEAKRIEEKKAGA----EEAIADTREELD 440

Query: 273 SAEKKTKQ--TLKDVQ------------TMTNINKARKVYWFEKFYWFISSENYL 313
           +  ++ +Q  T  + Q            +  +I   +   W+E F WF +S+ YL
Sbjct: 441 ALRERKRQWETRDETQDDGGEPEEIDWLSRASIPVRKSEEWYEDFRWFHTSDGYL 495


>gi|448346455|ref|ZP_21535340.1| Fibronectin-binding A domain protein [Natrinema altunense JCM
           12890]
 gi|445632658|gb|ELY85869.1| Fibronectin-binding A domain protein [Natrinema altunense JCM
           12890]
          Length = 715

 Score = 44.3 bits (103), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 55/260 (21%), Positives = 114/260 (43%), Gaps = 21/260 (8%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKLENVKRD 118
           TP  + +H     + ++SF +A+D +F     AE  + D    + + ++ + K E +   
Sbjct: 268 TPFPLEEHDDLEGEPYDSFLSALDAYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQ 327

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQ-AQRH 177
            +  +   EQ     +E+AEL+      VD  +  I+     + SW+DI  R ++ A++ 
Sbjct: 328 QQGAIEGFEQEAASLREQAELLYAEYGLVDDILSTIQGARERERSWDDIRERFEEGAEQG 387

Query: 178 NDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSS----ETKKPSL 233
            D  A+I+    ++ +  T+ + +  ER ++ D+      NAD     +    E K  +L
Sbjct: 388 IDAAAAIV---DIDGSDGTVTVEIDDERIDL-DAQQGVEQNADRLYTEAKRVEEKKDGAL 443

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINK 293
             ++      A+AKR         + + +  +S        ++      +D  + ++I  
Sbjct: 444 AAIEDTRQDLADAKR--------RRDEWEADESGGGDDDETDEDGDDLPRDWLSESSIPI 495

Query: 294 ARKVYWFEKFYWFISSENYL 313
                WF++F WF +S+ +L
Sbjct: 496 RENEPWFDRFRWFNTSDGFL 515



 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 25/40 (62%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
           D +   + + V +DQVSKT  +GE+L  G F IRG + ++
Sbjct: 588 DGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYY 627


>gi|374326819|ref|YP_005085019.1| hypothetical protein P186_1339 [Pyrobaculum sp. 1860]
 gi|356642088|gb|AET32767.1| hypothetical protein P186_1339 [Pyrobaculum sp. 1860]
          Length = 621

 Score = 44.3 bits (103), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW   +   + ++V+ +QVSK  P+G++L  GSFMI GK+ +    ++ + + 
Sbjct: 484 AWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMIYGKREYVRNVRMELAVG 536


>gi|145591891|ref|YP_001153893.1| hypothetical protein Pars_1690 [Pyrobaculum arsenaticum DSM 13514]
 gi|145283659|gb|ABP51241.1| protein of unknown function DUF814 [Pyrobaculum arsenaticum DSM
           13514]
          Length = 614

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW   +   + ++V+ +QVSK  P+G++L  GSFM+ GK+ +    +L + + 
Sbjct: 477 AWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMVYGKREYVRHIRLELAVG 529


>gi|379003409|ref|YP_005259081.1| putative RNA-binding protein [Pyrobaculum oguniense TE7]
 gi|375158862|gb|AFA38474.1| putative RNA-binding protein [Pyrobaculum oguniense TE7]
          Length = 614

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW   +   + ++V+ +QVSK  P+G++L  GSFM+ GK+ +    +L + + 
Sbjct: 477 AWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMVYGKREYVRHIRLELAVG 529


>gi|41615287|ref|NP_963785.1| hypothetical protein NEQ506 [Nanoarchaeum equitans Kin4-M]
 gi|40069011|gb|AAR39346.1| NEQ506 [Nanoarchaeum equitans Kin4-M]
          Length = 255

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 29/42 (69%), Gaps = 1/42 (2%)

Query: 317 AWDAKVVT-NAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNF 357
           AW   + + + ++V   Q+SK AP+GE+L  GSFM+ GKK++
Sbjct: 142 AWQEGLASIDVFYVLGKQISKKAPSGEYLKHGSFMVYGKKHY 183


>gi|448730186|ref|ZP_21712496.1| hypothetical protein C449_10386 [Halococcus saccharolyticus DSM
           5350]
 gi|445793917|gb|EMA44482.1| hypothetical protein C449_10386 [Halococcus saccharolyticus DSM
           5350]
          Length = 699

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 31/61 (50%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLFKLEESSI 378
           + +   +A+    DQVSKT  +GE+L  G F IRG + +F    + + +  + + E   I
Sbjct: 572 NGRFAGDAYMASPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVAVGIVCEPETRVI 631

Query: 379 S 379
            
Sbjct: 632 G 632



 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 50/268 (18%), Positives = 101/268 (37%), Gaps = 41/268 (15%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQER----DALKKLENVKRD 118
           TP  + +      + F+SF AA+D +F   ++ + D +   +ER    D +++ + +   
Sbjct: 257 TPFPLEERADLDSEGFDSFTAALDAYFVALDTTE-DEEGGGRERPDFEDDIERQQRIIEQ 315

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
            E  +   E     ++ KAE +  + + VD  +  +R        W+DIE R  +     
Sbjct: 316 QEGAIEDFEDQADAERAKAESLYAHYDLVDEILSTVRNAREQGTGWDDIEERFAEGADRG 375

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDL 238
              A  +  +  +   +T+ +       + +D       NAD                  
Sbjct: 376 IAAAEAVDGVTPSEGTVTVDIDGRSVELDPRDGV---EQNADRL---------------- 416

Query: 239 DLSAYANAKRFFDLKRSA------AKKQQKTIQSTEKALKSAEKKTKQTLKDVQ------ 286
               Y  AKR    K  A       + + + +Q      +SA++   ++    +      
Sbjct: 417 ----YKEAKRVVGKKEGAEEAVAETRAELEALQRRRDEWESADENETESTDTDEDEDIDW 472

Query: 287 -TMTNINKARKVYWFEKFYWFISSENYL 313
            T  +I   +   W+E+F WF +S+ +L
Sbjct: 473 LTRRSIPVRQNEQWYERFRWFRTSDGFL 500


>gi|448737510|ref|ZP_21719550.1| hypothetical protein C451_08253 [Halococcus thailandensis JCM
           13552]
 gi|445803654|gb|EMA53937.1| hypothetical protein C451_08253 [Halococcus thailandensis JCM
           13552]
          Length = 695

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGIS 368
           + +   +A+  + DQVSKT  +GE+L  G F IRG + +F      +A+GI+
Sbjct: 569 NGRFAGDAYMAEPDQVSKTPESGEYLEKGGFAIRGDRTYFRDTAVGVAVGIT 620



 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 58/296 (19%), Positives = 115/296 (38%), Gaps = 65/296 (21%)

Query: 48  VANMEFHPMILQPGSTPSVMRQHQGFPIQE--------FESFAAAVDEFFSTAESHKIDL 99
           +++ +F+P I +    P  +     FP+ E        F+SF AA+D +F   ++ + + 
Sbjct: 237 LSSGDFNPRIYRDDGDPVDVTP---FPLDERAELDSEGFDSFTAALDAYFVELDTTEDEE 293

Query: 100 KAVQQERDALKKLENVKR---DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQ 156
              ++  D  +++E  +R     E  +   E     ++E AE +  N E VD  +  +R 
Sbjct: 294 SGGRERPDFEEQIERQQRIIDQQEGAIEDFEAQADRERETAESLYANYELVDEILTTVRN 353

Query: 157 DIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHH 216
                + WE IE R  + +      A  +  ++ +   +T+         +I D      
Sbjct: 354 AREEGIGWEAIEERFAEGEERGIAAAEAVSGIEPSEGTVTV---------DIDDR----- 399

Query: 217 DNADPFADSSETKKPSLVDVDLDL--SAYANAKRFF-DLKRSAAKKQ--QKTIQSTEKAL 271
                             DV+LD       NA R + + KR   KK+  ++ +  T + L
Sbjct: 400 ------------------DVELDPQEGVEQNADRLYREAKRVVEKKEGAEEAVAETREEL 441

Query: 272 KSAEKKTKQTLKDVQ--------------TMTNINKARKVYWFEKFYWFISSENYL 313
           ++ E++  +                    +  +I       W+E+F WF +S+ +L
Sbjct: 442 EAIERQRDEWEAGDVDDDPDEESEDVDWLSQRSIPVRTDEQWYERFRWFHTSDGFL 497


>gi|448339346|ref|ZP_21528374.1| Fibronectin-binding A domain protein [Natrinema pallidum DSM 3751]
 gi|445620575|gb|ELY74071.1| Fibronectin-binding A domain protein [Natrinema pallidum DSM 3751]
          Length = 721

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 25/40 (62%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
           D +   + + V +DQVSKT  +GE+L  G F IRG + ++
Sbjct: 594 DGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYY 633



 Score = 41.2 bits (95), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 56/259 (21%), Positives = 113/259 (43%), Gaps = 19/259 (7%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFF---STAESHKIDLKAVQQERDA-LKKLENVKRD 118
           TP  + +H+    + ++SF +A+D++F     AE  + D    + + ++ + K E +   
Sbjct: 274 TPFPLEEHEDLDGEPYDSFLSALDDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQ 333

Query: 119 HETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHN 178
            +  +   EQ     +E+AEL+      VD  +  I        SW+DI  R ++     
Sbjct: 334 QQGAIEGFEQEAESLREQAELLYAEYGLVDDILSTILGARKRDRSWDDIRDRFEEGAEQG 393

Query: 179 DPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETKKPSLVD 235
              A  +  + ++ +  T+ + +  ER ++ D+      NAD     A   E KK     
Sbjct: 394 IDAAEAV--VDVDGSDGTVTVDIDDERISL-DAQQGVEQNADRLYTEAKRVEEKKEGA-- 448

Query: 236 VDLDLSAYANAKRFFDLKRSAAKKQQ-KTIQSTEKALKSAEKKTKQTLKDVQTMTNINKA 294
               L+A  N +   DL+ +  ++ + +  +S       A++  + + +D  +  +I   
Sbjct: 449 ----LAAIENTRD--DLEDAKRRRDEWEDDESGGADEAEADEDEEDSQRDWLSEPSIPIR 502

Query: 295 RKVYWFEKFYWFISSENYL 313
               WF++F WF +S+ YL
Sbjct: 503 ENEPWFDRFRWFHTSDGYL 521


>gi|448340269|ref|ZP_21529242.1| Fibronectin-binding A domain protein [Natrinema gari JCM 14663]
 gi|445630575|gb|ELY83836.1| Fibronectin-binding A domain protein [Natrinema gari JCM 14663]
          Length = 722

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 25/40 (62%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
           D +   + + V +DQVSKT  +GE+L  G F IRG + ++
Sbjct: 595 DGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYY 634



 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 57/265 (21%), Positives = 113/265 (42%), Gaps = 31/265 (11%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKKL 112
           TP  + +H     + ++SF +A+D++F      +++L A ++E D           + K 
Sbjct: 275 TPFPLEEHDDLEGEPYDSFLSALDDYF-----FRLEL-AAEEEPDPTDQRPDFESEIAKH 328

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E +    +  +   EQ     +E+AEL+      VD  +  I+     + SW++I  R +
Sbjct: 329 ERIIEQQQGAIEGFEQEAASLREQAELLYAEYGLVDEILSTIQGARERERSWDEIRERFE 388

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
           +        A  I  + ++ +  T+ + +  ER ++ D+      NAD     A   E K
Sbjct: 389 EGAEQGIDAAEAI--VDIDGSDGTVTVEIDDERIDL-DAQQGVEQNADRLYTEAKRVEEK 445

Query: 230 KPSLVDVDLDLSAYANAKR-FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM 288
           K      D  L+A  N ++   D KR   + + +  +S  +     +       +D  + 
Sbjct: 446 K------DGALAAIENTRQDLADAKRR--RDEWEADESGGEDDDETDADGDDLPRDWLSE 497

Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
           ++I       WF++F WF +S+ +L
Sbjct: 498 SSIPIRENEPWFDRFRWFHTSDGFL 522


>gi|110667755|ref|YP_657566.1| hypothetical protein HQ1801A [Haloquadratum walsbyi DSM 16790]
 gi|109625502|emb|CAJ51929.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
          Length = 719

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V  DQVSKT  +GE++  GSF+IRG + +      ++A+GI
Sbjct: 592 EGRYADDAYVVTPDQVSKTPESGEYIEKGSFVIRGDRTYIRDVAAEIAVGI 642



 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 53/265 (20%), Positives = 112/265 (42%), Gaps = 19/265 (7%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERD---ALKKLENVKRDHE 120
           P   R+ Q   +  ++SF  A+D +F   + +    ++ Q   D    + K + +    E
Sbjct: 260 PLEERKQQNLDVTTYDSFNGALDVYFREVDRNPAAEESGQTRPDFAAEIAKKQRIIEQQE 319

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI-EARVKQAQRHND 179
             +   EQ    ++ +AEL+  N E V+  I  I+   A   SW++I E     A+R  D
Sbjct: 320 GAIDDFEQRAEAERSRAELLYANYELVNEIIETIQTARAEDTSWDEIRETFAMGAERGID 379

Query: 180 PVASIIK------QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
             A+++        + + I+ + + + V      ++ +    +  A    +  E    ++
Sbjct: 380 AAAAVVSVDGAEAMVTIEIDDMRVPVNVD---VGVEKNADQRYTEAKRIEEKKEGALTAI 436

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKA---LKSAEKKTKQTLKDVQ--TM 288
            +   +L+A    +  +D +  A    +    +TE     + +  + ++    D +  +M
Sbjct: 437 ENTREELNAVKQRRDAWD-REDAKPDTEDNADNTETVTDKVNTGTEPSRMGPTDDEWLSM 495

Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
           T+I   +   W+E+F WF +S  YL
Sbjct: 496 TSIPLQKNDDWYEQFRWFHTSTGYL 520


>gi|385803199|ref|YP_005839599.1| hypothetical protein Hqrw_1937 [Haloquadratum walsbyi C23]
 gi|339728691|emb|CCC39852.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
          Length = 719

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 32/51 (62%), Gaps = 2/51 (3%)

Query: 319 DAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFP--PCQLAMGI 367
           + +   +A+ V  DQVSKT  +GE++  GSF+IRG + +      ++A+GI
Sbjct: 592 EGRYADDAYVVTPDQVSKTPESGEYIEKGSFVIRGDRTYIRDVAAEIAVGI 642



 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 54/265 (20%), Positives = 109/265 (41%), Gaps = 19/265 (7%)

Query: 64  PSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERD---ALKKLENVKRDHE 120
           P   R+ Q   +  ++SF  A+D +F   + +    ++ Q   D    + K + +    E
Sbjct: 260 PLEERKQQNLDVTAYDSFNGALDVYFREVDRNPAAEESGQTRPDFAAEIAKKQRIIEQQE 319

Query: 121 TRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDI-EARVKQAQRHND 179
             +   EQ    ++ +AEL+  N E V+  I  I+   A   SW++I E     A+R  D
Sbjct: 320 GAIDDFEQRAEAERSRAELLYANYELVNEIIETIQTARAEDTSWDEIRETFAMGAERGID 379

Query: 180 PVASIIK------QLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
             A+++        + + I+ + + + V      ++ +    +  A    +  E    ++
Sbjct: 380 AAAAVVSVDGAEAMVTIEIDDVRVPVNVD---VGVEKNADQRYTEAKRIEEKKEGALTAI 436

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK-----QTLKDVQTM 288
            +   +L+A    +  +D +  A    +    +TE         T+      T  +  +M
Sbjct: 437 ENTREELNAVKQRRDAWD-REDAKPDTEDNADNTETVTDKVNTGTEPSRMGPTNDEWLSM 495

Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
           T+I   +   W+E+F WF +S  YL
Sbjct: 496 TSIPLQKNDDWYEQFRWFHTSTGYL 520


>gi|374850433|dbj|BAL53422.1| hypothetical conserved protein [uncultured crenarchaeote]
          Length = 530

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
            + + VKA+QVS T P+G++L  GSFM+ G K F+
Sbjct: 411 VSVYAVKAEQVSFTPPSGQYLPKGSFMVYGSKKFY 445


>gi|397781041|ref|YP_006545514.1| hypothetical protein BN140_1875 [Methanoculleus bourgensis MS2]
 gi|396939543|emb|CCJ36798.1| putative protein MJ1625 [Methanoculleus bourgensis MS2]
          Length = 659

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 62/280 (22%), Positives = 111/280 (39%), Gaps = 78/280 (27%)

Query: 155 RQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVL 214
           R+++  Q   E I+   K+ +R+   V  + +        IT L    ++RS  +   +L
Sbjct: 312 REEVIRQRQAEAIKGFEKKIRRYERVVEVLYENYTAVTGVITTLDAASRDRSWQEIEQIL 371

Query: 215 HHDNADPFADSSETKKPSLVDVDLDLSA-----YA------NAKRFFDLKRSAAKKQQKT 263
             ++ +  A       P+   V+LDL+      Y       N  R++D  +   KK+   
Sbjct: 372 KSNSDNAAAKMIRAVHPAEAAVELDLAGERVKVYVHETIEQNIGRYYDQIKKFKKKKAGA 431

Query: 264 IQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYL---------- 313
           + + E+A+ + + + KQ L          K R   W+ +F WF +S+  L          
Sbjct: 432 LAAMERAI-TVKPRRKQHLV-------FQKKR---WYHRFRWFSTSDGVLVIGGRDASQN 480

Query: 314 ---------------------GNV-------------AWDAKVVTNAW----------WV 329
                                G+V             A  A   +NAW            
Sbjct: 481 EELVKKYMEGGDLFIHADVHGGSVVIVKGATEHLDEAAQFAASYSNAWKAGHFSADVYAA 540

Query: 330 KADQVSKTAPTGEFLTTGSFMIRGKKNFF--PPCQLAMGI 367
           + DQVSKTA +GE++  G+F++RG++ +F   P  +A+G+
Sbjct: 541 RPDQVSKTAESGEYVARGAFIVRGERQYFRNVPVGVAIGL 580


>gi|397772651|ref|YP_006540197.1| Fibronectin-binding A domain protein [Natrinema sp. J7-2]
 gi|397681744|gb|AFO56121.1| Fibronectin-binding A domain protein [Natrinema sp. J7-2]
          Length = 722

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 57/265 (21%), Positives = 113/265 (42%), Gaps = 31/265 (11%)

Query: 63  TPSVMRQHQGFPIQEFESFAAAVDEFFSTAESHKIDLKAVQQERDA----------LKKL 112
           TP  + +H     + ++SF +A+D++F      +++L A ++E D           + K 
Sbjct: 275 TPFPLEEHDDLEGEPYDSFLSALDDYF-----FRLEL-AAEEEPDPTDQRPDFESEIAKH 328

Query: 113 ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172
           E +    +  +   EQ     +E+AEL+      VD  +  I+     + SW++I  R +
Sbjct: 329 ERIIEQQQGAIEGFEQEAASLREQAELLYAEYGLVDEILSTIQGARERERSWDEIRERFE 388

Query: 173 QAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPF---ADSSETK 229
           +        A  I  + ++ +  T+ + +  ER ++ D+      NAD     A   E K
Sbjct: 389 EGAEQGIDAAEAI--VDIDGSDGTVTVEIDDERIDL-DAQQGVEQNADRLYTEAKRVEEK 445

Query: 230 KPSLVDVDLDLSAYANAKR-FFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTM 288
           K      D  L+A  N ++   D KR   + + +  +S  +     +       +D  + 
Sbjct: 446 K------DGALAAIENTRQDLADAKRR--RDEWEADESGGEDDDETDADGDDLPRDWLSE 497

Query: 289 TNINKARKVYWFEKFYWFISSENYL 313
           ++I       WF++F WF +S+ +L
Sbjct: 498 SSIPIRENEPWFDRFRWFHTSDGFL 522


>gi|355571923|ref|ZP_09043131.1| protein of unknown function DUF814 [Methanolinea tarda NOBI-1]
 gi|354825019|gb|EHF09254.1| protein of unknown function DUF814 [Methanolinea tarda NOBI-1]
          Length = 633

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 81/204 (39%), Gaps = 79/204 (38%)

Query: 231 PSLVDVDLDL----------SAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSA-EKKTK 279
           P    VD+DL          S  ANA  +++      KK ++ I   + A++ A +KK +
Sbjct: 362 PGRGAVDIDLGERVTLTVGESIEANAAAYYE----EIKKYRRKIAGAQAAMERAVQKKER 417

Query: 280 QTLKDVQTMTNINKARKVYWFEKFYWFI------------SSEN------YLGN------ 315
           +T++          A K  W+ +F WFI            +S+N      Y+        
Sbjct: 418 RTVRAA--------AGKKRWYHRFRWFITSDGVLVVGGRDASQNEELVKKYMEGSDLFVH 469

Query: 316 --------------------VAWDAKVVTNAW----------WVKADQVSKTAPTGEFLT 345
                               VA  A   + AW           V   QVSKT  +GE+++
Sbjct: 470 ADVHGASVVIVKGKTGKMDEVATFAASYSGAWKSGHLAADVYCVAPSQVSKTPESGEYVS 529

Query: 346 TGSFMIRGKKNFFP--PCQLAMGI 367
            GSF++RG++ +F   P  +A+G+
Sbjct: 530 RGSFIVRGERRYFRNVPLGIAIGL 553


>gi|296241940|ref|YP_003649427.1| hypothetical protein Tagg_0195 [Thermosphaera aggregans DSM 11486]
 gi|296094524|gb|ADG90475.1| protein of unknown function DUF814 [Thermosphaera aggregans DSM
           11486]
          Length = 666

 Score = 42.7 bits (99), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF-PPCQLAMGI 367
           AW   +     +W     VS + P+GE+L  G+ M+ GKKN+   P +L +G+
Sbjct: 521 AWKTGMGFIEVFWTMGSSVSLSPPSGEYLPKGAIMVYGKKNYLKTPLRLGLGL 573


>gi|315427275|dbj|BAJ48887.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
 gi|343485854|dbj|BAJ51508.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 628

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 25/35 (71%)

Query: 324 TNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFF 358
            + + VKA+QVS T P+G++L  GSFM+ G K F+
Sbjct: 509 VSVYAVKAEQVSFTPPSGQYLPKGSFMVYGSKKFY 543


>gi|424812620|ref|ZP_18237860.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
 gi|339756842|gb|EGQ40425.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
          Length = 628

 Score = 42.4 bits (98), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 79/386 (20%), Positives = 136/386 (35%), Gaps = 136/386 (35%)

Query: 57  ILQPGSTPSVMRQHQG-------FPIQ--------EFESFAAAVDEFFSTAESHKIDLKA 101
           +L+ G  P +    +G       FP+Q         FE+F+ A+DE F      K++ K 
Sbjct: 210 VLELGDDPVLYTDPEGEPVRAAPFPLQTYSEHEEERFETFSRALDELFHRRRQQKLESKR 269

Query: 102 VQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINN-------QESVDAAILAI 154
           + + R+  + +E      E +   LEQ     ++ AE I  N       ++ VD+ I   
Sbjct: 270 MDKYRERREGIERQLHQQEQKAEGLEQAARQRRQAAETIYENYQVFHDLKQKVDSVIHEE 329

Query: 155 RQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVL 214
             + A QL   D+E+                      +NH     RV             
Sbjct: 330 GWESAEQLEVSDLES----------------------VNHQERFYRVA------------ 355

Query: 215 HHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSA 274
                    D +E K      +  D S  A A R +D     AK++++  ++T +AL++ 
Sbjct: 356 --------IDGAEVK------LSPDESLEAAASRMYD----EAKEREQKAENTREALQNT 397

Query: 275 EKKTKQTLKDVQTMTNI----NKARKVYWFEKFYWFISSENYL----------------- 313
             K ++  +D   +       +++R   WFEK+ WF + E  L                 
Sbjct: 398 RGKLEELEEDEFEVEEDSMERDESRSKRWFEKYRWFHTPEGRLVICGRGPQTNESLVKNH 457

Query: 314 ----------------------GNVAWDAKV---------VTNAW--WVKADQVSKTAP- 339
                                 G  A + ++          + AW   + AD V    P 
Sbjct: 458 LEGDDLYLHADFDGAPSVALKDGQDASEEEIRQAAKAAVTFSKAWKSGIGADDVYYVEPS 517

Query: 340 -------TGEFLTTGSFMIRGKKNFF 358
                  +GE+L  G+F+IRG + + 
Sbjct: 518 QVTKNPESGEYLEKGAFVIRGDRTYL 543


>gi|359415829|ref|ZP_09208221.1| hypothetical protein HRED_04719, partial [Candidatus Haloredivivus
           sp. G17]
 gi|358033813|gb|EHK02326.1| hypothetical protein HRED_04719 [Candidatus Haloredivivus sp. G17]
          Length = 194

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 50/93 (53%), Gaps = 8/93 (8%)

Query: 234 VDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKK----TKQTLKDVQTMT 289
           + +DL     A A +++D     AK+ +  +++ EKAL+  E +     ++ ++  + M 
Sbjct: 52  IKIDLHQDLEATASQYYD----KAKESESKMENAEKALEKTEDEIESLGEEDIELEEVME 107

Query: 290 NINKARKVYWFEKFYWFISSENYLGNVAWDAKV 322
           + ++ R   WFEK+ WF SS+ YL  +  DA+ 
Sbjct: 108 DKSEKRSKKWFEKYRWFYSSDGYLVCLGRDAQT 140


>gi|126460385|ref|YP_001056663.1| hypothetical protein Pcal_1780 [Pyrobaculum calidifontis JCM 11548]
 gi|126250106|gb|ABO09197.1| protein of unknown function DUF814 [Pyrobaculum calidifontis JCM
           11548]
          Length = 616

 Score = 42.0 bits (97), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 16/53 (30%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 317 AWDAKV-VTNAWWVKADQVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGIS 368
           AW   +   + ++ + +QVSK  P G++L  GSFM+ GK+ +    +L + + 
Sbjct: 479 AWKIGIHAIDVYYARGEQVSKQPPAGQYLARGSFMVYGKREYVRNVRLELAVG 531


>gi|432330923|ref|YP_007249066.1| putative RNA-binding protein, snRNP like protein [Methanoregula
           formicicum SMSP]
 gi|432137632|gb|AGB02559.1| putative RNA-binding protein, snRNP like protein [Methanoregula
           formicicum SMSP]
          Length = 630

 Score = 41.6 bits (96), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 26/39 (66%)

Query: 333 QVSKTAPTGEFLTTGSFMIRGKKNFFPPCQLAMGISFLF 371
           QVSKT   GEF++ GSF++RG++ ++    L+ GI  + 
Sbjct: 512 QVSKTPEAGEFVSRGSFIVRGERTYYRNIPLSTGIGLML 550


>gi|152997342|ref|YP_001342177.1| hypothetical protein Mmwyl1_3337 [Marinomonas sp. MWYL1]
 gi|150838266|gb|ABR72242.1| conserved hypothetical protein [Marinomonas sp. MWYL1]
          Length = 232

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 66/129 (51%), Gaps = 11/129 (8%)

Query: 17  TLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPI- 75
           T+ +   +L V+  G    K+E RPN +D+++  +     I+ PG T SV+ ++ G P+ 
Sbjct: 50  TVEIFALDLLVNKNG----KEELRPNDDDFLIIPLT---TIIPPGKTQSVIIRYIGEPVL 102

Query: 76  QEFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKE 135
            E +S+  A+++   T +  +ID   +     + K L NV  ++      ++  + VDKE
Sbjct: 103 SESKSYRIAINQV--TVDLGEIDGSGIGMAL-SFKTLLNVVPNNAKSNLVIKNKEQVDKE 159

Query: 136 KAELIINNQ 144
              +++ N+
Sbjct: 160 TWSVLLENK 168


>gi|290474141|ref|YP_003467018.1| toxin RtxA protein [Xenorhabdus bovienii SS-2004]
 gi|289173451|emb|CBJ80230.1| putative toxin RtxA protein [Xenorhabdus bovienii SS-2004]
          Length = 4716

 Score = 38.9 bits (89), Expect = 4.0,   Method: Composition-based stats.
 Identities = 48/181 (26%), Positives = 73/181 (40%), Gaps = 19/181 (10%)

Query: 118  DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRH 177
            DH  +  A  Q  L DKEKAE      E      LA       QL   D+EA  +  Q  
Sbjct: 1615 DH-VQQDAAAQNALQDKEKAEADRQRLEQERDTQLAAVSGSQAQLEATDLEALNRNGQDQ 1673

Query: 178  NDPV----ASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233
             D +     +I K+L      ++ L    K      + W      A+      +T+  S 
Sbjct: 1674 RDAINHEAQAITKELTEQAKGLSALDSHSKHHGKSGEQWRAQF--AEGLLGQVQTQLNSA 1731

Query: 234  VDV---DLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDV-QTMT 289
             +     LD S +A ++RF DL+ + AK        +E  +   E+  +Q+ K++ QT T
Sbjct: 1732 DETAQQQLDRSQHAASERFGDLRDAVAK--------SEAGVSKGEQNQQQSQKEIEQTRT 1783

Query: 290  N 290
            N
Sbjct: 1784 N 1784


>gi|348664654|gb|EGZ04498.1| hypothetical protein PHYSODRAFT_536255 [Phytophthora sojae]
          Length = 580

 Score = 38.1 bits (87), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 9/84 (10%)

Query: 98  DLKAVQQERDALKK---LENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAI 154
           D +  QQE +A ++       +RD   RL+ L  TQL+ +   ++ +N Q+  DA   A+
Sbjct: 202 DRRHFQQELNASRRDVQTAKCQRDKVERLAQLRHTQLLKESARKVRVNEQKRRDAQ--AV 259

Query: 155 RQDIANQLSWED----IEARVKQA 174
           RQ IA+ L + D    +E++VKQA
Sbjct: 260 RQLIADTLEYRDELSIMESKVKQA 283


>gi|301111582|ref|XP_002904870.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262095200|gb|EEY53252.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 583

 Score = 37.7 bits (86), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 6/61 (9%)

Query: 118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWED----IEARVKQ 173
           D   +L+ L  TQLV +   +L IN Q+  DA   A+RQ IA+ L + D    +E+R+KQ
Sbjct: 224 DKAEKLANLRHTQLVKESARKLRINEQKRRDAQ--AVRQTIADTLEYRDELNAMESRLKQ 281

Query: 174 A 174
           A
Sbjct: 282 A 282


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.128    0.364 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,382,614,316
Number of Sequences: 23463169
Number of extensions: 207395567
Number of successful extensions: 799899
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 648
Number of HSP's successfully gapped in prelim test: 1298
Number of HSP's that attempted gapping in prelim test: 795570
Number of HSP's gapped (non-prelim): 4509
length of query: 380
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 236
effective length of database: 8,980,499,031
effective search space: 2119397771316
effective search space used: 2119397771316
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 78 (34.7 bits)