BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 002338
         (934 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225432064|ref|XP_002273922.1| PREDICTED: nuclear export mediator factor Nemf-like [Vitis
           vinifera]
          Length = 1110

 Score = 1422 bits (3681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 713/928 (76%), Positives = 788/928 (84%), Gaps = 32/928 (3%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAE+KCLRRLIGMRC+NVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEIKCLRRLIGMRCANVYDLSPKTYMFKLMNSSGVTESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTTAY RDK  TPSGFTLKLRKHIRTRRLEDVRQLGYDR++LFQFGLG NAHYV
Sbjct: 61  ESGVRLHTTAYVRDKSMTPSGFTLKLRKHIRTRRLEDVRQLGYDRVVLFQFGLGANAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSEF V+TLLRSHRDDDKGVAIMSRHRYP EICRVFERT  +KL AA
Sbjct: 121 ILELYAQGNILLTDSEFMVMTLLRSHRDDDKGVAIMSRHRYPVEICRVFERTATTKLQAA 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LTS KE ++NE  + +E GN VS+A +E  G  KG KS + SKN+N    DGARAKQ TL
Sbjct: 181 LTSPKESESNEAVEASEGGNKVSDAPREKQGNNKGVKSSEPSKNTN----DGARAKQATL 236

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           KTVLGEALGYGPALSEHIILD GL+PN K+++ +K + + IQ L  +V KFE+WL+DVIS
Sbjct: 237 KTVLGEALGYGPALSEHIILDAGLIPNTKVTKDSKFDIDTIQRLAQSVTKFENWLEDVIS 296

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLLLNQFRSREFVKFETFDAALD 359
           GD VPEGYILMQNK  GKD PP++    +Q IYDEFCP+LLNQF+SREFVKFETFDAALD
Sbjct: 297 GDQVPEGYILMQNKIFGKDCPPSQPDRGSQVIYDEFCPILLNQFKSREFVKFETFDAALD 356

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIESQR+EQQ KAKE +A  KL KI +DQENRVHTLK+EVD  +KMAELIEYNLED
Sbjct: 357 EFYSKIESQRSEQQQKAKEGSAMQKLTKIRVDQENRVHTLKKEVDHCIKMAELIEYNLED 416

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VDAAILAVRVALAN M+WEDLARMVKEE+K+GNPVAGLIDKLYLERNCM+LLLSNNLDEM
Sbjct: 417 VDAAILAVRVALANGMNWEDLARMVKEEKKSGNPVAGLIDKLYLERNCMTLLLSNNLDEM 476

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
           DD+EKTLPV+KVEVDLALSAHANARRWYE KK+QE+KQEKT+ AH KAFKAAEKKTRLQ+
Sbjct: 477 DDDEKTLPVDKVEVDLALSAHANARRWYEQKKRQENKQEKTVIAHEKAFKAAEKKTRLQL 536

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
            QEKTVA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+Y+HADL
Sbjct: 537 SQEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADL 596

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
           HGASSTVIKNH+PE PVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGE
Sbjct: 597 HGASSTVIKNHKPEHPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGE 656

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGH 719
           YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG  DFE++  
Sbjct: 657 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGAQDFEENES 716

Query: 720 HKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSK 779
            K NSD ESEK++TDEK  AES                      + P E++ + NG DS+
Sbjct: 717 LKGNSDSESEKEETDEKRTAES----------------------KIPLEERNMLNGNDSE 754

Query: 780 -IFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVR 838
            I DI+    + V PQLEDLIDRAL LGS + S  K+ +ET+Q DL EE  H +R ATVR
Sbjct: 755 HIADISGGHVSSVNPQLEDLIDRALELGSNTASGKKYALETSQVDL-EEHNHEDRKATVR 813

Query: 839 DKPYISKAERRKLKKGQGSSVVDP---KVEREKERGKDASSQPESIVRKTKIEGGKISRG 895
           +KPYISKAERRKLKKGQ +S  D      + E E    ++SQP+  V+ ++  GGKISRG
Sbjct: 814 EKPYISKAERRKLKKGQKTSTSDAGGDHGQEEIEENNVSTSQPDKDVKNSQPAGGKISRG 873

Query: 896 QKGKLKKMKEKYGDQDEEERNIRMALLA 923
           QKGKLKKMKEKY DQDEEER+IRMALLA
Sbjct: 874 QKGKLKKMKEKYADQDEEERSIRMALLA 901


>gi|255556494|ref|XP_002519281.1| conserved hypothetical protein [Ricinus communis]
 gi|223541596|gb|EEF43145.1| conserved hypothetical protein [Ricinus communis]
          Length = 1092

 Score = 1350 bits (3494), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 689/926 (74%), Positives = 763/926 (82%), Gaps = 54/926 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYVFKLMNSSGVTESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRI+LFQFGLG NAHYV
Sbjct: 61  ESGVRLHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIVLFQFGLGANAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDS+FTVLTLLRSHRDDDKG AIMSRHRYPTEICRVFER TA KL  +
Sbjct: 121 ILELYAQGNILLTDSDFTVLTLLRSHRDDDKGFAIMSRHRYPTEICRVFERITAEKLQES 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNA-SKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
           LTS KEP+ +EP  VN+  NN+S    KE  G   G KS D SK+++    DG RAKQ T
Sbjct: 181 LTSFKEPEISEP--VNDGENNMSEKLKKEKQGKSTGTKSSDPSKSAS----DGNRAKQTT 234

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VLGEALGYGPALSEH+ILD GLVPN K S+ N+L+DNAIQVLV AVAK EDWLQD+I
Sbjct: 235 LKNVLGEALGYGPALSEHMILDAGLVPNTKFSKSNRLDDNAIQVLVQAVAKLEDWLQDII 294

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           SGD +PEGYILMQNK++GK+HP +ES  + +IYDEFCP+LLNQF+ RE+VKF+TFDAALD
Sbjct: 295 SGDKIPEGYILMQNKNVGKNHPSSES--AFKIYDEFCPILLNQFKMREYVKFDTFDAALD 352

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIESQRAEQQ K KE++A  KLNKI +DQENRV TL++EVD  V+ AELIEYNLED
Sbjct: 353 EFYSKIESQRAEQQQKTKENSAIQKLNKIRLDQENRVLTLRKEVDLCVRKAELIEYNLED 412

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VDAAILAVRVALA  MSWEDL RMVKEE+K GNPVA LIDKL+LERNCM+LLLSNNLD+M
Sbjct: 413 VDAAILAVRVALAKGMSWEDLTRMVKEEKKLGNPVASLIDKLHLERNCMTLLLSNNLDDM 472

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
           DD+EKTLPV+KVE+DLALSAHANARRWYE+KKKQESKQ KT+TAH KAFKAAE+KTRLQ+
Sbjct: 473 DDDEKTLPVDKVEIDLALSAHANARRWYEMKKKQESKQGKTVTAHEKAFKAAERKTRLQL 532

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
            QEK+VA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+L
Sbjct: 533 SQEKSVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAEL 592

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
           HGASSTVIKNHRPEQPVPPLTLNQAGC+TVC SQAWDSK+VTSAWWVYPHQVSKTAPTGE
Sbjct: 593 HGASSTVIKNHRPEQPVPPLTLNQAGCYTVCQSQAWDSKIVTSAWWVYPHQVSKTAPTGE 652

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGH 719
           YLTVGSFMIRGKKNFL PHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM+DFE+SG 
Sbjct: 653 YLTVGSFMIRGKKNFLSPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMNDFEESGP 712

Query: 720 HKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGI-DS 778
             E SD ESEK++  ++ ++ES +            +A  VDS  F  +  T + GI + 
Sbjct: 713 PLEISDSESEKEEIGKEVMSESKTT----------ADAEVVDSINF-LQQGTAAGGISND 761

Query: 779 KIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVR 838
            I DI  N  A  TPQLEDLIDRALGLG A++S   +G+E ++ DLS+E+          
Sbjct: 762 DISDIVGNDVASATPQLEDLIDRALGLGPATVSQKNYGVEISKIDLSKEEI--------- 812

Query: 839 DKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDA-SSQPESIVRKTKIEGGKISRGQK 897
                    RR  K              E+ +  DA  SQ E   +  K   GKISRGQK
Sbjct: 813 ---------RRNXK--------------EESKENDAFVSQREKSSQSNKAGSGKISRGQK 849

Query: 898 GKLKKMKEKYGDQDEEERNIRMALLA 923
            KLKKMKEKY DQDEEER+IRMALLA
Sbjct: 850 SKLKKMKEKYADQDEEERSIRMALLA 875


>gi|449441522|ref|XP_004138531.1| PREDICTED: nuclear export mediator factor Nemf-like [Cucumis
           sativus]
          Length = 1119

 Score = 1343 bits (3475), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 685/928 (73%), Positives = 772/928 (83%), Gaps = 20/928 (2%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCL+RLIGMRC+NVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLKRLIGMRCANVYDLSPKTYMFKLMNSSGVTESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG +AHYV
Sbjct: 61  ESGVRLHTTEYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGASAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSEFTVLTLLRSHRDD+KGVAIMSRHRYPTEI RVFE+TTA+KL  A
Sbjct: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDNKGVAIMSRHRYPTEISRVFEKTTAAKLQEA 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT S     +    V  +GNN ++  K+    QK  K+      S+K   DG+R+KQ TL
Sbjct: 181 LTLS-----DNIVNVTGNGNNETDPLKQQADNQKVSKT----SVSSKAQGDGSRSKQSTL 231

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VLGEALGYG ALSEHIIL+ GL+PNMKL   NKL+DN++  L+ AVA FEDWL+DVI 
Sbjct: 232 KAVLGEALGYGTALSEHIILNAGLIPNMKLCNDNKLDDNSLDCLMQAVANFEDWLEDVIF 291

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G  +PEGYILMQ K + K+   +E+ ++ +IYDEFCP+LLNQF SR++ KFETFDAALDE
Sbjct: 292 GTRIPEGYILMQKKDVKKEE--SEAATANEIYDEFCPILLNQFMSRKYTKFETFDAALDE 349

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIESQR+EQQ KAKE +A HKLNKI MDQ NRV  LKQEVD SVKMAELIEYNLEDV
Sbjct: 350 FYSKIESQRSEQQQKAKESSATHKLNKIRMDQGNRVELLKQEVDHSVKMAELIEYNLEDV 409

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           DA ILAVRVALA  MSWEDLARMVKEE+K+GNPVAGLIDKL LERNCM+LLLSNNLDEMD
Sbjct: 410 DAVILAVRVALAKGMSWEDLARMVKEEKKSGNPVAGLIDKLNLERNCMTLLLSNNLDEMD 469

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
           D+EKT PV+KVEVD++LSAHANARRWYELKKKQESKQEKTITAH KAFKAAE+KTRLQ+ 
Sbjct: 470 DDEKTQPVDKVEVDISLSAHANARRWYELKKKQESKQEKTITAHEKAFKAAERKTRLQLS 529

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           QEKTVA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+LH
Sbjct: 530 QEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAELH 589

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASSTVIKNH+PEQ VPPLTLNQAGC+TVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 590 GASSTVIKNHKPEQLVPPLTLNQAGCYTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 649

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE+G++  E++   
Sbjct: 650 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEDGVNGVEENEPL 709

Query: 721 KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKI 780
            E SDIE EK +++E     S +  NS  PA S    +  +S E P ED    NG++   
Sbjct: 710 NEESDIEYEKRESEEV----SNTSANSFIPAISGPEGT--ESLEIPIEDIMTLNGVNKDT 763

Query: 781 FDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDK 840
               RN  + VTPQLEDLID+AL LGSA+ SS  + +ET++ +  +E    ++ AT R+K
Sbjct: 764 QPDVRNNVSLVTPQLEDLIDKALELGSATASSKSYILETSKVNSVDEPCLDDKNATGREK 823

Query: 841 PYISKAERRKLKKGQGSSVVDPKVEREKERGK---DASSQPESIVRKTKIEGGKISRGQK 897
           PYISKAERRKLKKGQ SS  D  +++E E+ +   D+S+  ++ V   K+   KISRGQ+
Sbjct: 824 PYISKAERRKLKKGQNSSSTDGSIKQESEQPRDIDDSSNLLQNKVNNPKLGSVKISRGQR 883

Query: 898 GKLKKMKEKYGDQDEEERNIRMALLAVS 925
           GKLKKMKEKY DQDEEER+IRMALLA S
Sbjct: 884 GKLKKMKEKYADQDEEERSIRMALLASS 911


>gi|449485009|ref|XP_004157045.1| PREDICTED: nuclear export mediator factor NEMF homolog [Cucumis
           sativus]
          Length = 1090

 Score = 1342 bits (3474), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 685/928 (73%), Positives = 772/928 (83%), Gaps = 20/928 (2%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCL+RLIGMRC+NVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLKRLIGMRCANVYDLSPKTYMFKLMNSSGVTESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG +AHYV
Sbjct: 61  ESGVRLHTTEYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGASAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSEFTVLTLLRSHRDD+KGVAIMSRHRYPTEI RVFE+TTA+KL  A
Sbjct: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDNKGVAIMSRHRYPTEISRVFEKTTAAKLQEA 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT S     +    V  +GNN ++  K+    QK  K+      S+K   DG+R+KQ TL
Sbjct: 181 LTLS-----DNIVNVTGNGNNETDPLKQQADNQKVSKT----SVSSKAQGDGSRSKQSTL 231

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VLGEALGYG ALSEHIIL+ GL+PNMKL   NKL+DN++  L+ AVA FEDWL+DVI 
Sbjct: 232 KAVLGEALGYGTALSEHIILNAGLIPNMKLCNDNKLDDNSLDCLMQAVANFEDWLEDVIF 291

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G  +PEGYILMQ K + K+   +E+ ++ +IYDEFCP+LLNQF SR++ KFETFDAALDE
Sbjct: 292 GTRIPEGYILMQKKDVKKEE--SEAATANEIYDEFCPILLNQFMSRKYTKFETFDAALDE 349

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIESQR+EQQ KAKE +A HKLNKI MDQ NRV  LKQEVD SVKMAELIEYNLEDV
Sbjct: 350 FYSKIESQRSEQQQKAKESSATHKLNKIRMDQGNRVELLKQEVDHSVKMAELIEYNLEDV 409

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           DA ILAVRVALA  MSWEDLARMVKEE+K+GNPVAGLIDKL LERNCM+LLLSNNLDEMD
Sbjct: 410 DAVILAVRVALAKGMSWEDLARMVKEEKKSGNPVAGLIDKLNLERNCMTLLLSNNLDEMD 469

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
           D+EKT PV+KVEVD++LSAHANARRWYELKKKQESKQEKTITAH KAFKAAE+KTRLQ+ 
Sbjct: 470 DDEKTQPVDKVEVDISLSAHANARRWYELKKKQESKQEKTITAHEKAFKAAERKTRLQLS 529

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           QEKTVA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+LH
Sbjct: 530 QEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAELH 589

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASSTVIKNH+PEQ VPPLTLNQAGC+TVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 590 GASSTVIKNHKPEQLVPPLTLNQAGCYTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 649

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE+G++  E++   
Sbjct: 650 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEDGVNGVEENEPL 709

Query: 721 KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKI 780
            E SDIE EK +++E     S +  NS  PA S    +  +S E P ED    NG++   
Sbjct: 710 NEESDIEYEKRESEEV----SNTSANSFIPAISEPEGT--ESLEIPIEDIMTLNGVNKDT 763

Query: 781 FDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDK 840
               RN  + VTPQLEDLID+AL LGSA+ SS  + +ET++ +  +E    ++ AT R+K
Sbjct: 764 QPDVRNNVSLVTPQLEDLIDKALELGSATASSKSYILETSKVNSVDEPCLDDKNATGREK 823

Query: 841 PYISKAERRKLKKGQGSSVVDPKVEREKERGK---DASSQPESIVRKTKIEGGKISRGQK 897
           PYISKAERRKLKKGQ SS  D  +++E E+ +   D+S+  ++ V   K+   KISRGQ+
Sbjct: 824 PYISKAERRKLKKGQNSSSTDGSIKQESEQPRDIDDSSNLLQNKVNNPKLGSVKISRGQR 883

Query: 898 GKLKKMKEKYGDQDEEERNIRMALLAVS 925
           GKLKKMKEKY DQDEEER+IRMALLA S
Sbjct: 884 GKLKKMKEKYADQDEEERSIRMALLASS 911


>gi|357448763|ref|XP_003594657.1| Serologically defined colon cancer antigen-like protein [Medicago
           truncatula]
 gi|355483705|gb|AES64908.1| Serologically defined colon cancer antigen-like protein [Medicago
           truncatula]
          Length = 1146

 Score = 1321 bits (3419), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 686/955 (71%), Positives = 765/955 (80%), Gaps = 48/955 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDL+PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLTPKTYVFKLMNSSGMTESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG RLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRI+LFQFGLG NA+YV
Sbjct: 61  ESGARLHTTVYMRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIVLFQFGLGENANYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGN++LTDS FTVLTLLRSHRDDDKG+AIMSRHRYP E CRVFERTT +KL  A
Sbjct: 121 ILELYAQGNVILTDSSFTVLTLLRSHRDDDKGLAIMSRHRYPVESCRVFERTTTAKLQTA 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LTSSKE D +E  K N +G +VSN  KE  G +K GKS+                   TL
Sbjct: 181 LTSSKEDDNDEAVKANGNGTDVSNVEKEKQGSKKSGKSY------------------ATL 222

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +LGEALGYGPALSEH+ILD GL+PN K+S+    +D  +Q LV AVAKFEDW+QD+IS
Sbjct: 223 KIILGEALGYGPALSEHMILDAGLIPNEKVSKDKVWDDATVQALVQAVAKFEDWMQDIIS 282

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G+IVPEGYILMQNK LGKD   ++  S  QIYDEFCP+LLNQF+SR+  KFETFD ALDE
Sbjct: 283 GEIVPEGYILMQNKVLGKDSSVSQPESLKQIYDEFCPILLNQFKSRDHTKFETFDLALDE 342

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQ----------ENRVHTLKQEVDRSVKMA 410
           FYSKIESQR+EQQH AKE++A  KLNKI  DQ          ENRVHTL++E D  +KMA
Sbjct: 343 FYSKIESQRSEQQHTAKENSALQKLNKIRNDQVGTHVQTSTIENRVHTLRKEADNCIKMA 402

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           ELIEYNLEDVDAAILAVRV+LA  MSW+DLARMVKEE+KAGNPVAGLIDKL+LERNCM+L
Sbjct: 403 ELIEYNLEDVDAAILAVRVSLAKGMSWDDLARMVKEEKKAGNPVAGLIDKLHLERNCMTL 462

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           LLSNNLDEMDD+EKTLP +KVEVDLALSAHANARRWYELKKKQESKQEKTITAH KAFKA
Sbjct: 463 LLSNNLDEMDDDEKTLPADKVEVDLALSAHANARRWYELKKKQESKQEKTITAHEKAFKA 522

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           AE+KTRLQ+ QEKTVA+ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK
Sbjct: 523 AERKTRLQLNQEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 582

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           GD+YVHA+LHGASSTVIKNH+P QPVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQ
Sbjct: 583 GDLYVHAELHGASSTVIKNHKPMQPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQ 642

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           VSKTAPTGEYLTVGSFMIRGKKN+LPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 
Sbjct: 643 VSKTAPTGEYLTVGSFMIRGKKNYLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEET 702

Query: 711 MDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPN--SAHPAPSH-------------T 755
           +DD  ++G  +E SD ESEK+  D +  A+S    N  +  P PS               
Sbjct: 703 IDDNVETGPVEEQSDSESEKNVADGETAADSERNGNLSADSPIPSEDLLADTSQTSLAAI 762

Query: 756 NASNVDSHEFPAEDKTISNGIDS-KIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTK 814
           NA    S +F A+D +  N +DS K+ D + N  A V+PQLE+++DRALGLGS + S+  
Sbjct: 763 NAKTTVSDDFSAKDPSTKNMLDSEKLSDFSGNGLASVSPQLEEILDRALGLGSVAKSNKS 822

Query: 815 HGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPK--VEREKERGK 872
           +  E TQ DLS E+ +      VRDKPYISKAERRKLK         P     ++K + K
Sbjct: 823 YEAENTQLDLSSENHNESSKPAVRDKPYISKAERRKLKNEPKHGEAHPSDGNGKDKSKLK 882

Query: 873 DASSQPESI-VRKTKIEGG-KISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           D S    +      K  GG KISRGQKGKLKKMKEKY DQDEEER+IRM+LLA S
Sbjct: 883 DISGDLHAKDAENLKTGGGKKISRGQKGKLKKMKEKYADQDEEERSIRMSLLASS 937


>gi|356529076|ref|XP_003533123.1| PREDICTED: nuclear export mediator factor Nemf-like [Glycine max]
          Length = 1131

 Score = 1301 bits (3368), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 683/951 (71%), Positives = 771/951 (81%), Gaps = 51/951 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVR+NTADVAAEVKCLRRLIGMRCSNVYDLSPKTY+FKLMNSSGV+ESGESEKVLLLM
Sbjct: 1   MVKVRLNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYVFKLMNSSGVSESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG NA+YV
Sbjct: 61  ESGVRLHTTLYLRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGENANYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDS FTV+TLLRSHRDDDKG+AIMSRHRYP E CRVFERTT  KL  +
Sbjct: 121 ILELYAQGNILLTDSTFTVMTLLRSHRDDDKGLAIMSRHRYPVESCRVFERTTIEKLRTS 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L SSKE D ++  K + +G+N SN +KE  G  KGGKS                    TL
Sbjct: 181 LVSSKEDDNDDAVKADGNGSNASNVAKEKQGTHKGGKS------------------SATL 222

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VLGEALGYGPALSEHI+LD GL+P+ K+ +    +D  +Q LV AV +FEDW+QDVIS
Sbjct: 223 KIVLGEALGYGPALSEHILLDAGLIPSTKVPKDRTWDDATVQALVQAVVRFEDWMQDVIS 282

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G++VPEGYILMQNK++GKD   ++ GS +Q+YDEFCP+LLNQF+SR++ KFETFDAALDE
Sbjct: 283 GELVPEGYILMQNKNMGKDSSISQPGSVSQMYDEFCPILLNQFKSRDYTKFETFDAALDE 342

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIESQR+EQQ KAKE++A  KLN+I  DQENRVH L++E D  VKMAELIEYNLEDV
Sbjct: 343 FYSKIESQRSEQQQKAKENSASQKLNRIRQDQENRVHALRKEADHCVKMAELIEYNLEDV 402

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           DAAILAVRVALA  M+W+DLARMVKEE+KAGNPVAGLIDKL+L+RNCM+LLLSNNLDEMD
Sbjct: 403 DAAILAVRVALAKGMNWDDLARMVKEEKKAGNPVAGLIDKLHLDRNCMTLLLSNNLDEMD 462

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
           D+EKTLPV+KVEVDLALSAHANARRWYE KKKQESKQ KT+TAH KAFKAAE+KTRLQ+ 
Sbjct: 463 DDEKTLPVDKVEVDLALSAHANARRWYEQKKKQESKQGKTVTAHEKAFKAAERKTRLQLN 522

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           QEKTVA+ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+Y+HADLH
Sbjct: 523 QEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLH 582

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASSTVIKNH+P QPVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 583 GASSTVIKNHKPAQPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 642

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE  DD+E++G  
Sbjct: 643 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEAADDYEETGPL 702

Query: 721 KENSDIESEKDDTDEKPVAE-----SLS------VPNSAHPAPSHTNASNVD-----SHE 764
           ++ SD ESEKD TD +P  +     +LS      +P      PS T+ +  D     S +
Sbjct: 703 EDKSDSESEKDVTDIEPATDLERNGNLSADSHKPLPEDFPADPSQTSLATTDAETAISQD 762

Query: 765 FPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDL 824
           FPA++ +  N +D +I              LE+L+D+AL LG  + SS K+GIE +Q DL
Sbjct: 763 FPAKETSTLNMVDREILS-----------DLEELLDQALELGPVAKSSKKYGIEKSQIDL 811

Query: 825 SEEDKHVERTAT-VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERG--KDASSQ-PES 880
             E +H E+T T VR+KPYISKAERRKLKK Q     D  VE  K+    KD S+  P  
Sbjct: 812 DTE-QHFEQTKTAVREKPYISKAERRKLKKEQKPGEEDSNVEHGKDESKLKDISANLPVK 870

Query: 881 IVRKTKIEGG-KISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVSTLTCT 930
             +  K  GG KISRGQKGKLKK+KEKY DQDEEER+IRM LLA S  + T
Sbjct: 871 EDQNLKKGGGQKISRGQKGKLKKIKEKYADQDEEERSIRMTLLASSGKSIT 921


>gi|297795761|ref|XP_002865765.1| EMB1441 [Arabidopsis lyrata subsp. lyrata]
 gi|297311600|gb|EFH42024.1| EMB1441 [Arabidopsis lyrata subsp. lyrata]
          Length = 1080

 Score = 1300 bits (3363), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 664/933 (71%), Positives = 747/933 (80%), Gaps = 49/933 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCL+RLIGMRCSNVYD+SPKTY+FKL+NSSG+TESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLKRLIGMRCSNVYDISPKTYMFKLLNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRII+FQFGLG NAHYV
Sbjct: 61  ESGVRLHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIVFQFGLGANAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNI+LTDSE+ ++TLLRSHRDD+KG AIMSRHRYP EICRVFERTT SKL  +
Sbjct: 121 ILELYAQGNIILTDSEYMIMTLLRSHRDDNKGFAIMSRHRYPIEICRVFERTTVSKLQES 180

Query: 181 LT--SSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           LT  S K+ +A + ++            KE  GG+KGGKS           ND   AKQ 
Sbjct: 181 LTAFSLKDHEAKQIER------------KEQNGGKKGGKS-----------NDSTGAKQY 217

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TLK +LG+ALGYGP LSEHIILD GL+P  KLSE  KL+DN IQ+LV AV  FEDWL+D+
Sbjct: 218 TLKNILGDALGYGPQLSEHIILDAGLIPTTKLSEDKKLDDNEIQLLVQAVIVFEDWLEDI 277

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
           I+G  VPEGYILMQ + L  D  P+ESG   ++YDEFC +LLNQF+SR + KFETFDAAL
Sbjct: 278 INGQKVPEGYILMQKQILAND-TPSESGGVKKMYDEFCSILLNQFKSRVYEKFETFDAAL 336

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEFYSKIESQR+EQQ KAKED+A  KLNKI  DQENRV  LK+EV+  V MAELIEYNLE
Sbjct: 337 DEFYSKIESQRSEQQQKAKEDSASQKLNKIRQDQENRVQILKKEVNHCVNMAELIEYNLE 396

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
           DVDAAILAVRVALA  M W+DLARMVKEE+K GNPVAGLIDKLYLE+NCM+LLL NNLDE
Sbjct: 397 DVDAAILAVRVALAKGMGWDDLARMVKEEKKLGNPVAGLIDKLYLEKNCMTLLLCNNLDE 456

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
           MDD+EKTLPVEKVEVDL+LSAH NARRWYE+KKKQE+KQEKT++AH KAF+AAEKKTR Q
Sbjct: 457 MDDDEKTLPVEKVEVDLSLSAHGNARRWYEMKKKQETKQEKTVSAHEKAFRAAEKKTRHQ 516

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           + QEK VA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+
Sbjct: 517 LSQEKVVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAE 576

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
           LHGASSTVIKNH+PEQ VPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQV+KTAPTG
Sbjct: 577 LHGASSTVIKNHKPEQNVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVTKTAPTG 636

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSG 718
           EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG+HLNERRVRGEEEGM+D     
Sbjct: 637 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGAHLNERRVRGEEEGMNDVVMET 696

Query: 719 HH-KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGID 777
           H   E+SD+ESE +      V E++S         S T  S  D+  F           D
Sbjct: 697 HAPDEHSDVESENE-----AVNEAVSASGEVDLEESSTILSQ-DTSSF-----------D 739

Query: 778 SKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATV 837
                IA       T QLEDL+DR LGLG+A+++  K  IET++ ++ E+    E+ A V
Sbjct: 740 MNSSGIAEENVESATSQLEDLLDRTLGLGAATVAGKKDTIETSKDEMEEKMTQEEKKAVV 799

Query: 838 RDKPYISKAERRKLKKGQ-GSSVVDPKVEREKE--RGKDAS--SQPESIVRKTKIEGGKI 892
           RDKPY+SKAERRKLK GQ G++ VD    +EK+  + KD S  SQ    +   K  G K+
Sbjct: 800 RDKPYMSKAERRKLKMGQSGNTAVDGNTGQEKQQRKEKDVSSLSQANKSIPDNKPAGEKV 859

Query: 893 SRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           SRGQ+GKLKKMKEKY DQDE+ER IRMALLA S
Sbjct: 860 SRGQRGKLKKMKEKYADQDEDERKIRMALLASS 892


>gi|356558107|ref|XP_003547349.1| PREDICTED: nuclear export mediator factor NEMF homolog [Glycine
           max]
          Length = 1119

 Score = 1299 bits (3361), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 695/948 (73%), Positives = 778/948 (82%), Gaps = 43/948 (4%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTY+FKLMNSSGV+ESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYVFKLMNSSGVSESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT Y RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG NA+YV
Sbjct: 61  ESGVRLHTTLYMRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGENANYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDS FTV+TLLRSHRDDDKG+AIMSRHRYP E CRVFERTT  KL  +
Sbjct: 121 ILELYAQGNILLTDSTFTVMTLLRSHRDDDKGLAIMSRHRYPVESCRVFERTTIEKLRTS 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L SSKE DA+E  K N +G+N SN +KE    +KGGKS                    TL
Sbjct: 181 LVSSKEDDADEAVKANGNGSNASNVAKEKQETRKGGKS------------------SATL 222

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VLGEALGYGPALSEHIILD GL+P+ K+ +    +D  +Q LV AV KFEDW+QDVIS
Sbjct: 223 KIVLGEALGYGPALSEHIILDAGLIPSTKVPKDRTWDDATVQALVQAVVKFEDWMQDVIS 282

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G+IVPEGYILMQNK+LGKD   ++ GS +Q+YDEFCP+LLNQF+SR++ KFETFDAALDE
Sbjct: 283 GEIVPEGYILMQNKNLGKDSSISQPGSVSQMYDEFCPILLNQFKSRDYTKFETFDAALDE 342

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIESQRAEQQ K+KE++A  KLNKI  DQENRVH L++E D  VKMAELIEYNLEDV
Sbjct: 343 FYSKIESQRAEQQQKSKENSAAQKLNKIRQDQENRVHVLRKEADHCVKMAELIEYNLEDV 402

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           DAAILAVRVALA  M+W+DLARMVKEE+KAGNPVAGLIDKL+LERNCM+LLLSNNLDEMD
Sbjct: 403 DAAILAVRVALAKGMNWDDLARMVKEEKKAGNPVAGLIDKLHLERNCMNLLLSNNLDEMD 462

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
           D+EKTLPV+KVEVDLALSAHANARRWYE KKKQESKQEKT+TAH KAFKAAE+KTRLQ+ 
Sbjct: 463 DDEKTLPVDKVEVDLALSAHANARRWYEQKKKQESKQEKTVTAHEKAFKAAERKTRLQLN 522

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           QEKTVA+ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE+IVKRYMSKGD+YVHADLH
Sbjct: 523 QEKTVASISHMRKVHWFEKFNWFISSENYLVISGRDAQQNELIVKRYMSKGDLYVHADLH 582

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASSTVIKNH+P QPVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 583 GASSTVIKNHKPAQPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 642

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE  DD+E++G  
Sbjct: 643 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEAADDYEETGPL 702

Query: 721 KENSDIESEKDDTDEKPVAES-----LSVPNSAHPAP------------SHTNASNVDSH 763
           +  SD E EKD TD K   +S     LS  +S  P P            +  NA    S 
Sbjct: 703 EGKSDSEFEKDVTDIKSATDSERNDNLSA-DSHKPLPEDFPADASQTSLATINAETAISQ 761

Query: 764 EFPAEDKTISNGIDSKIF-DIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQF 822
           +FPA++ +  N +D +I  D++ N  A VTPQLE+L+D+ L LG  + S+ K+GIE +Q 
Sbjct: 762 DFPAKETSTLNVVDREILSDVSGNGLASVTPQLEELLDQVLELGPIAKSNKKYGIEKSQI 821

Query: 823 DLSEEDKHVERTAT-VRDKPYISKAERRKLKKGQGSSVVDPKVEREK--ERGKDASSQPE 879
           DL  E +++E++ T VRDKPYISKAERRKLKK Q     D  VE  K   + KD S+  +
Sbjct: 822 DLDTE-QYLEQSKTAVRDKPYISKAERRKLKKEQKHGEEDLNVEHGKYESKLKDISANLQ 880

Query: 880 SIVRKTKIEGG--KISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           +   +   +GG  KISRGQKGKLKK+KEKY DQDEEER+IRMALLA S
Sbjct: 881 AKEDQNLKKGGGQKISRGQKGKLKKIKEKYADQDEEERSIRMALLASS 928


>gi|15240582|ref|NP_199804.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana]
 gi|8777424|dbj|BAA97014.1| unnamed protein product [Arabidopsis thaliana]
 gi|332008489|gb|AED95872.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana]
          Length = 1080

 Score = 1293 bits (3347), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 659/933 (70%), Positives = 743/933 (79%), Gaps = 49/933 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCL+RLIGMRCSNVYD+SPKTY+FKL+NSSG+TESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLKRLIGMRCSNVYDISPKTYMFKLLNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRII+FQFGLG NAHYV
Sbjct: 61  ESGVRLHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIVFQFGLGANAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNI+LTDSE+ ++TLLRSHRDD+KG AIMSRHRYP EICRVFERTT SKL  +
Sbjct: 121 ILELYAQGNIILTDSEYMIMTLLRSHRDDNKGFAIMSRHRYPIEICRVFERTTVSKLQES 180

Query: 181 LTSS--KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           LT+   K+ DA + +             KE  GG+KGGKS           ND   AKQ 
Sbjct: 181 LTAFVLKDHDAKQIE------------PKEQNGGKKGGKS-----------NDSTGAKQY 217

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TLK +LG+ALGYGP LSEHIILD GLVP  KLSE  KL+DN IQ+LV AV  FEDWL+D+
Sbjct: 218 TLKNILGDALGYGPQLSEHIILDAGLVPTTKLSEDKKLDDNEIQLLVQAVIVFEDWLEDI 277

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
           I+G  VPEGYILMQ + L  D   +ESG   ++YDEFC +LLNQF+SR + KFETFDAAL
Sbjct: 278 INGQKVPEGYILMQKQILAND-TTSESGGVKKMYDEFCSILLNQFKSRVYEKFETFDAAL 336

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEFYSKIESQR+EQQ KAKED+A  KLNKI  DQENRV  LK+EV+  V MAELIEYNLE
Sbjct: 337 DEFYSKIESQRSEQQQKAKEDSASLKLNKIRQDQENRVQILKKEVNHCVNMAELIEYNLE 396

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
           DVDAAILAVRVALA  M W+DLARMVKEE+K GNPVAG+ID+LYLE+NCM+LLL NNLDE
Sbjct: 397 DVDAAILAVRVALAKGMGWDDLARMVKEEKKLGNPVAGVIDRLYLEKNCMTLLLCNNLDE 456

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
           MDD+EKT+PVEKVEVDL+LSAH NARRWYE+KKKQE+KQEKT++AH KAF+AAEKKTR Q
Sbjct: 457 MDDDEKTVPVEKVEVDLSLSAHGNARRWYEMKKKQETKQEKTVSAHEKAFRAAEKKTRHQ 516

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           + QEK VA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+YVHA+
Sbjct: 517 LSQEKVVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYVHAE 576

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
           LHGASSTVIKNH+PEQ VPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQV+KTAPTG
Sbjct: 577 LHGASSTVIKNHKPEQNVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVTKTAPTG 636

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSG 718
           EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG+HLNERRVRGEEEGM+D     
Sbjct: 637 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGAHLNERRVRGEEEGMNDVVMET 696

Query: 719 HH-KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGID 777
           H   E+SD ESE +  +E   A                 +  VD  E        ++ +D
Sbjct: 697 HAPDEHSDTESENEAVNEVVSA-----------------SGEVDLQESSTALSQDTSSLD 739

Query: 778 SKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATV 837
                I     A  T QLEDL+DR LGLG+A+++  K  IET++ D+ E+ K  E+ A V
Sbjct: 740 MSSSGITEENVASATSQLEDLLDRTLGLGAATVAGKKDTIETSKDDMEEKMKQEEKNAVV 799

Query: 838 RDKPYISKAERRKLKKGQ-GSSVVDPKVEREKE--RGKDAS--SQPESIVRKTKIEGGKI 892
           RDKPY+SKAERRKLK GQ G++  D    +EK+  + KD S  SQ    +   K  G K+
Sbjct: 800 RDKPYMSKAERRKLKMGQSGNTAADGNTGQEKQQRKEKDVSSLSQATKSIPDNKPAGEKV 859

Query: 893 SRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           SRGQ+GKLKKMKEKY DQDE+ER IRMALLA S
Sbjct: 860 SRGQRGKLKKMKEKYADQDEDERKIRMALLASS 892


>gi|296083204|emb|CBI22840.3| unnamed protein product [Vitis vinifera]
          Length = 993

 Score = 1264 bits (3272), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 612/781 (78%), Positives = 670/781 (85%), Gaps = 39/781 (4%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAE+KCLRRLIGMRC+NVYDLSPKTY+FKLMNSSGVTESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEIKCLRRLIGMRCANVYDLSPKTYMFKLMNSSGVTESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTTAY RDK  TPSGFTLKLRKHIRTRRLEDVRQLGYDR++LFQFGLG NAHYV
Sbjct: 61  ESGVRLHTTAYVRDKSMTPSGFTLKLRKHIRTRRLEDVRQLGYDRVVLFQFGLGANAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSEF V+TLLRSHRDDDKGVAIMSRHRYP EICRVFERT  +KL AA
Sbjct: 121 ILELYAQGNILLTDSEFMVMTLLRSHRDDDKGVAIMSRHRYPVEICRVFERTATTKLQAA 180

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LTS KE ++NE  +    GNN            KG KS + SKN+N    DGARAKQ TL
Sbjct: 181 LTSPKESESNEAKQ----GNN------------KGVKSSEPSKNTN----DGARAKQATL 220

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           KTVLGEALGYGPALSEHIILD GL+PN K+++ +K + + IQ L  +V KFE+WL+DVIS
Sbjct: 221 KTVLGEALGYGPALSEHIILDAGLIPNTKVTKDSKFDIDTIQRLAQSVTKFENWLEDVIS 280

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           GD VPEGYILMQNK  GKD PP++    +QIYDEFCP+LLNQF+SREFVKFETFDAALDE
Sbjct: 281 GDQVPEGYILMQNKIFGKDCPPSQPDRGSQIYDEFCPILLNQFKSREFVKFETFDAALDE 340

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIESQR+EQQ KAKE +A  KL KI +DQENRVHTLK+EVD  +KMAELIEYNLEDV
Sbjct: 341 FYSKIESQRSEQQQKAKEGSAMQKLTKIRVDQENRVHTLKKEVDHCIKMAELIEYNLEDV 400

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           DAAILAVRVALAN M+WEDLARMVKEE+K+GNPVAGLIDKLYLERNCM+LLLSNNLDEMD
Sbjct: 401 DAAILAVRVALANGMNWEDLARMVKEEKKSGNPVAGLIDKLYLERNCMTLLLSNNLDEMD 460

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
           D+EKTLPV+KVEVDLALSAHANARRWYE KK+QE+KQEKT+ AH KAFKAAEKKTRLQ+ 
Sbjct: 461 DDEKTLPVDKVEVDLALSAHANARRWYEQKKRQENKQEKTVIAHEKAFKAAEKKTRLQLS 520

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           QEKTVA ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+Y+HADLH
Sbjct: 521 QEKTVATISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLH 580

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASSTVIKNH+PE PVPPLTLNQAGCFTVCHSQAWDSK+VTSAWWVYPHQVSKTAPTGEY
Sbjct: 581 GASSTVIKNHKPEHPVPPLTLNQAGCFTVCHSQAWDSKIVTSAWWVYPHQVSKTAPTGEY 640

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG  DFE++   
Sbjct: 641 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGAQDFEENESL 700

Query: 721 KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKI 780
           K NSD                    +SAH   + +N  +++  E P E++ + NG D K 
Sbjct: 701 KGNSD-------------------SDSAHNELTTSNVGSINLPEVPLEERNMLNGNDKKP 741

Query: 781 F 781
           +
Sbjct: 742 Y 742



 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/84 (65%), Positives = 59/84 (70%), Gaps = 18/84 (21%)

Query: 840 KPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGK 899
           KPYISKAERRKLKKGQ +S  D  V+         +SQP          GGKISRGQKGK
Sbjct: 740 KPYISKAERRKLKKGQKTSTSDADVK---------NSQP---------AGGKISRGQKGK 781

Query: 900 LKKMKEKYGDQDEEERNIRMALLA 923
           LKKMKEKY DQDEEER+IRMALLA
Sbjct: 782 LKKMKEKYADQDEEERSIRMALLA 805


>gi|242085896|ref|XP_002443373.1| hypothetical protein SORBIDRAFT_08g018400 [Sorghum bicolor]
 gi|241944066|gb|EES17211.1| hypothetical protein SORBIDRAFT_08g018400 [Sorghum bicolor]
          Length = 1158

 Score = 1204 bits (3116), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 613/957 (64%), Positives = 741/957 (77%), Gaps = 41/957 (4%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK RM T DVAAEVKCLRRLIGMR +NVYD++PKTY+FKLMNSSG+TESGESE+VLLLM
Sbjct: 1   MVKARMTTTDVAAEVKCLRRLIGMRLANVYDITPKTYLFKLMNSSGITESGESERVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVR HTT Y RDK  TPSGFTLKLRKHIR +RLEDVR LGYDRIILFQFGLG NAH++
Sbjct: 61  ESGVRFHTTQYVRDKSTTPSGFTLKLRKHIRNKRLEDVRMLGYDRIILFQFGLGSNAHFI 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSE+TV+TLLRSHRDD+KG+AIMSRHRYP E+CRVF RT  +KL   
Sbjct: 121 ILELYAQGNILLTDSEYTVMTLLRSHRDDNKGLAIMSRHRYPVEVCRVFVRTDFAKLKDM 180

Query: 181 LT-----------SSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKN-SNKN 228
           LT           +S   DA EP +   D   ++  S+++L  ++   +    ++ SN  
Sbjct: 181 LTMPDKADDKEEITSGSTDAQEPSQSTNDEVLITEISEKSLSRKEKKAAAKAKQSGSNAK 240

Query: 229 SNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVLVL 286
           +N+G ++ + TLKT+LGEAL YGPAL+EHIILD GLVP+ K+ +   + ++D+ +Q L+ 
Sbjct: 241 ANNGVQSNKATLKTILGEALAYGPALAEHIILDAGLVPSTKVGKDPESTVDDSTVQALME 300

Query: 287 AVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH--PPTESGSSTQIYDEFCPLLLNQFR 344
           ++ +FEDWL D+ISG  +PEGYILMQNK   K +  P  E+ ++ +IYDE+CP+LLNQF+
Sbjct: 301 SITRFEDWLVDIISGQRIPEGYILMQNKLTAKKNLTPSEEASTNHKIYDEYCPILLNQFK 360

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           SRE+ +F TFDAALDEFYSKIESQ+  QQ KAKE++A  +LNKI +DQENRVHTL++EVD
Sbjct: 361 SREYNEFATFDAALDEFYSKIESQKVNQQQKAKEESAAQRLNKIKLDQENRVHTLRKEVD 420

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             VKMAELIEYNLEDVDAAILAVRV+LAN MSWE L RM+KEERKAGNPVAGLIDKL  E
Sbjct: 421 HCVKMAELIEYNLEDVDAAILAVRVSLANEMSWEALTRMIKEERKAGNPVAGLIDKLNFE 480

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
           RNC++LLLSNNLD+MD++EKT PVEKVEVD+ALSAHANARRWYE+KKKQESKQEKTITAH
Sbjct: 481 RNCITLLLSNNLDDMDEDEKTAPVEKVEVDIALSAHANARRWYEMKKKQESKQEKTITAH 540

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+IV
Sbjct: 541 EKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNELIV 600

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           KRYMSKGD+YVHA+LHGASST+IKNH+P+ P+PPLTLNQAGCFTVCHS+AWDSK+VTSAW
Sbjct: 601 KRYMSKGDLYVHAELHGASSTIIKNHKPDTPIPPLTLNQAGCFTVCHSKAWDSKIVTSAW 660

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG+LFRLDESSL SHLNERRV
Sbjct: 661 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDESSLASHLNERRV 720

Query: 705 RGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHE 764
           RGE+E + + E     K+++    E+  +DE    E+       H   S  N    +S E
Sbjct: 721 RGEDEALQEMEAESRKKQSNPESDEEIGSDEGANKET-------HEDESSGNIGTANSPE 773

Query: 765 FP--AEDKTISNGID---------SKIFDIARNVA------APVTPQLEDLIDRALGLGS 807
            P    ++++ NG             + D   +++      A V+ QL+DL+D+ L LG 
Sbjct: 774 LPEIQAEESLDNGSSISKEETIQAEDLLDNGSSISKEETIEASVSSQLDDLLDKTLRLGP 833

Query: 808 ASISSTKHGIETTQFDLSEEDKHVE-RTATVRDKPYISKAERRKLKKGQGSSVVDPKVER 866
           A +S     + +    L+E+D  +E +  T+RDKPYISKAERRKLKKGQ +       + 
Sbjct: 834 AKVSGKSSLLTSVPSSLAEDDDDLELKRPTIRDKPYISKAERRKLKKGQVNGETATDSQN 893

Query: 867 EKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
            ++  +   SQ E     T+    K+SRGQKGKLKK+KEKY +QDEEER IRMALL+
Sbjct: 894 GEKLSQPGYSQQEKGKGSTQAANAKVSRGQKGKLKKIKEKYAEQDEEEREIRMALLS 950


>gi|115489110|ref|NP_001067042.1| Os12g0564600 [Oryza sativa Japonica Group]
 gi|108862839|gb|ABA98970.2| zinc knuckle family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113649549|dbj|BAF30061.1| Os12g0564600 [Oryza sativa Japonica Group]
          Length = 1159

 Score = 1204 bits (3115), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 638/957 (66%), Positives = 748/957 (78%), Gaps = 38/957 (3%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK RM TADVAAEVKCLRRLIGMR SNVY ++PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1   MVKARMTTADVAAEVKCLRRLIGMRLSNVYGITPKTYLFKLMNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT Y RDK  TPSGFTLKLRKHIR++RLEDVR LGYDRIILFQFGLG NAH+V
Sbjct: 61  ESGVRLHTTQYVRDKSTTPSGFTLKLRKHIRSKRLEDVRMLGYDRIILFQFGLGSNAHFV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSE+TVLTLLRSHRDD+KG+AIMSRHRYP E CRVFERT  +KL   
Sbjct: 121 ILELYAQGNILLTDSEYTVLTLLRSHRDDNKGLAIMSRHRYPVEACRVFERTDFTKLKDT 180

Query: 181 L---------TSSKEP---DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN 228
           L         +S   P   DA EP     DG  V++ S+E      G KS   +K S+ N
Sbjct: 181 LMMNAVDDKESSQVTPGSIDAQEPSVTPSDGVPVTDKSEEP-STTTGKKSASKNKQSSSN 239

Query: 229 S--NDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVL 284
           +  ++ A + + TLKT+LGEAL YGPAL+EHIILD GL+P+ K+ +   + ++D+ IQ L
Sbjct: 240 AKASNNAPSNKSTLKTLLGEALAYGPALAEHIILDAGLLPSTKVGKDPESSIDDHTIQSL 299

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH-PPTESGSSTQ-IYDEFCPLLLNQ 342
           V +++KFEDWL DV+SG  +PEGYILMQNK   K +  P E  S++Q IYDE+CP+LLNQ
Sbjct: 300 VESISKFEDWLVDVMSGQRIPEGYILMQNKAAAKKNLTPLEGSSASQKIYDEYCPVLLNQ 359

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
           F+SREF +FETFDAALDEFYSKIESQR  QQ K+KED+A  +LNKI +DQENRVHTL++E
Sbjct: 360 FKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRVHTLRKE 419

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
           VD S+KMAELIEYNLEDVDAAI+AVRV+LAN MSW+ LARM+KEE+KAGNPVAGLIDKL 
Sbjct: 420 VDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAGLIDKLS 479

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            ERNC++LLLSNNLD+MD+EEKT PVEKVEVDL+LSAHANARRWYELKKKQESKQEKT+T
Sbjct: 480 FERNCITLLLSNNLDDMDEEEKTAPVEKVEVDLSLSAHANARRWYELKKKQESKQEKTVT 539

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           AH KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+
Sbjct: 540 AHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNEL 599

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           IVKRYMSKGD+YVHA+LHGASST+IKNH+P+ P+PPLTLNQAG FTVCHS+AWDSK+VTS
Sbjct: 600 IVKRYMSKGDLYVHAELHGASSTIIKNHKPDNPIPPLTLNQAGSFTVCHSKAWDSKIVTS 659

Query: 643 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           AWWVYP+QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG+LFRLDESSL SHLNER
Sbjct: 660 AWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDESSLASHLNER 719

Query: 703 RVRGE-EEGMDDFEDSGHHKE-------NSDIESEKDDTDEKPVAESLSVPNSAHPAPSH 754
           RVRGE EE + D E              +SD E+ K+  D++   ++++V    +P PS+
Sbjct: 720 RVRGEDEEALPDVESQKLESNAELDGELDSDSETGKEKHDDESSLDNINVKKIDNPIPSN 779

Query: 755 TN--ASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISS 812
                 N DS E  +E +T+ N   S      +     V+ QLEDL+D+ LGLG   +  
Sbjct: 780 APYVKDNADSSEQLSEIRTVVNSTTST--SKGQTSDRTVSSQLEDLLDKNLGLGPTKVLG 837

Query: 813 TKHGIETTQFDLSEE-DKHVERTATVRDKPYISKAERRKLKKGQ--GSSVVD-PKVEREK 868
               + +    ++++ D    +  +VRDKPYISKA+RRKLKKGQ  G S  D P  E  K
Sbjct: 838 RSSLLSSNSASVADDIDDLDTKKTSVRDKPYISKADRRKLKKGQNVGDSTSDSPNGEAAK 897

Query: 869 ERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
              K  +SQ E      K    K+SRGQKGKLKK+KEKYG+QDEEER IRMALLA S
Sbjct: 898 ---KPVNSQQEKGKTIEKPANPKVSRGQKGKLKKIKEKYGEQDEEEREIRMALLASS 951


>gi|125579741|gb|EAZ20887.1| hypothetical protein OsJ_36526 [Oryza sativa Japonica Group]
          Length = 1176

 Score = 1192 bits (3085), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 637/974 (65%), Positives = 748/974 (76%), Gaps = 55/974 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK RM TADVA+EVKCLRRLIGMR SNVY ++PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1   MVKARMTTADVASEVKCLRRLIGMRLSNVYGITPKTYLFKLMNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT Y RDK  TPSGFTLKLRKHIR++RLEDVR LGYDRIILFQFGLG NAH+V
Sbjct: 61  ESGVRLHTTQYVRDKSTTPSGFTLKLRKHIRSKRLEDVRMLGYDRIILFQFGLGSNAHFV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSE+TVLTLLRSHRDD+KG+AIMSRHRYP E CRVFERT  +KL   
Sbjct: 121 ILELYAQGNILLTDSEYTVLTLLRSHRDDNKGLAIMSRHRYPVEACRVFERTDFTKLKDT 180

Query: 181 L---------TSSKEP---DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN 228
           L         +S   P   DA EP     DG  V++ S+E      G KS   +K S+ N
Sbjct: 181 LMMNAVDDKESSQVTPGSIDAQEPSVTPSDGVPVTDKSEEP-STTTGKKSASKNKQSSSN 239

Query: 229 S--NDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVL 284
           +  ++ A + + TLKT+LGEAL YGPAL+EHIILD GL+P+ K+ +   + ++D+ IQ L
Sbjct: 240 AKASNNAPSNKSTLKTLLGEALAYGPALAEHIILDAGLLPSTKVGKDPESSIDDHTIQSL 299

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH-PPTESGSSTQ-IYDEFCPLLLNQ 342
           V +++KFEDWL DV+SG  +PEGYILMQNK   K +  P E  S++Q IYDE+CP+LLNQ
Sbjct: 300 VESISKFEDWLVDVMSGQRIPEGYILMQNKAAAKKNLTPLEGSSASQKIYDEYCPVLLNQ 359

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
           F+SREF +FETFDAALDEFYSKIESQR  QQ K+KED+A  +LNKI +DQENRVHTL++E
Sbjct: 360 FKSREFNEFETFDAALDEFYSKIESQRVNQQQKSKEDSAAQRLNKIKLDQENRVHTLRKE 419

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
           VD S+KMAELIEYNLEDVDAAI+AVRV+LAN MSW+ LARM+KEE+KAGNPVAGLIDKL 
Sbjct: 420 VDHSIKMAELIEYNLEDVDAAIVAVRVSLANGMSWDALARMIKEEKKAGNPVAGLIDKLS 479

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            ERNC++LLLSNNLD+MD+EEKT PVEKVEVDL+LSAHANARRWYELKKKQESKQEKT+T
Sbjct: 480 FERNCITLLLSNNLDDMDEEEKTAPVEKVEVDLSLSAHANARRWYELKKKQESKQEKTVT 539

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           AH KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+
Sbjct: 540 AHEKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNEL 599

Query: 583 IVKRYMSKGDV-----------------YVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           IVKRYMSKGD+                 YVHA+LHGASST+IKNH+P+ P+PPLTLNQAG
Sbjct: 600 IVKRYMSKGDLSLRFSRKLLVYFASLDSYVHAELHGASSTIIKNHKPDNPIPPLTLNQAG 659

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
            FTVCHS+AWDSK+VTSAWWVYP+QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG
Sbjct: 660 SFTVCHSKAWDSKIVTSAWWVYPYQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFG 719

Query: 686 LLFRLDESSLGSHLNERRVRGE-EEGMDDFEDSGHHKE-------NSDIESEKDDTDEKP 737
           +LFRLDESSL SHLNERRVRGE EE + D E              +SD E+ K+  D++ 
Sbjct: 720 ILFRLDESSLASHLNERRVRGEDEEALPDVESQKLESNAELDGELDSDSETGKEKHDDES 779

Query: 738 VAESLSVPNSAHPAPSHTN--ASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQL 795
             ++++V    +P PS+      N DS E  +E +T+ N   S      +     V+ QL
Sbjct: 780 SLDNINVKKIDNPIPSNAPYVKDNADSSEQLSEIRTVVNSTTST--SKGQTSDRTVSSQL 837

Query: 796 EDLIDRALGLGSASISSTKHGIETTQFDLSEE-DKHVERTATVRDKPYISKAERRKLKKG 854
           EDL+D+ LGLG   +      + +    ++++ D    +  +VRDKPYISKA+RRKLKKG
Sbjct: 838 EDLLDKNLGLGPTKVLGRSSLLSSNSASVADDIDDLDTKKTSVRDKPYISKADRRKLKKG 897

Query: 855 Q--GSSVVD-PKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQD 911
           Q  G S  D P  E  K   K  +SQ E      K    K+SRGQKGKLKK+KEKYG+QD
Sbjct: 898 QNVGDSTSDSPNGEAAK---KPVNSQQEKGKTIEKPANPKVSRGQKGKLKKIKEKYGEQD 954

Query: 912 EEERNIRMALLAVS 925
           EEER IRMALLA S
Sbjct: 955 EEEREIRMALLASS 968


>gi|357161759|ref|XP_003579195.1| PREDICTED: nuclear export mediator factor Nemf-like [Brachypodium
           distachyon]
          Length = 1163

 Score = 1184 bits (3064), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 614/958 (64%), Positives = 745/958 (77%), Gaps = 37/958 (3%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK RM TADVAAEVKCLRRLIGMR SNVYD++PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1   MVKARMTTADVAAEVKCLRRLIGMRLSNVYDITPKTYLFKLMNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT Y RDK  TPSGFTLKLRKH+R++RLEDVR LGYDR+ILFQFGLG NAH++
Sbjct: 61  ESGVRLHTTQYVRDKSTTPSGFTLKLRKHVRSKRLEDVRMLGYDRMILFQFGLGSNAHFI 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNI+LTDSE+TV+TLLRSHRDD+KG+AIMSRHRYP E CR FERT  +KL   
Sbjct: 121 ILELYAQGNIILTDSEYTVMTLLRSHRDDNKGLAIMSRHRYPVEACRTFERTDFTKLKDT 180

Query: 181 L-------------TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSK-NSN 226
           L              +    D++EP +   DG  V++  +E     +   +  + +  SN
Sbjct: 181 LKLSNTVDGEDSSQVTPNSADSHEPSESVNDGVPVTDKLEEPSNRTEKKSAVKIKQPGSN 240

Query: 227 KNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVL 284
             +++G ++ + TLKT+LGEAL YGPAL+EHIILD GL+P+ K+ +   + ++D+ IQ L
Sbjct: 241 AKASNGTQSNKSTLKTLLGEALAYGPALAEHIILDAGLLPSTKVGKDPESSIDDHTIQSL 300

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH-PPTESGSSTQ-IYDEFCPLLLNQ 342
           V +V +FEDWL D+ISG  +PEGYILMQNK   K +  P+E  S+ Q IYDE+CP+LL Q
Sbjct: 301 VESVTRFEDWLVDIISGQRIPEGYILMQNKMSAKKNITPSEVSSTNQKIYDEYCPILLKQ 360

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
           F++RE+ +FETFDAALDEFYSKIESQR  QQ KAKED+A  +LNKI +DQENRVHTL++E
Sbjct: 361 FKAREYDEFETFDAALDEFYSKIESQRVNQQQKAKEDSAVQRLNKIKLDQENRVHTLRKE 420

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
            D  +KMAELIEYNLEDVDAAI+AVRV+LAN MSWE LARM+KEER+AGNPVAGLIDKL 
Sbjct: 421 ADHCIKMAELIEYNLEDVDAAIVAVRVSLANGMSWEALARMIKEERRAGNPVAGLIDKLS 480

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            E NC++LLLSNNLD+MD++EKT PVEKVEVDL+LSAHANARRWYE+KKKQE+KQEKTIT
Sbjct: 481 FENNCITLLLSNNLDDMDEDEKTAPVEKVEVDLSLSAHANARRWYEMKKKQETKQEKTIT 540

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           AH KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL++SGRDAQQNE+
Sbjct: 541 AHDKAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIVSGRDAQQNEL 600

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           +VKRYMSKGD+YVHA+LHGASST+IKNH+P+ P+PPLTLNQAGCFTVCHS+AWDSK+VTS
Sbjct: 601 VVKRYMSKGDLYVHAELHGASSTIIKNHKPDSPIPPLTLNQAGCFTVCHSKAWDSKIVTS 660

Query: 643 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG+LFRLDES L SHLNER
Sbjct: 661 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDESCLASHLNER 720

Query: 703 RVRGEEEGMDDFEDSGHHKEN---------SDIESEKDDTDEKPVAESLSVPNSAHPAPS 753
           R+RGE+E + + E     + N         +D E+ K   + +   +  SV  +   +PS
Sbjct: 721 RIRGEDEALPEIEVEPWKRHNISELDDKLANDNETSKGIHENESSRDYTSVQQNYDASPS 780

Query: 754 H--TNASNVDSHEFPAEDKTI-SNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASI 810
           +  +N     S E  +E +T+ +NG+ S   +  R+ +  V+ QLEDL+D+ LGLG A +
Sbjct: 781 NQPSNMGTASSSEQLSEAQTVENNGVASTFNEETRDDS--VSSQLEDLLDKNLGLGPAKV 838

Query: 811 SSTKHGIETTQFDLSEEDKHVERTATV-RDKPYISKAERRKLKKGQGS--SVVDPKVERE 867
           S     + ++   L E+   ++   T+ R+KPY+SKAERRKLKKGQ S  S  DP  +  
Sbjct: 839 SGKSSLLISSHSSLPEDTDDLDVKKTIQREKPYVSKAERRKLKKGQNSCESTSDP--QNG 896

Query: 868 KERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           +   K  +SQ E     TK    K SRGQKGKLKK+KEKY +QD+EER IRMALLA S
Sbjct: 897 EAVKKPGNSQQEKGKDNTKTANPKTSRGQKGKLKKIKEKYAEQDDEEREIRMALLASS 954


>gi|125537046|gb|EAY83534.1| hypothetical protein OsI_38746 [Oryza sativa Indica Group]
          Length = 1153

 Score = 1150 bits (2976), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 617/951 (64%), Positives = 727/951 (76%), Gaps = 55/951 (5%)

Query: 24  MRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFT 83
           MR SNVY ++PKTY+FKLMNSSG+TESGESEKVLLLMESGVRLHTT Y RDK  TPSGFT
Sbjct: 1   MRLSNVYGITPKTYLFKLMNSSGITESGESEKVLLLMESGVRLHTTQYVRDKSTTPSGFT 60

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
           LKLRKHIR++RLEDVR LGYDRIILFQFGLG NAH+VILELYAQGNILLTDSE+TVLTLL
Sbjct: 61  LKLRKHIRSKRLEDVRMLGYDRIILFQFGLGSNAHFVILELYAQGNILLTDSEYTVLTLL 120

Query: 144 RSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL---------TSSKEP---DANE 191
           RSHRDD+KG+AIMSRHRYP E CRVFERT  +KL   L         +S   P   DA E
Sbjct: 121 RSHRDDNKGLAIMSRHRYPVEACRVFERTDFTKLKDTLMMNAVDDKESSQVTPGSIDAQE 180

Query: 192 PDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS--NDGARAKQPTLKTVLGEALG 249
           P     DG  V++ S+E      G KS   +K S+ N+  ++ A + + TLKT+LGEAL 
Sbjct: 181 PSVTPSDGVPVTDKSEEP-STTTGKKSASKNKQSSSNAKASNNAPSNKSTLKTLLGEALA 239

Query: 250 YGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEG 307
           YGPAL+EHIILD GL+P+ K+ +   + ++D+ IQ LV +++KFEDWL DV+SG  +PEG
Sbjct: 240 YGPALAEHIILDAGLLPSTKVGKDPESSIDDHTIQSLVESISKFEDWLVDVMSGQRIPEG 299

Query: 308 YILMQNKHLGKDH-PPTESGSSTQ-IYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI 365
           YILMQNK   K +  P E  S++Q IYDE+CP+LLNQF+SREF +FETFDAALDEFYSKI
Sbjct: 300 YILMQNKAAAKKNLTPLEGSSASQKIYDEYCPVLLNQFKSREFNEFETFDAALDEFYSKI 359

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
           ESQR  QQ K+KED+A  +LNKI +DQENRVHTL++EVD S+KMAELIEYNLEDVDAAI+
Sbjct: 360 ESQRVNQQQKSKEDSAAQRLNKIKLDQENRVHTLRKEVDHSIKMAELIEYNLEDVDAAIV 419

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT 485
           AVRV+LAN MSW+ LARM+KEE+KAGNPVAGLIDKL  ERNC++LLLSNNLD+MD+EEKT
Sbjct: 420 AVRVSLANGMSWDALARMIKEEKKAGNPVAGLIDKLSFERNCITLLLSNNLDDMDEEEKT 479

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV 545
            PVEKVEVDL+LSAHANARRWYELKKKQESKQEKT+TAH KAFKAAEKKTRLQ+ QEKTV
Sbjct: 480 APVEKVEVDLSLSAHANARRWYELKKKQESKQEKTVTAHEKAFKAAEKKTRLQLAQEKTV 539

Query: 546 ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV------------ 593
           A I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+IVKRYMSKGD+            
Sbjct: 540 AAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNELIVKRYMSKGDLSLRFSRKLLVYF 599

Query: 594 -----YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
                YVHA+LHGASST+IKNH+P+ P+PPLTLNQAG FTVCHS+AWDSK+VTSAWWVYP
Sbjct: 600 ASLDSYVHAELHGASSTIIKNHKPDNPIPPLTLNQAGSFTVCHSKAWDSKIVTSAWWVYP 659

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE- 707
           +QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL+MGFG+LFRLDESSL SHLNERRVRGE 
Sbjct: 660 YQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLVMGFGILFRLDESSLASHLNERRVRGED 719

Query: 708 EEGMDDFEDSGHHKE-------NSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTN--AS 758
           EE + D E              +SD E+ K+  D++   ++++V    +P PS+      
Sbjct: 720 EEALPDVESQKLESNAELDGELDSDSETGKEKHDDESSLDNINVKKIDNPIPSNAPYVKD 779

Query: 759 NVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIE 818
           N DS E  +E +T+ N   S      +     V+ QLEDL+D+ LGLG   +      + 
Sbjct: 780 NADSSEQLSEIRTVVNSTTST--SKGQTSDRTVSSQLEDLLDKNLGLGPTKVLGRSSLLS 837

Query: 819 TTQFDLSEE-DKHVERTATVRDKPYISKAERRKLKKGQ--GSSVVD-PKVEREKERGKDA 874
           +    ++++ D    +  +VRDKPYISKA+RRKLKKGQ  G S  D P  E  K   K  
Sbjct: 838 SNSASVADDIDDLDTKKTSVRDKPYISKADRRKLKKGQNVGDSTSDSPNGEAAK---KPV 894

Query: 875 SSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           +SQ E      K    K+SRGQKGKLKK+KEKYG+QDEEER IRMALLA S
Sbjct: 895 NSQQEKGKTIEKPANPKVSRGQKGKLKKIKEKYGEQDEEEREIRMALLASS 945


>gi|168034467|ref|XP_001769734.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679083|gb|EDQ65535.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1100

 Score = 1015 bits (2625), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 546/958 (56%), Positives = 671/958 (70%), Gaps = 76/958 (7%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK+RMNTADVAAEV+CLRRLIG RC+NVYDL+PKTY+ KL  SSGVTESGESE+ LLL+
Sbjct: 1   MVKLRMNTADVAAEVRCLRRLIGFRCANVYDLTPKTYVIKLSRSSGVTESGESERSLLLL 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVR HTT +ARDK  TPSGFTLKLRKHIRTRRLEDVRQLG DR+I  QFG+G   H++
Sbjct: 61  ESGVRFHTTEFARDKSTTPSGFTLKLRKHIRTRRLEDVRQLGIDRVIDLQFGMGEGTHHI 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTD ++ VLTLLR+H+D+DKG+ +M++H YP   CR+F R +  KL AA
Sbjct: 121 ILELYAQGNILLTDGDYNVLTLLRTHKDEDKGLVMMAKHEYPVNACRLFNRFSLEKLEAA 180

Query: 181 LTSSK-EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
           +   K + DA+E     E     S   KE+ G                           T
Sbjct: 181 MRDQKTQADADEYIDAKEVKVKTSWGKKEDTG--------------------------RT 214

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLS----EVNKLEDNAIQVLVLAVAKFEDWL 295
           LK+VLG  LGYGPAL EHI+LD+GL   MK+S     V  +    +  L+ A+++FEDWL
Sbjct: 215 LKSVLGGCLGYGPALCEHIVLDSGLQSGMKVSLGPDGVLSISKENLGDLMGAISRFEDWL 274

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESG--SSTQIYDEFCPLLLNQFRSREFVKFET 353
             V++GD +PEG++ MQ K++ KD    +       ++YDEF PL L QF  R  ++ ET
Sbjct: 275 DSVVNGDRIPEGFVYMQKKNIKKDKVLLDDQLQEEEKVYDEFSPLHLKQFDDRTVMRMET 334

Query: 354 FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
           +DAALDEF+SKIE QRAEQQ KA+ED+AF KL+KI  DQ  RV  LKQEVD++V+MAELI
Sbjct: 335 YDAALDEFFSKIEGQRAEQQRKAQEDSAFSKLDKIRADQTQRVEVLKQEVDQTVRMAELI 394

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           EYNLEDVD AILAVR  +A+ M W+DLARM+KEE+KAGNPVAGLI  L LE+N ++LLLS
Sbjct: 395 EYNLEDVDNAILAVRSTVASGMDWKDLARMIKEEKKAGNPVAGLIHSLQLEKNQITLLLS 454

Query: 474 NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
           NNLD+MDD+EKT PV KV+VD+ LSAHANARRW+E KKK   KQ+KT  AH KAFKAAEK
Sbjct: 455 NNLDDMDDDEKTQPVSKVDVDIGLSAHANARRWFEQKKKHAVKQDKTKAAHEKAFKAAEK 514

Query: 534 KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
           KT  Q+ Q K+VA ISHMRKVHWFEKFNWF+SSENYL+ISGRDAQQNE++VKRYM KGD+
Sbjct: 515 KTLQQLAQAKSVAAISHMRKVHWFEKFNWFVSSENYLIISGRDAQQNELVVKRYMRKGDL 574

Query: 594 YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK 653
           YVHADLHGASSTVI+NH P  P+PPLT+NQAG FTVC SQAWDSK+VTSAWWV  HQVSK
Sbjct: 575 YVHADLHGASSTVIQNHNPLYPIPPLTINQAGVFTVCRSQAWDSKIVTSAWWVEAHQVSK 634

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
           TAPTGEYLTVGSFM+RGKKNFLPP+PL+MGFG+LFRLD+SS+ +HLNERRVRGE E  D 
Sbjct: 635 TAPTGEYLTVGSFMVRGKKNFLPPNPLVMGFGVLFRLDDSSIAAHLNERRVRGEVEDDDT 694

Query: 714 FEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTIS 773
                    N+D+ S+  D       E L             N   V   + P  +  I 
Sbjct: 695 LT---LVTSNNDVYSKTPDA-----IEELDGVGEEEEQDIEFNEDEVADSKCPDVEVEIG 746

Query: 774 NGIDSKI-FDIARNVAAPVTPQLEDLIDRALGLGSA---SISSTKHGIET--TQFDLSEE 827
           N +D K+   I    ++     L+ L+DRAL L +    + +++K+G++T   Q   +E 
Sbjct: 747 N-LDEKVDAGIEGEGSSDDASGLDALLDRALELRAGPKRTDTNSKYGLDTLPAQVSDTEY 805

Query: 828 DKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDA---------SSQP 878
           D  V + A+ R+KPYISKAERRK KKG        KVE+  E+   A         +SQ 
Sbjct: 806 DLPVAK-ASQREKPYISKAERRKAKKG-------GKVEKGSEKDASAETVDGEEEKTSQE 857

Query: 879 ESIVRKTKI-----------EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           E++  K+ I            G K+ RG+KGKLKK+K KY +QDE+ER +RM+LLAVS
Sbjct: 858 ENLKTKSAIFKDDKMSESSPLGEKVGRGRKGKLKKIKAKYAEQDEDERELRMSLLAVS 915


>gi|297736754|emb|CBI25955.3| unnamed protein product [Vitis vinifera]
          Length = 712

 Score =  999 bits (2582), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/745 (68%), Positives = 568/745 (76%), Gaps = 93/745 (12%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           RMNTADVAAE+KCLRRLIGMRC+NVYDLSPKTY+FK MNSSGVTESG SEKVLLLM+SGV
Sbjct: 6   RMNTADVAAEIKCLRRLIGMRCANVYDLSPKTYMFKFMNSSGVTESGGSEKVLLLMKSGV 65

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           RLHTTAY R    TPSGFTLKLRKHI TRRLEDVRQLGYDR+ILFQFGLG NAHYVILEL
Sbjct: 66  RLHTTAYVR---MTPSGFTLKLRKHICTRRLEDVRQLGYDRVILFQFGLGANAHYVILEL 122

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
            AQGNILLTDSEF V+TLL SHRDDDKGVAI+SRH YP EICRVFE TT +KL AALTS 
Sbjct: 123 CAQGNILLTDSEFMVMTLLGSHRDDDKGVAIISRHWYPVEICRVFECTTTTKLQAALTSP 182

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
           KE ++NE  +                G +KG KS + SKN+N    DGARAKQ TLKTVL
Sbjct: 183 KESESNEAKQ----------------GNRKGAKSSEPSKNTN----DGARAKQATLKTVL 222

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
           GEALGYGPALSEHIILD GL+PN K+++ +K + + IQ L  +VAKFE+WL+DVI GD V
Sbjct: 223 GEALGYGPALSEHIILDAGLIPNTKVTKDSKFDFDTIQRLAQSVAKFENWLEDVILGDQV 282

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
           PEGYILMQNK  GKD  P++    +QIYDEFCP+LLNQF+SREFVKFETFDAA DEFYSK
Sbjct: 283 PEGYILMQNKIFGKDCRPSQPDRGSQIYDEFCPILLNQFKSREFVKFETFDAASDEFYSK 342

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           IE QR+EQQ KAKE              ENRVHTLK+E DR +KMAELIEYNLEDVDAAI
Sbjct: 343 IEGQRSEQQQKAKE--------------ENRVHTLKKEDDRCIKMAELIEYNLEDVDAAI 388

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
           LAVRVALAN M+WEDLARMVKE++K+GNPVAGLIDKLYLERNCM+LLLSNNLDEMDD+EK
Sbjct: 389 LAVRVALANGMNWEDLARMVKEKKKSGNPVAGLIDKLYLERNCMTLLLSNNLDEMDDDEK 448

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT 544
           TL V+KVEVDLALSAHANAR+WYE KK+QE+K+EKTI AH K  K  +++         +
Sbjct: 449 TLHVDKVEVDLALSAHANARQWYEQKKRQENKREKTIIAHEKLLKLLKRRL------ASS 502

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY-------MSKGDVYVHA 597
             +   +   +WFEKFNWFISS+NY VISGRDAQ NEMIVKRY       M     + +A
Sbjct: 503 FHSYWPLVLFNWFEKFNWFISSKNYFVISGRDAQLNEMIVKRYIELRRKKMRPNSTHYYA 562

Query: 598 -------------------------------------------DLHGASSTVIKNHRPEQ 614
                                                      D HGASSTVIKNH+PE 
Sbjct: 563 TKKELCKDFEFPTYCNTVISILVKVFLKLIGFSYLSNARYIHADPHGASSTVIKNHKPEH 622

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           PVPPLTLNQAGCFTVCHSQ WDSK+VTSAWWVYPHQVSKTAPTGEYLTVGSFMI GKKNF
Sbjct: 623 PVPPLTLNQAGCFTVCHSQVWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIHGKKNF 682

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHL 699
           LPPHPL+MGFGLLF LDE +   H+
Sbjct: 683 LPPHPLMMGFGLLFCLDERAPWDHI 707


>gi|302768961|ref|XP_002967900.1| hypothetical protein SELMODRAFT_60048 [Selaginella moellendorffii]
 gi|300164638|gb|EFJ31247.1| hypothetical protein SELMODRAFT_60048 [Selaginella moellendorffii]
          Length = 1083

 Score =  969 bits (2505), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 516/948 (54%), Positives = 653/948 (68%), Gaps = 73/948 (7%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK R+N ADVAAEVKCLR LIGMRC+NVYDL+PKTY+ KL  SSG+T SGE E+ L+L+
Sbjct: 1   MVKGRLNVADVAAEVKCLRCLIGMRCANVYDLTPKTYVIKLAKSSGLTSSGEGERALVLL 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLH T ++RDK  TPSGFTLKLRKHIRTRRLE+V+QLG DR++ FQFG G  AH++
Sbjct: 61  ESGVRLHMTEFSRDKSVTPSGFTLKLRKHIRTRRLENVQQLGVDRVVDFQFGTGELAHHI 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHR-------DDDKGVAIMSRHRYPTEICRVFERTT 173
           ILELYAQGN+LLTD+++ VLTLLRSHR       DD KG+A+M+RHRYP E CR F+RTT
Sbjct: 121 ILELYAQGNVLLTDADYNVLTLLRSHRQACRFFLDDYKGIAMMARHRYPVENCRTFQRTT 180

Query: 174 ASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGA 233
              L  A +    PD  + ++                  Q+  ++   ++   K  ++G 
Sbjct: 181 MQDLIRAFS----PDEKKAEQ------------------QEAQQTPQDARLQKKKDDEGF 218

Query: 234 RAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNK---LEDNAIQVLVLAVAK 290
                TLK++L ++  YGPA+ EH+ILD GL PNMK+ + +    + +  +  L+ A+ +
Sbjct: 219 -----TLKSILLDSFSYGPAVFEHVILDAGLQPNMKVCDASNRSMVSEKDLHSLLEAIKR 273

Query: 291 FEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVK 350
           FEDWL+ V +GD  PEGYI        K        +  +++DEF PLLL Q   RE+VK
Sbjct: 274 FEDWLESVTTGDFTPEGYITFHPNKTAKKK--NAESAEEKMFDEFSPLLLKQSAHREYVK 331

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+TFDAALDEF+SKIE QR +QQ K +ED+A+ KL KI  DQ +RV +LK+EVD++V  A
Sbjct: 332 FDTFDAALDEFFSKIEGQRLDQQRKTQEDSAYSKLEKIRADQRSRVESLKREVDQAVHTA 391

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           ELIEYNL DVD AI AVR ALAN M W+DL RM+KEERKAGNPVAGLI  L LE+N ++L
Sbjct: 392 ELIEYNLADVDLAIDAVRAALANGMDWKDLGRMIKEERKAGNPVAGLIHSLQLEKNHITL 451

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           LLSNNLD+MDD++KT P +KVEVDL+LSAHANAR+W+++KKKQ  KQEKT+ AH KAFKA
Sbjct: 452 LLSNNLDDMDDDDKTKPADKVEVDLSLSAHANARKWFDMKKKQALKQEKTVAAHEKAFKA 511

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           AE+KT+ Q+ Q K VA ISH+RKVHWFEKFNWFISSENYL+ISGRDAQQNE IVKRYM K
Sbjct: 512 AERKTQQQLSQAKAVATISHLRKVHWFEKFNWFISSENYLIISGRDAQQNEQIVKRYMKK 571

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           GD+YVHADLHGASST+IKNH P QPV PLT+NQAGCFTVC SQAWDSK++TSAWWVY HQ
Sbjct: 572 GDLYVHADLHGASSTLIKNHNPSQPVSPLTINQAGCFTVCRSQAWDSKIITSAWWVYDHQ 631

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE--- 707
           VSKTAPTGEYLTVGSFMIRGKKNFLPP+PL+MGFGL FRLDESS+ +H NERR+R E   
Sbjct: 632 VSKTAPTGEYLTVGSFMIRGKKNFLPPYPLVMGFGLFFRLDESSIPAHFNERRIRAEGDN 691

Query: 708 EEGMDDFEDSGHHKENSDIESEKDDTDE-KPVAESLSVPNSAHPAPSHTNASNVDSHEFP 766
           EE   + +D     +++ +E  +D   E K   +  S    A    +    S     E  
Sbjct: 692 EEPEAEIQDD-EEIDDASVEDSQDKVHERKESGDGGSTIEKASVTEAEEARSEEAESEEA 750

Query: 767 AEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGS---ASISSTKHGIETTQFD 823
              +T +  +D +        A      ++ L+D+AL L S   + + + K+G+   Q +
Sbjct: 751 RAPETENAAMDEQ-----EEQAPQSDSDIDSLLDKALELKSVLPSQVDTNKYGLGEVQTE 805

Query: 824 LSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVR 883
              +D   E T   R+KPYISKAERRKLKKG  +  V    E EK+  ++ SS       
Sbjct: 806 DQVDDADQE-TKVAREKPYISKAERRKLKKGGNTQEV--AQENEKDGIEEGSS------- 855

Query: 884 KTKIEGGKISRGQKGKLK------KMKEKYGDQDEEERNIRMALLAVS 925
                G K S G   +++      K  +KY +QD+EER +RM+LL+V+
Sbjct: 856 -----GAKPSEGSNKQVRGKKGKLKKLKKYAEQDDEERELRMSLLSVT 898


>gi|302761200|ref|XP_002964022.1| hypothetical protein SELMODRAFT_266749 [Selaginella moellendorffii]
 gi|300167751|gb|EFJ34355.1| hypothetical protein SELMODRAFT_266749 [Selaginella moellendorffii]
          Length = 1052

 Score =  959 bits (2479), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 507/938 (54%), Positives = 643/938 (68%), Gaps = 104/938 (11%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK R+N ADVAAEVKCLR LIGMRC+NVYDL+PKTY+ KL  SSG+T SGE E+ L+L+
Sbjct: 1   MVKGRLNVADVAAEVKCLRCLIGMRCANVYDLTPKTYVIKLAKSSGLTSSGEGERALVLL 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLH T ++RDK  TPSGFTLKLRKHIRTRRLE+V+QLG DR++ FQFG G  AH++
Sbjct: 61  ESGVRLHMTEFSRDKSVTPSGFTLKLRKHIRTRRLENVQQLGVDRVVDFQFGTGELAHHI 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGN+LLTD+++ VLTLLRS       +A+M+RHRYP E CR F+RTT   L  A
Sbjct: 121 ILELYAQGNVLLTDADYNVLTLLRS-------IAMMARHRYPVENCRTFQRTTMQDLIRA 173

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            +    PD  + ++                  Q+  ++   ++   K  ++G      TL
Sbjct: 174 FS----PDEKKAEQ------------------QEAQQTPQDARLQKKKDDEGF-----TL 206

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNK---LEDNAIQVLVLAVAKFEDWLQD 297
           K++L ++  YGPA+ EH+ILD GL PNMK+ + +    + +  +  L+ A+ +FEDWL+ 
Sbjct: 207 KSILLDSFSYGPAVFEHVILDAGLQPNMKVCDASNRSMVSEKDLHSLLEAIKRFEDWLES 266

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
           V +GD  PEGYI        K        +  +++DEF PLLL Q   RE++KF+TFDAA
Sbjct: 267 VTTGDFTPEGYITFHPNKTAKKK--NAESAEEKMFDEFSPLLLKQSAHREYIKFDTFDAA 324

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           LDEF+SKIE QR +QQ K +ED+AF KL KI  DQ +RV +LK+EVD++V  AELIEYNL
Sbjct: 325 LDEFFSKIEGQRLDQQRKTQEDSAFSKLEKIRADQRSRVESLKREVDQAVHTAELIEYNL 384

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
            DVD AI AVR ALAN M W+DL RM+KEERKAGNPVAGLI  L LE+N ++LLLSNNLD
Sbjct: 385 ADVDLAIDAVRAALANGMDWKDLGRMIKEERKAGNPVAGLIHSLQLEKNHITLLLSNNLD 444

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           +MDD++KT P +KVEVDL+LSAHANAR+W+++KKKQ  KQEKT+ AH KAFKAAE+KT+ 
Sbjct: 445 DMDDDDKTKPADKVEVDLSLSAHANARKWFDMKKKQALKQEKTVAAHEKAFKAAERKTQQ 504

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
           Q+ Q K VA ISH+RKVHWFEKFNWFISSENYL+ISGRDAQQNE IVKRYM KGD+YVHA
Sbjct: 505 QLSQAKAVATISHLRKVHWFEKFNWFISSENYLIISGRDAQQNEQIVKRYMKKGDLYVHA 564

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
           DLHGASST+IKNH P QPV PLT+NQAGCFTVC SQAWDSK++TSAWWVY HQVSKTAPT
Sbjct: 565 DLHGASSTLIKNHNPSQPVSPLTINQAGCFTVCRSQAWDSKIITSAWWVYDHQVSKTAPT 624

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE---EEGMDDF 714
           GEYLTVGSFMIRGKKNFLPP+PL+MGFGL FRLDESS+ +H NERR+R E   EE   + 
Sbjct: 625 GEYLTVGSFMIRGKKNFLPPYPLVMGFGLFFRLDESSIPAHFNERRIRAEGDNEEPEAEI 684

Query: 715 EDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISN 774
           +D     +++ +E  +D   E+  +E+ ++      AP   + S++DS            
Sbjct: 685 QDD-EEIDDASVEDSQDKVHERKESENAAMDEQEEQAPQ--SDSDIDS------------ 729

Query: 775 GIDSKIFDIARNVAAPVTPQLEDLIDRALGLGS---ASISSTKHGIETTQFDLSEEDKHV 831
                                  L+D+AL L S   + + + K+G+   Q +   +D   
Sbjct: 730 -----------------------LLDKALELKSVLPSQVDTNKYGLGEVQTEDQVDDADQ 766

Query: 832 ERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGK 891
           E T   R+KPYISKAERRKLKKG  +  V    E EK+  ++ SS            G K
Sbjct: 767 E-TKVAREKPYISKAERRKLKKGGNTQEV--AQENEKDGIEEGSS------------GAK 811

Query: 892 ISRGQKGKLK------KMKEKYGDQDEEERNIRMALLA 923
            S G   +++      K  +KY +QD+EER +RM+LL+
Sbjct: 812 PSEGSNKQVRGKKGKLKKLKKYAEQDDEERELRMSLLS 849


>gi|224101503|ref|XP_002312307.1| predicted protein [Populus trichocarpa]
 gi|222852127|gb|EEE89674.1| predicted protein [Populus trichocarpa]
          Length = 796

 Score =  865 bits (2235), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 452/601 (75%), Positives = 502/601 (83%), Gaps = 24/601 (3%)

Query: 331 IYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHM 390
           IYDEFCPLLLNQFR RE VKF+ FDAALDEFYSKIESQ++E Q K KE +A  KLNKI +
Sbjct: 1   IYDEFCPLLLNQFRMREHVKFDAFDAALDEFYSKIESQKSEHQQKTKEGSAIQKLNKIRL 60

Query: 391 DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
           DQENRV  L++EVD SVKMAELIEYNLEDV++AILAVRVALA  M WEDLARMVK+E+KA
Sbjct: 61  DQENRVEMLRKEVDHSVKMAELIEYNLEDVNSAILAVRVALAKGMGWEDLARMVKDEKKA 120

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
           GNPVAGLIDKL+ E+NCM+LLLSNNLDEMDD+EKT PV+KVEVDLALSAHANARRWYELK
Sbjct: 121 GNPVAGLIDKLHFEKNCMTLLLSNNLDEMDDDEKTFPVDKVEVDLALSAHANARRWYELK 180

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
           KKQESKQEKT+TAH KAFKAAEKKTRLQ+ QEK+VA ISHMRKVHWFEKFNWFISSENYL
Sbjct: 181 KKQESKQEKTVTAHEKAFKAAEKKTRLQLSQEKSVATISHMRKVHWFEKFNWFISSENYL 240

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VISGRDAQQNEMIVKRY+SKGD+YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC
Sbjct: 241 VISGRDAQQNEMIVKRYVSKGDLYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 300

Query: 631 HSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           HSQAWDSK+VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL
Sbjct: 301 HSQAWDSKIVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 360

Query: 691 DESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKP-VAESLSVPNSAH 749
           DESSLGSHLNERRVRGEE+G++D E+S   KE SD ESE+++   K  V ES        
Sbjct: 361 DESSLGSHLNERRVRGEEDGVNDVEESQPLKEISDSESEEEEVAGKELVLES-------- 412

Query: 750 PAPSHTN---ASNVDSHEFPAEDKTISNGID-SKIFDIARNVAAPVTPQLEDLIDRALGL 805
              SH+N    SN   HE   ++ ++ NG++   + D+  N  APVTPQLEDLIDRALGL
Sbjct: 413 --ESHSNDLTVSNTILHESSVQETSL-NGVNIENLSDVVGNDVAPVTPQLEDLIDRALGL 469

Query: 806 GSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVE 865
           G  ++SS  +G+E  Q D++EE  H E     RDKPYISKAERRKLKKGQ SS  D +VE
Sbjct: 470 GPTAVSSKNYGVEPLQVDMTEE--HHEEA---RDKPYISKAERRKLKKGQRSSATDAEVE 524

Query: 866 REKERGKD---ASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           REKE  KD   +  QPE  V+  K  GGKI RGQ+ KLKKMKEKY +QDEEER+IRMALL
Sbjct: 525 REKEELKDNVVSVDQPEKHVQNNKQGGGKIIRGQRSKLKKMKEKYANQDEEERSIRMALL 584

Query: 923 A 923
           A
Sbjct: 585 A 585


>gi|414878087|tpg|DAA55218.1| TPA: hypothetical protein ZEAMMB73_985047 [Zea mays]
          Length = 608

 Score =  862 bits (2227), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/608 (71%), Positives = 502/608 (82%), Gaps = 15/608 (2%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK RM T DVAAEVKCLRRLIGMR +NVYD++PKTY+FKLMNSSG+TESGESEKVLLLM
Sbjct: 1   MVKARMTTTDVAAEVKCLRRLIGMRLANVYDITPKTYLFKLMNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVR HTT Y RDK  TPSGFTLKLRKHIR +RLEDVR LGYDRIILFQFGLG NAH++
Sbjct: 61  ESGVRFHTTQYVRDKSTTPSGFTLKLRKHIRNKRLEDVRMLGYDRIILFQFGLGSNAHFI 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNILLTDSE+TVLTLLRSHRDD+KG+AIMSRHRYP E CRVF RT  +KL   
Sbjct: 121 ILELYAQGNILLTDSEYTVLTLLRSHRDDNKGLAIMSRHRYPVEACRVFGRTDFAKLKDM 180

Query: 181 LT-----------SSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSK-NSNKN 228
           LT           +S   DA E  +   D   V+  S+++L  ++   +    +  SN  
Sbjct: 181 LTKPDKADDKEEITSGSTDAQETSQSTNDEVLVTEISEKSLSKKEKKAAAKAKQFGSNAK 240

Query: 229 SNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVLVL 286
            N+GA++ + TLKT+LGEAL YGPAL+EHIILD GLVP+ K+ +   + + D+ +Q L+ 
Sbjct: 241 VNNGAQSNKATLKTILGEALAYGPALAEHIILDAGLVPSTKVGKDPESTINDSTVQSLME 300

Query: 287 AVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGS-STQIYDEFCPLLLNQFRS 345
           ++ +FEDWL D+ISG  +PEGYILMQNK   K+  P E  S + +IYDE+CP+LLNQF+S
Sbjct: 301 SITRFEDWLVDIISGQRIPEGYILMQNKMTAKNITPLEEASINHKIYDEYCPVLLNQFKS 360

Query: 346 REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
           RE+ +F TFDAALDEFYSKIESQ+  QQ KAKE++A  +LNKI +DQENRVHTL++EVD 
Sbjct: 361 REYNEFATFDAALDEFYSKIESQKVNQQQKAKEESAAQRLNKIKLDQENRVHTLRKEVDH 420

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
            VKMAELIEYNLEDVDAAILAVRV+LAN MSWE L RM+KEERKAGNPVAGLIDKL  ER
Sbjct: 421 CVKMAELIEYNLEDVDAAILAVRVSLANEMSWEALTRMIKEERKAGNPVAGLIDKLNFER 480

Query: 466 NCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
           NC++LLLSNNLD+MD++EKT PVEKVEVD+ALSAHANARRWYE+KKKQESKQEKTITAH 
Sbjct: 481 NCITLLLSNNLDDMDEDEKTAPVEKVEVDIALSAHANARRWYEMKKKQESKQEKTITAHD 540

Query: 526 KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
           KAFKAAEKKTRLQ+ QEKTVA I+HMRKVHWFEKFNWFISSENYL+ISGRDAQQNE+IVK
Sbjct: 541 KAFKAAEKKTRLQLAQEKTVAAITHMRKVHWFEKFNWFISSENYLIISGRDAQQNELIVK 600

Query: 586 RYMSKGDV 593
           RYMSKGD+
Sbjct: 601 RYMSKGDL 608


>gi|384249421|gb|EIE22903.1| hypothetical protein COCSUDRAFT_16391 [Coccomyxa subellipsoidea
           C-169]
          Length = 1029

 Score =  659 bits (1701), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/723 (47%), Positives = 459/723 (63%), Gaps = 78/723 (10%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGE-SEKVLL 58
           MVK RM+TADV  EV CLR  ++GMR +NVYD + KTYI KL      ++SGE  EK LL
Sbjct: 1   MVKQRMSTADVVGEVACLRHSVLGMRVANVYDANAKTYIIKL------SKSGEEGEKALL 54

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++ESGVR HTT Y +DK +TPS FTLKLRKH+RTRRL+DVRQLG DR++ F FG G   +
Sbjct: 55  VLESGVRFHTTRYLKDKADTPSNFTLKLRKHLRTRRLDDVRQLGVDRVVDFSFGTGEACY 114

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           ++ILELYAQGN++L D+ +++LTLLRSHRDDDKG+AIM+RH YP    R+    T ++L 
Sbjct: 115 HLILELYAQGNVILADANYSILTLLRSHRDDDKGLAIMARHAYPVHAIRLRSALTQAQLD 174

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           AAL S+ +                                                 KQ 
Sbjct: 175 AALASADD-------------------------------------------------KQ- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL+  L   + YGPALSEH  L  GL P  K  + + L +     L+  V  +E WL   
Sbjct: 185 TLRGALASVVPYGPALSEHCTLLAGLRPTRK-PKADPLCEEERTALLGGVRHWEAWLDAC 243

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +    PEG+I ++    G +     +     +YD F PL+L Q   +E ++F T++AAL
Sbjct: 244 ETA--APEGFISLKRPADGSE--AASASGDCLVYDSFDPLILQQNSGQEVLRFPTYNAAL 299

Query: 359 DEFYSK-----------IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           DEFY+K           +E Q+AEQ     E AA  KL++I +DQ  R   L +E   + 
Sbjct: 300 DEFYAKARPAPLCLTMSVEGQKAEQARLQAEQAALSKLDRIRIDQTGRAEALDREAKEAE 359

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
             A+LIE N E VD AI AVRVALA  +SW +L R++++E  AGN VAGL+  L+L+RN 
Sbjct: 360 AKAQLIEANAEAVDQAINAVRVALAQGLSWAELERLIRDEAAAGNQVAGLVHALHLDRNA 419

Query: 468 MSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
           ++LL SN   E +DE  T +P   VEVDL L+A  NAR W+  +K + +KQ KT+ A+ +
Sbjct: 420 VTLLDSNA--ESNDETGTDVPTALVEVDLDLNAQQNARAWHSDRKARSAKQAKTLDANKR 477

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           A   A+KK ++Q+ + K VA +  +RK  WFEKFNWF++SENYLV+SGRDAQQNE++VKR
Sbjct: 478 ALVEADKKVQVQLSKVKAVAAVQQLRKPAWFEKFNWFVTSENYLVVSGRDAQQNELLVKR 537

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           Y+ K D+YVHA+LHGAS+TV++NH P +P   L ++QAG   VC SQAWD+K+VTSAWWV
Sbjct: 538 YLRKDDLYVHAELHGASTTVVRNHNPSRPGMAL-VSQAGTACVCRSQAWDAKIVTSAWWV 596

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
           + HQVSK+AP+GEYL  GSFMIRG+KNFLPPHPLIMG   LF+LDES +  HL ER  + 
Sbjct: 597 HAHQVSKSAPSGEYLPTGSFMIRGRKNFLPPHPLIMGLTFLFKLDESCIAGHLGERAPKS 656

Query: 707 EEE 709
            E+
Sbjct: 657 AED 659


>gi|428183447|gb|EKX52305.1| hypothetical protein GUITHDRAFT_65529, partial [Guillardia theta
           CCMP2712]
          Length = 703

 Score =  582 bits (1499), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 314/716 (43%), Positives = 441/716 (61%), Gaps = 79/716 (11%)

Query: 21  LIGMRCSNVYDLSPKTYIFK------------LMNSSGVTES---GESEKVLLLMESGVR 65
           L+G R +N+YDL  KTY+ K             + S  +TE       EK L+L+ESG+R
Sbjct: 1   LLGARLANIYDLDAKTYLLKTNKVRHALAGGAWLLSPWMTERFPLQSGEKCLVLLESGIR 60

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            HTT + RDK N PSGFTLKLRKHIR +R+E+V+QLG DR+++F FG    A ++ILEL+
Sbjct: 61  FHTTEFMRDKSNMPSGFTLKLRKHIRMKRIEEVKQLGVDRVVIFTFGAADEAFHLILELF 120

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSK 185
           A GNI+L D ++T+L LLR++ D+                       T +K+    T   
Sbjct: 121 AGGNIILVDHQYTILALLRTYTDE----------------------ATNTKVAVKETYQL 158

Query: 186 EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLG 245
           + + NE  K++ D           L  +  GK              GA+ ++ T++ VL 
Sbjct: 159 DSNQNENRKISVD-----------LLMEAFGK--------------GAKNEKATMRDVLI 193

Query: 246 EALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK-FEDWLQDVISGDIV 304
           + L YGPAL EH +L T L   MK+SE+    D+ +   +  V K  +D + ++  G  +
Sbjct: 194 KELDYGPALVEHALLGTSLDGKMKVSEMEITRDSPVVSTLFGVFKEVDDMIANLTDGGKM 253

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
            EG ++   K  G+D P          YD+F P++L Q+  ++   F++FD A+D ++S 
Sbjct: 254 IEGVLV--RKGAGEDSP----------YDDFGPVVLRQYAGKKLDMFDSFDKAMDAYFSI 301

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
            E ++ EQQ   ++ AA  K+ ++    E  +  L++E   +   A LIE NL DVD AI
Sbjct: 302 AEDKKLEQQKVQQKKAAVSKVERVKRAHEASIQALQEEEAENYHRATLIEANLSDVDNAI 361

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
           L +   L+  M W  L ++VKEE + GNP+A +I  L L+ N ++LLL+  LD M++EE+
Sbjct: 362 LVINSMLSQGMDWASLKKLVKEEGRKGNPIAQMIHGLKLDSNQITLLLTFGLDAMEEEEQ 421

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT 544
           TLPV  V+VDL ++A+ NA+ +Y  KKK   K EKT+ A  KA K AE+K +  + +  T
Sbjct: 422 TLPVVAVDVDLGMNAYQNAQSYYSSKKKVALKAEKTMQAAGKAIKGAERKAKEDLKKADT 481

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
            A+I  +RK HWFEKF WFISSEN+LV+ GRDAQQNE++VKR+M KGD+Y+HAD+HGA++
Sbjct: 482 KASIQQIRKTHWFEKFIWFISSENFLVLCGRDAQQNELLVKRHMEKGDIYLHADIHGAAT 541

Query: 605 TVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            +IKNH  +  VPPLTL QAG   VC SQAWD+KMVTSA+WV+P QVSK+APTGEYL+ G
Sbjct: 542 HIIKNHTKD-AVPPLTLAQAGLSCVCRSQAWDAKMVTSAYWVHPEQVSKSAPTGEYLSTG 600

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG---EEEGMDDFEDS 717
           SFMIRGKKN+LPP+ LIMGFGLLFR+DES L  H+ ER++RG   +EE M    DS
Sbjct: 601 SFMIRGKKNYLPPNSLIMGFGLLFRIDESCLAHHVGERKIRGLGEQEEEMGKAGDS 656


>gi|302854251|ref|XP_002958635.1| hypothetical protein VOLCADRAFT_69736 [Volvox carteri f.
           nagariensis]
 gi|300256024|gb|EFJ40301.1| hypothetical protein VOLCADRAFT_69736 [Volvox carteri f.
           nagariensis]
          Length = 744

 Score =  563 bits (1452), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 333/743 (44%), Positives = 426/743 (57%), Gaps = 94/743 (12%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGE-SEKVLL 58
           MVK RM++ADVAAEV CLR R++G+R +N+YDL+PKTY+ KL        SGE  EKV L
Sbjct: 1   MVKQRMSSADVAAEVACLRQRILGLRVANIYDLTPKTYVIKL------ARSGEDGEKVYL 54

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           L+ESG R HTT       + PS FTLKLRKH RTRR+E VRQLG DR +    G G  A 
Sbjct: 55  LLESGSRFHTTKVGEKSSDLPSNFTLKLRKHCRTRRVEAVRQLGVDRCMELTLGSGPAAV 114

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           ++ILE+YAQGN++LTD ++ VLTLLRSHRDD KG+ IM+RH YP    R+     ASK+ 
Sbjct: 115 HLILEMYAQGNVVLTDYKYEVLTLLRSHRDDAKGLVIMARHPYPMSAMRL-----ASKV- 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                                GK  D        +   A   Q 
Sbjct: 169 ------------------------------------TGKQLD----EAAAAAAAAGGAQA 188

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
             + +L   L YGP ++EH+ +D G  PN  +    +  +   +    A           
Sbjct: 189 NYRALLSAVLPYGPTIAEHVAMDAGFDPNAAVPLEGEEVEEEGEGAATAATAAAAAAAPP 248

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
             G  +P          + +        +   ++ EF PL L  +  +  ++  TFD AL
Sbjct: 249 GGGGALP--------ADVRRSLLAALVAAGELVFAEFSPLPLLPYSGQPCLELSTFDDAL 300

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEFYSKIE QRA       E AA  KL+KI +DQ  R   L ++ +     A+LI YNLE
Sbjct: 301 DEFYSKIEGQRAGIARADAERAALSKLDKIKLDQGTRAEALLRQAEECELKAQLITYNLE 360

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            VDA +LAV   LA  M W  LA +V+ ER+AGNPVA LI  L LE N +S+LL+N LD+
Sbjct: 361 MVDAVLLAVNQMLATGMDWSALADLVRNERRAGNPVAALIASLELENNRVSVLLANTLDD 420

Query: 479 MDD---------------EEKTLPVEK-------------VEVDLALSAHANARRWYELK 510
             +                E+  P                V VDL+LSA ANA  ++E +
Sbjct: 421 TGEEGEEEAMTRKAVKVASEECFPQHTQRHTQRHTHTHILVFVDLSLSAAANASTYFEAR 480

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVA-NISHMRKVHWFEKFNWFISSENY 569
           ++  +K  KT+ A+  A  AAEKK   Q+ Q +     +  +RK  WFE+F+WFISSENY
Sbjct: 481 RRHLAKHAKTLAANEAALAAAEKKVEAQLKQVRAAPPALQPVRKPMWFERFHWFISSENY 540

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LV+SGRDAQQNE++VKRY  KGDVYVHA+LHG  +T+    R   P+PPLTL QAGC  V
Sbjct: 541 LVVSGRDAQQNELLVKRYFRKGDVYVHAELHG--TTICVRWRSGGPIPPLTLQQAGCACV 598

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           C S+AWDSK+VTSAWWV+  QVSKTAPTGEYLT GSFMIRGKKNFLPP PL+MGFG LF+
Sbjct: 599 CRSRAWDSKLVTSAWWVHHQQVSKTAPTGEYLTTGSFMIRGKKNFLPPQPLVMGFGFLFK 658

Query: 690 LDESSLGSHLNERRVRG-EEEGM 711
           LD+SS+ +HL ER VRG + +GM
Sbjct: 659 LDDSSIPAHLGERAVRGLDPDGM 681


>gi|147771936|emb|CAN75697.1| hypothetical protein VITISV_035984 [Vitis vinifera]
          Length = 431

 Score =  554 bits (1427), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 316/589 (53%), Positives = 360/589 (61%), Gaps = 163/589 (27%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           RMNTADVAAE+KCLRRLIGMRC+NVYDLSPKTY+FK MNSSGVTESG             
Sbjct: 6   RMNTADVAAEIKCLRRLIGMRCANVYDLSPKTYMFKFMNSSGVTESG------------- 52

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
                                                G +++ILFQFGLG NA YVILEL
Sbjct: 53  -------------------------------------GSEKVILFQFGLGANAXYVILEL 75

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
            AQGNILLTDSEF V+TLL SHR+    +  M + R P E                    
Sbjct: 76  CAQGNILLTDSEFMVMTLLGSHRN----LRAMKQSR-PVE-------------------- 110

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
                         GN VS+A +E  G +KG KS + SKN+N    DGARAKQ TLKTVL
Sbjct: 111 -------------GGNKVSDAPREKQGNRKGAKSSEPSKNTN----DGARAKQATLKTVL 153

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
           GEALGYGPALSEHIILD GL+PN K+++ +K + + IQ L  +VAKFE+WL+DVI GD V
Sbjct: 154 GEALGYGPALSEHIILDAGLIPNTKVTKDSKFDXDTIQRLAQSVAKFENWLEDVILGDQV 213

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
           PEGYILMQNK  GKD  P++    +QIYDEFCP+LLNQF+SREFVKFETFDAA DEFYSK
Sbjct: 214 PEGYILMQNKIFGKDCRPSQPDRGSQIYDEFCPILLNQFKSREFVKFETFDAASDEFYSK 273

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           IE QR+EQQ KAKE  A  KL+KI MDQENRVHTLK+E DR +KMAELIEYNLEDVDAAI
Sbjct: 274 IEGQRSEQQQKAKEVXAMQKLSKICMDQENRVHTLKKEDDRCIKMAELIEYNLEDVDAAI 333

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
           LAVRVALAN M+WEDLARM                                         
Sbjct: 334 LAVRVALANGMNWEDLARM----------------------------------------- 352

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT 544
                 VEVDLALSAHANAR WYE KK+QE+K+EKTI AH K  K  +++          
Sbjct: 353 ------VEVDLALSAHANARXWYEQKKRQENKREKTIIAHEKLLKLLKRRLA-------- 398

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
                           + F SS+NY VISGRDAQ NEMIVKRYMSKGD+
Sbjct: 399 ----------------SSFHSSKNYFVISGRDAQLNEMIVKRYMSKGDL 431


>gi|443707183|gb|ELU02895.1| hypothetical protein CAPTEDRAFT_151175 [Capitella teleta]
          Length = 1023

 Score =  550 bits (1417), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 304/705 (43%), Positives = 425/705 (60%), Gaps = 72/705 (10%)

Query: 2   VKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K +  T D+ A V +  RR IGMR +NVYD+  KTY+ KL        +   +K LL++
Sbjct: 1   MKTKFTTVDIRASVLEVKRRWIGMRVTNVYDIDNKTYLVKL--------AKPDQKALLVL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H+T +   K N+PSGF++KLRKH+R RRLE V+QLG DR++  QFG    A+++
Sbjct: 53  ESGSRFHSTEFDWPKNNSPSGFSMKLRKHLRGRRLESVQQLGADRVVDMQFGSNEAAYHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LELY +GN++LTD E+ +L LLR   D+ + V +     YP +  R  +     KLH+A
Sbjct: 113 VLELYDRGNLVLTDHEYNILNLLRVRTDESQDVKLAVHESYPLQTARQ-DTVDHDKLHSA 171

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L  +KE D                                                   L
Sbjct: 172 LLEAKEGD--------------------------------------------------HL 181

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +L   L YGPAL EH +   GL  N ++ +   ++++   +L  A+ + +  L+++  
Sbjct: 182 KRILNPLLPYGPALIEHSLRAAGLPENCRMGKEFIVQEHMASLLA-ALVEAQRILENM-- 238

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G    +GYI+ + +   K    TE G     Y+EF P L  Q  S   ++FE+F  A+DE
Sbjct: 239 GSESSKGYIIQKKE---KKASSTE-GDELITYNEFHPYLYKQHESCPHLEFESFSKAVDE 294

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           F+SKIESQ+ + +   +E +   KL  +  D   R+  L  E ++     +LIE NL  V
Sbjct: 295 FFSKIESQKLDMKTLQQEKSVLRKLENVRKDHAQRLQALANEQEKDNIKGQLIEMNLPLV 354

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           + AIL V+ ALAN++ W D+ ++VKE +  G+PVA  I  L L+ N  +++L +  +   
Sbjct: 355 ERAILVVQSALANQLDWADINQLVKEAQAQGDPVASSISSLQLQSNHFTMMLRDCYE--G 412

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
           DEE  LP +KV++DL LSA+ANAR++Y+ KK    K++KT+ A +KA K+AEKKT+  + 
Sbjct: 413 DEEDMLPAQKVQIDLGLSAYANARKYYDKKKHAAQKEQKTVAASTKALKSAEKKTKQTLK 472

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           + +  A I   RK HWFEKF WFISSENYLVI GRD QQNE++VKR++  GD+YVHADLH
Sbjct: 473 EVQVAATIRKQRKTHWFEKFLWFISSENYLVIGGRDQQQNELLVKRHLRPGDLYVHADLH 532

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           GASS +IKN      VPP TLN+AG   +CHS AWD+K+VTSAWWV+ HQVSKTAPTGEY
Sbjct: 533 GASSVIIKN---PSGVPPKTLNEAGTMALCHSAAWDAKVVTSAWWVHHHQVSKTAPTGEY 589

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           LT GSFMIRGKKNFLPP  LI GFG LF++D++S+  H +ER+VR
Sbjct: 590 LTTGSFMIRGKKNFLPPSYLIYGFGFLFKVDDTSIFRHQDERKVR 634


>gi|148704665|gb|EDL36612.1| mCG3169, isoform CRA_a [Mus musculus]
          Length = 1083

 Score =  547 bits (1410), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 310/745 (41%), Positives = 431/745 (57%), Gaps = 100/745 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 20  MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 71

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 72  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 131

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 132 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 178

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 179 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 202

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 203 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 258

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 259 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 314

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 315 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 374

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN-- 475
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 375 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRNPYL 434

Query: 476 LDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWY 507
           L E +D +    +E                             V+VDL+LSA+ANA+++Y
Sbjct: 435 LSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 494

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 495 DHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 554

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 555 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 613

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 614 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 673

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 674 FKVDESCVWRHRGERKVRVQDEDME 698


>gi|32130521|ref|NP_079717.2| nuclear export mediator factor Nemf [Mus musculus]
 gi|47606756|sp|Q8CCP0.2|NEMF_MOUSE RecName: Full=Nuclear export mediator factor Nemf; AltName:
           Full=Serologically defined colon cancer antigen 1
           homolog
          Length = 1064

 Score =  547 bits (1410), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 310/745 (41%), Positives = 431/745 (57%), Gaps = 100/745 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN-- 475
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRNPYL 415

Query: 476 LDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWY 507
           L E +D +    +E                             V+VDL+LSA+ANA+++Y
Sbjct: 416 LSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 475

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 476 DHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 535

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 536 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 594

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 595 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 654

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 655 FKVDESCVWRHRGERKVRVQDEDME 679


>gi|431893718|gb|ELK03539.1| Serologically defined colon cancer antigen 1 [Pteropus alecto]
          Length = 1077

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 308/747 (41%), Positives = 437/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +  R  E          
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVREHYPVDHARAVE--------PL 164

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT  +  +             ++NA K  L                             L
Sbjct: 165 LTLERLTEV------------IANAPKGEL-----------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYIK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFSGKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -------------NLDEMDDEEKTLPVEK----------------VEVDLALSAHANARR 505
                        N+++++ E      +K                V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDINVEKIETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H +ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHRSERKVRVQDEDME 681


>gi|291403822|ref|XP_002718277.1| PREDICTED: serologically defined colon cancer antigen 1
           [Oryctolagus cuniculus]
          Length = 1076

 Score =  544 bits (1401), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 309/748 (41%), Positives = 435/748 (58%), Gaps = 104/748 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLCAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   RQLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSARQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIILTDYEYLILNILRFRTDEADDVKFAVRERYPLDHAR------------- 159

Query: 181 LTSSKEPDANEP----DKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
                   A EP    +++ E    +S+A K  L                          
Sbjct: 160 --------AAEPLLSLERLTE---VISSAPKGEL-------------------------- 182

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
              LK VL   L YGPAL EH ++++G   N+K+ E  KLE   I+ ++  + K ED+++
Sbjct: 183 ---LKRVLNPLLPYGPALIEHCLMESGFPGNVKVDE--KLESKDIEKVLTCLQKAEDYMK 237

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
              + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD 
Sbjct: 238 --TTSNFRGKGYII-QKREIKPSLEVDKPSEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L  N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLHTNHVTMLLRNPY 414

Query: 475 ------------NLDEMDDEEKTLPVEK------------------VEVDLALSAHANAR 504
                       ++    +E + L  +K                  V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVTVEKNENEPLKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHRGERKVRIQDEDME 681


>gi|340713692|ref|XP_003395373.1| PREDICTED: nuclear export mediator factor NEMF homolog [Bombus
           terrestris]
          Length = 971

 Score =  543 bits (1400), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 350/935 (37%), Positives = 519/935 (55%), Gaps = 132/935 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+A  +  L++ IGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNSYDIACTICELQKFIGMRVNQIYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTTA+   K   PSGF++K+RKH++ +RLE + Q+G DR+I  QFG G  A++VI
Sbjct: 53  SGNRIHTTAFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGIDRMIDLQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E T+L +LR H + DK +    + +YP              +  A 
Sbjct: 113 LELYDRGNIVLTDHEMTILNILRPHTEGDK-IRFAVKEKYP--------------MDRAH 157

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
            ++  P  N                +++L   K G+S                     LK
Sbjct: 158 QNTMPPIEN---------------IQQHLQNAKAGES---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            +L   L +G +L +H++L  G     K+ +   + ++  + L+LA+ ++ + + D    
Sbjct: 182 KLLNPLLEFGSSLIDHVLLKHGFTLGCKIGKDFNVAEHMPK-LILAL-EYANEMMDFARK 239

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
           + V +GYI+ +     K+  PT  G    IY   EF P L  Q+    + +F++FD A+D
Sbjct: 240 N-VSKGYIIQK-----KESKPTTDGKENFIYTNIEFHPFLFEQYADYPYKEFDSFDVAVD 293

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
           E++S +E Q+ + +   +E  A  KL  +  D + R+  L+  QE+D+  + AELI  N 
Sbjct: 294 EYFSTMEGQKLDLKALQQERDALKKLENVKKDHDQRLINLEKTQELDK--QKAELISRNQ 351

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
             VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +  +
Sbjct: 352 ALVDNAILAIQSALANQMAWPDIKILLKEAESRGDPVASAIKQLKLETNHISLLLHDPYE 411

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           + D+E +  P+  +++DLA +A  NA ++Y  K+    KQ+KTI +  KA K+AEKKT+ 
Sbjct: 412 DSDEESELKPM-LIDIDLAHTAFGNATKYYNQKRSAAKKQQKTIESQDKALKSAEKKTKQ 470

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            + + +T+ +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+YVHA
Sbjct: 471 TLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKSGDIYVHA 530

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
           DL GASS VIKN   +  VPP TL +AG   V +S AWD+K+V  AWWV   QVSKTAPT
Sbjct: 531 DLTGASSVVIKNPGNDS-VPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKTAPT 589

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
           GEYLT GSFMIRGKKN+LPP  L+MG G LFRL+ESS+  H NERRVR         +D 
Sbjct: 590 GEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIERHKNERRVRV-------IDDE 642

Query: 718 GHH-----KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTI 772
             H     +E+ +IE   D  +++               P + N  N    E   +    
Sbjct: 643 SEHTDSLIEEDREIELIGDSEEDE--------------QPENKNNLNPIQEESKIDMIME 688

Query: 773 SNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVE 832
            N ++  + D   N+A    P  +  ID        S S  K  ++  Q  +  + K ++
Sbjct: 689 ENNVNQDVSDEENNLAQ--FPDTQIRID-------VSGSKVKLHVDNNQSTVIPQ-KDLD 738

Query: 833 RTATVRDKPYISKA----ERRKLKKGQGSSVVDPKVERE-KERGKDASSQPESIVRKTKI 887
                 DKP I  A    +R ++K+        P ++++ KER +    + + +V K   
Sbjct: 739 VIYLGDDKPVIINAVNMQKRSEIKQ-------KPPLKKDNKERIETEPKKNDQVVLK--- 788

Query: 888 EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
                 RGQKG+LKKMKEKY DQDEE+R + M +L
Sbjct: 789 ------RGQKGRLKKMKEKYKDQDEEDRRLSMQVL 817


>gi|240978882|ref|XP_002403060.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215491284|gb|EEC00925.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 651

 Score =  543 bits (1398), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 317/719 (44%), Positives = 430/719 (59%), Gaps = 92/719 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  LR RL+GMR   VYD   KTY+FKL        +   EK +LL+
Sbjct: 1   MKSRFSTVDIVAMICELRQRLVGMRVIQVYDADSKTYLFKL--------NRHDEKAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTT +A  K  +PSGF++KLRKH+R +R+E V QLG DRI+  QFG+   A++V
Sbjct: 53  ESGVRLHTTDFAWPKNLSPSGFSMKLRKHLRNKRVESVSQLGADRIVDIQFGVNEAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
           ILELY +GN++LTD ++ +L +LR     DD  V  + R RYP +              +
Sbjct: 113 ILELYDRGNLVLTDGDYMILNILRPRTGKDDDDVKFVVRERYPVQ--------------S 158

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP- 238
           AL+ + + +A                                         D  R  +P 
Sbjct: 159 ALSPALDAEA---------------------------------------LTDILRFAKPA 179

Query: 239 -TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
            TL+ +L   + YGPAL EH++   GL    K+++V+   D         VA   + LQD
Sbjct: 180 DTLRKLLTPKVSYGPALLEHVLRARGLSTGAKVADVDASRD---------VATLLECLQD 230

Query: 298 VIS----GDIVP-EGYILMQNKHLGKDHPPTESGSSTQI--YDEFCPLLLNQFRSREFVK 350
             +        P +GYIL++   + K   P + GS T+I  Y EF P L  Q      V+
Sbjct: 231 AEALMERARTEPSKGYILVR---VEKRVTPADDGS-TEITSYQEFHPFLWRQHEKERVVE 286

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
             +F AA+D+F+S +E QR   +   KE  A  KL  I MD E R+  L+Q        A
Sbjct: 287 LASFSAAVDQFFSSLEMQRISLKAHQKEKEALKKLENIRMDHEKRIVALEQVQREDKHKA 346

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           ELIE NL+ V+ A+L +R ALAN++ W ++  +++E ++ G+PVA  I +L L+ N  ++
Sbjct: 347 ELIEINLDLVERALLVLRSALANQIGWAEITELLREAQEQGDPVAQSIKQLKLDTNHFAM 406

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           LL +  +E  D   TL    V++DL LSA+ANARR+Y+ K+    KQ+KT+ + +KA+K+
Sbjct: 407 LLRDPYEE--DARDTL----VDIDLDLSAYANARRYYDQKRHAAGKQQKTLESSTKAYKS 460

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           AEKKT+  + Q    +NI+  RK  WFEKF WFISSE+YLVI GRDAQQNEMIVKR+++ 
Sbjct: 461 AEKKTKEALKQVALTSNIARARKAFWFEKFFWFISSEDYLVIGGRDAQQNEMIVKRHLNP 520

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           GDVYVHADLHGASS VIKN      VPP TLN+AG   +C+S AWD+K+VTSAWWV+ HQ
Sbjct: 521 GDVYVHADLHGASSIVIKNP-GGGSVPPKTLNEAGTMAICYSAAWDAKVVTSAWWVHHHQ 579

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           VSKTAPTG+YLT G+FMIRGKKN+LPP  LIMGFG L++LDE S+  H  ERRVR  EE
Sbjct: 580 VSKTAPTGQYLTPGAFMIRGKKNYLPPSYLIMGFGFLYKLDEDSVERHSGERRVRTAEE 638


>gi|189239405|ref|XP_001813943.1| PREDICTED: similar to CG11847 CG11847-PA [Tribolium castaneum]
 gi|270010510|gb|EFA06958.1| hypothetical protein TcasGA2_TC009916 [Tribolium castaneum]
          Length = 972

 Score =  543 bits (1398), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 367/933 (39%), Positives = 513/933 (54%), Gaps = 130/933 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++ +GMR +NVYD+  KTY+ +L  S         EK ++L+E
Sbjct: 1   MKTRFNTFDIICTVTELQKCVGMRVNNVYDIDNKTYLIRLQRSE--------EKAVILLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R H T +   K   PSGF++KLRKH++ +RLE + QLG DRI+ FQFG G  A++VI
Sbjct: 53  SGNRFHETGFEWPKNVAPSGFSMKLRKHLKNKRLESLAQLGTDRIVDFQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD EFT+L +LR H + D+    + R +YP +  R     T  +L   L
Sbjct: 113 LELYDKGNIILTDFEFTILNVLRPHTEGDR-FKFVVREKYPQDRARQSSLITRDELVQLL 171

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
            ++K  D                                                   LK
Sbjct: 172 KAAKNGDQ--------------------------------------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            VL   L YGP L EH++L  G   + K+ +   +E +  +VL  A+ + E+   +    
Sbjct: 182 KVLVPNLEYGPPLIEHVLLKQGFSNSTKIGKTFNIESDVDKVLC-ALEEAENLFSEAKKA 240

Query: 302 DIVPEGYILMQNKH--LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
               +GYI+ + +   +  D+P  E   S Q   EF P+L  Q +S    +F +F++A+D
Sbjct: 241 GF--KGYIIQKKEERVVSADNPEKEYYYSNQ---EFHPVLYEQHKSSISKEFPSFNSAVD 295

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
           EF+S +ESQ+ E +   +E  A  KL  +  D   R+  L+  QE+D+  + AELI  N 
Sbjct: 296 EFFSSLESQKLELKALQQEREALKKLENVKKDHSQRLLALEKTQEIDK--QKAELITRNQ 353

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           E VD AILAV+ ALA ++SW DLA ++KE    G+ +A  I +L LE N +SL L++   
Sbjct: 354 ELVDKAILAVQTALATQISWSDLADLIKEAASQGDEIAQRIKELKLETNHISLYLTDPYA 413

Query: 475 ---NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
              +  + +D +  +P   V+VDL LSA AN RR+Y+ K+    KQ+KTI + SKAFK+A
Sbjct: 414 EDDSESDDEDNDDKIPPMVVDVDLDLSAFANGRRYYDQKRNAAKKQQKTIESQSKAFKSA 473

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           EKKT+  +   +T+ NI+  RKV+WFEKF WFISSENYLVI+GRD QQNE+IVKRYM   
Sbjct: 474 EKKTKQTLKDVQTITNINKARKVYWFEKFFWFISSENYLVIAGRDQQQNELIVKRYMKST 533

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
           DVYVHAD+HGASS VIKN    Q VPP TLN+AG   +C+S AWD+K+VT+A+WV+  QV
Sbjct: 534 DVYVHADVHGASSVVIKNPSG-QAVPPKTLNEAGTMAICYSVAWDAKVVTNAYWVWGEQV 592

Query: 652 SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           SKTAPTGEYL+ GSFMIRGKKNFLP   LI+G   LF+L+ES +  H +ERRV     G 
Sbjct: 593 SKTAPTGEYLSTGSFMIRGKKNFLPLSHLILGLSFLFKLEESCIEKHKDERRVIA--PGE 650

Query: 712 DDFEDSGHHKENSDIESEK-DDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDK 770
           +DF ++   +   ++E E  D++DE+                   N   V S    A DK
Sbjct: 651 EDFVETVESENKDEVEVEVLDESDEE-------------------NKEEVKS----AADK 687

Query: 771 TISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKH 830
            I N  +S   +   +   P T            +       TK  I T     ++E   
Sbjct: 688 EIENEENSSSSEDEESSKFPDT-----------QIKIQHFEGTKINILTEPVIRNDETDE 736

Query: 831 VERTATVRD-KPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEG 889
            E    + D KP + K  +R  +    S    PK + + ER ++ ++      ++TK   
Sbjct: 737 NETVVYLGDNKPVVVKPNQRS-RNTSESKTKQPKNDAKNERKEETNN------KQTK--- 786

Query: 890 GKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
               RGQK KLKK+KEKY DQDEEER +RM +L
Sbjct: 787 ----RGQKSKLKKIKEKYKDQDEEERKLRMEIL 815


>gi|355778566|gb|EHH63602.1| hypothetical protein EGM_16603 [Macaca fascicularis]
 gi|380817886|gb|AFE80817.1| nuclear export mediator factor NEMF [Macaca mulatta]
 gi|383422753|gb|AFH34590.1| nuclear export mediator factor NEMF [Macaca mulatta]
 gi|384950256|gb|AFI38733.1| nuclear export mediator factor NEMF [Macaca mulatta]
          Length = 1077

 Score =  542 bits (1397), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 308/744 (41%), Positives = 431/744 (57%), Gaps = 96/744 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R      A++    
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHAR------AAEPLLT 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L S  E  A+ P                                           K   L
Sbjct: 167 LESLTEIVASAP-------------------------------------------KGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 --------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWYE 508
                         N  E    +K     K            V+VDL+LSA+ANA+++Y+
Sbjct: 419 EEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYD 478

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
            K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSEN
Sbjct: 479 HKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSEN 538

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG   
Sbjct: 539 YLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMA 597

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF
Sbjct: 598 LCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLF 657

Query: 689 RLDESSLGSHLNERRVRGEEEGMD 712
           ++DES +  H  ER+VR ++E M+
Sbjct: 658 KVDESCVWRHRGERKVRVQDEDME 681


>gi|426233096|ref|XP_004010553.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Ovis
           aries]
          Length = 1076

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 311/747 (41%), Positives = 435/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDIEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHRGERKVRVQDEDME 681


>gi|329664770|ref|NP_001192434.1| nuclear export mediator factor NEMF [Bos taurus]
          Length = 1076

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 310/747 (41%), Positives = 435/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDVEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHRGERKVRVQDEDME 681


>gi|348572143|ref|XP_003471853.1| PREDICTED: nuclear export mediator factor NEMF-like [Cavia
           porcellus]
          Length = 1076

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 307/744 (41%), Positives = 430/744 (57%), Gaps = 96/744 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDV--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E          
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPVDHARAAE--------PL 164

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT  +  D             +++A K  L                             L
Sbjct: 165 LTLERLTDV------------IASAPKGEL-----------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ + K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLESKEIEKVLVCMQKAEDYVK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFSGKGYII-QKREIKPSLEVDKPAEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D E R+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHETRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 -------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANARRWYE 508
                      ++  E  LP  K                   V+VDL+LSA+ANA+++Y+
Sbjct: 419 EEDDDVDGDVSVEKNETELPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYD 478

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
            K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSEN
Sbjct: 479 HKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSEN 538

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E  VPP TL +AG   
Sbjct: 539 YLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-VVPPRTLTEAGTMA 597

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF
Sbjct: 598 LCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLF 657

Query: 689 RLDESSLGSHLNERRVRGEEEGMD 712
           ++DES +  H  ER+VR ++E M+
Sbjct: 658 KVDESCVWRHRGERKVRVQDEDME 681


>gi|440907236|gb|ELR57405.1| Serologically defined colon cancer antigen 1 [Bos grunniens mutus]
          Length = 1077

 Score =  541 bits (1394), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 310/747 (41%), Positives = 435/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGVPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDVEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 655 FLFKVDESCVWRHRGERKVRVQDEDME 681


>gi|350409527|ref|XP_003488770.1| PREDICTED: nuclear export mediator factor NEMF homolog [Bombus
           impatiens]
          Length = 971

 Score =  541 bits (1394), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 346/930 (37%), Positives = 519/930 (55%), Gaps = 122/930 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+A  +  L++ IGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNSYDIACTICELQKFIGMRVNQIYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTTA+   K   PSGF++K+RKH++ +RLE + Q+G DR+I  QFG G  A++VI
Sbjct: 53  SGNRIHTTAFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGVDRMIDLQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E T+L +LR H + DK +    + +YP +  R  + T         
Sbjct: 113 LELYDRGNIVLTDHEMTILNILRPHTEGDK-IRFAVKEKYPMD--RAHQNTMP------- 162

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                P  N                +++L   K G+S                     LK
Sbjct: 163 -----PIEN---------------IQQHLQSAKAGES---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            +L   + +G ++ +H++L  G     K+ +   + ++  + L+LA+ ++ + + D    
Sbjct: 182 KLLNPLVEFGASVIDHVLLKHGFTLGCKIGKDFNVAEHMPK-LILAL-EYANEMMDFARK 239

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
           + V +GYI+ +     K+  PT  G    IY   EF P L  Q+ +  + +F++FD A+D
Sbjct: 240 N-VSKGYIIQK-----KESKPTADGKEDFIYTNIEFHPFLFEQYTNYPYKEFDSFDVAVD 293

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
           E++S +E Q+ + +   +E  A  KL  +  D + R+  L+  QE+D+  + AELI  N 
Sbjct: 294 EYFSTMEGQKLDLKALQQERDALKKLENVKKDHDQRLINLEKTQELDK--QKAELISRNQ 351

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
             VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +  +
Sbjct: 352 TLVDNAILAIQSALANQMAWPDIKVLLKEAESRGDPVASAIKQLKLETNHISLLLHDPYE 411

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           + D+E +  P+  +++DLA +A  NA ++Y  K+    KQ+KTI +  KA K+AEKKT+ 
Sbjct: 412 DSDEESELKPM-LIDIDLAHTAFGNATKYYNQKRSAAKKQQKTIESQDKALKSAEKKTKQ 470

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            + + +T+ +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+YVHA
Sbjct: 471 TLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKSGDIYVHA 530

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
           DL GASS VIKN   +  VPP TL +AG   V +S AWD+K+V  AWWV   QVSKTAPT
Sbjct: 531 DLTGASSVVIKNPGSDS-VPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKTAPT 589

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
           GEYLT GSFMIRGKKN+LPP  L+MG G LFRL+ESS+  H +ERRVR         +D 
Sbjct: 590 GEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIERHKDERRVRI-------IDDE 642

Query: 718 GHHKENSDIESEKD-----DTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTI 772
             H  +S IE +++     D++E   +E+    N+ +P    +    +            
Sbjct: 643 SEHT-DSLIEEDREIELIGDSEEDEQSEN---KNNLNPIQEESKVDIIMEE--------- 689

Query: 773 SNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVE 832
            N ++  + D   N+     P  +  ID        S S  K  ++  Q  +  + K ++
Sbjct: 690 -NNVNQDVSDEENNLVQ--FPDTQIRID-------VSGSKVKLHVDNNQLTVMPQ-KDLD 738

Query: 833 RTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKI 892
                 DKP I  A        Q SS +  K+  +K+  +    +P+      K +   +
Sbjct: 739 VIYLGDDKPVIINAVNM-----QKSSEIKQKLPLKKDNKEKIEIEPK------KNDQVVL 787

Query: 893 SRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
            RGQKG+LKKMKEKY DQDEE+R + M +L
Sbjct: 788 KRGQKGRLKKMKEKYKDQDEEDRRLSMQVL 817


>gi|242018711|ref|XP_002429817.1| Serologically defined colon cancer antigen, putative [Pediculus
           humanus corporis]
 gi|212514835|gb|EEB17079.1| Serologically defined colon cancer antigen, putative [Pediculus
           humanus corporis]
          Length = 1024

 Score =  540 bits (1391), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 308/728 (42%), Positives = 431/728 (59%), Gaps = 90/728 (12%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T D+   V   ++ IG+R + VYD+  KTY+ +L  +         EKV++L+E
Sbjct: 1   MKTRFSTFDIVCSVAEFQKYIGLRVNQVYDIDHKTYLIRLQKTD--------EKVVILLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF +KLRKH+R +RLE ++QLG+DRI+  QFG G  A++V 
Sbjct: 53  SGTRIHTTDFEWPKNVAPSGFCMKLRKHLRNKRLESLKQLGFDRIVHLQFGTGDAAYHVF 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICR-VFERTTASKLHAA 180
           LELY +GNI+LTD +  +L +LR H + DK +    R +YP    R V    T  ++   
Sbjct: 113 LELYDKGNIVLTDCDLIILNILRPHTEGDK-IRFAVREKYPINRARDVCNFPTEEQIKNI 171

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             S+K                                           SND        L
Sbjct: 172 FASAK-------------------------------------------SNDN-------L 181

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +L   L YGPAL EH++L   L    K+ +   L++N +  ++ A+ + +D +++   
Sbjct: 182 KKILNFNLDYGPALIEHVLLGVDLRGTEKIGQGFDLQNN-LSKIINALKEAQDIVENASL 240

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESG-SSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
              V +GYI+ +      +  PTESG S   +  EF P L  Q     F + ETF  A+D
Sbjct: 241 S--VSKGYIIQK-----VEKRPTESGMSDFHVNTEFHPFLFRQHVKNPFNECETFLKAVD 293

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELIEYNL 417
            F+S +ESQ+ + +   +E  A  K+  +  D   R+  L   QE+DR +K AELI  NL
Sbjct: 294 SFFSSLESQKIDMKAINQEKEALKKIENVRRDHNQRLQQLFETQELDR-IK-AELITTNL 351

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
             VD A+LA+R A+AN++SW D+  +VKE + AG+PVA  I KL L+ N ++L LS+   
Sbjct: 352 TLVDQAVLAIRTAIANQISWPDIDILVKEGKNAGDPVASSIKKLKLDINHITLQLSDPYR 411

Query: 475 ----------NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTI 521
                       +  DD+   + V K   V++DL L+A ANAR++Y++K+    KQ+KTI
Sbjct: 412 SDSSSSEEEEEEETNDDKPIKIKVPKIIDVDIDLDLTAFANARKYYDMKRSAAKKQQKTI 471

Query: 522 TAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
            +  KA K+AEKKT+  + + KT+ NI+ +RK  WFEKF WFISSENYLVI+GRD  QNE
Sbjct: 472 ESQDKALKSAEKKTKQALKEMKTIVNITKVRKTFWFEKFFWFISSENYLVIAGRDMMQNE 531

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT 641
           ++VKRYM  GD+YVHAD+HGASS +IKN   E PVPP TLN+AG   + +SQAW++K+VT
Sbjct: 532 LLVKRYMKSGDLYVHADIHGASSVIIKNPSNE-PVPPKTLNEAGVMAISYSQAWEAKVVT 590

Query: 642 SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNE 701
           SAWWV+  QVSKTAPTGEYL  GSFMIRGKKN+LPP  LIMGF  LF+LD++SL  H ++
Sbjct: 591 SAWWVHNTQVSKTAPTGEYLGTGSFMIRGKKNYLPPANLIMGFSFLFKLDDNSLSRHKDD 650

Query: 702 RRVRGEEE 709
           R+VR  EE
Sbjct: 651 RKVRSLEE 658


>gi|296483277|tpg|DAA25392.1| TPA: hypothetical protein BOS_10863 [Bos taurus]
          Length = 1076

 Score =  540 bits (1391), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 309/747 (41%), Positives = 435/747 (58%), Gaps = 102/747 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDVEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNEPEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 654

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+V+ ++E M+
Sbjct: 655 FLFKVDESCVWRHRGERKVKVQDEDME 681


>gi|417405795|gb|JAA49597.1| Putative rna-binding protein [Desmodus rotundus]
          Length = 1081

 Score =  539 bits (1389), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 306/742 (41%), Positives = 426/742 (57%), Gaps = 106/742 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER-TTASKLHA 179
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP    R  E   T  +L  
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVGHARAVEPLPTLERLTE 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            +TS+ E +                                                   
Sbjct: 173 VITSAAEGE--------------------------------------------------L 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK  L   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   
Sbjct: 183 LKRALNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  E       Y+EF P L +Q     +++FE+FD 
Sbjct: 239 ASNFSGKGYIIQKREVKPSLEVDKPAEE----ILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+    D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNFRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L  VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LPVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 --------------NLDEMDDE-----------------EKTLPVEKVEVDLALSAHANA 503
                         N+++ + E                 +K  P+  V+VDL+LSA+ANA
Sbjct: 415 LLSEEEDDDVDGEINVEKSETEPPKGKKKKQKNKQLQRPQKNRPL-LVDVDLSLSAYANA 473

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWF 563
           +++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WF
Sbjct: 474 KKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWF 533

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
           ISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +
Sbjct: 534 ISSENYLIIGGRDQQQNEVIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTE 592

Query: 624 AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MG
Sbjct: 593 AGTMALCYSAAWDARIITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMG 652

Query: 684 FGLLFRLDESSLGSHLNERRVR 705
           F  LF+++ES    H  ERRVR
Sbjct: 653 FSFLFKVEESCAWRHRGERRVR 674


>gi|119586145|gb|EAW65741.1| serologically defined colon cancer antigen 1, isoform CRA_a [Homo
           sapiens]
          Length = 828

 Score =  539 bits (1388), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 312/748 (41%), Positives = 434/748 (58%), Gaps = 104/748 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNN-VSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
                   A EP    E     V++A K  L                             
Sbjct: 160 --------AAEPLLTLERLTEIVASAPKGEL----------------------------- 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREIKPCLEADKPVEDILT----YEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHQGERKVRVQDEDME 681


>gi|405952718|gb|EKC20496.1| Serologically defined colon cancer antigen 1-like protein
           [Crassostrea gigas]
          Length = 1084

 Score =  537 bits (1383), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 295/724 (40%), Positives = 426/724 (58%), Gaps = 84/724 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +  D+A  +K L+R  GMR  NVYD+  KTY+ KL            +K ++L+E
Sbjct: 1   MKSRFSKVDIAVVIKELKRFYGMRVVNVYDVDSKTYLIKL--------GKPDDKAVILIE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG+R+H T Y   K   PSGF++KLRKHI+ RRLE++ QLG DRI+  QFG G  A++VI
Sbjct: 53  SGIRIHGTEYDWPKNMAPSGFSMKLRKHIKGRRLENINQLGMDRIVDLQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD EFT+L +LR   D  + V    R  YP    +     +  KL   +
Sbjct: 113 LELYDRGNVVLTDFEFTILNILRPRTDTCQDVKFAVRETYPVSAAKQHSVPSNEKLREVI 172

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
            ++K  D                                                   LK
Sbjct: 173 LAAKVGD--------------------------------------------------VLK 182

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSE---VNKLEDNAIQVLVLAVAKFEDWLQDV 298
            VL   L YGPA++EH +   G   N+K+ +   V +  D     + LA +  +   ++ 
Sbjct: 183 KVLLPHLDYGPAVTEHCLQCIGFPENVKVGKGFSVTEDMDKLTSAIELAESLLKTLSEEP 242

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI--YDEFCPLLLNQFRSREFVKFETFDA 356
             G       +++Q K   +     + G + ++  Y+EF P+L  QF ++    F+ F+ 
Sbjct: 243 CQG-------VIVQKK---EKRAAVKEGENAELLTYEEFHPMLFKQFENKPHSIFDNFNK 292

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           ++DEF+S+IESQ+ + +   +E +A  KL+ I  D E R+  L++E +  +    LIE N
Sbjct: 293 SVDEFFSQIESQKLDMKALQQEKSALKKLDNIKKDHEKRIEGLQKEQETDINKGRLIELN 352

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL---- 472
           L  VD A+L VR ALAN++ W ++  +V E +  G+PVA  I  L L+ N ++LLL    
Sbjct: 353 LPLVDQALLIVRSALANQIDWTEIENLVHEAQLQGDPVASCITGLKLDSNMITLLLRDPY 412

Query: 473 --SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
             S++  + DD++  L   K+++D+++SA+ N+R++++ KK    K++KTI A +KA K+
Sbjct: 413 RYSDDEYDDDDDDDVLKPTKIDIDISMSAYGNSRKYFDKKKTAAKKEQKTIDASAKALKS 472

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           AE+KT+  + +  T A I+  RK +WFEKF WFI+SENYLVI GRD QQNEMIVKRY+  
Sbjct: 473 AERKTKETLKEVATAATINKARKTYWFEKFLWFITSENYLVIGGRDQQQNEMIVKRYLRP 532

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           GD+YVHADLHGASS V+KN   E PVPP +LN+AG   +C+S AWD+K+VTSAWWVY  Q
Sbjct: 533 GDLYVHADLHGASSCVLKNPSGE-PVPPKSLNEAGTMAICNSVAWDAKVVTSAWWVYHDQ 591

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           VSKTAP+GEYLT GSFMIRGKKN+LPP  L+ GFGLLF+L++ S+  H  ER+V     G
Sbjct: 592 VSKTAPSGEYLTTGSFMIRGKKNYLPPTHLVYGFGLLFKLEDDSIERHKGERKVH----G 647

Query: 711 MDDF 714
           +DD+
Sbjct: 648 VDDY 651


>gi|384489957|gb|EIE81179.1| hypothetical protein RO3G_05884 [Rhizopus delemar RA 99-880]
          Length = 1044

 Score =  536 bits (1382), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 300/720 (41%), Positives = 426/720 (59%), Gaps = 111/720 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R N  DV A V  L+ RLIG+R  NVYD++ KT++FK         +   +K L+L 
Sbjct: 1   MKQRFNALDVRATVSNLKERLIGIRLQNVYDVNAKTFLFKF--------AKPDDKELVL- 51

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA--H 118
                                    +RKH+RTRRL +VRQLG DRI+ F+F  G  +  +
Sbjct: 52  -------------------------IRKHLRTRRLTNVRQLGVDRIVDFEFAGGEKSIGY 86

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           ++I E YA GNI+LTD E+ +L LLR+ +  +     +        +   F++  A +L 
Sbjct: 87  HIICEFYASGNIILTDHEYRILALLRAVQPTETLKMAVGEIYNIQSVLNDFQKVEAEQLR 146

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            AL+++   D                                                  
Sbjct: 147 NALSAAGPKD-------------------------------------------------- 156

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA--IQVLVLAVAKFEDWLQ 296
            LK +L     YGPA+ EHIIL++ L PNMK++      +N+  +Q L+    K +D ++
Sbjct: 157 NLKKILNIKFEYGPAMIEHIILESELDPNMKVASDFDTSENSPMMQALLEGFKKADDMIE 216

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
              +G+ VP+GYI++QN    +     +     +IYDEF P L  QF +R+F +F TFD 
Sbjct: 217 S--TGNSVPKGYIILQND--TRQTKNEKEEEEMEIYDEFHPHLYKQFSNRKFKEFSTFDQ 272

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEF+S IE+Q+ E + + +E+AA  KL  + ++QE RV +L  +   + + A+LIE N
Sbjct: 273 AVDEFFSSIEAQKLELKTRRQEEAALKKLEAVKLEQEKRVESLLNQQLTNTRKAQLIELN 332

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
           L+ VDAAI  +R A+A++M W+DL  +VKEE++ GNP+A +ID L LE N ++LLL++  
Sbjct: 333 LQFVDAAITIIRNAVASQMDWQDLNDLVKEEKRRGNPIALIIDTLKLETNQVTLLLTDPE 392

Query: 477 DEMDDEEKTLP--------------VEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
           +  + E                   + K++VD+ L+A ANAR++YE KK   SK EKTI 
Sbjct: 393 EHEESESDDEEEEEEEEEKEEKPKEIFKIDVDIGLTAFANARKYYEQKKTTASKHEKTIE 452

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           A +KA K+AE+K R  + + K  A I+ +RK  WFEKF WFIS+E YLVI+GRD QQNEM
Sbjct: 453 ASTKALKSAERKIRKDLKETKITATINKIRKPFWFEKFQWFISTEGYLVIAGRDMQQNEM 512

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPE---QPVPPLTLNQAGCFTVCHSQAWDSKM 639
           +V+RY+SK DVYVHADLHGA+S ++KN +P+   QP+ P TL QAG  +VC S+AWDSK+
Sbjct: 513 LVRRYLSKDDVYVHADLHGAASVIVKN-KPQANGQPISPSTLYQAGIMSVCQSKAWDSKI 571

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
           VTSA+WVYP QVSK+AP+GEYLT GSFMIRGKKNFLPP  L+ GFG LF+LDESS+G+H+
Sbjct: 572 VTSAYWVYPDQVSKSAPSGEYLTTGSFMIRGKKNFLPPVQLVYGFGYLFKLDESSIGNHI 631


>gi|330841435|ref|XP_003292703.1| hypothetical protein DICPUDRAFT_40970 [Dictyostelium purpureum]
 gi|325077022|gb|EGC30763.1| hypothetical protein DICPUDRAFT_40970 [Dictyostelium purpureum]
          Length = 1084

 Score =  535 bits (1378), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 302/753 (40%), Positives = 456/753 (60%), Gaps = 94/753 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+   V  L++ LIG+R +N+YDLSP+ ++ K         S    K  L++
Sbjct: 1   MKTRFSSIDIRTTVFNLQKSLIGLRLANLYDLSPRVFLLKF--------SRPDFKKNLII 52

Query: 61  ESGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           ESG+R+H+T + RDK + TP+ F+L LRK+++T+RLE V+QLG DR++ F FG G+   +
Sbjct: 53  ESGIRIHSTNFIRDKGDHTPAPFSLTLRKYLKTKRLESVKQLGVDRVVDFTFGSGVAVQH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
           +I+ELY+ GNI+LTD ++ +L             AI+  H+Y  +     E      ++ 
Sbjct: 113 LIIELYSIGNIILTDGDYRIL-------------AILRTHQYNQD-----ESVAVGDVYP 154

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
                     N+  K  E    + ++  EN                        + K+ T
Sbjct: 155 V---------NKAKKPTEFTTELIDSIIEN-----------------------TQDKKET 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK V  ++L +GP L EH IL  GL P++KL +     D+++    L ++ F++  Q + 
Sbjct: 183 LKQVFNKSLDFGPELIEHCILSAGLQPSLKLEQY----DHSVSSQAL-ISAFKEG-QKIY 236

Query: 300 SGDIVPEGYILMQNKHLGKD--------------HPPTESGSSTQIYDEFCPLLLNQFRS 345
              +  +GYI++++    K                PP E      +Y+EF P L  Q+ S
Sbjct: 237 DQSVASKGYIVLKDPKQQKPQQQKKQQQQTSTTAEPPKE----IVMYEEFVPFLYKQYES 292

Query: 346 REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
           ++++++++FD A+D+F+S+IESQ+ EQQ   +E     KL+K+  DQ+ R+ +L      
Sbjct: 293 KKYIEYDSFDGAVDQFFSEIESQKLEQQRIQQEQTVLKKLDKVKEDQQRRIDSLFANEAE 352

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP--VAGLIDKLYL 463
           +V+ AELIE NL++VD  IL +R  +AN M+W+ L +++KEE+K  NP  VA  I +L L
Sbjct: 353 NVRKAELIEANLQEVDQCILIIRSGVANSMNWDTLNQLLKEEKKK-NPYSVATKIQRLKL 411

Query: 464 ERNCMSLLLSNNLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKT 520
           E N ++L L++     DDEE     +K   ++VD++LSA ANAR++Y+ KK+   K +KT
Sbjct: 412 ESNQITLALTDGFLYDDDEEVNKTNKKPTLIDVDISLSAFANARKYYDTKKQSHEKAQKT 471

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           I+    A KAAE KTR Q+ + K+  ++  MRKV WFEKF+WFISS+NY+V+SGRDAQQN
Sbjct: 472 ISQAEFALKAAESKTRQQLSEVKSKHSMIQMRKVFWFEKFHWFISSDNYIVVSGRDAQQN 531

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           E++ K+Y+ K DVYVHAD+ G++S VIKN    + +PP TL QAG  T+C+S AW +K+V
Sbjct: 532 ELLFKKYLEKDDVYVHADIFGSTSCVIKNPNGGE-IPPNTLIQAGTMTMCYSNAWSAKVV 590

Query: 641 TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLN 700
           TSA+WVY HQVSKTAP+GEYLT GSFMIRGKKN+LP   L+MGFG +F++DES +G+HLN
Sbjct: 591 TSAYWVYSHQVSKTAPSGEYLTTGSFMIRGKKNYLPHSQLVMGFGFMFKIDESCIGNHLN 650

Query: 701 ERRVRGEEEGMDDFEDSGHHKEN-SDIESEKDD 732
           ER+      G ++ ED G    N S+I +  DD
Sbjct: 651 ERKPLL--SGSNNHEDDGDASNNSSEIVTTNDD 681


>gi|356640194|ref|NP_001239258.1| serologically defined colon cancer antigen 1 [Gallus gallus]
          Length = 1071

 Score =  533 bits (1372), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 367/982 (37%), Positives = 522/982 (53%), Gaps = 149/982 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A V  LR  L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDIRALVAELRLSLLGMRVNNVYDVDSKTYLIRLQKPDC--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH++TRRL  VRQLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKTRRLVSVRQLGIDRIVDFQFGSNEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +              +A
Sbjct: 113 IIELYDRGNIVLTDHEYLILNILRFRTDEADDVRFAVRERYPVD--------------SA 158

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              +  P      ++      +SNA K   G Q                          L
Sbjct: 159 KAPTPLPTLERLTEI------ISNAPK---GEQ--------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YG  L EH +++ G    +K+ +  + ++N I+ ++ A+ K E ++   ++
Sbjct: 184 KRVLNPHLPYGATLIEHCLIEAGFSGYVKIDQHMESKEN-IEKVLSALEKAEGYM--TLT 240

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            D   +GYI+ Q K       P +       Y+EF P L +Q     +++F++F+ A DE
Sbjct: 241 EDFNGKGYII-QKKEKKPSLEPDKPAEDIYTYEEFHPFLFSQHSKCPYLEFDSFNKAADE 299

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ--EVDRSVKMAELIEYNLE 418
           FYSK+E Q+ + +   +E  A  KL  +  D E R+  L+Q  EVD+ +K  ELIE NLE
Sbjct: 300 FYSKLEGQKIDLKALQQEKQALKKLENVRRDHEQRLEALQQAQEVDK-IK-GELIEMNLE 357

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN---- 474
            V  AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N    
Sbjct: 358 IVSRAIQVVRSALANQIDWTEIGAIVKEAQAQGDPVANAIKELKLQTNHITMLLRNPYVL 417

Query: 475 ---------------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
                                         ++   +K  P   V+VDL+LSA+ANA+++Y
Sbjct: 418 SEEEEEGEDADLEKEETEEPKGKKKKNKSKQLKKPQKNKP-SLVDVDLSLSAYANAKKYY 476

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSE
Sbjct: 477 DHKRHAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSE 536

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYLVI+GRD QQNE+IVKRY+  GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 537 NYLVIAGRDQQQNELIVKRYLKPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 595

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD+++VTSAWWV  +QVSKTAPTGEYLT GSFMIRGKKNFL P  L+MGF  L
Sbjct: 596 ALCYSAAWDARVVTSAWWVSHNQVSKTAPTGEYLTTGSFMIRGKKNFLQPSYLMMGFSFL 655

Query: 688 FRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH--KENSDIESEKDDTDEKPVAESLSVP 745
           F++DES +  H  ER+++ ++E ++    S      E  ++    D + E+  AE     
Sbjct: 656 FKVDESCVWRHREERKIKVQDEDLETVSSSASELVAEEVELLEGGDSSSEEDKAE----- 710

Query: 746 NSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARN-VAAPVTPQLEDLIDRALG 804
              H AP    A             T  N  D  + D+ ++ V+ P  P  E + D   G
Sbjct: 711 --CHEAPEDVEA-------------TPENNGDENVADLDQDRVSTPPVP--EGVSDEDDG 753

Query: 805 LGSASISSTKHGIET-------TQFDLS--EEDKHVERTATVRDKPYISKAE---RRKL- 851
                    K  ++        T  DLS  +  + +++T    ++P +S ++   RR L 
Sbjct: 754 ESEVEQPEPKSEVKEEEVNYPDTTIDLSHLQSQRSLQKTIPKEEEPNLSDSKSQGRRHLS 813

Query: 852 ----------KKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLK 901
                     K+   S  +DP  ER+K+   +    P     K       I RGQK K+K
Sbjct: 814 AKERREMKKKKQQSDSENLDPPEERQKD--TETQRPPPPNTNKGVPAPQPIKRGQKSKMK 871

Query: 902 KMKEKYGDQDEEERNIRMALLA 923
           KMKEKY DQDEE+R + M LL 
Sbjct: 872 KMKEKYKDQDEEDRELIMKLLG 893


>gi|125858778|gb|AAI29514.1| LOC733300 protein [Xenopus laevis]
          Length = 906

 Score =  531 bits (1369), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 306/781 (39%), Positives = 442/781 (56%), Gaps = 108/781 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  L   L+GMR  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKSRFNTIDIRAVIAELTDSLLGMRVHNVYDIDNKTYLIRLQKPDS--------KAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH+++RRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT-TASKLHA 179
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +  +  E   +  +L  
Sbjct: 113 IVELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVREHYPIDHAKAPEPLLSVERLKE 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L ++K+ D                                                   
Sbjct: 173 VLDNAKKGD--------------------------------------------------Q 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YG  L EH +LDTGL  N+K+ +++  ED  ++ +  A+ K E ++   +
Sbjct: 183 LKKVLNPHLPYGATLIEHCLLDTGLSSNVKVDQISGPED--LEKVHTALRKAEGYMD--L 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +G+I+ Q +       P ++       +EF P L  Q  +  +++ ++F+  +D
Sbjct: 239 TQNFNGKGFII-QKREKKPSLEPDKASEDIFTNEEFHPFLFAQHANSTYIELDSFNKTVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EF+SK+E Q+ + +   +E  A  KL+ +  D E+R+ +L+   D      ELIE NL+ 
Sbjct: 298 EFFSKLEGQKIDIKALQQEKQALKKLDNVRKDHEHRLESLQYAQDADKAKGELIEMNLDI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N ++++L N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQIQGDPVALAIKELKLQTNHITMMLKNPYVLS 417

Query: 475 ----------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
                                    +    +K  PV  V+VDL+LSA+ANA+++Y+ K+ 
Sbjct: 418 EEESEDEEDEKEEEPKGKKKKAKNKQPKKVQKNKPV-LVDVDLSLSAYANAKKYYDHKRH 476

Query: 513 QESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
              K +KTI A  KAFK+AEKKT+  + + +TV+ I   RKV+WFEKF WFISSENYL+I
Sbjct: 477 AAKKSQKTIEAAEKAFKSAEKKTKQTLKEVQTVSTIQKARKVYWFEKFLWFISSENYLII 536

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +GRD QQNE+IVKRY++ GDVYVHADLHGA+S VIKN   E PVPP TL +AG   VC+S
Sbjct: 537 AGRDQQQNELIVKRYLNPGDVYVHADLHGATSCVIKNPTGE-PVPPRTLTEAGTMAVCYS 595

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            AWD++++TSAWWV+ +QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGFG LF++DE
Sbjct: 596 AAWDARVITSAWWVHHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFGFLFKVDE 655

Query: 693 SSLGSHLNERRVRGEEEGMDDFE--------------DSGHHKENSDIESEKDDTDEKPV 738
           + +  H  ER+V+  +E M+                 D+     NS  + EK DT E+P 
Sbjct: 656 TCVWRHKGERKVKQLDEDMESVTSSNIELAAEENIPLDAPEEDSNSSEDDEKSDTQEQPF 715

Query: 739 A 739
           +
Sbjct: 716 S 716


>gi|301617501|ref|XP_002938173.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Xenopus (Silurana) tropicalis]
          Length = 951

 Score =  530 bits (1365), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 313/802 (39%), Positives = 447/802 (55%), Gaps = 111/802 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  L   L+G+R  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKSRFNTIDIRAVIAELSDSLLGLRVHNVYDVDNKTYLIRLQKPDS--------KAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH+++RRL  ++QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKSRRLVSIKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT-TASKLHA 179
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +  +  E   +  KL  
Sbjct: 113 IVELYDRGNIVLTDHEYLILNILRFRTDEADDVKFAVREHYPIDHAKAPEPLLSVEKLKE 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L  +                            QKG +                      
Sbjct: 173 ILEKA----------------------------QKGDQ---------------------- 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YG  L EH +LDTGL  N+K+ +++  ED  ++ +  A+ K E+++   +
Sbjct: 183 LKRVLNPHLPYGATLIEHCLLDTGLSSNVKVDQISGPED--LEKVHTALRKAEEYMD--V 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           +     +G+I+ Q +       P +        +EF P L  Q  +  +++ ++F+ A+D
Sbjct: 239 TQHFKGKGFII-QKREKKPSLEPDKPSEDIFTNEEFHPFLFAQHCNNTYIELDSFNKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EF+SK+E QR + +   +E  A  KL  +  D E R+ +L+   D      ELIE NL+ 
Sbjct: 298 EFFSKMEGQRIDLKALQQEKQALKKLENVRKDHEERLESLQHAQDADKAKGELIEMNLDI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W+++  +VKE +  G+ VA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWKEIGLIVKEAQIQGDSVALAIKELKLQTNHITMLLKNPYTLS 417

Query: 475 ----------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
                                    +    +K  PV  V+VDL+LSA+ANA+++Y+ K+ 
Sbjct: 418 EEGSEDEEEEKEEEPKGKKKKSKNKQPKKVQKNKPV-LVDVDLSLSAYANAKKYYDHKRH 476

Query: 513 QESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
              K +KTI A  KAFK+AEKKT+  + + +TV+ I   RKV+WFEKF WFISSENYLVI
Sbjct: 477 AAKKSQKTIEAAEKAFKSAEKKTKQTLKEVQTVSTIQKARKVYWFEKFLWFISSENYLVI 536

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E PVPP TL +AG   VC+S
Sbjct: 537 AGRDQQQNELIVKRYLNPGDLYVHADLHGATSCVIKNPTGE-PVPPRTLTEAGTMAVCYS 595

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            AWD++++TSAWWV+ +QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGFG LF++DE
Sbjct: 596 AAWDARVITSAWWVHHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFGFLFKVDE 655

Query: 693 SSLGSHLNERRVRGEEEGMDDFE--------------DSGHHKENSDIESEKDDTDEK-- 736
             +  H  ERRV+  +E M+                 D+     NS  E EK DT E+  
Sbjct: 656 PCVWRHKGERRVKQLDEDMESVTSSNTELAAEENIPLDAAEEDSNSSEEDEKLDTQEEQR 715

Query: 737 -PVAESLSVPNSAHPAPSHTNA 757
            P  +S+ +    +  P+  N+
Sbjct: 716 GPCTDSMGLEQKEYMVPADQNS 737


>gi|348544245|ref|XP_003459592.1| PREDICTED: nuclear export mediator factor Nemf-like [Oreochromis
           niloticus]
          Length = 1074

 Score =  529 bits (1362), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 315/748 (42%), Positives = 429/748 (57%), Gaps = 107/748 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +    IGMR  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFTTVDIRAVIAEINANYIGMRVYNVYDIDNKTYLIRLQKPDS--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+H+T +   K   PSGF +K RKH++TRRL  ++QLG DRI+  QFG    A+++
Sbjct: 53  ESGTRIHSTDFEWPKNMMPSGFAMKCRKHLKTRRLTQIKQLGIDRIVDIQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+L D E+T+L LLR    + + V I  R RYP E                
Sbjct: 113 IIELYDRGNIILADHEYTILNLLRFRTAEAEDVKIAVRERYPVE---------------- 156

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             S++ P   EP    E    +                  LSK  N             +
Sbjct: 157 --SARPP---EPLISLERLTEI------------------LSKAPNGEQ----------V 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKL-SEVNKLED-----NAIQVLVLAVAKFEDW 294
           K VL   L YG  L EH +++ GL  ++K+ S+V+  +       A+Q+    + K E++
Sbjct: 184 KRVLNPHLPYGATLIEHSLIEAGLSGSIKIDSQVDSAQVAPKILEALQIAETYMEKTENF 243

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETF 354
                SG    +GYI+ Q         P +       YDEF P L  Q     +++F+TF
Sbjct: 244 -----SG----KGYII-QKTEKKPSLTPGKPSEELLTYDEFHPFLFAQHAKSPYLEFDTF 293

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAEL 412
           D A+DEF+SK+ESQ+ + +   +E  A  KL  +  D E R+  L   QEVDR +K  EL
Sbjct: 294 DKAVDEFFSKMESQKIDLKALQQEKQALKKLENVKKDHEQRLEALHQAQEVDR-IK-GEL 351

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           +E NL  VD A+  VR ALAN++ W ++  +VKE + AG+PVA  I +L L+ N +++LL
Sbjct: 352 VEMNLPVVDRALQVVRSALANQVDWTEIGVLVKEAQAAGDPVACAIKELKLQTNHITMLL 411

Query: 473 SNNLDEMDDE---------------------------EKTLPVEKVEVDLALSAHANARR 505
            N     +D+                           ++  P+  V+VDL LSA+ANA++
Sbjct: 412 KNPYISEEDQEEEEKKEIVETKGKKNKNKEKGQNKKLQRNKPM-LVDVDLGLSAYANAKK 470

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+  E K++KTI A  KA K+AEKKT+  + + +TV  I   RKV+WFEKF WFIS
Sbjct: 471 YYDSKRSAEKKEQKTIEAADKAMKSAEKKTQQTLKEVQTVTTIQKARKVYWFEKFLWFIS 530

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYLVI+GRD QQNEMIVKRY+  GD+YVHADLHGA+S VIKN     P+PP TL +AG
Sbjct: 531 SENYLVIAGRDQQQNEMIVKRYLRAGDIYVHADLHGATSCVIKNPSG-NPIPPRTLTEAG 589

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              VC+S AWD+K+VTSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  LIMGFG
Sbjct: 590 TMAVCYSAAWDAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIMGFG 649

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMDD 713
            LF++D+ S+  H  ER+VR  EE M++
Sbjct: 650 FLFKVDDQSVFRHQGERKVRTVEEDMEE 677


>gi|71679669|gb|AAI00005.1| Zgc:153813 protein [Danio rerio]
          Length = 881

 Score =  528 bits (1359), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 357/960 (37%), Positives = 514/960 (53%), Gaps = 128/960 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +N+YD+  KTY+ +L             K +LL+
Sbjct: 1   MKGRFNTVDIRAAIAEINASCVGMRVNNIYDIDNKTYLIRLQKPEC--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H T +   K   PSGF +K R H+++RRL  VRQLG DRI+  QFG    A+++
Sbjct: 53  ESGIRIHCTEFDWPKNMMPSGFAMKCRMHLKSRRLVHVRQLGVDRIVDLQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTA-SKLHA 179
           ILELY +GNI+LTD +F +L LLR    + + V I  R RYP E  R  E   +  +L  
Sbjct: 113 ILELYDRGNIILTDHQFMILNLLRFRTAEAEDVKIAVRERYPVENARAEEPIISLQRLTQ 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L+                            G Q G +                      
Sbjct: 173 VLS----------------------------GAQTGDQ---------------------- 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK +L   L YG  L EH +   G+    K+     L   +++VL  A+   E+++Q   
Sbjct: 183 LKRILNPHLPYGGPLIEHCLASVGMSGLYKVDSQTDLTQVSLKVLE-ALQMAEEYMQK-- 239

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +G+I+ +++      P   +G + +    Y+EF P L  Q     +V+FE+F+ 
Sbjct: 240 TANFSGQGFIIQKSEQ----KPNVCAGDAAEELLTYEEFHPFLFCQHVKSRYVEFESFNK 295

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEF+S++ESQ+ + +   +E  A  KL  +  D + R+  L Q  +      EL+E N
Sbjct: 296 AVDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELVELN 355

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L  V  A+  VR ALAN++ W ++ RMV E + AG+PVA  I +L L+ N ++LLL N  
Sbjct: 356 LPVVQRALQVVRSALANQVDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLRNPE 415

Query: 475 -----NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
                   E+   +K+   EK   V++D+ LSAHANA+R+Y+ K+    K++KT+ A  K
Sbjct: 416 ACPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQK 475

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           AFK+AEKKT+  +   +TV +I   RKV+WFEKF WF+SSENYL+I+GRD QQNEMIVKR
Sbjct: 476 AFKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKR 535

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           Y+  GD+YVHADLHGA+S VIKN   E  VPP TL +A    VC+S AWD+K++TSAWWV
Sbjct: 536 YLRAGDLYVHADLHGATSCVIKNPSGE-AVPPRTLTEAATMAVCYSAAWDAKVITSAWWV 594

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR- 705
              QVSKTAP+GEYLT GSFMIRGKKNFLPP  LIMGFG LF++D+ S+  H  ER+++ 
Sbjct: 595 QHDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFRHRGERKMKT 654

Query: 706 -----------------GEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSA 748
                            GE E +   EDSG+ +EN+D  +  DD +E+ V +S       
Sbjct: 655 LEEEEEEEDTTSTAEILGEGEEL-LAEDSGNEEENTDSRT-ADDDEEQQVCKSDEDDEED 712

Query: 749 HPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSA 808
                     + D  E       + +  DS+       ++ P         D  + L   
Sbjct: 713 QRVCREDEDEDEDEDEDALSAADVEDAADSEEEHPGAQISFP---------DTCISLSHL 763

Query: 809 SISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREK 868
            I+ T H  +TT     +E + V     V  K +++  +RR +KK Q       K E  +
Sbjct: 764 QINRTAH-TDTTD---PQESQQVNTDTQV--KKHLTAKQRRDMKKKQ-------KQENTE 810

Query: 869 ERGKDASSQPESIVRK-TKIEGGK----ISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
           +  +  + QPE+  R  T   GG     + RGQ+ KLKKMK+KY DQDEE+R + M +L 
Sbjct: 811 DLEEGDAKQPETASRTPTSKSGGAAAAPLKRGQRNKLKKMKDKYKDQDEEDREMMMKILG 870


>gi|194742419|ref|XP_001953700.1| GF17891 [Drosophila ananassae]
 gi|190626737|gb|EDV42261.1| GF17891 [Drosophila ananassae]
          Length = 999

 Score =  527 bits (1358), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 327/803 (40%), Positives = 453/803 (56%), Gaps = 117/803 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F+L  +  V      EKV LL+E
Sbjct: 1   MKTRFNTYDIICGVAELQKLVGWRVNQIYDVDNKTYLFRLQGTGAV------EKVTLLIE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGF++KLRKH++ +RLE ++QLG DRI+ FQFG G  A++VI
Sbjct: 55  SGTRFHTTRFEWPKNVAPSGFSMKLRKHLKNKRLEKIQQLGADRIVDFQFGTGDAAYHVI 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTDSE T L +LR H + +  +    R +YP E  +              
Sbjct: 115 LELYDRGNLILTDSELTTLYILRPHTEGEH-LRFAMREKYPVERAK-------------- 159

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                                   S E L  +   +  + +KN +             L+
Sbjct: 160 -----------------------QSSEGLKAEALEQLLENAKNGD------------NLR 184

Query: 242 TVLGEALGYGPALSEHIILDTGL----VPNMKLSE----------------------VNK 275
            +L   L  GP++ EH++L+ GL    +   K SE                        K
Sbjct: 185 QILMPNLDCGPSVIEHVLLEQGLENRIIEKEKSSEDAQESEEKPEKGGKKQKKGRNQQTK 244

Query: 276 LED------NAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSST 329
           +E       N + +L  AV   ED L +  SG    +GYI+       K+  PTE+G   
Sbjct: 245 VEQKPFDVANDLPLLQQAVKSAEDLLTEGASGKT--KGYIVQV-----KEEKPTENGKVE 297

Query: 330 QIYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             +   EF P    QF+  E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ 
Sbjct: 298 FFFRNIEFHPYQFVQFKDFECATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSN 357

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
           +  D   R+  L +  D   K AELI  N   VD AI AV+ A+A+++SW D+  +VKE 
Sbjct: 358 VKNDHAKRLEELTKVQDEDRKKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEA 417

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARRW 506
           +  G+ VA  I +L LE N +SL+LS+   E +DE+   P V  V+VDLALSA ANARR+
Sbjct: 418 QANGDAVASSIKQLKLETNHISLILSDPYGENEDEDLDTPEVTVVDVDLALSAWANARRY 477

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y+LK+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFISS
Sbjct: 478 YDLKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISS 537

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG 
Sbjct: 538 ENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIRNPTGEE-IPPKTLLEAGS 596

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   LIMG  L
Sbjct: 597 MAISYSVAWDAKVVTNSYWVTSEQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLIMGLSL 656

Query: 687 LFRLDESSLGSHLNERRVRG-EEEGMD-DFEDSGHHKENSDIESE-KDDTDEKPVAESLS 743
           LF+L++S +  HL ER+VR  ++E  D DF++S      +D+ SE  DD++  PVA    
Sbjct: 657 LFKLEDSFIARHLGERKVRSIDDEPTDQDFKESDVA---NDLLSEPSDDSEATPVA---- 709

Query: 744 VPNSAHPAPSHTNASNVDSHEFP 766
             N + P      +SN D   FP
Sbjct: 710 --NMSEP------SSNTDITAFP 724


>gi|328781799|ref|XP_395865.4| PREDICTED: serologically defined colon cancer antigen 1 homolog
           isoform 1 [Apis mellifera]
          Length = 970

 Score =  527 bits (1357), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 296/720 (41%), Positives = 430/720 (59%), Gaps = 78/720 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+   +  L++LIGMR + VYD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNSYDITCTINELQKLIGMRVNQVYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DR+I  QFG G  A+++I
Sbjct: 53  SGNRIHTTVFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGVDRMIDLQFGSGEAAYHII 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E T+L +LR H + DK +    + +YP +           + H  +
Sbjct: 113 LELYDRGNIVLTDYEMTILNILRPHTEGDK-IRFAVKEKYPMD-----------RAHQNI 160

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
               E              N+    +++L   K G+S                     LK
Sbjct: 161 MPPIE--------------NI----QQHLQNAKIGES---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +      
Sbjct: 182 KILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSARQN 240

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
             + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + KF +FD A+D
Sbjct: 241 --ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKKFASFDVAVD 293

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
           E++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI  N 
Sbjct: 294 EYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELISRNQ 351

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
             VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +  +
Sbjct: 352 SLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHDPYE 411

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           + D+E +  P+  +++DLA +A  NAR++Y  K+    KQ+KTI +  KA K+AEKKT+ 
Sbjct: 412 DSDEESELKPM-LIDIDLAHTAFGNARKYYNQKRSAAKKQQKTIESQDKALKSAEKKTKQ 470

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            + + +T+ +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+YVHA
Sbjct: 471 TLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKTGDIYVHA 530

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
           DL GASS +IKN      VPP TL +AG   V +S AWD+K+V  AWWV   QVSKTAPT
Sbjct: 531 DLTGASSVIIKNPGGST-VPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKTAPT 589

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR---GEEEGMDDF 714
           GEYLT GSFMIRGKKN+LPP  L+MG G LFRL+ESS+  H +ER+VR    E E M+ F
Sbjct: 590 GEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFRLEESSIERHKDERKVRIIDDENEHMESF 649



 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 21/31 (67%), Positives = 26/31 (83%)

Query: 892 ISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           + RGQKG+LKKMKEKY DQDEE+R + M +L
Sbjct: 786 LKRGQKGRLKKMKEKYKDQDEEDRKLSMQVL 816


>gi|307209071|gb|EFN86238.1| Serologically defined colon cancer antigen 1-like protein
           [Harpegnathos saltator]
          Length = 989

 Score =  526 bits (1356), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 358/942 (38%), Positives = 515/942 (54%), Gaps = 129/942 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L+RLIGMR + +YD+  +TY+ +   S         EK +LL+E
Sbjct: 1   MKTRFNTYDLVCSVTELQRLIGMRVNQIYDIDHRTYLIRFQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DRII  QFG G  A+++I
Sbjct: 53  SGNRIHTTGFEWPKNIAPSGFSMKMRKHLKNKRLESLMQVGIDRIIDLQFGSGEAAYHII 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E  +L +LR H + DK +    R +YP                   
Sbjct: 113 LELYDRGNIILTDHEMVILYILRPHTEGDK-IRFAVREKYPL------------------ 153

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                      D+ + +     +   E+L   K G+S                     LK
Sbjct: 154 -----------DRAHNEAMPPIDEIHEHLQKAKTGES---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            VL   L +G A+ +H++L        K+S + N  ED  +  L+LA+    + + +   
Sbjct: 182 KVLNPILEFGSAVIDHVLLKATFALGCKISKDFNITED--MPKLILALEDANNIMDNAKK 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAAL 358
                +GYI+ +     K+  PT+ G    I+   EF PLL  Q++ + + +F++FDA +
Sbjct: 240 S--ASKGYIIQK-----KEARPTQDGKEEFIFANIEFHPLLFEQYKDQPYKEFDSFDATV 292

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYN 416
           DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QEVD+  + AELI  N
Sbjct: 293 DEYFSTMEGQKLDLKALQQEREALKKLENVRKDHDQRLITLEKTQEVDK--QKAELISRN 350

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
              VD AILA++ ALAN+MSW D+  ++KE +   +PVA  I +L LE N +SLLL +  
Sbjct: 351 QTLVDNAILAIQSALANQMSWPDIQVLLKEAQARSDPVASAIKQLKLETNHISLLLHDPY 410

Query: 477 DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
           +E D+E +  P+  ++VDLA +A  NAR++Y  K+    KQ+KTI +H KA K+AEKKT+
Sbjct: 411 EESDEESELKPM-IIDVDLAHTAFGNARKYYSQKRSAAKKQQKTIESHGKALKSAEKKTK 469

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
             + + +T+ +I  +RKV+WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GD+YVH
Sbjct: 470 QTLKEVQTIHSIIKLRKVYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLRAGDLYVH 529

Query: 597 ADLHGASSTVIKNHRPEQP----VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
           ADL GASS VIKN     P    VPP +L +AG   + +S AWD+K+V +AWWV+  QVS
Sbjct: 530 ADLTGASSVVIKN-----PTGGFVPPKSLAEAGTMAIAYSVAWDAKVVANAWWVHHDQVS 584

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           K+APTGEYLT GSFMIRGKKN+LPP  LIMG G++FRL+E+S+  H +ER+V+       
Sbjct: 585 KSAPTGEYLTTGSFMIRGKKNYLPPSQLIMGLGIMFRLEENSIERHKDERKVKA------ 638

Query: 713 DFEDSGHHKENSD--IESEKD-----DTDEKPVAESLSVPNSAHP----APSHTNASNVD 761
                G   EN D  IE +K+     D+DE    E  +  N  H      P      N D
Sbjct: 639 ----VGEESENVDSVIEDDKEIELEGDSDEDENLEDKNALNPIHEEDHLEPESCATDNKD 694

Query: 762 SHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQ 821
           +      +K   N  + +  +       P T    DL          S    K  ++  Q
Sbjct: 695 A------NKDEGNDEEEEEEEDDTKCQFPDTQIKLDL----------SGPKVKLHVDNNQ 738

Query: 822 FDLSEEDKHVERTATV-RDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPES 880
             ++ +    E    +  DKP I     ++ K+ +    V PK  +EK    D +     
Sbjct: 739 PLIATQKDAEENVVYLGDDKPVIVNLPIKE-KRAKTKQKVQPKEPKEKIEKSDKTE---- 793

Query: 881 IVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
            +   KIE   + RGQKGKLKKMKEKY DQDEE+R + M +L
Sbjct: 794 -IDNKKIEQPVLKRGQKGKLKKMKEKYKDQDEEDRRLSMLVL 834


>gi|410898599|ref|XP_003962785.1| PREDICTED: nuclear export mediator factor Nemf-like [Takifugu
           rubripes]
          Length = 1029

 Score =  526 bits (1356), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 306/740 (41%), Positives = 427/740 (57%), Gaps = 105/740 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +    +GMR +NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFTTVDIKAVIAEINSNYMGMRVNNVYDIDTKTYLIRLQKPDS--------KAILLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+H+T +   K   PSGF +K RKH++TRRL  V+QLG DRI+  QFG    A+++
Sbjct: 53  ESGTRIHSTDFEWPKNMMPSGFAMKCRKHLKTRRLTQVKQLGNDRIVDIQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GN++L D E+T+L LLR    +   V I  R RYP E  R  E   + +    
Sbjct: 113 IVELYDRGNVILADHEYTILNLLRFRTAEVDDVKIAVRERYPVESARPPEPLISLQRLTE 172

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L S+                            Q+G +                      +
Sbjct: 173 LLSA---------------------------AQQGDQ----------------------I 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKL---SEVNKLEDNAIQVLVLA---VAKFEDW 294
           K VL   L YG  L EH +++ GL  + K+   + V ++    ++ L +A   +AK E++
Sbjct: 184 KRVLNPHLSYGATLIEHSLIEVGLPGSAKVDSQASVAQVASKILEALTVAEAYMAKTENF 243

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKF 351
                      +GYI+ +++      P    G  ++    YDEF P L  Q     +++F
Sbjct: 244 ---------TGKGYIIQKSEK----KPSVTPGKPSEELLTYDEFHPFLFAQHSKSPYLEF 290

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKM 409
           ++FD A+DEF+SK+ESQ+ + +    E  A  KL  +  D E R+  L   QE+DR +K 
Sbjct: 291 DSFDKAVDEFFSKMESQKIDMKALQLEKHAMKKLENVKKDHEQRLEALHQAQEIDR-IK- 348

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
            ELIE NL  V+ A+  V  ALAN++ W ++  +VKE + AG+PVA  I +L L+ N ++
Sbjct: 349 GELIEMNLAIVERALQVVCSALANQVDWTEIGILVKEAQAAGDPVACAIKELKLQANHIT 408

Query: 470 LLLSNNLDEMDDEEKTLPVEK--------------------VEVDLALSAHANARRWYEL 509
           LLL N     DDE++   VE+                    V+VDL+LSA+ANA+++Y+ 
Sbjct: 409 LLLKNPYVSEDDEQEDDVVEETGRKNKNKKSKKFQKNKPMLVDVDLSLSAYANAKKYYDN 468

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K+  + K+ KTI A  KA K+AEKKT+  + + +TV  I   RKV+WFEKF WFIS+ENY
Sbjct: 469 KRSAKRKEFKTIEAADKAMKSAEKKTQKTLKEVQTVTTIQKARKVYWFEKFLWFISAENY 528

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LVI+GRD QQNEMIVKRY+  GD+YVHADLHGA+S VIKN   + PVPP TL +AG   V
Sbjct: 529 LVIAGRDQQQNEMIVKRYLRAGDIYVHADLHGATSCVIKNPSGD-PVPPRTLTEAGTMAV 587

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           C+S AW++K+VTSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKN+LPP  LIMGFG LF+
Sbjct: 588 CYSAAWEAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNYLPPSYLIMGFGFLFK 647

Query: 690 LDESSLGSHLNERRVRGEEE 709
           +DE S+  H  ER+V+  EE
Sbjct: 648 VDEHSVFRHRGERKVKTVEE 667


>gi|345306303|ref|XP_001515044.2| PREDICTED: nuclear export mediator factor NEMF [Ornithorhynchus
           anatinus]
          Length = 1076

 Score =  524 bits (1349), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 298/727 (40%), Positives = 427/727 (58%), Gaps = 98/727 (13%)

Query: 18  LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN 77
           L  L+GMR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K  
Sbjct: 20  LNSLLGMRVNNVYDVDNKTYLIRLQKPDV--------KATLLLESGIRIHTTEFEWPKNM 71

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEF 137
            PS F +K RKH+++RRL  V+QLG DRI+ FQFG    A+++I+ELY +GNI+LTD E+
Sbjct: 72  MPSSFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTDYEY 131

Query: 138 TVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNE 197
            +L +LR   D+   V    R RYP ++ +                     A EP     
Sbjct: 132 LILNILRFRTDEADDVKFAVRERYPVDLAK---------------------APEP----- 165

Query: 198 DGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEH 257
                                F L + +   SN  A   +P LK VL   L YG  L EH
Sbjct: 166 --------------------LFTLERLTEIISN--APKGEP-LKRVLNPHLPYGATLIEH 202

Query: 258 IILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLG 317
            ++++G   N+K+    +++D  I+ +++ + K E++++  I+ +   +GYI+ Q +   
Sbjct: 203 CLIESGFPGNVKVDPQFEIKD--IEKVLVCLQKAEEYMK--ITTNFSGKGYII-QKREKK 257

Query: 318 KDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAK 377
               P +       Y+EF P L +Q     +V+FE+FD A+DEFYSK+E Q+ + +   +
Sbjct: 258 PSLEPDKPAEDILTYEEFHPFLFSQHSKYPYVEFESFDKAVDEFYSKLEGQKIDLKALQQ 317

Query: 378 EDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRM 435
           E  A  KL  +  D E+R+  L   QE+D+ VK  ELIE NL+ VD AI  VR ALAN++
Sbjct: 318 EKQALKKLENVRKDHEHRLEALHQAQEIDK-VK-GELIEMNLQIVDRAIQVVRSALANQI 375

Query: 436 SWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--------------------- 474
            W ++  +VKE +  G+PVA  I +L L+ N +++LL N                     
Sbjct: 376 DWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLKNPYVMSEEEDDDGEDIEKEETE 435

Query: 475 ---------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
                       ++   +K  P+  V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  
Sbjct: 436 EPKGKKKKQKDKQLKKPQKNKPL-VVDVDLSLSAYANAKKYYDHKRHAARKTQKTVEAAE 494

Query: 526 KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
           KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSENYL+I GRD QQNEMIVK
Sbjct: 495 KAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIVK 554

Query: 586 RYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWW 645
           RY++ GD+YVHADLHGA+S VIKN   E  +PP TL +AG   +C+S AWD++++TSAWW
Sbjct: 555 RYLNSGDIYVHADLHGATSCVIKNPTGEA-IPPRTLTEAGTMALCYSAAWDARVITSAWW 613

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           V+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF+++E+ +  H  ER+V+
Sbjct: 614 VHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVEETCVWRHRGERKVK 673

Query: 706 GEEEGMD 712
            ++E M+
Sbjct: 674 VQDEDME 680


>gi|383852746|ref|XP_003701886.1| PREDICTED: nuclear export mediator factor NEMF homolog [Megachile
           rotundata]
          Length = 970

 Score =  524 bits (1349), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 298/711 (41%), Positives = 425/711 (59%), Gaps = 81/711 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+   +  L++LIGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNSYDIVCTITELQKLIGMRVNQIYDIDHRTYLIRLQRSE--------EKSVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DRII  QFG G  A++VI
Sbjct: 53  SGNRIHTTVFEWPKNVAPSGFSMKMRKHLKNKRLESLTQVGVDRIIDLQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E T+L +LR H + DK +    + +YP +  R  + T         
Sbjct: 113 LELYDRGNIVLTDHEMTILNILRPHTEGDK-IRFAVKQKYPMD--RAHQNTMPP------ 163

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                        + E  N++ NA        K G+S                     LK
Sbjct: 164 -------------IEEIQNHLQNA--------KAGES---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLE--DNAIQVLVLAVAKFEDWLQDV 298
            +L   L +G A+ +H++L  G     K+  + N +E   N I  L  A    E   ++V
Sbjct: 182 KILNPLLEFGSAVIDHVLLKHGFSLGCKIGKDFNIVEHMPNLISALQCADEMMETAKKNV 241

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                  +GYI+ +     K+  P   G+   IY   EF P L  Q++   F +F++FDA
Sbjct: 242 ------SKGYIIQK-----KEVKPVVDGTEEFIYTNIEFHPYLFEQYKDYPFKEFDSFDA 290

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           ++DE++S +E Q+ + +   +E  A  KL+ +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 291 SVDEYFSTMEGQKLDMKVLQQEREALKKLDNVKKDHDQRLITLEKTQELDK--QKAELIS 348

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L L+ N +SLLL +
Sbjct: 349 RNQMLVDNAILAIQSALANQMAWPDIKILLKEAESRGDPVASAIKQLKLDTNHISLLLHD 408

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK 534
             +E D+E +  P+  +++DLA +A  NAR++Y  K+    KQ+KTI +  KA K+AEKK
Sbjct: 409 PYEESDEESELKPM-LIDIDLAHTAFGNARKYYNQKRSAAKKQQKTIESQDKALKSAEKK 467

Query: 535 TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVY 594
           T+  + + + + +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+Y
Sbjct: 468 TKQTLKEVQAIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKSGDIY 527

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
           VHADL GASS VIKN     PVPP TL +AG   V +S AWD+K+V  AWWV   QVSKT
Sbjct: 528 VHADLTGASSVVIKNPGG-GPVPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKT 586

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           APTGEYLT GSFMIRGKKN+L P  L+MG G LFRL+ESS+  H +ERR+R
Sbjct: 587 APTGEYLTTGSFMIRGKKNYLSPCQLVMGLGFLFRLEESSIERHKDERRIR 637



 Score = 47.4 bits (111), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 21/31 (67%), Positives = 26/31 (83%)

Query: 892 ISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           + RGQKG+LKKMKEKY DQDEE+R + M +L
Sbjct: 786 LKRGQKGRLKKMKEKYKDQDEEDRRLFMQVL 816


>gi|345495372|ref|XP_001603770.2| PREDICTED: nuclear export mediator factor NEMF homolog [Nasonia
           vitripennis]
          Length = 972

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 289/711 (40%), Positives = 426/711 (59%), Gaps = 72/711 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+GMR + +YD+  +TY+ +   S         EK +LL+E
Sbjct: 1   MKNRFNTYDLVCSVTELQKLVGMRVNQIYDIDHRTYLIRFQRSE--------EKSILLIE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DR++  QFG    A++++
Sbjct: 53  SGNRIHTTEFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGVDRVVDLQFGSNEAAYHIV 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTDSE T+L +LR H + DK + +  + RYP           A + H  +
Sbjct: 113 LELYDRGNIVLTDSEMTILNILRPHTEGDK-IRLAVKERYP-----------AFRAHTKV 160

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             ++E                              +  D+ KN+ +           +LK
Sbjct: 161 IPTRE------------------------------ELQDIIKNAKQGE---------SLK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            +L   L  G A+ +H++L+ G     K+  E +  +D  +  L  A+   E  L +   
Sbjct: 182 KILNPHLEVGAAVIDHVLLEVGFQLGCKIGKEFDVAKD--VDKLYSALENAEKMLNNAKK 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAAL 358
              V +GYI+ +     K+  P + G    +Y   EF P L  Q +++ + ++ETFD A+
Sbjct: 240 D--VSKGYIIQK-----KEEKPIKDGEEEFMYANIEFHPFLFEQCKNQHYKEYETFDKAV 292

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DE++S +E Q+ + +   +E  A  KL+ +  D + R+ TL +  +   + AELI  N E
Sbjct: 293 DEYFSTMEGQKLDLKVLQQERDALKKLDNVKKDHDQRLVTLGKTQEADKQKAELITRNQE 352

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            VD AILA++ ALAN+MSW+D+  ++KE +  G+PVA  I  L LE N +++LLS+  ++
Sbjct: 353 LVDNAILAMQSALANQMSWQDIQTLLKEAQAKGDPVASAIKHLKLESNHITMLLSDPYED 412

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
            DD+E  L    V++DLA SA +NA R+Y+ K+    KQ+KTI +  KA K+AE+KT+  
Sbjct: 413 SDDDEPELKPMTVDIDLAHSAFSNATRYYDQKRSAAKKQQKTIESQGKALKSAERKTKQT 472

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           + + + + +I+  RKV+WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GDVYVHAD
Sbjct: 473 LKEVQAIHSINKARKVYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLRSGDVYVHAD 532

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
           L GASS V+KN     PVPP +L +AG   V +S AW++K++  ++WV   QVSKTAPTG
Sbjct: 533 LTGASSVVVKNPNG-GPVPPKSLAEAGTMAVAYSIAWEAKVIAGSYWVNSDQVSKTAPTG 591

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           EYLT GSFMIRGKKN+LPP  LIMG G LFRL++SS+  H +ERRVR  EE
Sbjct: 592 EYLTTGSFMIRGKKNYLPPCQLIMGLGFLFRLEDSSIERHKDERRVRTLEE 642


>gi|391330989|ref|XP_003739933.1| PREDICTED: nuclear export mediator factor NEMF homolog [Metaseiulus
           occidentalis]
          Length = 956

 Score =  522 bits (1345), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 302/750 (40%), Positives = 432/750 (57%), Gaps = 79/750 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K +  +AD+ A V  L+ L+GMR   VYD+  KTY+FKL+         + EK +L+ E
Sbjct: 1   MKAKFTSADIVAMVGELKALVGMRVKQVYDVDSKTYLFKLVR--------QEEKAVLIFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG+R+HTT Y   K   PSGF+ KLRKH++ +RL  + QLG DRI+  QFG+   A++VI
Sbjct: 53  SGIRIHTTEYDWPKGMAPSGFSSKLRKHLKNKRLATISQLGVDRIVDLQFGINEAANHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           +ELY +GN++LTD+ F +L +LR  +   + V    R +YP              +  A+
Sbjct: 113 VELYDRGNVVLTDNNFIILNILRPRQAGSEDVRFAVREKYP--------------IAGAI 158

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
               EP         +D      A+KE                              T+K
Sbjct: 159 QEVPEPS-------QQDVIEWLTAAKET----------------------------DTVK 183

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            ++   + +GPA+ EH++L   +  N KL +   L  +  + +  ++ +   +L+ +   
Sbjct: 184 KIIVPKVFFGPAVLEHVLLSREISANTKLRKA-VLTPDFFKSIHSSIVEGNAFLEKLKQP 242

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGS-STQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           D+   G I ++ +   K     E GS     Y+EF P L  Q        F TF  A+D 
Sbjct: 243 DL-STGIISLKVEPRVK---AAEDGSMEIASYNEFHPFLFKQLEGSRVEHFATFGQAVDA 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           F+S  E Q+ + +    E  A  KL  + +D E R++ L+      ++ A LIE NLE V
Sbjct: 299 FFSMQEQQKIDLRAHNLEKEAVKKLENVKLDHEKRLNALEGTQRTDLEKAMLIENNLELV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           + A+ AVR  +A++ SW+++  M+KE +  G+PVA  I  L+L+RN   +LLSN+     
Sbjct: 359 EKALYAVRSFVASQYSWDEIGHMIKEAQHMGDPVACTIKALHLDRNQFGMLLSNSF---- 414

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
             E  L    V++D+ LSA+ANARR++++KK    KQ+KTI + +KA K+A+KKT+  + 
Sbjct: 415 --ENDLSPSVVDIDIDLSAYANARRYFDMKKHAARKQQKTIESSAKALKSAQKKTKEILK 472

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           Q +   NI+  RK +WFEKF WFISSENYLVI GRDAQQNE+IVK+YM+KGD+YVHADLH
Sbjct: 473 QVELTTNIARTRKSYWFEKFFWFISSENYLVIGGRDAQQNEVIVKKYMTKGDIYVHADLH 532

Query: 601 GASSTVIKN----HR----PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
           GASS VIKN    HR        +PP TLN+AG   +C+S AW++K+VTSAWWV+ HQV+
Sbjct: 533 GASSVVIKNPSVTHRFLSVSGGEIPPKTLNEAGTMAICYSAAWEAKVVTSAWWVHHHQVT 592

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           KTAP+GEYLT GSFMIRGKKN+LPP  LIMGFG +FRLDE S+ +H N+R+V   +E   
Sbjct: 593 KTAPSGEYLTAGSFMIRGKKNYLPPLYLIMGFGFMFRLDEESVPAHQNDRKVWTADE-TT 651

Query: 713 DFEDSGHHKENSDIESEKD-DTDEKPVAES 741
             ED+    E  D ++E D  T E    ES
Sbjct: 652 AVEDNAIEPEGVDEQNEIDVSTSEDEAGES 681



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 42/61 (68%), Gaps = 1/61 (1%)

Query: 863 KVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           +V+R+K +G+   + P +  + ++ E  +  +  K K++K+K++YGDQD+EER +RM +L
Sbjct: 725 QVDRKKVKGQKKGAPPPA-AKASEGEQKQPKKLSKAKMRKIKQRYGDQDDEERELRMKIL 783

Query: 923 A 923
           A
Sbjct: 784 A 784


>gi|159155700|gb|AAI54741.1| Zgc:153813 protein [Danio rerio]
          Length = 883

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 357/959 (37%), Positives = 513/959 (53%), Gaps = 132/959 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +N+YD+  KTY+ +L             K +LL+
Sbjct: 1   MKGRFNTVDIRAAIAEINASCVGMRVNNIYDIDNKTYLIRLQKPEC--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H T +   K   PSGF +K RKH+++RRL  VRQLG DRI+  QFG    A+++
Sbjct: 53  ESGIRIHCTEFDWPKNMMPSGFAMKCRKHLKSRRLVHVRQLGVDRIVDLQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTA-SKLHA 179
           ILELY +GNI+LTD +F +L LLR    + + V I  R RYP E  R  E   +  +L  
Sbjct: 113 ILELYDRGNIILTDHQFMILNLLRFRTAEAEDVKIAVRERYPVENARAEEPIISLQRLTQ 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L+                            G Q G +                      
Sbjct: 173 VLS----------------------------GAQTGDQ---------------------- 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK +L   L YG  L EH +   G+    K+     L   +++VL  A+   ED++Q   
Sbjct: 183 LKRILNPHLPYGGPLIEHCLASVGMSGLYKVDSQTDLTQVSLKVLE-ALQMAEDYMQK-- 239

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +G+I+ +++      P   +G + +    Y+EF P L  Q     +V+FE+F+ 
Sbjct: 240 TANFSGQGFIIQKSEQ----KPNVCAGDAAEELLTYEEFHPFLFCQHVKSRYVEFESFNK 295

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEF+S++ESQ+ + +   +E  A  KL  +  D + R+  L Q  +      EL+E N
Sbjct: 296 AVDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELVELN 355

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L  V  A+  VR ALAN++ W ++ +MV E + AG+PVA  I +L L+ N ++LLL N  
Sbjct: 356 LPVVQRALQVVRSALANQVDWVEIGQMVTEAQAAGDPVACAIKELKLQSNHITLLLRNPE 415

Query: 475 -----NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
                   E+   +K+   EK   V++D+ LSAHANA+R+Y+ K+    K++KT+ A  K
Sbjct: 416 ACPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQK 475

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           AFK+AEKKT+  +   +TV +I   RKV+WFEKF WF+SSENYL+I+GRD QQNEMIVKR
Sbjct: 476 AFKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKR 535

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           Y+  GD+YVHADLHGA+S VIKN   E  VPP TL +A    VC+S AWD+K++TSAWWV
Sbjct: 536 YLRAGDLYVHADLHGATSCVIKNPSGE-AVPPRTLTEAATMAVCYSAAWDAKVITSAWWV 594

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR- 705
              QVSKTAP+GEYLT GSFMIRGKKNFLPP  LIMGFG LF++D+ S+  H  ER+++ 
Sbjct: 595 QHDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFRHRGERKMKT 654

Query: 706 ----------------GEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAH 749
                            EE      EDSG+ +E++D  +  DD +++       V  S  
Sbjct: 655 LEEEEEEEDTTSTAEILEEGEELLAEDSGNEEEDTDSRTADDDEEQQ-------VCKSDE 707

Query: 750 PAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSAS 809
                      D  E   ED+   +  DS+       ++ P         D  + L    
Sbjct: 708 DDEKDQRVCREDEDEDEDEDEDAVSAADSEEEHPGAQISFP---------DTCISLSHLQ 758

Query: 810 ISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKE 869
           I+ T H  +TT     +E + V     V  K +++  +RR +KK Q       K E  ++
Sbjct: 759 INRTAH-TDTTD---PQESQQVNTDTQV--KKHLTAKQRRDMKKKQ-------KQENTED 805

Query: 870 RGKDASSQPESIVRK-TKIEGGK----ISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
             +  + QPE+  R  T   GG     + RGQ+ KLKKMK+KY DQDEE+R + M +L 
Sbjct: 806 LEEGDAKQPETASRTPTSKSGGAAAAPLKRGQRNKLKKMKDKYKDQDEEDREMMMKILG 864


>gi|452822547|gb|EME29565.1| RNA-binding protein [Galdieria sulphuraria]
          Length = 1067

 Score =  521 bits (1341), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 347/1005 (34%), Positives = 518/1005 (51%), Gaps = 184/1005 (18%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLM-----------NSSGVT 48
           M + R +  D+ AEVK LRR  IG R  N+YD++P TY+ K+              S V 
Sbjct: 1   MPRNRFSLLDLQAEVKYLRRRFIGARVVNIYDVTPTTYLLKISVPSRNQISVEETISVVE 60

Query: 49  ESGES--EKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRI 106
           ES  S  EK  +L+ESG+R+H T + RDK N PSGF++KLRKHIR+R+++++R LG DR+
Sbjct: 61  ESSNSNWEKTFVLIESGIRIHETRFYRDKANIPSGFSVKLRKHIRSRKIQEIRTLGADRV 120

Query: 107 I-------LFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRD----DDKGVAI 155
           +       +F+         +I+E Y+ GNI+LTD E+T+L+ LRS++       + V I
Sbjct: 121 VELVFSSRVFEGSTIERPCRLIVEFYSSGNIVLTDEEYTILSALRSYKGPFGVTKEPVHI 180

Query: 156 MSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKG 215
            +R++YP  + R                                +N+S +    L   K 
Sbjct: 181 FTRNKYPVHLLR--------------------------------SNISLSKNSVLALLKN 208

Query: 216 GKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN- 274
           G   D+ +N                   L   L  GP + EH ++ +G  P  K+ E+  
Sbjct: 209 GSQTDIVRN------------------FLSTRLYCGPQVIEHALVASGFEPKTKIKELFL 250

Query: 275 KLEDNAIQV------LVLAVAKFEDWLQD---VISGDIVPEGYILMQNKHLGKDHPPTE- 324
             EDN   V       + ++  FE  L D         +  GY+  +     KD   T+ 
Sbjct: 251 NAEDNEEGVSHKTLSFLQSLESFESSLCDNDSTCESLSLERGYLFYR-----KDAHTTDV 305

Query: 325 --SGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
             S S   +Y++F P LL    +   ++F TF+ A+D +++ +E +RA+     +E    
Sbjct: 306 SMSNSERLLYEDFSPFLLCHLSNTSHIEFPTFNEAVDIYFANLEKERAQIVASKQESVVS 365

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K++ +  D E R+  L++  + + K+AE IE N ++VD AI  VR  +AN ++W++L +
Sbjct: 366 KKVDSLRKDLERRIDELERAKEENFKIAEAIELNADEVDKAIWVVRAMIANGVAWDELDK 425

Query: 443 MVKEERKAGNPVAGLIDKLYLERNCMSLLL------------------SNNLDEMDDEEK 484
           M++EE++ GNPVA  I  L+L+RN ++L+L                  S ++   DD ++
Sbjct: 426 MLEEEKEKGNPVAETIHSLHLDRNEITLMLPIDPILEDEFVNENFQYQSEDITYYDDTDE 485

Query: 485 T----------------LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
           T                 PV   +VDL+LSA ANA R++E +K+ + K+EKT+ A  +A 
Sbjct: 486 TEEHFQTERMVAELNASKPVVLADVDLSLSAFANAARYFESRKRAQEKKEKTMEATKRAL 545

Query: 529 KAAEKKTRLQILQE-----KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             AEKK   Q+ +      K    I  +RK  WFEKF+WFISSEN+LVI+G+DAQQNE +
Sbjct: 546 NVAEKKASKQMERSQQRSLKPAVAIREIRKPAWFEKFDWFISSENFLVIAGKDAQQNEQV 605

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
           VKRYM   DVYVHAD+HGASS V+KN   ++PVP  TL +AG F +CHS AW SK+V+SA
Sbjct: 606 VKRYMKTFDVYVHADIHGASSVVVKNRFRDKPVPLQTLIEAGAFAMCHSSAWSSKIVSSA 665

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERR 703
           WWV+  QVSKTAP+GEYLT GSFMIRGKKN+LPP  L+MG+G+LF++D S    H NER+
Sbjct: 666 WWVHASQVSKTAPSGEYLTTGSFMIRGKKNYLPPSQLVMGYGILFKMDPSCTRDHENERQ 725

Query: 704 VRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSH 763
            R   E ++     GH K N D    + D D        + P SA      T  ++   H
Sbjct: 726 RRPLNEAVE-----GHLKTNEDCAENEPDFDNLE-----TFPTSA------TGNADQFYH 769

Query: 764 EFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFD 823
           E   ++  +++  D     +  N+             + L L S  + +TK   E  QF 
Sbjct: 770 ENNLQEADVAHLFDKYHESLPDNL-------------KTLQLDSTGMLATKED-ELDQFR 815

Query: 824 LSEEDKHV---ERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPES 880
            SEE+  +    RT   RD                 S+ V    + + E  K+  + P  
Sbjct: 816 -SEENLELIKYSRTKKARDH----------------STQVGHTKQAQPETFKEKKTSPVD 858

Query: 881 IVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           ++    +   K+ RG++ K+K+ K+KY +Q  EERN+ MALL  S
Sbjct: 859 LIENVDV--SKLPRGKRSKMKRAKKKYAEQTLEERNLAMALLGSS 901


>gi|321467512|gb|EFX78502.1| hypothetical protein DAPPUDRAFT_305191 [Daphnia pulex]
          Length = 997

 Score =  519 bits (1336), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 290/715 (40%), Positives = 426/715 (59%), Gaps = 81/715 (11%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  + D+ A +  +  +LIGMR + +YD+  KTY+ +L  S         EK +LL 
Sbjct: 1   MKARFTSIDIVAAIAEINLKLIGMRVNQIYDVDHKTYLIRLHRSE--------EKAMLLF 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF++KLRKH+  +RLE   Q+G DRII  QFG G  A++V
Sbjct: 53  ESGIRIHTTDFQWPKNPAPSGFSMKLRKHLNNKRLEMASQVGQDRIINLQFGTGEAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASK-LHA 179
           I+ELY +GNI+L D E+ +L +LR  R + + V  + + +YP E   V +  T ++ L  
Sbjct: 113 IIELYDRGNIVLCDFEYVILNILRP-RTEGEDVRFLVKEKYPLEGTSVEDCITNTEVLEN 171

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L+S+K             G+N                                      
Sbjct: 172 WLSSAKT------------GDN-------------------------------------- 181

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKL-SEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           LK +L     YGPAL EH++L+ G  PN ++ ++ +   D  +  L LA+   +  +Q++
Sbjct: 182 LKKILVPKTNYGPALIEHVLLEFGFPPNSRIGTQFDITRD--LPKLHLALKSADSIMQNI 239

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIY--DEFCPLLLNQFRSREFVKFETFDA 356
            S   + +G ++ +     ++  PT SG +   +   EF P+L  Q  S  F++  +F+ 
Sbjct: 240 GS---ISKGIVVQK-----RESRPTPSGENQDFFTNQEFHPMLYKQHESHPFIELPSFNQ 291

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DEF+SK+ESQ+ + +   +E  A  KL  I  D E R+  L   QE+D     A LIE
Sbjct: 292 AVDEFFSKMESQKLDLKVVQQERDAMKKLANIRQDHEKRLANLHHVQEIDEL--KARLIE 349

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   +D AI  VR ALAN++SW+++  +V+E  + G+PVA +I KL L  N +SL+LS+
Sbjct: 350 MNQPLIDHAIQVVRSALANQVSWKEIDELVEEATRKGDPVAKIIKKLKLSTNHISLMLSH 409

Query: 475 NLDEMDDE---EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
              E D +   +++   + V++DL L+A ANAR+++  KK    K++KTI +  KAFK+A
Sbjct: 410 PYAEQDSDSESDESYKPQLVDIDLDLTAFANARKYFGEKKNASKKEQKTIESSHKAFKSA 469

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           EKK +  + +   +A I   RKV WFEKF WFISS+NY+V+ GRD QQNE++VKRY+  G
Sbjct: 470 EKKAKQTLKESAAIATIRKARKVLWFEKFYWFISSDNYIVVGGRDRQQNELLVKRYLKAG 529

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
           D+YVHADLHGASS ++KN      +PP TL +AG   V +S AW++K++T+AWWV   QV
Sbjct: 530 DIYVHADLHGASSVIVKNVSASNRIPPRTLQEAGLMAVGYSAAWEAKVMTTAWWVESSQV 589

Query: 652 SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
           SKTAP+GEYLT GSFMIRGKKNFLPP P+++GFGLLFRL+ESS+  HLN+R+ + 
Sbjct: 590 SKTAPSGEYLTTGSFMIRGKKNFLPPLPIVLGFGLLFRLEESSIARHLNDRKPKA 644


>gi|340374096|ref|XP_003385574.1| PREDICTED: nuclear export mediator factor Nemf-like [Amphimedon
           queenslandica]
          Length = 1137

 Score =  518 bits (1334), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 295/718 (41%), Positives = 419/718 (58%), Gaps = 81/718 (11%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A ++ L RRL GMR +N+YD+  KTY+ KL  S         EK++LL+
Sbjct: 1   MKERFTTVDLLASIEYLNRRLTGMRVANIYDVDHKTYLLKLARSE--------EKIVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG RLHTT +   K   PSGF +KLRKH+RT+RL  + QLG DR+I   FG G  AH++
Sbjct: 53  ESGCRLHTTEFEWPKHLQPSGFAMKLRKHLRTKRLISITQLGVDRVIDMVFGSGEYAHHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD  + +L+LLR+  D D  V    R  +  +  +  +   + +  A 
Sbjct: 113 IIELYDRGNIILTDHTYLILSLLRTRTDADADVRFAVREHFSMDTIKQEQILPSIEQVAG 172

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           +  S +P                       G Q                          L
Sbjct: 173 ILGSAKP-----------------------GDQ--------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           + +L     YG +L  H ++  GL  N KL   N    +  QVL  A+ +  +  Q   S
Sbjct: 184 RHILNPHFVYGTSLLTHCLIGIGLTENTKLPATNDSPIDPDQVLK-ALLEAHEIFQSFRS 242

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD---------EFCPLLLNQFRSREFVKF 351
             +  +GY++ +     KD  PT   +++             EF PLL  Q  S  + + 
Sbjct: 243 --MPSKGYLIQK-----KDVAPTVGVATSDTPTTSTEVTTNIEFHPLLYRQHLSSCYKEV 295

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           ETFD A+DEF+S   SQ+ + +    + +A  KL  I  D E R+  L++  D     AE
Sbjct: 296 ETFDRAVDEFFSSKSSQKQDVKVIQLQKSAVKKLENIKQDHEKRIEALRKSQDEDRYKAE 355

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LIE+N + V+ A L +R A+A+ M W D+  +V + +  G+PVA  I  L L  N ++L 
Sbjct: 356 LIEWNTDLVERACLVIRSAVASSMDWGDIELLVHDAQGRGDPVANSIQGLKLHSNLITLW 415

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           L    +E DD+       KV++DL LS +ANARR+Y++KK+   K++KT  + +KA K+A
Sbjct: 416 LKAPYEEDDDDSI-----KVDIDLGLSVYANARRYYDMKKQAAKKEQKTSESSNKALKSA 470

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           E+KT+  + +   ++ I+  RKVHWFEKF WFISSEN++VI GRD QQNE++VK+Y+++ 
Sbjct: 471 ERKTKQTLKEAAVISRITKARKVHWFEKFYWFISSENFVVIGGRDQQQNELLVKKYLNEH 530

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
           DVYVHADLHGA+S ++KNH    PVPP TLN+AG   VC+S AW++K+VTSAWWVY +QV
Sbjct: 531 DVYVHADLHGATSVIVKNHSG-GPVPPKTLNEAGVMAVCYSSAWEAKIVTSAWWVYANQV 589

Query: 652 SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           SKTAP+GEYLT GSFMIRGKKNFLPP  L++GF ++F++DESSL +H+NERRVR  +E
Sbjct: 590 SKTAPSGEYLTTGSFMIRGKKNFLPPCHLVLGFSIMFKVDESSLANHINERRVRSADE 647



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 50/90 (55%), Gaps = 18/90 (20%)

Query: 842 YISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPES------IVRKTKIEGGKISRG 895
           +IS  ER+ LKK Q SS           +G +ASS P S           + +  +  RG
Sbjct: 754 HISAKERKLLKK-QSSS-----------KGHEASSTPASSKPHPKPQPLPQPQSQQYKRG 801

Query: 896 QKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           QK K KK+K+KYGDQDEEER +RM LLA S
Sbjct: 802 QKSKQKKIKDKYGDQDEEEREMRMNLLASS 831


>gi|322784867|gb|EFZ11647.1| hypothetical protein SINV_03144 [Solenopsis invicta]
          Length = 985

 Score =  518 bits (1333), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 295/713 (41%), Positives = 430/713 (60%), Gaps = 77/713 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L+RLIGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNTYDLVCSVTELQRLIGMRVNQIYDIDNRTYLIRLQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT++   K   PS F++K+RKH++ +RLE + Q+G DRII  QFG G  A+++I
Sbjct: 53  SGNRIHTTSFEWPKNVAPSSFSMKMRKHLKNKRLESLMQVGTDRIIKLQFGSGEAAYHII 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LE+Y +GNI+LTD E  +L +LR H + DK +    + +YP +            +H  +
Sbjct: 113 LEVYDRGNIILTDHEMVILYVLRPHTEGDK-IRFAVKEKYPLDRAHSTTMPPIDVIHEHI 171

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             +KE                             G+S                     LK
Sbjct: 172 QKAKE-----------------------------GES---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            VL   L +G A+ +H++L  G     K+  + N  ED  +  L+LA+    + +   ++
Sbjct: 182 KVLNPLLEFGSAVIDHVLLKAGFNFGCKIGKDFNIAED--MPKLILALEDANNMMD--LA 237

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAAL 358
              V +GYIL +     K+   T+ G    I+   EF P L +Q+ ++ + +F++FDAA+
Sbjct: 238 KKTVSKGYILQK-----KESKLTQDGKEDFIFANIEFHPFLFDQYNNQPYKEFDSFDAAV 292

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYN 416
           DE+YS +E Q+ + +   +E  A  KL ++  D   R+ TL+  QE+D+  + AELI  N
Sbjct: 293 DEYYSTMEGQKIDLKALQQEREALQKLERVRKDHSQRLITLEKTQELDK--QKAELISRN 350

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
              VD AILA++ ALAN+MSW D+  ++KE +  G+PVA  I +L LE N ++LLL +  
Sbjct: 351 QALVDNAILAIQSALANQMSWPDIQVLLKEAQARGDPVASAIKQLKLETNHIALLLHDPY 410

Query: 477 DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
           ++ D+E +  P+  +++DLA +A +NA+++Y  KK    KQ+KTI +H KA K+AEKKT+
Sbjct: 411 EDSDEESELKPM-IIDIDLAHTAFSNAKKYYSQKKSAAKKQQKTIESHGKALKSAEKKTK 469

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
             + + +T+  I+ +RK++WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GD+YVH
Sbjct: 470 QTLKEVQTIHTINKLRKMYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLKAGDLYVH 529

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           ADL GASS VIKN     PVPP +L +AG   V +S AWDSK++ SAWWV+  QVSK+AP
Sbjct: 530 ADLTGASSVVIKNPSG-NPVPPKSLAEAGTMAVAYSIAWDSKVIASAWWVHHDQVSKSAP 588

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           TGEYLT GSFMIRGKKN+L    LIMG GL+FRL++SS+  H NERRV+  +E
Sbjct: 589 TGEYLTTGSFMIRGKKNYLTQSQLIMGLGLMFRLEDSSIERHKNERRVKAVDE 641


>gi|115529351|ref|NP_001070202.1| uncharacterized protein LOC767767 [Danio rerio]
 gi|115313121|gb|AAI24465.1| Zgc:153813 [Danio rerio]
          Length = 694

 Score =  517 bits (1331), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 295/718 (41%), Positives = 416/718 (57%), Gaps = 79/718 (11%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +N+YD+  KTY+ +L             K +LL+
Sbjct: 1   MKGRFNTVDIRAAIAEINASCVGMRVNNIYDIDNKTYLIRLQKPEC--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H T +   K   PSGF +K RKH+++RRL  VRQLG DRI+  QFG    A+++
Sbjct: 53  ESGIRIHCTEFDWPKNMMPSGFAMKCRKHLKSRRLVHVRQLGVDRIVDLQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNI+LTD +F +L LLR    + + V I  R RYP E  R             
Sbjct: 113 ILELYDRGNIILTDHQFMILNLLRFRTAEAEDVKIAVRERYPVENAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    +    V +      G Q G +                      L
Sbjct: 160 --------AEEPIISLQRLTQVLS------GAQTGDQ----------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +L   L YG  L EH +   G+    K+     L   +++VL  A+   E+++Q   +
Sbjct: 184 KRILNPHLPYGGPLIEHCLASVGMSGLYKVDSQTDLTQVSLKVLE-ALQMAEEYMQK--T 240

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +G+I+ +++      P   +G + +    Y+EF P L  Q     +V+FE+F+ A
Sbjct: 241 ANFSGQGFIIQKSEQ----KPNVCAGDAAEELLTYEEFHPFLFCQHVKSRYVEFESFNKA 296

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEF+S++ESQ+ + +   +E  A  KL  +  D + R+  L Q  +      EL+E NL
Sbjct: 297 VDEFFSQMESQKLDMRALQQEKQALKKLENVRKDHQQRLEALHQAQEVERLKGELVELNL 356

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
             V  A+  VR ALAN++ W ++ RMV E + AG+PVA  I +L L+ N ++LLL N   
Sbjct: 357 PVVQRALQVVRSALANQVDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLRNPEA 416

Query: 475 ----NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                  E+   +K+   EK   V++D+ LSAHANA+R+Y+ K+    K++KT+ A  KA
Sbjct: 417 CPEGGAAELQSGKKSRSREKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQKA 476

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           FK+AEKKT+  +   +TV +I   RKV+WFEKF WF+SSENYL+I+GRD QQNEMIVKRY
Sbjct: 477 FKSAEKKTKQTLKDVQTVTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKRY 536

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +  GD+YVHADLHGA+S VIKN   E  VPP TL +A    VC+S AWD+K++TSAWWV 
Sbjct: 537 LRAGDLYVHADLHGATSCVIKNPSGE-AVPPRTLTEAATMAVCYSAAWDAKVITSAWWVQ 595

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
             QVSKTAP+GEYLT GSFMIRGKKNFLPP  LIMGFG LF++D+ S+  H  ER+++
Sbjct: 596 HDQVSKTAPSGEYLTTGSFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFRHRGERKMK 653


>gi|291190355|ref|NP_001167106.1| Serologically defined colon cancer antigen 1 homolog [Salmo salar]
 gi|223648156|gb|ACN10836.1| Serologically defined colon cancer antigen 1 homolog [Salmo salar]
          Length = 1069

 Score =  516 bits (1328), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 304/751 (40%), Positives = 431/751 (57%), Gaps = 104/751 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFNTVDIRAVIAEINANYLGMRVNNVYDIDTKTYLIRLQKPDT--------KSILLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H+T +   K   PSGF +K RKH+++RRL  V+QLG DRI+  QFG    A+++
Sbjct: 53  ESGLRIHSTDFEWPKNMMPSGFAMKCRKHLKSRRLTQVKQLGVDRIVDIQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
           I+ELY +GNI+L D E+T+L LLR    + ++ V I  R RYP E               
Sbjct: 113 IVELYDRGNIILADHEYTILNLLRFRTAEGEEDVKIAVRERYPVE--------------- 157

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
                   +A  P+ +          S E L       +  LSK +N             
Sbjct: 158 --------NARPPEPL---------ISLERL-------TEVLSKATNGEQ---------- 183

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +K VL   L YG  L EH +++ GL   +K+        +A ++L  A+   ED+++   
Sbjct: 184 VKRVLNPHLPYGATLIEHCLMEVGLPGFIKVDSQYDAARDAPKILD-ALQMAEDYMEKTA 242

Query: 300 SGDIVPEGYILMQNKHLGKDHPPT---ESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           S D   +GYI+ +      D  P+   E       Y+EF P L  Q  +  +V+F+TFD 
Sbjct: 243 SFD--GKGYIIQKC-----DKKPSLAPEKPEELLTYEEFHPFLFAQHANSHYVEFDTFDK 295

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ--EVDRSVKMAELIE 414
           A+DE+YSK+ESQR + +   +E  A  KL+ +  D   R+  L Q  EVDR     EL+E
Sbjct: 296 AVDEYYSKMESQRIDVKALQQEKQALKKLDNVKRDHVQRLEALHQLQEVDRL--RGELVE 353

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            NL  V+ A+  VR ALAN++ W ++  +VKE + AG+PVA  I +L L+ N +++LL N
Sbjct: 354 MNLPIVERALQVVRSALANQVDWAEIGLIVKEAQAAGDPVACAIKELKLQTNHITMLLKN 413

Query: 475 ----------------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRW 506
                                       +  +    +K  P+  V+VDL+LSA+ANA+++
Sbjct: 414 PYIVPDEVEEEDVAEVAEEKKGKKNKNKDKGQKGKPKKDQPM-LVDVDLSLSAYANAKKY 472

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y+ K+    K++KT+ A  KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISS
Sbjct: 473 YDHKRTAAKKEQKTVEAAQKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISS 532

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYL+I+GRD QQNE+IVKRY+  GD+YVHADLHGA+S VIKN     P+PP TL +AG 
Sbjct: 533 ENYLIIAGRDQQQNEIIVKRYLRAGDIYVHADLHGATSCVIKNASG-VPIPPRTLTEAGT 591

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             VC+S AWD+K++TSAWWV+ HQV+K+APTGEYLT GSFMIRGKKNF+PP  L+MGF  
Sbjct: 592 MAVCYSAAWDAKVITSAWWVHHHQVTKSAPTGEYLTTGSFMIRGKKNFMPPSYLMMGFSF 651

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
           LF++DE  +  H  ER+V+  +E M D   S
Sbjct: 652 LFKVDEQCVFRHRGERKVKTIDEDMADVTSS 682


>gi|198422494|ref|XP_002122733.1| PREDICTED: similar to serologically defined colon cancer antigen 1
           [Ciona intestinalis]
          Length = 1103

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 293/719 (40%), Positives = 421/719 (58%), Gaps = 87/719 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  +  +++GMR  NVYD+  KTY+FKL        +    K +LL+
Sbjct: 1   MKSRFSTLDICAVLTEINEKVVGMRLVNVYDIDHKTYLFKL--------AKPDHKAMLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H + +   K   PS F++KLRKH+R RRL    QLG DRI+  QFG    +++V
Sbjct: 53  ESGIRIHLSEFDWPKNPMPSNFSMKLRKHLRGRRLVSASQLGIDRIVDLQFGSEDASYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRD---------DDKGVAIMSRHRYPTEICRVFER 171
            +ELY +GNI L+D    +L LLR  +D         ++  V +     YP        R
Sbjct: 113 FVELYDRGNIALSDCNDVILNLLRFRKDLHKPDAEQQENSDVKVAVHEPYP--------R 164

Query: 172 TTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSND 231
            TA ++   ++  K                     KE L   K G               
Sbjct: 165 NTARQVEPFISIEK--------------------LKEILQSAKNGS-------------- 190

Query: 232 GARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
                   +K +L   L YG A  EH I++ G  P++KL    + E +  + L  ++   
Sbjct: 191 -------LVKRILNPHLPYGAACIEHAIINAGFSPDVKLGGEFQFERDC-EKLHESLKSC 242

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
           E+ +Q   S  +  +GYI+ + +        T+S    +   EF P + NQ + R   +F
Sbjct: 243 EEMMQTAKS--LQCKGYIVQKIE--------TKSDGELKTNVEFHPFVFNQHKHRNLQEF 292

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           E+F+ A+DEF+  +ESQ+ + +   +E AA  KL  +  D E+R+  L+ E +     A 
Sbjct: 293 ESFNKAVDEFFGSLESQKNDMKSLQRERAAMRKLENVRKDHESRLSGLRSEQESDEMKAA 352

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LIE NL  VD +IL VR A+AN++ W+++  +VKE +  G+PVA  I  L LE N M + 
Sbjct: 353 LIETNLHLVDQSILVVRSAIANQVDWDEIKLLVKEAQGRGDPVASCIKTLKLETNSMVMA 412

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           L ++ D  DD++ T    K+E+DL+LSA+ANAR++Y  K+    K++KTI A +KAFK+A
Sbjct: 413 LRSHDD--DDQKPT----KIEIDLSLSAYANARKYYGRKRNAAKKEQKTIDASTKAFKSA 466

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           EKKT+  + +   V NI   RKV+WFEKF WFISSENYLVI GR+AQQNE++VK+Y+++G
Sbjct: 467 EKKTKQTLKEAAAVRNILKARKVYWFEKFLWFISSENYLVIGGREAQQNEVLVKKYLNQG 526

Query: 592 DVYVHADLHGASSTVIKNHRPE-QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           D+YVHADLHGA+S +IKN  P  QP+PP TLN+AG    CHS AWD+K+VTSAWWV+  Q
Sbjct: 527 DIYVHADLHGATSCIIKN--PSGQPIPPKTLNEAGTMATCHSAAWDAKVVTSAWWVHHDQ 584

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           VSKTAP+GEYLT GSF+IRGKKN+LPP  L+ GFG LF++DE+ +  H  ERRVR  ++
Sbjct: 585 VSKTAPSGEYLTTGSFLIRGKKNYLPPSYLVYGFGFLFKVDETCVWKHKGERRVRTNDD 643


>gi|66804841|ref|XP_636153.1| DUF814 family protein [Dictyostelium discoideum AX4]
 gi|60464500|gb|EAL62645.1| DUF814 family protein [Dictyostelium discoideum AX4]
          Length = 1268

 Score =  514 bits (1324), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 298/785 (37%), Positives = 448/785 (57%), Gaps = 156/785 (19%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+   V  L++ LIG+R +N+YDLSP+ ++ K         S    K  L++
Sbjct: 1   MKTRFSSIDIRTTVVNLQKSLIGLRLANLYDLSPRVFLLKF--------SKPDCKKNLII 52

Query: 61  ESGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           ESG+R+H+T + RDK + TP+ F+L LRK+++T+RLE V+QLG DR++ F FG G+   +
Sbjct: 53  ESGIRIHSTNFVRDKGDHTPAPFSLNLRKYLKTKRLESVKQLGVDRVVDFTFGSGVAVQH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           +I+ELY+ GNI+LTD E+ +L +LR+H+ + D+ VA+     YP +  +V    T S + 
Sbjct: 113 LIVELYSIGNIILTDGEYRILAILRTHQYNQDESVAVGDV--YPIDKVKVPTEFTESLI- 169

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                         D++ E                              N+ D    K+ 
Sbjct: 170 --------------DQIIE------------------------------NTVD----KKE 181

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN------KLEDNAIQVLVLAVAKFE 292
           TLK V  ++L +GP L EH +L  GL P+ KL + +       L D+ IQ          
Sbjct: 182 TLKQVFNKSLDFGPELIEHCLLSAGLQPSTKLEQYDHSKFSKSLRDSFIQG--------- 232

Query: 293 DWLQDVISGDIVPEGYILMQNK-----------------------HLGKDHPPT------ 323
              Q +    I  +GYI++++                         +  D          
Sbjct: 233 ---QKIFDNSIQSKGYIVLKDPKQLKPQQQQKQQKQQQQQQSNTLKISNDLSSNNNNNNN 289

Query: 324 -----ESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
                E      IY+EF P L  Q+  ++F++FE+FDAA+D+F+S+IESQ+ EQQ  A+E
Sbjct: 290 NNNNLEEKKEMVIYEEFVPYLYKQYELKKFIEFESFDAAVDQFFSEIESQKVEQQRIAQE 349

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
                KL+K+  DQ+ R+ +L      +++ A+LIE NL++VD  IL +R  +A+ M WE
Sbjct: 350 QVVLKKLDKVKEDQQRRIDSLFANEVENIRKAQLIEANLQEVDQCILIIRSGVASSMDWE 409

Query: 439 DLARMVKEERKAGNP--VAGLIDKLYLERNCMSLLLSNNL-------------------- 476
            L +++KEE+K  NP  VA  I +L LE N ++L L++N                     
Sbjct: 410 TLNQLLKEEKKK-NPYSVATKIHRLKLESNQITLSLTDNFLYDDNDGDDDDDDEESDEES 468

Query: 477 -------------DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
                         +  +++  L    ++VD++LSA ANAR++Y+ KK+   K +KTI+ 
Sbjct: 469 DEEDQNTKKSIKKSKTSNQKPNL----IDVDISLSAFANARKYYDTKKQSHEKAQKTISQ 524

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
              A KAAEKKTR Q+ + K+  ++  MRK+ WFEKF+WFISS+NY+V+SGRDAQQNE++
Sbjct: 525 AEFALKAAEKKTRQQLSETKSKNSMIAMRKIFWFEKFHWFISSDNYIVVSGRDAQQNELL 584

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
            K+Y+ K D+YVHAD+ G++S VIKN    + +PP TL QAG  T+C+S AW +K+VTSA
Sbjct: 585 YKKYLEKDDIYVHADIFGSTSCVIKNPNGGE-IPPNTLIQAGTMTMCYSNAWSAKVVTSA 643

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERR 703
           +WVY HQVSKTAP+GE+LT GSFMIRGKKN+LP   L+MGFG +F++D+S LG+HLNER+
Sbjct: 644 YWVYSHQVSKTAPSGEFLTTGSFMIRGKKNYLPHSQLVMGFGFMFKIDDSCLGNHLNERK 703

Query: 704 -VRGE 707
            + GE
Sbjct: 704 PIYGE 708


>gi|390331684|ref|XP_003723334.1| PREDICTED: nuclear export mediator factor Nemf-like
           [Strongylocentrotus purpuratus]
          Length = 1116

 Score =  513 bits (1321), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 287/713 (40%), Positives = 428/713 (60%), Gaps = 75/713 (10%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +  +L+G+R  NVYD++ KTY+ +L         G  +KV+LL 
Sbjct: 1   MKSRFTTIDLRAILYEIGSKLLGLRVLNVYDVNNKTYLIRL--------GGTDQKVVLLF 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+HTT++   K   PS F++KLRKH+++RRL +++QLG DR++  QFG    A++V
Sbjct: 53  ESGTRMHTTSFDWPKSQMPSNFSMKLRKHLKSRRLTEIKQLGVDRVVDLQFGSDEAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GN+ LTD E+T+LTLLR+ R D + V    R RYP +                
Sbjct: 113 IVELYDRGNVALTDHEYTILTLLRT-RKDSEDVRFAVRERYPVDT--------------- 156

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A  PD +           +E L   K G +                     +
Sbjct: 157 --------ARHPDPIPS-----LERIQEILAAGKPGDN---------------------I 182

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           + +L     YGPAL EH +L+ G   N K +    ++ +  +V+  ++++ E +++   S
Sbjct: 183 RKLLNPHFIYGPALIEHCLLNQGFPSNAKGNNGFDIQQDMSRVMT-SLSEGEQYVEK--S 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G    +GYI+ + +   K    ++ G   ++  EF  L  N   S+ +++F+TFD A DE
Sbjct: 240 GSEC-KGYIVQKRE---KKPAASQDGEDAELLTEFI-LYTN---SQPYLEFDTFDQAADE 291

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           F+SK+ESQ+ + +   +E  A  KL+ +  D E R+ +L+Q  + + K   LIE NL  V
Sbjct: 292 FFSKMESQKLDMKVIQQERGALKKLDNVKKDHEKRISSLQQNQELNEKKGALIEINLPLV 351

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           + A+  VR A+AN++ W+++  ++KE +  G+PVA  I  L L+ N   +LL +   + D
Sbjct: 352 EQALRVVRSAVANQIDWKEIDSIIKEAQTQGDPVALAIKSLRLDTNHFQMLLRDPYKQYD 411

Query: 481 D----EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
           D    EE       V++D+A SA+ANAR+++  KK  + K++KT+ + SKA K+AEKKT 
Sbjct: 412 DADEGEEDVARPMLVDIDIAQSAYANARKYFVQKKTSQKKEQKTMESSSKAIKSAEKKTM 471

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
             +    TVA+I+  RK +WFEK+ W ISSENY++I+GRD QQNE++VK+Y+S GD+YVH
Sbjct: 472 QALKDVATVASINKSRKTYWFEKYYWCISSENYIIIAGRDQQQNEIVVKKYLSPGDIYVH 531

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           AD+HGASS +IKN +   PVPP TL +AG   VC+S AWD+K++TSAWWV   QVSKTAP
Sbjct: 532 ADIHGASSVIIKNPKG-GPVPPKTLQEAGTMAVCYSVAWDAKVITSAWWVRHDQVSKTAP 590

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           TGE+LT GSFM+RGKKNFLPP  L+MGFG L ++DES    H +ERR+RG +E
Sbjct: 591 TGEFLTTGSFMVRGKKNFLPPTQLVMGFGFLMKIDESCAWRHKDERRIRGTDE 643


>gi|449504623|ref|XP_002200475.2| PREDICTED: nuclear export mediator factor Nemf [Taeniopygia
           guttata]
          Length = 1213

 Score =  513 bits (1321), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 299/731 (40%), Positives = 416/731 (56%), Gaps = 98/731 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+GMR +NVYD+  KTY+ +L             K  LL+ESG+R+H T +   K   PS
Sbjct: 158 LLGMRVNNVYDVDNKTYLIRLQKPEC--------KATLLLESGIRIHLTEFEWPKNMMPS 209

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F +K RKH+RTRRL  VRQLG DR++  QFG    A+++ILELY +GN++LTD E+ +L
Sbjct: 210 SFAMKCRKHLRTRRLVSVRQLGVDRVVDLQFGSEQAAYHLILELYDRGNVVLTDHEYLIL 269

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            +LR   D+   V    R RYP E         ++K    L +         D++ E   
Sbjct: 270 NILRFRTDEADDVRFAVRERYPVE---------SAKAAVPLPTL--------DRLTE--- 309

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIIL 260
            +SNA K   G Q                          LK VL   L YG +L EH ++
Sbjct: 310 IISNAPK---GEQ--------------------------LKRVLNPLLPYGSSLIEHCLI 340

Query: 261 DTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW--LQDVISGDIVPEGYILMQNKHLGK 318
           + G    +K+ +  + ++N  +VL  A+ K E++  L D  SG    +GY++ Q +    
Sbjct: 341 EAGFSGAVKIDQHLEKKENLEKVLS-ALEKAEEYMALTDNFSG----KGYVI-QKREKKP 394

Query: 319 DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
              P +       Y+EF P L +Q     +++F++F+ A DEFYSK+E Q+ + +   +E
Sbjct: 395 SLEPDKPAEDIYTYEEFHPFLFSQHSKCPYLEFDSFNKATDEFYSKLEGQKIDLKALQQE 454

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
             A  KL  +  D E+R+  L+Q  +      ELIE NL  VD AI  VR ALAN++ W 
Sbjct: 455 KQALKKLENVRRDHEHRLEALQQAQEADKLKGELIEMNLAVVDRAIQVVRSALANQIDWT 514

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------------------------ 474
           ++  +VKE +  G+PVA  I +L L+ N +++LL N                        
Sbjct: 515 EIGAIVKEAQAQGDPVATAIKELKLQTNHITMLLRNPYVLSEEEEEEDDADIEKEETEEP 574

Query: 475 -------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                     ++   +K  P   V+VDL LSA+ANA+++Y+ K+    K +KT+ A  KA
Sbjct: 575 KGKKKKNKTKQLKKPQKNKP-SLVDVDLNLSAYANAKKYYDHKRHAAKKTQKTVEAAEKA 633

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           FK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSENYLVI+GRD QQNE+IVKRY
Sbjct: 634 FKSAEKKTKQTLREVQTVTTIQKARKVYWFEKFLWFISSENYLVIAGRDQQQNELIVKRY 693

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +  GD+YVHADLHGA+S VIKN   E P+PP TL +AG   +C+S AWD+++VTSAWWV 
Sbjct: 694 LKPGDIYVHADLHGATSCVIKNPSGE-PIPPRTLTEAGTMALCYSAAWDARVVTSAWWVS 752

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
             QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+V+ +
Sbjct: 753 HSQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHREERKVKVQ 812

Query: 708 EEGMDDFEDSG 718
           +E +D    S 
Sbjct: 813 DEDLDTVSSSA 823



 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 8/65 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+GMR +NVYD+  KTY+ +L             K  LL+ESG+R+H T +   K   PS
Sbjct: 92  LLGMRVNNVYDVDNKTYLIRLQKPEC--------KATLLLESGIRIHLTEFEWPKNMMPS 143

Query: 81  GFTLK 85
            F +K
Sbjct: 144 SFAMK 148


>gi|449681046|ref|XP_002157080.2| PREDICTED: nuclear export mediator factor NEMF-like, partial [Hydra
            magnipapillata]
          Length = 1467

 Score =  512 bits (1318), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 288/710 (40%), Positives = 422/710 (59%), Gaps = 79/710 (11%)

Query: 18   LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN 77
            L   IG+R +NVYD+  KT++ +L +       GE  K  +L+ESG R+H T Y   K  
Sbjct: 477  LNSSIGLRVANVYDIDNKTFLVRLTH-------GEI-KSTILVESGNRIHLTEYDWPKSM 528

Query: 78   TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEF 137
             PSGF++K RKH++ RRL  + QLG DRI+   FG    A+++I+ELY +GNI+L D E+
Sbjct: 529  MPSGFSMKCRKHLKGRRLASINQLGVDRIVDMTFGYDEAAYHLIVELYDRGNIVLADFEY 588

Query: 138  TVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHAALTSSKEPDANEPDKVN 196
             +L LLR   D++  V    R +YP E+ R  E   + +KL   +               
Sbjct: 589  NILQLLRVRTDENADVKFAVREKYPVELARKEEPLLSINKLEEII--------------- 633

Query: 197  EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
                             K GKS D                  +LK VL   L +GP+L E
Sbjct: 634  -----------------KSGKSTD------------------SLKQVLNPLLIFGPSLLE 658

Query: 257  HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
            H +L+ G  P+ KLS++N  +   I  L  ++   ++ L+++ S +   EGY++ +    
Sbjct: 659  HCLLEGGFSPSTKLSQINTSDKQEISKLYSSLQIGDNILKNISSKE--GEGYLIQK---- 712

Query: 317  GKDHPPTESG-SSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHK 375
             K+      G     IY EF P L +Q +S  F+ F +F+  +DEF+SK+ESQ+ + +  
Sbjct: 713  -KESNANAVGEKDLLIYTEFHPFLYHQHKSLPFIHFHSFNKCVDEFFSKLESQKIDLKAL 771

Query: 376  AKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRM 435
             +E AA  +L  +  D E R+H+LK+  D+  + A+LIE NL  ++ AI+ V  A+AN++
Sbjct: 772  QQEKAALKRLENVREDHEKRIHSLKETQDKEARRAKLIELNLPLIERAIIIVNSAIANQL 831

Query: 436  SWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDL 495
             WE++  ++KE +  G+PVA +I  L L+ N +++ + N+ +E + +E  L    + +DL
Sbjct: 832  DWEEIEDLLKEAKLKGDPVANIIKSLQLKTNQITISV-NDEEETESDEDDLDEVDIIIDL 890

Query: 496  ALSAHANARRWYEL----KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM 551
             L+A  NARR+Y +    ++   +K+EKTI A  KA K+AE KT+  + + +    I+  
Sbjct: 891  GLTAFGNARRYYYILHDKRRNAATKEEKTIQASKKALKSAEYKTKETLKEVQNAKIINKT 950

Query: 552  RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
            RK  WFEKF WFISSENYLVI GRD QQNE++VKRY+  GD+YVHADLHGASS +IKN  
Sbjct: 951  RKTFWFEKFYWFISSENYLVIGGRDQQQNEILVKRYLKAGDLYVHADLHGASSVIIKNST 1010

Query: 612  PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
                VPP TLN+AG   +C+S AW+++++TSAWWVY +QVSKTAP+GEYLT GSFMIRGK
Sbjct: 1011 G-LDVPPKTLNEAGTMAICYSAAWEARVITSAWWVYHNQVSKTAPSGEYLTTGSFMIRGK 1069

Query: 672  KNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG------MDDFE 715
            KNFLPP  LIMGF +LF+LDES +  H+N+RRV+  ++       ++DFE
Sbjct: 1070 KNFLPPSYLIMGFSVLFKLDESCISRHVNDRRVKSNDDQENKSIEVEDFE 1119


>gi|307173031|gb|EFN64173.1| Serologically defined colon cancer antigen 1-like protein
           [Camponotus floridanus]
          Length = 988

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 284/710 (40%), Positives = 423/710 (59%), Gaps = 71/710 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++LIGMR + +YD+  +TY+ +   S         EK +LL+E
Sbjct: 1   MKTRFNTYDLVCSVTELQKLIGMRVNQIYDIDHRTYLIRFQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG RLH T +   K   PSGF++K+RKH++ +RLE + Q+G DRII  QFG G  A+++I
Sbjct: 53  SGNRLHMTNFEWPKNVAPSGFSMKMRKHLKNKRLESLTQVGMDRIINLQFGSGEAAYHII 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LE+Y +GNI+LTD E  +L +LR H + DK +    R +YP +          + +H  +
Sbjct: 113 LEVYDRGNIILTDYEMVILYVLRPHTEGDK-IRFAVREKYPLDRAHSTTMPPINVIHEHI 171

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             +KE            G+N                                      LK
Sbjct: 172 QKAKE------------GHN--------------------------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            VL   L +G A+ +H++L  G     K+ +   +  +  + L+LA+   ++ +    + 
Sbjct: 182 KVLNPLLEFGSAVIDHVLLKAGFTLGCKIGKDFHITKDMPK-LILALEDADNIMDH--AK 238

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
             + +GYI+ +     K+   T+ G    I+   EF P L  Q++++ + +F++FDAA+D
Sbjct: 239 KHISKGYIIQK-----KEAKMTQDGKEDFIFANIEFHPFLFEQYKNQPYKEFDSFDAAVD 293

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           E++S +E Q+ + +   +E  A  KL ++  D + R+ TL++  +   + AELI  N   
Sbjct: 294 EYFSTMEGQKLDLKVLQQEREALQKLERVKKDHDQRLVTLEKSQELDKQKAELISRNQIL 353

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VD AILA++ ALAN+MSW D+  ++KE +  G+PVA  I +L LE N ++LLL +  ++ 
Sbjct: 354 VDNAILAIQSALANQMSWPDIQILLKEAQVIGDPVASAIKQLKLETNHITLLLHDPYEDS 413

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
           D+E +  P+  +++DLA +A +NA+ +Y  KK    K +KTI +  KA K+AEKKT+  +
Sbjct: 414 DEESELKPM-LIDIDLAHTAFSNAKNYYSQKKSAARKHQKTIESQGKALKSAEKKTKQTL 472

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
            + +T+  I+ +RK +WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GD+YVHADL
Sbjct: 473 KEVQTIHTINKLRKTYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLKAGDLYVHADL 532

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
            GASS VIKN   + PVPP +L +AG   V +S AWDSK++ SAWWV+  QVSK+APTGE
Sbjct: 533 TGASSVVIKNPSGD-PVPPKSLAEAGTMAVAYSIAWDSKVIASAWWVHHDQVSKSAPTGE 591

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           YLT GSFMIRGKKN+L    LIMG G++FRL+ESS+  H NERRV+  +E
Sbjct: 592 YLTTGSFMIRGKKNYLTQSQLIMGLGVMFRLEESSIERHKNERRVKTIDE 641


>gi|312384850|gb|EFR29482.1| hypothetical protein AND_01485 [Anopheles darlingi]
          Length = 1109

 Score =  509 bits (1312), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 353/982 (35%), Positives = 523/982 (53%), Gaps = 169/982 (17%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M K R NT DV   V  L++LIGMR + +YD+  KTY+ +L  +         EKV+LL+
Sbjct: 1   MTKTRFNTYDVVCSVTELQKLIGMRVNQIYDIDNKTYLIRLARNE--------EKVVLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R HTT++   K   PSGFT+KLRKH++ +RLE ++QLG DRI+ FQFG G  A+++
Sbjct: 53  ESGLRFHTTSFEWPKNMAPSGFTMKLRKHLKNKRLESLQQLGVDRIVDFQFGSGEAAYHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNILLTD E  +L +LR H + ++ +    R +YP                  
Sbjct: 113 ILELYDRGNILLTDCELRILNILRPHVEGEE-LRFAVREKYPK----------------- 154

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                       D+  +D               +G  S +  K + + ++ G      TL
Sbjct: 155 ------------DRAKQD---------------QGPPSVEQIKGAIEKAHPGD-----TL 182

Query: 241 KTVLGEALGYGPALSEHIILDTGL--------------VPNM----------KLSEVNKL 276
           +T L   L YG ++ +H++ + GL              +P            + ++V +L
Sbjct: 183 RTALNPVLEYGASVIDHVLHEHGLFGCRIGGELPVDANLPKKAKRKQKNICKEFTKVFEL 242

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--- 333
           E N  + L+ A+   E  LQ+    +  P GYI+ +     K+  P + G   + Y    
Sbjct: 243 E-NDFEPLISALNDAETMLQNA-RKEPSP-GYIIQK-----KEVRPAKEGEKEEYYFTNL 294

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           E+ P + +Q++     +F++F +A+DEFYS +E+        A+E  A  KL+ +  D  
Sbjct: 295 EYQPYMYSQYQGEPCKEFDSFTSAVDEFYSSLETL-------AQEREALKKLSNVKTDHA 347

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
            R+  L +      K AELI  N + VD A+LAV+ ALA +MSW D+  +VK  +   +P
Sbjct: 348 KRIEELTKAQLGDRKKAELITRNQDLVDKALLAVQSALAAQMSWTDIQDLVKAAQANKDP 407

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-------------LPVEKVEVDLALSAH 500
           VA  I +L LE N +SL LS+    +D+ E               L    V+VDLALSA 
Sbjct: 408 VASCIRQLKLEINHISLYLSDPYAFLDENESDNEEDSDREEDEEKLEPMVVDVDLALSAF 467

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE-KTVANISHMRKVHWFEK 559
           ANARR+Y+ ++    K++KTI + SKA K AE+KT +Q L++ +T   IS +RKV+WFEK
Sbjct: 468 ANARRYYDQRRFAARKEQKTIESSSKALKNAERKT-IQTLKDVRTQTTISKVRKVYWFEK 526

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F WFISSENYL+I GRD QQNE+IVKRYM   D+YVHA++ GASS +IKN    + +PP 
Sbjct: 527 FYWFISSENYLIIGGRDQQQNELIVKRYMRPNDIYVHAEIQGASSVIIKNPAGGE-IPPK 585

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL +AG   + +S AWD+K+VTSA+WV+  QVSKTAPTGEYLT GSFMIRG+KNFLPP  
Sbjct: 586 TLLEAGTMAISYSVAWDAKVVTSAYWVHSEQVSKTAPTGEYLTTGSFMIRGRKNFLPPCH 645

Query: 680 LIMGFGLLFRLDESSLGSHLNERRVRG--EEEGMDDFEDSGHHKENSDIESEKDDTDEKP 737
           L++G   LF+L++SS+  H  ER+VR   EE  +   E+     E+ D E + DD  ++ 
Sbjct: 646 LVLGLSFLFKLEDSSVERHRGERKVRNFDEESVISKEEERSEISESVDQEIKLDDESDQE 705

Query: 738 VAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVA-----APVT 792
             E              TN           ED+   N +  K+  ++ + +     +P T
Sbjct: 706 EQEP------------ETN-----------EDQQPDNSLSQKVAGLSVSESQETEKSPST 742

Query: 793 PQLEDLIDRALGLGSASIS----STKHGIETTQFDLSEEDKHVERTATV---RDKPYISK 845
            Q +D  ++        I     + K  + T    L   +   +R A +    +KPYI +
Sbjct: 743 GQSDDEPEQGPQFPDTHIKVEHDTGKVSVRTDPI-LQRLNSETDRKAEIFLGDEKPYIIQ 801

Query: 846 AERRKLKKGQGSSVVDPKVEREKERGKDASSQP-ESIVRKTKIEG----GKISRGQKGKL 900
               +LK+          + + K++ KD   +  E  V   K EG    G++ RGQ+ K+
Sbjct: 802 PAAPRLKQ----------ISKSKQKAKDKEQKAKEKQVAPQKDEGQQKQGQLKRGQRAKM 851

Query: 901 KKMKEKYGDQDEEERNIRMALL 922
           +K+KEKY DQDE++R + M +L
Sbjct: 852 RKIKEKYKDQDEDDRKMIMEIL 873


>gi|332016223|gb|EGI57136.1| Serologically defined colon cancer antigen 1 [Acromyrmex
           echinatior]
          Length = 990

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 346/930 (37%), Positives = 505/930 (54%), Gaps = 104/930 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L+RLIGMR + +YD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNTYDLVCSVTELQRLIGMRVNQIYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+H TA+   K   PSGF++K+RKH++ +RLE + Q+G DRII  QFG G  A++VI
Sbjct: 53  SGNRIHITAFEWPKNVAPSGFSMKMRKHLKNKRLESLMQVGTDRIIKLQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LE+Y +GNI+LTD E  +L +LR H + DK +    + +YP +            +H  +
Sbjct: 113 LEVYDRGNIILTDHEMVILYVLRPHTEGDK-IRFAVKEKYPLDRAHSTTMPHIDVIHDHI 171

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             +KE D                                                   LK
Sbjct: 172 QKAKEGD--------------------------------------------------NLK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAK-FEDWLQDVI 299
            VL   L +G A+ +H++L  G     K+  + +  ED    +L L  A    D+ +  +
Sbjct: 182 KVLNPLLEFGSAVIDHVLLKAGFNLGCKIGKDFHITEDMPRLILALEDANNIMDYAKKNV 241

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAA 357
           S     +GYI+ +     K+   T+ G    I+   EF P L  Q+ ++ + +F +FDAA
Sbjct: 242 S-----KGYIIQK-----KESKLTQDGKEDFIFANIEFHPFLFEQYNNQPYKEFNSFDAA 291

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEY 415
           +DE++S +E Q+ + +   +E  A  KL ++  D   R+ TL+  QE+D+  + AELI  
Sbjct: 292 VDEYFSMMEGQKIDLKALQQEREALQKLERVRKDHSQRLITLEKTQELDK--QKAELISR 349

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N   VD AILA++ ALAN+MSW D+  ++KE +  G+PVA  I +L LE N ++L+L + 
Sbjct: 350 NQVLVDNAILAIQSALANQMSWPDIQVLLKEAQTRGDPVASAIKQLKLETNHIALMLHDP 409

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
            ++ D+E K  P+  +++DLA +A +NA+++Y  KK    KQ+KTI +  KA K+AEKKT
Sbjct: 410 YEDSDEESKLKPM-MIDIDLAHTAFSNAKKYYSQKKSAAKKQQKTIESQGKALKSAEKKT 468

Query: 536 RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  + + +T+  I+ +RK +WFEKF WFI+SENYLVI GRD QQNE+IVKRY+  GD+YV
Sbjct: 469 KQTLKEVQTIHTINKLRKTYWFEKFYWFITSENYLVIGGRDQQQNELIVKRYLKAGDLYV 528

Query: 596 HADLHGASSTVIKNHRPE-QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
           HADL GASS VIKN  P   PVPP +L +AG   V +S AWDSK++ SAWWV+  QVSK+
Sbjct: 529 HADLTGASSVVIKN--PSGNPVPPKSLAEAGTMAVAYSIAWDSKVIASAWWVHHDQVSKS 586

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDF 714
           APTGEYLT GSFMIRGKKN+L    LIMG G++FRL++SS+  H +ERRV+  +E  +  
Sbjct: 587 APTGEYLTTGSFMIRGKKNYLTHSQLIMGLGIMFRLEDSSIERHKDERRVKTVDEESEKA 646

Query: 715 EDSGHHKENSDIESEKD-DTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTIS 773
           +         ++E + D D + +   ++L   N+ HP     +    +SH      K   
Sbjct: 647 DSIVEDDREIELEGDSDEDENLEKQEQNLENKNTLHPI-QEEDQEKSESHTTDYSVKKDI 705

Query: 774 NGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVER 833
            G D K  D       P T    DL          S    K  ++  Q  +  +    E 
Sbjct: 706 YGEDEKDTDEDTKYQFPDTQIKIDL----------SGPKVKIHVDNNQPLMQSQKNTKEN 755

Query: 834 TATV-RDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKI 892
              +  DKP I  A   +    Q +     K+E++ +   D            K E   +
Sbjct: 756 VVYLGDDKPIIINASTMEKHAKQKTKESTKKIEKDDKNEND----------NKKGEQPTL 805

Query: 893 SRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
            RGQKGKLKK+KEKY DQDEE+R + M +L
Sbjct: 806 KRGQKGKLKKIKEKYKDQDEEDRRLSMLVL 835


>gi|380024993|ref|XP_003696268.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           homolog [Apis florea]
          Length = 970

 Score =  506 bits (1304), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 301/744 (40%), Positives = 438/744 (58%), Gaps = 84/744 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+A  +  L++LIGMR + VYD+  +TY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNSYDIACTINELQKLIGMRVNQVYDIDHRTYLIRLQRSE--------EKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+HTT +   K   PSGF++K+RKH++ +RLE + Q+G DR+I  QFG G  A+++I
Sbjct: 53  SGNRIHTTVFEWPKNVAPSGFSMKMRKHLKNKRLESLTQIGVDRMIDLQFGSGEAAYHII 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E   +T+L   R   +G  I    R+      V E+    + H  +
Sbjct: 113 LELYDRGNIVLTDYE---MTILNILRPHTEGDKI----RFA-----VKEKYPMDRAHQNI 160

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
               E              N+    +++L   K G++                     LK
Sbjct: 161 MPPIE--------------NI----QQHLQNAKIGEN---------------------LK 181

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            +L   L +G A+ +H++L  G     K+     +E++ +  L+LA+    D +    + 
Sbjct: 182 KILNPLLEFGSAIIDHVLLKHGFTLGCKIGRDFNIEED-MSKLILALEYANDMMN--FAR 238

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALD 359
             V +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD A+D
Sbjct: 239 QNVSKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDVAVD 293

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNL 417
           E++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI  N 
Sbjct: 294 EYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELISRNQ 351

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
             VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +  +
Sbjct: 352 TLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHDPYE 411

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           + D+E +  P+  +++DLA +A  NAR++Y  K+    KQ+KTI +  KA K+AEKKT+ 
Sbjct: 412 DSDEESELKPM-LIDIDLAHTAFGNARKYYNQKRSAAKKQQKTIESQDKALKSAEKKTKQ 470

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            + + +T+ +I+ +RK++WFEKF WFISSENYLVI GRD QQNE+IVKRY+  GD+YVHA
Sbjct: 471 TLKEVQTIHSINKLRKIYWFEKFYWFISSENYLVIGGRDQQQNELIVKRYLKTGDIYVHA 530

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
           DL GASS +IKN      VPP TL +AG   V +S AWD+K+V  AWWV   QVSKTAPT
Sbjct: 531 DLTGASSVIIKNPGG-GSVPPKTLAEAGTMAVAYSIAWDAKVVAGAWWVNNDQVSKTAPT 589

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR---GEEEGMDDF 714
           GEYLT GSFMIRGKKN+LPP  L+MG G LF L+ESS+  H +ER+VR    E E  + F
Sbjct: 590 GEYLTTGSFMIRGKKNYLPPCQLVMGLGFLFXLEESSIERHKDERKVRIIDDENEHTESF 649

Query: 715 EDSGHHKENSDIE-SEKDDTDEKP 737
            +     E+ +IE  E  + DE+P
Sbjct: 650 IE-----EDKEIELIEDSEEDEQP 668



 Score = 47.8 bits (112), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 21/31 (67%), Positives = 26/31 (83%)

Query: 892 ISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           + RGQKG+LKKMKEKY DQDEE+R + M +L
Sbjct: 786 LKRGQKGRLKKMKEKYKDQDEEDRRLSMQVL 816


>gi|224101505|ref|XP_002312308.1| predicted protein [Populus trichocarpa]
 gi|222852128|gb|EEE89675.1| predicted protein [Populus trichocarpa]
          Length = 309

 Score =  505 bits (1301), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 249/326 (76%), Positives = 269/326 (82%), Gaps = 22/326 (6%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTY+FKLMNSSGVTESGESEKVLLLMESGVR
Sbjct: 1   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYVFKLMNSSGVTESGESEKVLLLMESGVR 60

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           LHTTAY RDK NTPSGFTLKLRKHIR RRLEDVRQLGYDRI+LFQFGLG NAHYVILELY
Sbjct: 61  LHTTAYVRDKSNTPSGFTLKLRKHIRARRLEDVRQLGYDRIVLFQFGLGANAHYVILELY 120

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSK 185
           +QGNI+L DSEF VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER+TA KL  ALTS K
Sbjct: 121 SQGNIILADSEFMVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERSTAEKLQKALTSLK 180

Query: 186 EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLG 245
           E +  +  K     ++V                       +KN+N+G R KQ TLKTVLG
Sbjct: 181 ELENKKQGKNKGGKSSV----------------------PSKNTNEGNRVKQATLKTVLG 218

Query: 246 EALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVP 305
           E LGYGPALSEHIILD GLVPN K S+ NKL+D  IQVLV AVAKFE+WLQD+ISGD VP
Sbjct: 219 EVLGYGPALSEHIILDAGLVPNTKFSKDNKLDDETIQVLVKAVAKFENWLQDIISGDKVP 278

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQI 331
           EGYILMQNK+LGKD PP++SGSS Q+
Sbjct: 279 EGYILMQNKNLGKDCPPSDSGSSVQV 304


>gi|291230458|ref|XP_002735180.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 834

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 285/718 (39%), Positives = 417/718 (58%), Gaps = 69/718 (9%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +R RLIG+R  NVYDL  KTY+ +L        +    K  LL 
Sbjct: 8   MKARFTTFDILAIIPEIRARLIGLRVLNVYDLDNKTYLIRL--------AKPDVKDALLF 59

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+  T +   K   PSGF++KLRKH+R RRL  V QLG DRI+  QFG    A+++
Sbjct: 60  ESGQRIQCTDFDWPKNAMPSGFSMKLRKHLRGRRLVKVEQLGVDRIVDLQFGEEEAAYHL 119

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT-TASKLHA 179
           I+ELY +GN++LTD ++T+L LLR   D  + V    R  YP E  +  E   +  KLH 
Sbjct: 120 IVELYDRGNVVLTDHQYTILNLLRVRTDQSQDVKFAVREPYPLESAKQPEPVLSIEKLHD 179

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L ++K+ D                                                   
Sbjct: 180 ILVAAKDGD--------------------------------------------------Q 189

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L  GP++ EH +L  G     K+ +   +  +  +++  A+   E+ L+ ++
Sbjct: 190 LKRVLNPHLVCGPSVIEHCLLKQGFDDGCKVGQNVDISTDLPRIMA-ALQDMENVLKKIV 248

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
                 +GY++ Q K         +       Y E+ P+L  Q +   +++ E+F  A+D
Sbjct: 249 ESP--SKGYVI-QKKEKKTSKLSGDVPEELITYAEYHPMLFEQHQKSLYIELESFGKAVD 305

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EF+S++ +Q+ + +   +E +A  KL  +  D E R+  L+   +  +  A+LIE NL  
Sbjct: 306 EFFSQMGTQKLDIKALQQEKSAIKKLENVKKDHEKRIQQLQASQNVDMVKAQLIEINLPL 365

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL--D 477
           VD AI  V+ A+AN++ W ++  +VKE +  G+ VA  I  L L++N ++LLL +     
Sbjct: 366 VDRAIQVVQSAIANQIDWAEIWDIVKEAQTQGDEVAKSIKSLKLDKNHITLLLRDPFVSS 425

Query: 478 EMDDEEKTLPVE--KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
           ++DDE+K   +   K+++DL LSA+ANAR++YE KK    K++KT+ A  KA K+AE KT
Sbjct: 426 DVDDEDKHSGIGPLKIDIDLDLSAYANARKYYEAKKHSAVKEQKTLAASQKALKSAEIKT 485

Query: 536 RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  +    TV +I+  RK +WFEKF WFISSENYL+I GRD QQNE++V++Y++KGD+YV
Sbjct: 486 KQTLKDVATVTSINKARKTYWFEKFIWFISSENYLIIGGRDQQQNEIVVRKYLNKGDIYV 545

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTA 655
           HADLHGASS +IKN      +PP TLN+AG   +C+S AW +++VTSAWWVY +QVSKTA
Sbjct: 546 HADLHGASSVIIKNPTGAD-IPPKTLNEAGSMAICYSAAWQARVVTSAWWVYHNQVSKTA 604

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
           PTGEYLT GSFM+RGKKN+LPP  L+MGFG LF++DE SL  H +ER+V+  EE ++D
Sbjct: 605 PTGEYLTTGSFMVRGKKNYLPPSYLVMGFGFLFKVDEDSLWRHKDERKVKSLEEELED 662


>gi|297736763|emb|CBI25964.3| unnamed protein product [Vitis vinifera]
          Length = 403

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 248/328 (75%), Positives = 282/328 (85%), Gaps = 4/328 (1%)

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
           E GN VS+A +E  G +KG KS + SKN+N    DGARAKQ TLKTVLGEALGYGPALSE
Sbjct: 63  EGGNKVSDAPREKQGNRKGAKSSEPSKNTN----DGARAKQATLKTVLGEALGYGPALSE 118

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
           HIILD GL+PN K+++ +K + + IQ L  +VAKFE+WL+DVI GD VPEGYILMQNK  
Sbjct: 119 HIILDAGLIPNTKVTKDSKFDIDTIQRLAQSVAKFENWLEDVILGDQVPEGYILMQNKIF 178

Query: 317 GKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
           GKD PP++    +QIYDEFCP+LLNQF+SREFVKFETFDAA DEFYSKIE QR+EQQ KA
Sbjct: 179 GKDCPPSQPDRGSQIYDEFCPILLNQFKSREFVKFETFDAASDEFYSKIEGQRSEQQQKA 238

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
           KE  A  KL+KI MDQENRVHTLK+E DR +KMAELIEYNLEDVDAAILAVRVALAN M+
Sbjct: 239 KEVTAMQKLSKICMDQENRVHTLKKEDDRCIKMAELIEYNLEDVDAAILAVRVALANGMN 298

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLA 496
           WEDLARMVKE++K+GNPVAGLIDKLYLERNCM+LLLSNNLDEMDD+EKTL V+KVEVDLA
Sbjct: 299 WEDLARMVKEKKKSGNPVAGLIDKLYLERNCMTLLLSNNLDEMDDDEKTLHVDKVEVDLA 358

Query: 497 LSAHANARRWYELKKKQESKQEKTITAH 524
           LSAHANARRWYE KK+QE+K+EKTI AH
Sbjct: 359 LSAHANARRWYEQKKRQENKREKTIIAH 386


>gi|195038845|ref|XP_001990823.1| GH19576 [Drosophila grimshawi]
 gi|193895019|gb|EDV93885.1| GH19576 [Drosophila grimshawi]
          Length = 983

 Score =  504 bits (1297), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 350/956 (36%), Positives = 517/956 (54%), Gaps = 153/956 (16%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+   V  L+RL+G+R + +YD+  KTY+F+L  S      G SEK  LL+E
Sbjct: 1   MKTRFNSYDIICGVAELQRLVGLRVNQIYDIDNKTYLFRLHGS------GASEKATLLLE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTTA+   K   PSGF++KLRKH++ +RL+ VRQLG DRI+ FQFG G  A++V+
Sbjct: 55  SGTRFHTTAFEWPKNVAPSGFSMKLRKHLKNKRLQHVRQLGADRIVDFQFGTGEAAYHVL 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E T+L +LR H + +  V    R +YP                  +
Sbjct: 115 LELYDRGNVILTDYEQTILYILRPHTEGE-SVRFAMREKYP------------------I 155

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             +KE +    + ++ED      A ++ +   KGG+S                     L+
Sbjct: 156 DRAKEGNC---ETMSED------AMRQRIENSKGGES---------------------LR 185

Query: 242 TVLGEALGYGPALSEHIILDTGL---------------------VPNMKLSEVN------ 274
           ++L   L  GPA+ EH++++ G+                       N K ++ N      
Sbjct: 186 SILMPILDCGPAVIEHVLVEHGIENCIVNSAPDADEPAKEEMTKTQNPKKNKRNQKTCKT 245

Query: 275 KLED--NAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIY 332
           KL D    +Q L++A+    D ++   SG+    GYI+       K+  P ++ ++   Y
Sbjct: 246 KLFDLVTDLQKLMMAIKDARDIIEIGQSGN--SNGYIIQV-----KEEKPLDTENTEHFY 298

Query: 333 D--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHM 390
              EF P L  Q + + F K+ETF  A+DEF+S  ESQ+ + +   +E  A  KL+ +  
Sbjct: 299 RNVEFHPYLFVQNKDQPFKKYETFMEAVDEFFSTQESQKIDIKTLQQEREALKKLSNVKN 358

Query: 391 DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
           D   R+  L +  D   + AELI  N   VD AILA++ A+A+++SW D+  +VKE +  
Sbjct: 359 DHTKRLDELNKLQDIDKRKAELITSNQSLVDKAILAIQSAIASQLSWPDIQELVKEAQTN 418

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
           G+ VA  I +L LE N +SLLL++       E        V+VDLALSA ANARR+Y+ K
Sbjct: 419 GDVVASSIKQLKLEINHISLLLTDPY-----ECNDDDSIIVDVDLALSAWANARRYYDQK 473

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
           +    K++KTI A  KA K+AE+KT+  + + +T++NI+  RKV WFEKF WF+SSENYL
Sbjct: 474 RSAALKEKKTIDASQKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFYWFVSSENYL 533

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI GRDAQQNE+IVKRYM   D+YVHAD+ GASS +I+N      +PP TL +AG   + 
Sbjct: 534 VIGGRDAQQNELIVKRYMRPKDIYVHADIQGASSVIIRNATGGD-IPPKTLLEAGTMAIS 592

Query: 631 HSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           +S AWD+K+VT+++WVY +QVSKTAP+GEYL  GSFMIRGKKNFLP   LIMG  LLF+L
Sbjct: 593 YSVAWDAKVVTNSYWVYSNQVSKTAPSGEYLGTGSFMIRGKKNFLPSCHLIMGLSLLFKL 652

Query: 691 DESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIE-SEKDDTDEKPVAESLSVPNSAH 749
           +E  +  H  ER++R      DD  D     + ++I  +E D+  E   A+++    +A 
Sbjct: 653 EEGFVQRHAGERKIR----NTDDVADEDDKAQQAEITYTELDEISESNEADNVCANANAF 708

Query: 750 PAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSAS 809
           P             E   E  T    + +++    R  + P T ++              
Sbjct: 709 P-----------DTEVKVEHDTGRITVKTELL---REDSKPKTVEI-------------- 740

Query: 810 ISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKE 869
             S ++ I      ++EE+  +      R K   +  +RR+ K     + +        E
Sbjct: 741 --SQENNI------INEEETVIIEAGPSRKKTQTTNKKRREAKVRSDKADI--------E 784

Query: 870 RGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           R + + ++    +  +K++     RGQK KLKKMK KY DQDEEER +RM +L  S
Sbjct: 785 RSQASVTEMLEPINASKVK-----RGQKAKLKKMKSKYRDQDEEERKMRMLILNSS 835


>gi|195504496|ref|XP_002099104.1| GE23561 [Drosophila yakuba]
 gi|194185205|gb|EDW98816.1| GE23561 [Drosophila yakuba]
          Length = 996

 Score =  504 bits (1297), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 312/779 (40%), Positives = 438/779 (56%), Gaps = 112/779 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1   MKTRFNTYDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGF++KLRKH++ +RLE ++QLG DRI+  QFG G  A++VI
Sbjct: 55  SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLEKIQQLGSDRIVDLQFGTGDAAYHVI 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E T L +LR H + +  +    R +YP E                 
Sbjct: 115 LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE----------------- 156

Query: 182 TSSKEPDAN-EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             +K+P    EPD + +   N  N                                   L
Sbjct: 157 -RAKQPTKELEPDALVKLLENARNGD--------------------------------YL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN------------------------KL 276
           + +L   L  GPA+ EH++L  GL  ++   E                          KL
Sbjct: 184 RQILTPNLDCGPAVIEHVLLSHGLDNHVIKKEATEETPEADDKPEKGGKKQRKKQQNTKL 243

Query: 277 EDNA------IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
           E         + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G    
Sbjct: 244 EQKPFDMVKDLPILQQAVKDAQELIAEGSSGK--SKGYIIQV-----KEEKPTENGKVEF 296

Query: 331 IYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI 388
            +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ +
Sbjct: 297 FFRNIEFHPYLFTQFKNFETATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNV 356

Query: 389 HMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
             D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE
Sbjct: 357 KNDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKE 414

Query: 447 ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK---VEVDLALSAHANA 503
            +  G+ VA  I +L LE N +SL+LS+  D  +D++  L   +   V+VDLALSA ANA
Sbjct: 415 AQANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDDDLKAPELTVVDVDLALSAWANA 474

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWF 563
           RR+Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WF
Sbjct: 475 RRYYDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWF 534

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
           ISSENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +
Sbjct: 535 ISSENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLE 593

Query: 624 AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           AG   + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG
Sbjct: 594 AGSMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMG 653

Query: 684 FGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVA 739
             LLF+L++S +  HL ER+VR     +DD +   + KE     D+ S+ +D D  P A
Sbjct: 654 LSLLFKLEDSFIERHLGERKVR----SLDDDQIDQNVKETEVEHDLLSDNEDADTNPNA 708


>gi|281200297|gb|EFA74518.1| DUF814 family protein [Polysphondylium pallidum PN500]
          Length = 1134

 Score =  504 bits (1297), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 291/724 (40%), Positives = 426/724 (58%), Gaps = 111/724 (15%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R ++ D+   V  L+R +IG+R +NVYDLSP+ ++FKL        S    K  L+
Sbjct: 1   MPKTRFSSVDIRTTVSNLQRTVIGLRLANVYDLSPRVFLFKL--------SKPELKKQLI 52

Query: 60  MESGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +ESG+R+H+T + RDK + TP+ F++ ++          V+QLG DRII F FG G+   
Sbjct: 53  IESGIRVHSTNFTRDKGDHTPAPFSITVK---------SVKQLGVDRIIDFTFGSGVATQ 103

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHR---DDDKGVAIMSRHRYPTEICRVFERTTAS 175
           ++I+EL++ GNI+LTD ++ V+ +LR+H+   +D+  V  +    YP E           
Sbjct: 104 HLIIELFSIGNIILTDGDYKVIAILRTHQFTENDNIAVGDV----YPVE----------- 148

Query: 176 KLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA 235
                   +K+P    P+ +NE                       L + S K  N     
Sbjct: 149 -------KAKKPTTFTPELINE-----------------------LLEKSEKKDN----- 173

Query: 236 KQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDW 294
               LK +  +AL +GP L EH +LD GL PN KL   ++  +   IQ  V         
Sbjct: 174 ----LKQIFNKALDFGPELIEHCLLDAGLSPNQKLESYDRANNEKLIQAFVEG------- 222

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETF 354
            Q + +  +   GYI+++        PP     +T IY+EF P L  Q+ S+   ++++F
Sbjct: 223 -QKIFNVTMQSRGYIVLR--------PPKTPTDTTVIYEEFVPFLYKQYHSKPNQEYDSF 273

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           D A+D+F+S+IE+QR EQQ  A+E     KL+K+  DQ+ R+ +L      +V+ A+LIE
Sbjct: 274 DQAVDQFFSEIEAQRVEQQRIAQEQTVLKKLDKVREDQQRRIDSLFAAEADNVRKAQLIE 333

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP--VAGLIDKLYLERNCMSLLL 472
            NL++VD  I  ++  +   M W  L++++KEE+K  NP  VA +I KL LE N + L L
Sbjct: 334 ANLQEVDQCITIIKSGVNASMDWTALSQLLKEEKKK-NPYSVANIIHKLKLESNQIQLAL 392

Query: 473 SNNLDEMDDEEKTLPVEK--------------VEVDLALSAHANARRWYELKKKQESKQE 518
           ++N D+  DE++    E+              V+V++AL+A+ANAR +Y+ KK    K  
Sbjct: 393 NDNYDDDYDEDEDDDEEEEKKQQKKDKKKPTLVDVNIALTAYANAREYYDSKKHANEKAN 452

Query: 519 KTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
           KTI     A KAAEKKTR Q+ + K  + +  MRKV WFEKF+WF+SS+NYLVISG+DAQ
Sbjct: 453 KTIQQAEFAMKAAEKKTRQQLSEVKAKSAMIQMRKVFWFEKFHWFLSSDNYLVISGKDAQ 512

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           QNEM+ K+Y+ K D+YVHAD+ G++S VIKNH     +PP TL QAG  T+C+S AW +K
Sbjct: 513 QNEMLFKKYLEKDDIYVHADIFGSTSCVIKNHGG-GAIPPNTLIQAGTMTMCYSNAWSAK 571

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           +VTSA+WVY +QVSKTAP+GE+LT GSFMIRGKKN+LP   L+MGFG +F+LDES + +H
Sbjct: 572 VVTSAYWVYANQVSKTAPSGEFLTTGSFMIRGKKNYLPHSQLVMGFGFMFKLDESCIANH 631

Query: 699 LNER 702
           + ER
Sbjct: 632 IGER 635


>gi|157116544|ref|XP_001658543.1| hypothetical protein AaeL_AAEL007639 [Aedes aegypti]
 gi|108876416|gb|EAT40641.1| AAEL007639-PA [Aedes aegypti]
          Length = 995

 Score =  503 bits (1294), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 305/759 (40%), Positives = 426/759 (56%), Gaps = 110/759 (14%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M K R NT DV   V  L++LIGMR + +YD+  KTY+ +L+ +         EKV+LL+
Sbjct: 1   MTKTRFNTYDVVCSVTELQKLIGMRVNQIYDIDNKTYLIRLVRNE--------EKVVLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R HTTA+   K   PSGFT+KLRKH++ +RLE ++QLG DRI+ FQFG G  A++V
Sbjct: 53  ESGNRFHTTAFEWPKNVAPSGFTMKLRKHLKNKRLESMKQLGVDRIVDFQFGTGEAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNILLTD +  +L +LR H + ++ V    R +YPT                 
Sbjct: 113 ILELYDRGNILLTDCDLKILNILRPHVEGEE-VRFAVREKYPT----------------- 154

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                       D+  ED               KG  + +  K +   ++ G      TL
Sbjct: 155 ------------DRAKED---------------KGPPAMEKVKETIAKAHPGD-----TL 182

Query: 241 KTVLGEALGYGPALSEHIILDTGL----------------VPNM----------KLSEVN 274
           +T L   L YG ++ +H++   GL                VP            + S+V 
Sbjct: 183 RTALNPILEYGASVIDHVLHKYGLYGCRIGGELPAEAMAEVPKKAKKKQKAIAKEFSKVF 242

Query: 275 KLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD- 333
            +E++ +  L+ A+   E  L+  +     P    ++Q K L     P +     + Y  
Sbjct: 243 NIEED-MTALMCAINDAETMLRKAMKE---PSRGFIIQKKELK----PAKDKEQEEFYFT 294

Query: 334 --EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
             E+ P L NQ++     +F++F AA+DEFYS +E Q+ + +  A+E  A  KL+ +  D
Sbjct: 295 NLEYHPFLYNQYKEDPVKEFDSFTAAVDEFYSTLEGQKIDLKAFAQEREALKKLSNVRTD 354

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
              R+  L +      K AELI  N   VD+AILAV+ ALA++MSW D+  +VK  +   
Sbjct: 355 HAKRLEDLTKAQLEDRKKAELITRNQNLVDSAILAVQSALASQMSWSDIQDLVKAAQANN 414

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-------------LPVEKVEVDLALS 498
           +PVA  I +L LE N +SL+L +    +D++ +              L    V+VDLA++
Sbjct: 415 DPVASCIKQLKLEINHISLMLKDPYGALDEDFEDDDDEEEREDGEGKLEPMVVDVDLAMT 474

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
           A ANARR+Y+ ++    K++KTI + SKA K AEKKT   +   +T   IS  RKV+WFE
Sbjct: 475 AFANARRYYDQRRFAARKEQKTIESSSKALKNAEKKTMQTLKDVRTQTTISKARKVYWFE 534

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KF WFISSENYLVI GRD QQNE+IVKRYM   D+YVHA++ GASS +IKN   E  +PP
Sbjct: 535 KFYWFISSENYLVIGGRDQQQNELIVKRYMRPSDIYVHAEIQGASSVIIKNPSGED-IPP 593

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL +AG   + +S AWD+K+VTSA+WV   QVSKTAPTGEYLT GSFMIRGKKNFLPP 
Sbjct: 594 KTLLEAGTMAISYSVAWDAKVVTSAYWVKSEQVSKTAPTGEYLTTGSFMIRGKKNFLPPC 653

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVRG-EEEGMDDFED 716
            L++G   +F+L+ESS+  H  ER+VR  +EE +   ED
Sbjct: 654 HLVLGLSFMFKLEESSIERHKGERKVRTFDEESIMSKED 692



 Score = 47.8 bits (112), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 20/33 (60%), Positives = 27/33 (81%)

Query: 890 GKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           G++ RGQK K++K+KEKY DQDEEER + M +L
Sbjct: 814 GQLKRGQKAKMRKIKEKYKDQDEEERKLMMEIL 846


>gi|395838618|ref|XP_003792209.1| PREDICTED: nuclear export mediator factor NEMF [Otolemur garnettii]
          Length = 1056

 Score =  503 bits (1294), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 288/744 (38%), Positives = 414/744 (55%), Gaps = 117/744 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPVDHART------------ 160

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A  P  +      ++NA K  L                             L
Sbjct: 161 --------AEPPLTLERLTEIIANAPKGEL-----------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   ++K+ E  KLE   I+ +++ + K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFASSVKVDE--KLESKDIEKVLVCLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFNGKGYII-QKREIKPSLEADKPAEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVATAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 ------NLDEMDDEEKTLPVEK--------------------VEVDLALSAHANARRWYE 508
                   D   ++ +T P +                     V+VDL+LSA+ANA+++Y+
Sbjct: 419 EEDDDVVDDVSVEKNETEPSKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYD 478

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
            K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSEN
Sbjct: 479 HKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSEN 538

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL+I GRD QQNE+IVKRY++ G                      +P+PP TL +AG   
Sbjct: 539 YLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAGTMA 576

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF
Sbjct: 577 LCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLF 636

Query: 689 RLDESSLGSHLNERRVRGEEEGMD 712
           ++DES +  H  ER+VR ++E ++
Sbjct: 637 KVDESCVWRHRGERKVRVQDEDVE 660


>gi|328864957|gb|EGG13343.1| DUF814 family protein [Dictyostelium fasciculatum]
          Length = 1244

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 296/773 (38%), Positives = 449/773 (58%), Gaps = 134/773 (17%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN- 77
           + +IG+R +N+YDLSP+ ++FKL        S    K  L++ESG+R+H+T + RDK + 
Sbjct: 69  KNVIGLRLANIYDLSPRVFLFKL--------SRPDFKKTLIIESGIRIHSTNFIRDKGDH 120

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEF 137
           TP+ F++ LRK+++T+RLE VRQLG DRI+ F FG G+   +VI+EL++ GNI+LTD ++
Sbjct: 121 TPAPFSITLRKYLKTKRLESVRQLGVDRIVDFTFGSGVATQHVIVELFSIGNIILTDGDY 180

Query: 138 TVLTLLRSHR-DDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVN 196
            VL +LR+H+  ++  +A+     YP +  R                        P  V 
Sbjct: 181 KVLAILRTHQYTENDNIAVGDV--YPVDKAR------------------------PPSV- 213

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
                 + A  +N+  Q                   A  K+ TLK V  ++L +GP L E
Sbjct: 214 -----FTEALVDNIIQQ-------------------AADKKDTLKQVFNKSLDFGPELIE 249

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ---- 312
           H IL  GL P++K+   N  E++A ++    +  F++  Q +    +  +G+I+++    
Sbjct: 250 HCILMAGLSPSLKIESYNH-EEHASKL----IEAFKEG-QKIFDVAVQSKGFIVLKPPKV 303

Query: 313 --------------NKHLGKDHPPTESGSSTQ----------IYDEFCPLLLNQFRSREF 348
                          + L KD     +GS  +          +Y+EF P L  Q++ +++
Sbjct: 304 ESKQQQQQKKKAAEQQQLKKD---AIAGSGEEAATEEKKELVVYEEFVPYLYKQYQDKKY 360

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           +++++FD A+D+F+S+IESQ+ EQQ  ++E     KL+K+  DQ+ R+ +L      ++K
Sbjct: 361 LEYDSFDLAVDQFFSEIESQKVEQQRMSQEQTVLKKLDKVREDQQRRIDSLYASEGENIK 420

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP--VAGLIDKLYLERN 466
            A+LIE NL+DVD  IL +R  +A  M W +L +++KEE+K  NP  VA  I KL L+ N
Sbjct: 421 KAQLIESNLQDVDQCILIIRSGVAASMDWGNLNQLLKEEKKK-NPYSVANKIHKLKLDTN 479

Query: 467 CMSLLLSN------------------------NLDEMDDEEKTLPVEKVEVDLALSAHAN 502
            ++L L++                           +   +    PV  ++VD++LSA+AN
Sbjct: 480 QITLSLTDLHLDDDEDEEDEDENSDDDSEDEEKKKKNQKKNAKKPVF-IDVDISLSAYAN 538

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNW 562
           AR +Y+ KK+   K EKTI     A KAAEKK R Q+ + KT +++  MRKV WFEKF+W
Sbjct: 539 ARNFYDSKKQSHEKAEKTIQQADFALKAAEKKARQQLSEVKTKSSMQQMRKVFWFEKFHW 598

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+NY+VISG+DAQQNE++ K+Y+ K DVYVHAD+ G++S VIKN +  + +PP TL 
Sbjct: 599 FISSDNYIVISGKDAQQNELLFKKYLDKDDVYVHADIFGSTSCVIKNPKGGE-IPPNTLI 657

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG  T+C+S AW +K+VTSA+WVY HQVSKTAP+GE+LT GSFMIRGKKN+LP   L+M
Sbjct: 658 QAGTMTMCYSNAWSAKVVTSAYWVYSHQVSKTAPSGEFLTTGSFMIRGKKNYLPHSQLVM 717

Query: 683 GFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS--GHHKENSDIESEKDDT 733
           GFG +F++D+S + +HL ER       G     DS  G H E+  +E   DD+
Sbjct: 718 GFGFMFKIDDSCIANHLGER-----SSGSSLLRDSMDGDHDEDMRMEELPDDS 765



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 30/42 (71%), Gaps = 1/42 (2%)

Query: 883 RKTKIEGGK-ISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
           +K+ +E  K ++  QK KLKK++E+YG QDEEER + M LL 
Sbjct: 919 KKSTVEPQKHMTMSQKNKLKKIQERYGHQDEEERKLAMELLG 960


>gi|426233098|ref|XP_004010554.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Ovis
           aries]
          Length = 1055

 Score =  501 bits (1290), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 295/747 (39%), Positives = 418/747 (55%), Gaps = 123/747 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDIEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
            + +E DD +  +  EK                            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDISTEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAG 573

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 574 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSHLMMGFS 633

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 634 FLFKVDESCVWRHRGERKVRVQDEDME 660


>gi|357620683|gb|EHJ72794.1| hypothetical protein KGM_20428 [Danaus plexippus]
          Length = 1001

 Score =  500 bits (1288), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 294/728 (40%), Positives = 426/728 (58%), Gaps = 84/728 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L+RL+GMR + VYD+  KTY+ +L  S         EK +LL+E
Sbjct: 1   MKTRFNTYDIVCMVSELQRLVGMRVNQVYDIDNKTYVIRLQRSE--------EKAVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGFT+KLRKH++ +RLE + QLG DRI+  QFG G  A++VI
Sbjct: 53  SGNRFHTTQFEWPKNVAPSGFTMKLRKHLKNKRLEKLSQLGIDRIVELQFGSGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E+T+L +LR H + DK V    + +YP              L  A 
Sbjct: 113 LELYDRGNIVLTDCEWTILNVLRPHVEGDK-VRFAVKEKYP--------------LDRAK 157

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           T    P+                A KE LG  K G +                     LK
Sbjct: 158 TDYAAPN--------------EGALKEILGKSKPGDN---------------------LK 182

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSE-VNK--LEDNAIQVLVLAVAKFEDWLQDV 298
            +L   L YG ++ +H++L  GL  N+K+S+  NK    +  +  L  A+ + E  +++ 
Sbjct: 183 KILNPNLEYGASIIDHVLLQNGLSGNLKISQDPNKGFYVERDLGTLANALRQAETMIEN- 241

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD-EFCPLLLNQFRSREFVKFETFDAA 357
              + + +GYI+ +     +D P  + G    + + EF PLL  Q + + +V++ETFD A
Sbjct: 242 -GKNQMAKGYIIQKR----EDRPNQDGGPDFFLTNQEFHPLLYLQNKDQVYVEYETFDRA 296

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYS +E Q+ + +    E  A  KL  I  D E R+  L++      + AE+I  N 
Sbjct: 297 VDEFYSALEGQKIDLKTIQVEREAMKKLQNIRTDHEKRLSNLEKVQLEDRRAAEMIARNE 356

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
             V+ A LA++ A+AN+MSW+D+  +VK  +   +PVA  I +L L  N ++LLL +   
Sbjct: 357 PLVEQARLAIQTAIANQMSWDDIKLLVKAAQDNKDPVASAIKQLKLNTNHITLLLKDPYD 416

Query: 475 ----------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                     + D   D+E+  P+  V++DL+L+A ANARR+Y+ K+    KQ+KT+ + 
Sbjct: 417 DDDDDDDDDDDNDGGGDKERLEPM-MVDIDLSLTAFANARRYYDQKRSAAKKQQKTLESA 475

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            KA K+AEKKT+  + + + +++IS  R+ +WFEKF WFISS+NYLVI+GRD QQNE++V
Sbjct: 476 DKALKSAEKKTKQTLKEAQAISSISKARRNYWFEKFYWFISSDNYLVIAGRDQQQNELLV 535

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           KRYM   DVYVHAD+ GASS VIK   P  P PP TL++AG   V +S AW++K++T AW
Sbjct: 536 KRYMRSTDVYVHADVSGASSVVIKC--PSGPPPPRTLSEAGQAAVAYSVAWEAKVLTRAW 593

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           WV+ HQVSK+APTGEYL+ GSFMIRGKKN+L P  L  GF  +FRL++SS+  H ++R+ 
Sbjct: 594 WVHGHQVSKSAPTGEYLSTGSFMIRGKKNYLLPEHLQFGFSFMFRLEDSSIDRHRDDRKA 653

Query: 705 RGEEEGMD 712
              ++  D
Sbjct: 654 VQADDASD 661


>gi|194908933|ref|XP_001981863.1| GG11364 [Drosophila erecta]
 gi|190656501|gb|EDV53733.1| GG11364 [Drosophila erecta]
          Length = 994

 Score =  499 bits (1286), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 314/785 (40%), Positives = 444/785 (56%), Gaps = 111/785 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1   MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGF++KLRKH++ +RLE ++QLG DRI+ FQFG G  A++VI
Sbjct: 55  SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLERIQQLGSDRIVDFQFGTGDAAYHVI 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E T L +LR H + +  +    R +YP E                 
Sbjct: 115 LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE----------------- 156

Query: 182 TSSKEPDAN-EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             +K+P    EP+ + +   N  N                                   L
Sbjct: 157 -RAKQPTKELEPEALVKLLENARNGD--------------------------------YL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN------------------------KL 276
           + +L   L  GPA+ EH++L  GL  ++   E                          KL
Sbjct: 184 RQILTPNLDCGPAVIEHVLLSHGLDNHVIKKEATEETPEADDKPEKGGKKQRKKQQNTKL 243

Query: 277 EDNA------IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
           E         + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G    
Sbjct: 244 EQKPFDMIKDLPILQQAVKDAQELITEGSSGK--SKGYIIQV-----KEEKPTENGKVEF 296

Query: 331 IYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI 388
            +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ +
Sbjct: 297 FFKNIEFHPYLFIQFKNFEKATFESFMDAVDEFYSTQESQKIDIKTLQQEREALKKLSNV 356

Query: 389 HMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
             D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE
Sbjct: 357 KNDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKE 414

Query: 447 ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARR 505
            +  G+ VA  I +L LE N +SL+LS+  D  +D++   P +  V+VDLALSA ANARR
Sbjct: 415 AQANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKAPELTVVDVDLALSAWANARR 474

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFIS
Sbjct: 475 YYDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFIS 534

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG
Sbjct: 535 SENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAG 593

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  
Sbjct: 594 SMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLS 653

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESL 742
           LLF+L++S +  HL ER+VR     +DD +   + KE     D+ S+ +D D   +  +L
Sbjct: 654 LLFKLEDSFIERHLGERKVR----NLDDDQIDPNVKETEVEHDLLSDNEDADAN-LNGNL 708

Query: 743 SVPNS 747
           S P+S
Sbjct: 709 SEPSS 713


>gi|326921280|ref|XP_003206889.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Meleagris gallopavo]
          Length = 1080

 Score =  496 bits (1277), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 346/964 (35%), Positives = 507/964 (52%), Gaps = 152/964 (15%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+GMR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K   PS
Sbjct: 32  LLGMRVNNVYDVDNKTYLIRLQKPDC--------KATLLLESGIRIHTTEFEWPKNMMPS 83

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
           GF +K RKH++TRRL  VRQLG DRI+ FQFG    A+++I+ELY               
Sbjct: 84  GFAMKCRKHLKTRRLVSVRQLGIDRIVDFQFGSNEAAYHLIIELY--------------- 128

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
                    D+G  +++ H Y   I  +    T                       ++ +
Sbjct: 129 ---------DRGNIVLTDHEY--LILNILRFRT-----------------------DEAD 154

Query: 201 NVSNASKEN--LGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHI 258
           +V  A +E   +   K        +   +  +D  + +Q  LK VL   L YG  L EH 
Sbjct: 155 DVRFAVRERYPVDSAKAPTPLPSLERLTEIISDAPKGEQ--LKRVLNPHLPYGATLIEHC 212

Query: 259 ILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGK 318
           +++ G    +K+ +  + ++N I+ ++ A+ K E+++   ++ D   +GYI+ Q K    
Sbjct: 213 LIEAGFSGYVKIDQHMESKEN-IEKVLSALEKAEEYM--TLTEDFNGKGYII-QKKEKKP 268

Query: 319 DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
              P +       Y+EF P L +Q     +++F++F+ A DEFYSK+E Q+ + +   +E
Sbjct: 269 SLEPDKPAEDIYTYEEFHPFLFSQHSKCPYLEFDSFNKAADEFYSKLEGQKIDLKALQQE 328

Query: 379 DAAFHKLNKIHMDQENRVHTLKQ--EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
             A  KL  +  D E R+  L+Q  EVD+ +K  ELIE NLE V+ AI  VR ALAN++ 
Sbjct: 329 KQALKKLENVRRDHEQRLEALQQAQEVDK-IK-GELIEMNLEIVNRAIQVVRSALANQID 386

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN---------------------- 474
           W ++  +VKE +  G+PVA  I +L L+ N +++LL N                      
Sbjct: 387 WTEIGAIVKEAQAQGDPVANAIKELKLQTNHITMLLRNPYVLSEEEEEGEDADLEKEETE 446

Query: 475 ---------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
                       ++   +K  P   V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  
Sbjct: 447 EPKGKKKKNKNKQLKKPQKNKP-SLVDVDLSLSAYANAKKYYDHKRHAAKKTQKTVEAAE 505

Query: 526 KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
           KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSENYLVI+GRD QQNE+IVK
Sbjct: 506 KAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSENYLVIAGRDQQQNELIVK 565

Query: 586 RYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWW 645
           RY+  GD+YVHADLHGA+S VIKN   E P+PP TL +AG   +C+S AWD+++VTSAWW
Sbjct: 566 RYLKPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMALCYSAAWDARVVTSAWW 624

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           V  +QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+++
Sbjct: 625 VSHNQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHREERKIK 684

Query: 706 GEEEGMDDFEDSGHHKENSDIE--SEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSH 763
            ++E ++    S     + ++E     D + E+  AE        H AP    A      
Sbjct: 685 VQDEDLETVSSSASELVSEEVELLEGGDSSSEEDKAE-------CHEAPEDVEA------ 731

Query: 764 EFPAEDKTISNGIDSKIFDIARN-VAAPVTPQLEDLIDRALGLG-------SASISSTKH 815
                  T  N  D  + D+ ++ V+ P  P  E + +   G          + +   + 
Sbjct: 732 -------TAENNGDENVADLDQDRVSTPPVP--EGVSEEDDGESEVEHPEPQSEVKEEEV 782

Query: 816 GIETTQFDLS--EEDKHVERTATVRDKPYISKAE---RRKL-----------KKGQGSSV 859
               T  DLS  +  + +++T    ++P +S ++   RR L           K+   S  
Sbjct: 783 NYPDTTIDLSHLQSQRSLQKTVPKEEEPNLSDSKSQGRRHLSAKERREMKKKKQQNDSEN 842

Query: 860 VDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRM 919
           +DP  ER+K+   +    P     K       I RGQK K+KKMKEKY DQDEE+R + M
Sbjct: 843 LDPPEERQKD--TETQRPPPPNTTKGVPAPQPIKRGQKSKMKKMKEKYKDQDEEDRELIM 900

Query: 920 ALLA 923
            LL 
Sbjct: 901 KLLG 904


>gi|347968346|ref|XP_312244.5| AGAP002680-PA [Anopheles gambiae str. PEST]
 gi|333468048|gb|EAA08148.6| AGAP002680-PA [Anopheles gambiae str. PEST]
          Length = 1053

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 298/750 (39%), Positives = 430/750 (57%), Gaps = 114/750 (15%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M K R NT DV   V  L++LIGMR + +YD+  KTY+ +L  +         EKV+LL+
Sbjct: 1   MTKTRFNTYDVVCSVTELQKLIGMRVNQIYDIDNKTYLIRLARNE--------EKVVLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R HTT++   K   PSGFT+K+RKH++ +RLE ++QLG DRI+ FQFG G  A+++
Sbjct: 53  ESGLRFHTTSFEWPKNVAPSGFTMKMRKHLKNKRLESLQQLGVDRIVDFQFGTGEAAYHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNILLTD E  +L +LR H + ++ +    R +YP                  
Sbjct: 113 ILELYDRGNILLTDCELRILNILRPHVEGEE-LRFAVREKYPK----------------- 154

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                       D+  +D                G  S +  K + + +  G      TL
Sbjct: 155 ------------DRAKQDN---------------GPPSMEQIKEAIQKAQPGD-----TL 182

Query: 241 KTVLGEALGYGPALSEHIILDTGL--------VPN----------------MKLSEVNKL 276
           +T L   L YG ++ +H++   GL        +PN                 + ++V  +
Sbjct: 183 RTALNPILEYGASVIDHVLHRQGLFGCRIGGELPNDPALPKKVKKKQKNIAKEFAKVFDM 242

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--- 333
           E + +  L+ A+ + E  L++  +      GYI+ +     K+  PT+ G   + Y    
Sbjct: 243 ETD-LGPLMSAINEAETMLRE--AQKRPSPGYIIQK-----KEVKPTKQGDEEEYYFTNL 294

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           E+ P + NQ++   F  F++F  A+DEFYS +ESQ+ + +  A+E  A  KL+ +  D  
Sbjct: 295 EYQPYMYNQYQGEPFKAFDSFTTAVDEFYSSLESQKIDLKAFAQEREALKKLSNVKTDHA 354

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
            R+  L +      K AELI  N + VD A+LAV+ ALA +MSW D+  +VK  +   +P
Sbjct: 355 KRIEELTKAQLEDRKRAELITRNQDLVDKALLAVQSALAAQMSWTDIQDLVKAAQANKDP 414

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEE-----------------KTLPVEKVEVDLA 496
           VA  I +L LE N +SL L++    +D++                  K +P+  V+VDLA
Sbjct: 415 VASCIRQLKLEINHISLHLTDPYASLDEQASDEEEEEEDSEREDDEAKLVPM-VVDVDLA 473

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE-KTVANISHMRKVH 555
           LSA ANARR+Y+ ++    K++KTI + SKA K AE+KT +Q L++ +T   IS +RKV+
Sbjct: 474 LSAFANARRYYDQRRFAARKEQKTIESSSKALKNAERKT-IQTLKDVRTQTTISKVRKVY 532

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WFEKF WF+SSENYLVI GRD QQNE+IVKRYM   D+YVHA++ GASS +IKN    + 
Sbjct: 533 WFEKFYWFVSSENYLVIGGRDQQQNELIVKRYMRPTDIYVHAEIQGASSVIIKNPAGGE- 591

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL +AG   + +S AWD+K+VTSA+WV+  QVSKTAPTGEYLT GSFMIRG+KNFL
Sbjct: 592 IPPKTLLEAGTMAISYSVAWDAKVVTSAYWVHSEQVSKTAPTGEYLTTGSFMIRGRKNFL 651

Query: 676 PPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           PP  L++G   LF+L++SS+  H  ERRVR
Sbjct: 652 PPCHLVLGLSFLFKLEDSSVERHRGERRVR 681


>gi|119586147|gb|EAW65743.1| serologically defined colon cancer antigen 1, isoform CRA_c [Homo
           sapiens]
          Length = 1001

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 297/746 (39%), Positives = 413/746 (55%), Gaps = 109/746 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I        I L D    VLT        D    I++  R+ T+                
Sbjct: 113 I--------IELYDRGNIVLT--------DYEYVILNILRFRTD---------------- 140

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                            + ++V  A +E         +  L           +  K   L
Sbjct: 141 -----------------EADDVKFAVRERYPLDHARAAEPLLTLERLTEIVASAPKGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV-- 298
           K VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++    
Sbjct: 184 KRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMKTTSN 241

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG + P    +      G              Y+EF P L +Q     +++FE+FD A+
Sbjct: 242 FSGKVAPCILTIYCCDLFG--------------YEEFHPFLFSQHSQCPYIEFESFDKAV 287

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+
Sbjct: 288 DEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQ 347

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN---- 474
            VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N    
Sbjct: 348 IVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLL 407

Query: 475 ----------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRW 506
                           N  E    +K     K            V+VDL+LSA+ANA+++
Sbjct: 408 SEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKY 467

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISS
Sbjct: 468 YDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISS 527

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG 
Sbjct: 528 ENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGT 586

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  
Sbjct: 587 MALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSF 646

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMD 712
           LF++DES +  H  ER+VR ++E M+
Sbjct: 647 LFKVDESCVWRHQGERKVRVQDEDME 672


>gi|26333303|dbj|BAC30369.1| unnamed protein product [Mus musculus]
          Length = 641

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 287/703 (40%), Positives = 400/703 (56%), Gaps = 100/703 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN-- 475
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRNPYL 415

Query: 476 LDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWY 507
           L E +D +    +E                             V+VDL+LSA+ANA+++Y
Sbjct: 416 LSEEEDGDGDASIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 475

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 476 DHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 535

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 536 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 594

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRG
Sbjct: 595 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRG 637


>gi|170055538|ref|XP_001863626.1| serologically defined colon cancer antigen 1 [Culex
           quinquefasciatus]
 gi|167875449|gb|EDS38832.1| serologically defined colon cancer antigen 1 [Culex
           quinquefasciatus]
          Length = 995

 Score =  494 bits (1271), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 301/747 (40%), Positives = 423/747 (56%), Gaps = 111/747 (14%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M K R NT DV   V  L+RL+GMR + +YD+  KTY+ +L+ +         EKV+LL+
Sbjct: 1   MTKTRFNTYDVVCSVTELQRLVGMRVNQIYDIDNKTYLIRLVRNE--------EKVVLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R HTTA+   K   PSGFT+K+RKH++ +RLE +RQLG DRI+ FQFG G  A+++
Sbjct: 53  ESGNRFHTTAFEWPKNVAPSGFTMKMRKHLKNKRLESLRQLGVDRIVDFQFGSGEAAYHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELY +GNILLTD E  +L +LR H + ++ +    R +YP                  
Sbjct: 113 ILELYDRGNILLTDCELKILNILRPHVEGEE-LRFAVREKYPE----------------- 154

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                       D+  +D               +G    +  + +   +N G      TL
Sbjct: 155 ------------DRAKQD---------------RGPPPMEKVRETIAKANPGD-----TL 182

Query: 241 KTVLGEALGYGPALSEHIILDTGL-----------VP--------------NMKLSEVNK 275
           +T L   L YG ++ +H +   GL           VP                + ++V  
Sbjct: 183 RTALNPILEYGASVIDHALTKYGLFGCRIGGKLNPVPPEVSKKVKKKQKAIAKEFAKVFN 242

Query: 276 LEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD-- 333
            E++ +  L+ A+   E  L+    G   P    ++Q K L     P + G   + Y   
Sbjct: 243 PEED-MTALMCAINDAETMLR---QGMREPSKGFIIQKKELR----PAKEGEPEEYYLTN 294

Query: 334 -EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQ 392
            E+ P L NQ++   + +F +F AA+DEFYS +E Q+ + +  A+E  A  KL+ +  D 
Sbjct: 295 LEYQPYLYNQYKDEPYQEFASFTAAVDEFYSTLEGQKIDLKSFAQEREALKKLSNVRTDH 354

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
             R+  L +      K AELI  N   VD+A+LAV+ ALA++M+W D+  +VK  +   +
Sbjct: 355 AKRLDDLIKAQLEDRKKAELITRNQNLVDSALLAVQSALASQMAWSDIQDLVKAAQANND 414

Query: 453 PVAGLIDKLYLERNCMSLLLSNNLDEMDDE-------------EKTLPVEKVEVDLALSA 499
           P+A  I +L LE N +SLLL +    +D+E             +K  P+  V+VDLALSA
Sbjct: 415 PIASCIRQLKLEINHISLLLKDPYAVLDEEEEEEEDSDREDDEQKLEPM-VVDVDLALSA 473

Query: 500 HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE-KTVANISHMRKVHWFE 558
            ANAR++Y+ ++    K++KTI + SKA K AEKKT LQ L++ +T   IS  RKV+WFE
Sbjct: 474 FANARKYYDQRRFAARKEQKTIESSSKALKNAEKKT-LQTLKDVRTQTTISKARKVYWFE 532

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KF WFISSENYLVI GRD QQNE++VKRYM   D+YVHA++ GASS VIKN    + +PP
Sbjct: 533 KFYWFISSENYLVIGGRDQQQNELLVKRYMRPADIYVHAEIQGASSVVIKNPSGAE-IPP 591

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL +AG   + +S AWD+K+VTSA+WV   QVSKTAPTGEYLT GSFMIRGKKNFLPP 
Sbjct: 592 KTLLEAGTMAISYSVAWDAKVVTSAYWVRSEQVSKTAPTGEYLTTGSFMIRGKKNFLPPC 651

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVR 705
            L++G   +F+L+ESS+  H  ER+VR
Sbjct: 652 HLVLGLSFMFKLEESSVERHKGERKVR 678



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/33 (66%), Positives = 27/33 (81%)

Query: 890 GKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           G + RGQK KL+K+KEKYGDQDEEER + M +L
Sbjct: 813 GPLKRGQKAKLRKIKEKYGDQDEEERKLMMDIL 845


>gi|284005983|gb|ADB57053.1| MIP15468p [Drosophila melanogaster]
          Length = 939

 Score =  490 bits (1261), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 359/965 (37%), Positives = 513/965 (53%), Gaps = 161/965 (16%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1   MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 55  SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLEKVQQMGSDRIVDFQFGTGDAAYHVI 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E   LT L   R   +G  +    R    + R  + T   +L A +
Sbjct: 115 LELYDRGNVILTDYE---LTTLYILRPHTEGENLRFAMREKYPVERAKQPTKELELEALV 171

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                                              K  + ++N +             L+
Sbjct: 172 -----------------------------------KLLENARNGD------------YLR 184

Query: 242 TVLGEALGYGPALSEHIILDTGL------------------------------VPNMKLS 271
            +L   L  GPA+ EH++L  GL                                N KL 
Sbjct: 185 QILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKLE 244

Query: 272 EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
           +      N + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G+    
Sbjct: 245 QKPFDMVNDLPILQQAVKDAQELIAEGNSGK--SKGYIIQV-----KEEKPTENGTVEFF 297

Query: 332 YD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ + 
Sbjct: 298 FRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVK 357

Query: 390 MDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
            D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE 
Sbjct: 358 NDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEA 415

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARRW 506
           +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLALSA ANARR+
Sbjct: 416 QANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKDPEVTVVDVDLALSAWANARRY 475

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFISS
Sbjct: 476 YDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISS 535

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG 
Sbjct: 536 ENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAGS 594

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  L
Sbjct: 595 MAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSL 654

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESLS 743
           LF+L++S +  HL ER+VR     ++D +   + KEN    D+ S+ +D D        S
Sbjct: 655 LFKLEDSFIERHLGERKVR----SLEDDQIDPNVKENEVEHDLLSDNEDAD--------S 702

Query: 744 VPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDLIDR 801
             N + P      +SN +   FP  +  I +       D  R +  +  V P++E+  + 
Sbjct: 703 NINLSEP------SSNTEITAFPNTEVKIEH-------DTGRIIVRSDSVNPEIEETKES 749

Query: 802 ALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVD 861
            + L                      DK +++T        ++   R+K        V  
Sbjct: 750 EVVL----------------------DKILKKTDDEETTIILAGPSRKK-------QVSA 780

Query: 862 PKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMA 920
            K + +K R K +A+ Q    V        ++ RGQKGKLKKMK+KY DQD+EER IRM 
Sbjct: 781 KKTKEDKARAKQEAAKQEVPPVSSEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREIRMM 840

Query: 921 LLAVS 925
           +L  S
Sbjct: 841 ILKSS 845


>gi|324502310|gb|ADY41017.1| Serologically defined colon cancer antigen 1 [Ascaris suum]
          Length = 958

 Score =  489 bits (1260), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 285/707 (40%), Positives = 395/707 (55%), Gaps = 80/707 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T DV A V  LR L G+R +NVYD+  KTY+ ++            EK  ++ME
Sbjct: 1   MKSRFSTLDVFAVVHDLRALEGLRVTNVYDVDSKTYLIRMHIPD--------EKCFIMME 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG+RLH T++   K   PS F++KLRKHI+ +RL  V QLG DR++  QFG    A +VI
Sbjct: 53  SGMRLHKTSFEWPKAQFPSSFSMKLRKHIKQKRLTKVEQLGVDRVVDLQFGTDDRASHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           +ELY +GNILLTD ++ +L +LR   D +  V    R  YP E  R            A+
Sbjct: 113 VELYDRGNILLTDHQYVILNVLRPRTDKNTDVRFSVRETYPIENAR----------QEAM 162

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
             SK                      E L   K G+S                     ++
Sbjct: 163 VPSKA------------------RLIEMLATTKKGES---------------------VR 183

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLV----LAVAKFEDWLQD 297
             L     YGPAL EH +   G+  N ++       +  IQ L+    +A   F++  Q+
Sbjct: 184 RALAPLTQYGPALIEHSLRLAGICSNAQIGVNISNSEEDIQKLLNAMDIAQIVFDELRQN 243

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
              G I+   Y L               G S + Y EF P    QF S    +FE F   
Sbjct: 244 RSHGFII---YKL----------DTRADGHSFESYQEFHPYRFKQFESENLREFENFSEC 290

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DE++SKIESQRA+Q+    E  A  KL  +  DQ+ R+ +L+       +MAELIE N 
Sbjct: 291 VDEYFSKIESQRADQRALNAEREALKKLENVKRDQQERIESLELAQVEKRQMAELIELNS 350

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
           + VD A+L +R A+AN++SWE +  M  +  +AG+P+A  I  L L  N M+L L +   
Sbjct: 351 DLVDKALLIIRSAIANQLSWEMIEEMRIKASEAGDPIASSIVGLNLNSNEMTLSLRDPYH 410

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           + D   K +P+     D+ALSA+ N+R+++  KK    K++KTI++ +KA K+A+ K + 
Sbjct: 411 D-DSSPKKVPI-----DIALSAYQNSRKFHSEKKAAVDKKQKTISSSAKALKSAQLKAKE 464

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            +   +  A++   R+  WFEKF WF+SSENYLVI GRDAQQNE++VKRY+  GD+YVHA
Sbjct: 465 TLATVRAKADVVKSRRQMWFEKFFWFVSSENYLVIGGRDAQQNELLVKRYLRTGDIYVHA 524

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
           D+ GASS VI+N      +PP TLN+AG   VC+S +W++K++ +AWWVY HQVS+TAPT
Sbjct: 525 DVRGASSVVIRNKVNGGEIPPKTLNEAGSMAVCYSSSWEAKVIAAAWWVYHHQVSRTAPT 584

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           GEYLT GSFMIRGKKNFLP   L MGFGL+F+LDE S+  H  ERRV
Sbjct: 585 GEYLTPGSFMIRGKKNFLPSCQLQMGFGLMFKLDEDSVERHRGERRV 631


>gi|167516076|ref|XP_001742379.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779003|gb|EDQ92617.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1051

 Score =  489 bits (1259), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 335/970 (34%), Positives = 501/970 (51%), Gaps = 136/970 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+  ++  L+ RL GMR +N+YD+  KTY+ +L  +         EK +LL+
Sbjct: 1   MKNRFSTLDLQVQLAELKPRLTGMRVANIYDIDNKTYLIRLQQTP--------EKAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R HTT Y   K + PSGFT+K RKH+RTRRL D++QLG DR+I   FG    A+++
Sbjct: 53  ESGIRFHTTEYDWPKGDAPSGFTMKCRKHLRTRRLTDMKQLGVDRVIDLTFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LT+S + +L LLR  R D + V      RYP E  +     T  +L AA
Sbjct: 113 IIELYDRGNIILTESTYNILALLR-RRTDSEDVKFAVGERYPIEASKQPSPITRERLEAA 171

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             SSK+ D                                                 P  
Sbjct: 172 FASSKKGD-------------------------------------------------PAR 182

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWL-QDVI 299
           K  L   +  GP   EH +   G   N K+ +   + D+  +VL  A+ + ED L + + 
Sbjct: 183 KA-LNPIMECGPQAIEHCMQLHGFPNNAKVGKGLAIPDDLDRVLA-AMKQAEDLLFEKLK 240

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESG--SSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
           +GDI     ++   ++L  D      G  +   + D+  P ++ QF  R  +   +FD A
Sbjct: 241 AGDISVSATVV---QYLPIDTIRLAEGDEAPVLVLDDVIPFMMKQFEDRPHIHLPSFDRA 297

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +D ++S++E+Q+ + +   +E AA  KL  +    E  V   +   + + + A+++E NL
Sbjct: 298 IDRYFSELETQKLQMRAMQQEAAALKKLEAVKASHEKHVEGYRLAQEANERKAQVLEANL 357

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           E VD AI  +R  +AN++ W ++A +VKE ++ G+P A +ID L L++N M++ L N   
Sbjct: 358 EQVDRAIEIIRSMVANKLDWVEIAELVKEAQQQGDPDARIIDGLKLDKNHMTIRLPNPEA 417

Query: 475 -----------------------------NLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
                                           +      T P   +++DLAL+A+ANA  
Sbjct: 418 HAESSESDSSSASDSEEEEEEEEQKAIAAASKKRGTSSATDPFLTIDLDLALTAYANACN 477

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
            Y+ KK    K++K   A   A ++AE+KT+ Q+ Q      ++  RK++WFEKF WFIS
Sbjct: 478 MYQHKKISAVKEQKARDATELAIQSAERKTQQQLQQNNVTTAVNKQRKIYWFEKFLWFIS 537

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYLVI GRD QQNE++V+RY+ KGDVYVHADLHGA+S ++KN R    VPP+TL +AG
Sbjct: 538 SENYLVIGGRDRQQNEILVRRYLKKGDVYVHADLHGAASVIVKNPRGGD-VPPITLQEAG 596

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              V +S +W+++M TSAWWV+  QVSKTAP GEYL+ GSFMIRGKKN+LP   L+MGF 
Sbjct: 597 HMAVIYSGSWEARMPTSAWWVHHDQVSKTAPAGEYLSTGSFMIRGKKNYLPKVELVMGFA 656

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVP 745
           +LF++DE S+  H+NERR RG  E              S+  S       +PV  S S  
Sbjct: 657 ILFKVDEGSVARHVNERRPRGLGEA-------------SEASSPAVSRPPEPVEASSSGA 703

Query: 746 NSAHPAPSHTNASNVDSHE-------FPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
             A P  + + A +  + +        PA    ++  + ++        AA   P  E  
Sbjct: 704 GDASPVAAESEAGDSTATQNKNKAESQPAGTAVVAPEVPAESSSAMSTAAAMAFPDTEIS 763

Query: 799 IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKK----- 853
           +D A    SAS+S T    +      SE D  V R+     K  +S  ++R+LKK     
Sbjct: 764 VDYASATPSASVSRTVSHAQ------SEADTAV-RSRMQGSKARLSAKQKRQLKKKGYTP 816

Query: 854 GQGSSVVDPKV-EREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDE 912
            Q SS+   ++ E   E G+D  S+ E   R    +   +   ++GK KK ++KY +QDE
Sbjct: 817 AQMSSLTAAELQELTGESGED--SEGEDDQRNEHAQQPAVRG-KRGKKKKKQQKYAEQDE 873

Query: 913 EERNIRMALL 922
           +ER +R+ LL
Sbjct: 874 DERQLRLDLL 883


>gi|432938285|ref|XP_004082515.1| PREDICTED: nuclear export mediator factor Nemf-like [Oryzias
           latipes]
          Length = 1089

 Score =  489 bits (1259), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 292/746 (39%), Positives = 412/746 (55%), Gaps = 103/746 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +    +GMR  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFTTVDIRAAIAEINANYVGMRVYNVYDIDNKTYLIRLQKPDS--------KAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H+T +   K   PSGF +K RKH++TRRL  ++QLG DRI+  QFG    A+++
Sbjct: 53  ESGIRIHSTDFEWPKNMMPSGFAMKCRKHLKTRRLTHIKQLGIDRIVDMQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY                        D+G  I++ H Y       F    A  +  A
Sbjct: 113 IVELY------------------------DRGNIILADHEYTILNLLRFRNAEAEDVKIA 148

Query: 181 LTSSKEP--DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           +   + P  +A  P+ +          SK   G Q                         
Sbjct: 149 V-RERYPVENARSPEPLISLEQLTEILSKAPKGEQ------------------------- 182

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV---LVLAVAKFEDWL 295
            +K +L   L YG  L EH  ++ GL  ++K+      ++NA +V   +  A+   E ++
Sbjct: 183 -VKRILNPHLSYGATLIEHSFIEAGLPGSIKVDS----QENAAEVAPKIREALQIAESYM 237

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
           +   + +   +G+I+ Q         P +       YDEF P L  Q     F++ ++F+
Sbjct: 238 EK--TENFNGKGFII-QKSEKKPSVAPGKPAEELLTYDEFHPFLFVQHAKSPFLELDSFN 294

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELI 413
            A+DEF+SK+E Q+ + +   +E  A  KL  +  D E R+  L   QEVDR     EL+
Sbjct: 295 KAVDEFFSKMEGQKIDMKALQQEKQALKKLENVKKDHEQRLEALHQAQEVDRL--KGELV 352

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           E NL  V+ A+  VR ALAN++ W ++  +VKE + AG+PVA  I +L L  N +++LL 
Sbjct: 353 EINLAVVERALQVVRSALANQVDWAEIGHIVKEAQAAGDPVACAIKELKLHSNHITMLLK 412

Query: 474 N-----------------------NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWY 507
           N                       N +    ++K L   K   V+VDL LSA+ANA+++Y
Sbjct: 413 NPYISEEEQEDEEMKDAVEEKGKKNKNRDKGQKKKLQRNKPMLVDVDLGLSAYANAKKYY 472

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+  E KQ+KT+ A  KA K+AEKKT+  + + +TV  I   RKV+WFEKF WFIS+E
Sbjct: 473 DHKRSAEKKQQKTLEAADKAMKSAEKKTQKTLKEVQTVTTIQKARKVYWFEKFLWFISAE 532

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYLVI+GRD QQNE+IVKRY+  GD+YVHADLHGA+S VIKN   + P+PP TL +AG  
Sbjct: 533 NYLVIAGRDQQQNEIIVKRYLRAGDIYVHADLHGATSCVIKNPSGD-PIPPRTLTEAGTM 591

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC+S AWD+K++TSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  LIMGFG L
Sbjct: 592 AVCYSAAWDAKIITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLIMGFGFL 651

Query: 688 FRLDESSLGSHLNERRVRGEEEGMDD 713
           F+++E S+  H  ER+V+  EE MDD
Sbjct: 652 FKVEEQSVFRHRGERKVKSVEEEMDD 677


>gi|281362528|ref|NP_001163721.1| caliban, isoform B [Drosophila melanogaster]
 gi|281362530|ref|NP_651341.2| caliban, isoform C [Drosophila melanogaster]
 gi|332319785|sp|Q9VBX1.2|NEMF_DROME RecName: Full=Nuclear export mediator factor NEMF homolog; AltName:
           Full=Protein Caliban
 gi|157816462|gb|ABV82224.1| IP12923p [Drosophila melanogaster]
 gi|272477156|gb|ACZ95015.1| caliban, isoform B [Drosophila melanogaster]
 gi|272477157|gb|AAF56406.2| caliban, isoform C [Drosophila melanogaster]
          Length = 992

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 359/965 (37%), Positives = 513/965 (53%), Gaps = 161/965 (16%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1   MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 55  SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLEKVQQMGSDRIVDFQFGTGDAAYHVI 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E   LT L   R   +G  +    R    + R  + T   +L A +
Sbjct: 115 LELYDRGNVILTDYE---LTTLYILRPHTEGENLRFAMREKYPVERAKQPTKELELEALV 171

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                                              K  + ++N +             L+
Sbjct: 172 -----------------------------------KLLENARNGD------------YLR 184

Query: 242 TVLGEALGYGPALSEHIILDTGL------------------------------VPNMKLS 271
            +L   L  GPA+ EH++L  GL                                N KL 
Sbjct: 185 QILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKLE 244

Query: 272 EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
           +      N + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G+    
Sbjct: 245 QKPFDMVNDLPILQQAVKDAQELIAEGNSGK--SKGYIIQ-----VKEEKPTENGTVEFF 297

Query: 332 YD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ + 
Sbjct: 298 FRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVK 357

Query: 390 MDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
            D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE 
Sbjct: 358 NDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEA 415

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARRW 506
           +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLALSA ANARR+
Sbjct: 416 QANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKDPEVTVVDVDLALSAWANARRY 475

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFISS
Sbjct: 476 YDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFISS 535

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG 
Sbjct: 536 ENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAGS 594

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  L
Sbjct: 595 MAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLSL 654

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESLS 743
           LF+L++S +  HL ER+VR     ++D +   + KEN    D+ S+ +D D        S
Sbjct: 655 LFKLEDSFIERHLGERKVR----SLEDDQIDPNVKENEVEHDLLSDNEDAD--------S 702

Query: 744 VPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDLIDR 801
             N + P      +SN +   FP  +  I +       D  R +  +  V P++E+  + 
Sbjct: 703 NINLSEP------SSNTEITAFPNTEVKIEH-------DTGRIIVRSDSVNPEIEETKES 749

Query: 802 ALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVD 861
            + L                      DK +++T        ++   R+K        V  
Sbjct: 750 EVVL----------------------DKILKKTDDEETTIILAGPSRKK-------QVSA 780

Query: 862 PKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMA 920
            K + +K R K +A+ Q    V        ++ RGQKGKLKKMK+KY DQD+EER IRM 
Sbjct: 781 KKTKEDKARAKQEAAKQEVPPVSSEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREIRMM 840

Query: 921 LLAVS 925
           +L  S
Sbjct: 841 ILKSS 845


>gi|339236819|ref|XP_003379964.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
 gi|316977305|gb|EFV60421.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
          Length = 789

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 295/757 (38%), Positives = 423/757 (55%), Gaps = 82/757 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +  D+ A V+ LR+ IGMR + VYD++PKTY+ KL        S   +KV+++ E
Sbjct: 1   MKGRFSLIDLLAVVQELRQYIGMRLNLVYDINPKTYLLKL--------SKPDKKVMIIFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG+RLH+T Y   K   PSGFT+KLRKH+R +RLED+  +G DRI+  +FG G  A ++I
Sbjct: 53  SGIRLHSTEYGWSKNIMPSGFTMKLRKHLRDKRLEDISVVGLDRIVDMRFGNGPTACHLI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE--RTTASKLHA 179
           +ELY +GN++LTDSE+ +L +LR+   +   V    R  Y  E+ R FE  R TA +   
Sbjct: 113 IELYDRGNVVLTDSEYVILNILRARTIETDNVRYAVRETYLVEV-REFEEYRRTADE--- 168

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP- 238
                            E  N + +A                               QP 
Sbjct: 169 -----------------EMANRLLHAC------------------------------QPG 181

Query: 239 -TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
            TL   L     YGP L EH +L+  L   MK+  V   +     + +     FE  L +
Sbjct: 182 DTLHKCLVPHFPYGPLLLEHCLLENKLSLRMKVQAVIGDQSLVSALALSLSLAFE--LFE 239

Query: 298 VISGDIVPE-GYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           +I  +  P  GY+ M  +          +G   +I+ EF P   +QF + E  +F+TF+ 
Sbjct: 240 MIRKE--PSCGYLKMTVEE-------NAAGERIEIFHEFHPYFFSQFANSECKQFDTFNG 290

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DE++SK++SQ+ +Q+   +E AA  +L  +  D E R+  L+ +     +MA  +E N
Sbjct: 291 AVDEYFSKLDSQKCQQKQLQQERAALKRLENVRQDHEQRLANLQADQMLKERMAVAVELN 350

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
            E V+ A+  +R A+A ++ W  +  M+++ R  G+PVAG I  L LERN   + L  ++
Sbjct: 351 SETVEQALAVLRSAIAMKLEWFQINEMIQDARDLGDPVAGKIVGLCLERNAFVMRLPVDV 410

Query: 477 DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
            + D E        VE+DLALS+H N+RRW+   K+   KQ+KTI A  KA K+AE +T+
Sbjct: 411 FDNDQELGDAETVDVEIDLALSSHQNSRRWFSQMKESALKQKKTIAAGGKALKSAELRTK 470

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
            Q+   +   NI  +RK+ WFEKF+WF SS+  LVI+GRDA+QNE++VKRY+  GD+YVH
Sbjct: 471 EQLKSTRQKTNIGKVRKMFWFEKFHWFFSSDRLLVIAGRDAKQNEILVKRYLKPGDLYVH 530

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           ADL GA+S VIK    + P+PP TLN+A    VC S AW+SK+VTSAWWV   QVSK+AP
Sbjct: 531 ADLRGAASVVIKQSEDKGPIPPKTLNEAAALAVCLSAAWESKVVTSAWWVKHDQVSKSAP 590

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER-----RVRGEEEGM 711
           +GEYL  G FMIRGKKN+L    L+MGFGLLFRLD  S   HL +R      + GEE   
Sbjct: 591 SGEYLKTGGFMIRGKKNYLTASQLVMGFGLLFRLDSESAARHLEKRCQAEDELDGEEANC 650

Query: 712 DDFEDSGHHKENSDIESEKDDTDEKPV-AESLSVPNS 747
           D+ +D    K+   + SE  +     V +E  S P++
Sbjct: 651 DNLQDE-QKKQKKLVRSELSEQSFNSVNSEEFSYPDN 686


>gi|312082754|ref|XP_003143575.1| serologically defined colon cancer antigen 1 [Loa loa]
          Length = 899

 Score =  486 bits (1250), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 287/737 (38%), Positives = 407/737 (55%), Gaps = 81/737 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T DV A V  L+ L G R +NVYD+  KTY+ ++            EK  +++E
Sbjct: 1   MKNRFSTLDVFAVVHDLKELTGQRVANVYDVDSKTYLIRIQKPD--------EKCFIMLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+H T +   K   PS FT+KLRKHIR +RLE V QLG DRII  QFG   +A +VI
Sbjct: 53  SGCRIHRTTFDWPKAQFPSSFTMKLRKHIRHKRLECVTQLGVDRIIDMQFGFDEHACHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            ELY +GN++LTD+ +T+L +LR   D +  +    + RYP E  R              
Sbjct: 113 AELYDRGNVVLTDNNYTILNVLRPRTDKETDMRFSVQERYPLEAAR-------------- 158

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                              NVS  +K+ L         +  K + K           ++K
Sbjct: 159 ------------------QNVSCPTKDEL--------MERLKTAKKGE---------SVK 183

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             L     YGP L EH +   G+  N ++     +E++    L  A+ +  D + +VI  
Sbjct: 184 RFLAPLTQYGPTLIEHSLRTVGVAQNAQIGVNIGMEESGAMKLFEAL-QLADQIFNVIRC 242

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
           +   +G+++ +             G   + Y EF P + +QF   +   F++F   +DEF
Sbjct: 243 N-AAQGFLVYR-------EDARMDGVIVETYQEFHPFMFSQFSDMQTKHFDSFSECVDEF 294

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           +SK+E Q+A+ +    E  A  KLN +  DQ++R+  LK       +MAELIE N + VD
Sbjct: 295 FSKLELQKADVKALNAEKEAMKKLNNVIKDQQDRIAALKVAQLEREEMAELIELNSDLVD 354

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
            A+L +R A+AN++SWE +  M     +AGNP+A  I  L L  N M+LLL       D 
Sbjct: 355 KALLVIRSAIANQLSWEAIEEMRVNACEAGNPIAASIVGLNLNSNQMTLLLR------DP 408

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQI 539
               +  +KV +D+ALS++ NAR+ +  KK  + K++KTI A SKA K+ + K +  L +
Sbjct: 409 YRPEIDPKKVTIDIALSSYQNARKLHTEKKAAQQKEQKTICASSKALKSTKVKIKETLNV 468

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           +  K  A +   R+V WFEKF WF+SSENYLVI GRDAQQNE++VKRY+  GD+Y+HAD 
Sbjct: 469 VHSK--AEVMKKRRVMWFEKFFWFVSSENYLVIGGRDAQQNELLVKRYLRPGDIYMHADT 526

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
            GASS +I+N      +PP TLN+A    V +S AW++K+ ++AWWV+ HQVS+TAPTGE
Sbjct: 527 RGASSIIIRNKLGGGDMPPRTLNEAATMAVSYSSAWEAKVTSAAWWVHQHQVSRTAPTGE 586

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV-----RGEEEGMDDF 714
           YLT GSFMIRGKKN+LP   L MGFG++F+LDE SL  H  ER+V     + +    DD 
Sbjct: 587 YLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLERHAEERKVAPVVTKDDTVNQDDG 646

Query: 715 EDSGHHKENSDIESEKD 731
           ED G     S  E EKD
Sbjct: 647 EDDGISLTGSGSEDEKD 663


>gi|290975413|ref|XP_002670437.1| hypothetical protein NAEGRDRAFT_81846 [Naegleria gruberi]
 gi|284083996|gb|EFC37693.1| hypothetical protein NAEGRDRAFT_81846 [Naegleria gruberi]
          Length = 1146

 Score =  486 bits (1250), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 294/816 (36%), Positives = 439/816 (53%), Gaps = 147/816 (18%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K RM+  D+   V  LR +LIGMR +N+YD++ KTY+ K   +         EK+++L+
Sbjct: 1   MKNRMSVVDIRCIVAELREQLIGMRLANLYDINKKTYLLKFAKTD--------EKIVVLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+H+TA+ RDK   PS F LK+RKHIRTRRLE + QLG DR++ F FG    A+++
Sbjct: 53  ESGIRIHSTAFERDKSKMPSPFVLKMRKHIRTRRLEKLEQLGVDRVVDFTFGAEEKAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+E +A+GN++LTD ++ ++++LR+H  + +         YP  I    +    SK   A
Sbjct: 113 IVEFFAKGNVVLTDYQYKIISILRTHSKEAEAGLFAVGETYP--ITTRLQSDGISKPTLA 170

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP-- 238
            T            + ++ N     ++EN             +N         + K P  
Sbjct: 171 QTIKT--------AIEKERNAALAPTEEN------------PENPQPTQKKKQQKKAPAV 210

Query: 239 ---TLKTVLGEALGYGPALSEHIILDTG-LVPNMKL------------SEVNKL------ 276
              T+K +L   L YG    EH +L    L  N+ L            SEV+ +      
Sbjct: 211 PTLTVKNLLNNYLDYGTGFVEHCLLTADVLASNLNLLDNAHPDTLKLISEVDNVIASSNV 270

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--------------QNKHLGKDHPP 322
           E   +  LV A  + +D++  + +      GYIL+              +N  L     P
Sbjct: 271 ETPILDKLVSAFKQVDDFIMRIKTEK--QRGYILLKEIVQQQVLDEVTVENPFLPPKKEP 328

Query: 323 TESGSST------------------QI---------------YDEFCPLLLNQFRSR--- 346
           TE+G  +                  QI               YD+F P L  Q R +   
Sbjct: 329 TENGEPSSEEPVVEPEIVLNDLQLKQIELMKQEKRLSIKRDQYDDFTPFLFEQVRRKIPA 388

Query: 347 -----EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
                + ++F++FD + DEF+S IE+++ E Q  + E+    K++K+  +QE ++  L+ 
Sbjct: 389 DKNQIKVIEFDSFDRSADEFFSAIEAKKIESQKSSIENTVEKKMSKVKREQELKLQELQA 448

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL 461
             D+   +A LIE + E VD AI  +  ALA   SWE + +++KE R   +P+A +I KL
Sbjct: 449 SFDKYETIATLIETHYEIVDQAIQVICSALAQSQSWETIKQIIKEHRDV-DPIAAMIHKL 507

Query: 462 YLERNCMSLLL--------------------------------SNNLDEMDDEEKTLPVE 489
            LE + +++ L                                     + D ++K  P+ 
Sbjct: 508 KLESSQITVTLPPPSIDDDDEDEFEYEESDEENDDEDEESDDEEKKEKKSDKKKKEEPM- 566

Query: 490 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANIS 549
           ++++D++L+AHANA ++Y L+KK    +EK   A  KA K  E+KT     + +  + I+
Sbjct: 567 RIDIDISLTAHANAAKYYSLRKKSGENKEKAAFASKKAIKKTEQKTLESAKKSQIKSEIT 626

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
             RK  WFEKF WFI+SENYLV+ GRDAQQNE++VKRYM KGD+Y+HAD+HGASS +IKN
Sbjct: 627 IRRKRFWFEKFYWFITSENYLVLGGRDAQQNELVVKRYMRKGDIYIHADVHGASSCIIKN 686

Query: 610 HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
              E P+PPL+L +AG F VC S AWD+K+++SA+WVY HQVSKTAPTGEYLTVGSFMIR
Sbjct: 687 PTGE-PIPPLSLQEAGMFCVCRSVAWDNKVMSSAYWVYDHQVSKTAPTGEYLTVGSFMIR 745

Query: 670 GKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           GKKNFLPP PL+MGF ++F++DES + +H+ ER+ R
Sbjct: 746 GKKNFLPPSPLVMGFAVMFKVDESCIPNHIQERKPR 781


>gi|393907053|gb|EJD74501.1| serologically defined colon cancer antigen 1 [Loa loa]
          Length = 1568

 Score =  484 bits (1246), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 287/737 (38%), Positives = 407/737 (55%), Gaps = 81/737 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T DV A V  L+ L G R +NVYD+  KTY+ ++            EK  +++E
Sbjct: 1   MKNRFSTLDVFAVVHDLKELTGQRVANVYDVDSKTYLIRIQKPD--------EKCFIMLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R+H T +   K   PS FT+KLRKHIR +RLE V QLG DRII  QFG   +A +VI
Sbjct: 53  SGCRIHRTTFDWPKAQFPSSFTMKLRKHIRHKRLECVTQLGVDRIIDMQFGFDEHACHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            ELY +GN++LTD+ +T+L +LR   D +  +    + RYP E  R              
Sbjct: 113 AELYDRGNVVLTDNNYTILNVLRPRTDKETDMRFSVQERYPLEAAR-------------- 158

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                              NVS  +K+ L         +  K + K           ++K
Sbjct: 159 ------------------QNVSCPTKDEL--------MERLKTAKKGE---------SVK 183

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             L     YGP L EH +   G+  N ++     +E++    L  A+ +  D + +VI  
Sbjct: 184 RFLAPLTQYGPTLIEHSLRTVGVAQNAQIGVNIGMEESGAMKLFEAL-QLADQIFNVIRC 242

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
           +   +G+++ +             G   + Y EF P + +QF   +   F++F   +DEF
Sbjct: 243 N-AAQGFLVYRED-------ARMDGVIVETYQEFHPFMFSQFSDMQTKHFDSFSECVDEF 294

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           +SK+E Q+A+ +    E  A  KLN +  DQ++R+  LK       +MAELIE N + VD
Sbjct: 295 FSKLELQKADVKALNAEKEAMKKLNNVIKDQQDRIAALKVAQLEREEMAELIELNSDLVD 354

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
            A+L +R A+AN++SWE +  M     +AGNP+A  I  L L  N M+LLL       D 
Sbjct: 355 KALLVIRSAIANQLSWEAIEEMRVNACEAGNPIAASIVGLNLNSNQMTLLLR------DP 408

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQI 539
               +  +KV +D+ALS++ NAR+ +  KK  + K++KTI A SKA K+ + K +  L +
Sbjct: 409 YRPEIDPKKVTIDIALSSYQNARKLHTEKKAAQQKEQKTICASSKALKSTKVKIKETLNV 468

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           +  K  A +   R+V WFEKF WF+SSENYLVI GRDAQQNE++VKRY+  GD+Y+HAD 
Sbjct: 469 VHSK--AEVMKKRRVMWFEKFFWFVSSENYLVIGGRDAQQNELLVKRYLRPGDIYMHADT 526

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
            GASS +I+N      +PP TLN+A    V +S AW++K+ ++AWWV+ HQVS+TAPTGE
Sbjct: 527 RGASSIIIRNKLGGGDMPPRTLNEAATMAVSYSSAWEAKVTSAAWWVHQHQVSRTAPTGE 586

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV-----RGEEEGMDDF 714
           YLT GSFMIRGKKN+LP   L MGFG++F+LDE SL  H  ER+V     + +    DD 
Sbjct: 587 YLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLERHAEERKVAPVVTKDDTVNQDDG 646

Query: 715 EDSGHHKENSDIESEKD 731
           ED G     S  E EKD
Sbjct: 647 EDDGISLTGSGSEDEKD 663


>gi|440797731|gb|ELR18808.1| isoform 2 of serologically defined colon cancer antigen 1 family
           protein [Acanthamoeba castellanii str. Neff]
          Length = 1138

 Score =  484 bits (1245), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 291/773 (37%), Positives = 410/773 (53%), Gaps = 143/773 (18%)

Query: 5   RMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG 63
           R  + D++A  + LR +++G+R +NVYDL  KTY  KL             K  L+ ESG
Sbjct: 3   RFTSLDISAITRELREKVVGLRIANVYDLGKKTYQLKLAKPD--------HKQYLVFESG 54

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
           VRLHTT + R+++  PS F LKLR+++RT+R+EDVRQLG DR+I    G G   H++I+E
Sbjct: 55  VRLHTTKFQRERQTVPSVFCLKLRRYLRTKRIEDVRQLGIDRVIDITIGSGEAQHHLIIE 114

Query: 124 LYAQGNILLTDSEFTVLTLLRSHR-----DDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           LYA GNI+L D  + + TL+RS++     DD+  VA+ +R  YP +  R    TT  +L 
Sbjct: 115 LYASGNIILVDKNYAIETLIRSYKTGEGTDDEVSVAVGTR--YPVDKARQLVPTTVDRLR 172

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
             L S  E                                               RAK+ 
Sbjct: 173 EVLHSVPEEQ---------------------------------------------RAKE- 186

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            +K VL   L  GP L EH +L   L P+ K+SE ++ +  A+   +           + 
Sbjct: 187 AVKDVLNRHLDLGPTLFEHCLLCADLKPHAKVSEYDEAKTEALHRAIQHA--------ES 238

Query: 299 ISGDIVPEGYILMQNKHL--------------GKDH------------------------ 320
           +  D   +GYI++++                 GKD                         
Sbjct: 239 LYSDPTLKGYIVLKDAKPDAAPAASAKALQGKGKDKETQPQPPPQQQQQQQEGRAEEEAQ 298

Query: 321 ---------------PPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI 365
                          P  E    ++++  F P +  QF  R  ++F +FD A+D F+SK 
Sbjct: 299 SPVVPATPAPQDAAKPDGEEDYDSRLFMMFVPYVYKQFEGRPRLEFPSFDEAVDIFFSKA 358

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
           + Q+ E + + +E         +  D E R+  L +  +  +K A LIE N+ DVDAAI 
Sbjct: 359 QEQQVEVKKEQQE-------KTVKKDHETRIAALTKAEEECIKKAHLIETNVSDVDAAIK 411

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT 485
                LA  M W  L R+VKE +KAG+P+A LI  L    N ++LLL + L+   D    
Sbjct: 412 VTCSELARGMDWAQLTRVVKEAKKAGDPIANLIHSLDFANNRITLLLVDPLEAAADASGA 471

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV 545
           +  +KVEVD+  +A+ANA+ +Y   K++  K  KT+ +   A KAAEKK R +I      
Sbjct: 472 M--QKVEVDIGQTAYANAQEFYAEAKRRAHKHAKTVASSQMAVKAAEKKARREIKDVGVK 529

Query: 546 ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
           A I  +RK +WFEKF+WFISSENY+VISGRDAQQNE+IVKRY+ KGD YVHADLHGA++ 
Sbjct: 530 AAIQKVRKAYWFEKFHWFISSENYVVISGRDAQQNELIVKRYLRKGDAYVHADLHGAATC 589

Query: 606 VIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           V+KN  P++P+P LTL +AG  T+           TSAWWV+P QVSKTAP+GEYL  GS
Sbjct: 590 VVKNPHPDKPIPALTLAEAGSMTI----------PTSAWWVHPEQVSKTAPSGEYLVTGS 639

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSG 718
           FMIRGKKNFLPP  L+MGF  +F++D +S+ +H+NER VR   E + + E +G
Sbjct: 640 FMIRGKKNFLPPSQLVMGFAYMFKVDPTSVANHVNERAVRTLVE-LSELEGAG 691


>gi|195107152|ref|XP_001998180.1| GI23827 [Drosophila mojavensis]
 gi|193914774|gb|EDW13641.1| GI23827 [Drosophila mojavensis]
          Length = 962

 Score =  477 bits (1227), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 343/971 (35%), Positives = 499/971 (51%), Gaps = 205/971 (21%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R ++ D+   +  L+RLIG+R + +YD+  KTY+F+L         G SEK +    
Sbjct: 1   MKTRFSSYDIICGIAELQRLIGLRVNQIYDIDNKTYLFRLHGG------GSSEKNM---- 50

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                            PSGF +K RKH++ +RLE + QLG DRI+ FQFG G  A++V 
Sbjct: 51  ----------------APSGFCMKFRKHLKNKRLEHINQLGADRIVDFQFGSGEAAYHVF 94

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E T+L +LR H + +  +    R +YP +  ++             
Sbjct: 95  LELYDRGNVILTDYEKTILYILRPHTEGE-SIRFAVREKYPVDRAKI------------- 140

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                            GN     S+            ++ +NSN+           +LK
Sbjct: 141 -----------------GNCELRESEMR----------EIIENSNEGD---------SLK 164

Query: 242 TVLGEALGYGPALSEHIILDTGL------------------------VPNMKLSEVN--- 274
            +L   L  GPA+ EH++++ GL                          N K SE+N   
Sbjct: 165 RILMPILDCGPAVIEHVLIEHGLENHLIRGSVDQEKGQVESSKKQSTKKNRKSSEINPSD 224

Query: 275 -KLEDNAIQV--LVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
            +  D A  +  L+LA+    D +   I  +   +G+I+       K+   T + ++   
Sbjct: 225 IQFFDLAADLPQLMLAIKSAYDIM--AIGRNGSSKGFIIQV-----KEEKLTNAENTEHF 277

Query: 332 YD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           Y   EF P L +Q++   F ++ETF  A+DEF+S  ESQ+ + +   +E  A  KL+ + 
Sbjct: 278 YRNIEFHPYLFSQYKKLPFKEYETFMEAVDEFFSSQESQKIDIKTLQQEREALKKLSNVK 337

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
            D   R+  L +  D   K AELI  N   VD AILA++ A+A+++SW D+  +VKE + 
Sbjct: 338 KDHTKRLEELNRVQDDDKKKAELITSNQCLVDKAILAIQSAIASQLSWPDIQELVKEAQA 397

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSN---NLDEMDDEEKTLPVEKVEVDLALSAHANARRW 506
            G+ VA  I +L LE N +SLLLS+   N +E D+ +  +    V++DLALSA ANARR+
Sbjct: 398 NGDIVASSIKQLKLEINHISLLLSDPYKNENENDNADSVI----VDIDLALSAWANARRY 453

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           Y+LK+    K++KTI A  KA K+AE+KT+  + + +T++NI+  RK+ WFEKF WFISS
Sbjct: 454 YDLKRSAALKEKKTIDASQKALKSAERKTQQTLKEVRTISNIAKARKIFWFEKFFWFISS 513

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           ENYLVI GRDAQQNE+IVKRYM   D+YVHAD+ GASS +I+N   E+ +PP TL +AG 
Sbjct: 514 ENYLVIGGRDAQQNELIVKRYMRPKDIYVHADIQGASSVIIRNTTGEE-IPPKTLLEAGT 572

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             + +S AWD+K+VT+++WVY HQVSKTAPTGEYL  GSFMIRGKKNFLP   LIMG  L
Sbjct: 573 MAISYSVAWDAKVVTNSYWVYSHQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMGLSL 632

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESL---- 742
           LF+L++S L  H  ER++R  E+ ++     G   E  +I S  D  +     ES+    
Sbjct: 633 LFKLEDSFLQRHAGERKIRTTEDIIN-----GDKIEQPEI-SSTDLNEINEACESINEYG 686

Query: 743 --SVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLID 800
             S PN+       T    V +      +KT  + +D +  DI                 
Sbjct: 687 KNSFPNTEVKIEHDTGRITVKTDLLDETNKT--DAVDQQSLDI----------------- 727

Query: 801 RALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVV 860
                                  +++ED  + + A  R K   +K  R            
Sbjct: 728 -----------------------INDEDTVIIQPAPSRKKNQSTKKRR------------ 752

Query: 861 DPKVEREKERGKDAS------SQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEE 914
                 +KER + A+        PE+    +K++     RGQKGKLKK+K KY DQD+EE
Sbjct: 753 -----EDKERSEKANIEMVYVGSPETDKSSSKVK-----RGQKGKLKKIKLKYRDQDDEE 802

Query: 915 RNIRMALLAVS 925
           R IRM +L  S
Sbjct: 803 RKIRMMILNSS 813


>gi|403374308|gb|EJY87098.1| DUF3441 multi-domain protein [Oxytricha trifallax]
          Length = 1126

 Score =  476 bits (1224), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 275/744 (36%), Positives = 441/744 (59%), Gaps = 88/744 (11%)

Query: 27  SNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKL 86
           +NVYD+S + Y+ KL        S  + K  LL+ESG+R+HTT + R+KK+ PSGF++KL
Sbjct: 23  ANVYDVSGRLYLLKL--------SKANRKEHLLIESGIRIHTTEFLRNKKDVPSGFSMKL 74

Query: 87  RKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSH 146
           RKH+RT++L ++ QLG DR+I  QFG G NA+++++ELYA GN++LTD E+T+L+LLRSH
Sbjct: 75  RKHLRTKKLCNITQLGVDRVIDLQFGQGENAYHILVELYASGNVILTDFEYTILSLLRSH 134

Query: 147 RDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD------ANEPDKVNEDGN 200
           + D+    I  + +YP      F       + +   S   PD        EP+K  E G 
Sbjct: 135 KFDETS-KIQIKEKYP------FTAAAGMTIDSIFVS---PDDIKRFIEGEPEK--EQGQ 182

Query: 201 NVSNASKENLGGQKGGKSFDL------SKNSNKNS----------------NDGARAKQP 238
              N +K  + GQ+     +       +++ NK                  +   + K+ 
Sbjct: 183 KEDNLNK--IEGQENNNEENAAAQPKPAEDKNKKGLSEKQQQKQDKKQKNQDKKDKKKEV 240

Query: 239 TLKTVLGEALGY-GPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
            +K++L + + Y     +EH++   G  PN K +++ + +     VL+ A  + +  ++D
Sbjct: 241 NMKSILTKMVPYINFPYAEHVLKLLGQDPNAK-AQIEQSD-----VLIQAAMQCQQLVRD 294

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSST---------------------QIYDEFC 336
           + + + + +G+++   K + +   P  + ++                      ++  +F 
Sbjct: 295 LETSEEI-KGFLIYSEKPIEEKKVPVLTTTTAVALPQVEQLEQETEQDIKFKGKLLKDFG 353

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
           P+ L QF S   +++ +FD  +DE++S+++ QR + ++  KED  + K+++I  DQ  R+
Sbjct: 354 PIPLAQFASDPCLEYASFDQCVDEYFSQLDKQREQSKYSNKEDEIWKKMSRIKDDQAKRI 413

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
             L++E D S   A+L++  + +V A I  ++V   + +SW D+ RMVKEE+KAGNP+A 
Sbjct: 414 QGLQKEQDLSEFKAQLLQKYIYEVQALIDILQVMQTSGISWNDIQRMVKEEKKAGNPLAD 473

Query: 457 LIDKLYLERNCMSLLL-SNNLDEMDDE-------EKTLPVEKVEVDLALSAHANARRWYE 508
           LI K+  E+N ++L+L + N ++ ++E       E   PV +V+VDL +SA  N R+++E
Sbjct: 474 LIYKMNFEKNSVTLMLDACNEEDAENEFAVDEKFENFDPVVRVDVDLHISAQMNIRKYFE 533

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           +KKK   K+ KT TA   AFK AE     +I++ +    I  MRKV+WFEKF+WFISSEN
Sbjct: 534 IKKKSYEKEVKTKTAADIAFKDAETNALKEIVKHRQTQKIDRMRKVYWFEKFDWFISSEN 593

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YL ISG++AQ NE++VKRYM KGD+++H D+ GA+ T+IKN      VPP+TLN+A  F 
Sbjct: 594 YLCISGKNAQLNEVLVKRYMDKGDLFMHTDMPGAAVTIIKNPSG-LIVPPITLNEAAIFE 652

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +CHS+AW+ K+VTS +WV+  QVSKT PTG Y+  GSFMIRGK+N + P  L +GF ++F
Sbjct: 653 LCHSKAWEGKIVTSVYWVHADQVSKTPPTGLYIPTGSFMIRGKRNIMTPSKLELGFTIMF 712

Query: 689 RLDESSLGSHLNERRVRGEEEGMD 712
            L+E S+ +H+ ERR R  +E MD
Sbjct: 713 TLNEESIANHMGERRPRLLQEEMD 736


>gi|195573753|ref|XP_002104856.1| GD21177 [Drosophila simulans]
 gi|194200783|gb|EDX14359.1| GD21177 [Drosophila simulans]
          Length = 972

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 352/966 (36%), Positives = 503/966 (52%), Gaps = 183/966 (18%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V              
Sbjct: 1   MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV-------------- 46

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                       +K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 47  ------------EKNMAPSGFSMKLRKHLKNKRLEKVQQMGSDRIVDFQFGTGDAAYHVI 94

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E T L +LR H + +  +    R +YP E                 
Sbjct: 95  LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE----------------- 136

Query: 182 TSSKEPDAN-EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             +K+P    EP+ + +   N  N                                   L
Sbjct: 137 -RAKQPTKELEPEALVKLLENARNGD--------------------------------YL 163

Query: 241 KTVLGEALGYGPALSEHIILDTGL------------------------------VPNMKL 270
           + +L   L  GPA+ EH++L  GL                                N KL
Sbjct: 164 RQILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKL 223

Query: 271 SEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
            +      N + +L  AV   ++ + +  SG    +GYI+       K+  P E+G+   
Sbjct: 224 EQKPFDMVNDLPILQQAVKDAQELIAEGNSGK--GKGYIIQV-----KEEKPAENGTVEF 276

Query: 331 IYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI 388
            +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ +
Sbjct: 277 FFRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNV 336

Query: 389 HMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
             D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE
Sbjct: 337 KNDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKE 394

Query: 447 ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARR 505
            +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLA+SA ANARR
Sbjct: 395 AQANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKAPEVTVVDVDLAMSAWANARR 454

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFIS
Sbjct: 455 YYDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFIS 514

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG
Sbjct: 515 SENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAG 573

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  
Sbjct: 574 SMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLS 633

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESL 742
           LLF+L++S +  HL ER+VR     +DD +   + KE     D+ S+ +DTD   +  +L
Sbjct: 634 LLFKLEDSFIERHLGERKVR----SLDDDQIDPNVKETEVEHDLLSDNEDTD---LNTNL 686

Query: 743 SVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDLID 800
           S P           +SN +   FP  +  I +       D  R    +  V P++E+  +
Sbjct: 687 SEP-----------SSNTEITAFPNTEVKIEH-------DTGRITVRSDSVNPEIEETKE 728

Query: 801 RALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVV 860
             + L                      DK + + A V +   I  A  RK        V 
Sbjct: 729 SEVVL----------------------DK-ILKKADVEETTIILAAPSRK------KQVS 759

Query: 861 DPKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRM 919
             K + +K R K +A+ Q  + V        ++ RGQKGKLKKMK+KY DQD+EER IRM
Sbjct: 760 AKKTKEDKARAKQEAAKQEVAPVSTEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREIRM 819

Query: 920 ALLAVS 925
            +L  S
Sbjct: 820 MILKSS 825


>gi|110735863|dbj|BAE99907.1| hypothetical protein [Arabidopsis thaliana]
          Length = 329

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 237/333 (71%), Positives = 265/333 (79%), Gaps = 26/333 (7%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVKVRMNTADVAAEVKCL+RLIGMRCSNVYD+SPKTY+FKL+NSSG+TESGESEKVLLLM
Sbjct: 1   MVKVRMNTADVAAEVKCLKRLIGMRCSNVYDISPKTYMFKLLNSSGITESGESEKVLLLM 60

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVRLHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRII+FQFGLG NAHYV
Sbjct: 61  ESGVRLHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIVFQFGLGANAHYV 120

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           ILELYAQGNI+LTDSE+ ++TLLRSHRDD+KG AIMSRHRYP EICRVFERTT SKL  +
Sbjct: 121 ILELYAQGNIILTDSEYMIMTLLRSHRDDNKGFAIMSRHRYPIEICRVFERTTVSKLQES 180

Query: 181 LTSS--KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           LT+   K+ DA + +             KE  GG+KGGK           SND   AKQ 
Sbjct: 181 LTAFVLKDHDAKQIE------------PKEQNGGKKGGK-----------SNDSTGAKQY 217

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TLK +LG+ALGYGP LSEHIILD GLVP  KLSE  KL+DN IQ+LV AV  FEDWL+D+
Sbjct: 218 TLKNILGDALGYGPQLSEHIILDAGLVPTTKLSEDKKLDDNEIQLLVQAVIVFEDWLEDI 277

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
           I+G  VPEGYILMQ + L  D   +ESG   ++
Sbjct: 278 INGQKVPEGYILMQKQILAND-TTSESGGVKKV 309


>gi|195354790|ref|XP_002043879.1| GM17806 [Drosophila sechellia]
 gi|194129117|gb|EDW51160.1| GM17806 [Drosophila sechellia]
          Length = 972

 Score =  471 bits (1211), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 353/966 (36%), Positives = 504/966 (52%), Gaps = 183/966 (18%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V              
Sbjct: 1   MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV-------------- 46

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                       +K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 47  ------------EKNMAPSGFSMKLRKHLKNKRLEQVQQMGSDRIVDFQFGTGDAAYHVI 94

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E T L +LR H + +  +    R +YP       ER          
Sbjct: 95  LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPV------ER---------- 137

Query: 182 TSSKEP-DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             +K+P +  EP+ + +   N  N                                   L
Sbjct: 138 --AKQPTNELEPEALVKLLENARNGD--------------------------------YL 163

Query: 241 KTVLGEALGYGPALSEHIILDTGL------------------------------VPNMKL 270
           + +L   L  GPA+ EH++L  GL                                N KL
Sbjct: 164 RQILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKL 223

Query: 271 SEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
                   N + +L  AV   ++ + +  SG    +GYI+       K+  P E+G+   
Sbjct: 224 EHKPFDMVNDLPILQQAVKDAQELIAEGNSGK--SKGYIIQ-----VKEEKPAENGTVEF 276

Query: 331 IYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI 388
            +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ +
Sbjct: 277 FFRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNV 336

Query: 389 HMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
             D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE
Sbjct: 337 KNDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKE 394

Query: 447 ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARR 505
            +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLALSA ANARR
Sbjct: 395 AQANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKAPEVTVVDVDLALSAWANARR 454

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y++K+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF WFIS
Sbjct: 455 YYDMKRSAAQKEKKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKFYWFIS 514

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYLVI GRDAQQNE+IVKRYM   D+YVHA++ GASS +I+N   E+ +PP TL +AG
Sbjct: 515 SENYLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVIIQNPTGEE-IPPKTLLEAG 573

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L MG  
Sbjct: 574 SMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHLTMGLS 633

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENS---DIESEKDDTDEKPVAESL 742
           LLF+L++S +  HL ER+VR     +DD +   + KE     D+ S+ +D D   +  +L
Sbjct: 634 LLFKLEDSFIERHLGERKVR----SLDDDQIDPNVKETEVEHDLLSDNEDAD---LNTNL 686

Query: 743 SVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV--AAPVTPQLEDLID 800
           S P           +SN +   FP  +  I +       D  R    +  V P++E+  +
Sbjct: 687 SEP-----------SSNTEITAFPNTEVKIEH-------DTGRITVRSDSVNPEIEETKE 728

Query: 801 RALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVV 860
             + L                      DK +++T  V +   I  A  RK        V 
Sbjct: 729 SEVVL----------------------DKILKKT-DVEETTIILAAPSRK------KQVS 759

Query: 861 DPKVEREKERGK-DASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRM 919
             K + +K R K +A+ Q  + V        ++ RGQKGKLKKMK+KY DQD+EER IRM
Sbjct: 760 AKKTKEDKARAKQEAAKQEVAPVSTEPKNPSQVKRGQKGKLKKMKQKYKDQDDEEREIRM 819

Query: 920 ALLAVS 925
            +L  S
Sbjct: 820 MILKSS 825


>gi|313211850|emb|CBY15998.1| unnamed protein product [Oikopleura dioica]
          Length = 699

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 277/763 (36%), Positives = 427/763 (55%), Gaps = 101/763 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A +  +R  L+     N+YD+  KTY+ KL   +         K +LL 
Sbjct: 1   MKTRFTVLDIKAALAEIRDNLLHHYVLNIYDIDSKTYLLKLRKCAS--------KHVLLF 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG--MNAH 118
           ESG R+H T     K   PSGF++KLRKH++ +RL +  QLG+DRII  QFG    ++  
Sbjct: 53  ESGNRVHPTEMEWPKNTAPSGFSMKLRKHLKGKRLINATQLGFDRIIDLQFGTSACLDEF 112

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           ++I+ELY +GNI+L D E+T+L LLR+  D         R  YP           A  L 
Sbjct: 113 HLIIELYDRGNIILCDQEYTILNLLRARTDKTTDERFAVRESYPV--------GQAQPLK 164

Query: 179 AALTSSKEPDAN-EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
               S++E + N +P ++               G +K  K+  ++K              
Sbjct: 165 EPFLSTEELEENIKPPQIQ--------------GNKKKNKNLTIAKQ------------- 197

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
                 L   LGYG  L EH +++ GL   +  + V++  D  ++ L       ++  + 
Sbjct: 198 ------LNSCLGYGTDLIEHFLIEEGL--EVATASVSQDADEILECL-------QNCYEF 242

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
           + SG    +G+I             T++  +   Y ++ P L NQ +    ++ E F  A
Sbjct: 243 LNSGKTKFQGFI------------STKTNDNVLQYVDYQPFLFNQSQLDSTIELEKFSLA 290

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +D+FY +I+SQ+AEQ+    E +A  KL  + +D   R+ +LK     +V+ A+LIE NL
Sbjct: 291 VDKFYGEIQSQKAEQKMMQAEKSAMKKLENVKLDHMKRLESLKLAQADNVRKAQLIEMNL 350

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
           + VD+A+  VR A+A+++ WE++   ++E +  G+PV+  I +L L+ N + ++LS  + 
Sbjct: 351 DLVDSALNQVRSAVASQIGWEEIEDFLEEGQDEGDPVSIAIRELKLKTNQIVMMLSEPMY 410

Query: 478 EMDDE--------------------EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQ 517
           +  D                     E +  +  + +DL+LSA  NA+ +Y+ K+    K+
Sbjct: 411 DDSDSSSEEEENPSESEYTKSARVTEGSEIIIYIFLDLSLSAFGNAKAFYDSKRAAADKE 470

Query: 518 EKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDA 577
            KTI A  KA K+AEKKT   +   +TV  ++ +RK  WFEKF WFISSENYLVI+G+DA
Sbjct: 471 SKTIDASKKALKSAEKKTNESLKNIQTVRQVTKVRKQMWFEKFFWFISSENYLVIAGKDA 530

Query: 578 QQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS 637
           QQNE IVK+Y+  GDVYVHAD+HGASS ++KN  P +PV P+TL++ G   VCHS AW++
Sbjct: 531 QQNETIVKKYLKNGDVYVHADIHGASSCIVKNIDPSKPVSPVTLHEVGHAAVCHSAAWNA 590

Query: 638 KMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
           K++TSAWWV+ +QVSKTAP+GEYL+ GSFMIRGKKN+LPP  L++GFG LF+LD++ +  
Sbjct: 591 KVLTSAWWVHANQVSKTAPSGEYLSTGSFMIRGKKNYLPPSQLVLGFGFLFKLDDACVAR 650

Query: 698 HLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
           H  ER+++G    ++D E+    KE S++   K++ + +P  E
Sbjct: 651 HAGERKIKG---LVNDVEE----KEQSELGEIKEENENEPQLE 686


>gi|28416669|gb|AAO42865.1| At5g49930 [Arabidopsis thaliana]
          Length = 324

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 232/328 (70%), Positives = 260/328 (79%), Gaps = 26/328 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           MNTADVAAEVKCL+RLIGMRCSNVYD+SPKTY+FKL+NSSG+TESGESEKVLLLMESGVR
Sbjct: 1   MNTADVAAEVKCLKRLIGMRCSNVYDISPKTYMFKLLNSSGITESGESEKVLLLMESGVR 60

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           LHTTAY RDK NTPSGFTLKLRKHIRTRRLEDVRQLGYDRII+FQFGLG NAHYVILELY
Sbjct: 61  LHTTAYVRDKSNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIVFQFGLGANAHYVILELY 120

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS- 184
           AQGNI+LTDSE+ ++TLLRSHRDD+KG AIMSRHRYP EICRVFERTT SKL  +LT+  
Sbjct: 121 AQGNIILTDSEYMIMTLLRSHRDDNKGFAIMSRHRYPIEICRVFERTTVSKLQESLTAFV 180

Query: 185 -KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
            K+ DA + +             KE  GG+KGGK           SND   AKQ TLK +
Sbjct: 181 LKDHDAKQIE------------PKEQNGGKKGGK-----------SNDSTGAKQYTLKNI 217

Query: 244 LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
           LG+ALGYGP LSEHIILD GLVP  KLSE  KL+DN IQ+LV AV  FEDWL+D+I+G  
Sbjct: 218 LGDALGYGPQLSEHIILDAGLVPTTKLSEDKKLDDNEIQLLVQAVIVFEDWLEDIINGQK 277

Query: 304 VPEGYILMQNKHLGKDHPPTESGSSTQI 331
           VPEGYILMQ + L  D   +ESG   ++
Sbjct: 278 VPEGYILMQKQILAND-TTSESGGVKKV 304


>gi|195388566|ref|XP_002052950.1| GJ23608 [Drosophila virilis]
 gi|194151036|gb|EDW66470.1| GJ23608 [Drosophila virilis]
          Length = 966

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 350/969 (36%), Positives = 504/969 (52%), Gaps = 197/969 (20%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R N+ D+   V  L+RLIG+R + VYD+  KTY+F+L         G SEK ++   
Sbjct: 1   MKTRFNSYDITCGVAELQRLIGLRVNQVYDIDNKTYLFRLHGG------GASEKNVV--- 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                            PSGF++KLRKH++ +RLE + QL  DRI+ FQFG G  A++V+
Sbjct: 52  -----------------PSGFSMKLRKHLKNKRLERISQLATDRIVDFQFGTGEAAYHVL 94

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GNI+LTD E  +L +LR H + +  +    R +YP+   +V             
Sbjct: 95  LELYDRGNIILTDYEQIILYILRPHTEGE-CLRFAVREKYPSGRAQV------------- 140

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                            GN     S+E L            +   + SN G   K+  L 
Sbjct: 141 -----------------GN--IELSEEAL------------REIIEQSNVGEGLKR-ILL 168

Query: 242 TVLGEALGYGPALSEHIILDTGL--------------------VPNMKLSEVN----KLE 277
            VLG     GPA+ EH++++ G+                      N + S+++    KL 
Sbjct: 169 PVLG----CGPAVIEHVLIEHGIENCVVSAQQEQTETSKANRCKKNRRSSQISRADTKLF 224

Query: 278 DNA--IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD-- 333
           D A  + +LV A+    D +     G+   +G+I+       K+  P+ + S+   Y   
Sbjct: 225 DFATDLPLLVKAIQSARDIMDLGQKGNC--KGFIIQI-----KEEKPSSTESTDHFYRNV 277

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           EF P L +Q +   F ++ TF  A+DEF+S  ESQ+ + +   +E  A  KL+ +  D  
Sbjct: 278 EFHPYLFSQHKKMPFKEYNTFMEAVDEFFSTQESQKIDMKTLQQEREALKKLSNVKNDHT 337

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
            R+  L +  D   K AELI  N   VD AILA++ A+A+++SW D+  +VKE +  G+ 
Sbjct: 338 RRLEELNKVQDLDKKKAELITCNQSLVDKAILAIQSAIASQLSWPDIQELVKEAQANGDI 397

Query: 454 VAGLIDKLYLERNCMSLLLS----------NNLDEMDDEEKTLPVEKVEVDLALSAHANA 503
           VA  I KL LE N +SLLL+          N+ +  D+++  L    ++VDLALSA ANA
Sbjct: 398 VARSIKKLKLEINHISLLLTDPYKCGNEYLNDENGADNDDSLL----IDVDLALSAWANA 453

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWF 563
            R+Y+LK+    K++KTI A  KA K+AE+KT+  + + +T++NI+  RKV WFEKF WF
Sbjct: 454 CRYYDLKRSAALKEKKTIDASQKALKSAERKTQQTLKEVRTISNIAKARKVFWFEKFFWF 513

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
           +SSENYL+I GRDAQQNE+IVKRYM   DVYVHAD+ GASS +I+N      +PP TL +
Sbjct: 514 VSSENYLIIGGRDAQQNELIVKRYMRPKDVYVHADIQGASSVIIRNSTGGD-IPPKTLLE 572

Query: 624 AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           AG   + +S AWD+K+VT+++WVY  QVSKTAPTGEYL  GSFMIRGKKNFLP   LIMG
Sbjct: 573 AGTMAISYSVAWDAKVVTNSYWVYSDQVSKTAPTGEYLGTGSFMIRGKKNFLPSCHLIMG 632

Query: 684 FGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLS 743
             +LF+L++S +  H+ ER++R  E+ +D                           E++ 
Sbjct: 633 LSILFKLEDSFIQRHVGERKIRSTEDAIDQ--------------------------ENVK 666

Query: 744 VPNSAHPAPSHT---NASNVDS--HEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
            P   +  P+     N +N DS  + FP  +  + +       D  R     +T + E  
Sbjct: 667 QPEITYTDPNQITELNDANSDSAINVFPNTEVKVEH-------DTGR-----ITIKTE-- 712

Query: 799 IDRALGLGSASISSTKHGIETTQFD--LSEEDKHVERTATVRDKPYISKAERRKLKKGQG 856
               LG         K  I  +Q D  ++EED  + + A  R K   +K  +RK  KG  
Sbjct: 713 ---LLG------EDIKTNIIESQHDNPINEEDAVIIKAAPSRKKNQQTK--KRKECKGHM 761

Query: 857 SSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERN 916
                 K + E+ +      QP        I   K+ RGQKGKLKKMK+KY DQD+EER 
Sbjct: 762 E-----KADLERLQNNSPEIQP--------INSSKVKRGQKGKLKKMKQKYKDQDDEERE 808

Query: 917 IRMALLAVS 925
           IRM +L  S
Sbjct: 809 IRMMILNSS 817


>gi|339260826|ref|XP_003368211.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
 gi|316964832|gb|EFV49764.1| serologically defined colon cancer antigen 1-like protein
           [Trichinella spiralis]
          Length = 749

 Score =  457 bits (1175), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 274/704 (38%), Positives = 390/704 (55%), Gaps = 72/704 (10%)

Query: 54  EKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGL 113
           +KV+++ ESG+RLH+T Y   K   PSGFT+KLRKH+R +RLED+  +G DRI+  +FG 
Sbjct: 5   KKVMIIFESGIRLHSTEYGWSKNIMPSGFTMKLRKHLRDKRLEDISVVGLDRIVDMRFGN 64

Query: 114 GMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE--R 171
           G  A ++I+ELY +GN++LTDSE+ +L +LR+   +   V    R  Y  E+ R FE  R
Sbjct: 65  GPTACHLIIELYDRGNVVLTDSEYVILNILRARTIETDNVRYAVRETYLVEV-REFEEYR 123

Query: 172 TTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSND 231
            TA +                    E  N + +A                          
Sbjct: 124 RTADE--------------------EMANRLLHAC------------------------- 138

Query: 232 GARAKQP--TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
                QP  TL   L     YGP L EH +L+  L   MK+  V   +     + +    
Sbjct: 139 -----QPGDTLHKCLVPHFPYGPLLLEHCLLENKLSLRMKVQAVIGDQSLVSALALSLSL 193

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFV 349
            FE  L + I  +    GY+ M  +          +G   +I+ EF P   +QF S E  
Sbjct: 194 AFE--LFEKIRKE-PSRGYLKMTVEE-------NAAGERIEIFHEFHPYFFSQFASSECK 243

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +F+TF+ A+DE++SK++SQ+ +Q+   +E AA  +L  +  D E R+  L+ +     +M
Sbjct: 244 QFDTFNGAVDEYFSKLDSQKCQQKQLQQERAALKRLENVRQDHEQRLANLQADQMLKERM 303

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
           A  +E N E V+ A+  +R A+A ++ W  +  M+++ R  G+PVAG I  L LERN   
Sbjct: 304 AVAVELNSETVEQALAVLRSAIAMKLEWFQINEMIQDARDLGDPVAGKIVGLCLERNAFV 363

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           + L  ++ + D E        VE+DLALS+H N+RRW+   K+   KQ+KTI A  KA K
Sbjct: 364 MRLPVDVFDNDQELGDAETVDVEIDLALSSHQNSRRWFSQMKESALKQKKTIAAGGKALK 423

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
           +AE  T+ Q+   +   NI  +RK+ WFEKF+WF SS+  LVI+GRDA+QNE++VKRY+ 
Sbjct: 424 SAELHTKEQLKSTRQKTNIGKVRKMFWFEKFHWFFSSDRLLVIAGRDAKQNEILVKRYLK 483

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
            GD+YVHADL GA+S VIK    + P+PP TLN+A    VC S AW+SK+VTSAWWV   
Sbjct: 484 PGDLYVHADLRGAASVVIKQSEDKGPIPPKTLNEAAALAVCLSAAWESKVVTSAWWVKHD 543

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER-----RV 704
           QVSK+AP+GEYL  G FMIRGKKN+L    L+MGFGLLFRLD  S   HL +R      +
Sbjct: 544 QVSKSAPSGEYLKTGGFMIRGKKNYLTASQLVMGFGLLFRLDSESAARHLEKRCQAEDEL 603

Query: 705 RGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPV-AESLSVPNS 747
            GEE   D+ +D    K+   + SE  +     V +E  S P++
Sbjct: 604 DGEEANCDNLQDE-QKKQKKLVRSELSEQSFNSVNSEEFSYPDN 646


>gi|219109751|ref|XP_002176629.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411164|gb|EEC51092.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1238

 Score =  454 bits (1167), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 336/989 (33%), Positives = 499/989 (50%), Gaps = 123/989 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDL-SPKTYIFKLMNSSGVTESGESE----- 54
           VKVR +  DV A V  + RRL+G +  NVYD  + +TY+FKL +S G T S  +      
Sbjct: 12  VKVRFDGLDVTAMVSHVQRRLLGRKIINVYDGDNGETYVFKLDSSGGTTISNNNNNTSNS 71

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           K  LL+ESG+R H   +       P+ F  KLRKH+R  RLE + Q+G DR+IL QFG G
Sbjct: 72  KEFLLLESGIRFHPLEHFESNLPMPTPFCAKLRKHLRGLRLEQISQIGTDRVILLQFGSG 131

Query: 115 MNAHYVILELYAQGNILLTDS-EFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTT 173
            + H +ILELYA+GNI+LT+   +T+L LLRSH  +   VA+     YP       ++  
Sbjct: 132 ASRHALILELYAKGNIILTEGIHYTILALLRSHVYEKDQVAVQVGQVYPVTYATSVQKDN 191

Query: 174 ASKLHAALTSSKEPDANEPDKVN---------EDGNNVSNASKENLGGQKGGKSFDLSKN 224
            +  +A   +  +P+ N+P   +         ++ N + N S E +  Q           
Sbjct: 192 QTVANAVAATDTQPE-NDPSPTSRIMDTACAAKNKNGILNMSIEEI--QASLALLLEPAP 248

Query: 225 SNKNSNDGARAKQPTLKTVLGE----ALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA 280
            +  +  G +     LKT+L +       YGPAL EH IL   L+P+  + E        
Sbjct: 249 VSATTKKGKKGSPLNLKTLLLQPQWGVSQYGPALLEHCILQANLLPHASIKET------- 301

Query: 281 IQVLVLAVAKFEDW-----------LQDVISGDIVPEGYILMQNK---HLGKDHPPTESG 326
               VL  A +E             + ++ S  I   GYIL Q +    +    P +E+ 
Sbjct: 302 ----VLQAADWERLQTSLSEQGPAIMYNLHSAAIDTPGYILYQPRVEEDIVNGKPHSENL 357

Query: 327 SST------------QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQH 374
           SS             ++  EF P LL Q ++   ++++ F AA+ +F++ + +Q+   + 
Sbjct: 358 SSAVAVVAKELAHADKVLLEFQPHLLAQHQNCPRLEYKHFGAAVADFFAHMVAQKRLLKV 417

Query: 375 KAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANR 434
           +A E A   KL K+  DQ +RV  L+++       A++++ N E+VD A+L +  AL + 
Sbjct: 418 QASEMAVQEKLRKVQQDQADRVMALERDQQTLQAYAQVVKNNAENVDKALLVINSALDSG 477

Query: 435 MSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN-LDEMDDEEKTLPVEKVEV 493
           M W+ L  +V  E+   NP+A LI +L LE   M L L  +  DE+ D      V  V V
Sbjct: 478 MDWDQLIELVSVEQANRNPIANLIVRLELENEIMILRLPRDPFDELSD------VLNVNV 531

Query: 494 DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ-----EKTVANI 548
            L  SAHANA   +   +  + K +KT+ + SKA +AAE+  + Q+++     ++TVA +
Sbjct: 532 SLKDSAHANASALFAKYRASKEKTQKTLESSSKALQAAEESAQRQLIEAQRRTKQTVAAV 591

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
              RK  W+EKF+WF++S+NYLV+ G+DA QNE++VKRY+  GD Y+HA++HGA+S +++
Sbjct: 592 K--RKPAWYEKFHWFVTSDNYLVLGGKDAHQNELLVKRYLRAGDAYLHAEVHGAASCILR 649

Query: 609 NHRPEQP-----VPPLT---LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
             R   P       PL+   L +AG FT+C S AW S+MVTSAWWV  HQVSKTAP+GE+
Sbjct: 650 AKRRRLPNGATQSIPLSDQALREAGNFTICRSSAWASRMVTSAWWVESHQVSKTAPSGEF 709

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHLNERRVRGEEEGMDDFEDSGH 719
           LTVGSFM+RGKKNFLPP PL MG  +LFRL D+ S+  H  ERR         DF     
Sbjct: 710 LTVGSFMVRGKKNFLPPSPLEMGLAVLFRLGDDDSIARHKTERR---------DF----- 755

Query: 720 HKENSDIESEKDDTDEKPVAESLSV-PNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDS 778
               + IE E    D      S  + P +       T   +   HE        S+ +  
Sbjct: 756 ----ALIELENSSVDVLDAVSSFQMEPKTNIEGQEATTHRDTTEHEG-------SDLVSD 804

Query: 779 KIF-DIARNVAAPVTPQLEDLIDRAL----GLGSASISSTKHGIETTQFDLSEEDKHVER 833
           +++  + + + +  T   E+LI+         GS      K G  T + +     K +  
Sbjct: 805 EVWMTLPKVIVSNSTSSAENLINDPTRDDGSCGSDGNEEAKKGSTTNEGNGRRTKKGLSV 864

Query: 834 TATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKIS 893
               + K Y S  E RKL     S+V   K   E   G+    QP        I+  K+ 
Sbjct: 865 KERKQMKKYGSLGEARKLH----STVAVDKSSTEDTHGQ----QPVLPSLDGLIDASKLK 916

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           RG++ K K+   KY DQD+E+R + M  L
Sbjct: 917 RGKRAKAKRAMLKYMDQDDEDRELAMLAL 945


>gi|119586149|gb|EAW65745.1| serologically defined colon cancer antigen 1, isoform CRA_e [Homo
           sapiens]
          Length = 628

 Score =  446 bits (1148), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 270/686 (39%), Positives = 386/686 (56%), Gaps = 102/686 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E          
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE--------PL 164

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT  +  +             V++A K  L                             L
Sbjct: 165 LTLERLTEI------------VASAPKGEL-----------------------------L 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFSGKGYIIQKREIKPCLEADKPVEDILT----YEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYL 415

Query: 475 -----------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARR 505
                            N  E    +K     K            V+VDL+LSA+ANA++
Sbjct: 416 LSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 475

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 476 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 535

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 536 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 594

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQV 651
              +C+S AWD++++TSAWWVY HQV
Sbjct: 595 TMALCYSAAWDARVITSAWWVYHHQV 620


>gi|256080624|ref|XP_002576579.1| hypothetical protein [Schistosoma mansoni]
 gi|353229334|emb|CCD75505.1| hypothetical protein Smp_052790 [Schistosoma mansoni]
          Length = 1009

 Score =  444 bits (1142), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 262/725 (36%), Positives = 403/725 (55%), Gaps = 69/725 (9%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K+   T DV   V  ++ R++G R +N+YD+  KTY+ KL ++         +K +LL+
Sbjct: 1   MKLLYTTFDVMVSVSEIKNRILGYRVNNIYDVDNKTYLLKLASTKS------DDKTILLL 54

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG RLH T +   K   PSGF++KLRKHIR +++ D+ Q+G DR++    G   +A+++
Sbjct: 55  ESGSRLHITDFDWPKNIMPSGFSMKLRKHIRNKKIVDISQIGADRVVDIHIGYESSAYHL 114

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICR-VFERTTASKLHA 179
           I+ELY +GN+LLTD  FT+L LLR   D ++ +   +  +YPT  CR + E     K   
Sbjct: 115 IVELYDRGNMLLTDESFTILHLLRPRTDKNQNIRFAAHEKYPTTSCRQILECFRDLKDQK 174

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
           +L                   ++ N         KG  +      SN  + D      P+
Sbjct: 175 SL------------------KDIENFLIPLFQSSKGPWT------SNPQTCDS-----PS 205

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEV-NKLEDNAIQ-----VLVLAVAKFED 293
           +   L   L YG  + EH +     V   K+ ++ N  ED  +Q     ++ L V  F  
Sbjct: 206 INKTLSSELPYGNVIIEHCMR----VAQNKIKQMRNHKEDFQLQSEKTDLIELYVEHFAV 261

Query: 294 WLQDVISGDIV------PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE 347
            L+D++    +      P GYI       GK +  ++ G     Y+EF P +  Q+R + 
Sbjct: 262 VLRDILLEPFLCDRQATPHGYIF------GKSYQSSDEGLRN--YEEFHPFMFEQYRDKP 313

Query: 348 FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
            + F++F+ A+D ++SKIESQ+  +Q    E  A  K+  I  DQE R+  LK E +  +
Sbjct: 314 HLAFDSFNKAVDAYFSKIESQKTLEQISRNEQKASRKVENIKKDQERRLMLLKTEQELDM 373

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + A L+E N   VD  I+ +  AL+N++ W++L  +V++ ++  +P+A  I +L L+ + 
Sbjct: 374 RKAYLLEANRRLVDNIIIMINHALSNQIDWKELELIVEDAKQRDDPLACHIVELKLQTSQ 433

Query: 468 MSLLLSNNLDEMDDEEKTL-------PVEKVEVDLALSAHANARRWYELKKKQESKQEKT 520
             + L +  +   D ++TL          +V VD+ ++A  NAR++Y+ K+    K+EKT
Sbjct: 434 AVIRLKDPFESSSDVDETLVRSGNKDEYTEVVVDIDVNALTNARKYYDKKRAASKKEEKT 493

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           I    K  K+A     +++   KTVA I+ +RK  WFEKF WFISSENYLV++G D+QQN
Sbjct: 494 INVSRKVLKSAIHNAEIKMKTAKTVAQITEVRKPMWFEKFFWFISSENYLVVAGHDSQQN 553

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIK-NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
           E++VKRY+  GD++VHAD+HGAS+ +IK  H   + V      +AG   V  S AW S +
Sbjct: 554 EVLVKRYLKPGDLFVHADIHGASTVIIKARHLTSEEVDSPNHQEAGNMAVVLSSAWQSHV 613

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
           +T AWWV+  QVSKTAP+GEYLT G+FMIRGKKN+LPP P   GFG++F+L E S+  H 
Sbjct: 614 LTRAWWVHHDQVSKTAPSGEYLTSGAFMIRGKKNYLPPCPFDYGFGIMFKLHEDSIAKHK 673

Query: 700 NERRV 704
            ERR+
Sbjct: 674 GERRI 678


>gi|341901167|gb|EGT57102.1| hypothetical protein CAEBREN_19463 [Caenorhabditis brenneri]
          Length = 920

 Score =  443 bits (1140), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 264/706 (37%), Positives = 383/706 (54%), Gaps = 81/706 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R    DV A    L++L GMR +NVYD+  KTY+ KL        S   EK ++L E
Sbjct: 1   MKNRFTLVDVIAATTELKKLQGMRVNNVYDIDNKTYLIKL--------SRTDEKAVILFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SGVRLH T +   K  TPS F++KLRKHI  +RL  +R +G+DR++   FG     + + 
Sbjct: 53  SGVRLHQTFHDWPKSQTPSSFSMKLRKHINQKRLTSIRVVGFDRLVELVFGTEDRENRLY 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           +ELY +GN++LTD E T+L +LR   D D  V    R +Y                    
Sbjct: 113 VELYDRGNVVLTDHELTILNILRVRTDKDTSVRWAVREKY-------------------- 152

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           T ++E                   S+E    + G   F+    +     DG   K+  L 
Sbjct: 153 TFTEE------------------ISEETANSRHGKFKFEDFAKAVSAIPDG---KEEQLG 191

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ----- 296
            ++ +    G  +++ I+   GL    K+S  NK + + I        KFED L+     
Sbjct: 192 RIVSQFTRCGNPVTKEILCKCGLKAEQKIS--NKSDLSGI------TEKFEDILKATEEI 243

Query: 297 -DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
            +++  +  P+G I            P+ + +  Q+Y EF P+ +    S+   +  +F 
Sbjct: 244 WEMVEEN--PKGVI-------SYTEVPSPTSAPIQLYQEFNPIPM-PLTSKFTKELPSFC 293

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            ++DEFYS+IE+Q+ EQ+    E  A  KL  +  DQ++R+  L+   ++   MA  I  
Sbjct: 294 ESVDEFYSRIETQKQEQKAINMEKQALKKLENVEKDQKDRIEALQMTQEQREHMANRIIL 353

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E V+ A+L +R ALAN+ SW+ +  M K   K G+PVA  ID    E N   + L   
Sbjct: 354 NQELVEKALLLIRSALANQFSWQTIEEMKKTAAKNGDPVAKSIDSFKFESNEFVMTLG-- 411

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
            D  DDE + L   KV +D++++A  NA+R +  KK    K +KT+ +  KA K A++K 
Sbjct: 412 -DPYDDEAEIL---KVPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKA 467

Query: 536 RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  + Q K V  +   RK  WFEKF WFISSE Y+V++GRDAQQNE++VK+Y+   D+Y+
Sbjct: 468 KSTLEQVKIVTEVKKSRKAMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYM 527

Query: 596 HADLHGASSTVIKNHRPE--QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK 653
           HAD+ GASS VI+N   E  Q +PP TL +A    VC+S AW++ +  SAWWV P QVS+
Sbjct: 528 HADVRGASSVVIRNKSFEESQEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVRPEQVSR 587

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
           TAPTGEYL  GSFMIRGKKNF+PP  L+MG G+LFR+DE S+  H+
Sbjct: 588 TAPTGEYLPSGSFMIRGKKNFMPPSQLVMGLGVLFRMDEESIERHV 633


>gi|348681953|gb|EGZ21769.1| hypothetical protein PHYSODRAFT_557667 [Phytophthora sojae]
          Length = 1063

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 283/770 (36%), Positives = 425/770 (55%), Gaps = 116/770 (15%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL-------SPKTYIFKLMNSSGVTESGE 52
           M K RM+  D+ A V  +R  +  MR +N+YD+       + KTYI KL           
Sbjct: 1   MKKTRMSIDDIRAMVGSIRANVQNMRVTNIYDVQGQGESGAAKTYILKLHQPP------- 53

Query: 53  SEKVLLLMESGVRLHTTAYARDKKN---TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
             KV LL+ESGVR HT+ YARD K     PS FT+KLRKH+R +RL  +RQL  DR++ F
Sbjct: 54  FPKVFLLLESGVRFHTSKYARDAKAGSALPSQFTMKLRKHLRGKRLSGLRQLEGDRVVDF 113

Query: 110 QFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF 169
            FG      ++ILELYA GNI+LTD ++ +L+LLR+HR D+  V +  +  YP ++    
Sbjct: 114 TFGQDALQCHLILELYASGNIVLTDGDYRILSLLRTHRFDE-NVKMAVKQVYPVQLLGDQ 172

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS 229
           E+  A +  A L                       A  +    ++  K+        +  
Sbjct: 173 EKQRAIQTPAQLA----------------------AFVDKWFVEQEAKAAVALPGKTQKK 210

Query: 230 NDGARAKQPTL--KTVLGEALGYGPALSEHIILDTGLVPNMKL---SEVNKLEDNAIQVL 284
                 KQ  L  ++  G   G GP + EH ++  G+ P +KL   +E + L D+ +  L
Sbjct: 211 KKAQTIKQLLLVKESTFG---GLGPVIIEHCLVRAGISPTLKLKNAAEFSALGDDKLAAL 267

Query: 285 VLAVAKFEDW-----LQD----------------VISGDIVPE----------------- 306
           +  +   E W     LQD                V +GD   E                 
Sbjct: 268 LAEIQ--EGWKLLERLQDEQTSVNGPVPVQNDDTVDAGDSDEEEAAPVAKAPSSASSQKC 325

Query: 307 GYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRS--REFVKFETFDAALDEFYSK 364
           G+I++++         +   ++ + ++EF P L  Q +   ++   F+TFD A+DE++S+
Sbjct: 326 GFIILKD---------SADENAPEQFEEFTPFLYAQHQQAHKKVKSFDTFDEAVDEYFSR 376

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
            E+  AE   ++ + AA +KL K+  +Q+ ++  L++  ++S + A+LIE N +DV+  +
Sbjct: 377 FEADTAEVAKQSAQLAAENKLAKLKKNQQQQLAQLREVQEQSFQHAQLIEANQQDVENVL 436

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
           L +R ALA+ M W  L  +V+ E+K GNPVA LI KL LE N +++LL +  D+ + E+ 
Sbjct: 437 LVIRSALASGMDWRGLEELVRYEQKNGNPVASLIHKLDLEHNRVAILLCDEEDDDEGEDG 496

Query: 485 TLPVEK-------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
                +       + +DL+LSA ANAR  Y  KKK   K +K   A  KA   AEK T+ 
Sbjct: 497 GDGTGEEDKQAHVIWIDLSLSALANAREIYTKKKKAGEKVKKATEATDKAIALAEKNTKK 556

Query: 538 QILQEKTVANISHMR-KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
            + +++T  N+ + R K  WFEKF+WF+++E YLV++G+DA QNE++VKRY+ KGDVYVH
Sbjct: 557 TLEKQQTKRNVIYQRRKTLWFEKFHWFLTNEKYLVVAGKDAHQNELLVKRYLRKGDVYVH 616

Query: 597 ADLHGASSTVIKNH-----RPEQPVPPL---TLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
           ADLHGA++ +++NH     +  Q +PP+   TL QAGC +VC S AW S+++  A+WV+ 
Sbjct: 617 ADLHGAATCIVRNHATVKDKKTQELPPIPVATLEQAGCMSVCRSNAWTSQVIAGAYWVHA 676

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
            QVSKTAP GEYLT GSFMIRGKKN++ P  L MG  +LFR+DESS+ +H
Sbjct: 677 DQVSKTAPAGEYLTTGSFMIRGKKNYIQPSRLEMGLAVLFRIDESSISNH 726


>gi|354506443|ref|XP_003515270.1| PREDICTED: nuclear export mediator factor Nemf, partial [Cricetulus
           griseus]
          Length = 699

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 229/500 (45%), Positives = 321/500 (64%), Gaps = 42/500 (8%)

Query: 246 EALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVP 305
           EA  YGPAL EH +++ G   N+K+ E  KLE   I+ +++ V K ED++++  + +   
Sbjct: 18  EAESYGPALIEHCLIENGFSGNVKVDE--KLESKDIEKILVCVQKAEDYMKE--TANFHG 73

Query: 306 EGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A+DEFY
Sbjct: 74  KGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKAVDEFY 129

Query: 363 SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDA 422
           SKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ VD 
Sbjct: 130 SKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDR 189

Query: 423 AILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN--LDEMD 480
           AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   L E +
Sbjct: 190 AIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEE 249

Query: 481 DEEKTLPVEK----------------------------VEVDLALSAHANARRWYELKKK 512
           D++    VE                             V+VDL+LSA+ANA+++Y+ K+ 
Sbjct: 250 DDDGDASVEVSDAEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRY 309

Query: 513 QESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
              K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENYL+I
Sbjct: 310 AAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLII 369

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
            GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG   +C+S
Sbjct: 370 GGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMALCYS 428

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DE
Sbjct: 429 AAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDE 488

Query: 693 SSLGSHLNERRVRGEEEGMD 712
           S +  H  ER+VR ++E ++
Sbjct: 489 SCIWRHRGERKVRAQDEDIE 508


>gi|149051344|gb|EDM03517.1| rCG61611 [Rattus norvegicus]
          Length = 899

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 228/503 (45%), Positives = 321/503 (63%), Gaps = 36/503 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ V + ED+L+   
Sbjct: 17  LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLESKDIEKILVCVQRAEDYLEK-- 72

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 73  TANFNGKGYII-QKREVKPSLDANKPAEDILTYEEFHPFLFSQHLQCPYIEFESFDKAVD 131

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 132 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 191

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN--LD 477
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   L 
Sbjct: 192 VDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHITMLLRNPYLLS 251

Query: 478 EMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWYEL 509
           E +D +    +E                             V+VDL+LSA+ANA+++Y+ 
Sbjct: 252 EEEDGDGDGSIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDH 311

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENY
Sbjct: 312 KRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENY 371

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           L+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   + P+PP TL +AG   +
Sbjct: 372 LIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGD-PIPPRTLTEAGTMAL 430

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           C+S AWD++++TSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF+
Sbjct: 431 CYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFK 490

Query: 690 LDESSLGSHLNERRVRGEEEGMD 712
           +DES +  H  ER+VR ++E M+
Sbjct: 491 VDESCVWRHRGERKVRVQDEDME 513


>gi|73962860|ref|XP_851229.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Canis
           lupus familiaris]
          Length = 1077

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 231/508 (45%), Positives = 321/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREV----KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 -----------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANAR 504
                          ++  E  LP  K                   V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDISVEKNETELPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHRGERKVRVQDEDME 681



 Score =  140 bits (352), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   LIGMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAILAELNASLIGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVDHARAAE 162


>gi|281604208|ref|NP_001164057.1| serologically defined colon cancer antigen 1 [Rattus norvegicus]
          Length = 1065

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 228/503 (45%), Positives = 321/503 (63%), Gaps = 36/503 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ V + ED+L+   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLESKDIEKILVCVQRAEDYLEK-- 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TANFNGKGYII-QKREVKPSLDANKPAEDILTYEEFHPFLFSQHLQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN--LD 477
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   L 
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHITMLLRNPYLLS 417

Query: 478 EMDDEEKTLPVEK----------------------------VEVDLALSAHANARRWYEL 509
           E +D +    +E                             V+VDL+LSA+ANA+++Y+ 
Sbjct: 418 EEEDGDGDGSIENSDAEAPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDH 477

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENY
Sbjct: 478 KRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENY 537

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           L+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   + P+PP TL +AG   +
Sbjct: 538 LIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGD-PIPPRTLTEAGTMAL 596

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           C+S AWD++++TSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF+
Sbjct: 597 CYSAAWDARVITSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFK 656

Query: 690 LDESSLGSHLNERRVRGEEEGMD 712
           +DES +  H  ER+VR ++E M+
Sbjct: 657 VDESCVWRHRGERKVRVQDEDME 679



 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 103/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L N           K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNANLLGMRVNNVYDVDNKTYLIRLQNPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHARAAE 162


>gi|344273431|ref|XP_003408525.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor
           NEMF-like [Loxodonta africana]
          Length = 1000

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 231/511 (45%), Positives = 325/511 (63%), Gaps = 50/511 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++ ++
Sbjct: 183 LKRVLNPLLPYGPALIEHCLMENGFSGNVKVGE--KFESKDIEKVLVCLQKAEDYMKTML 240

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
             +   +GYI+ + +      P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 241 --NFSGKGYIIQKREV----KPSLEIDKPTEDILTYEEFHPFLFSQHLQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGSIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 --------------NLDEMDDE-------------------EKTLPVEKVEVDLALSAHA 501
                         ++++ + E                    K LPV+   VDL+LSA+A
Sbjct: 415 LLSEEEDDDVDGDISIEKNETEPLKGKKKKQKNKQLQKPQKNKPLPVD---VDLSLSAYA 471

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF 
Sbjct: 472 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 531

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL
Sbjct: 532 WFISSENYLIIGGRDQQQNEIIVKRYLTAGDIYVHADLHGATSCVIKNPTGE-PIPPRTL 590

Query: 622 NQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
            +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+
Sbjct: 591 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 650

Query: 682 MGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 651 MGFSFLFKVDESCVWRHWGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHARAAE 162


>gi|297736760|emb|CBI25961.3| unnamed protein product [Vitis vinifera]
          Length = 321

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 231/363 (63%), Positives = 260/363 (71%), Gaps = 50/363 (13%)

Query: 488 VEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN 547
           V+KVEVDLALSAHANARRWYE KK+QE+KQEKTI AH KAFKAAEKK+ +Q+ Q      
Sbjct: 8   VDKVEVDLALSAHANARRWYEQKKRQENKQEKTIIAHEKAFKAAEKKSCVQLSQVGE--- 64

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
                 +HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD+Y+HADLHGAS    
Sbjct: 65  ----HYIHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDLYIHADLHGASR--- 117

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
                             CFTVCHSQAWDSK+VTSAWWVYPHQVSKTA TGEYLTVGSFM
Sbjct: 118 ------------------CFTVCHSQAWDSKIVTSAWWVYPHQVSKTASTGEYLTVGSFM 159

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIE 727
           IRGK NFLPPHPL+MGFGLLF LDESSLGSHLNERRVRGEEEG  DFE++   K NSD E
Sbjct: 160 IRGK-NFLPPHPLMMGFGLLFCLDESSLGSHLNERRVRGEEEGAQDFEENESLKGNSDSE 218

Query: 728 SEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV 787
           SEK++TDEK  AES S+ +                   P+  + I  G  S+I DI+   
Sbjct: 219 SEKEETDEKRTAESKSIMD-------------------PSTHQPILEGF-SEINDISGIH 258

Query: 788 AAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAE 847
            + V PQLEDLIDRAL LGS + S  K+ +ET+Q DL EE  H +R A VR+KPY S   
Sbjct: 259 VSSVNPQLEDLIDRALELGSNTASGKKYALETSQVDL-EEHNHEDRKAKVREKPYTSYQS 317

Query: 848 RRK 850
           +RK
Sbjct: 318 QRK 320


>gi|403277932|ref|XP_003930596.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 1077

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/508 (45%), Positives = 322/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G + N+K+ E  KLE   I+ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFLGNVKVDE--KLETKDIEKILVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E+    +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKRE----TKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHRGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHVRAAE 162


>gi|55640675|ref|XP_509934.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Pan
           troglodytes]
 gi|410223614|gb|JAA09026.1| nuclear export mediator factor [Pan troglodytes]
 gi|410263654|gb|JAA19793.1| nuclear export mediator factor [Pan troglodytes]
 gi|410263656|gb|JAA19794.1| nuclear export mediator factor [Pan troglodytes]
 gi|410299008|gb|JAA28104.1| nuclear export mediator factor [Pan troglodytes]
 gi|410354861|gb|JAA44034.1| nuclear export mediator factor [Pan troglodytes]
          Length = 1076

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 233/508 (45%), Positives = 323/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREIKPSLEADKPVEDIFT----YEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|269849764|sp|O60524.4|NEMF_HUMAN RecName: Full=Nuclear export mediator factor NEMF; AltName:
           Full=Antigen NY-CO-1; AltName: Full=Serologically
           defined colon cancer antigen 1
          Length = 1076

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/505 (45%), Positives = 320/505 (63%), Gaps = 38/505 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 597 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 657 FKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|426376840|ref|XP_004055190.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Gorilla
           gorilla gorilla]
          Length = 1077

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/505 (45%), Positives = 320/505 (63%), Gaps = 38/505 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 597 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 657 FKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRADEADDVKFAVRERYPLDHARAAE 162


>gi|397523542|ref|XP_003831788.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Pan
           paniscus]
          Length = 1076

 Score =  432 bits (1112), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/505 (45%), Positives = 320/505 (63%), Gaps = 38/505 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 597 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 657 FKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|194375658|dbj|BAG56774.1| unnamed protein product [Homo sapiens]
          Length = 999

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 233/508 (45%), Positives = 322/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 141 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 196

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +       Y+EF P L +Q     +++FE+FD 
Sbjct: 197 TSNFSGKGYIIQKREIKPCLEADKPVED----ILTYEEFHPFLFSQHSQCPYIEFESFDK 252

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 253 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 312

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 313 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 372

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 373 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 432

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 433 KYYDHKRYAAKKTQKTVEAAGKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 492

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 493 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 551

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 552 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 611

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 612 SFLFKVDESCVWRHQGERKVRVQDEDME 639



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 71/170 (41%), Gaps = 51/170 (30%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG+R+HTT +   K   PS F +K                                   
Sbjct: 53  KSGIRIHTTEFEWPKNMMPSSFAMK----------------------------------- 77

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
                  GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 78  -------GNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 120


>gi|32130516|ref|NP_004704.2| nuclear export mediator factor NEMF [Homo sapiens]
 gi|119586148|gb|EAW65744.1| serologically defined colon cancer antigen 1, isoform CRA_d [Homo
           sapiens]
 gi|148922399|gb|AAI46282.1| Serologically defined colon cancer antigen 1 [synthetic construct]
 gi|151556560|gb|AAI48733.1| Serologically defined colon cancer antigen 1 [synthetic construct]
          Length = 1076

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 233/508 (45%), Positives = 323/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E+    +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYII-QKREI---KPCLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|311245467|ref|XP_001924665.2| PREDICTED: nuclear export mediator factor NEMF [Sus scrofa]
          Length = 1076

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 232/508 (45%), Positives = 321/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K E+ +Q   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEECMQTTS 240

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           S +   +GYI+ + +    L  D P  +       Y+EF P L +Q     +++FE+FD 
Sbjct: 241 SFN--GKGYIIQKREVKPSLEVDKPTVD----ILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 --------------NLDEMDDEEKTLPVEK----------------VEVDLALSAHANAR 504
                         N ++ + E      +K                V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDINTEKNESEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E MD
Sbjct: 654 SFLFKVDESCVWRHRGERKVRVQDEDMD 681



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 101/170 (59%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLCAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH++ RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKGRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVDHARAAE 162


>gi|355693257|gb|EHH27860.1| hypothetical protein EGK_18167 [Macaca mulatta]
          Length = 1077

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 230/505 (45%), Positives = 320/505 (63%), Gaps = 38/505 (7%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 597 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 657 FKVDESCVWRHRGERKVRVQDEDME 681



 Score =  139 bits (350), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|296214948|ref|XP_002753922.1| PREDICTED: nuclear export mediator factor NEMF isoform 1
           [Callithrix jacchus]
          Length = 1077

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 233/513 (45%), Positives = 323/513 (62%), Gaps = 39/513 (7%)

Query: 233 ARA-KQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
           ARA K   LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ + K 
Sbjct: 175 ARAPKGELLKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKILVCLQKA 232

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
           ED+++   + +   +GYI+ Q + +       +       Y+EF P L +Q     +++F
Sbjct: 233 EDYMK--TTSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEF 289

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           E+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      E
Sbjct: 290 ESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGE 349

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++L
Sbjct: 350 LIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTML 409

Query: 472 LSN--------------------NLDEMDDEEKTLPVEK------------VEVDLALSA 499
           L N                    N  E    +K     K            V+VDL+LSA
Sbjct: 410 LRNPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSA 469

Query: 500 HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEK 559
           +ANA+++Y+ K+    K +KT+ A  KAF++AEKKT+  + + +TV +I   RKV+WFEK
Sbjct: 470 YANAKKYYDHKRYAAKKTQKTVEAAEKAFRSAEKKTKQTLKEVQTVTSIQKARKVYWFEK 529

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP 
Sbjct: 530 FLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPR 588

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  
Sbjct: 589 TLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSY 648

Query: 680 LIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 649 LMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|301773240|ref|XP_002922036.1| PREDICTED: serologically defined colon cancer antigen 1-like
           [Ailuropoda melanoleuca]
          Length = 1077

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 230/508 (45%), Positives = 321/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + + ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLKQAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYII-QKREI---KPSLEVDKPTEDIFTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 -----------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANAR 504
                          ++  E   P  K                   V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDLGVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHRGERKVRVQDEDME 681



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 101/170 (59%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   LIGMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLIGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP    R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVGHARAGE 162


>gi|119586150|gb|EAW65746.1| serologically defined colon cancer antigen 1, isoform CRA_f [Homo
           sapiens]
          Length = 1010

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 233/508 (45%), Positives = 323/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E+    +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYII-QKREI---KPCLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 593

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 594 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 653

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 654 SFLFKVDESCVWRHQGERKVRVQDEDME 681



 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|281343421|gb|EFB19005.1| hypothetical protein PANDA_010972 [Ailuropoda melanoleuca]
          Length = 1058

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 230/508 (45%), Positives = 321/508 (63%), Gaps = 44/508 (8%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + + ED+++   
Sbjct: 164 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLKQAEDYMK--T 219

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 220 TSNFSGKGYII-QKREI---KPSLEVDKPTEDIFTYEEFHPFLFSQHSQCPYIEFESFDK 275

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 276 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 335

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 336 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 395

Query: 475 -----------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANAR 504
                          ++  E   P  K                   V+VDL+LSA+ANA+
Sbjct: 396 LLSEEEDDDVDGDLGVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 455

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 456 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 515

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +A
Sbjct: 516 SSENYLIIGGRDQQQNEIIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEA 574

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 575 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 634

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 635 SFLFKVDESCVWRHRGERKVRVQDEDME 662



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 67/150 (44%), Positives = 91/150 (60%), Gaps = 8/150 (5%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           LIGMR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K   PS
Sbjct: 2   LIGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLLESGIRIHTTEFEWPKNMMPS 53

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F +K RKH+++RRL   +QLG DRI+ FQFG    A+++I+ELY +GNI+LTD E+ +L
Sbjct: 54  SFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTDYEYLIL 113

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
            +LR   D+   V    R RYP    R  E
Sbjct: 114 NILRFRTDESDDVKFAVRERYPVGHARAGE 143


>gi|297736751|emb|CBI25952.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 232/388 (59%), Positives = 277/388 (71%), Gaps = 28/388 (7%)

Query: 488 VEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN 547
           V+KVEVDLALSAHANARRWYE KK+QE+KQEKTI AH KAFKAAEKK+ +Q+ Q      
Sbjct: 8   VDKVEVDLALSAHANARRWYEQKKRQENKQEKTIIAHEKAFKAAEKKSCVQLSQVGE--- 64

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH--ADLHGASST 605
                 +HWFEKFNWFISSENYLVISGRDAQQN+MIVKRYMSKGD+++H  +  + +SST
Sbjct: 65  ----HYIHWFEKFNWFISSENYLVISGRDAQQNKMIVKRYMSKGDLFIHFKSTNNNSSST 120

Query: 606 VIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
            +   R       + LN +  F VCHSQAWDSK+VTSAWWVYPHQVSKTA TGEYLTVGS
Sbjct: 121 FLFFQRHLNTCCRIPLNYSSLFIVCHSQAWDSKIVTSAWWVYPHQVSKTASTGEYLTVGS 180

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSD 725
           FMIRGK NFLPPHPL+MGFGLLF LDESSLGSHLN+RRVRGEEEG  DFE++   K NSD
Sbjct: 181 FMIRGK-NFLPPHPLMMGFGLLFCLDESSLGSHLNDRRVRGEEEGAQDFEENESLKGNSD 239

Query: 726 IESEKDDTDEK---------------PVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDK 770
            ESEK++TDEK               P+ E  S  +SAH   + +N  +++  E P E++
Sbjct: 240 SESEKEETDEKRTAESKSIMDPSTHQPILEGFSEISSAHNELTTSNVGSINLPEVPLEER 299

Query: 771 TISNGIDSK-IFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDK 829
            + NG DS+ I DI+    + V PQLED IDRAL LGS + S  K+ +ET+Q DL EE  
Sbjct: 300 NMLNGNDSEHIDDISGIHVSSVNPQLEDFIDRALELGSNTASGKKYALETSQVDL-EEHN 358

Query: 830 HVERTATVRDKPYIS-KAERRKLKKGQG 856
           H +R A VR+KPY S + E   +  GQG
Sbjct: 359 HEDRKAKVREKPYTSYQREVIYISHGQG 386


>gi|410962212|ref|XP_003987668.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Felis catus]
          Length = 1080

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 233/511 (45%), Positives = 324/511 (63%), Gaps = 47/511 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDLEKVLVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQ---QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
           A+DEFYSKIE Q+ +    Q       A  KL+ +  D ENR+  L+Q  +      ELI
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQVWYXKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELI 354

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           E NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL 
Sbjct: 355 EMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLR 414

Query: 474 N----NLDEMDDEEKTLPVEK----------------------------VEVDLALSAHA 501
           N    + +E DD +  + VEK                            V+VDL+LSA+A
Sbjct: 415 NPYLLSEEEDDDVDGDITVEKNETEAPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYA 474

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF 
Sbjct: 475 NAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFL 534

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           WF+SSENYL+I GRD QQNEMIVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL
Sbjct: 535 WFVSSENYLIIGGRDQQQNEMIVKRYLTTGDIYVHADLHGATSCVIKNPTGE-PIPPRTL 593

Query: 622 NQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
            +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+
Sbjct: 594 TEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLM 653

Query: 682 MGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 654 MGFSFLFKVDESCIWRHRGERKVRVQDEDME 684



 Score =  139 bits (350), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVDHARAAE 162


>gi|255083452|ref|XP_002504712.1| predicted protein [Micromonas sp. RCC299]
 gi|226519980|gb|ACO65970.1| predicted protein [Micromonas sp. RCC299]
          Length = 1219

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 280/782 (35%), Positives = 401/782 (51%), Gaps = 103/782 (13%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPK-TYIFKLMNSSGVTESGESEKVLL 58
           M K + N+ D+AA    LR +++G   +N++DL  K T + K   S G TESGE EK  +
Sbjct: 1   MPKQKFNSHDIAASCATLRAKVLGAWLANIFDLDDKRTLLLKFTRSGGATESGEGEKTTV 60

Query: 59  LMESGVRLHTTAYARDKK-NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L+ESG R HTT+YAR++K + PS F  KLR H+R +RL  V Q+G DR + F FG G   
Sbjct: 61  LLESGARFHTTSYARERKADQPSKFNAKLRMHLRGKRLNGVNQMGADRAVAFTFGAGDTE 120

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           H+++LELYAQGNI+L D E+ +LTLLR HRDD + + ++  H YP E  R   R  A+ L
Sbjct: 121 HHLVLELYAQGNIVLCDREWRILTLLRPHRDDARSLVLLGNHPYPRERFRSHVRVDAAAL 180

Query: 178 HAALTS--SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA 235
            AAL      +P   +P +           ++E                         R 
Sbjct: 181 VAALEGRHDDDPLGPKPIEGEGVEGEGIEGAREK------------------------RR 216

Query: 236 KQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWL 295
              T++  L +A G+GP + +      G+V     +    L+D  +  L  ++   +DW 
Sbjct: 217 APGTVREALCKAFGFGPPVVDRAARMAGIVDGS--AAKTPLDDAQVTALGASLGAIDDWF 274

Query: 296 QDVISGDIVPEGYILMQNKH--LGKDHPPTESGSSTQIYDEFCPLLLN------QFRSRE 347
           + V  G + P G +  + K    G+D     S S    +++F P   +      QF  + 
Sbjct: 275 EGVTDGRVEPRGVVTWRIKEGESGEDGATASSPSLDADFEDFSPFPADDVPPPAQFDPKV 334

Query: 348 FVKFET---FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           F   E    FDAALD F++  E++R   + +   +AA  KL K+  DQE RV  L++E +
Sbjct: 335 FRTTEISGGFDAALDLFFASFEARRDRSRREKSANAAAKKLEKVRRDQEARVRALEKERE 394

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
                A LIEYNL  VDA + AV  ALA  M+W+DL  M+KEE +AGNPVA L+  L L 
Sbjct: 395 SQELAATLIEYNLTQVDAVLAAVNGALAGGMAWDDLTLMIKEEARAGNPVARLVKTLDLP 454

Query: 465 RNCMSLLLSNNLDEMDDEEKTL------------------PVEK---------VEVDLAL 497
           +N +++ L N+LD  DDE                      P  +         VE+DLAL
Sbjct: 455 KNKVTVTLKNHLDVDDDEGDDDGDDGDGGDADDVGEGDAKPRSRRLKRDGGVSVELDLAL 514

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL---QILQEKTVANISHMRKV 554
            AHANAR  ++ KKK ++K  KT+  + +A  AAEKK +    ++  + T   I+  R  
Sbjct: 515 GAHANAREHFDRKKKHDAKHGKTLAQNKRAVAAAEKKAKEAGARMASKGTGMGIARARVP 574

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ 614
            WFEKF+WFI++EN LV+S RDA Q + +V +Y+   D +VHAD  GA  T++K      
Sbjct: 575 EWFEKFHWFITTENCLVLSARDAAQADALVVKYLGPDDAFVHADSPGAPVTIVKAPPVRS 634

Query: 615 P------------------------------VPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           P                              VPP++L QAG   +C S AWDS+ V SA+
Sbjct: 635 PALPEAEASMSRLSLSATRVVGSSADGWCGGVPPVSLIQAGAACLCRSAAWDSRHVVSAF 694

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHLNERR 703
           W+ P  V K  P G+ L  G     G K +LPP PL+MGFG +F L DE  + +H+ +R 
Sbjct: 695 WIPPENVRKVTPDGDPLAPGVVWHVGAKTYLPPAPLVMGFGCVFLLRDEDGVRAHVGDRT 754

Query: 704 VR 705
           V+
Sbjct: 755 VK 756


>gi|195151655|ref|XP_002016754.1| GL21904 [Drosophila persimilis]
 gi|194111811|gb|EDW33854.1| GL21904 [Drosophila persimilis]
          Length = 966

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 295/823 (35%), Positives = 428/823 (52%), Gaps = 140/823 (17%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T D+   V  L++L+G R + +YD+  KTY+F+L  +                 
Sbjct: 1   MKTRFSTYDIICGVAELQKLVGWRVNQIYDIDNKTYLFRLQGNG---------------- 44

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
                     A  K   PSGF++KLRKH++ +RLE + QLG DRI+ FQFG G       
Sbjct: 45  ----------AWPKNVAPSGFSMKLRKHLKNKRLEKISQLGVDRIVDFQFGSG------- 87

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
                       D+ + VL  L      D+G  I++           FE TT   L    
Sbjct: 88  ------------DAAYHVLLELY-----DRGNLILTD----------FELTTLYIL---- 116

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT-- 239
                       + + +G N+  A +E    ++     D  + S  +  D      P   
Sbjct: 117 ------------RPHTEGENIRFAVREKYPIERAKHQDD--EFSLDHLADLLEKAPPGVH 162

Query: 240 LKTVLGEALGYGPALSEHIIL----DTGLVPNMKLSEVNKLED----------------- 278
           L+ +L   L  GPA+ EH++L    +  ++P    S V+  E                  
Sbjct: 163 LRQILMPVLNCGPAVVEHVLLLHDLENRVMPQGTTSNVDGPEQPLKKAQNSKKQRKERNL 222

Query: 279 -------------NAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTES 325
                        N +  L +AV +  + ++D  +G+   +GYI+    H+ K+  P E 
Sbjct: 223 QNAKSEVKVFDMVNDLPTLKMAVKRALNLIKDGNNGE--SKGYII----HV-KEEKPIED 275

Query: 326 GSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH 383
           G         EF P L  QF+  EF  FE+F  A+DEFYS  ESQ+ + +   +E  A  
Sbjct: 276 GKIEYFLRNIEFQPFLFAQFKDNEFSMFESFLEAVDEFYSTQESQKIDMKTLQQEREALK 335

Query: 384 KLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARM 443
           KL+ +  D   R+  L +  D   K AELI  N   VD AI AV+ A+A++++W D+  +
Sbjct: 336 KLSNVKKDHAKRLEELTKVQDDDKKKAELITSNQSLVDNAIRAVQSAIASQLTWPDIHEL 395

Query: 444 VKEERKAGNPVAGLIDKLYLERNCMSLLLSN---NLDEMDDEEKTLPVEKVEVDLALSAH 500
           VKE +  G+ VA  I +L LE N +SL+LS+   + +E D E+ T+    V+VDLALSA 
Sbjct: 396 VKEAQTNGDVVASSIKQLKLEINHISLILSDPYVSQNEKDCEDLTV----VDVDLALSAW 451

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKF 560
           ANARR+Y+LK+    K++KT+ A  KA K+AE+KT+  + + +T++NI   RKV WFEKF
Sbjct: 452 ANARRYYDLKRSAAQKEQKTVDASQKALKSAERKTQQTLKEVRTISNIVKARKVFWFEKF 511

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WFISSEN+LVI GRDAQQNE+IVKRYM   D+YVHA++ GASS VI+N   E  +PP T
Sbjct: 512 YWFISSENFLVIGGRDAQQNELIVKRYMRPKDIYVHAEIQGASSVVIRNTTGED-IPPKT 570

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L +AG   + +S AWD+K+VT+++WV   QVSKTAPTGEYL  GSFMIRGKKNFLP   L
Sbjct: 571 LVEAGSMAISYSVAWDAKVVTNSYWVTSDQVSKTAPTGEYLATGSFMIRGKKNFLPSCHL 630

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEG-------MDDFEDSGHHKENSDIESEK--D 731
            MG  LLF+L+ES +  HL ER+VR  ++         +D  D   ++ N D+E+++   
Sbjct: 631 TMGLSLLFKLEESFVARHLGERKVRSIDDAPFENSFKQNDLTDMLLNEVNEDLETQQVVS 690

Query: 732 DTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISN 774
             +E    ++   PN+       T    V  +    EDK I++
Sbjct: 691 IPEEDHRNDNSDFPNTEVKIEHDTGRITVKPNSLNVEDKPITD 733


>gi|119586146|gb|EAW65742.1| serologically defined colon cancer antigen 1, isoform CRA_b [Homo
           sapiens]
          Length = 1067

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 231/507 (45%), Positives = 315/507 (62%), Gaps = 51/507 (10%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV- 298
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMKTTS 240

Query: 299 -ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
             SG + P    +      G              Y+EF P L +Q     +++FE+FD A
Sbjct: 241 NFSGKVAPCILTIYCCDLFG--------------YEEFHPFLFSQHSQCPYIEFESFDKA 286

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 287 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 346

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N   
Sbjct: 347 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYL 406

Query: 475 -----------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARR 505
                            N  E    +K     K            V+VDL+LSA+ANA++
Sbjct: 407 LSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 466

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 467 YYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 526

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 527 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 585

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 586 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 645

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E M+
Sbjct: 646 FLFKVDESCVWRHQGERKVRVQDEDME 672



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|308480173|ref|XP_003102294.1| hypothetical protein CRE_05887 [Caenorhabditis remanei]
 gi|308262220|gb|EFP06173.1| hypothetical protein CRE_05887 [Caenorhabditis remanei]
          Length = 917

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 258/714 (36%), Positives = 380/714 (53%), Gaps = 73/714 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R    DV A    L++L GMR +NVYD+  KTY+ KL        S   EK ++L E
Sbjct: 1   MKNRFTLVDVIAATTELKKLQGMRVNNVYDIDNKTYLIKL--------SRTDEKAVILFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SGVRLH T +   K  TPS F++KLRKHI  +RL  +R +G+DR++   FG     + + 
Sbjct: 53  SGVRLHQTFHEWPKSQTPSSFSMKLRKHINQKRLTSIRVVGFDRLVELVFGTDDRENRLY 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           +ELY +GN++LTD+E  +L +LR   D D  V    R +Y                    
Sbjct: 113 VELYDRGNVVLTDNELIILNILRVRTDKDTSVRWAVREKY-------------------- 152

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           T ++E +        E G    +     +GG   GK   L +                  
Sbjct: 153 TFNEEAE-------RERGGVTMDDVTRAIGGIPEGKEEQLGR------------------ 187

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
            V+ +    G  +++ I+   G+   MK+S    +E      L     + E   + V   
Sbjct: 188 -VMSQLTKCGNPITKEILAACGMKAEMKVSRKTDVETEFRGKLEEIRKETEHVWEQV--- 243

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
           +  P G+I        +   PT      Q+Y+EF P+ +  F S+   +  +F  ++DEF
Sbjct: 244 EEQPRGFI-----SYTEILSPT--SQPIQLYNEFNPIPM-PFTSKLQKELPSFCESVDEF 295

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           YS+IE+Q+ EQ+    E  A  KL  +  DQ+ R+  L+   ++   MA  I  N + V+
Sbjct: 296 YSRIETQKQEQKAVNMEKQALKKLENVEKDQKERIEALQLTQEQREHMANRIILNQDLVE 355

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
            A+L +R ALAN+ SW+ +  M K     G+ VA  ID    E N   + L    D  D+
Sbjct: 356 KALLLIRSALANQFSWQTIEEMRKNAAMNGDLVAKSIDSFRFENNEFFMNLG---DPYDE 412

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
           E + L   KV +D++++A  NA+R +  KK    K +KT+ +  KA K A++K +  + Q
Sbjct: 413 EAELL---KVPIDISMNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQ 469

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
            K V  +   RK  WFEKF WFISSE Y+V++GRDAQQNE++VK+Y+   D+Y+HAD+ G
Sbjct: 470 VKIVTEVKKSRKAMWFEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRG 529

Query: 602 ASSTVIKNHRPE--QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
           ASS +I+N   E  Q +PP TL +A    VC+S AW++ +  SAWWV+P+QVS+TAPTGE
Sbjct: 530 ASSVIIRNKSFEESQEIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPNQVSRTAPTGE 589

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
           YL  GSFMIRGKKNF+PP  L+MG G+LFR+D+ S+  H    + +  EE  D+
Sbjct: 590 YLPSGSFMIRGKKNFMPPSQLVMGLGVLFRMDDESIERHAALEKAKKSEENPDE 643


>gi|268571229|ref|XP_002640975.1| Hypothetical protein CBG11722 [Caenorhabditis briggsae]
          Length = 894

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 254/685 (37%), Positives = 370/685 (54%), Gaps = 86/685 (12%)

Query: 24  MRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFT 83
           MR +NVYD+  KTY+ KL            EK ++L ESGVRLH T +   K  TPS F+
Sbjct: 1   MRVNNVYDIDNKTYLIKLTRPD--------EKAVILFESGVRLHQTFHDWPKSQTPSSFS 52

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
           +KLRKHI  +RL  +R +G+DRI+   FG     + + +ELY +GN++LTD E T+L +L
Sbjct: 53  MKLRKHINQKRLTSIRVVGFDRIVELIFGTEDRENRLYVELYDRGNVILTDHEMTILNIL 112

Query: 144 RSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVS 203
           R   D D  V    R +Y                    T S + +  +P           
Sbjct: 113 RVRTDKDTSVRWAVREKY--------------------TCSGDAEQQDP----------- 141

Query: 204 NASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTG 263
                     +G KS D+ +   ++  DG   K   L  +L      G  +++ I+   G
Sbjct: 142 ----------RGFKSDDVIRRI-QSIPDG---KDEQLGRILSGFTKCGNPITKEILSKIG 187

Query: 264 LVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPT 323
           L    KL+  + + + + +   +  A  E W  D +  D  P+G+I     +L  + P  
Sbjct: 188 LKWEQKLNAKSDVAEISAKFEEIKKATEEIW--DTVEHD--PKGFI----SYL--EIPSA 237

Query: 324 ESGSSTQIYDEFCPL-------LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
            S +  +IY EF P+       L  + RS        F  ++DEFYS+IE+Q+ EQ+   
Sbjct: 238 TSSTPIEIYSEFNPISMPLTLKLQKELRS--------FCESVDEFYSRIETQKQEQKAVN 289

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
            E  A  KL  +  DQ+ R+  L+   ++   MA  I  N E V+ A+L +R ALAN+ S
Sbjct: 290 MEKQALKKLENVEKDQKERIEALQLTQEQREHMANRIILNQELVEKALLLIRSALANQFS 349

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLA 496
           W+ +  M K     G+PVA  ID    E N   + L    D  D+E + L   KV +D++
Sbjct: 350 WQTIEEMRKSAAANGDPVAKSIDSFKFENNEFFMKLG---DPYDEEAELL---KVPIDIS 403

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHW 556
           ++A  NA+R +  KK    K +KT+ +  KA K A++K +  + Q K V  +   RK  W
Sbjct: 404 MNASKNAQRHFVDKKSAAEKVKKTVASSEKAIKNAQEKAKCTLEQVKIVTEVKKSRKTMW 463

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP- 615
           FEKF WFISSE Y+V++GRDAQQNE++VK+Y+   D+Y+HAD+ GASS +I+N   E+  
Sbjct: 464 FEKFRWFISSEGYIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSVIIRNKSFEESM 523

Query: 616 -VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
            +PP TL +A    VC+S AW++ +  SAWWV+P QV++TAPTGEYL  GSFMIRGKKNF
Sbjct: 524 EIPPKTLTEAAQMAVCYSNAWEATVTASAWWVHPSQVTRTAPTGEYLPSGSFMIRGKKNF 583

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHL 699
           +PP  L+MG G+LFR+DE S+  H+
Sbjct: 584 MPPSQLVMGLGILFRMDEESIERHV 608


>gi|328723949|ref|XP_001945685.2| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Acyrthosiphon pisum]
          Length = 987

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 270/742 (36%), Positives = 398/742 (53%), Gaps = 71/742 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T D+   V  +++  GMR   VYD+  KTY+FK            +EK +LL+E
Sbjct: 1   MKTRFSTLDIMCVVNEIQKYKGMRLQRVYDIDHKTYLFKF--------QLNNEKCVLLLE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SGVRLH T Y   K   PS F++KLRKH+  +RLE + Q+G+DRII  QFG+G  A++VI
Sbjct: 53  SGVRLHVTNYEWTKNEAPSSFSMKLRKHLSNKRLEKLTQMGFDRIIDLQFGVGEAAYHVI 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY                        DKG  I++   Y   +  +    T  +     
Sbjct: 113 LELY------------------------DKGNIILADKDYI--MINILRPHTEDEKQKFF 146

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                P++   +++N    +      +        K F  S   N               
Sbjct: 147 VKEVYPNSRPKNRLNPPTEDSLIQILKTAKHSTNLKKFIFSNFPN--------------- 191

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
                 L YG  L EH+++  G   N ++  E N   D  IQ L+      E +L ++ +
Sbjct: 192 -----CLDYGNCLLEHMLISGGFPTNTRIGIEFNI--DTDIQKLMNCFCIAEKFLDNITT 244

Query: 301 GDIVPEGYILMQ-NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
              + EG+I+ + ++ L  D    E  ++     E+ P L  Q +      +E+F+ A+D
Sbjct: 245 ---LKEGFIIQKIDQQLLPDGIMKELCTN----QEYHPFLFAQHQKLPSKTYESFNEAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYS +ESQ+ + +   +E  A  KL  I  D E R+  L+   D     AELI  NL+ 
Sbjct: 298 EFYSNLESQKYDVKCMQQEKGAVKKLQNIVKDHEERLKKLQDTQDEHKFKAELITNNLDL 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKE-ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
           VD  I  VR A+A ++ W+++  M+++   +       ++  L L  N ++L L +  +E
Sbjct: 358 VDNTIQFVRQAVAKQLHWDEIWDMIRQLNFEDDGCTYAIVKNLKLSVNHITLQLFDPYNE 417

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
            +  E+    + +++DL  SA  NA R+Y  KK+   K++KTI + S   K AEKKT+  
Sbjct: 418 ENKNEEN--SQLIDIDLGQSAFGNAERYYGSKKQSAIKEKKTIDSSSTVLKMAEKKTKQT 475

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           +   + VA+I+ +RK +WFEKF WFISSENYLVI+GRDA QNE+IVKRYM   DVYVHA 
Sbjct: 476 LKDMQVVASINKVRKTYWFEKFYWFISSENYLVIAGRDAHQNEVIVKRYMKSSDVYVHAG 535

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPT 657
             GA++ +IKN    QPVPP TLN+A    + +S +W  K+ + +A+WV P QVSKTAPT
Sbjct: 536 FSGATTVIIKN-PINQPVPPATLNEAAVMAISYSVSWTMKINLQNAFWVKPEQVSKTAPT 594

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG-EEEGMDDFED 716
           GEYLT GSFMIRGKKN+LP   LI+G   LF+L++SS+  H NER+++G E EG+D+ E 
Sbjct: 595 GEYLTTGSFMIRGKKNYLPATHLILGLSFLFKLEDSSIPRHANERKIKGIECEGLDNIEQ 654

Query: 717 SGHHKENSDIESEKDDTDEKPV 738
           +    EN   E++ D+  EK +
Sbjct: 655 NNDEFENIPSENDSDEDLEKNI 676



 Score = 48.1 bits (113), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 28/34 (82%)

Query: 892 ISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           + RGQ+GKLK++KEKY DQDE+ER +R+ LL  S
Sbjct: 775 LKRGQRGKLKRIKEKYKDQDEDEREMRIKLLQSS 808


>gi|452981583|gb|EME81343.1| hypothetical protein MYCFIDRAFT_114319, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 1087

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 320/962 (33%), Positives = 468/962 (48%), Gaps = 128/962 (13%)

Query: 14  EVKCL-----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
           +VKC+       L  +R +NVYDLS + ++ K              +  LL++SG R H 
Sbjct: 9   DVKCIAHELSNSLTTLRLANVYDLSTRIFLLKFQKPE--------HREQLLVDSGFRCHL 60

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T +AR     PS F  +LRK ++TRR   V+Q+G DR+I  QF  G  A+ + LE YA G
Sbjct: 61  TKFARATAAAPSPFVARLRKFLKTRRCTAVKQIGTDRVIELQFSDG--AYRLFLEFYAGG 118

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD 188
           NI+LT                D  + I++  R  +E     +     K + +L  + E  
Sbjct: 119 NIVLT----------------DNELTILALLRSVSEGAEHEQYRQGLKYNLSLRQNHE-- 160

Query: 189 ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG--------ARAKQPTL 240
                        V + +KE L          L K   K   +          +A  P  
Sbjct: 161 ------------GVPSLTKEWL-------KESLQKTVEKQQAEAQKPGKKIKKKAGDPLR 201

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K +      + P L +H +  +G+   ++   V  LE + +   VL   K  + +   I+
Sbjct: 202 KALAVTTTQFPPVLLDHALHVSGVDRELQPERV--LEHDELLEKVLQALKQAESVVAEIT 259

Query: 301 GDIVPEGYILMQNKHLGK--DHPPTESGSSTQIYDEFCPLLLNQF---RSREFVKFETFD 355
              V +GYIL + K   K  D   T       +Y+ F P    Q    +S  F++++ F+
Sbjct: 260 SQPVAKGYILGKRKQSSKQEDTDGTADEGKDVMYEHFHPFKPAQLAEDQSFVFLEYDGFN 319

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            A+DEF+S IE Q+ E + + +ED A  ++     +QE R+  L+Q  +  V+ A+ IE 
Sbjct: 320 VAVDEFFSSIEGQKLESRLQEREDNAKKRIEHARKEQEQRIEGLQQVQELHVRKAQAIEA 379

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN 474
           N+E V+ A  AV   +A  M W D+  +++ E+   NPVA LI   L L  N ++LLLS 
Sbjct: 380 NVERVEEATAAVNGLIAQGMDWADIGSLIENEQARHNPVAELIKLPLKLHENTITLLLSE 439

Query: 475 ---------------NLDEMDDEEKTLPVEK------VEVDLALSAHANARRWYELKKKQ 513
                            D  D + +T P         V++DLA SA +NAR++Y+ K+  
Sbjct: 440 IGRDADEEMDVTDSEPSDSEDGDAETAPARAEDKRLTVDIDLAASAWSNARQYYDQKRTA 499

Query: 514 ESKQEKTITAHSKAFKAAEK----KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
            SKQE+T  A  KA K+ E+    K +  + QEK V  +  +RK  WFEKF +FISS+ Y
Sbjct: 500 ASKQERTEAASKKALKSTEQNVMAKLKKDLKQEKDV--LRPVRKQFWFEKFIYFISSDGY 557

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCF 627
           LV++GRD  QNEM+ +R++ KGDVYVHADL+GASS VIKN  H P  P+PP TL QAG  
Sbjct: 558 LVLAGRDDLQNEMLYRRHLRKGDVYVHADLNGASSVVIKNSPHTPCAPIPPSTLAQAGDL 617

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AWDSK V SAWWV   QVSKTA TGEYL VGSF+IRGKKNFLPP  L++GFG++
Sbjct: 618 VVCRSSAWDSKAVMSAWWVNAEQVSKTADTGEYLAVGSFIIRGKKNFLPPARLLLGFGVM 677

Query: 688 FRLDESSLGSHLNERRVRGEE-EGMDDFEDSGHHKENS-----DIESEKDDTDEKPVAES 741
           F++ E S   H+  R +R +  +   D  D+    E+S       +   DD  + P A  
Sbjct: 678 FQISEESKARHVKHRLLRQDSYQATPDLTDAETIAESSAAGEPSDDGSDDDFPDAPPAPR 737

Query: 742 LSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDR 801
           +   +   P  ++      D  E     ++  N + S  FD A +         +D  D 
Sbjct: 738 IEDEDDGFPDRTYGTPDYNDDEE--EHSRSQRNPLQSSAFD-AHDNDDHEDEDGDDEKDE 794

Query: 802 ALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVD 861
             G    S +  + G E T+  ++  D+  +   +      +S  ER+ L K +      
Sbjct: 795 ETGSVEGSTNGAELGREDTESTVTPADQEEQPETSA----PLSNKERKALAKFE------ 844

Query: 862 PKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMAL 921
                     KD   QP    +  +I+   + RG++GK KK+ EKY DQDEE+R I M L
Sbjct: 845 ----------KDKKPQPSQKAKAKQIKA--LVRGKRGKAKKLAEKYADQDEEDREIAMRL 892

Query: 922 LA 923
           L 
Sbjct: 893 LG 894


>gi|145509741|ref|XP_001440809.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408037|emb|CAK73412.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1071

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 249/719 (34%), Positives = 393/719 (54%), Gaps = 89/719 (12%)

Query: 3   KVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           K+R+   D+ A V  L+ +LIG R SN+Y++  KTY+FK         S +  K  L++E
Sbjct: 5   KIRLTALDIMALVTELKQKLIGTRLSNIYNIDAKTYVFKF--------SLQESKSYLVIE 56

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G+R + +    +K   PSGFT+K RK +R+RRLE + Q+G +R+++F FG   + +Y+I
Sbjct: 57  NGLRFNLSD-TIEKNKVPSGFTMKFRKFLRSRRLESIEQIGVERVVVFTFGREDHTYYLI 115

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY+QGNI+L D ++ ++ L R H +  + V +     YP      FE T  + L    
Sbjct: 116 LELYSQGNIILADKDYRIIQLTRQH-EFSENVKVAPNEIYP------FEYTATNYLEKFD 168

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           TS +       +K                                     G + K+   K
Sbjct: 169 TSMERIQKVISEK------------------------------------QGQKLKEVVFK 192

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVI 299
            V        P L + +  D     NM  +E  VN+ +         +V K  D+  D I
Sbjct: 193 LV--------PCLHQSLTDDIIQQLNMNQNEKIVNQFD---------SVKKVVDFAMDYI 235

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           +       Y      +L     P ++    + +D F       ++ +  V+  TF+ A+ 
Sbjct: 236 NKYRAQTQY----KGYLCAKEAPKDAEQKPKFFD-FAADQPAYYQGKYVVETPTFNQAVH 290

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           +++  ++  R E+  ++ ED A+ K   I  DQ +R+  L++E D  +  A LI+ N+ D
Sbjct: 291 QYFLVVD--RQEENKQSIEDIAWKKFENIKQDQMSRIQKLQEEQDEYIMKAGLIQENIND 348

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           V A I  ++  + N + W+ + RM+ + +K GNP++ +I  + L++N +++LL N  DE 
Sbjct: 349 VQAIIDIIQKMIENGIPWDKIQRMINDSKKEGNPLSNMIGGMNLKQNKVTILLGNKDDEY 408

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
            D      + ++E+D+  SA+ NAR++YE KKK   K+ KT  A  +A K AEK    +I
Sbjct: 409 SD------LIQIEIDITQSAYQNARKYYESKKKNRDKEIKTKEAVEQALKQAEKTALKEI 462

Query: 540 LQEKT-VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
            +EK  +  + + RK +WFEKF WFISS+ YLVISG+D QQNEMIVKRYM+K D+Y+HAD
Sbjct: 463 EREKNKIQKVQNQRKKYWFEKFFWFISSDGYLVISGKDVQQNEMIVKRYMNKDDIYMHAD 522

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
           ++G++ST++KN   E P+P  T+ QA   T+C S++WD+K+V SAWWV+  QVSK+APTG
Sbjct: 523 IYGSASTIVKNP-SEGPIPEATIMQAATATICRSKSWDAKIVVSAWWVHASQVSKSAPTG 581

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER--RVRGEEEGMDDFE 715
             +  GSFMI GKKNF+ P  L MG  +L++LD+ S+  H  ER  ++R E+  +D+ E
Sbjct: 582 MNIPAGSFMIYGKKNFIYPPRLEMGCTILYQLDQDSIKRHEEERKKKLREEQSQVDESE 640


>gi|397618049|gb|EJK64734.1| hypothetical protein THAOC_14501 [Thalassiosira oceanica]
          Length = 1217

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 293/784 (37%), Positives = 426/784 (54%), Gaps = 94/784 (11%)

Query: 3   KVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPK----------TYIFKLMNSSG----- 46
           K R +  DVA+    L+R ++G + +N+YD S             Y+FKL + SG     
Sbjct: 5   KTRFDGLDVASMCSHLKRTMMGFKLANIYDGSSLGVSGGSDSKGVYMFKLADPSGGSAAT 64

Query: 47  ------VTESG---ESEKVLLLMESGVRLHTTAY---ARDKKNTPSGFTLKLRKHIRTRR 94
                  TE G   ES++ +LL+ESGVR H T +   +      PS F +KLRKH+R  R
Sbjct: 65  GKSNTSSTEDGGEAESKRAMLLIESGVRFHPTTHFSQSSSSSAMPSPFAMKLRKHLRNLR 124

Query: 95  LEDVRQLG-YDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSH------- 146
           LE+V QLG  DR++ F+FG G   H++ILELY+QGN++LTD E+ +L LLR+H       
Sbjct: 125 LENVTQLGNLDRVVDFRFGSGSYTHHLILELYSQGNLVLTDGEYRILALLRTHEYEVKDG 184

Query: 147 -RDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA-LTSSKEPDANEPDKVNEDGNNVSN 204
            +D+ +GV    +      +  V+  T A+ L     T +   D N+   ++    N   
Sbjct: 185 KKDEREGV---EKEEVKVRVGNVYPVTLATTLSMDDRTENSGEDGNKSGLLSMSAENAFE 241

Query: 205 ASKENL-GGQKGGKSFDLSKNSNKNSNDGARAK---QPTLKTVL---GEAL-GYGPALSE 256
            +K  L   Q+  ++ +  ++  K    G + +      LK +L   G  +  YGP+L E
Sbjct: 242 WAKSELVATQQRARTVNSQQHGGKGKKKGKKKQLDENLVLKALLLRPGSGVYHYGPSLVE 301

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ-----------DVISGDIVP 305
           H IL  GL P +KL+  N +E        L    + D L+           ++ S D   
Sbjct: 302 HCILFAGLEPTLKLNADN-IE------YTLPSGSWGDLLESLRDEGSVVLGNLQSPDSAG 354

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQIYD--------EFCPLLLNQFRSREFVKFETFDAA 357
            GYIL + K   +     ++ + T   +        EF P LL Q +++  + + TF  A
Sbjct: 355 SGYILYKPKETKESLQEQKNDAQTAPQNPHSDKTLLEFQPHLLIQHKNQPHLTYSTFATA 414

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
            DEF+S + SQ+A  +  A E AA  +L KIH DQ  RV  L +E D+    A L+E + 
Sbjct: 415 TDEFFSNLSSQKAAARADAAESAARERLAKIHADQARRVDGLVREQDKFRDAARLVELHA 474

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
           +DVD A+  +  AL + M W+ L ++V  E+   NP+A LI KL L+++ + L L + +D
Sbjct: 475 DDVDRALGVINGALQSGMDWDQLEQLVTVEQGNENPIALLIHKLVLDKDEIMLALPD-ID 533

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
             +DE +  P+  V V++  SAH NAR  Y + +  + K+ KTI A   A KAAE K + 
Sbjct: 534 NWEDESEAPPIVIVTVNIKESAHGNARAKYAVYRASKEKERKTIEASETALKAAEAKAKQ 593

Query: 538 QILQ---EKTVANISHMRKVH------WFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
           Q+ +    K    +S   +V+         KF WFI+S+NYLV++G+DAQQNE +VKRY+
Sbjct: 594 QLAEAQKRKARKQLSVNSQVYQGNLQFCLNKFAWFITSDNYLVVAGKDAQQNEQLVKRYL 653

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQ--------PVPPLTLNQAGCFTVCHSQAWDSKMV 640
             GD Y+HA++HGA++ V++  R  +        P+    L +AG FT+C S AW SKMV
Sbjct: 654 RPGDAYLHAEIHGAATCVLRAKRRRRKDGKTQVMPLSDQALREAGTFTICRSSAWSSKMV 713

Query: 641 TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHL 699
           TSA+WV  HQVSKTAPTGEYLTVGSFMIRG+KNFLP   L MG G+LFRL D+ S+  H 
Sbjct: 714 TSAYWVESHQVSKTAPTGEYLTVGSFMIRGRKNFLPASTLEMGVGVLFRLGDDVSVARHA 773

Query: 700 NERR 703
           NERR
Sbjct: 774 NERR 777


>gi|332237024|ref|XP_003267700.1| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Nomascus leucogenys]
          Length = 1058

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 267/746 (35%), Positives = 390/746 (52%), Gaps = 119/746 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNVMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I        I L D    VLT        D    I++  R+ T+                
Sbjct: 113 I--------IELYDRGNIVLT--------DYEYVILNILRFRTD---------------- 140

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                            + ++V  A +E         +  L           + AK   L
Sbjct: 141 -----------------EADDVKFAVRERYPLDHARAAEPLLTLERLTEIVASTAKGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+DE
Sbjct: 240 SNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           FYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ V
Sbjct: 299 FYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN------ 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N      
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSE 418

Query: 475 --------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWYE 508
                         N  E    +K     K            V+VDL+LSA+ANA+++Y+
Sbjct: 419 EEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYD 478

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHW--FEKFNWFISS 566
            K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+W  F K    +S 
Sbjct: 479 HKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWXVFSKLLGRLSQ 538

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           EN+L   G D Q+ E+++                     VI    P +P+PP TL +AG 
Sbjct: 539 ENHLNPGGEDLQRTEVLI------------------LCIVI----PGEPIPPRTLTEAGT 576

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  
Sbjct: 577 MALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSF 636

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMD 712
           LF++DES +  H  ER+VR ++E M+
Sbjct: 637 LFKVDESCVWRHRGERKVRVQDEDME 662


>gi|145494650|ref|XP_001433319.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400436|emb|CAK65922.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1070

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 248/717 (34%), Positives = 391/717 (54%), Gaps = 85/717 (11%)

Query: 3   KVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           K+R+   D+ A V  L+ +LIG R SN+Y++  KTY+FK         S +  K  L++E
Sbjct: 5   KIRLTALDIMALVTELKQKLIGTRLSNIYNIDAKTYVFKF--------SLQESKSYLVIE 56

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G+R + +    +K   PSGFT+K RK +R+RRLE + Q+G +R+++F FG   + +Y+I
Sbjct: 57  NGLRFNLSD-TIEKNKVPSGFTMKFRKFLRSRRLESIEQIGVERVVVFTFGREDHTYYLI 115

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY+QGNI+L D ++ ++ L R H +  +   +     YP      FE T  + L    
Sbjct: 116 LELYSQGNIILADKDYRIIQLTRQH-EFSENAKVAPNEIYP------FEYTATNYLEKFD 168

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
           TS +                +     E  G +     F L                P L 
Sbjct: 169 TSMER---------------IQKVVSEKAGQKLKEVVFKLV---------------PCLH 198

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
                      +L++ II    +  N K+  VN+ E+         V K  D+  + I+ 
Sbjct: 199 Q----------SLTDDIIQQLQMNQNEKI--VNQFEN---------VKKVVDYAMEYINK 237

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
                 Y      +L     P ++    + +D F       ++ +  ++  TF+ A+ ++
Sbjct: 238 YRAQTQY----KGYLCAKEAPKDAEQKPKFFD-FAADQPAYYQGKYVIETPTFNEAVHQY 292

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           +  ++  R E   ++ ED A+ K   I  DQ +R+  L+ E D  +  A LI+ N+ DV 
Sbjct: 293 FLVVD--RQEDNKQSIEDIAWKKFENIKQDQMSRIQKLQSEQDEYIMKAGLIQENINDVQ 350

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
           A I  ++  + N + W+ + RM+ + +K GNP++ +I  + L++N +++LL N  DE  D
Sbjct: 351 AIIDIIQKMIENGIPWDKIQRMINDSKKEGNPLSNMIGGMNLKQNKVTILLGNKEDEYSD 410

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
                 + ++E+D+  SAH NAR++YE KKK   K+ KT  A  +A K AEK    +I +
Sbjct: 411 ------LIQIEIDITQSAHQNARKYYESKKKNRDKEIKTKEAVEQALKQAEKTALKEIER 464

Query: 542 EKT-VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           EK  +  + + RK +WFEKF WFISS+ YLVISG+D QQNEMIVKRYM+K D+Y+HAD++
Sbjct: 465 EKNKIQKVQNQRKKYWFEKFFWFISSDGYLVISGKDVQQNEMIVKRYMNKDDIYMHADIY 524

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           G++ST++KN   E P+P  T+ QA   T+C S++WD+K+V SAWWV+  QVSK+APTG  
Sbjct: 525 GSASTIVKNPN-EGPIPEATIMQAATATICRSKSWDAKIVVSAWWVHASQVSKSAPTGMN 583

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER--RVRGEEEGMDDFE 715
           +  GSFMI GKKNF+ P  L MG  +L++LD+ S+  H  ER  ++R E+  +D+ E
Sbjct: 584 IPAGSFMIYGKKNFIYPPRLEMGCTILYQLDQDSIKRHEEERKKKLREEQSQVDESE 640


>gi|301106825|ref|XP_002902495.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262098369|gb|EEY56421.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1051

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 272/775 (35%), Positives = 420/775 (54%), Gaps = 95/775 (12%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDL-------SPKTYIFKLMNSSGVTESGE 52
           M K RM+  D+ A V  +R  ++ MR +N+YD+       + KTYI KL           
Sbjct: 1   MKKTRMSIDDIHAMVGSIRANVVNMRVTNIYDVQGQGDSGAAKTYILKLHQPP------- 53

Query: 53  SEKVLLLMESGVRLHTTAYARDKKN---TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
             KV LL+ESGVR HT+ YARD K     PS FT+KLRKH+R +RL  + QL  DR++ F
Sbjct: 54  FPKVFLLLESGVRFHTSKYARDAKAGNALPSQFTMKLRKHLRGKRLSALTQLEGDRVVDF 113

Query: 110 QFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF 169
            FG      ++ILELYA GNI+LTD              D + ++++  HR+   +    
Sbjct: 114 TFGQDALKCHLILELYASGNIILTDG-------------DYRILSLLRTHRFDENVKMAV 160

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS 229
           ++    +L       K+      +++ E  N            Q+  K+        +  
Sbjct: 161 KQEYPVQLLG--DQEKQRGIQTTEQLTEFVNRWFE--------QQEAKAAIALPGKTQKK 210

Query: 230 NDGARAKQPTL--KTVLGEALGYGPALSEHIILDTGLVPNMKL---SEVNKLEDNAIQVL 284
                 KQ  L  ++  G   G GP + EH ++   + P +K+   +E   L ++ +  L
Sbjct: 211 KKAQTIKQLLLVKESTFG---GLGPVIIEHCLVRAAISPTLKIKNAAEFTTLGEDKLAAL 267

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLG-------KDHPPTESGSST-------- 329
           +  + +    L+ +        G + +Q+           +D  P     ST        
Sbjct: 268 LAEIQEGWKLLERLQDEQTSVNGPVPLQSDDTADTGDSDEEDAAPVAKDPSTTSQKCGFI 327

Query: 330 -----------QIYDEFCPLLLNQ-FRSREFVK-FETFDAALDEFYSKIESQRAEQQHKA 376
                      + ++EF P L  Q  ++ + VK F+TFD A+DE++S+ E++ AE   ++
Sbjct: 328 ILKDVAGENAPEQFEEFTPYLYAQHLQAYKKVKSFDTFDEAVDEYFSRFEAETAEVAKQS 387

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
            + AA +KL K+  +Q+ ++  L++  ++S + A+LIE N +DV+  +L +R ALA+ M 
Sbjct: 388 AQLAAENKLAKLKKNQQQQLAQLREVQEQSFQDAQLIEANQQDVENVLLVIRSALASGMD 447

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK------ 490
           W  L  +V+ E+K GNPVA LI +L LE N +++LL ++ ++  ++      E+      
Sbjct: 448 WRGLEELVRYEQKNGNPVASLIHQLDLEHNRVAILLCDSDEDDYEDGGDGTGEEDKKAHV 507

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           + +DL+LSA ANAR  Y  KKK  +K +K   A  KA   AEK T+  + +++T  N+ +
Sbjct: 508 IWIDLSLSALANAREIYTKKKKAGAKVKKATEATDKAIALAEKNTKKTLEKQQTKRNVIY 567

Query: 551 MR-KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
            R K  WFEKF+WF+++E YLV++G+DA QNE++VKRY+ KGDVYVHADLHGA++ +++N
Sbjct: 568 QRRKTLWFEKFHWFLTNEKYLVVAGKDAHQNELLVKRYLRKGDVYVHADLHGAATCIVRN 627

Query: 610 HRP-------EQP-VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           H         E P +P  TL QAGC +VC S AW S+++  A+WV+  QVSKTAP GEYL
Sbjct: 628 HATVKDKKTQELPSIPVATLEQAGCMSVCRSNAWTSQVIAGAYWVHADQVSKTAPAGEYL 687

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNE---RRVRGEEEGMDD 713
           T GSFMIRGKKN++ P  L MG  +LFR+D+S +G+H  +   R +R  E   DD
Sbjct: 688 TTGSFMIRGKKNYIQPSRLEMGLAILFRIDDSCIGNHARQGEGRDLRVAEGPEDD 742


>gi|453084374|gb|EMF12418.1| hypothetical protein SEPMUDRAFT_149103 [Mycosphaerella populorum
           SO2202]
          Length = 1130

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 263/729 (36%), Positives = 384/729 (52%), Gaps = 88/729 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L  +R SNVYDLS + ++ K      +          L+++SG R H T + R     PS
Sbjct: 21  LTSLRLSNVYDLSSRIFLLKFQKPDQIRHQ-------LIVDSGFRCHLTQFVRATAAQPS 73

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK +RTRR   VRQ+G DR+I   F      + + LE YA GN++LTD E+ +L
Sbjct: 74  PFVARLRKFLRTRRCVSVRQIGTDRVIELCFSHAEGVYRLFLEFYAGGNVILTDHEYHIL 133

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLRS  + ++        +Y  E                             + N  G 
Sbjct: 134 GLLRSVNEGEEHEQYRVGLKYDLE----------------------------KRQNYAGE 165

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP---TLKTVLGEALGYGP-ALSE 256
            V + +K  L       +  L + +N+ ++     K+    +L+  L       P  L +
Sbjct: 166 GVPDLTKVWLKEALQRTATKLVEQANREASKKKVVKKKKGDSLRKALAVTTTQFPPVLLD 225

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVL-VLAVAKFEDWLQDVISGDIVPEGYILMQNKH 315
           H I    +   ++  +V   E+   QVL  L +A  E  ++D+ S  I  +GYIL Q K 
Sbjct: 226 HAIFVAKVDRELEAQQVVDSEELLDQVLSALRIA--EGVMEDITSQPIA-KGYILAQRKK 282

Query: 316 LGKDHPPTES-----------GSSTQIYDEFCPLLLNQFR---SREFVKFETFDAALDEF 361
            G   P                +S  +YD+F P    Q     +  F++ E F+ A+DEF
Sbjct: 283 -GMATPEKAEEEGEEEGRDADSTSGLMYDDFHPFKPAQLAEDPANVFLEHEGFNIAVDEF 341

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           +S IE Q+ E +   K+++A  ++     +QE R++ L+Q  +  V+ A+ IE N+E V+
Sbjct: 342 FSSIEGQKLESKLAEKQESARKRIEHAKKEQEQRINGLQQVQELHVRKAQAIEANVERVE 401

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLS----NNL 476
            A  AV   +A  M WED+ R++++E+K  NPVA LI   L L  N M+LLLS    ++ 
Sbjct: 402 EATAAVNGLIAQGMDWEDIGRLIEQEQKRHNPVAELIKLPLKLHENTMTLLLSELGADDE 461

Query: 477 DEMDDEEKTLPVEK------------------VEVDLALSAHANARRWYELKKKQESKQE 518
           DE  +E  + P +                   +++DLA SA  NAR++Y+ K+    KQE
Sbjct: 462 DEEANETDSEPSDSEDEGTNAAQVKHDAKRLTIDIDLAGSAWVNARQYYDQKRTAAVKQE 521

Query: 519 KTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           KT+ A  KA K+ E+K     +  + QEK V  +  +RK  WFEKF +F+SS+ YLV++G
Sbjct: 522 KTVLASKKAIKSTEQKVMATLKKDLKQEKDV--LRPVRKQFWFEKFIYFVSSDGYLVLAG 579

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH-RPEQPVPPLTLNQAGCFTVCHSQ 633
           +DAQQNE++ +RY+ KGDVY+HADL GA+S +IKN   PE P+PP TL Q G   VC S 
Sbjct: 580 KDAQQNEILYRRYLKKGDVYIHADLDGAASVIIKNKLNPEDPIPPSTLAQGGDLAVCTSS 639

Query: 634 AWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           AWDSK V SAWWV   QVSKTAPTGEYL  G F++RGKKNFLPP  L++GFG++F++ E 
Sbjct: 640 AWDSKAVMSAWWVNADQVSKTAPTGEYLAAGGFIVRGKKNFLPPAKLLLGFGVMFQISEE 699

Query: 694 SLGSHLNER 702
           S   H+  R
Sbjct: 700 SKAQHVKHR 708


>gi|378733722|gb|EHY60181.1| translation factor [Exophiala dermatitidis NIH/UT8656]
          Length = 1147

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 277/798 (34%), Positives = 425/798 (53%), Gaps = 110/798 (13%)

Query: 2   VKVRMNTADV---AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R ++ DV   AAE+     L  +R SN+YDLS + ++FK        + G  E+  L
Sbjct: 1   MKQRFSSLDVKVIAAELAA--SLTSLRVSNIYDLSSRIFLFKF------AKPGRREQ--L 50

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           L++SG R H T+++R     PS F  +LRK++++RR+ +V Q+G DR+I   F  G   +
Sbjct: 51  LVDSGFRCHLTSFSRTAATAPSAFVSRLRKYLKSRRVTNVAQIGTDRVIEITFSEGQ--Y 108

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            + LE +A GNI++TD++  VL L R   + D+ V +    +Y  +  + F         
Sbjct: 109 RMFLEFFAAGNIIVTDADLNVLALQRQVSEGDEDVDVKLGGKYILDAKQNFHGI------ 162

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKE--NLGGQKGGKSFDLSKNSNKNSNDGARAK 236
           A +T         P++V E        +K+   +GG+K  ++        K  +D     
Sbjct: 163 APVT---------PERVKETLEKAVQRAKDAKEVGGKKAKRA--------KGGDD----- 200

Query: 237 QPTLKTVLGEALGYG-PALS----EHIILDTGLVPNMKLSEVNKLEDNAIQVLVL-AVAK 290
                  L +AL +G P  S    +H+  + G+    K  +V  L D  +   V+ A+ +
Sbjct: 201 -------LRKALSFGFPEFSAHLLDHVFNEIGIDAAAKAEDV--LNDGQLMEAVMKALNR 251

Query: 291 FEDWLQDVISGDIVPEGYILMQNKHLGKDHP--------PTESGSSTQIYDEFCPLLLNQ 342
            ++  + + +G    +GYI+ + K    + P        P+ SG    +Y++F P   +Q
Sbjct: 252 AKEIFESLGTGQ--SKGYIIAKIKSPSSEAPQEAEAQTQPS-SGRDNLLYEDFHPFRPSQ 308

Query: 343 FRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL 399
           F  +     ++F+ F+  +DEFYS IESQ+ E +   +E+AA  KL     +QE R+  L
Sbjct: 309 FEGKPDLRILEFDGFNRTVDEFYSSIESQKLESRLTEREEAARKKLQAAKEEQEKRLGAL 368

Query: 400 KQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           +   +  V+ A+ IE N   V+ A  AV   +   M W D+ ++++ E+K GN VA +I 
Sbjct: 369 QHVQELHVRKAQAIEANTHRVEEACAAVNGLIGQGMDWVDIGKLIENEQKRGNVVAQMIK 428

Query: 460 -KLYLERNCMSLLLSN-------------------NLDEMDDEEKTLPVE----KVEVDL 495
             L LE N ++LLL                     N DE   ++ T P       +++DL
Sbjct: 429 LPLKLEENTVTLLLDEPGFNEESEEDEPDETDEEENSDEDTRKKPTKPATDKRLAIDIDL 488

Query: 496 ALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHM 551
            LS  ANAR++YE KK    K+++T+ A + A K+AE+K     +  + QEK  A +   
Sbjct: 489 GLSPWANARQYYEQKKNAAVKEKRTLEAATMALKSAERKIEADLKRGLKQEK--AALRPA 546

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH- 610
           RK  WFEKF +FISS+ YLVI G+DAQQNE++ +RY+ +GDVYVHADL GASS ++KN+ 
Sbjct: 547 RKQFWFEKFLYFISSDGYLVIGGKDAQQNELLYRRYLKRGDVYVHADLQGASSVIVKNNP 606

Query: 611 -RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
             P+ P+PP TL+QAG  TVC S AWDSK V  AWWV   QVSKTAP+GEYLT G F+IR
Sbjct: 607 RTPDAPIPPSTLSQAGALTVCTSSAWDSKAVMGAWWVNAEQVSKTAPSGEYLTTGGFIIR 666

Query: 670 GKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESE 729
           G KN LPP  L++GFG+L+ + E S  +H   R  R E     + E   +      +E +
Sbjct: 667 GHKNLLPPSQLLLGFGVLWLISEESKVNHGKHRLERTESMLPGEAEALANDARGLSLEEQ 726

Query: 730 KDDTDEKPVAE-SLSVPN 746
           + D    P++E S +VP+
Sbjct: 727 EQDL---PISEQSRAVPD 741


>gi|145351275|ref|XP_001420008.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580241|gb|ABO98301.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1069

 Score =  407 bits (1047), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 272/722 (37%), Positives = 377/722 (52%), Gaps = 88/722 (12%)

Query: 23  GMRCSNVYDLSP----KTYIFKLMNSSGVTE--------SGESEKVLLLMESGVRLHTTA 70
           G   +N YD+      K ++ KL   SG           + ESEK+L+ +ESG R+HTT 
Sbjct: 24  GCWLANAYDVDATSGNKKFLLKLNKPSGAVARDARADATTAESEKILVFIESGTRVHTTR 83

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM-NAHYVILELYAQGN 129
           Y R K   P+ FT KLR   + +RL D RQLG DR I F FG G  N  ++I+ELY+QGN
Sbjct: 84  YERGKTTAPTAFTAKLRARAKGKRLTDARQLGRDRAIDFTFGGGGENECHLIVELYSQGN 143

Query: 130 ILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDA 189
           ++L D  +TV+ LLRS+RD    V I+  H+YP E  + F+    ++             
Sbjct: 144 VILCDGNYTVVALLRSYRDGGD-VNILPNHQYPLERLKGFQLGGYTR------------- 189

Query: 190 NEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALG 249
              D V+     V    +E +GG                    AR    TL+  L  A G
Sbjct: 190 --EDVVSALARGVLATEEETMGGD-------------------ARRAPATLREALCRAFG 228

Query: 250 YGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV- 304
           Y PA+++H+ L      G   ++ LSE        +  L  AV   E W + V +GD+V 
Sbjct: 229 YSPAIADHVALTASIEHGSNASLPLSEA------CVDRLTAAVRDLESWFEGVTTGDVVA 282

Query: 305 -PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFE---------TF 354
            P     M     G D          +I+D+F P  L Q   R   KFE          F
Sbjct: 283 VPNVCTKMDANADGTDE--------IEIFDDFSPFSLKQNEGRPTRKFELPKGLDPVCAF 334

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           D A+DE++  +E+Q      +  E  A  KL K   DQ++RV  L++E ++  + A LIE
Sbjct: 335 DHAVDEYFIALEAQSQILARRKAEAQALAKLEKSLKDQKSRVEQLEREREKEEQRAVLIE 394

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           YN E VD AI AV  ALA+ MSW +L  M+ EER+ GNPVAG+I  L L  N +++ L+N
Sbjct: 395 YNHEAVDTAIDAVNSALASGMSWPELEAMINEERRLGNPVAGMIKSLDLANNQITITLAN 454

Query: 475 NLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           +LDE+D+ +      K   V VDL LSAHANA   +  KKK   K  KT+ A SKA  AA
Sbjct: 455 HLDEVDEVDAASGKRKRVAVGVDLGLSAHANASMRFAAKKKHAEKFSKTVDAQSKAVAAA 514

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           E K +  + +    ++I+  R+  WFEKFNWFI+SEN LV+  +DA Q EM++ RYM  G
Sbjct: 515 EAKAKAAMEKAANGSSIARARQPLWFEKFNWFITSENCLVLQAKDATQAEMLITRYMLPG 574

Query: 592 DVYVHADLHGASSTVIKNHRPE----QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           D +VHA++  A  T++K   P     + VP  +L QAG   +C S AW+S+ V SAWW  
Sbjct: 575 DAFVHAEVPQAPVTLVKP--PPGVDVRAVPAYSLVQAGAAVMCRSSAWNSRAVKSAWWTS 632

Query: 648 PHQVSKTAPT-GEYLTVG-SFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
             +VSK +P  G+ L  G + +    K FLP   L+MGFGL+F + E +  +H NER VR
Sbjct: 633 SERVSKISPVAGDALPPGVTHVAHADKQFLPHAQLVMGFGLMFVVSEKNAEAHKNERLVR 692

Query: 706 GE 707
            +
Sbjct: 693 SD 694


>gi|407928362|gb|EKG21221.1| protein of unknown function DUF814 [Macrophomina phaseolina MS6]
          Length = 1094

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 306/978 (31%), Positives = 482/978 (49%), Gaps = 155/978 (15%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L  +R +N+YDLS + ++FK    +         +  LL++SG R H T++AR    TPS
Sbjct: 21  LCSLRVANIYDLSTRIFLFKFQKPN--------HREQLLIDSGFRCHLTSFARSTPATPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F ++LRK ++TRR+  + Q+G DRII  QF  G+  + + LE YA GNI+LTD+E  +L
Sbjct: 73  PFVVRLRKFLKTRRVTSITQIGTDRIIELQFSDGL--YRLYLEFYAGGNIILTDNELNIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
           +LLRS    D+G                +ER      +           N  ++ N  G 
Sbjct: 131 SLLRSV---DEGPE--------------YERVKVGIKY-----------NLTERQNYGG- 161

Query: 201 NVSNASKENL--GGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALG-YGPALSEH 257
            V   +KE +  G QK      L +          +  +  L+  L  ++    P L +H
Sbjct: 162 -VPELTKERVREGLQKA-----LDRQQEATDKKAKKRGKDALRKALAVSITELPPMLVDH 215

Query: 258 IILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
               TG   ++K  +V  LED ++   L+ A+A+ ++   ++ S +I  +GYI+   K  
Sbjct: 216 AFASTGFDSSLKPEQV--LEDESLLDNLMKALAEAKNVDAEITSAEIA-KGYIVA--KKT 270

Query: 317 GKDHPP--TESGSSTQ------IYDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSKI 365
           G+  P   +E GS  +      +Y++F P    QF +     F++FE F+  +DEF+S I
Sbjct: 271 GQPAPTEVSEEGSEEKAPAEKLLYEDFHPFKPKQFEADPTLTFLEFEGFNKTVDEFFSSI 330

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
           E Q+ E + + +E+ A  KL +   +   R+  L++  + +++ AE I+ N++ V  A++
Sbjct: 331 EGQKLESRLQEREENAKRKLEQAKQEHLKRLGGLQRAQELNIRKAEAIQANVDRVQEAVM 390

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDE-- 482
           AV   +   M W ++ R+++ E+  GNPVA +I   L L  N ++LLL     E D+E  
Sbjct: 391 AVNGLIDKGMDWIEIDRLIEREQTHGNPVAQMIKVPLKLRENTVTLLLDEPGVEEDEEDF 450

Query: 483 -----------------EKTLPVEK-------VEVDLALSAHANARRWYELKKKQESKQE 518
                            ++  P  K       +++DL LS  ANA+ +++ KK   +K+E
Sbjct: 451 EGSETESEPSDDEEEQQQRKKPAVKPQDNRLTIDIDLGLSPWANAKTYFDQKKTAAAKEE 510

Query: 519 KTITAHSKAFKAAEKKTRLQIL----QEKTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           +T+ A  KA K+ +KK    +     QEK +  +  +RK  WFEKF +FISS+ YLVI G
Sbjct: 511 RTLEASQKALKSTQKKIEADLKKGLKQEKEL--LRPVRKQFWFEKFIYFISSDGYLVIGG 568

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHS 632
           +DAQQNE++ +R++ KGD+YVHADL  A+  +IKN    P+ P+PP TL+QAG  +V  S
Sbjct: 569 KDAQQNEILYRRHLKKGDIYVHADLSAAAVVIIKNRPSTPDDPIPPSTLSQAGNLSVSTS 628

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            AWDSK V SAWWV   QVSKT  +GEYL  G F+I GKKNFLPP  L++GF ++F++ E
Sbjct: 629 TAWDSKAVMSAWWVNADQVSKTTSSGEYLAAGGFVINGKKNFLPPAQLLLGFAVMFQITE 688

Query: 693 SSLGSH-------------------LNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDT 733
            S  +H                    +E   + E  G DD  DS     ++ +ES  D++
Sbjct: 689 ESKKNHNKHRLAEANMASKPAAPQPTHEEASKEETVGQDDASDSDEDFPDAKLESASDES 748

Query: 734 DEKPVAESLSV-PNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVT 792
           D +    S  +  N    A    + S  +  E   E    S G++       +    P+ 
Sbjct: 749 DNEQHQRSNPLQSNGVADAADEGSGSGSELEEAAEEQPQTSEGVEG-----VKEEPLPLA 803

Query: 793 PQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLK 852
           P                       +E     + +E K V +      K ++S  ERR L+
Sbjct: 804 P-----------------------VEEAGEQIHQEPKKV-KQEKAGGKRHLSARERRLLR 839

Query: 853 KGQGSSVVDP------KVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEK 906
           KG   S +        + + E +    A ++  + V   K +   + RG++GK KKM  K
Sbjct: 840 KGVNPSELTTAGGSANESDDEDDAVSVAPTEATTQVSSQKSKQTPLPRGKRGKAKKMALK 899

Query: 907 YGDQDEEERNIRMALLAV 924
           Y +QDEEER + + LL  
Sbjct: 900 YAEQDEEERELALRLLGA 917


>gi|449017191|dbj|BAM80593.1| unknown RNA-binding protein, conserved [Cyanidioschyzon merolae
           strain 10D]
          Length = 1371

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 257/731 (35%), Positives = 391/731 (53%), Gaps = 119/731 (16%)

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA------------------HYV 120
           PSGFTLKLRKH+RTRRL +V QLG DR++ F+F  G  +                  +++
Sbjct: 167 PSGFTLKLRKHLRTRRLAEVTQLGIDRVVDFRFVGGSQSASAYKASANGQPSRAALENHL 226

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEIC----RVFERTTASK 176
           I+EL++ GNI+LTD ++ +L +LR  R + + +A  +  R P        R+ ++     
Sbjct: 227 IVELHSGGNIILTDGDYQILAVLRVFRAEPRPLADSADQRDPPATGPGSRRMQQQDAVVG 286

Query: 177 LHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
               ++ +++      D+++E         + + G Q      DL +N            
Sbjct: 287 ARYDISLARQFAPLTYDRLHEIFQECYQKRQRSGGDQL----RDLQRN------------ 330

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
                  LG ALG+GP L EH++L+ G      L E    E    +VL  A A   +  +
Sbjct: 331 -------LGRALGWGPELIEHVLLEVGAPSPDPLPE---YEQRLYRVLCEAAAFLSESPR 380

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPP-TESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
                    EGYIL++    G       +S   +  Y EF P LL Q +  E   F +FD
Sbjct: 381 ---------EGYILLRPVAEGASQASGADSEDVSDRYCEFTPRLLRQHQHLEPRMFPSFD 431

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            A+DE+++++E  R  Q+ + ++  A   L ++  + E RV TLKQ+ +R ++ A LIE 
Sbjct: 432 EAVDEYFARMEELRYRQEIENRQRQAQGTLERMRRELETRVLTLKQQEERCLRKAALIET 491

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           NL DVD A+  +R ALA+ + W++L +M+  ER+ GNPVA LI  L L+ N M+L+L+++
Sbjct: 492 NLVDVDNALQVIRAALASGIDWKELDQMLVLERRRGNPVAQLIHSLQLQENQMTLMLADD 551

Query: 476 LDEMDD---------------EEKTLP--------------------------VEKVEVD 494
              +D+               E + L                           VE V+VD
Sbjct: 552 SGSVDNTDAETGSSSRQRRPAETRDLSNEDSASSVESASEDESGDSTSVCSSRVELVQVD 611

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN------- 547
           L+LSA ANARR+YE +KK   K  KT+ A ++A +AAEKK  L++L      N       
Sbjct: 612 LSLSAFANARRYYEQRKKAAEKGTKTMEASAQALRAAEKKA-LEVLAGTASKNKRKKATP 670

Query: 548 ---ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK--GDVYVHADLHGA 602
              +  +RK  WFEKF +FI+SENYLVI+G+D+QQNE +V+RY+ +  GD+Y+HAD+HGA
Sbjct: 671 LNTLKAIRKPLWFEKFRYFITSENYLVIAGKDSQQNEQLVRRYLEENTGDLYMHADVHGA 730

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLT 662
           +S +IK  +  +P PPL++ +A  F    S AWD+K+  +A+WVYP QVS+TAP+G YL 
Sbjct: 731 ASVIIKGKK-NRPAPPLSIQEAAIFAAACSSAWDAKVAVNAYWVYPEQVSRTAPSGMYLQ 789

Query: 663 VGSFMIRGKKNFLP-----PHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
            GSF+IRG +N++P       PL+MGFG LFRL   S+  H+ ER VR   + + + + +
Sbjct: 790 QGSFVIRGSRNYVPVTTSGSGPLVMGFGFLFRLAPESVWRHIGERPVRSGPDSLQEAQAA 849

Query: 718 GH-HKENSDIE 727
           G   K+   +E
Sbjct: 850 GAPQKQQQQVE 860



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 48/88 (54%), Gaps = 17/88 (19%)

Query: 3  KVRMNTADVAAEVKCLRRLIGM--RCSNVYDLSPKTYIFKL----MNSSGVTESGESEKV 56
          K + +  D+ AEV  L+  +G   R  NVY+L  KTY+ KL    +N+SG   + E+E+ 
Sbjct: 8  KTKFSLLDLRAEVSVLQERLGSGSRVLNVYNLGRKTYLLKLSVPPLNASGRIPATETEEA 67

Query: 57 -----------LLLMESGVRLHTTAYAR 73
                      LL+ESGVRLHTT + R
Sbjct: 68 WATGDSSWRREYLLIESGVRLHTTRFTR 95


>gi|395745874|ref|XP_002824790.2| PREDICTED: LOW QUALITY PROTEIN: nuclear export mediator factor NEMF
           [Pongo abelii]
          Length = 1061

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/515 (44%), Positives = 317/515 (61%), Gaps = 43/515 (8%)

Query: 231 DGARAKQPTLKT-VLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
           D ARA +P L    L E +   P      +L   L P + L E  KLE   I+ +++++ 
Sbjct: 156 DHARAAEPLLTLERLTEIVASAPKGE---LLKRVLNP-LLLDE--KLETKDIEKVLVSLQ 209

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFV 349
           K ED+++   + +   +GYI+ Q + +       +       Y+EF P L +Q     ++
Sbjct: 210 KAEDYMK--ATSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYI 266

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +FE+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +     
Sbjct: 267 EFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLK 326

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
            ELIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N ++
Sbjct: 327 GELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVT 386

Query: 470 LLLSN--------------------NLDEMDDEEKTLPVEK------------VEVDLAL 497
           +LL N                    N  E    +K     K            V+VDL+L
Sbjct: 387 MLLRNPYLLSEEEDDDVDDDVNVEKNETEPPKGKKKKQKSKQLQKPQKNKPLLVDVDLSL 446

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWF 557
           SA+ANA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WF
Sbjct: 447 SAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWF 506

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVP 617
           EKF WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+P
Sbjct: 507 EKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIP 565

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP
Sbjct: 566 PRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPP 625

Query: 678 HPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
             L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 626 SYLMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 660



 Score =  139 bits (350), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|327287378|ref|XP_003228406.1| PREDICTED: serologically defined colon cancer antigen 1 homolog
           [Anolis carolinensis]
          Length = 635

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 256/683 (37%), Positives = 374/683 (54%), Gaps = 100/683 (14%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ + +  LR+ L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFNTVDIRSVIAELRQSLLGMRVNNVYDVDNKTYLIRLQKPDV--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH++TRRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKTRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +I +             
Sbjct: 113 IIELYDRGNIVLTDHEYLILNILRFRTDEADDVRFAVREHYPVDIAK------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                 P A  P             S E L         ++   S K            +
Sbjct: 160 ------PAAPLP-------------SLERLT--------EIITTSPKTEQ---------I 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YG  L EH +++TG   N ++ +++  +   I+ L+ A+ K E++++  ++
Sbjct: 184 KRVLNPHLPYGATLIEHCLIETGFSGNTRIEQIDSKD---IERLLAALQKAEEYME--VT 238

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +GYI+ Q +       P +       Y+EF P L +Q+    FV+F++F+ A+DE
Sbjct: 239 DNFDGKGYII-QKREKKPSLEPEKPAEEILTYEEFHPFLFSQYTKCPFVEFDSFNKAVDE 297

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELIEYNLE 418
           FYSK+E Q+ + +   +E  A  KL  +  D E+R+  L   QE+D+ VK  EL+E NLE
Sbjct: 298 FYSKLEGQKIDLKALQQEKQALKKLENVRKDHEHRLEALHQAQEIDK-VK-GELVEMNLE 355

Query: 419 DVDAAILAVRVALANRMSWED--------------LARMVKEERKAGNPVAGLIDKLYL- 463
            VD AI  VR ALAN++ W +              LA  +KE +   N +  L+   Y+ 
Sbjct: 356 MVDRAITVVRSALANQIDWTEIGALVKEAQAQGDPLASAIKELKLQTNHITMLLKNPYVF 415

Query: 464 ----------------ERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
                                      N  +   + +      V++DL+LSA+ANA+++Y
Sbjct: 416 SEEEEEEEDGEVEEEVGEETKGKRKKKNKAKQPKKPQKNKPLLVDLDLSLSAYANAKKYY 475

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV  I   RKV+WFEKF WFISSE
Sbjct: 476 DHKRFAARKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTTIQKARKVYWFEKFLWFISSE 535

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYLVI+GRD QQNEMIVKRY+  GD+YVHADLHGA+S VIKN   + P+PP TL +AG  
Sbjct: 536 NYLVIAGRDQQQNEMIVKRYLRPGDIYVHADLHGATSCVIKNPTGD-PIPPRTLTEAGAM 594

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQ 650
            +C+S AWD++++TSAWWV+ HQ
Sbjct: 595 ALCYSAAWDARVITSAWWVHHHQ 617


>gi|449299546|gb|EMC95559.1| hypothetical protein BAUCODRAFT_71160 [Baudoinia compniacensis UAMH
           10762]
          Length = 1052

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 266/723 (36%), Positives = 385/723 (53%), Gaps = 88/723 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +N+YDLS + ++ K              +  LL++SG R H T +AR     PS
Sbjct: 21  LVTLRLANIYDLSTRIFLLKFAKPD--------HREQLLVDSGFRCHLTDFARATAAAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK +RTRR+  V Q+G DR+I  QF  G+  + + LE YA GN++LTDS+ T+L
Sbjct: 73  PFVARLRKFLRTRRVTKVEQIGTDRVIEIQFSEGL--YRLFLEFYAGGNVVLTDSDLTIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR+       VA  + H                KL      S          + ++  
Sbjct: 131 ALLRT-------VAEGAEHEQ-------------YKLGLKYDLS----------LRQNYG 160

Query: 201 NVSNASKENL--GGQKG--GKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
            V   +KE +  G QK    +  +  K   K    G  A +  L     E   + P L +
Sbjct: 161 GVPPLTKERVRDGLQKAIQKQEAEAQKPGKKIKRKGGDALRKALAVTTTE---FPPILLD 217

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ--NK 314
           H +  TG     +  +V       +  LV ++ + ++ +Q++ S      GYIL +    
Sbjct: 218 HALHVTGYDREAQPEQVVA-SGELLNKLVESLQEAQNVVQEITSA-ATARGYILAKPGKS 275

Query: 315 HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR---EFVKFETFDAALDEFYSKIESQRAE 371
              +D     +  +  +YD+F P    Q  S     F++ E F+   DEF+S +E Q+ E
Sbjct: 276 SAHQDANGLVNSDAGLLYDDFHPFKPAQLASDPSITFLEHEGFNKTCDEFFSSLEGQKLE 335

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            + + +ED A  K+ +   +Q  R+  L+   + +V+ A+ IE N+E V+ A+ AV   +
Sbjct: 336 SRLQEREDNAKRKIEQARQEQAKRIDGLQHVQELNVRKAQAIEANVERVEEAVAAVNGLI 395

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN------NLDEMDDEEK 484
           A  M W D+ R+++ E+   N VA +I   L L  N ++LLLS       + D+M DE +
Sbjct: 396 AQGMDWMDIGRLIENEQSRHNAVAEMIKLPLKLYENTVTLLLSEYAGLEEDYDDMADETE 455

Query: 485 ----------------TLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
                           + P EK   V+VDLALS  +NAR++Y+ K+    KQE+T  A  
Sbjct: 456 SEESEDEADTQAPRHTSKPEEKRLAVDVDLALSPWSNARQYYDQKRTAAEKQERTAQASQ 515

Query: 526 KAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
           KA K+ E+K     +  + QEK V  +  +RK  +FEKFN+FISS+ YLV++GRDAQQNE
Sbjct: 516 KALKSTEQKVMADLKKGLKQEKDV--LRPVRKQMYFEKFNYFISSDGYLVLAGRDAQQNE 573

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
           M+ +RY+ KGDVY+HADLHGA+S ++KN    PE P+PP TL QAG   VC S AWDSK 
Sbjct: 574 MLYRRYLKKGDVYIHADLHGAASVIVKNDPQTPEAPIPPSTLGQAGNLAVCTSTAWDSKA 633

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
           V SAWWV   QVSKTAPTGEYLT G F+IRGKKN+LPP  L++GF +LFR+ E S   HL
Sbjct: 634 VMSAWWVGSEQVSKTAPTGEYLTTGGFVIRGKKNYLPPAQLLLGFAVLFRISEESKARHL 693

Query: 700 NER 702
             R
Sbjct: 694 KHR 696


>gi|336276025|ref|XP_003352766.1| hypothetical protein SMAC_01600 [Sordaria macrospora k-hell]
 gi|380094654|emb|CCC08036.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1086

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 322/977 (32%), Positives = 478/977 (48%), Gaps = 147/977 (15%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R +N+YDL+ K  + K        +        LL+
Sbjct: 1   MKQRFSSLDVRVVAHELSEALVSLRLANIYDLNSKILLLKFAKPDNRQQ--------LLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H T + R     PS F  +LRK+++TRR   V Q+G DRII FQF  G  A  +
Sbjct: 53  ESGFRCHLTDFVRTASPAPSQFVARLRKYLKTRRCTSVSQIGTDRIIEFQFSDG--AFRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTDS+  +L LL   R+  +G A     + P  I   +           
Sbjct: 111 YLEFFASGNIILTDSDLKILALL---RNVPEGEA-----QEPQRIGLTYTLENRQNFGGV 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            T +KE               + +A +  +  QK        K   K   D  R    T 
Sbjct: 163 PTLTKE--------------RLRDALQSTV--QKVAADQAAGKKIKKKGADELRRGLATT 206

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            T L       P L +H+   T   P+ K +E+  LED+++   +    +    + D ++
Sbjct: 207 ITELP------PILVDHVFRLTSFDPSTKPAEI--LEDDSLLDRLFDTLQKAREILDEVT 258

Query: 301 GDIVPEGYIL------MQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FVKF 351
              V  GYI+       ++  +  D PP E   +  +Y++F P L  QF + +    + F
Sbjct: 259 DSSVANGYIIAKPRPGFEDAEVVVDAPPAEKAKNL-LYEDFQPFLPKQFENNKDYRILPF 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
             ++  +DEF+S +E QR E +   +E AA  KL    MDQ  R+  L++    + + A 
Sbjct: 318 VGYNKTVDEFFSSLEGQRLESKLSEREAAAKRKLEAARMDQAKRIEGLQEMEMLNYRKAA 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            I+ N+E V  A+ AV   L   M W D+ +++++E+K GNPVA +I   + L+ N ++L
Sbjct: 378 TIQANIERVQEAMDAVNGLLQEGMDWVDITKLIEKEQKQGNPVAEIIKLPMKLKENTITL 437

Query: 471 LL---------------------SNNLDEMD--DEEKTLPVEKVEVD--LALSAHANARR 505
           LL                     S++ DE D  + +  +PV ++E+D  L LS   NAR 
Sbjct: 438 LLGEGVEEEEEGDEDKEDDEFDYSDDEDEGDVGEPKDKVPVNRLEIDINLTLSVWNNARE 497

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFN 561
           +Y+ K+    K +KT+     A K+AE+K     R  + QEK V  +  +RK  WFEKF 
Sbjct: 498 YYDQKRTAAHKAQKTVQQSVIALKSAEQKISEDLRKGLKQEKPV--LQPIRKAMWFEKFT 555

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPL 619
           WFISS+ YLV+ GRDAQQNEM+ KRY+ KGDVYVHAD+HGA+S +IKN+   P+ P+PP 
Sbjct: 556 WFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYVHADVHGAASVIIKNNPKTPDAPIPPS 615

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL QAG  +VC S AWDSK    AWWV   QVSK+APTGEYL VGSFM+RGK+N LPP  
Sbjct: 616 TLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSKSAPTGEYLPVGSFMVRGKRNLLPPAL 675

Query: 680 LIMGFGLLFRLDESSLGSH-------LNERRVRGEEEGMDDFEDSG----HHKENSDIES 728
           L +GFGLLFR+ + S   H         E + R   + +D   + G      K  +  +S
Sbjct: 676 LTLGFGLLFRISDDSKSKHTRNRVYDFGEAKTRDRADSLDVLSEHGESLHEQKPEAGQKS 735

Query: 729 EKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVA 788
           E DD DE       +        P H+  S         +DK++ +         A   A
Sbjct: 736 ESDDEDED------AANQKGRSNPLHSQRS--------VQDKSVESD--------AGQGA 773

Query: 789 APVTPQLEDL-IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAE 847
            P T +L DL I++       S+S           +L E++K     +    +P +++ E
Sbjct: 774 EPPTEELADLEINK-----DESVS-----------NLDEDNK-----SPAEPEPAVAQDE 812

Query: 848 RRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKY 907
           + +    +      P     K+ G  +SS   +  ++  ++     RGQ+GK KK+  KY
Sbjct: 813 KEEGDDDEDEDSHQPS---SKQAGTPSSST--APQKQQPLKKAPAKRGQRGKQKKIAAKY 867

Query: 908 GDQDEEERNIRMALLAV 924
            DQDEE+R +   L+ V
Sbjct: 868 KDQDEEDRALMEELMGV 884


>gi|380483775|emb|CCF40411.1| hypothetical protein CH063_10996 [Colletotrichum higginsianum]
          Length = 1087

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 278/794 (35%), Positives = 404/794 (50%), Gaps = 106/794 (13%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L  +R +NVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSIDVKVIAHELQESLTTLRLANVYDLSSKILLLKFAKPDN--------KKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T + R     PS F  +LRK ++TRRL  VRQ+G DRI+ FQF  G   + +
Sbjct: 53  DSGFRCHLTDFTRTTAAAPSAFVTRLRKFLKTRRLTSVRQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
            LE +A GN++LTD++  +LTLLR+  + +          Y  E  + +      T  ++
Sbjct: 111 FLEFFASGNVILTDADLKILTLLRNVSEGEGQEPQRVGMNYSLENRQNYNGVPDLTKERV 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
            AAL SS                 VS  S     G+K                D  R   
Sbjct: 171 RAALESS-----------------VSKTSVAATAGKK----------IKVKPGDELRR-- 201

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQ 296
            +L T + E     P L +H    TG    MK +++  LED + +  L+ A+ +    ++
Sbjct: 202 -SLATTITE---LPPILVDHSFQLTGFDGKMKPADI--LEDESLLDALLKALTQARSIVE 255

Query: 297 DVISGDIVPEGYILMQNKH--------LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
           D  S     +GYI  + +                 E+  S  +YD+F P L ++F +   
Sbjct: 256 DATSS-ATAKGYIFAKYRSKPDHAPEAAPPAAEDEETKRSNLLYDDFHPFLPSKFANDPT 314

Query: 349 VK---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
           VK   F+ ++  +DEF+S +E Q+ E +   +E AA  KL+    DQE R+  L+     
Sbjct: 315 VKVLEFDGYNKTVDEFFSSLEGQKLESKLTEREAAARRKLDAARSDQEKRIEGLRGAQSI 374

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
           +V+ A  IE N+E V  A+ AV   L   M W D++++++ E+K  NPVA +I   L L 
Sbjct: 375 NVQKATAIEANVERVQEAMDAVNGLLQQGMDWVDISKLIEREQKRRNPVAEIIKLPLNLA 434

Query: 465 RNCMSLLL----------SNNLDEMD------------DEEKTLPVEKVEVDLALSAHAN 502
            N ++LLL          SN   + D            +++K     ++EVD+ LS  AN
Sbjct: 435 ENKITLLLGEEEDIEDDESNYETDSDASDSENEESSNNNKQKNDKRLEIEVDITLSPWAN 494

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ----EKTVANISHMRKVHWFE 558
           +R ++E K+    K EKT+     A K AE+K + ++ +    EK V  +  +RK  WFE
Sbjct: 495 SRGYHEQKRSAAKKAEKTVQQSQMALKNAEQKIQAELKKGLKTEKAV--LQPIRKQSWFE 552

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPV 616
           KF WF+SS+ YLV+ G+DAQQNEM+ KRY+ KGDVYVHAD+HGA++ +IKN    P+ P+
Sbjct: 553 KFIWFVSSDGYLVLGGKDAQQNEMLYKRYLRKGDVYVHADMHGAATVIIKNSPSTPDAPI 612

Query: 617 PPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
           PP TL QAG   VC S AWDSK    AWWV  +QVSK+APTGEYL  GSFM+RG+KNFLP
Sbjct: 613 PPSTLAQAGTLAVCSSSAWDSKAGMGAWWVNANQVSKSAPTGEYLPTGSFMVRGQKNFLP 672

Query: 677 PHPLIMGFGLLFRLDESSLGSHLNERRVRG------------EEEGMDDFEDSGHHKEN- 723
           P  L++G G++F++ E S   H+  R   G            EE   D  +      ++ 
Sbjct: 673 PAQLLLGIGIMFKISEESKARHVKHRLYDGAGLQAPSADKGPEESAADAAQARDEDPDDV 732

Query: 724 SDIESEKDDTDEKP 737
           SDI SE +D DE P
Sbjct: 733 SDIGSENNDEDEDP 746


>gi|402876104|ref|XP_003901819.1| PREDICTED: nuclear export mediator factor NEMF [Papio anubis]
          Length = 1048

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 210/464 (45%), Positives = 296/464 (63%), Gaps = 36/464 (7%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLL 340
           I+ +++++ K ED+++   + +   +GYI+ Q + +       +       Y+EF P L 
Sbjct: 193 IEKVLVSLQKAEDYMK--TTSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLF 249

Query: 341 NQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
           +Q     +++FE+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+
Sbjct: 250 SQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQ 309

Query: 401 QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDK 460
           Q  +      ELIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +
Sbjct: 310 QAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKE 369

Query: 461 LYLERNCMSLLLSN--------------------NLDEMDDEEKTLPVEK---------- 490
           L L+ N ++++L N                    N  E    +K     K          
Sbjct: 370 LKLQTNHVTMMLRNPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKP 429

Query: 491 --VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
             V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I
Sbjct: 430 LLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSI 489

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
              RKV+WFEKF WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIK
Sbjct: 490 QKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIK 549

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
           N   E P+PP TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMI
Sbjct: 550 NPTGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMI 608

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           RGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 609 RGKKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 652



 Score =  139 bits (350), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|310791286|gb|EFQ26815.1| hypothetical protein GLRG_02635 [Glomerella graminicola M1.001]
          Length = 1073

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 263/749 (35%), Positives = 393/749 (52%), Gaps = 92/749 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L  +R +NVYDLS K  +FK              K  L++
Sbjct: 1   MKQRFSSIDVKVIAHELQESLTTLRLANVYDLSSKILLFKFAKPDN--------KKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T + R     PSGF  +LRK+++TRRL  V+Q+G DRI+ FQF  G   + +
Sbjct: 53  DSGFRCHLTDFTRTTAAAPSGFVARLRKYLKTRRLTSVKQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
            LE +A GN++LTD++  +LTLLR+  + +         +Y  +  + +      T  ++
Sbjct: 111 FLEFFASGNVILTDTDLRILTLLRNVPEGEGQEPQRVGLKYSLDNRQNYNGVPDLTKERV 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
            AAL SS                      K++      GK   +     K  ++  R   
Sbjct: 171 RAALESS---------------------VKKSAATATAGKKIKV-----KPGDELRR--- 201

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQ 296
            +L T + E     P L +H    TG     K +E+  LED+++   L+ A+ +    ++
Sbjct: 202 -SLATTITE---LPPILVDHSFQITGFDGKTKPAEI--LEDDSLLDALLKALTRARSIVE 255

Query: 297 DVISGDIVPEGYILMQNKH-------LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFV 349
           D  S     +GYI  + +                E+  S  +YD+F P L  +F     V
Sbjct: 256 DATSS-ATSKGYIFAKYRSKADAASDAAPTAEGEETKRSDLLYDDFHPFLPKKFADDPTV 314

Query: 350 K---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           K   F+ ++  +DEF+S +E Q+ E +   +E AA  KL+    DQE R+  L+     +
Sbjct: 315 KVLEFDGYNKTVDEFFSSLEGQKLESKLTEREAAARRKLDAARSDQEKRIEGLRGAQSIN 374

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
           V+ A  IE N+E V  A+ A+   L   M W D++++++ E+K  NPVA +I   L L  
Sbjct: 375 VQKATAIEANVERVQEAMDAMNGLLQQGMDWVDISKLIEREQKRHNPVAEIIKLPLNLAE 434

Query: 466 NCMSLLL----------------SNNLDEMDDE------EKTLPVEKVEVDLALSAHANA 503
           N ++LLL                S+  D  D++      +K+    +V+V++ALS  AN+
Sbjct: 435 NTITLLLGEEEDIEDDESNYETDSDASDSEDEDNGNSNKQKSDKRLEVDVNIALSPWANS 494

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ----EKTVANISHMRKVHWFEK 559
           R ++E K+    K EKT+     A K AE+K + ++ +    EK V  +  +RK  WFEK
Sbjct: 495 REYHEQKRSAAKKAEKTVQQSVIALKNAEQKIQAELKKGLKTEKAV--LQPIRKQIWFEK 552

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F WF+SS+ YLV+ G+DAQQNEM+ KRY+ KGDVYVHAD+HGA++ +IKN    P+ P+P
Sbjct: 553 FIWFVSSDGYLVLGGKDAQQNEMLYKRYLRKGDVYVHADMHGAATVIIKNSPSTPDAPIP 612

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL QAG   VC S AWDSK    AWWV   QVSK+APTGEYL  GSFM+RG+KNFLPP
Sbjct: 613 PSTLAQAGTLAVCSSSAWDSKAGMGAWWVNADQVSKSAPTGEYLPTGSFMVRGQKNFLPP 672

Query: 678 HPLIMGFGLLFRLDESSLGSHLNERRVRG 706
             L++G G++F++ E S   H+  R   G
Sbjct: 673 AQLLLGIGIMFKISEESKARHVKHRLYDG 701


>gi|320169195|gb|EFW46094.1| serologically defined colon cancer antigen 1 [Capsaspora owczarzaki
           ATCC 30864]
          Length = 1151

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 214/519 (41%), Positives = 305/519 (58%), Gaps = 52/519 (10%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSE---VNKLEDNAIQVLVLAVAKFEDWLQ 296
           LK  L   L +GPA+ EH IL  GL P+  +S            I  L   +   +  L 
Sbjct: 183 LKKFLNSQLAFGPAVVEHCILKAGLKPDGSVSSQLPCTAEHSEPIDKLYAEILNTQQLLI 242

Query: 297 DVISGDIVPEGYILM----------QNKHLGKDHPPTESGSSTQ--------IYDEFCPL 338
           DV +   VP GYI+           +NK +G +     +  ++         ++DE+ P 
Sbjct: 243 DVGASSEVP-GYIIQRKESRATAANKNKGVGDEQAAVAAALASASGDASDIFVFDEYHPF 301

Query: 339 LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHT 398
           L  Q ++R  V F TFD A+DEFYS+IE QR + +H   E     KL K  ++QE ++  
Sbjct: 302 LFEQHKARPVVHFPTFDRAVDEFYSRIEGQRLDMKHIGDERNVLKKLEKFKLEQERKLVG 361

Query: 399 LKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLI 458
           L+   +      +LI   L++   A+L +R ALA+ + W +++ MV+  ++  +PVA +I
Sbjct: 362 LRTTQEEEALRGQLI---LDNQTKALLVIRSALAHAVDWSEISDMVEAAKEQKDPVASII 418

Query: 459 DKLYLERNCMSLLLSN---------------NLDEMDDEEKTLPVE------------KV 491
            KL L+ N ++L+L++                 D+    +     +            KV
Sbjct: 419 HKLKLDSNIITLMLTSPDAVEEEEDDNSEDEGADQAVSSKGKGSAKGGKKGHHQQTRMKV 478

Query: 492 EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM 551
           ++D+  S HANA  ++  KK+  +K+++TI A SKA K+AE++T+ Q+ Q    A ++ +
Sbjct: 479 DIDITASVHANAESYFSRKKQAAAKEQRTIDASSKALKSAERQTKQQLKQVAVKATVNKV 538

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           RKV WFEKF WFI+SENYLVI GRD QQNE++VKR++  GD YVHADLHGASS ++KN  
Sbjct: 539 RKVLWFEKFLWFITSENYLVIGGRDMQQNELLVKRHLRNGDAYVHADLHGASSVIVKNPT 598

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+QPVP  +L +AG F VC+S AWD+K++TSAWWV  +QVSKTAPTGEYLT GSFMIRG+
Sbjct: 599 PDQPVPIRSLCEAGTFAVCYSSAWDAKVITSAWWVAANQVSKTAPTGEYLTTGSFMIRGR 658

Query: 672 KNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           KNFLPP PLI+GFG L+RLDES +  HL ER+V  E E 
Sbjct: 659 KNFLPPSPLILGFGFLYRLDESCIAKHLQERKVVSEGEA 697



 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 76/168 (45%), Positives = 110/168 (65%), Gaps = 10/168 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+ A +  LR RLIG+R +NVYD++ KTY+FKL             K +LL+
Sbjct: 1   MKQRFSSLDIIASIALLRSRLIGLRVTNVYDINFKTYLFKLAKPGF--------KAILLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R HTT +   K N+PS F +KLRKH+RTRRL  +RQ+G DR+I  +FG G+ A++V
Sbjct: 53  ESGIRFHTTEFDWPKNNSPSNFAMKLRKHLRTRRLNSIRQVGADRVIDLEFGSGVAAYHV 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYPTEICR 167
           I+ELY +GNI+LTD E+ +L+LLR    + D+ V      ++P    R
Sbjct: 113 IVELYDRGNIILTDFEYNILSLLRVRTVEGDEDVRFAVGEKFPEAAVR 160


>gi|171684415|ref|XP_001907149.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942168|emb|CAP67820.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1070

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 259/731 (35%), Positives = 373/731 (51%), Gaps = 96/731 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +N+YDL+ K  +FK        +        LL+ESG R H T +AR     PS
Sbjct: 23  LVSLRLANIYDLNSKILLFKFAKPDNRQQ--------LLIESGFRCHLTDFARSTAPAPS 74

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  V Q+G DRII F+F  G  A+ + LE +A GN++LTD++ T++
Sbjct: 75  AFVARLRKFLKTRRVTSVSQIGTDRIIEFRFSDG--AYRLYLEFFASGNVILTDADLTII 132

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKLHAALTSSKEPDANEPDKVNE 197
            LLR+  + +         +Y  E  + F      T  +L AAL ++ E           
Sbjct: 133 ALLRNVPEGEGQEPQRVGLKYTLENRQNFGGVPELTKERLRAALKTAAE----------- 181

Query: 198 DGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEH 257
                                  ++K + K   D  R    T  T L       P L +H
Sbjct: 182 ---------------------HAVTKKAKKKGADELRRGLATTITEL------PPVLVDH 214

Query: 258 IILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLG 317
           +   T      K  E+ + E   +  L  A+ K    L +V S      GYI+ +     
Sbjct: 215 VFRLTEFNSAAKPLEILESE-TLLDSLFRALEKARAVLDEVTSSPRA-TGYIIAKPNPRA 272

Query: 318 KDHPPTESGSSTQ-------IYDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSKIES 367
            + PP E+   TQ       +Y++F P L  QF   +    + F+ ++  +DEF+S IE 
Sbjct: 273 VEQPPAETEGETQKEKPRGLLYEDFQPFLPKQFEDDQGLTTLSFDGYNKTVDEFFSSIEG 332

Query: 368 QRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAV 427
           Q+ E + + +E  A  KL+    DQ  R+  L      +++ A  IE N+E V  A+ AV
Sbjct: 333 QKLESKLQEREATAKRKLDAARQDQAKRIEGLVGFQTLNLRKAAAIEANIERVQEAMDAV 392

Query: 428 RVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNN----------- 475
              L   M W ++ ++V+ E+  GNPVA +I   + L  + ++LLL              
Sbjct: 393 NGLLEQGMDWVNINKLVEREQAQGNPVAEIIKLPVNLAESTITLLLGEEEEEEAGEDEDM 452

Query: 476 ----------LDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTIT 522
                     +D   + EK    +K   ++++L LS   NAR +YE K+    K++KT+ 
Sbjct: 453 EFNYDTDEEVVDAAPEPEKAKGPDKRLAIDINLKLSVWNNAREYYEQKRTAADKEKKTVA 512

Query: 523 AHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
               A K+AE+K     R  + QEK V  +  +RK  WFEKF WFISS+ YLV+ GRDAQ
Sbjct: 513 QSVIALKSAEQKITEDLRKGLKQEKPVLQL--IRKQMWFEKFVWFISSDGYLVLGGRDAQ 570

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           QNE++ KRY+ KGDVYVHAD+HGAS+ +IKN    P+ P+PP TL QAG  +VC S AWD
Sbjct: 571 QNEILYKRYLKKGDVYVHADMHGASTVIIKNSPKTPDAPIPPSTLAQAGSLSVCCSSAWD 630

Query: 637 SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 696
           SK    AWWV   QVSK+APTGEYL  GSFM+RGKKN LPP  L++GFGL+FR+ E S  
Sbjct: 631 SKAAMGAWWVNADQVSKSAPTGEYLPAGSFMVRGKKNPLPPALLMLGFGLMFRISEESKA 690

Query: 697 SHLNERRVRGE 707
            H+  R   G+
Sbjct: 691 KHVKHRLYDGD 701


>gi|330929686|ref|XP_003302734.1| hypothetical protein PTT_14667 [Pyrenophora teres f. teres 0-1]
 gi|311321722|gb|EFQ89181.1| hypothetical protein PTT_14667 [Pyrenophora teres f. teres 0-1]
          Length = 1133

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 255/748 (34%), Positives = 394/748 (52%), Gaps = 94/748 (12%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R ++ DV A  +   +L  +R +NVYDLS + ++ K              +  LL++
Sbjct: 1   MKQRFSSLDVKATHELSAKLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLID 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R H T YAR    TPSGF  KLRK+++TRR+  V Q+G DRI+ FQF  G+  + + 
Sbjct: 53  SGFRCHLTEYARTTAGTPSGFVAKLRKYLKTRRITSVAQIGTDRILEFQFSDGL--YRLY 110

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKLH 178
           LE YA GNI+LTD+E  VL+LLR+  + ++   +    +Y   I + +      T  ++ 
Sbjct: 111 LEFYAGGNIVLTDAELNVLSLLRNVDEGEEHEKLRVGLKYNLTIRQNYGGAPELTKERVR 170

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            AL  + +   N+P+   +     S                                   
Sbjct: 171 QALQKAVDRQQNQPEATGKKAKKASKD--------------------------------- 197

Query: 239 TLKTVLGEALGYGPA-LSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
           +L+  L  ++   P  L +H +        +K  EV  + D+++   +++V +    + D
Sbjct: 198 SLRKALAVSITECPPLLVDHALHVANFDSTLKPEEV--IADDSLMEKLVSVLQDARKITD 255

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE--FVKFETFD 355
            I+     +GYIL +       +    S  S  +YD+F P    QF + +  F++F+ F+
Sbjct: 256 EITTADQIKGYILAKPNPSAPTNVDESSDKSRLLYDDFHPFRPQQFENSDYTFLEFDGFN 315

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            A+DEF+S IE Q+ E +   +E  A  KL K   + E+R+  L+Q  + + + AE I  
Sbjct: 316 KAVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEDRIGGLQQVQELNFRKAEAILA 375

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL-- 472
           N+  V  A  AV   +   M W D+AR+++ E+ +GN VA LI   L L  N ++LLL  
Sbjct: 376 NVHRVTEATEAVNGLIRQGMDWVDIARLIEREQNSGNAVAQLIKLPLKLNENTITLLLDE 435

Query: 473 ---------------SNNLDEMDDEE----------KTLPVE-------KVEVDLALSAH 500
                          ++++ E  DEE          K+ PV+        +++DL+L+A 
Sbjct: 436 TNWEEGQEVEDEGNETSSVSEDSDEEAAGEEDGAKKKSAPVKVSARPQLAIDIDLSLTAW 495

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHW 556
           AN+  +++ KK   +K+++T+ A ++A K+ EKK     +  + QEK V  +  +RK HW
Sbjct: 496 ANSTEYFDQKKTAANKEDRTLQASTRALKSHEKKVAEDLKKGLKQEKEV--LRPVRKQHW 553

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQ 614
           FEKF +FISS+ YLV+ G+DAQQNE+I +R++ KGDVYVHADL GA   +IKN    P+ 
Sbjct: 554 FEKFIYFISSDGYLVLGGKDAQQNEIIYRRFLRKGDVYVHADLKGAMPMIIKNKPDTPDA 613

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG   +C S AWDSK V SAWWV   QVSKT  TGE+L  G F ++GKK F
Sbjct: 614 PIPPSTLSQAGNLCICTSDAWDSKAVMSAWWVRSDQVSKTGQTGEFLPAGMFNVKGKKEF 673

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHLNER 702
           LPP  L++G  ++F + ESS  +H   R
Sbjct: 674 LPPAQLVVGLAVMFEISESSKANHQKHR 701


>gi|47230001|emb|CAG10415.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 582

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 259/717 (36%), Positives = 363/717 (50%), Gaps = 169/717 (23%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R  T D+ A +  +    +GMR +NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKTRFTTVDIKAVIAEINANYMGMRVNNVYDIDNKTYLIRLQKPDS--------KAILLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+H+T +   K   PSGF +K RKH++TRRL  V+QLG DRI+  QFG    A+++
Sbjct: 53  ESGTRIHSTDFEWPKNMMPSGFAMKCRKHLKTRRLTRVQQLGNDRIVDIQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHA 179
           I+ELY +GNI+L D E+T+L LLR    +   V I  R RYP E  R  E   +  +L  
Sbjct: 113 IVELYDRGNIILADHEYTILNLLRFRTAEVDDVKIAVRERYPVESARPPEPLISLERLTE 172

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            L+++++ D                                                   
Sbjct: 173 ILSTAQQGD--------------------------------------------------Q 182

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVL-VLAVAKFEDWLQDV 298
           +K VL   L YG  L EH +++ GL  + K+     +   A ++L  L VA  E +++  
Sbjct: 183 VKRVLNPHLSYGATLIEHSLIEVGLPGSAKVDSQTDVAQVAPKILEALKVA--ETYMEK- 239

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFD 355
            S     +GYI+ +++      P    G   +    YDEF P L  Q     +++F++FD
Sbjct: 240 -SEHFTGKGYIIQKSE----KKPSVTPGKPCEELLTYDEFHPFLFAQHSKSPYLEFDSFD 294

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELI 413
            A+DEF+SK+ESQ+ + +    E  A  KL  +  D E R+  L   QE+DR +K  ELI
Sbjct: 295 KAVDEFFSKMESQKIDMKALQLEKHALKKLENVKKDHEQRLEALHQAQEIDR-IK-GELI 352

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           E NL  V+ A+  V  ALAN++ W ++  +VKE + AG+PVA  I +L L+ N ++LLL 
Sbjct: 353 EMNLAIVERALQVVCGALANQVDWTEIGILVKEAQAAGDPVACAIKELKLQANHITLLLK 412

Query: 474 NNLDEMDDEEKTLPVEK--------------------VEVDLALSAHANARRWYELKKKQ 513
           N     DDE++   +E+                    V+VDL+LSA+ANA++        
Sbjct: 413 NPYISEDDEQEDDVLEETGRKNKNKKNKKFHKNKPVLVDVDLSLSAYANAKK-------- 464

Query: 514 ESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVIS 573
                                                      FEKF WFIS+ENYLVI+
Sbjct: 465 -------------------------------------------FEKFLWFISAENYLVIA 481

Query: 574 GRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQ 633
           GRD QQNEMIVKRY+  G                      +P+PP TL +AG   VC+S 
Sbjct: 482 GRDQQQNEMIVKRYLRAG----------------------EPIPPRTLTEAGTMAVCYSA 519

Query: 634 AWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           AW++K+VTSAWWV+ HQVSKTAPTGEYLT GSFMIRGKKN+LPP  LIMGFG LF++
Sbjct: 520 AWEAKIVTSAWWVHHHQVSKTAPTGEYLTTGSFMIRGKKNYLPPSYLIMGFGFLFKV 576


>gi|156059014|ref|XP_001595430.1| hypothetical protein SS1G_03519 [Sclerotinia sclerotiorum 1980]
 gi|154701306|gb|EDO01045.1| hypothetical protein SS1G_03519 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 1063

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 260/732 (35%), Positives = 396/732 (54%), Gaps = 94/732 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDLS K ++ K              K  +L++SG R H T ++R     PS
Sbjct: 19  LVTLRVSNIYDLSSKIFLVKFAKPDN--------KQQILIDSGFRCHLTDFSRATAAAPS 70

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK+++TRR+  V Q+G DRII FQF  G    Y  LE YA GNI+LTD E  +L
Sbjct: 71  VFVQRLRKYLKTRRVTQVSQVGTDRIIEFQFSDGQYRLY--LEFYAGGNIILTDKELNIL 128

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
           TLLR     D G A                     +L   L  S E      ++ N  G 
Sbjct: 129 TLLRVV---DPGEA-------------------QEELRVGLKYSLE------NRQNYGG- 159

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSND-GARAKQP--TLKTVLGEALG-YGPALSE 256
            + + ++E L          L K ++K  +D G + K+P   L+  L  ++  + P L +
Sbjct: 160 -IPDLTRERLKEA-------LQKGADKGEDDSGKKKKKPGDALRKALAVSITEFAPMLVD 211

Query: 257 HIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNK-- 314
           H +  T    ++K SEV + ED  +  L+ ++ + +  +Q++ S +   +GYI+ + K  
Sbjct: 212 HAMRITNFNHSLKPSEVLQSED-LLDHLMRSLQEAQRVVQEITSSE-TSKGYIIAKKKDS 269

Query: 315 HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSKIESQRAE 371
            +  D    E      +YD+F P    QF       F++FE F+  +DEF+S IE Q+ E
Sbjct: 270 QVTSDDNQAEDRKGL-LYDDFHPFKPRQFEDDPTLVFLEFEGFNKTVDEFFSSIEGQKLE 328

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            + + +E  A  K+     +Q  R+  L++    + + A  ++ N+E V  A  AV   +
Sbjct: 329 SRLEERELNAKKKIQAARNEQAKRLGGLQEIQALNERKASALQANVERVQEARDAVNGLI 388

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNL-------------- 476
           A  M W ++ R+++ E+K  NPVA +I   L L++N ++LLL   +              
Sbjct: 389 AQGMDWFEIGRLIELEQKRKNPVASMIKLPLKLDQNTVTLLLDEEVFNDDEDSSYETDSD 448

Query: 477 --DEMDDEEKTLPVEK----------VEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
             D  D+E+   PVEK          ++++L+LS  ANAR +++ K+   SK++KT+ + 
Sbjct: 449 VSDSEDEEKAAKPVEKEEKATETRLAIDINLSLSPWANARNYFDQKRSAASKEDKTLQSS 508

Query: 525 SKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           SKA K+ E K     +  + QEKT+  +  +RK  WFEKF WFISS+ YLV++G+DAQQ+
Sbjct: 509 SKALKSTEAKIAQDLKKGLKQEKTI--LRPVRKQIWFEKFVWFISSDGYLVLAGKDAQQS 566

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           E++ KRY+ KGD+Y+HAD+ GA+S +++N+   P+ P+PP TL+QAG   V  S AWDSK
Sbjct: 567 EILYKRYLRKGDMYLHADISGAASVIVRNNPKTPDAPIPPQTLSQAGTLVVATSSAWDSK 626

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
              SAWWV   QVSK APTGE+L  G F I+GKKNFLPP  L++GFG+LF++ + S   H
Sbjct: 627 AGMSAWWVNADQVSKAAPTGEFLPAGKFTIQGKKNFLPPAQLLLGFGILFQISDESKARH 686

Query: 699 LNERRVRGEEEG 710
           +  R   GE  G
Sbjct: 687 VKHRFQDGEPVG 698


>gi|346976277|gb|EGY19729.1| DUF814 domain-containing protein [Verticillium dahliae VdLs.17]
          Length = 1086

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 256/741 (34%), Positives = 374/741 (50%), Gaps = 117/741 (15%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +NVYDLS K  + K              K  +L++SG R H T +AR     PS
Sbjct: 71  LVTLRLANVYDLSSKILLLKFAKPD--------NKKQILIDSGFRCHLTDFARTTAAAPS 122

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRRL  V Q+G DRII F F  G   + + LE +A GN++LTD+E  +L
Sbjct: 123 AFVARLRKFLKTRRLTAVSQVGTDRIIEFTFSDGQ--YRLFLEFFASGNVILTDAELRIL 180

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
           TLLR+                                        E +  EP +V   G 
Sbjct: 181 TLLRN--------------------------------------VPEGEGQEPQRV---GL 199

Query: 201 NVSNASKENLGGQK--------------GGKSFDLSKNSNKNSNDGARAKQPTLKTVLGE 246
             S  +++N GG                  K+ +      K    G + ++  L T + E
Sbjct: 200 GYSLDNRQNFGGVPPLTRERLQDALRVMAAKAANAPTTGKKKVKPGDQLRK-GLATTITE 258

Query: 247 ALGYGPALSEHIILDTGLVPNMKLSEV---NKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
                P L +H    TG  P    +E+   + L D+ +  L +A    ED      +   
Sbjct: 259 ---LPPMLVDHAFQVTGFDPTKTPAELLDSDALLDSLLHALTVARKVVEDATSSATT--- 312

Query: 304 VPEGYILMQNKHLGKD-HPPTESGSSTQ----IYDEFCPLLLNQFRSREFVKFETFDA-- 356
              GY++ + +   ++     + G+ T+    +YD+F P L  +F     VK  TFD   
Sbjct: 313 --TGYVIAKYRQKSEETEEKPDDGAETKREDLLYDDFHPFLPQKFADDPSVKVLTFDGFN 370

Query: 357 -ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
             +DEF+S +E Q+ E +   +E AA  KL     D   R+  L++    + + A  IE 
Sbjct: 371 KTVDEFFSSLEGQKLESKLTEREAAAKKKLEATRQDHAQRIEGLQEAQSLNEQKAAAIEA 430

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL--YLERNCMSLLLS 473
           N+E V  A+ AV   +   M W ++ ++++ E+K  NPVA  I KL   L  N M+LLL 
Sbjct: 431 NVERVQEAMDAVNGLVQQGMDWVNIGKLIEREQKRRNPVAETI-KLPRKLGENLMTLLLG 489

Query: 474 NNLDEMDDEEKTLPVE--------------------KVEVDLALSAHANARRWYELKKKQ 513
               E +DE      +                    ++E++L LS  ANAR +Y+ ++  
Sbjct: 490 TEAVEDEDEAYETGSDASDSEDDEDGAKAKGADRRLQIEINLGLSPWANAREYYDQRRTA 549

Query: 514 ESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
             K++KT+   + A + AEKK     +  + QEK V  +  +RK  WFEKF WFISS+ Y
Sbjct: 550 AVKEQKTVQHSTMALRNAEKKITEDLKKGLKQEKAV--LQPIRKQMWFEKFIWFISSDGY 607

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVPPLTLNQAGCF 627
           LV+ G+DAQQNE + KRY+ KGDVY HAD+HGA++ ++KN +  P+ P+PP TL QAG  
Sbjct: 608 LVLGGKDAQQNETLYKRYLRKGDVYCHADMHGAATVIVKNRQDTPDAPIPPATLAQAGML 667

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +VC S AWDSK    AWWV   QVSK+APTGEYL  GSFM+RG+KNFLPP PL++G G++
Sbjct: 668 SVCSSSAWDSKAGMGAWWVRADQVSKSAPTGEYLPAGSFMVRGQKNFLPPAPLVLGLGIM 727

Query: 688 FRLDESSLGSHLNERRVRGEE 708
           FR+ E S   H+ + R+RG+E
Sbjct: 728 FRISEESKAKHV-KHRLRGDE 747


>gi|345804334|ref|XP_863447.2| PREDICTED: nuclear export mediator factor NEMF isoform 6 [Canis
           lupus familiaris]
          Length = 1056

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 215/508 (42%), Positives = 304/508 (59%), Gaps = 65/508 (12%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E    T+    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREV----KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPY 414

Query: 475 -----------NLDEMDDEEKTLPVEK-------------------VEVDLALSAHANAR 504
                          ++  E  LP  K                   V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDISVEKNETELPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTTG----------------------EPIPPRTLTEA 572

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 573 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 632

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 633 SFLFKVDESCVWRHRGERKVRVQDEDME 660



 Score =  140 bits (352), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   LIGMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAILAELNASLIGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPVDHARAAE 162


>gi|367042422|ref|XP_003651591.1| hypothetical protein THITE_2086741 [Thielavia terrestris NRRL 8126]
 gi|346998853|gb|AEO65255.1| hypothetical protein THITE_2086741 [Thielavia terrestris NRRL 8126]
          Length = 1094

 Score =  394 bits (1013), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 260/724 (35%), Positives = 372/724 (51%), Gaps = 96/724 (13%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDL+ K  + K        +        LL+ESG R H T +AR     PS
Sbjct: 21  LVSLRLSNIYDLNSKLLLLKFAKPDNRQQ--------LLIESGFRCHLTDFARAAAPAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  V Q+G DRII FQF  G  A+ + LE +A GN++LTD++  +L
Sbjct: 73  QFVSRLRKFLKTRRVTGVSQIGTDRIIEFQFSNG--AYRLYLEFFASGNVILTDADLKIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR+                      V +          LT + E      ++ N  G 
Sbjct: 131 ALLRN----------------------VPQGEGQEPQRVGLTYTLE------NRQNFGG- 161

Query: 201 NVSNASKENL-GGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII 259
            V   +KE L G  K       +K + +  +D  R    T  T L       P L +H+ 
Sbjct: 162 -VPALTKERLRGALKTASEQAATKKAKRKGSDELRRGLATTITELP------PVLVDHVF 214

Query: 260 LDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGK 318
             T   P  K +++  LE+ A+   L  ++ K    L +V S      GYI+ +      
Sbjct: 215 RLTSFDPTTKPADI--LENEALLDALFQSLEKARSILDEVTSSPSA-RGYIIAKRNPRAA 271

Query: 319 DH-----PPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKIESQRA 370
           D        T+  +   +Y++F P L  QF    + + + F+ F   +DEF+S +E Q+ 
Sbjct: 272 DQVADGEETTKEKAQNLLYEDFQPFLPKQFEDDPTCQVLSFDGFSKTVDEFFSSLEGQKL 331

Query: 371 EQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVA 430
           E + + +E  A  KL     DQ  R+  L++    +++ A  IE N+E V  A+ AV   
Sbjct: 332 ESRLQEREATAKRKLEAARRDQAQRIEGLQEAQLLNLRKAAAIEANVERVQEAMDAVNGL 391

Query: 431 LANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN---------NLD--- 477
           L   M W D+ ++V+ E++  NPVA +I   + LE + ++LLL           N+D   
Sbjct: 392 LQQGMDWVDINKLVEREQRLHNPVAEIIKLPMRLEESIITLLLGEEEEEAEAEANMDFDY 451

Query: 478 -------------EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                        +    +K L    ++++L LS   NAR +YE K+    KQ+KTI   
Sbjct: 452 DTDEEAAEETAAGKAKGPDKRL---AIDINLKLSPWNNAREYYEQKRTAADKQQKTIQQS 508

Query: 525 SKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
             A + AEKK     +  + QEK V  +  +RK  WFEKF WFISS+ YLV+ GRDAQQN
Sbjct: 509 EIALRNAEKKISEDLKKGLKQEKPVLQL--IRKQMWFEKFLWFISSDGYLVLGGRDAQQN 566

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           E++ KRY+ KGDVYVHAD+HGA S +IKN+   P+ P+PP TL QAG  +VC S AWDSK
Sbjct: 567 EILYKRYLRKGDVYVHADMHGAPSVIIKNNPKTPDAPIPPSTLAQAGSLSVCCSSAWDSK 626

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
            V  AWWV   QVSK+APTGEYL  GSFM+RGK+N LPP  L +GFGL+F++ E S   H
Sbjct: 627 AVMGAWWVNADQVSKSAPTGEYLPAGSFMVRGKRNALPPALLTLGFGLMFKISEDSKSKH 686

Query: 699 LNER 702
           +  R
Sbjct: 687 VKHR 690


>gi|325185450|emb|CCA19934.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1061

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 261/741 (35%), Positives = 395/741 (53%), Gaps = 101/741 (13%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL-------------SPKTYIFKLMNSSG 46
           M K RM   D+ A +  +R+ ++ MR +N+Y+L             + +TYIFKL     
Sbjct: 1   MPKTRMLIDDIHAMMGSVRKNILNMRVTNIYNLQNEAEVEGIDNKSNQRTYIFKLHQPP- 59

Query: 47  VTESGESEKVLLLMESGVRLHTTAYARDKKNT---PSGFTLKLRKHIRTRRLEDVRQLGY 103
                   KV LL+ESGVR H++ YAR+  ++   P+ FT+KLRKHIR +RL  + QL  
Sbjct: 60  ------FPKVYLLIESGVRFHSSNYARNISSSSTLPNQFTMKLRKHIRGKRLMQLEQLKG 113

Query: 104 DRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           DR+I F FG   +  ++ILELYA GNI+LTD+++ +L+LLR+HR D+  V +  R  YP 
Sbjct: 114 DRVIDFTFGSDQSQCHLILELYASGNIILTDNQYNILSLLRTHRIDE-NVKVAVRQVYPI 172

Query: 164 EIC--RVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDL 221
           +I   R  E   + ++     S    D ++ D                            
Sbjct: 173 QILSNRALESQVSGQILRQRLSDWFSDQSDDD---------------------------- 204

Query: 222 SKNSNKNSNDGARAKQPTL-KTVLGEALGYGP---ALSEHIILDTGLVPNMKL---SEVN 274
              + KN+  G + K  TL + +L +++G+G    A+ EH I+ TG +PN K+    +V 
Sbjct: 205 ---TTKNTARGGKKKFQTLEQLLLTKSVGFGGLGRAIVEHCIVSTG-IPNSKIKSYQDVR 260

Query: 275 KLEDN----------AIQVLVLAVAKFEDWLQDVISGDIVPE-------GYILMQNKHLG 317
            LED+           I++L       E +++D  S +I+ E       GYI++ N    
Sbjct: 261 TLEDHLGKLAEELNKGIKLLQWLENNQEQYMKDEQSTEILSESEKKPKGGYIILGN---- 316

Query: 318 KDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAK 377
                 ++G+ T  Y+ F P+L  Q R + +V F+TFD  +DE++S  E+++ +   +A 
Sbjct: 317 -----AQTGTKTDTYESFTPVLYAQHREKAYVSFDTFDQTVDEYFSYHEARKTQTGSQAA 371

Query: 378 EDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSW 437
           + AA  KL K+  +Q  ++  L    + ++K A+LIE +  D++  +  +R ALA+ M W
Sbjct: 372 QQAASSKLEKMRKNQIQQLDELHHSEEINLKHAQLIELHQLDIEKVLSVIRSALASGMDW 431

Query: 438 EDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLAL 497
           + L  +VK E+   NPVA +I +  L +N +S+LLS   D+   E+    V  + +DL+L
Sbjct: 432 KALKDLVKYEQTNANPVASMIHEFDLSKNRVSVLLS---DDPYFEDAEPAVHAIWLDLSL 488

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT-RLQILQEKTVANISHMRKVHW 556
           SA  NA   Y  KK    K +K   A  KA K A  KT +    Q      I   RK  W
Sbjct: 489 SALGNAAELYAKKKTSAEKAKKAEVATEKAIKLAASKTEKFMKTQLIKPTPIHQRRKTFW 548

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ-- 614
           FEKF+WF+SSEN LVISG+DAQQNE++V RY+ K DV+VH+DL GAS  +++        
Sbjct: 549 FEKFHWFLSSENILVISGKDAQQNELLVNRYVRKNDVFVHSDLQGASPCIVRVRAARTFD 608

Query: 615 ---PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
               +P  TL QA C  VC S AW ++++T A+WV    VSK+  +GE L  G+F+I GK
Sbjct: 609 QALSIPITTLEQAACMCVCRSNAWKNQVITGAYWVKAECVSKSTSSGELLPPGTFLILGK 668

Query: 672 KNFLPPHPLIMGFGLLFRLDE 692
           KNFL    L MG  +L+  +E
Sbjct: 669 KNFLQALRLEMGLAILYHTEE 689


>gi|194379038|dbj|BAG58070.1| unnamed protein product [Homo sapiens]
          Length = 782

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 203/414 (49%), Positives = 272/414 (65%), Gaps = 34/414 (8%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y+EF P L +Q     +++FE+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D
Sbjct: 41  YEEFHPFLFSQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKD 100

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
            ENR+  L+Q  +      ELIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G
Sbjct: 101 HENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQG 160

Query: 452 NPVAGLIDKLYLERNCMSLLLSN--------------------NLDEMDDEEKTLPVEK- 490
           +PVA  I +L L+ N +++LL N                    N  E    +K     K 
Sbjct: 161 DPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQ 220

Query: 491 -----------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
                      V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  +
Sbjct: 221 LQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTL 280

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK-GDVYVHAD 598
            + +TV +I   RKV+WFEKF WFISSENYL+I GRD QQNE+IVKRY++  GD+YVHAD
Sbjct: 281 KEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPVGDIYVHAD 340

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
           LHGA+S VIKN   E P+PP TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTG
Sbjct: 341 LHGATSCVIKNPTGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTG 399

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           EYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 400 EYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDEDME 453


>gi|50555916|ref|XP_505366.1| YALI0F13277p [Yarrowia lipolytica]
 gi|49651236|emb|CAG78173.1| YALI0F13277p [Yarrowia lipolytica CLIB122]
          Length = 1134

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 261/758 (34%), Positives = 399/758 (52%), Gaps = 98/758 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R +  D+      LR+ ++  R  N+YDL  S + ++ K      V ES    K L+
Sbjct: 1   MKQRFSQLDLKVIASELRKSILNYRLQNIYDLLSSSRHFLLKF----AVPES----KQLV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G R+HT+ + R    TPS F  KLRKH+RTRRL  + Q   DR+++  F  G   +
Sbjct: 53  VIDPGFRIHTSNFQRPTSQTPSNFVAKLRKHLRTRRLSAITQPVGDRVLVLTFSDGQ--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR--SHRDDDKGVAIMSRHRYPTEIC------RVFE 170
           ++ILE +A GN++L D +F +L L R  S   +++ VA+   + +  E+       +V  
Sbjct: 111 HLILEFFAGGNLILVDQDFKILALQRVVSEGANNQRVAVGVIYEFDKELLNNTDPLQVSR 170

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
               + L     ++  PD  E D+VN     V                     N  K   
Sbjct: 171 TEITADLLQQWVATVSPD--EDDEVNAISGGV---------------------NKKKTRR 207

Query: 231 DGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
              +AK P+LK +L   +    PAL E  +   G+  N+ + +V+   ++ +  +  AV 
Sbjct: 208 ---KAKLPSLKKLLYSNMSELSPALLEQYLEKEGVDGNLSIKDVD-FSESTVTSIAAAVK 263

Query: 290 KFEDWLQDVISGDIVPEGYILMQ-NKHLGK-DHPPTESGSSTQ------IYDEFCPLLLN 341
             ED +Q+++  D+V  GYI  + N +  K D   T    S        +Y+ F P  + 
Sbjct: 264 GCEDRVQELLDADLV-TGYIACEKNPNWKKPDEEKTYIPGSIDPSDIEYLYESFEPFEIT 322

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
                +   FE ++  +D ++S +ES R   +  A+E  A  +LN    + + RV  L+Q
Sbjct: 323 -VADGKVDTFEGYNLTVDRYFSTVESTRYSLRVNAQEQIAEKRLNAARNETKKRVDGLQQ 381

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL 461
             DRS+ M   ++     V+ AI AV+      M W+D+  ++  E+K GNPVA ++  +
Sbjct: 382 VQDRSILMGTALQTYAGRVEEAIAAVKQLQDQGMDWKDMEHLIDLEKKKGNPVAQMVSSM 441

Query: 462 YLERNCMSLLLSNNLDEM-----------------------------DDEEKTLPVEKVE 492
            LE+N ++L+L N   E                               +E KTL   KVE
Sbjct: 442 NLEKNRVTLILPNPDVEDESDSDSDSDMDETDSEGESEESGSESDSNKNESKTL---KVE 498

Query: 493 VDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH-- 550
           V+L L+A+ANA  ++++KK    KQEKT    + A K+AE+K +L +  ++++A   H  
Sbjct: 499 VNLDLTAYANANNYFDIKKVAAQKQEKTEKNSATALKSAEQKVKLDL--KRSLAQEQHAL 556

Query: 551 --MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             MR  +WFEKF WF SS+ YLVI G+DAQQNEM+ KRY  KGD YVHA++ GAS+ ++K
Sbjct: 557 RPMRPSYWFEKFWWFFSSDGYLVIGGKDAQQNEMLYKRYFRKGDAYVHAEIQGASTVIVK 616

Query: 609 NHR-PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           NH  P  P+PP TL+QAG  ++C S+AWDSK++ SAWWV   QVSK+AP+GE+L  GSFM
Sbjct: 617 NHLGPTAPLPPSTLSQAGSLSICTSKAWDSKVLISAWWVEHGQVSKSAPSGEFLPTGSFM 676

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           IRGKKNFLPP  L +G  +L+  DE S   ++ +R  R
Sbjct: 677 IRGKKNFLPPTSLDVGLAILWIADEDSTAKYVKQRLER 714


>gi|332842178|ref|XP_003314363.1| PREDICTED: nuclear export mediator factor NEMF isoform 1 [Pan
           troglodytes]
          Length = 1055

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 217/508 (42%), Positives = 306/508 (60%), Gaps = 65/508 (12%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKREIKPSLEADKPVEDIFT----YEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEA 572

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 573 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 632

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 633 SFLFKVDESCVWRHQGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|426376842|ref|XP_004055191.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Gorilla
           gorilla gorilla]
          Length = 1056

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 215/505 (42%), Positives = 303/505 (60%), Gaps = 59/505 (11%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAGTM 575

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 576 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 635

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 636 FKVDESCVWRHQGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRADEADDVKFAVRERYPLDHARAAE 162


>gi|397523544|ref|XP_003831789.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Pan
           paniscus]
          Length = 1055

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 215/505 (42%), Positives = 303/505 (60%), Gaps = 59/505 (11%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++Y
Sbjct: 418 EEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYY 477

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           + K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSE
Sbjct: 478 DHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSE 537

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +AG  
Sbjct: 538 NYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAGTM 575

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  L
Sbjct: 576 ALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFL 635

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           F++DES +  H  ER+VR ++E M+
Sbjct: 636 FKVDESCVWRHQGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|403277934|ref|XP_003930597.1| PREDICTED: nuclear export mediator factor NEMF isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 1056

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 215/508 (42%), Positives = 305/508 (60%), Gaps = 65/508 (12%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G + N+K+ E  KLE   I+ +++ + K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFLGNVKVDE--KLETKDIEKILVCLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E+    +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYIIQKRE----TKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQVVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEA 572

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 573 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 632

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 633 SFLFKVDESCVWRHRGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHVRAAE 162


>gi|451850505|gb|EMD63807.1| hypothetical protein COCSADRAFT_182004 [Cochliobolus sativus
           ND90Pr]
          Length = 1128

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 317/995 (31%), Positives = 484/995 (48%), Gaps = 139/995 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L  +L  +R +NVYDLS + ++ K              +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSAKLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T YAR     PSGF  KLRK+++TRR+  + Q+G DRI+ FQF  G+  + +
Sbjct: 53  DSGFRCHLTEYARTTAAAPSGFVAKLRKYLKTRRVTSISQIGTDRILEFQFSDGL--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKL 177
            LE YA GNI+LTD++  VL+LLR+  + ++   +    +Y   + + +      T  ++
Sbjct: 111 YLEFYAGGNIILTDADLNVLSLLRNVDEGEEHEKLRVGLKYNLTLRQNYGGAPELTKERV 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             AL  + +   ++P      G     A K++L                          +
Sbjct: 171 CQALQKAVDKQQDQPVAA---GRKAKKAGKDSL--------------------------R 201

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
             L   + E     P L +H +       ++K  EV  L D+ +   ++ V +    + D
Sbjct: 202 KALAVSITEC---PPLLVDHALHVASYDSSLKPEEV--LADDGLVKRLVEVLQDARKITD 256

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE--FVKFETFD 355
            I+     +GYIL +            S  S  +YD+F P    QF + +  F++F+ F+
Sbjct: 257 EITKTDQIKGYILAKPNPSASKPDDESSDKSRLLYDDFHPFRPQQFENTDYTFLEFDGFN 316

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            A+DEF+S IE Q+ E +   +E  A  KL K   + E+R+  L+Q  + + + AE I  
Sbjct: 317 KAVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEDRIGGLQQVQELNFRKAEAILA 376

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL-- 472
           N+  V  A  AV   +   M W D+ R+++ E+ +GN VA LI   L L  N ++LLL  
Sbjct: 377 NVHRVTEATEAVNGLIRQGMDWVDIERLIEREQNSGNAVAQLIRLPLKLHENTITLLLNE 436

Query: 473 -------------------SNNLDEMDDE-EKTLPVEKV-------EVDLALSAHANARR 505
                              S + D+ DD   KT P + V       ++DL LSA AN+  
Sbjct: 437 TNWEKGGEEEDEGNETSSVSEDTDDEDDRPRKTSPPKPVARPQLAIDIDLGLSAWANSTE 496

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFN 561
           +++ KK    K+ +T+ A SKA K+ EKK     +  + QEK V  +  +RK HWFEKF 
Sbjct: 497 YFDQKKTAADKEGRTLQASSKALKSHEKKVAEDLKKGLKQEKEV--LRPVRKQHWFEKFI 554

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPL 619
           +FISS+ YLV+ G+DAQQNE+I +R++ KGDVYVHADL GA   +IKN    P+ P+PP 
Sbjct: 555 YFISSDGYLVLGGKDAQQNEIIYRRFLRKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPS 614

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL+QAG  ++C S AWDSK V SAWWV   QVSKT  TGE+L  G F I+GKK FLPP  
Sbjct: 615 TLSQAGNLSICTSDAWDSKAVMSAWWVRSDQVSKTGQTGEFLPAGMFNIKGKKEFLPPAQ 674

Query: 680 LIMGFGLLFRLDESSLGSH----LNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDE 735
           L++G  ++F + +SS  +H    + E  V   E  M D E +   KE + +++++ D DE
Sbjct: 675 LVVGLAVMFEISDSSKANHHKHRVQETAVSAAE--MTD-EPTNESKEAAAMKTDESDDDE 731

Query: 736 KPVAESLSVPNSAHPAP--SHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTP 793
            P A+  S      P     HT  S+ +S    +     SN + S      RN       
Sbjct: 732 FPDAKINSDSEDDFPDAKMEHTEESDAESEAAASR----SNPLQSST----RNAKEDSDE 783

Query: 794 QLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKA------- 846
           + E L+ +            +H        +  +++  E   ++ D   ISK+       
Sbjct: 784 EEEPLVGK---------RGAEHAKPGENNGVVAKEEPPENEGSIADSESISKSMGRGKLS 834

Query: 847 --ERRKLKKGQ----------GSSVVDPKVEREKERGKDASSQPESIVRKT------KIE 888
             ERR  +KGQ             VVD   + E++  +  S++  + V +T      K +
Sbjct: 835 ARERRLARKGQLPELPQVPSDTVPVVDGADQDERDSTEGGSTKAATKVDETVTSQMNKQK 894

Query: 889 GGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
              + RG++ K KK   KY  QDEE+R + M LL 
Sbjct: 895 NPPLPRGKRAKAKKQAAKYAAQDEEDRELAMRLLG 929


>gi|194388162|dbj|BAG65465.1| unnamed protein product [Homo sapiens]
          Length = 1055

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 217/508 (42%), Positives = 306/508 (60%), Gaps = 65/508 (12%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ Q + +    P  E+    +    Y+EF P L +Q     +++FE+FD 
Sbjct: 239 TSNFSGKGYII-QKREI---KPCLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDK 294

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE N
Sbjct: 295 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMN 354

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-- 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N  
Sbjct: 355 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPY 414

Query: 475 ------------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANAR 504
                             N  E    +K     K            V+VDL+LSA+ANA+
Sbjct: 415 LLSEEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAK 474

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           ++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFI
Sbjct: 475 KYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFI 534

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SSENYL+I GRD QQNE+IVKRY++ G                      +P+PP TL +A
Sbjct: 535 SSENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEA 572

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF
Sbjct: 573 GTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGF 632

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMD 712
             LF++DES +  H  ER+VR ++E M+
Sbjct: 633 SFLFKVDESCVWRHQGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|390469065|ref|XP_003734045.1| PREDICTED: nuclear export mediator factor NEMF isoform 2
           [Callithrix jacchus]
          Length = 1056

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 217/513 (42%), Positives = 306/513 (59%), Gaps = 60/513 (11%)

Query: 233 ARA-KQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
           ARA K   LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++ + K 
Sbjct: 175 ARAPKGELLKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKILVCLQKA 232

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
           ED+++   + +   +GYI+ Q + +       +       Y+EF P L +Q     +++F
Sbjct: 233 EDYMK--TTSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEF 289

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           E+FD A+DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      E
Sbjct: 290 ESFDKAVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQTQEIDKLKGE 349

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LIE NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++L
Sbjct: 350 LIEMNLQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTML 409

Query: 472 LSN--------------------NLDEMDDEEKTLPVEK------------VEVDLALSA 499
           L N                    N  E    +K     K            V+VDL+LSA
Sbjct: 410 LRNPYLLSEEEDDDVDGDVSVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSA 469

Query: 500 HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEK 559
           +ANA+++Y+ K+    K +KT+ A  KAF++AEKKT+  + + +TV +I   RKV+WFEK
Sbjct: 470 YANAKKYYDHKRYAAKKTQKTVEAAEKAFRSAEKKTKQTLKEVQTVTSIQKARKVYWFEK 529

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F WFISSENYL+I GRD QQNE+IVKRY++ G                      +P+PP 
Sbjct: 530 FLWFISSENYLIIGGRDQQQNEIIVKRYLTPG----------------------EPIPPR 567

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  
Sbjct: 568 TLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSY 627

Query: 680 LIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 628 LMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 660



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|367021400|ref|XP_003659985.1| hypothetical protein MYCTH_2297656 [Myceliophthora thermophila ATCC
           42464]
 gi|347007252|gb|AEO54740.1| hypothetical protein MYCTH_2297656 [Myceliophthora thermophila ATCC
           42464]
          Length = 1085

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 271/768 (35%), Positives = 386/768 (50%), Gaps = 122/768 (15%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDL+ K  + K    +   +        LL+ESG R H T +AR     PS
Sbjct: 21  LVSLRLSNIYDLNSKILLLKFAKPNSRQQ--------LLIESGFRCHLTDFARAAAPAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  V Q+G DRII  QF  G  A+ + LE +A GNI+LTD+E  +L
Sbjct: 73  QFVSRLRKFLKTRRVTAVSQIGTDRIIEIQFSDG--AYRLYLEFFASGNIILTDAELKIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR+                                        E +  EP +V   G 
Sbjct: 131 ALLRN--------------------------------------VPEGEGQEPQRV---GL 149

Query: 201 NVSNASKENLGGQKGGKSFDL------------SKNSNKNSNDGARAKQPTLKTVLGEAL 248
             +  +++N GG        L            SK + K ++D  R    T  T L    
Sbjct: 150 TYTLENRQNFGGVPPLTKERLRDALRTALAQAESKKAKKKTSDELRRGLVTTITELP--- 206

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDVISGDIVPEG 307
              P L +H        P +K +E+  LED ++   L  ++ +    L DVIS     +G
Sbjct: 207 ---PVLIDHAFRLANFDPAIKPAEI--LEDESLLDALFQSLERGRSILDDVISSSTT-KG 260

Query: 308 YILMQNKHLGKDHPPTESGSSTQI-------YDEFCPLLLNQFR---SREFVKFETFDAA 357
           YI+ +     ++  P   G   QI       Y++F P L  QF    S + + F+ ++  
Sbjct: 261 YIIAKPNPRAQE--PVAEGEDAQISRPRNLLYEDFQPFLPKQFEDDPSCQVLSFDGYNKT 318

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEF+S +E Q+ E + + +E  A  KL     DQE R+  L++    +++ A  IE N+
Sbjct: 319 VDEFFSSLEGQKLESRLQEREAIAKRKLEAARRDQEQRIEGLQEAQMLNLRKAAAIEANI 378

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNL 476
           E V  A+ AV   L   M W D+ ++V+ E+K  NPVA +I   + L  N ++LLL    
Sbjct: 379 ERVQEAMDAVNGLLQQGMDWVDVNKLVEREQKLHNPVAEIIQLPMRLHENVITLLLGEEE 438

Query: 477 ------DEMD-----DEEKT---------LPVEKVEVD--LALSAHANARRWYELKKKQE 514
                 D++D     DEE            P +++ +D  L LS   NAR +YE K+   
Sbjct: 439 EEGEAEDKLDFDYDTDEEAADDGVPDKAKGPAKRLAIDINLKLSPWNNAREYYEQKRTAA 498

Query: 515 SKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
            KQ+KT+     A K AE+K     +  + QEK V  +  +RK  WFEKF WFISS+ YL
Sbjct: 499 EKQQKTVQQSEIALKNAEQKIAEDLKKGLKQEKPV--LQPIRKQLWFEKFIWFISSDGYL 556

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFT 628
           V+ GRDAQQNE++ KRY+ KGDVYVHAD+HGA + ++KN+   P+ P+PP TL QAG  +
Sbjct: 557 VLGGRDAQQNEILYKRYLRKGDVYVHADMHGAPTVIVKNNPKTPDAPIPPSTLAQAGSLS 616

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           VC S AWDSK    A+WV   QVSK+AP GEYL VGSFM+RGK+N LPP  L++GFGL+F
Sbjct: 617 VCCSNAWDSKAAMGAYWVNADQVSKSAPAGEYLPVGSFMVRGKRNPLPPALLMLGFGLMF 676

Query: 689 RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEK 736
           ++ E S   H+  R          D   +G    +   E E D T EK
Sbjct: 677 KVSEESKARHVKHRLYDA------DVGTAGAAPVSVATEVEADATSEK 718


>gi|299471369|emb|CBN79324.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1380

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 196/384 (51%), Positives = 255/384 (66%), Gaps = 10/384 (2%)

Query: 323 TESGSSTQIYDEFCPLLLNQFRSREFV-KFETFDAALDEFYSKIESQRAEQQHKAKEDAA 381
           TE G    +Y+EF P LL Q      +  F +FD A+D F+ +I  Q+ +Q   A E A 
Sbjct: 440 TEEGGDHVVYEEFLPQLLAQHEGGAVIHSFASFDQAVDAFFGRIVEQKLKQTAMAAEAAV 499

Query: 382 FHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLA 441
             K+  I  DQE RV  L++  ++ ++ A+L E   ++V+ A++ VR ALAN M W+DL 
Sbjct: 500 ERKVAWIRNDQERRVLALEERQEKMLRHAQLAEAWADEVEKALMVVRSALANGMDWQDLE 559

Query: 442 RMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
            +VK E   GNP+A LI +L L+RN + L L    D  DD+        VEVD+ LSAHA
Sbjct: 560 DLVKAETANGNPIASLIHELRLDRNQVVLSLPTAEDGEDDQ-------LVEVDIMLSAHA 612

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NAR  YE KK   +K+ KT+TA  K  K AE++    + ++    ++   RKV+WFEKFN
Sbjct: 613 NARVMYENKKLARAKELKTLTASEKVLKIAEQQAERTLQRQAHKRSLQVARKVYWFEKFN 672

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP--EQPVPPL 619
           WFISSENYLVISGR+AQQNE++VK+Y+  GD+YVHADLHGASS V++N  P  ++ V PL
Sbjct: 673 WFISSENYLVISGRNAQQNEVVVKKYLRPGDIYVHADLHGASSCVVRNKDPSGKRAVSPL 732

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
            L +AGC TVC S AW +KMVTSAWWVY  QVSKTAPTGEYL  GSFM+RG+K+FLPP  
Sbjct: 733 ALEEAGCMTVCRSGAWGAKMVTSAWWVYADQVSKTAPTGEYLVTGSFMVRGRKHFLPPRA 792

Query: 680 LIMGFGLLFRLDESSLGSHLNERR 703
           L MGF LLF+LD+S L +H  ERR
Sbjct: 793 LEMGFALLFKLDDSCLAAHAGERR 816



 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/134 (48%), Positives = 86/134 (64%), Gaps = 16/134 (11%)

Query: 57  LLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
           +LL+ESGVR HTT +   K + PSGF++KLRKHIRT+RLEDVRQ+G DR++ F+FG G  
Sbjct: 1   MLLLESGVRFHTTKFTHTKSDMPSGFSMKLRKHIRTQRLEDVRQVGMDRVVDFKFGSGKA 60

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSH----------------RDDDKGVAIMSRHR 160
           +++VILELYA GNI+LTDS++ +L LLR+H                      V +  R  
Sbjct: 61  SNHVILELYASGNIILTDSKYEILDLLRTHIYEGQGGGAAGGSGATGGAGDNVRVAVRQI 120

Query: 161 YPTEICRVFERTTA 174
           YP E+    E TTA
Sbjct: 121 YPMELATTQEGTTA 134


>gi|189211034|ref|XP_001941848.1| serologically defined colon cancer antigen 1 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187977941|gb|EDU44567.1| serologically defined colon cancer antigen 1 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 1151

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 305/980 (31%), Positives = 480/980 (48%), Gaps = 141/980 (14%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L  +R +NVYDLS + ++ K              +  LL++SG R H T YAR    TP
Sbjct: 38  KLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLIDSGFRCHLTEYARTTAGTP 89

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           SGF  KLRK+++TRR+  V Q+G DRI+ FQF  G+  + + LE YA GNI+LTD+E  V
Sbjct: 90  SGFVAKLRKYLKTRRITSVAQIGTDRILEFQFSDGL--YRLYLEFYAGGNIVLTDAELNV 147

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKLHAALTSSKEPDANEPDKVN 196
           L+LLR+  + ++   +    RY   + + +      T  ++  AL  + +   N+P    
Sbjct: 148 LSLLRNVDEGEEHEKLRVGLRYNLTLRQNYGGAPELTKERVRQALQKAMDRQQNQPAATG 207

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPA-LS 255
           +        S                                 L+  L  ++   P  L 
Sbjct: 208 KKAKKAGKDS---------------------------------LRKALAVSITECPPLLV 234

Query: 256 EHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKH 315
           +H +        +K  EV  + D+ +   +++V +    + D I+     +GYIL +   
Sbjct: 235 DHALHVADFDSTLKPEEV--IADDGLMEKLVSVLRDARKITDEITTTNQIKGYILAKPNP 292

Query: 316 LGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE--FVKFETFDAALDEFYSKIESQRAEQQ 373
               +    S  +  +YD+F P    QF + +  F++F+ F+ A+DEF+S IE Q+ E +
Sbjct: 293 SAPTNEDESSDKARLLYDDFHPFRPQQFENSDYTFIEFDGFNKAVDEFFSSIEGQKLESK 352

Query: 374 HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALAN 433
              +E  A  KL K   + E+R+  L+Q  + + + AE I  N+  V  A  AV   +  
Sbjct: 353 LTEREQQAKRKLEKARKEHEDRIGGLQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQ 412

Query: 434 RMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL-----------------SNN 475
            M W D+AR+++ E+ +GN VA LI   L L  N ++LLL                 +++
Sbjct: 413 GMDWVDIARLIEREQNSGNAVAQLIKLPLKLNENTITLLLDETNWEEGEEVEDEGNETSS 472

Query: 476 LDEMDDEE---------KTLPVE-------KVEVDLALSAHANARRWYELKKKQESKQEK 519
           + E  DE+         K+ PV+        +++DL+L+A AN+  +++ KK   +K+++
Sbjct: 473 VSEDSDEDAGEEDGAKKKSAPVKVSARPQLAIDIDLSLTAWANSTEYFDQKKTAANKEDR 532

Query: 520 TITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGR 575
           T+ A ++A K+ EKK     +  + QEK V  +  +RK  WFEKF +FISS+ YLV+ G+
Sbjct: 533 TLQASTRALKSHEKKVAEDLKKGLKQEKEV--LRPVRKQQWFEKFIYFISSDGYLVLGGK 590

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQ 633
           DAQQNE+I +R++ KGDVYVHADL GA   +IKN    P+ P+PP TL+QAG   +C S 
Sbjct: 591 DAQQNEIIYRRFLRKGDVYVHADLKGAMPMIIKNKPDTPDAPIPPSTLSQAGNLCICTSD 650

Query: 634 AWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           AWDSK V SAWWV   QVSKT  TGE+L  G F ++GKK FLP   L++G  ++F + ES
Sbjct: 651 AWDSKAVMSAWWVRSDQVSKTGQTGEFLPAGMFNVKGKKEFLPLAQLVVGLAVMFEISES 710

Query: 694 SLGSH----LNERRVRGEE---EGMDDFEDSGHHK-ENSD---------IESEKDD---- 732
           S  +H    + E  V   E   E  D+ + + H K +NSD         IES+ +D    
Sbjct: 711 SKANHHKHRIQETAVSAAEMVDEPTDETKAADHTKTDNSDDDEDFPDAKIESDSEDDFPD 770

Query: 733 ----TDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARN-V 787
                 E+  AES +    ++P  S T  +  +S +   ++ +++   D       RN  
Sbjct: 771 AKMGQTEESDAESEAAAPRSNPLQSRTTDARDESDD--GDEPSVAQKDDEFAMSGGRNRS 828

Query: 788 AAPVTPQLEDLIDRALGLGSASISST-KHGIETTQFDLSEEDKHVERTATVRDKPYISKA 846
           +A   PQ +D           S++ T K    T +  LS  ++ + R   + + P +   
Sbjct: 829 SANEEPQEDD----------GSVADTEKTSKSTGRRQLSARERRLARKGQLPELPQVPSN 878

Query: 847 ERRKLKKG---QGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKM 903
                      +GSS  +   +   +    A+SQ       TK +   + RG++ K KK 
Sbjct: 879 AAPADDDAAHEEGSSAEEGSAKTPGKVPGTATSQ------GTKQKNTPLPRGKRAKAKKQ 932

Query: 904 KEKYGDQDEEERNIRMALLA 923
             KY  QDEE+R + M LL 
Sbjct: 933 AAKYAAQDEEDRELAMRLLG 952


>gi|389646873|ref|XP_003721068.1| nuclear export mediator factor [Magnaporthe oryzae 70-15]
 gi|351638460|gb|EHA46325.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           70-15]
          Length = 1074

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 268/768 (34%), Positives = 384/768 (50%), Gaps = 127/768 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D  A  + L  +L G+R SN+YDLS K  + K             +K  L++
Sbjct: 1   MKQRFSSVDCKAISQELHAQLPGLRLSNIYDLSSKILLLKFAKPD--------QKAQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T +AR     PS F  +LRK ++TRRL  V Q+G DRII FQF  G   + +
Sbjct: 53  DSGFRCHLTDFARTTAPAPSPFVARLRKFLKTRRLTSVSQIGTDRIIEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GN++LTD+E  +L                                      A 
Sbjct: 111 FLEFFAGGNVILTDNELKIL--------------------------------------AI 132

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGG----QKGGKSFDLSKNSNKNSNDG---- 232
           L + KE +  EP ++   G + S  +++N GG     K      L+K + K +N      
Sbjct: 133 LRNVKEGEGQEPQRI---GLSYSLDNRQNYGGVPEFTKQRLRDALTKTAEKAANTSGATR 189

Query: 233 -ARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLED------------ 278
            AR     L+  L   +    P + +H    +      + +++ + +D            
Sbjct: 190 KARKSGADLRRGLASTITELPPIVVDHAFRSSNFDAQAQAADILQNDDTFDALFEALEEA 249

Query: 279 --------NAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ 330
                   +A Q+    VAK  D    V + D V EG ++          P     S   
Sbjct: 250 RKTLAGITSAAQITGYIVAKTRDGAASVQNEDRVSEGALV---------KPFVPGSSKDL 300

Query: 331 IYDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
           +Y++F P L  QF S      ++FE F+  +DEFYS +E Q+ E +   +E+AA  KL+ 
Sbjct: 301 LYEDFQPFLPKQFSSDPTNVILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDA 360

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              +Q  R+  L++    + + A  IE N+E V  A+ AV   L N M W D+ ++V+ E
Sbjct: 361 AREEQAKRIEGLEESQLLNFRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVERE 420

Query: 448 RKAGNPVAGLID-KLYLERNCMSLLLSNNLD----------------------EMDDEEK 484
           +K  NPVA +I+  + L  N ++L +    +                      E D + +
Sbjct: 421 QKRNNPVAAIIELPMDLANNTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQ 480

Query: 485 TLPVEKVEVD--LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK-TR-LQ-- 538
                ++EVD  L LS  +NA  +Y+ K+    K+EKTI   S A K+A +K TR LQ  
Sbjct: 481 QPSKRELEVDIKLNLSPWSNAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKG 540

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           + QEK V  I  +R   WFEK+ WF+SS+ YLV+ GRDAQQNEMI +R++ +GDVYVHAD
Sbjct: 541 LKQEKPV--IQPIRHQVWFEKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHAD 598

Query: 599 LHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           L GA S +IKN+   PE P+PP TL+QAG  TVC S AWD K    A+WV   QVSK AP
Sbjct: 599 LKGAPSVIIKNNPRTPEAPIPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAP 658

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           TGE+L  GSFMI+GKKN LPP  L++GFGLLFR+ E S   H  + RV
Sbjct: 659 TGEFLPAGSFMIKGKKNELPPATLVIGFGLLFRISEESKAKHAKQHRV 706


>gi|169612956|ref|XP_001799895.1| hypothetical protein SNOG_09606 [Phaeosphaeria nodorum SN15]
 gi|111061751|gb|EAT82871.1| hypothetical protein SNOG_09606 [Phaeosphaeria nodorum SN15]
          Length = 1132

 Score =  384 bits (987), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 333/1026 (32%), Positives = 477/1026 (46%), Gaps = 195/1026 (19%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L + L  +R +NVYDLS  T   ++     +       +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSKSLTSLRVTNVYDLSSLTLSQRIFL---IKFHKPDHREQLLI 57

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T YAR     PS F  KLRK+++TRR+  + Q+G DRI+ FQF  G+  + +
Sbjct: 58  DSGFRCHLTEYARTTAAAPSTFVAKLRKYLKTRRVTSIAQIGTDRILEFQFSDGL--YRL 115

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE YA GNI+LTD +  VL LLR+                      V E     +L   
Sbjct: 116 YLEFYAGGNIVLTDGDLKVLALLRN----------------------VDEGEEHERLRVG 153

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L      + N   + N  G       +   G QK      + +   + +  G +AK+   
Sbjct: 154 L------EYNLSMRQNYGGAPELTKDRIRKGLQKA-----VDRQQAQPAATGKKAKK-VG 201

Query: 241 KTVLGEALGYG-----PALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
           K  L +AL        P L +H +     D+ L P   L+    LE       +L+V K 
Sbjct: 202 KDALRKALAVSITECPPLLVDHALHVAKYDSALKPEEILANDELLEK------LLSVLKD 255

Query: 292 EDWLQDVISGDIVPEGYILMQ-NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE--F 348
              + D I+     +GYIL + N +   D    E   S  +YD+F P    QF   +  F
Sbjct: 256 ARKITDEINSQEQTKGYILAKPNPNATTDEEGAEK--SKHMYDDFHPFRPQQFEESDYTF 313

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F+ F+ A+DEF+S IE Q+ E +   +E  A  KL K   + E R+  L+Q  + + +
Sbjct: 314 LEFDGFNKAVDEFFSSIEGQKLESRLTEREQQAKKKLEKARREHEERLGGLQQVQEVNFR 373

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNC 467
            AE I  N+  V  A  AV   +   M W D+A +++ E+  GN VA  I   L L  N 
Sbjct: 374 KAEAILANVHRVAEATEAVNGLIRQGMDWGDIASLIEREQSHGNAVAETIKLPLKLHENT 433

Query: 468 MSLLLS----NNLDEMDDE-EKTLPVEK-------------------------VEVDLAL 497
           ++LLL     ++ +E DDE  +T  V +                         +++DLAL
Sbjct: 434 ITLLLDETDFDHAEEDDDEGNETSSVSEDSEDEDEGPKKKAAPAKPAARPKLAIDIDLAL 493

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRK 553
           S  AN+  +Y+ KK   SK+++T+ A +KA K+ EKK     +  + QEK +  +  +RK
Sbjct: 494 SPWANSTEYYDQKKTAASKEDRTLQASTKALKSHEKKVAEDLKKGLKQEKDI--LRPVRK 551

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--R 611
             WFEKF +FISS+ YLV+ G+DAQQNE+I +RY  KGDVYVHADL GA   +IKN    
Sbjct: 552 QQWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRYFRKGDVYVHADLKGAVPMIIKNKPTT 611

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+ P+PP TL+QAG  +VC S AW+SK V SAWWV   QVSKT  TGE+L  G F I+GK
Sbjct: 612 PDAPIPPSTLSQAGHLSVCSSDAWESKAVMSAWWVLADQVSKTGQTGEFLPPGLFNIKGK 671

Query: 672 KNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV----------------------RGEEE 709
           K +LPP  LI+G  ++F + E+S   H N+ RV                      +G +E
Sbjct: 672 KEYLPPAQLIVGLAVMFEISEASKARH-NKHRVLDGVNISAVEMAPDSEEQPKATQGSKE 730

Query: 710 GM----------------DDFEDSG-HHKENSDIESEKDDTDEKPVAESLSVPNSAHPAP 752
                             DDF D+   H E SD ESE         A   + P  +  A 
Sbjct: 731 DDSDDDEFPDAKLASDSDDDFPDAKMEHTEESDAESE---------AAGHANPLQSSKAD 781

Query: 753 SHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISS 812
           +H N+S+ D      ED    NG    +    R+ A+       D  D    LG      
Sbjct: 782 AHENSSDEDED----EDVKSVNGKSGHVMSGGRDGAS----HQGDAQDDTGSLGD----- 828

Query: 813 TKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQ-GSSVVDPKVEREKE-- 869
                       SE+ K   R        ++S  ERR LKKGQ  +SV  P  +   +  
Sbjct: 829 ------------SEQTKGASRR-------HLSAKERRLLKKGQLPASVQVPSQKTPADGS 869

Query: 870 -RGKDASSQPESIVRKTKIEG-----------GKISRGQKGKLKKMKEKYGDQDEEERNI 917
             G +++S  E   + TK  G             + RG++ K KK+  KY  QDEE+R +
Sbjct: 870 VDGDESASAGEEAQQPTKPAGTVTSQASKATSSPLPRGKRSKQKKLAAKYAAQDEEDREL 929

Query: 918 RMALLA 923
            M LL 
Sbjct: 930 AMRLLG 935


>gi|86196391|gb|EAQ71029.1| hypothetical protein MGCH7_ch7g436 [Magnaporthe oryzae 70-15]
          Length = 1095

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 263/749 (35%), Positives = 375/749 (50%), Gaps = 126/749 (16%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L G+R SN+YDLS K  + K             +K  L+++SG R H T +AR     P
Sbjct: 41  QLPGLRLSNIYDLSSKILLLKFAKPD--------QKAQLIIDSGFRCHLTDFARTTAPAP 92

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           S F  +LRK ++TRRL  V Q+G DRII FQF  G   + + LE +A GN++LTD+E  +
Sbjct: 93  SPFVARLRKFLKTRRLTSVSQIGTDRIIEFQFSDGQ--YRLFLEFFAGGNVILTDNELKI 150

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDG 199
           L                                      A L + KE +  EP ++   G
Sbjct: 151 L--------------------------------------AILRNVKEGEGQEPQRI---G 169

Query: 200 NNVSNASKENLGG----QKGGKSFDLSKNSNKNSNDG-----ARAKQPTLKTVLGEALG- 249
            + S  +++N GG     K      L+K + K +N       AR     L+  L   +  
Sbjct: 170 LSYSLDNRQNYGGVPEFTKQRLRDALTKTAEKAANTSGATRKARKSGADLRRGLASTITE 229

Query: 250 YGPALSEHIILDTGLVPNMKLSEVNKLED--------------------NAIQVLVLAVA 289
             P + +H    +      + +++ + +D                    +A Q+    VA
Sbjct: 230 LPPIVVDHAFRSSNFDAQAQAADILQNDDTFDALFEALEEARKTLAGITSAAQITGYIVA 289

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE-- 347
           K  D    V + D V EG ++          P     S   +Y++F P L  QF S    
Sbjct: 290 KTRDGAASVQNEDRVSEGALV---------KPFVPGSSKDLLYEDFQPFLPKQFSSDPTN 340

Query: 348 -FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
             ++FE F+  +DEFYS +E Q+ E +   +E+AA  KL+    +Q  R+  L++    +
Sbjct: 341 VILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAKRIEGLEESQLLN 400

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
            + A  IE N+E V  A+ AV   L N M W D+ ++V+ E+K  NPVA +I+  + L  
Sbjct: 401 FRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPVAAIIELPMDLAN 460

Query: 466 NCMSLLLSNNLD----------------------EMDDEEKTLPVEKVEVD--LALSAHA 501
           N ++L +    +                      E D + +     ++EVD  L LS  +
Sbjct: 461 NTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRELEVDIKLNLSPWS 520

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKK-TR-LQ--ILQEKTVANISHMRKVHWF 557
           NA  +Y+ K+    K+EKTI   S A K+A +K TR LQ  + QEK V  I  +R   WF
Sbjct: 521 NAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKPV--IQPIRHQVWF 578

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQP 615
           EK+ WF+SS+ YLV+ GRDAQQNEMI +R++ +GDVYVHADL GA S +IKN+   PE P
Sbjct: 579 EKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVIIKNNPRTPEAP 638

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL+QAG  TVC S AWD K    A+WV   QVSK APTGE+L  GSFMI+GKKN L
Sbjct: 639 IPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAGSFMIKGKKNEL 698

Query: 676 PPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           PP  L++GFGLLFR+ E S   H  + RV
Sbjct: 699 PPATLVIGFGLLFRISEESKAKHAKQHRV 727


>gi|440634980|gb|ELR04899.1| hypothetical protein GMDG_00158 [Geomyces destructans 20631-21]
          Length = 1072

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 250/729 (34%), Positives = 376/729 (51%), Gaps = 103/729 (14%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +NVYDL+ K ++ +             +K  ++++SG R H T+++R    +PS
Sbjct: 21  LVTLRLANVYDLASKIFLLRFTKPD--------DKKQMIIDSGFRCHLTSFSRATTASPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  KLRK ++TRR+  V Q+G DRII FQF  G    Y  LE YA GNI+LTD E  +L
Sbjct: 73  VFVTKLRKFLKTRRVTAVSQIGTDRIIEFQFSEGQYRLY--LEFYAGGNIILTDKELNIL 130

Query: 141 TLLRS------HRDDDKGV--AIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEP 192
           TLLR+        +   G+  ++ +R  Y           T  +L AAL  + E   N P
Sbjct: 131 TLLRTVPPGEGQEEQRIGLKYSLENRQNYLG-----IPPLTKDRLQAALRKAAEQSENAP 185

Query: 193 DKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGP 252
                                         K   KN  D  R     L   + E   + P
Sbjct: 186 ----------------------------AEKKQGKNGIDSLRR---ALAVSITE---FPP 211

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +H +  T   P +K +++ K  D  +  L+ ++ + +  +++ I+G  V  GYI+ +
Sbjct: 212 LLVDHAMKVTDFDPTLKPADIAK-NDTLLDHLLRSLEEADRVVKE-ITGSDVATGYIIAK 269

Query: 313 -----NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSK 364
                +K   +D    E+     +Y++F P    QF +     FV FE F+  +DEF+S 
Sbjct: 270 KQERTDKVASRDE---ETERQALLYEDFHPFKPRQFENDPACTFVPFEGFNNTVDEFFSS 326

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           IE QR E +   +E  A  KL     DQ+ R+  L++    + + A  IE N++ V  A 
Sbjct: 327 IEGQRLESRLYEREVTAKKKLQAAKDDQQKRLGGLQEIQTLNERKAGAIETNVQRVQEAT 386

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNL------- 476
            AV   +A  M W ++ +++  E+K GNPVA +I   L L  N ++LLL   +       
Sbjct: 387 DAVNGLIAQGMDWIEIGKLIDIEQKRGNPVASIIKLPLKLHENTVTLLLDEEIFVEDLND 446

Query: 477 ------DEMDDEEKTLPVEK-----------VEVDLALSAHANARRWYELKKKQESKQEK 519
                  ++ D E   P+++           ++++L  S  +NAR +Y  ++    K++K
Sbjct: 447 EAYETGSDVSDSEDEAPIKEAVKKVVDKRLAIDINLGASPWSNAREYYGQRRSAAEKEKK 506

Query: 520 TITAHSKAFKAA----EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGR 575
           T+ + +KA K+     E+  +  + QEK +  +  +RK  WFEKF WFISS+ YLV+ GR
Sbjct: 507 TLESSTKALKSTSHKIEQDLKKGLKQEKAI--LRPVRKHMWFEKFMWFISSDGYLVLGGR 564

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQ 633
           DAQQNE++ KRY+ KGDVYVHADL GA+S  I+NH  R + P+PP TL+QAG   V  S 
Sbjct: 565 DAQQNEILYKRYLRKGDVYVHADLDGATSVFIRNHESRVDAPIPPSTLSQAGILAVSSSS 624

Query: 634 AWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           AW+SK    AWW    QVSK+APTG+Y   GSF +RGKKNFLPP PL++GFG++F +   
Sbjct: 625 AWESKAGMPAWWANADQVSKSAPTGDYFKPGSFDVRGKKNFLPPAPLLLGFGVMFHVSNE 684

Query: 694 SLGSHLNER 702
           S  +H   R
Sbjct: 685 SKANHTKYR 693


>gi|350296215|gb|EGZ77192.1| hypothetical protein NEUTE2DRAFT_99766 [Neurospora tetrasperma FGSC
           2509]
          Length = 1095

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 273/778 (35%), Positives = 384/778 (49%), Gaps = 129/778 (16%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R +N+YDL+ K  + K        +        LL+
Sbjct: 1   MKQRFSSLDVRVVAHELSEALVSLRLANIYDLNSKILLLKFAKPDNRQQ--------LLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H T + R     PS F  +LRK+++TRR   V Q+G DRII FQF  G  A  +
Sbjct: 53  ESGFRCHLTDFVRTASPAPSQFVARLRKYLKTRRCTSVSQIGTDRIIEFQFSDG--AFRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTDS+  +L L                                      
Sbjct: 111 YLEFFASGNIILTDSDLKILAL-------------------------------------- 132

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGG-----------------QKGGKSFDLSK 223
           L +  E +  EP ++   G   +  +++N GG                 QK        K
Sbjct: 133 LRNVPEGEGQEPQRI---GLTYTLENRQNFGGVPALTKERLRDALQSTVQKAAADQAAGK 189

Query: 224 NSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV 283
              K   D  R    T  T L       P L EH+   T   P  K +E+   +    ++
Sbjct: 190 KIKKKGADELRRGLATTITELP------PILVEHVFRLTSFDPATKPAEILDDDSLLDKL 243

Query: 284 LVLAVAKFEDWLQDVISGDIVPEGYIL------MQNKHLGKDHPPTESGSSTQIYDEFCP 337
                   E  + D ++   V  GYI+       ++  L  D PP E  + T +Y++F P
Sbjct: 244 FDTLQQARE--ILDEVTDSSVSNGYIIAKPRSGFEDTELDVDAPPAEK-AKTLLYEDFQP 300

Query: 338 LLLNQF---RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
            L  QF   ++   + F  ++  +DEF+S +E QR E +   +E AA  KL    MDQ  
Sbjct: 301 FLPKQFEDDKAYRILPFVGYNKTVDEFFSSLEGQRLESKLSEREAAAKRKLEAARMDQAK 360

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           R+  L++    + + A  I+ N E V  A+ AV   L   M W D+ +++++E+K GNPV
Sbjct: 361 RIEGLQEMEMLNYRKAATIQANTERVQEAMDAVNGLLQEGMDWVDITKLIEKEQKQGNPV 420

Query: 455 AGLID-KLYLERNCMSLLL---------------------SNNLDEMDDEEKT---LPVE 489
           A +I   + L+ N ++LLL                     S++ D+ D  E T    PV+
Sbjct: 421 AEIIKLPMKLKENTITLLLGEGVEEEDEGDQDKEDDEFDYSDSEDDADGAETTKHKAPVK 480

Query: 490 KVEVD--LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEK 543
           ++EVD  L LS   NAR +Y+ K+    K +KT+     A K AE+K     R  + QEK
Sbjct: 481 RLEVDINLTLSVWNNAREYYDQKRTAADKAQKTVQQSVIALKNAEQKIAEDLRKGLKQEK 540

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
            V  +  +RK  WFEKF WFISS+ YLV+ GRDAQQNEM+ KRY+ KGDVYVHAD+HGA+
Sbjct: 541 PV--LQPIRKQMWFEKFTWFISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYVHADVHGAA 598

Query: 604 STVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           S +IKN+   P+ P+PP TL QAG  +VC S AWDSK    AWWV   QVSK+AP GEYL
Sbjct: 599 SVIIKNNPKTPDAPIPPSTLAQAGNLSVCCSSAWDSKAGMGAWWVNADQVSKSAPAGEYL 658

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLN-------ERRVRGEEEGMD 712
            VGSFM+RGK+N LPP  L +GFGLLFR+ + S   H         ER+ +G  + +D
Sbjct: 659 PVGSFMVRGKRNLLPPALLTLGFGLLFRVSDDSKSKHTRHRVYDFVERKTKGRADSLD 716


>gi|440466993|gb|ELQ36234.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           Y34]
 gi|440486785|gb|ELQ66618.1| serologically defined colon cancer antigen 1 [Magnaporthe oryzae
           P131]
          Length = 1095

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 263/749 (35%), Positives = 374/749 (49%), Gaps = 126/749 (16%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L G+R SN+YDLS K  + K             +K  L+++SG R H T +AR     P
Sbjct: 41  QLPGLRLSNIYDLSSKILLLKFAKPD--------QKAQLIIDSGFRCHLTDFARTTAPAP 92

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           S F  +LRK ++TRRL  V Q+G DRII FQF  G   + + LE +A GN++LTD+E  +
Sbjct: 93  SPFVARLRKFLKTRRLTSVSQIGTDRIIEFQFSDGQ--YRLFLEFFAGGNVILTDNELKI 150

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDG 199
           L                                      A L + KE +  EP ++   G
Sbjct: 151 L--------------------------------------AILRNVKEGEGQEPQRI---G 169

Query: 200 NNVSNASKENLGG----QKGGKSFDLSKNSNKNSNDG-----ARAKQPTLKTVLGEALG- 249
            + S  +++N GG     K      L+K + K +N       AR     L+  L   +  
Sbjct: 170 LSYSLDNRQNYGGVPEFTKQRLRDALTKTAEKAANTSGATRKARKSGADLRRGLASTITE 229

Query: 250 YGPALSEHIILDTGLVPNMKLSEVNKLED--------------------NAIQVLVLAVA 289
             P + +H    +      + +++ + +D                    +A Q+    VA
Sbjct: 230 LPPIVVDHAFRSSNFDAQAQAADILQNDDTFDALFEALEEARKTLAGITSAAQITGYIVA 289

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE-- 347
           K  D    V + D V EG ++          P     S   +Y++F P L  QF S    
Sbjct: 290 KTRDGAASVQNEDRVSEGALV---------KPFVPGSSKDLLYEDFQPFLPKQFSSDPTN 340

Query: 348 -FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
             ++FE F+  +DEFYS +E Q+ E +   +E+AA  KL+    +Q  R+  L++    +
Sbjct: 341 VILEFEGFNKTVDEFYSSLEGQKLESRLTEREEAAKKKLDAAREEQAKRIEGLEESQLLN 400

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
            + A  IE N+E V  A+ AV   L N M W D+ ++V+ E+K  NPVA +I   + L  
Sbjct: 401 FRKAAAIEANVERVQEAMDAVIGLLENGMDWVDINKLVEREQKRNNPVAAIIKLPMDLAN 460

Query: 466 NCMSLLLSNNLD----------------------EMDDEEKTLPVEKVEVD--LALSAHA 501
           N ++L +    +                      E D + +     ++EVD  L LS  +
Sbjct: 461 NTITLRIGEEEEDDSKDDVDAGYETDSTVSDDDDEADAKSQQPSKRELEVDIKLNLSPWS 520

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKK-TR-LQ--ILQEKTVANISHMRKVHWF 557
           NA  +Y+ K+    K+EKTI   S A K+A +K TR LQ  + QEK V  I  +R   WF
Sbjct: 521 NAGEYYDQKRSAAEKREKTIAQSSLALKSATQKITRELQKGLKQEKPV--IQPIRHQVWF 578

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQP 615
           EK+ WF+SS+ YLV+ GRDAQQNEMI +R++ +GDVYVHADL GA S +IKN+   PE P
Sbjct: 579 EKYLWFVSSDGYLVLGGRDAQQNEMIYRRHLGRGDVYVHADLKGAPSVIIKNNPRTPEAP 638

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL+QAG  TVC S AWD K    A+WV   QVSK APTGE+L  GSFMI+GKKN L
Sbjct: 639 IPPSTLSQAGQLTVCASNAWDGKAAMGAYWVNADQVSKAAPTGEFLPAGSFMIKGKKNEL 698

Query: 676 PPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           PP  L++GFGLLFR+ E S   H  + RV
Sbjct: 699 PPATLVIGFGLLFRISEESKAKHAKQHRV 727


>gi|327348881|gb|EGE77738.1| DUF814 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 1166

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 313/1001 (31%), Positives = 476/1001 (47%), Gaps = 162/1001 (16%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L + L+G+R SN+YDLS + Y+FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVISRELSQALVGLRISNIYDLSSRIYLFKLAKPDTRKQ--------LIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T Y+R     PS F ++LRK ++TRR+  V Q+G DRII  +   G N H V
Sbjct: 53  DTGFRCHLTEYSRTTAAAPSPFIVRLRKFLKTRRVTAVTQVGTDRIIDIELSDG-NFH-V 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE YA GNI+LTD E+ ++ L   HR   +G           E  RV        L   
Sbjct: 111 LLEFYAGGNIILTDKEYKIVAL---HRIVPEG--------NDQEEVRV-------GLQYV 152

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT+ +  +   P  +      +  A      G+  G       N+ +     A A +  +
Sbjct: 153 LTNKQNYNGVPPLSIERLRETLEQAKDVAGSGEGAG-------NTKRAKKKQAEALRRAV 205

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQDVI 299
                E   Y P L EH+   TG+ P++K  +V  L DN  ++ L+LA+ + E     + 
Sbjct: 206 SLGFPE---YPPLLLEHVFHITGVDPSLKPEQV--LGDNELVEKLMLALVEAESVNSSLS 260

Query: 300 SGDIVPEGYILMQNKHLG-KDHPPTESG---SSTQIYDEFCPLLLNQFRSRE---FVKFE 352
           + D  P GYI+ + +    +D   T +    S    Y +F P    QF ++     +KF+
Sbjct: 261 TADDTP-GYIVSKTEIKSVEDSEVTATDPFKSKNLQYVDFHPFEPKQFENQADMAILKFD 319

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           TF+ A+DE++S +E Q+ E +   +E+ A  KL     DQE RV  LK+  +  V+ A+ 
Sbjct: 320 TFNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRKAQA 379

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLL 471
           IE NL  V+ A+ AV   +A  M W ++AR+++ E+   NPVA +I   L L  N ++LL
Sbjct: 380 IEANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTVTLL 439

Query: 472 LSNNL------------------------------DEMDDEEKTLPVEKVEVDLALSAHA 501
           L                                   +  +++    +  +++DL +S  A
Sbjct: 440 LGEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGISPWA 499

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EKTVANISHMRKVHWFEK 559
           NAR++YE KK    K+EKT+ +  KA K+ EKK    + Q  ++    +  +R   WFEK
Sbjct: 500 NARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFWFEK 559

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVP 617
           F +FISS+ YL + GRDAQQ E++ +R++ KGDVYVHAD+ GA    +KN    P+ P+P
Sbjct: 560 FIYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDAPIP 619

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL+QAG   V  S AW SK V  AWWV   QVSKT P+GEYL  G F+IRG+KN LPP
Sbjct: 620 PGTLSQAGNLCVATSSAWHSKAVMGAWWVNADQVSKTTPSGEYLETGGFVIRGEKNQLPP 679

Query: 678 HPLIMGFGLLFRLDESSLGSHLNER------RVRG--EEEGMDDF--------------- 714
             L++GF ++F++   S+ +H   R         G  E +GM++                
Sbjct: 680 AQLLLGFAVMFQISSESIKNHTKHRVQDDSSTTTGVKETQGMEELPSRLDQQTPRESENK 739

Query: 715 -------EDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPA 767
                  ++    +EN +IE   DD    P           H     +++ + D      
Sbjct: 740 ETYHQPEQNDSSDEENGEIEENTDDKRTNPF---------LHEKAESSDSDSEDGESKIG 790

Query: 768 EDKTIS-NGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSE 826
           ED+    +  D + +D A + A         + + ALG    S    + G E        
Sbjct: 791 EDRPQDVDAKDEREYDHAESKA---------VEEAALGGKETSSQEEQAGSEP------- 834

Query: 827 EDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTK 886
              H + +A  R    +S  E  +LKK  G S+         E+     + PES  R T 
Sbjct: 835 ---HTD-SAAARPAKRLSATENGQLKK--GVSI---------EQASTPPTDPES--RLTP 877

Query: 887 IEGGKIS----RGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
            E  + S    RG++GK KK+  KY  QDEE+R + + LL 
Sbjct: 878 NEPSRSSTPNIRGKRGKNKKIATKYQHQDEEDRELALRLLG 918


>gi|342879256|gb|EGU80511.1| hypothetical protein FOXB_08971 [Fusarium oxysporum Fo5176]
          Length = 1060

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 258/743 (34%), Positives = 377/743 (50%), Gaps = 107/743 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+ RL+ +R SNVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKIIAHELQERLVTLRLSNVYDLSSKILLLKFAKPDN--------KKQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK ++TRRL  VRQ+G DR++ F+F  G   + +
Sbjct: 53  DTGFRCHLTKFARTTAAAPSAFVARLRKFLKTRRLTSVRQVGTDRVLEFEFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD++  +L L R                                    
Sbjct: 111 FLEFFASGNIILTDADLKILALAR------------------------------------ 134

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             +  E +  EP +V   G   S  +++N GG        L++   +++   A  K  T 
Sbjct: 135 --TVSEGEGQEPQRV---GLQYSLENRQNFGGIP-----PLTRERVQDALRTAVEKAATA 184

Query: 241 KTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
                +     P L +H +     DT + P+  L+    L D     LV ++ +    ++
Sbjct: 185 TASSKKQKELPPVLVDHWLHTNNFDTTIKPDEILANETLLAD-----LVKSLQEARQSVE 239

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLL---LNQFRSREFV 349
           ++ S +    GYI  + +   +    T+  S TQ    +Y++F P +   L +  + E +
Sbjct: 240 ELTSSEAC-TGYIFAKRRERTEGAEATDE-SKTQRDNLLYEDFHPFVPYKLKKDPTIEVL 297

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +F  ++  +DEF+S +E QR E +   +E AA  KL     +Q  R+  L++    + + 
Sbjct: 298 EFTGYNETVDEFFSSLEGQRLESRLSEREAAAKRKLEAARNEQSKRIEGLQEAQALNFRK 357

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A  IE N E V  A+ AV   L+  M W D+ ++V+ E+K  NPVA +I   L L  N +
Sbjct: 358 AAAIEANAERVQEAMDAVNGLLSQGMDWVDVGKLVEREKKRHNPVAEIIKLPLNLAENLI 417

Query: 469 S--------------------LLLSNNLDEMDDEEKTLPVEK---VEVDLALSAHANARR 505
           +                       +   DE     K     K   VE++L LS  +NAR 
Sbjct: 418 TLELAEEEFEPEEDDPYETDDDDSALGDDEGTSAAKGKQANKALSVEINLGLSPWSNARE 477

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFN 561
           +++ +K    K+EKT    S+A K AE+K     +  + QEK +  +  +RK  WFEKF 
Sbjct: 478 YFDQRKTAAVKEEKTQQQASRALKNAEQKITEDLKKGLKQEKAL--LQPIRKPMWFEKFV 535

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPL 619
           WFISS+ YLVI G+DAQQNEMI K+Y+ KGDVY HADLHGASS +IKN+   P+ P+PP 
Sbjct: 536 WFISSDGYLVIGGKDAQQNEMIYKKYLRKGDVYCHADLHGASSVIIKNNPKTPDAPIPPA 595

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           TL+QAG   VC S AWDSK   SAWWV   QVSK+APTGE+L  GSFMIRGKKNFLPP  
Sbjct: 596 TLSQAGSLAVCSSNAWDSKAGMSAWWVNADQVSKSAPTGEFLPAGSFMIRGKKNFLPPAQ 655

Query: 680 LIMGFGLLFRLDESSLGSHLNER 702
           L++G G+ F++ E S   H+  R
Sbjct: 656 LLLGLGVAFKISEESKAKHVKHR 678


>gi|336464133|gb|EGO52373.1| hypothetical protein NEUTE1DRAFT_71883 [Neurospora tetrasperma FGSC
           2508]
          Length = 1095

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 264/741 (35%), Positives = 370/741 (49%), Gaps = 121/741 (16%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +N+YDL+ K  + K        +        LL+ESG R H T + R     PS
Sbjct: 21  LVSLRLANIYDLNSKILLLKFAKPDNRQQ--------LLIESGFRCHLTDFVRTASPAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK+++TRR   V Q+G DRII FQF  G  A  + LE +A GNI+LTDS+  +L
Sbjct: 73  QFVARLRKYLKTRRCTSVSQIGTDRIIEFQFSDG--AFRLYLEFFASGNIILTDSDLKIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            L                                      L +  E +  EP ++   G 
Sbjct: 131 AL--------------------------------------LRNVPEGEGQEPQRI---GL 149

Query: 201 NVSNASKENLGG-----------------QKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
             +  +++N GG                 QK        K   K   D  R    T  T 
Sbjct: 150 TYTLENRQNFGGVPALTKERLRDALQSTVQKAAADQAAGKKIKKKGADELRRGLATTITE 209

Query: 244 LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
           L       P L EH+   T   P  K +E+   +    ++        E  + D ++   
Sbjct: 210 LP------PILVEHVFRLTSFDPATKPAEILDDDSLLDKLFDTLQQARE--ILDEVTDSS 261

Query: 304 VPEGYIL------MQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQF---RSREFVKFETF 354
           V  GYI+       ++  L  D PP E  + T +Y++F P L  QF   ++   + F  +
Sbjct: 262 VSNGYIIAKPRSGFEDTELDVDAPPAEK-AKTLLYEDFQPFLPKQFEDDKAYRILPFVGY 320

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           +  +DEF+S +E QR + +   +E AA  KL    MDQ  R+  L++    + + A  I+
Sbjct: 321 NKTVDEFFSSLEGQRLKSKLSEREAAAKRKLEAARMDQAKRIEGLQEMEMLNYRKAATIQ 380

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL- 472
            N+E V  A+ AV   L   M W D+ +++++E+K GNPVA +I   + L+ N ++LLL 
Sbjct: 381 ANIERVQEAMDAVNGLLQEGMDWVDITKLIEKEQKQGNPVAEIIKLPMKLKENTITLLLG 440

Query: 473 --------------------SNNLDEMDDEEKT---LPVEKVEVD--LALSAHANARRWY 507
                               S++ D+ D  E T    PV+++EVD  L LS   NAR +Y
Sbjct: 441 EGVEEEEEGDQDKEDDEFDYSDSEDDADGAETTKDKAPVKRLEVDINLTLSVWNNAREYY 500

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWF 563
           + K+    K +KT+     A K AE+K     R  + QEK V  +  +RK  WFEKF WF
Sbjct: 501 DQKRTAADKAQKTVQQSVIALKNAEQKIAEDLRKGLKQEKPV--LQPIRKQMWFEKFTWF 558

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTL 621
           ISS+ YLV+ GRDAQQNEM+ KRY+ KGDVYVHAD+HGA+S +IKN+   P+ P+PP TL
Sbjct: 559 ISSDGYLVLGGRDAQQNEMLYKRYLRKGDVYVHADVHGAASVIIKNNPKTPDAPIPPSTL 618

Query: 622 NQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
            QAG  +VC S AWDSK    AWWV   QVSK+AP GEYL VGSFM+RGK+N LPP  L 
Sbjct: 619 AQAGNLSVCCSSAWDSKAGMGAWWVNADQVSKSAPAGEYLPVGSFMVRGKRNLLPPALLT 678

Query: 682 MGFGLLFRLDESSLGSHLNER 702
           +GFGLLFR+ + S   H   R
Sbjct: 679 LGFGLLFRVSDDSKSKHTRHR 699


>gi|322693747|gb|EFY85597.1| serologically defined colon cancer antigen 1 [Metarhizium acridum
           CQMa 102]
          Length = 1063

 Score =  381 bits (978), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 311/980 (31%), Positives = 476/980 (48%), Gaps = 168/980 (17%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L + L+ +R +NVYDLS K  +FK              K  L++
Sbjct: 1   MKQRFSSLDVKVIAHELNQSLVTLRLANVYDLSSKILLFKFAKPDN--------KKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T ++R     PS F  +LRK ++TRRL  V Q+G DRI+  QF  G   + +
Sbjct: 53  DTGFRCHLTKFSRTTAAAPSAFVARLRKLLKTRRLTSVSQVGTDRILQLQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
            LE +A GNI+LTD++  +L+L R+  + D         +Y  E  + F      T  ++
Sbjct: 111 FLEFFASGNIILTDADLKILSLARNVSEGDGQEPQRVGLQYSLENRQNFHGIPPLTRERV 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             AL S+ +  A  P            ASK+   G+ GG   DL K              
Sbjct: 171 QVALQSAVDKAAATP------------ASKKP-KGKPGG---DLRK-------------- 200

Query: 238 PTLKTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
             L   + E     P L +HI+     DT + P  ++ E   L D  +++L  A +  E 
Sbjct: 201 -CLAVSITE---LPPVLVDHILQSNNFDTAVNP-AEILENEVLLDELVKLLSEAKSSVEG 255

Query: 294 WLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ--------IYDEFCPLLLNQFRS 345
                I+   +  GYI  + +    D  P +    ++        +Y++F P + ++ + 
Sbjct: 256 -----ITSSEICTGYIFAKRR----DGNPIKEAQGSEAATNRGELLYEDFHPFIPHKLQR 306

Query: 346 REFVK---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
              +K   F+ ++  +DEF+S +E Q+ E +   +E AA  KL+    DQ  R+  L+  
Sbjct: 307 DPSIKALEFKGYNQTVDEFFSSLEGQKLETRLNEREAAAKRKLDAAKADQAKRIEGLQDA 366

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KL 461
              +++ A  IE N+E V  A+ AV   LA  M W D+ ++V+ E+K  NPVA +I   L
Sbjct: 367 QTLNMRKAAAIEANVEWVQEAMDAVNGLLAQGMDWVDIGKLVEREKKRKNPVADIIVLPL 426

Query: 462 YLERNCMSLLL---------------SNNLDEMDDEEKTLPVEK---------VEVDLAL 497
            L  N ++L L               +++ D  D+ E +   +K         VE++L L
Sbjct: 427 NLAENLITLSLAEEEEEEAEEADPFETDDSDSEDENEASTISKKSEKPAKGLNVEINLKL 486

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRK 553
           S  +NAR +YE ++    K+EKT    S+A K AE+K     +  + QEK +  +  +RK
Sbjct: 487 SPWSNAREYYEQRRTAVVKEEKTQQQASRALKNAEQKIVEDLKKGLKQEKAL--LQPIRK 544

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--R 611
             WFEKF WFISS+ YLV+ G+DAQQNE++ KRY+ KGDVY HADL GA S +IKN+   
Sbjct: 545 QLWFEKFLWFISSDGYLVLGGKDAQQNEILYKRYLRKGDVYCHADLRGAPSVIIKNNPST 604

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+ P+PP TL QAG  +VC S+AWD K    AWWV   QVSK+ P G++L  G+FM+RG+
Sbjct: 605 PDAPIPPATLAQAGNLSVCASEAWDQKAGMGAWWVKADQVSKSGPAGDFLPTGNFMVRGQ 664

Query: 672 KNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKD 731
           KNFL P  L++G G++F++ E S   H+  R        + D + +      SD+ + ++
Sbjct: 665 KNFLAPAQLLLGLGIMFKISEESKARHVKHR--------IHDVDSA----LGSDVATSRN 712

Query: 732 DTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEF-PAEDKTISN--GIDSKIFDIARN-- 786
           D                      + AS  DS E  P +D T S+    D +  + AR   
Sbjct: 713 DM--------------------QSLASVADSQEKEPEDDVTQSDNESDDGREQEDARANP 752

Query: 787 VAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKA 846
           + AP   + +D +D A G  S S++ T+    T + D   ED+  E T T RD+  ++  
Sbjct: 753 LQAPDAAE-DDEVDEATGAVS-SLNLTEQ--PTGEGD--GEDEAAE-TGTSRDESELATE 805

Query: 847 ERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEK 906
                 K   S+                ++ P S  +K     G   RGQ+GK KK+  K
Sbjct: 806 ASEAPTKTSDSTT-------------QTAATPSSHSKK-----GPPKRGQRGKAKKIALK 847

Query: 907 YGDQDEEERNIRMALLAVST 926
           Y DQDEE+R    AL+  + 
Sbjct: 848 YKDQDEEDRAAAEALIGATV 867


>gi|303312187|ref|XP_003066105.1| hypothetical protein CPC735_053300 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240105767|gb|EER23960.1| hypothetical protein CPC735_053300 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 1125

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 300/978 (30%), Positives = 479/978 (48%), Gaps = 144/978 (14%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++G+R SN+YDLS +TY+FK+       +         ++
Sbjct: 1   MKQRFSSLDVKVICRELSAAVVGLRVSNIYDLSSRTYLFKIAKPDVRKQ--------FIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  +LR  +++RR+  V Q+G DRI+  +F  G   +++
Sbjct: 53  DSGFRCHITEYSRVTAPAPSHFVSRLRGFLKSRRITAVSQIGTDRIVHIEFSDGY--YHL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
            LE +A GNI+LTD+E+ ++ LLR   + +    +    +Y  +  + +E     +  +L
Sbjct: 111 FLEFFASGNIILTDNEYKIVALLRIVPEGEDQDEVRLGLKYRLDNKQNYEGVPPPSVDRL 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             AL   KE DA+                              +S+ +NK +    + ++
Sbjct: 171 KTALQKGKERDAS------------------------------ISEPANKRAK---KKQE 197

Query: 238 PTLKTVLGEALG---YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
             L+  L  +LG   Y P L EH +     D+ L P+  L   +++ D      ++ V +
Sbjct: 198 EALRRAL--SLGFPEYPPVLLEHALHVTGFDSSLRPDQILETGDRVND------LMRVLR 249

Query: 291 FEDWLQDVISGDIVPEGYILMQNKHLGKDHPP--TESGSSTQIYDEFCPLLLNQFRSR-- 346
             + + + +S      GYI+ +N++   ++P    E+      Y ++ P    QF     
Sbjct: 250 EVESVSNELSTTEQTRGYIVARNENKPSENPSFSGEAKPDKSNYIDYHPFAPRQFADGND 309

Query: 347 -EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
              + F++F+ A+DE+YS +E+Q+ E +   +E+    KL     D E RV  L+Q  + 
Sbjct: 310 ISILTFDSFNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEI 369

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
             + AE I  NL  V+  + AV   +A  M W ++AR+++ E+   NPVA LI   L L 
Sbjct: 370 HTRKAEAIATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLY 429

Query: 465 RNCMSLLLSNN---------------LDEMDDEEKTLP----VEKVEVDLALSAHANARR 505
            N +++LL                   +E D E KT P    V  V++DL L+  ANA +
Sbjct: 430 ENTVTVLLPEGQLDEEDDDSEESDEEDEENDGEAKTKPQRPEVLSVDIDLGLTPWANASQ 489

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKK--TRLQ--ILQEKTVANISHMRKVHWFEKFN 561
           +Y+ KK    K+EKTI A  +A K+AEKK  T L+  + QEK V  +   R   WFEKF 
Sbjct: 490 YYDQKKTAAVKEEKTIKASKQALKSAEKKLTTDLKRGLKQEKPV--LRPARIPFWFEKFY 547

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP---EQPVPP 618
           +FISS+ YLV+ G D +QNE++  R++ KGDVYVHAD+ GA   ++KN +P   + P+PP
Sbjct: 548 FFISSDGYLVLGGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKN-KPGASDAPIPP 606

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL QAG FTV  S+AW+SK +  AWWV   QVSKT P+GEYL  G  +IRG KN L P 
Sbjct: 607 GTLAQAGTFTVATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPG 666

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPV 738
            LI+GF ++F++   S+ +H    R R EE    +      H+  +   SE +  +E P 
Sbjct: 667 QLILGFAVMFQISPESVRNHT---RHRLEEPVSSEMTVKNDHRNGTHEPSEMEKLEESP- 722

Query: 739 AESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
                          +T   N    +   E K   N  D     + ++    + PQ+++ 
Sbjct: 723 ---------------NTAVDNCSIGKVGMEQKPRENTWD---LPVEQSAQTGIAPQVKE- 763

Query: 799 IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQG-- 856
                  G A +S            LS+ D   +  A      ++S  ERR +K+G G  
Sbjct: 764 -----PQGEAGLSREDKDT------LSDPDLQQQLAAFGATTKHVSAQERRLMKRGAGLH 812

Query: 857 -SSVVDPKVEREKERGKDASSQPESI---------VRKTKIEGGKIS-RGQKGKLKKMKE 905
            S++ +  ++ E E  ++  S P +          ++ T     ++  RG++GK KK+  
Sbjct: 813 ASALPELGLDEEDEDEEENQSTPSTFKPSGTPTLSIQSTSTSKSQLPVRGKRGKAKKLAS 872

Query: 906 KYGDQDEEERNIRMALLA 923
           KY DQDEE+R + + LL 
Sbjct: 873 KYKDQDEEDRELALRLLG 890


>gi|398396540|ref|XP_003851728.1| hypothetical protein MYCGRDRAFT_43818 [Zymoseptoria tritici IPO323]
 gi|339471608|gb|EGP86704.1| hypothetical protein MYCGRDRAFT_43818 [Zymoseptoria tritici IPO323]
          Length = 1060

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 258/739 (34%), Positives = 390/739 (52%), Gaps = 79/739 (10%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R SNVYDLS + ++ K              +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSNTLVSLRLSNVYDLSSRIFLLKFAKPD--------HREQLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T++AR     P+ F  +LRK ++TRR+  VRQ+G DR+I  +F  G  A+ +
Sbjct: 53  DSGFRCHLTSFARATAAAPTPFVARLRKFLKTRRVTAVRQVGTDRVIELEFSDG--AYRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE YA GNI+LT                DK   I++  R   E     +    +  + +
Sbjct: 111 YLEFYAGGNIVLT----------------DKESTILALLRSVGEGAEHEQYRAGATYNLS 154

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENL-GGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
           L            + N DG  V + S E L  G +      L ++         +A    
Sbjct: 155 L------------RQNFDG--VPDLSTERLRDGLQAAIQKQLIESQKPGKKIKKKAGDAL 200

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
            + +      + P L +H +  +G+  N++  +V  LE + +   VLA  +  + + D I
Sbjct: 201 RRALAITTTEFPPILLDHALHVSGIDRNVQPEQV--LESDELLDKVLAALQQANIVIDDI 258

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FVKFETFDA 356
           +   V  GYIL +     K+     +     +Y++F P    Q  + E   F +F  F+ 
Sbjct: 259 TQAEVATGYILAKRNGAVKESDGEATDERGLMYEDFHPFKPAQLTAEETIVFREFSGFNK 318

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
            +DEF+S IE Q+ E + + +ED A  ++ +   +Q  R+  L++  + +++ A+ IE N
Sbjct: 319 TVDEFFSSIEGQKLESKLQEREDHAKRRIEQAREEQAKRIDGLQEVQELNIRKAQAIEAN 378

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLS-- 473
           +E V+ A  AV   +A  M W D+ ++++ E+K  N VA LI   L L  N ++LLLS  
Sbjct: 379 VERVEEATAAVNGLIAQGMDWVDIGKLIENEQKRHNAVAELIKLPLKLHENTVTLLLSEL 438

Query: 474 NNLDEMDDEEKTLPVEK---------------------VEVDLALSAHANARRWYELKKK 512
           +  D  DDE      E                      +++DLA S  ANAR++Y+ K+ 
Sbjct: 439 DAADGGDDEANETDSEPDDSDDEDAAPAAKGGEDKRLTIDIDLAASGWANARQYYDQKRS 498

Query: 513 QESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
             +KQEKT  A  KA K+ E++     +  + QEK V  +  +RK  WFEKF +F+SS+ 
Sbjct: 499 AATKQEKTAQASQKALKSTEQRVMADLKKGLKQEKDV--LRPVRKQFWFEKFIYFLSSDG 556

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGC 626
           YLV++G+DAQQNE++ +RY+ KGDVYV+ADL GA+S +IKN+   PE P+PP TL+QAG 
Sbjct: 557 YLVLAGKDAQQNEILYRRYLKKGDVYVNADLQGAASVIIKNNPATPEAPIPPSTLSQAGN 616

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             VC S AW+SK V SAWWV   QVSKTAPTGEYLT G F+IRGKKN LPP  L++GFG+
Sbjct: 617 LAVCTSSAWESKAVMSAWWVNADQVSKTAPTGEYLTNGGFVIRGKKNHLPPAQLLLGFGV 676

Query: 687 LFRLDESSLGSHLNERRVR 705
           +F++ E S  +H+  R  R
Sbjct: 677 MFQISEESKANHVKHRLQR 695


>gi|358398026|gb|EHK47384.1| hypothetical protein TRIATDRAFT_238226 [Trichoderma atroviride IMI
           206040]
          Length = 1068

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 254/748 (33%), Positives = 392/748 (52%), Gaps = 93/748 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L+ +R +NVYDLS K  + K              K  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELQASLVTLRLANVYDLSSKILLLKFAKPDN--------KQQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R H T +AR     PS F  +LRK+++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  ENGFRCHLTDFARTTAAAPSAFVARLRKYLKTRRLTAVTQVGTDRILEFQFSDGQ--YRM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF---ERTTASKL 177
            LE +A GNI+LTD++  +L + R+  + +   A     +Y  E  + +      T  ++
Sbjct: 111 FLEFFASGNIILTDADLKILAISRNVGEGEGQEAQQVGLQYSLENRQNYGGIPALTKERI 170

Query: 178 HAAL-TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
             AL T++++ +ANE                       G  +F   K   K+  D  +A 
Sbjct: 171 RDALKTAAEKAEANE-----------------------GANTFSGKKAKGKSGGDLRKAL 207

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
             ++  +        P L E+I+         KL++V   E + +  LV  +++  D ++
Sbjct: 208 AVSITEL-------PPTLVENILQANSFDVTAKLADVIDNE-SLLDALVRYLSEARDIVE 259

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-----IYDEFCPLLLNQFR---SREF 348
           + I+      GYI  + K           G+++Q     +YD+F P + ++F+   S E 
Sbjct: 260 N-ITASATCTGYIFAKKKATSSSG--LVEGNASQKREGLLYDDFHPFIPHKFKKDSSFEI 316

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++FE ++  +DEF+S +E Q+ E +   +E+AA  KL     +Q  R+  L+     +++
Sbjct: 317 LEFEGYNRTVDEFFSSLEGQKLESRLTGREEAAKKKLEDARHEQGKRIQGLQDAQAMNLR 376

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNC 467
            A  IE N+E V  A+ AV   +A  M W D+ ++++ E+K  NPVA  I+  L L  N 
Sbjct: 377 KAAAIEANVERVQEAMDAVNGLIAQGMDWIDIGKLIEREKKRQNPVAETINLPLKLSENT 436

Query: 468 MSLLLSN----------------NLDEMDDEE---------KTLPVEKVEVDLAL--SAH 500
           ++LLL+                   DE D EE          T P + + VD+ L  S  
Sbjct: 437 ITLLLAEEEFDEDEDEAQEANPYETDESDSEEGLSEANATKDTKPAKLLTVDIVLNVSPW 496

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHW 556
           +NAR +YE ++    K+EKT    +KA K+ E K     +  + QEK +  +  +RK  W
Sbjct: 497 SNAREYYEQRRSAAIKEEKTQQQATKALKSTEHKIAEDLKKGLKQEKAL--LQPIRKQLW 554

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQ 614
           FEKF WFISS+ YLV+ G+D QQ+E++ +RY+ KGD+Y HAD+ GA++ VIKN+   P+ 
Sbjct: 555 FEKFLWFISSDGYLVLGGKDPQQSEILYRRYLRKGDIYCHADIRGAANIVIKNNPNTPDA 614

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG  +VC S+AWDSK    AWWV   QVSK+A TGE +  G+F+I GKKN+
Sbjct: 615 PIPPATLSQAGSLSVCSSEAWDSKAGMGAWWVNTDQVSKSASTGEIMPAGNFIIEGKKNY 674

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHLNER 702
           LPP  L++G G  FR+ E S GSHL  R
Sbjct: 675 LPPTQLLLGLGFAFRISEQSKGSHLKHR 702


>gi|406864313|gb|EKD17358.1| serologically defined colon cancer antigen 1 [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 1052

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 253/726 (34%), Positives = 390/726 (53%), Gaps = 94/726 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDLS K ++ K              K  +L++SG R H T ++R     P+
Sbjct: 21  LVTLRVSNIYDLSSKIFLIKFAKPD--------HKQQILIDSGFRCHLTEFSRATAAAPT 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK+++TRR+  +  +G DRII FQF  G   + + LE YA GNI+LTD E  +L
Sbjct: 73  AFVTRLRKYLKTRRVTSIAPVGTDRIIEFQFSDGQ--YRLFLEFYAGGNIILTDKELNIL 130

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR                       V E     +L   L  S E      ++ N  G 
Sbjct: 131 ALLRI----------------------VGEGEGQEELRVGLKYSLE------NRQNYAG- 161

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA-KQPTLKT--VLGEALG-----YGP 252
            V   +KE L         D  + S    +DG  A KQP  K    L  AL      Y P
Sbjct: 162 -VPPLTKERLQ--------DALQKSVDRGDDGLVAGKQPKKKASDALRRALAVSITEYPP 212

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +H +  T    ++K ++V + +D  +  L+ ++ + +  +Q++ S ++  +GYI+ +
Sbjct: 213 MLVDHAMRVTDFDASLKPADVLQSQD-LLDHLMRSLQEAQSVVQEITSSEVA-KGYIIAK 270

Query: 313 NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKIESQR 369
            K   ++  P +      IY++F P    QF    +  F++F+ F+   D+F+S IE Q+
Sbjct: 271 KKEGYEEASPEDQARKFVIYEDFHPFRPRQFENDPATVFLEFQGFNKTADQFFSSIEGQK 330

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
            E + + +E  A  K+     DQ  R+  L++  + +++ A  ++ N E V  A+ AV  
Sbjct: 331 LESRLQEREQMAKRKIEAARQDQAKRLGGLQEVQELNIRKAGALQANAERVQEAMDAVNG 390

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLL----------------- 471
            +A  M W ++ ++V+ E+K  NPVA +I   L L+ N +SLL                 
Sbjct: 391 LVAQGMDWVEIGKLVEIEQKRNNPVASIIKLPLKLQENTISLLLDEEEDADDDESNYETD 450

Query: 472 --LSNNLDEMDDEE-KTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
             +S++ DE   +E K   VEK   ++V+LALS  ANAR +Y+ K+    K++KT+ + +
Sbjct: 451 SDVSDSEDEAPKKEPKQKTVEKRLTIDVNLALSPWANAREYYDQKRTAAEKEQKTLQSST 510

Query: 526 KAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
           KA K+ E K     R  + QEK V  +  +R+  WFEKF WFISS+ YLV++G+D QQ E
Sbjct: 511 KALKSQEAKIAHDLRKGLKQEKAV--LRPVRRQMWFEKFTWFISSDGYLVLAGKDPQQKE 568

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
            + +RY+ KGDVYVHA++ GA+S VI+N+   P+ P+PP TL+QAG  ++  S AW++K 
Sbjct: 569 TLYRRYLKKGDVYVHAEVQGAASVVIRNNPKTPDAPIPPSTLSQAGTLSISCSSAWEAKA 628

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
             SAWWV   QVSK A TGE+L  GSF I+GKKNFLPP  L++GFG++F + E S  +H 
Sbjct: 629 GMSAWWVNADQVSKAASTGEFLPAGSFNIKGKKNFLPPAVLLLGFGVIFLISEESKVNH- 687

Query: 700 NERRVR 705
           N+ R++
Sbjct: 688 NKHRLQ 693


>gi|317033383|ref|XP_001395552.2| hypothetical protein ANI_1_620104 [Aspergillus niger CBS 513.88]
          Length = 1108

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 290/973 (29%), Positives = 466/973 (47%), Gaps = 123/973 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FK+             +  L++
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKVAKPD--------HRKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + P+ F  ++RK +++RR+  + Q+G DRII F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATASAPTPFVTRMRKFLKSRRITSIEQIGTDRIIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI++TD E+ +L L R               +  T +   +  T     H  
Sbjct: 111 FLEFFAGGNIIITDREYNILALFRQ--------VPAGEGQDETRVGVKYTVTNKQNYHGI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                      PD   E        +K  L  Q+G    +  K S K + D        L
Sbjct: 163 -----------PDITRERVKETVEKAK-ALFAQEG----NAPKKSKKKNAD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H      L P   L EV  L+D A+ + V+ V +      D +
Sbjct: 200 RKALSQGFPEYPPLLLDHAFAVKELDPATPLDEV--LQDEALLLKVVDVLEEAKVETDKL 257

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-----IYDEFCPLLLNQFRSREFV---KF 351
           + +    GYI+ ++        P +           +Y++F P    QF  +  V   ++
Sbjct: 258 ATEKSHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPFKPKQFEGKPGVTILEY 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            +F+A +DE++S IE+Q+ E +   +E+AA  KL+ +  +   R+  LK+  +  ++ A 
Sbjct: 318 PSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRKAG 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N ++L
Sbjct: 378 AIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTITL 437

Query: 471 LLSNNLDEMDDEEKTLPVEK----------------------VEVDLALSAHANARRWYE 508
           +L  + +E D+ E     +                       +++DL LS  ANA ++YE
Sbjct: 438 MLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQYYE 497

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 498 QKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQV--LRPARKTFWFEKFLFFI 555

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLN 622
           SSE YLV+ GRD  Q+E++ +RY+ KGDV+VHADL GA+  ++KN  + P  P+PP TL+
Sbjct: 556 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 615

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG   V  S AWDSK + SA+WV   QVSKTA  G  L  G F+I+G+KNFL P  L++
Sbjct: 616 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 675

Query: 683 GFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESL 742
           GFG++F++ + SL +H   R         D+   +    E  + + E DD   KPV +  
Sbjct: 676 GFGVMFQVSKESLRNHKLHR--------FDEPVATEAPVEGQEADKEADD---KPVEQEA 724

Query: 743 SVPNSAHP--------APSHTNASNVDSHEFPAEDKTISNGID-SKIFDIARNVAAPVTP 793
            +  S  P          S +     D    PA +       + ++   IA N +    P
Sbjct: 725 QITKSERPAEAEQEQEQSSESEGEQEDDAVIPARNPLQRGSSEPTQTESIAANESQNAQP 784

Query: 794 QLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT---VRDKPYISKAERRK 850
             +D  +      +   +      ++ Q + +E++K  + + T     D   +S  ERR 
Sbjct: 785 --DDAAEEEKEEEAEEPNGNNEDEQSAQEEPAEDEKDEDESGTSPQTYDDRQLSARERRM 842

Query: 851 LKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQ 910
            +KG+ S +  P       +  ++   P              +RG++GK KK  +KY DQ
Sbjct: 843 ARKGRASELDGPAANGTSAKSTNSKQAP--------------TRGKRGKAKKAAQKYADQ 888

Query: 911 DEEERNIRMALLA 923
           DEE+R + + LL 
Sbjct: 889 DEEDRELALRLLG 901


>gi|134080270|emb|CAK97173.1| unnamed protein product [Aspergillus niger]
          Length = 1180

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 285/953 (29%), Positives = 457/953 (47%), Gaps = 122/953 (12%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           ++ +R SN+YDLS + ++FK+             +  L+++SG R H T Y+R   + P+
Sbjct: 93  IVNLRVSNIYDLSSRIFLFKVAKPD--------HRKQLVVDSGFRCHVTQYSRATASAPT 144

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  ++RK +++RR+  + Q+G DRII F F  GM  +++ LE +A GNI++TD E+ +L
Sbjct: 145 PFVTRMRKFLKSRRITSIEQIGTDRIIDFSFSDGM--YHMFLEFFAGGNIIITDREYNIL 202

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            L R               +  T +   +  T     H             PD   E   
Sbjct: 203 ALFRQ--------VPAGEGQDETRVGVKYTVTNKQNYHGI-----------PDITRERVK 243

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALG-YGPALSEHII 259
                +K  L  Q+G    +  K S K + D        L+  L +    Y P L +H  
Sbjct: 244 ETVEKAKA-LFAQEG----NAPKKSKKKNAD-------VLRKALSQGFPEYPPLLLDHAF 291

Query: 260 LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKD 319
               L P   L EV  L+D A+ + V+ V +      D ++ +    GYI+ ++      
Sbjct: 292 AVKELDPATPLDEV--LQDEALLLKVVDVLEEAKVETDKLATEKSHPGYIVAKDDTRPSA 349

Query: 320 HPPTESGSSTQ-----IYDEFCPLLLNQFRSREFV---KFETFDAALDEFYSKIESQRAE 371
             P +           +Y++F P    QF  +  V   ++ +F+A +DE++S IE+Q+ E
Sbjct: 350 DSPAQGEEEAARKPGYLYEDFHPFKPKQFEGKPGVTILEYPSFNATVDEYFSSIETQKLE 409

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            +   +E+AA  KL+ +  +   R+  LK+  +  ++ A  IE N+  V  A+ AV   +
Sbjct: 410 SRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRKAGAIEDNVYRVQEAMDAVNGLI 469

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDEEKTLPVEK 490
           A  M W ++AR+++ E+  GNPVA +I   L L  N ++L+L  + +E D+ E     + 
Sbjct: 470 AQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTITLMLGESGEEQDEGEDLFSDDD 529

Query: 491 ----------------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
                                 +++DL LS  ANA ++YE KK    K++KT  + +KA 
Sbjct: 530 SESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQYYEQKKMAAVKEQKTTQSSTKAL 589

Query: 529 KAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           K+ EKK     +  + QEK V  +   RK  WFEKF +FISSE YLV+ GRD  Q+E++ 
Sbjct: 590 KSHEKKVTQDLKKGLKQEKQV--LRPARKTFWFEKFLFFISSEGYLVLGGRDVMQSEILY 647

Query: 585 KRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           +RY+ KGDV+VHADL GA+  ++KN  + P  P+PP TL+QAG   V  S AWDSK + S
Sbjct: 648 RRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLSQAGNLCVATSSAWDSKAIMS 707

Query: 643 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           A+WV   QVSKTA  G  L  G F+I+G+KNFL P  L++GFG++F++ + SL +H   R
Sbjct: 708 AYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVLGFGVMFQVSKESLRNHKLHR 767

Query: 703 RVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHP--------APSH 754
                    D+   +    E  + + E DD   KPV +   +  S  P          S 
Sbjct: 768 --------FDEPVATEAPVEGQEADKEADD---KPVEQEAQITKSERPAEAEQEQEQSSE 816

Query: 755 TNASNVDSHEFPAEDKTISNGID-SKIFDIARNVAAPVTPQLEDLIDRALGLGSASISST 813
           +     D    PA +       + ++   IA N +    P  +D  +      +   +  
Sbjct: 817 SEGEQEDDAVIPARNPLQRGSSEPTQTESIAANESQNAQP--DDAAEEEKEEEAEEPNGN 874

Query: 814 KHGIETTQFDLSEEDKHVERTAT---VRDKPYISKAERRKLKKGQGSSVVDPKVEREKER 870
               ++ Q + +E++K  + + T     D   +S  ERR  +KG+ S +  P       +
Sbjct: 875 NEDEQSAQEEPAEDEKDEDESGTSPQTYDDRQLSARERRMARKGRASELDGPAANGTSAK 934

Query: 871 GKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
             ++   P              +RG++GK KK  +KY DQDEE+R + + LL 
Sbjct: 935 STNSKQAP--------------TRGKRGKAKKAAQKYADQDEEDRELALRLLG 973


>gi|307109165|gb|EFN57403.1| hypothetical protein CHLNCDRAFT_57209 [Chlorella variabilis]
          Length = 1158

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 233/513 (45%), Positives = 295/513 (57%), Gaps = 88/513 (17%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TLK  L + + YGP  +EH +L  GL P  +      L       L+  V ++E WL   
Sbjct: 203 TLKGCLADLVPYGPLTAEHCVLLAGLEPQRQ-PAAAPLSALEAAALLGGVRQWEAWLDAC 261

Query: 299 ISGDIVPEGYILMQNKHL------------------GKDHPPTESGSSTQ-IYDEFCPLL 339
                 PEG+IL +                      G++        +   +YDEF PLL
Sbjct: 262 EDSATPPEGFILTKPAAAAAAAAVAAVAAAPPAPAAGQEDGGDGGAPAAAGVYDEFQPLL 321

Query: 340 LNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL 399
           L                        I+ QR+  Q  AKE AA  KL  I  D E R+ +L
Sbjct: 322 L------------------------IQGQRSAHQQAAKEKAAVGKLEAIRRDHEKRLGSL 357

Query: 400 KQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
            QE + +   A LIEYNLE VDAA+ AVR ALA+ M W DLARMVKEER+AGNPVAGLID
Sbjct: 358 GQEAEAAELKAALIEYNLEAVDAALNAVREALASGMDWRDLARMVKEERRAGNPVAGLID 417

Query: 460 KLYLERNCMSLLL-------------------SNNLDEMDDEEK--TLPVEKVEVDLALS 498
            L LER+ ++LLL                    N LDE D +E+  T P  KVEVDL LS
Sbjct: 418 SLQLERSRVTLLLRRARVCAWGGGGVAGGVRGGNWLDEEDGDEEAATRPATKVEVDLGLS 477

Query: 499 AHANARRWYELKKKQES-----KQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM-R 552
           AHANAR +Y+ ++K ++     KQ+KT+ A+ KA KAAEKK + Q+ Q ++ A    + R
Sbjct: 478 AHANARTYYDSRRKHQARGAGVKQQKTLDANQKALKAAEKKAQQQLKQVRSAAAAPAITR 537

Query: 553 KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP 612
           K  WFEKF WFISSENYLV+SGRDAQQNE++VKRY+ +GD YVHADLHGASST+++N  P
Sbjct: 538 KPFWFEKFFWFISSENYLVLSGRDAQQNELLVKRYLRRGDAYVHADLHGASSTIVRNSDP 597

Query: 613 EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
             P+PPLTL+QAG   VC SQAWD+K+VTSAWWV+P QVSKTAP+GEYL           
Sbjct: 598 GAPIPPLTLSQAGQACVCRSQAWDAKIVTSAWWVHPEQVSKTAPSGEYL----------- 646

Query: 673 NFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
                 PL+MG+G +F L E S+ +H+ ER  R
Sbjct: 647 ------PLVMGYGYMFGLAEESIPAHMGERAPR 673



 Score =  202 bits (513), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 104/187 (55%), Positives = 127/187 (67%), Gaps = 24/187 (12%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           MVK R ++ADVAAEV CL+R +GMR +NVYD++PKTY+ KL  S    E GE  KVLLL+
Sbjct: 1   MVKQRFSSADVAAEVSCLQRCLGMRVANVYDINPKTYVLKLARSG---EDGE--KVLLLI 55

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVR HT     +K +TPS FTLKLRKHIRTRRLE V+QLG DRI+   FG G  + ++
Sbjct: 56  ESGVRFHTVQAMPEKADTPSNFTLKLRKHIRTRRLEAVKQLGVDRIVQLSFGSGPASCHL 115

Query: 121 ILELYAQ-------------------GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           +LE YAQ                   GN++L D +F VLTLLRSHRDD KGVAIM+RH Y
Sbjct: 116 LLEFYAQASGRRQGELCFGTCMHPCAGNVILADDKFEVLTLLRSHRDDAKGVAIMARHPY 175

Query: 162 PTEICRV 168
           P +  R+
Sbjct: 176 PIQTIRL 182


>gi|358379255|gb|EHK16935.1| hypothetical protein TRIVIDRAFT_10609, partial [Trichoderma virens
           Gv29-8]
          Length = 1079

 Score =  374 bits (960), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 259/798 (32%), Positives = 395/798 (49%), Gaps = 138/798 (17%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L+ +R +NVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKVIAHELQGSLVTLRLANVYDLSSKILLLKFAKPDN--------KQQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK+++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  DNGFRCHLTDFARTTAAAPSAFVARLRKYLKTRRLTSVAQVGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS-------------------HRDDDKGVAIMSRHRY 161
            L+ +A GNI+LTD++  +L + R+                   +R +  G+  +++ R 
Sbjct: 111 FLKFFASGNIILTDADLKILAISRNVSEGEGQEPQGVGLQYSLENRQNFGGIPALTKERI 170

Query: 162 PTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDL 221
              +    E+  ASK+ A  + SK                          G+ GG   DL
Sbjct: 171 RDALKTAAEKAEASKVAATFSGSK------------------------AKGKSGG---DL 203

Query: 222 SKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAI 281
            K                L   + E     PAL E+I+       + K ++V    DN +
Sbjct: 204 RK---------------ALAVSITE---LPPALVENILQANSFDVSAKPADVV---DNEL 242

Query: 282 QV--LVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL---GKDHPPTESGSSTQIYDEFC 336
            +  LV  +++  D ++++I+     +GYI  + K     G D           +Y++F 
Sbjct: 243 LLDELVKHLSEARDIVENIIASATC-KGYIFAKKKTAPSSGPDETDQAQKHEGLLYEDFH 301

Query: 337 PLLLNQFR---SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           P +  +F+   S + ++FE ++  +DEF+S +E Q+ E +   +E+AA  KL     +Q 
Sbjct: 302 PFVPQKFKNDPSIQVLEFEGYNRTVDEFFSSLEGQKLESRLSGREEAAKKKLEAARHEQA 361

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
            R+  L+     +++ A  IE N+E V  A+ AV   LA  M W D+ ++++ E+K  NP
Sbjct: 362 KRIEGLQDAQAMNLRKAAAIEANVERVQEAMDAVNGLLAQGMDWVDIGKLIEREKKRQNP 421

Query: 454 VAGLID-KLYLERNCMSLLLSN-----------NLDEMDDEEKTLPVEKVE--------- 492
           VA +I   L L  N ++LLL+            N  E DD +      +V          
Sbjct: 422 VAEIISLPLKLAENTITLLLAEEEFDEDEAAEDNPFETDDSDSEAEASEVTPTKDKKADK 481

Query: 493 ---VDLAL--SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEK 543
              VD+ L  S  +NAR +YE ++    K+EKT    +KA K+ E+K     +  + QEK
Sbjct: 482 LLTVDIVLNTSPWSNAREYYEERRSAAMKEEKTQLQANKALKSTEQKIAEDLKKGLKQEK 541

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
            +  +  +RK  WFEKF WFISS+ YLV+ G+D QQ+EM+ +RY+ KGDVY HAD+ GA+
Sbjct: 542 AL--LQPIRKQMWFEKFIWFISSDGYLVLGGKDPQQSEMLYRRYLRKGDVYCHADIRGAA 599

Query: 604 STVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
             VIKN+   P+ P+PP TL+QAG  +VC S AWDSK     WWV   QVSK+ PTG+ L
Sbjct: 600 HIVIKNNPNTPDAPIPPATLSQAGSLSVCTSDAWDSKAGMGGWWVNADQVSKSTPTGDIL 659

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER------------RVRGEE- 708
             G+F I+GKKN+LPP  L++G G  F++ E S G HL  R               GE+ 
Sbjct: 660 PAGNFTIQGKKNYLPPTQLLLGLGFTFKISEQSKGKHLKHRVHDERSSLATETATTGEDE 719

Query: 709 ----EGMDDFEDSGHHKE 722
               E +D+ EDSG   E
Sbjct: 720 LQNAEEVDNSEDSGDESE 737


>gi|225681027|gb|EEH19311.1| DUF814 domain-containing protein [Paracoccidioides brasiliensis
           Pb03]
          Length = 1161

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 308/1001 (30%), Positives = 474/1001 (47%), Gaps = 157/1001 (15%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L R L+G+R SN+YDLS +  +FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVISQELSRALVGLRISNIYDLSSRICLFKLAKPDTRKQ--------LIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G R H T Y+R     PS F  +LRK ++TRR+  V QLG DRII     L     ++
Sbjct: 53  DIGFRCHLTEYSRTTAAAPSPFISRLRKFLKTRRVTAVSQLGTDRII--DIALSDGNFHL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLR-SHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
           +LE Y  GNI+LTD ++ ++ L R  H   ++            E  RV        L  
Sbjct: 111 LLEFYVGGNIILTDKDYKIVALHRIVHGGGER------------EEVRV-------GLQY 151

Query: 180 ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPT 239
            +T+ +  +   P  +      +  A  E   G+ G         SNK    G + +   
Sbjct: 152 DITNKQNYNGVPPLSIERLRETLQRA--EEAEGESGAVE---GPGSNKR---GKKRQTEA 203

Query: 240 LKTVLGEALGYGPALS-EHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQD 297
           LK  +       PAL  +H     G   N++  +   LED+ + + L+L + + E+ +  
Sbjct: 204 LKRAISMGFPEYPALLLDHSFHAAGFDANLEPKQA--LEDSELMKRLMLVLTEAENVIAR 261

Query: 298 VISGDIVPEGYILMQNK-HLGKDHPPTESGS---STQIYDEFCPLLLNQFRS---REFVK 350
           + + +  P GYI+++ +   G+     ++ S      +Y +F P    QF +      + 
Sbjct: 262 LSTLEDTP-GYIILKGESKTGEAITEADTDSPKPKNMLYHDFHPFKPKQFENVPGMTILT 320

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F TF+ A+DE++S +ESQ+ + +   +E+ A  KL     DQENRV  LK+  +  V+ A
Sbjct: 321 FNTFNKAVDEYFSSVESQKLKYRLTEREEVARRKLEAAQKDQENRVGALKEVQELHVRKA 380

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
           + IE NL  V+ AI AV   +A  M W ++AR+++ E+ + NPVA +I   L L  N ++
Sbjct: 381 QAIEANLLRVEEAINAVNGLIAQGMDWVEIARLIEMEKSSQNPVAKVIKLPLKLYENTVT 440

Query: 470 LLLS---------------------------NNLDEMDDEEKTLPVEKVEVDLALSAHAN 502
           LLL                            N +     ++    +  +++DL +S  AN
Sbjct: 441 LLLGEPTEDEEPADESDEEEEDSESGDEDGGNKVKLEGSKKAQQQLLSIDIDLGISPWAN 500

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFE 558
           AR++YE +K    K+EKT+ +  KA K+ EKK     +  + QEK +  +   R   WFE
Sbjct: 501 ARQYYEQRKAAAVKEEKTLKSTKKAIKSTEKKVTTDLKHALKQEKPI--LRPTRTPFWFE 558

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPV 616
           KF +F+SS+ YLV+ GRD QQ E++ +RY+ KGDVYVHAD+ GA+   +KN    P+ P+
Sbjct: 559 KFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVYVHADVQGATPIFVKNKPGTPDAPI 618

Query: 617 PPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
           PP TL+QAG   V  S AWDSK V  AWWV   QVSKTAP+GE++  G F+IRG+K+ LP
Sbjct: 619 PPGTLSQAGNLCVATSSAWDSKAVMGAWWVNAGQVSKTAPSGEFVGTGGFVIRGEKHQLP 678

Query: 677 PHPLIMGFGLLFRLDESSLGSHLNER----------------------RVRGEEEGMDDF 714
           P  L++GF ++F++ E S+ +H   R                      +   E  G D  
Sbjct: 679 PAQLLLGFAVMFQISEDSIKNHTKFRVQDEPSIVGIAKEVQANEVLHSKQDSEAPGADGN 738

Query: 715 EDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTN---ASNVDSHEFPAEDKT 771
           ++     E  D   E+D+  + P    L +   + P  S  N    S+    + P++D  
Sbjct: 739 KEISLASEEHDSSDEQDEETDNP----LLIGMESEPDDSGGNENKGSDNGEEKLPSDDTD 794

Query: 772 ISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHV 831
                D K ++   +V    T  LE   D  +    A +S  + GI   Q       KH 
Sbjct: 795 -----DEKEYN---SVVTKETVVLESGGDEPITQPEADVSEQQPGITKRQ-----ALKH- 840

Query: 832 ERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQ---------PESIV 882
                      +S  ERR+LKKG    V+   +E+   R  DA SQ         P    
Sbjct: 841 -----------LSARERRQLKKG----VL---IEQTSVRVADAESQSSSPTPSVAPSVTT 882

Query: 883 RKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
                      RG++GK KK+  KY  QDEE+R + + LL 
Sbjct: 883 TTNTNTLNSNIRGKRGKSKKLATKYQHQDEEDRELALRLLG 923


>gi|259479735|tpe|CBF70228.1| TPA: DUF814 domain protein, putative (AFU_orthologue; AFUA_2G09170)
           [Aspergillus nidulans FGSC A4]
          Length = 1100

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 253/748 (33%), Positives = 393/748 (52%), Gaps = 90/748 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    K L   L+G+R SN+YDLS + ++FK+             +  L++
Sbjct: 1   MKQRYSSLDVQVISKELASELVGLRVSNIYDLSTRIFLFKVAKPD--------HRKQLIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R    TPSGF  +LRK++++RR+  V Q+G DRII F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATAATPSGFVSRLRKYLKSRRITSVTQIGTDRIIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE +A GNI++TD ++T++ LLR               + P       E    +K+   
Sbjct: 111 LLEFFASGNIIITDRDYTIIALLR---------------QVPGG-----EGMEEAKVGLK 150

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            T + + + +    +  D    +    + L  Q+     D  K S K S D        L
Sbjct: 151 YTVTNKQNYSGIPPITRDRIRETLEKAKALFAQEN----DAPKKSKKKSTD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H        P M L +V  L D  +  +VL V +    +   +
Sbjct: 200 RRALSQGFPEYPPLLLDHAFATRAADPAMPLDQV--LGDAGLIDVVLGVLEEAQNVTKDL 257

Query: 300 SGDIVPEGYILMQNKHLGKD-HPPTESGSSTQ----IYDEFCPLLLNQFRSRE---FVKF 351
           S D    G+I+ +     K   P +E   S      +Y++F P    QF  ++    +++
Sbjct: 258 SADKAHPGFIVAKEDTRPKPPGPESEKNDSPSKPALLYEDFHPFKPRQFEGKDGFTILEY 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            + +A +DE++S IESQ+ E +   +E AA  KL+ +  + E R+  L+Q  +  ++ A 
Sbjct: 318 PSMNATVDEYFSSIESQKLESRLTERESAAKKKLDSLRSEHEKRIGALEQAQELHIRKAS 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            I+ N++ V  A+ AV   +A  M W ++AR+V+ E+K GNPVA LI   L L  N ++L
Sbjct: 378 AIQDNMDRVQEAMDAVNGLVAQGMDWVEIARLVEMEQKRGNPVASLIKLPLKLHENTITL 437

Query: 471 LLSNNLDEMDDEEKTL------------------PVEK-----VEVDLALSAHANARRWY 507
           LL    DE  + E+                    P +K     +++DL LS  ANA ++Y
Sbjct: 438 LLREAGDEGYEVEELFSSDESEDSDEEEGKGAASPQKKPEGLTIDIDLGLSPWANASQYY 497

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWF 563
           E KK    K EKT  + +KA K+ E+K     +  + QEK V  +   RK  WFEKF +F
Sbjct: 498 EQKKVAAVKAEKTSQSSAKALKSHERKVQDDLKRNLKQEKQV--LRPARKPFWFEKFLFF 555

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP---EQPVPPLT 620
           +SSE YLV+ GRD+ Q+EM+ +RY+ KGDV+VHADL GA+  ++KN +P      + P T
Sbjct: 556 VSSEGYLVLGGRDSMQSEMLYRRYLRKGDVFVHADLEGATPMIVKN-KPGALSSSISPTT 614

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L+QAG   V  S AWDSK + SA+WV   QVSKT+  G+ L VG F+++G+KNFL P  L
Sbjct: 615 LSQAGNLCVATSTAWDSKAIMSAYWVDAAQVSKTSAVGDLLPVGEFLVKGEKNFLAPSQL 674

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEE 708
           ++GF +++++   S GS +N +  R EE
Sbjct: 675 VLGFAVMWQI---SKGSLVNHKSFRSEE 699


>gi|320040092|gb|EFW22026.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 1136

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 301/989 (30%), Positives = 480/989 (48%), Gaps = 155/989 (15%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++G+R SN+YDLS +TY+FK+       +         ++
Sbjct: 1   MKQRFSSLDVKVICRELSAAVVGLRVSNIYDLSSRTYLFKIAKPDVRKQ--------FIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  +LR  +++RR+  V Q+G DRI+  +F  G   +++
Sbjct: 53  DSGFRCHITEYSRVTAPAPSHFVSRLRGFLKSRRITAVSQIGTDRIVHIEFSDGY--YHL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT---TASKL 177
            LE +A GNI+LTD+E+ ++ LLR   + +    +    +Y  +  + +E     +  +L
Sbjct: 111 FLEFFASGNIILTDNEYKIVALLRIVPEGEDQDEVRLGLKYRLDNKQNYEGVPPPSVDRL 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             AL   KE DA+                              +S+ +NK +    + ++
Sbjct: 171 KTALQKGKERDAS------------------------------ISEPANKRAK---KKQE 197

Query: 238 PTLKTVLGEALG---YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
             L+  L  +LG   Y P L EH +     D+ L P+  L   +++ D      ++ V +
Sbjct: 198 EALRRAL--SLGFPEYPPVLLEHALHVTGFDSSLRPDQILETGDRVND------LMRVLR 249

Query: 291 FEDWLQDVISGDIVPEGYILMQNKHLGKDHPP--TESGSSTQIYDEFCPLLLNQFRSR-- 346
             + + + +S      GYI+ +N++   ++P    E+      Y ++ P    QF     
Sbjct: 250 EVESVSNELSTTEQTRGYIVARNENKPSENPSFSGEAKPDKSNYIDYHPFAPRQFADGND 309

Query: 347 -EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
              + F++F+ A+DE+YS +E+Q+ E +   +E+    KL     D E RV  L+Q  + 
Sbjct: 310 ISILTFDSFNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEI 369

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
             + AE I  NL  V+  + AV   +A  M W ++AR+++ E+   NPVA LI   L L 
Sbjct: 370 HTRKAEAIATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLY 429

Query: 465 RNCMSLLLSNN---------------LDEMDDEEKTLP----VEKVEVDLALSAHANARR 505
            N +++LL                   +E D E KT P    V  V++DL L+  ANA +
Sbjct: 430 ENTVTVLLPEGQLDEEDDDSEESDEEDEENDGEAKTKPQRPEVLSVDIDLGLTPWANASQ 489

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKK--TRLQ--ILQEKTVANISHMRKVHWFEKFN 561
           +Y+ KK    K+EKTI A  +A K+AEKK  T L+  + QEK V  +   R   WFEKF 
Sbjct: 490 YYDQKKTAAVKEEKTIKASKQALKSAEKKLTTDLKRGLKQEKPV--LRPARIPFWFEKFY 547

Query: 562 WFISSENYLVI-----------SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
           +FISS+ YLV+           SG D +QNE++  R++ KGDVYVHAD+ GA   ++KN 
Sbjct: 548 FFISSDGYLVLGIDSVMLITRSSGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKN- 606

Query: 611 RP---EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           +P   + P+PP TL QAG FTV  S+AW+SK +  AWWV   QVSKT P+GEYL  G  +
Sbjct: 607 KPGASDAPIPPGTLAQAGTFTVATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVV 666

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIE 727
           IRG KN L P  LI+GF ++F++   S+ +H    R R EE    +      H+  +   
Sbjct: 667 IRGGKNHLAPGQLILGFAVMFQISPESVRNHT---RHRLEEPVSSEMTVKNDHRNGTHEP 723

Query: 728 SEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNV 787
           SE +  +E P                +T   N    +   E K   N  D     + ++ 
Sbjct: 724 SEMEKLEESP----------------NTAVDNCSIGKVGMEQKPRENTWD---LPVEQSA 764

Query: 788 AAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAE 847
              + PQ+++        G A +S            LS+ D   +  A      ++S  E
Sbjct: 765 QTGIAPQVKE------PQGEAGLSREDKDT------LSDPDLQQQLAAFGATTKHVSAQE 812

Query: 848 RRKLKKGQG---SSVVDPKVEREKERGKDASSQPESI---------VRKTKIEGGKIS-R 894
           RR +K+G G   S++ +  ++ E E  ++  S P +          ++ T     ++  R
Sbjct: 813 RRLMKRGAGLHASALPELGLDEEDEDEEENQSTPSTFKPSGTPTLSIQSTSTSKSQLPVR 872

Query: 895 GQKGKLKKMKEKYGDQDEEERNIRMALLA 923
           G++GK KK+  KY DQDEE+R + + LL 
Sbjct: 873 GKRGKAKKLASKYKDQDEEDRELALRLLG 901


>gi|119193306|ref|XP_001247259.1| hypothetical protein CIMG_01030 [Coccidioides immitis RS]
 gi|392863500|gb|EAS35746.2| hypothetical protein CIMG_01030 [Coccidioides immitis RS]
          Length = 1125

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 300/978 (30%), Positives = 478/978 (48%), Gaps = 144/978 (14%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++G+R SN+YDLS +TY+FK+       +        L++
Sbjct: 1   MKQRFSSLDVKVICRELSAAVVGLRVSNIYDLSSRTYLFKIAKPDVRKQ--------LIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  +LR  +++RR+  V Q+G DRI+  +F  G   +++
Sbjct: 53  DSGFRCHITEYSRVTAPAPSHFVSRLRGFLKSRRITAVSQVGTDRIVHIEFSDGY--YHL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLR--SHRDDDKGVAIMSRHRYPT-EICRVFERTTASKL 177
            LE +A GNI+LTD+E+ ++ LLR     +D   V +  ++R    +        +  +L
Sbjct: 111 FLEFFASGNIILTDNEYKIVALLRIVPEGEDQDEVRLGLKYRLDNKQNYEGVPPPSVDRL 170

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             AL   KE DA+                              +S+ +NK +    + ++
Sbjct: 171 KTALQKGKERDAS------------------------------ISEPANKRAK---KKQE 197

Query: 238 PTLKTVLGEALG---YGPALSEH----IILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
             L+  L  +LG   Y P L EH    I  D+ L P+  L   +++ D      ++ V +
Sbjct: 198 EALRRAL--SLGFPEYPPVLLEHALHVIGFDSSLRPDQILETGDRVND------LMRVLR 249

Query: 291 FEDWLQDVISGDIVPEGYILMQNKHLGKDHP--PTESGSSTQIYDEFCPLLLNQF---RS 345
             + + + +S      GYI+ +N++   ++P    E+      Y ++ P    QF     
Sbjct: 250 EVESISNELSTTEQTRGYIVARNENKPPENPSFSGEAKPDKSNYIDYHPFAPRQFVDGND 309

Query: 346 REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
              + F++F+ A+DE+YS +E+Q+ E +   +E+    KL     D E RV  L+Q  + 
Sbjct: 310 TSILTFDSFNKAVDEYYSSVETQKLESRLTEREETMKRKLEATKRDHEKRVGALQQVQEI 369

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
             + AE I  NL  V+  + AV   +A  M W ++AR+++ E+   NPVA LI   L L 
Sbjct: 370 HTRRAEAIATNLRKVEEVMNAVNGLIAQGMDWVEIARLIEMEQSRQNPVAKLIKLPLKLY 429

Query: 465 RNCMSLLLSNN---------------LDEMDDEEKTLP----VEKVEVDLALSAHANARR 505
            N +++LL                   +E D E K  P    V  V++DL L+  ANA +
Sbjct: 430 ENTVTVLLPEGQPDGEDDDSEESGEEDEENDGEAKKKPQRPEVLSVDIDLGLTPWANASQ 489

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKK--TRLQ--ILQEKTVANISHMRKVHWFEKFN 561
           +Y+ KK    K++KTI A  +A K+AEKK  T L+  + QEK V  +   R   WFEKF 
Sbjct: 490 YYDQKKTAAIKEDKTIKASKQALKSAEKKLTTDLKRGLKQEKPV--LRPARIPFWFEKFY 547

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP---EQPVPP 618
           +FISS+ YLV+ G D +QNE++  R++ KGDVYVHAD+ GA   ++KN +P   + P+PP
Sbjct: 548 FFISSDGYLVLGGSDDRQNEILYHRHLRKGDVYVHADMEGAIPLIVKN-KPGASDAPIPP 606

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL QAG FTV  S+AW+SK +  AWWV   QVSKT P+GEYL  G  +IRG KN L P 
Sbjct: 607 GTLAQAGTFTVATSRAWESKALMGAWWVNADQVSKTTPSGEYLATGGVVIRGGKNHLAPG 666

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPV 738
            LI+GF ++F++   S+ +H    R R EE    +      H+  +   SE +  +E P 
Sbjct: 667 QLILGFAVMFQISPESVRNHT---RHRLEEPVSSEMTVKNDHRNGTHEPSEMEKLEESP- 722

Query: 739 AESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL 798
                          +T   N    +   E K   N  D     + ++    + PQ+++ 
Sbjct: 723 ---------------NTAVDNCSIGKVGMEQKPRENTTD---LPVEQSAQTGIAPQVKE- 763

Query: 799 IDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQG-- 856
                  G A +S            L++ D   +  A      ++S  ERR +K+G G  
Sbjct: 764 -----PQGEAGLSREDKDA------LADPDLQQQLAAFGATTKHVSAQERRLMKRGAGLH 812

Query: 857 -SSVVDPKVEREKERGKDASSQPESI---------VRKTKIEGGKIS-RGQKGKLKKMKE 905
            S++ +  ++ E E  ++  S P +          ++ T     ++  RG++GK KK+  
Sbjct: 813 ASALSELGLDEEDEDEEENQSTPSTFKPSGTQTLSIQSTSTSKSQLPVRGKRGKAKKLAS 872

Query: 906 KYGDQDEEERNIRMALLA 923
           KY DQDEE+R + + LL 
Sbjct: 873 KYKDQDEEDRELALRLLG 890


>gi|67539818|ref|XP_663683.1| hypothetical protein AN6079.2 [Aspergillus nidulans FGSC A4]
 gi|40738864|gb|EAA58054.1| hypothetical protein AN6079.2 [Aspergillus nidulans FGSC A4]
          Length = 1588

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 253/748 (33%), Positives = 393/748 (52%), Gaps = 90/748 (12%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    K L   L+G+R SN+YDLS + ++FK+             +  L++
Sbjct: 1   MKQRYSSLDVQVISKELASELVGLRVSNIYDLSTRIFLFKVAKPD--------HRKQLIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R    TPSGF  +LRK++++RR+  V Q+G DRII F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATAATPSGFVSRLRKYLKSRRITSVTQIGTDRIIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE +A GNI++TD ++T++ LLR               + P       E    +K+   
Sbjct: 111 LLEFFASGNIIITDRDYTIIALLR---------------QVPGG-----EGMEEAKVGLK 150

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            T + + + +    +  D    +    + L  Q+     D  K S K S D        L
Sbjct: 151 YTVTNKQNYSGIPPITRDRIRETLEKAKALFAQEN----DAPKKSKKKSTD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H        P M L +V  L D  +  +VL V +    +   +
Sbjct: 200 RRALSQGFPEYPPLLLDHAFATRAADPAMPLDQV--LGDAGLIDVVLGVLEEAQNVTKDL 257

Query: 300 SGDIVPEGYILMQNKHLGKDH-PPTESGSSTQ----IYDEFCPLLLNQFRSRE---FVKF 351
           S D    G+I+ +     K   P +E   S      +Y++F P    QF  ++    +++
Sbjct: 258 SADKAHPGFIVAKEDTRPKPPGPESEKNDSPSKPALLYEDFHPFKPRQFEGKDGFTILEY 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            + +A +DE++S IESQ+ E +   +E AA  KL+ +  + E R+  L+Q  +  ++ A 
Sbjct: 318 PSMNATVDEYFSSIESQKLESRLTERESAAKKKLDSLRSEHEKRIGALEQAQELHIRKAS 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            I+ N++ V  A+ AV   +A  M W ++AR+V+ E+K GNPVA LI   L L  N ++L
Sbjct: 378 AIQDNMDRVQEAMDAVNGLVAQGMDWVEIARLVEMEQKRGNPVASLIKLPLKLHENTITL 437

Query: 471 LLSNNLDEMDDEEKTL------------------PVEK-----VEVDLALSAHANARRWY 507
           LL    DE  + E+                    P +K     +++DL LS  ANA ++Y
Sbjct: 438 LLREAGDEGYEVEELFSSDESEDSDEEEGKGAASPQKKPEGLTIDIDLGLSPWANASQYY 497

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWF 563
           E KK    K EKT  + +KA K+ E+K     +  + QEK V  +   RK  WFEKF +F
Sbjct: 498 EQKKVAAVKAEKTSQSSAKALKSHERKVQDDLKRNLKQEKQV--LRPARKPFWFEKFLFF 555

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP---EQPVPPLT 620
           +SSE YLV+ GRD+ Q+EM+ +RY+ KGDV+VHADL GA+  ++KN +P      + P T
Sbjct: 556 VSSEGYLVLGGRDSMQSEMLYRRYLRKGDVFVHADLEGATPMIVKN-KPGALSSSISPTT 614

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L+QAG   V  S AWDSK + SA+WV   QVSKT+  G+ L VG F+++G+KNFL P  L
Sbjct: 615 LSQAGNLCVATSTAWDSKAIMSAYWVDAAQVSKTSAVGDLLPVGEFLVKGEKNFLAPSQL 674

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEE 708
           ++GF +++++   S GS +N +  R EE
Sbjct: 675 VLGFAVMWQI---SKGSLVNHKSFRSEE 699


>gi|63054438|ref|NP_588145.2| nuclear export mediator factor NEMF [Schizosaccharomyces pombe
           972h-]
 gi|48475020|sp|Q9USN8.2|YJY1_SCHPO RecName: Full=Uncharacterized protein C132.01c
 gi|157310510|emb|CAA22870.2| nuclear export mediator factor NEMF [Schizosaccharomyces pombe]
          Length = 1021

 Score =  370 bits (951), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 253/748 (33%), Positives = 389/748 (52%), Gaps = 89/748 (11%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +  D+AA    LR +++G R +N YDL+ +T++ K           +  K  +++
Sbjct: 1   MKQRFSALDIAAIAAELREQVVGCRLNNFYDLNARTFLLKF--------GKQDAKYSIVI 52

Query: 61  ESGVRLHTTAYARDKKNTP-SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN--- 116
           ESG R H T +  D++N P SGF  KLRKHI++RRL  V QLG DR+++F FG G N   
Sbjct: 53  ESGFRAHLTKF--DRENAPLSGFVTKLRKHIKSRRLTGVSQLGTDRVLVFTFGGGANDQD 110

Query: 117 ---AHYVILELYAQGNILLTDSEFTVLTLLRSHR-DDDKGVAIMSRHRYP-------TEI 165
               +Y++ E +A GN+LL D  + +L+LLR    D D+  A+  ++           + 
Sbjct: 111 PDWTYYLVCEFFAAGNVLLLDGHYKILSLLRVVTFDKDQVYAVGQKYNLDKNNLVNDNKS 170

Query: 166 CRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNS 225
                  TA +L+  L       A+ P  +NE                       L    
Sbjct: 171 QSTIPHMTAERLNILLDEISTAYAS-PTSINEP----------------------LPDQQ 207

Query: 226 NKNSNDGARAKQP-TLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV 283
             +S    +  +P +L+  L   LG YG AL EH +  + L P     ++    D   + 
Sbjct: 208 LSSSTKPIKVPKPVSLRKALTIRLGEYGNALIEHCLRRSKLDPLFPACQL--CADETKKN 265

Query: 284 LVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQF 343
            +LA  +  D +   ++   V +GYI    + L     P      T +Y++F P    Q 
Sbjct: 266 DLLAAFQEADSILAAVNKPPV-KGYIFSLEQALTNAADPQHPEECTTLYEDFHPFQPLQL 324

Query: 344 --RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
              +R+ ++F T++  +DEF+S IE+Q+ +++   +   A  +L     DQ  ++ +L+ 
Sbjct: 325 VQANRKCMEFPTYNECVDEFFSSIEAQKLKKRAHDRLATAERRLESAKEDQARKLQSLQD 384

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
                   A+ IE N E V+A I  +   L   M W D+ ++++ +++  +PVA  I   
Sbjct: 385 AQATCALRAQAIEMNPELVEAIISYINSLLNQGMDWLDIEKLIQSQKRR-SPVAAAIQIP 443

Query: 461 LYLERNCMSLLLSN--NLDEMDDEEKTLPVEK--------------------VEVDLALS 498
           L L +N +++ L N  ++D  D+  +T   +                     VE+DL+L 
Sbjct: 444 LKLIKNAVTVFLPNPESVDNSDESSETSDDDLDDSDDDNKVKEGKVSSKFIAVELDLSLG 503

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM---RKVH 555
           A ANAR+ YEL+++   K+ KT  A SKA K+ ++K   Q L+  T A+   +   RK  
Sbjct: 504 AFANARKQYELRREALIKETKTAEAASKALKSTQRKIE-QDLKRSTTADTQRILLGRKTF 562

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           +FEKF+WFISSE YLV+ GRDAQQNE++ ++Y + GD++V ADL  +S  ++KN  P  P
Sbjct: 563 FFEKFHWFISSEGYLVLGGRDAQQNELLFQKYCNTGDIFVCADLPKSSIIIVKNKNPHDP 622

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL QAG   +  S+AWDSK V SAWWV   +VSK APTGE L  GSF IR KKN+L
Sbjct: 623 IPPNTLQQAGSLALASSKAWDSKTVISAWWVRIDEVSKLAPTGEILPTGSFAIRAKKNYL 682

Query: 676 PPHPLIMGFGLLFRLDESSLGSHLNERR 703
           PP  LIMG+G+L++LDE S     +ERR
Sbjct: 683 PPTVLIMGYGILWQLDEKS-----SERR 705


>gi|195451571|ref|XP_002072981.1| GK13887 [Drosophila willistoni]
 gi|194169066|gb|EDW83967.1| GK13887 [Drosophila willistoni]
          Length = 1004

 Score =  370 bits (950), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 255/633 (40%), Positives = 362/633 (57%), Gaps = 75/633 (11%)

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDAALDEFYS 363
           +GYI+       K+  PT+ G     +   EF P L +Q +  E  + ETF  A+DEF+S
Sbjct: 284 KGYIMQV-----KEEKPTDGGDVDYFFRNVEFHPFLFSQLKHLEVEEHETFMTAVDEFFS 338

Query: 364 KIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVD 421
           K ESQR + +   +E  A  KL+ I  D   R+  L   Q VD+  + AELI  N   VD
Sbjct: 339 KQESQRIDMKTLGQERDALKKLSNIKNDHAQRLEDLNKVQSVDK--RKAELITCNQSLVD 396

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS-----NNL 476
            AILAV+ A+A+++ W D+ ++VKE +  G+ VA  I +L LE N +SL+LS     N+ 
Sbjct: 397 KAILAVQSAIASQLPWPDIRQLVKEAQANGDIVANSIKQLKLETNHISLILSDPYSANDS 456

Query: 477 DEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
           DE DDEE   P+  V+VDLALSA ANARR+Y+LK+    K++KT+ A  KA K+AE+KT+
Sbjct: 457 DEDDDEESEEPM-IVDVDLALSAWANARRYYDLKRSAAKKEQKTVDASEKALKSAERKTQ 515

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
             + + +T++NI+  RKV WFEKF WFISSENYLVI GRDAQQNE+IVKRYM   D+YVH
Sbjct: 516 QTLKEVRTISNIAKARKVFWFEKFYWFISSENYLVIGGRDAQQNELIVKRYMRPKDIYVH 575

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           A++ GASS +I+N   ++ +PP TL +AG   + +S AWD+K++T+A+WV   QVSKTAP
Sbjct: 576 AEIQGASSVIIRNPNADE-IPPKTLLEAGTMAISYSVAWDAKVITNAYWVTSDQVSKTAP 634

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFED 716
           TGEYL  GSFMIRGKKNFLP   LIMG   LF+L++S +  H  ER++R  +E  +D + 
Sbjct: 635 TGEYLGTGSFMIRGKKNFLPSCHLIMGLSFLFKLEDSFVQRHAGERKIRSTDEDPNDIDL 694

Query: 717 SGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGI 776
                 N  +    +D       ESL       P+ +  N  N D+              
Sbjct: 695 KQCDIANDGLPEISED------GESL-------PSQNVNNIENADN-------------- 727

Query: 777 DSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT 836
                           P  E  I+   G  +    S   G E      +E +  + + AT
Sbjct: 728 --------------AFPDTEVKIEHDTGRVTIRTDSYPQGSEPA----TEPENDLTKNAT 769

Query: 837 VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIE----GGKI 892
             ++  I  A   + K+ + ++      +R+ ++G+   ++P++ V + ++E     G +
Sbjct: 770 EDEETTIIAAAPARQKQQKSNN------KRKDDKGR--KNKPQNQVTEVEVEPKPNTGVL 821

Query: 893 SRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
            RGQK KLKKMK KY DQDEEER +RM +L  S
Sbjct: 822 KRGQKSKLKKMKLKYKDQDEEERKLRMMILNSS 854



 Score =  157 bits (397), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 109/163 (66%), Gaps = 6/163 (3%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R +T D+   V  L+RL+G+R + +YD+  KTY+ +L  + G     E+EKV LL+E
Sbjct: 1   MKTRFSTYDIICGVAELQRLVGLRVNQIYDIDNKTYLIRLQGTGG-----ETEKVTLLIE 55

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTTA+   K   PSGF++KLRKH++ +RLE + QLG DRI+  QFG G  A++VI
Sbjct: 56  SGTRFHTTAFEWPKNVAPSGFSMKLRKHLKNKRLEHIHQLGADRIVDLQFGTGDAAYHVI 115

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           LELY +GN++LTD E T+L +LR H + +  +    R +YP +
Sbjct: 116 LELYDRGNVILTDYEQTILYILRPHTEGE-ALRFAVREKYPID 157


>gi|350636898|gb|EHA25256.1| hypothetical protein ASPNIDRAFT_49657 [Aspergillus niger ATCC 1015]
          Length = 1515

 Score =  368 bits (945), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 254/809 (31%), Positives = 402/809 (49%), Gaps = 93/809 (11%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FK+             +  L++
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKVAKPD--------HRKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + P+ F  ++RK +++RR+  + Q+G DRII F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATASAPTPFVTRMRKFLKSRRITSIEQIGTDRIIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI++TD E+ +L L R               +  T +   +  T     H  
Sbjct: 111 FLEFFAGGNIIITDREYNILALFRQ--------VPAGEGQDETRVGVKYTVTNKQNYHGI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                      PD   E        +K  L  Q+G    +  K S K + D        L
Sbjct: 163 -----------PDITRERVKETVEKAKA-LFAQEG----NAPKKSKKKNAD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H      L P   L EV  L+D A+ + V+ V +      D +
Sbjct: 200 RKALSQGFPEYPPLLLDHAFAVKELDPATPLDEV--LQDEALLLKVVDVLEEAKVETDKL 257

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-----IYDEFCPLLLNQFRSREFV---KF 351
           + +    GYI+ ++        P +           +Y++F P    QF  +  V   ++
Sbjct: 258 ATEKSHPGYIVAKDDTRPSADSPAQGEEEAARKPGYLYEDFHPFKPKQFEGKPGVTILEY 317

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            +F+A +DE++S IE+Q+ E +   +E+AA  KL+ +  +   R+  LK+  +  ++ A 
Sbjct: 318 PSFNATVDEYFSSIETQKLESRLTEREEAAKKKLDAVRQEHAKRIGALKEVQELHIRKAG 377

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N ++L
Sbjct: 378 AIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTITL 437

Query: 471 LLSNNLDEMDDEEKTLPVEK----------------------VEVDLALSAHANARRWYE 508
           +L  + +E D+ E     +                       +++DL LS  ANA ++YE
Sbjct: 438 MLGESGEEQDEGEDLFSDDDSESEDEQEEVAKAQKQSNNMLTIDIDLGLSPWANATQYYE 497

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 498 QKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQV--LRPARKTFWFEKFLFFI 555

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLN 622
           SSE YLV+ GRD  Q+E++ +RY+ KGDV+VHADL GA+  ++KN  + P  P+PP TL+
Sbjct: 556 SSEGYLVLGGRDVMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSPNAPIPPSTLS 615

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG   V  S AWDSK + SA+WV   QVSKTA  G  L  G F+I+G+KNFL P  L++
Sbjct: 616 QAGNLCVATSSAWDSKAIMSAYWVNASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 675

Query: 683 GFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDD-TDEKPVAES 741
           GFG++F++ + SL +H   R         D+   +    E  + + E DD  + +P   +
Sbjct: 676 GFGVMFQVSKESLRNHKLHR--------FDEPVATEAPVEGQEADKEADDKPNAQPDDAA 727

Query: 742 LSVPNSAHPAPSHTNASNVDSHEFPAEDK 770
                     P+  N     + E PAED+
Sbjct: 728 EEEKEEEAEEPNGNNEDEQSAQEEPAEDE 756


>gi|46128721|ref|XP_388914.1| hypothetical protein FG08738.1 [Gibberella zeae PH-1]
          Length = 1077

 Score =  367 bits (942), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 275/816 (33%), Positives = 403/816 (49%), Gaps = 115/816 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+ RL+ +R SNVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKIIAHELQERLVTLRLSNVYDLSSKILLLKFAKPDN--------KKQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK ++TRRL  VRQ+G DR++ F+F  G   + +
Sbjct: 53  DTGFRCHLTKFARTTAAAPSIFVARLRKFLKTRRLTAVRQVGTDRVLEFEFSDGQ--YRM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD++  +L L R+  + +         + P  +   +           
Sbjct: 111 FLEFFASGNIILTDADLNILALARTVSEGE--------GQEPQRVGLQYSLENRQNYGEI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              +KE   N      E     + +SK+    QKG    DL K               +L
Sbjct: 163 PALTKERVQNALKAAVEKAAADATSSKK----QKGKPGGDLRK---------------SL 203

Query: 241 KTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
              + E     P L +H +     DT + P+  L+    L++     LV ++ +    ++
Sbjct: 204 AVSITE---LPPVLVDHWLHTNNFDTTVKPHEVLANETLLDE-----LVKSLQEARKIVE 255

Query: 297 DVISGDIVPEGYILMQNKHLGKD---HPPTESGSSTQIYDEFCPLLLNQFRSR---EFVK 350
           ++ S +    GYI  + +   +       T++     +YD+F P +  + ++    E ++
Sbjct: 256 ELTSSETC-TGYIFAKRRERPEGTEVDEETKTKRDNLLYDDFHPFIPYKLKNDPAIEVLE 314

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ ++  +DEF+S +E QR E +   +E  A  KL     +Q  R+  L++    + + A
Sbjct: 315 FQGYNETVDEFFSSLEGQRLESKLTEREATAKRKLEAAKNEQNKRIEGLQEAQSLNFRKA 374

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             IE N+E V  A+ AV   L   M W D+ ++V+ E+K  NPVA +I   L L  N ++
Sbjct: 375 AAIEANVERVQEAMDAVNGLLNQGMDWVDVGKLVEREKKRHNPVAEIIKLPLNLAENLIT 434

Query: 470 LLL---------------------------SNNLDEMDDEEKTLPVEKVEVDLALSAHAN 502
           L L                           + +  +     K L    VE++L LS  +N
Sbjct: 435 LELAEEEFEPEEDDPYETDDDDDSALGDDEATSAAKGKQSNKAL---NVEINLGLSPWSN 491

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFE 558
           AR +++ +K    K+EKT    SKA K AE+K     +  + QEK +  +  +RK  WFE
Sbjct: 492 AREYFDQRKTAAVKEEKTQQQASKALKNAEQKITEDLKKGLKQEKAL--LQPIRKQMWFE 549

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPV 616
           KF WFISS+ YLVI G+DAQQNE I K+Y+ KGD+Y HADLHGASS +IKN+   P+ P+
Sbjct: 550 KFTWFISSDGYLVIGGKDAQQNETIYKKYLRKGDIYCHADLHGASSVIIKNNPKTPDAPI 609

Query: 617 PPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
           PP TL+QAG   VC S AWDSK    AWWV   QVSK+APTGE+L  GSFMIRGKKNFLP
Sbjct: 610 PPATLSQAGSLAVCSSNAWDSKAGMPAWWVNADQVSKSAPTGEFLQAGSFMIRGKKNFLP 669

Query: 677 PHPLIMGFGLLFRLDESSLGSHLNER-----RVRGEEE----------GMDDFEDSGHHK 721
           P  L++G GL FR+ E S   H+  R        G+E           G  D  D+GH  
Sbjct: 670 PAQLLLGLGLAFRISEESKAKHVKHRLHDVDSAIGDEGSGAPQSVGMMGDADEPDAGH-- 727

Query: 722 ENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNA 757
             SD+ S+ +  DEKP  ES   P  A       NA
Sbjct: 728 --SDVPSDYETEDEKPDEESRDNPLQAFKKGEGRNA 761


>gi|358369883|dbj|GAA86496.1| DUF814 domain protein [Aspergillus kawachii IFO 4308]
          Length = 1157

 Score =  366 bits (940), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 240/736 (32%), Positives = 377/736 (51%), Gaps = 84/736 (11%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FK+             +  L++
Sbjct: 51  MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKVAKPD--------HRKQLVV 102

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + P+ F  ++RK +++RR+  + Q+G DRII F F  GM  +++
Sbjct: 103 DSGFRCHVTQYSRATASAPTPFVTRMRKFLKSRRITSIEQIGTDRIIDFSFSDGM--YHM 160

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI++TD E+ +L L R               +  T +   +  T     H  
Sbjct: 161 FLEFFAGGNIIITDREYNILALFRQ--------VPAGEGQDETRVGVKYTVTNKQNYHGI 212

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                      PD   E        +K  L  Q+G       K S K + D        L
Sbjct: 213 -----------PDITRERVQETVEKAK-ALFSQEGS----APKKSKKKNAD-------VL 249

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H      L P   L EV  L+D A+   V+ V +      D +
Sbjct: 250 RKALSQGFPEYPPLLLDHAFAVKELDPATPLDEV--LQDEALLTKVVDVLEAAKVETDKL 307

Query: 300 SGDIVPEGYILM-QNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSREFV---KF 351
           + +    GYI+  ++     D P      + +    +Y++F P    QF  +  V   ++
Sbjct: 308 ATEKSHPGYIVAKEDTRPSADSPAQGEEDAARKPGYLYEDFHPFKPKQFEGKPGVTILEY 367

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            +F+A +DE++S IE+Q+ E +   +E+ A  KL  +  +   R+  LK+  +  ++ A 
Sbjct: 368 PSFNATVDEYFSSIETQKLESRLTEREETAKRKLEAVRQEHAKRIGALKEVQELHIRKAG 427

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N ++L
Sbjct: 428 AIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVANIIKLPLKLYENTITL 487

Query: 471 LLSNNLDEMDDEEKTLPVEK----------------------VEVDLALSAHANARRWYE 508
           +L  + +E D+ E     ++                      +++DL LS  ANA ++YE
Sbjct: 488 MLGESGEEQDEGEDLFSDDESESEDEQEEAAKAQKQSNNMLTIDIDLGLSPWANATQYYE 547

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 548 QKKMAAVKEQKTTQSSTKALKSHEKKVTQDLKKGLKQEKQV--LRPARKTFWFEKFLFFI 605

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLN 622
           SSE YLV+ GRDA Q+E++ +RY+ KGDV+VHADL GA+  ++KN  +    P+PP TL+
Sbjct: 606 SSEGYLVLGGRDAMQSEILYRRYLKKGDVFVHADLQGATPMIVKNRSNSSNAPIPPSTLS 665

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG   V  S AWDSK + SA+WV   QVSKTA  G  L  G F+I+G+KNFL P  L++
Sbjct: 666 QAGNLCVATSSAWDSKAIMSAYWVTASQVSKTADAGGLLPTGEFLIKGEKNFLAPSQLVL 725

Query: 683 GFGLLFRLDESSLGSH 698
           GFG++F++ + SL +H
Sbjct: 726 GFGVMFQVSKESLRNH 741


>gi|361131825|gb|EHL03460.1| putative Nuclear export mediator factor Nemf [Glarea lozoyensis
           74030]
          Length = 1063

 Score =  366 bits (940), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 300/956 (31%), Positives = 455/956 (47%), Gaps = 160/956 (16%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDLS K ++ K              K  ++++SG R H T YAR   +  S
Sbjct: 21  LLTLRVSNIYDLSSKIFLIKFAKPE--------HKQQIIIDSGFRCHLTDYARATASDQS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  KLRK ++TRR+  V Q+G DRII FQF  G+   Y  LE YA GNI        +L
Sbjct: 73  DFVKKLRKVLKTRRVTSVCQIGTDRIIEFQFSDGLYKLY--LEFYAAGNI--------IL 122

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
           T        DK + I++  R P       E     +L   L  S E   N         +
Sbjct: 123 T--------DKELNILALLR-PVPAGEGQE-----ELRVGLQYSLENRQNY--------H 160

Query: 201 NVSNASKENLGG------QKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPAL 254
            V   +KE L         KG +     K + K   D  R      K +      + P +
Sbjct: 161 GVPGLTKERLQNALQRAVDKGDEGLVAGKKAKKKGADALR------KALAVSITEFPPMV 214

Query: 255 SEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNK 314
            +H +  T     +K + V + +D+ +  L+ A+ + +  +++V S ++   G+I+ + K
Sbjct: 215 VDHAMRVTSFDSTLKPAGVLQ-KDSLVDDLMKALQEAQKVMEEVTSCEVA-TGFIIAKKK 272

Query: 315 HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSKIESQRAE 371
              +++   E  S   +YD+F P    QF S     F+++E F+  +DEF+S IE QR E
Sbjct: 273 EGYEENSDPEHSSKNVLYDDFHPFRPAQFESDPATVFLQYEGFNKTVDEFFSSIEGQRLE 332

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            + + +E  A  K+     DQE R+  L+     + + A  I+ N+E V  A+ AV   +
Sbjct: 333 SKLEERELNAQRKIQAARQDQERRLDGLQAVQSLNERKASAIQANVERVQEAMDAVNGLV 392

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL------------------ 472
           A  M W ++ ++++ E+K  NPVA +I   L LE N ++LLL                  
Sbjct: 393 AQGMDWVEIGKLIEVEKKRSNPVASMIKLPLKLEENTITLLLDEEVFDEDEDSAYETDDA 452

Query: 473 -SNNLDEM--DDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
            S++ DE+    E K   VEK   V++DL L+   NAR +++ K++  +K++KT+ + +K
Sbjct: 453 PSDSEDEVTKQKEPKEKGVEKRLTVDIDLGLTPWKNAREYFDEKRQAATKEQKTLESSTK 512

Query: 527 AFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           A K+ E K     +  + QEK V  +  +R++ WFEKF WFISS+ YLV+ G+DAQQNEM
Sbjct: 513 ALKSQEAKIAHDLKKGLQQEKAV--LRPVRRLMWFEKFIWFISSDGYLVLGGKDAQQNEM 570

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           + K+YM KGD ++HAD+ GA++ V++N    P+ P+PP TL+QAG   V  S AWDSK  
Sbjct: 571 LYKKYMKKGDAFLHADIQGAATVVVRNDPRTPDAPIPPSTLSQAGSLVVSCSVAWDSKAG 630

Query: 641 TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLN 700
            SAWW    Q+SK AP+G++L  GSF + GKKNFLPP  L++GFG++FR+ ESS   HL 
Sbjct: 631 MSAWWASATQISKAAPSGDFLPPGSFSVNGKKNFLPPSQLLLGFGVIFRISESSKSKHLK 690

Query: 701 ERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPA--------P 752
            R        + D  D   H     +E    DT E  +AES     SA P          
Sbjct: 691 HR--------VSDDRDQNRHS----VEEPNQDTPE--IAESEVASESAVPEIDDGQDSDD 736

Query: 753 SHTNASNVDSHEFPAEDKTI---SNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSAS 809
             +NAS+ +  E       +   S   + KI +++ +          DL D         
Sbjct: 737 GTSNASDAEEEEQNTPSNPLQRQSTATEPKIAEVSND----------DLTD--------- 777

Query: 810 ISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKE 869
                 GIE  + D + +  H   TAT  D    S+++       Q +    P    +  
Sbjct: 778 ------GIEALEIDDTPKIPH---TATPNDIDSNSESDD-DTDFNQTTGTRTPNTVADNR 827

Query: 870 RGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           +G  A+ +     +  KI                  KY DQDEE+R     L+  S
Sbjct: 828 KGGPATKKRGKRGKAKKIAN----------------KYKDQDEEDRLAAQQLIGAS 867


>gi|169783790|ref|XP_001826357.1| hypothetical protein AOR_1_1306054 [Aspergillus oryzae RIB40]
 gi|83775101|dbj|BAE65224.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1103

 Score =  365 bits (937), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 259/833 (31%), Positives = 408/833 (48%), Gaps = 134/833 (16%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKLAKPD--------HRKQLIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + PS F  ++RK +R+RR+  V+Q+G DRII   F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATASMPSPFVTRMRKFLRSRRITSVKQIGTDRIIDISFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS---HRDDDKGVAIM-------SRHRYPTEICRVFE 170
            LE +A GNI++TD E  +L L R       ++  V I        + H  P EI     
Sbjct: 111 FLEFFAGGNIIITDREHNILALYRQVSVSEGEEARVGIQYTVTNKQNYHGIP-EITLDRI 169

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           R T  K  A                 EDG                       K S K + 
Sbjct: 170 RETLEKAKALF-------------AREDG---------------------APKKSKKKNA 195

Query: 231 DGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
           D        L+  L +    Y P L +H  +   + P   L +V  L+D ++   V  V 
Sbjct: 196 D-------VLRKALSQGFPEYPPLLLDHAFVTKEVDPTTPLDKV--LQDESLLQEVNGVL 246

Query: 290 KFEDWLQDVISGDIVPEGYILMQ------NKHLGKDHPPTESGSSTQIYDEFCPLLLNQF 343
           +        +S      GYI+ +      ++   ++  P+E+G+   +Y++F P    QF
Sbjct: 247 QEAQNENTRLSTQESHPGYIVAKEDNRSVSQSANENEKPSETGNL--LYEDFHPFKPRQF 304

Query: 344 RSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
             +     ++F + +A +DE++S IE+Q+ E +   +E+AA  KL  +  + E ++  LK
Sbjct: 305 EGKPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGALK 364

Query: 401 QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID- 459
           ++ +  ++ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I  
Sbjct: 365 EQQELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARIIKL 424

Query: 460 KLYLERNCMSLLLSNNLDEMDD--------------------EEKTLP-VEKVEVDLALS 498
            L L  N ++LLL    DE D+                    E +  P V  +++DL +S
Sbjct: 425 PLKLHENTITLLLGEAGDEQDEGDELFSSDESEESEDEQDNGESQQPPSVLTIDIDLGIS 484

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISHMRKVHW 556
             ANA+++YE KK+   K+++T  + +KA K+ EKK    L+   +K    +   R+  W
Sbjct: 485 PWANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQPFW 544

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQ 614
           FEKF +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P  
Sbjct: 545 FEKFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDPTA 604

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG   V  S AWDSK V SAWWV   Q++KTA  G  L +G F+++G+KNF
Sbjct: 605 PIPPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEKNF 664

Query: 675 LPPHPLIMGFGLLFRLDESSLGSHLNE----------RRVRGEEEGMDDFEDSGHHKE-- 722
           L P  L++GFG+ F++ + SL +H              R  G E+  +  + S   +E  
Sbjct: 665 LAPSQLVLGFGVTFQISKDSLKNHKTHFVDEPEAPEATREGGHEQAGESTQRSEQQQETE 724

Query: 723 ----------------NSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASN 759
                           +SD E+E+D+ D  P    L    S  P   HT A+ 
Sbjct: 725 EAHKPSLDPKEQAEEQSSDSENEQDNADSLPARNPLQRGPSESP---HTEAAQ 774


>gi|212529000|ref|XP_002144657.1| DUF814 domain protein, putative [Talaromyces marneffei ATCC 18224]
 gi|210074055|gb|EEA28142.1| DUF814 domain protein, putative [Talaromyces marneffei ATCC 18224]
          Length = 1117

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 241/744 (32%), Positives = 385/744 (51%), Gaps = 92/744 (12%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   +IG+R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSIDVKIICQELSTSIIGLRVSNIYDLSSRIFLFKLAKPD--------HRKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   +TPSGF  +LRK ++TRR+  V+QLG DR+I   F  G+   ++
Sbjct: 53  DSGFRCHLTEYSRTTASTPSGFVSRLRKCLKTRRVTSVQQLGTDRVIDIVFSDGL--FHI 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD+E  +L L R+           +  +   +I   +    A   H  
Sbjct: 111 YLEFFAGGNIILTDAENKILALFRT--------VAAAGEQDEVKIGLTYAVEKAQYYHGI 162

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGG--QKGGKSFDLSKNSNKNSNDGARAKQP 238
              S+E       ++      V++A +++ G   +K  K  D+ +               
Sbjct: 163 PPLSEE-------RLRTTIQKVADADQQSAGSAQKKSKKKVDVFR--------------- 200

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
             K +      + P L E     TG   ++ L +V  LED +     + V +     Q +
Sbjct: 201 --KAISSGFPEFPPLLLEDAFAATGFDSSVTLKQV--LEDESTFQKAMNVLR---EAQKI 253

Query: 299 ISG--DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR---EFVKFET 353
           I+G  +   +GYI+ + +   +D     +     ++++F P    QF  +     +++++
Sbjct: 254 IAGLSEGEKKGYIVAKERAKKEDQQVDSTSKENLLFEDFHPFRPRQFEGKPGYHILEYDS 313

Query: 354 FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
           F+  +DE++S IESQ+ E +    E+ A  KL     D +NR   LKQ  +  ++ AE I
Sbjct: 314 FNKTVDEYFSSIESQKLESRLAEHEETAKRKLETARADHQNRAGALKQAQELHIRKAEAI 373

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL 472
           + N+  V  A  AV   +A  M W ++AR+++ E++  NPVA  I   L L  N ++LLL
Sbjct: 374 QANIYRVQEATDAVNGLIAQGMDWVEIARLIEMEQQRNNPVAQTIKLPLKLYENTITLLL 433

Query: 473 --------------------------SNNLDEMDDEEKTLPVE--KVEVDLALSAHANAR 504
                                     S N  E D+  K    E   +++DL+LS  +NA 
Sbjct: 434 SEENTEVEEEQEEFSESEPEVSEDSDSENEIEKDEGPKQKIAEPLAIDIDLSLSPWSNAT 493

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKF 560
           ++YE K+    K++KTI +  KA K+ EKK     +  + QEK V   S  RK  WFEK+
Sbjct: 494 QYYEQKRTAAVKEQKTIQSSEKALKSQEKKVTEDLKKHLKQEKQVLRPS--RKPFWFEKY 551

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVPP 618
            +FISSE YLV+ GRD+ Q E++ +RY+ KGDV+VHADL GA+  ++KN    P+ P+PP
Sbjct: 552 LYFISSEGYLVLGGRDSHQVEILYQRYLKKGDVFVHADLEGATPMIVKNKEGTPDAPIPP 611

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL QAG  +V  S+AW++K +  +WWV+ HQVS+T   GE L  G+FM++G+KN+L P 
Sbjct: 612 GTLTQAGSISVATSKAWETKALMPSWWVHAHQVSRTNERGELLANGAFMVKGEKNYLAPG 671

Query: 679 PLIMGFGLLFRLDESSLGSHLNER 702
             I+GF +LF++ + S+ +H   R
Sbjct: 672 QPILGFAVLFQISKESVQNHRKHR 695


>gi|119480773|ref|XP_001260415.1| hypothetical protein NFIA_084700 [Neosartorya fischeri NRRL 181]
 gi|119408569|gb|EAW18518.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 1116

 Score =  363 bits (931), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 246/736 (33%), Positives = 381/736 (51%), Gaps = 88/736 (11%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   L+ +R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSLDVKVICQELASELVNLRVSNIYDLSSRIFLFKLAKPD--------HRKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  ++RK +++RRL  + Q+G DR+I F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATATAPSPFVTRMRKFLKSRRLTSIEQIGTDRVIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRV-FERTTASK--L 177
            LE +A GNI++TD ++ +LTL R       GV          E  RV F+ T  +K   
Sbjct: 111 FLEFFAGGNIIITDRDYNILTLFRQV---PAGVG--------EEEMRVGFKYTVTNKQNY 159

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
           H        P+    D++ E       AS      Q+G       K S K + D      
Sbjct: 160 HGV------PEITL-DRIKETLEKAKEAS-----AQEG----TAPKKSKKKNVD------ 197

Query: 238 PTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
             L+  L +    Y P L +H      + P   L +V  L D+A+   V  V K    + 
Sbjct: 198 -VLRKALSQGFPEYPPLLLDHAFAVKEVDPATPLEKV--LGDDALMEQVNGVLKEAQSVT 254

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSRE---FV 349
             +S      GYI+ +           ++G  +Q    +Y++F P    QF  +     +
Sbjct: 255 IKLSAKEDHPGYIIAKEDKRPTAESTADTGDPSQKAGLLYEDFHPFRPRQFEGKPEVTIL 314

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +F TF+A +DE++S +E+Q+ E +   +E+AA  KL+ +  + E R+  LK+  +  V+ 
Sbjct: 315 EFSTFNATVDEYFSSLETQKLESRLTEREEAAKRKLDAVRQEHEKRLGALKEAQEIHVRK 374

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A  IE N+  V   + AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N +
Sbjct: 375 AAAIEDNVYRVQEVMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLPLKLYENTI 434

Query: 469 SLLLSNNLDEMDDEE--------------------KTLPVEKVEVDLALSAHANARRWYE 508
           +L+L    +E D  +                    K   +  +++DL LS  ANA ++YE
Sbjct: 435 TLVLGEASEEQDAADDLFSDESEEESESEEQEAARKAPEMLTIDIDLGLSPWANATQYYE 494

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 495 QKKMAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQV--LRPARKPFWFEKFLFFI 552

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLN 622
           SSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P+ P+PP TL+
Sbjct: 553 SSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLS 612

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG   V  S AW+SK V +AWWV  +QV+KT  TG  L  G F ++G+KNFL P  L++
Sbjct: 613 QAGNLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEVKGEKNFLAPSQLVL 671

Query: 683 GFGLLFRLDESSLGSH 698
           GF ++F++ + SL +H
Sbjct: 672 GFAVMFQISKESLKNH 687


>gi|338717943|ref|XP_001496390.3| PREDICTED: nuclear export mediator factor NEMF [Equus caballus]
          Length = 827

 Score =  363 bits (931), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 195/441 (44%), Positives = 270/441 (61%), Gaps = 61/441 (13%)

Query: 307 GYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAALDEFYS 363
           GYI+ + +      P  E    TQ    Y+EF P L +Q     +++FE+FD A+DEFYS
Sbjct: 17  GYIIQKREM----KPSLEVDKPTQDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDEFYS 72

Query: 364 KIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAA 423
           KIE Q+ + +   +E  A  KL+ +  D E+R+  L+Q  +      ELIE NL+ VD A
Sbjct: 73  KIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELIEMNLQIVDRA 132

Query: 424 ILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----NLDEM 479
           I  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N    + +E 
Sbjct: 133 IQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRNPYLLSEEED 192

Query: 480 DDEEKTLPVEK----------------------------VEVDLALSAHANARRWYELKK 511
           DD +  + VEK                            V+VDL+LSA+ANA+++Y+ K+
Sbjct: 193 DDVDGDISVEKNETEPPKGKKKKQKNKQLQKPQKNRPLLVDVDLSLSAYANAKKYYDHKR 252

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
               K +KT+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENYL+
Sbjct: 253 YAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLI 312

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCH 631
           I GRD QQNE+IVKRY++ G                      +P+PP TL +AG   +C+
Sbjct: 313 IGGRDQQQNEIIVKRYLTPG----------------------EPIPPRTLTEAGTMALCY 350

Query: 632 SQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++D
Sbjct: 351 SAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVD 410

Query: 692 ESSLGSHLNERRVRGEEEGMD 712
           ES +  H  ER+VR ++E M+
Sbjct: 411 ESCVWRHRGERKVRVQDEDME 431


>gi|351702906|gb|EHB05825.1| Serologically defined colon cancer antigen 1, partial
           [Heterocephalus glaber]
          Length = 762

 Score =  362 bits (928), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 186/371 (50%), Positives = 247/371 (66%), Gaps = 33/371 (8%)

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
           KE  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ VD AI  VR ALAN++ 
Sbjct: 1   KEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQVVDRAIQVVRSALANQID 60

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----NLDEMDDEEKTLPVEK-- 490
           W ++  +VKE +  G+PVA  I +L L+ N +++LL N    + +E DD +  + VEK  
Sbjct: 61  WTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLSEEEDDDADGDVSVEKNE 120

Query: 491 --------------------------VEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                                     V+VDL+LSA+ANA+++Y+ K+    K +KT+ A 
Sbjct: 121 TEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAA 180

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFISSENYL+I GRD QQNEMIV
Sbjct: 181 EKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIV 240

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           KRY++ GD+YVHADLHGA+S VIKN   E P+PP TL + G   +C+S AWD++++TSAW
Sbjct: 241 KRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEVGTMALCYSAAWDARVITSAW 299

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           WVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+V
Sbjct: 300 WVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKV 359

Query: 705 RGEEEGMDDFE 715
           R ++E ++  E
Sbjct: 360 RVQDEDVETLE 370


>gi|159129335|gb|EDP54449.1| DUF814 domain protein, putative [Aspergillus fumigatus A1163]
          Length = 1116

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 243/744 (32%), Positives = 374/744 (50%), Gaps = 104/744 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   L+ +R SN+YDLS + ++FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVICQELASELVNLRVSNIYDLSSRIFLFKLAKPDNRKQ--------LVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  ++RK +++RRL  + Q+G DR+I F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATATAPSPFVTRMRKFLKSRRLTSIEQIGTDRVIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVA-----------IMSRHRYPTEICRVF 169
            LE +A GNI++TD E+ +LTL R       GV            + ++  Y       F
Sbjct: 111 FLEFFAGGNIIITDREYNILTLFRQV---PAGVGEEEMRVGLKYTVTNKQNYHGVPEITF 167

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS 229
           ER     +   L  +KE  A E       G     + K+N+                   
Sbjct: 168 ER-----IKETLEKAKEASAQE-------GTAPKKSKKKNVD------------------ 197

Query: 230 NDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
                     L+  L +    Y P L +H      + P   L   N L D+ +   V  V
Sbjct: 198 ---------VLRKALSQGFPEYPPLLLDHAFAVKEVDPATPLE--NVLGDDTLMEQVNGV 246

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFR 344
            K    +   +S      GYI+ +           ++G  ++     Y++F P    QF 
Sbjct: 247 LKEAQSVTIKLSAKEDHPGYIVAKEDKRPSAESTADAGDPSEKAGLFYEDFHPFRPRQFE 306

Query: 345 SREFVK---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
               VK   F TF+A +DE++S +E+Q+ E +   +E+AA  KL+ +  + E R+  LK+
Sbjct: 307 GNPEVKILEFSTFNATVDEYFSSLETQKLEARLTEREEAAKRKLDAVRQEHEKRLGALKE 366

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
             +  V+ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   
Sbjct: 367 AQEIHVRKAAAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLP 426

Query: 461 LYLERNCMSLLLSNNLDEMDDEE--------------------KTLPVEKVEVDLALSAH 500
           L L  N ++L+L    +E D  +                    K   +  +++DL LS  
Sbjct: 427 LKLYENTITLVLGEASEEQDAADDLFWDESEEESESEEQEAARKASEMLTIDIDLGLSPW 486

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHW 556
           ANA ++YE KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  W
Sbjct: 487 ANATQYYEQKKIAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQV--LRPARKPFW 544

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQ 614
           FEKF +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P+ 
Sbjct: 545 FEKFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDA 604

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG   V  S AW+SK V +AWWV  +QV+KT  TG  L  G F I+G+KNF
Sbjct: 605 PIPPSTLSQAGNLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEIKGEKNF 663

Query: 675 LPPHPLIMGFGLLFRLDESSLGSH 698
           L P  L++GF ++F++ ++SL +H
Sbjct: 664 LAPSQLVLGFAVMFQISKNSLKNH 687


>gi|238493615|ref|XP_002378044.1| DUF814 domain protein, putative [Aspergillus flavus NRRL3357]
 gi|220696538|gb|EED52880.1| DUF814 domain protein, putative [Aspergillus flavus NRRL3357]
          Length = 1105

 Score =  360 bits (923), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 241/746 (32%), Positives = 381/746 (51%), Gaps = 105/746 (14%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLL 58
           +K R ++ DV    + L   ++ +R SN+YDLS   + ++FKL             +  L
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSVCRIFLFKLAKPD--------HRKQL 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++SG R H T Y+R   + PS F  ++RK +R+RR+  V+Q+G DRII   F  GM  +
Sbjct: 53  IVDSGFRCHVTQYSRATASMPSPFVTRMRKFLRSRRITSVKQIGTDRIIDISFSDGM--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRS---HRDDDKGVAIM-------SRHRYPTEICRV 168
           ++ LE +A GNI++TD E  +L L R       ++  V I        + H  P EI   
Sbjct: 111 HMFLEFFAGGNIIITDREHNILALYRQVSVSEGEEARVGIQYTVTNKQNYHGIP-EITLD 169

Query: 169 FERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN 228
             R T  K  A                 EDG                       K S K 
Sbjct: 170 RIRETLEKAKALF-------------AREDG---------------------APKKSKKK 195

Query: 229 SNDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLA 287
           + D        L+  L +    Y P L +H  +   + P   L +V  L+D ++   V  
Sbjct: 196 NAD-------VLRKALSQGFPEYPPLLLDHAFVTKEVDPTTPLDKV--LQDESLLQEVNG 246

Query: 288 VAKFEDWLQDVISGDIVPEGYILMQN------KHLGKDHPPTESGSSTQIYDEFCPLLLN 341
           V +        +S      GYI+ ++      +   ++  P+E+G+   +Y++F P    
Sbjct: 247 VLQEAQNENTRLSTQESHPGYIVAKDDNRSVSQSANENEKPSETGNL--LYEDFHPFKPR 304

Query: 342 QFRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHT 398
           QF  +     ++F + +A +DE++S IE+Q+ E +   +E+AA  KL  +  + E ++  
Sbjct: 305 QFEGKPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEKKIGA 364

Query: 399 LKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLI 458
           LK++ +  ++ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I
Sbjct: 365 LKEQQELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPVARII 424

Query: 459 D-KLYLERNCMSLLLSNNLDEMDD--------------------EEKTLP-VEKVEVDLA 496
              L L  N ++LLL    DE D+                    E +  P V  +++DL 
Sbjct: 425 KLPLKLHENTITLLLGEAGDEQDEGDELFSSDESEESEDEQDNGESQQPPSVLTIDIDLG 484

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISHMRKV 554
           +S  ANA+++YE KK+   K+++T  + +KA K+ EKK    L+   +K    +   R+ 
Sbjct: 485 ISPWANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQTRQP 544

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--P 612
            WFEKF +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P
Sbjct: 545 FWFEKFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRSKDP 604

Query: 613 EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
             P+PP TL+QAG   V  S AWDSK V SAWWV   Q++KTA  G  L +G F+++G+K
Sbjct: 605 TAPIPPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLVKGEK 664

Query: 673 NFLPPHPLIMGFGLLFRLDESSLGSH 698
           NFL P  L++GFG+ F++ + SL +H
Sbjct: 665 NFLAPSQLVLGFGVTFQISKDSLKNH 690


>gi|71001140|ref|XP_755251.1| DUF814 domain protein [Aspergillus fumigatus Af293]
 gi|66852889|gb|EAL93213.1| DUF814 domain protein, putative [Aspergillus fumigatus Af293]
          Length = 1116

 Score =  360 bits (923), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 245/744 (32%), Positives = 374/744 (50%), Gaps = 104/744 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   L+ +R SN+YDLS + ++FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVICQELASELVNLRVSNIYDLSSRIFLFKLAKPDNRKQ--------LVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  ++RK +++RRL  + Q+G DR+I F F  GM  +++
Sbjct: 53  DSGFRCHVTQYSRATATAPSPFVTRMRKFLKSRRLTSIEQIGTDRVIDFSFSDGM--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVA-----------IMSRHRYPTEICRVF 169
            LE +A GNI++TD E+ +LTL R       GV            + ++  Y       F
Sbjct: 111 FLEFFAGGNIIITDREYNILTLFRQV---PAGVGEEEMRVGLKYTVTNKQNYHGVPEITF 167

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS 229
           ER     +   L  +KE  A E       G     + K+N+                   
Sbjct: 168 ER-----IKETLEKAKEASAQE-------GTAPKKSKKKNVD------------------ 197

Query: 230 NDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
                     L+  L +    Y P L +H      + P   L   N L D+ +   V  V
Sbjct: 198 ---------VLRKALSQGFPEYPPLLLDHAFAVKEVDPATPLE--NVLGDDTLMEQVNGV 246

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFR 344
            K    +   +S      GYI+ +           ++G  ++     Y++F P    QF 
Sbjct: 247 LKEAQSVTIKLSAKEDHPGYIVAKEDKRPSAESTADAGDPSEKAGLFYEDFHPFRPRQFE 306

Query: 345 SREFVK---FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
               VK   F TF+A +DE++S +E+Q+ E +   +E+AA  KL+ +  + E R+  LK+
Sbjct: 307 GNPEVKILEFSTFNATVDEYFSSLETQKLEARLTEREEAAKRKLDAVRQEHEKRLGALKE 366

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
             +  V+ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   
Sbjct: 367 AQEIHVRKAAAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQGRGNPVARIIKLP 426

Query: 461 LYLERNCMSLLL---SNNLDEMDD-----------------EEKTLPVEKVEVDLALSAH 500
           L L  N ++L+L   S   D  DD                   K   +  +++DL LS  
Sbjct: 427 LKLYENTITLVLGEASREQDAADDLFWDESEEESESEEQEAARKASEMLTIDIDLGLSPW 486

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHW 556
           ANA ++YE KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  W
Sbjct: 487 ANATQYYEQKKIAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQV--LRPARKPFW 544

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQ 614
           FEKF +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P+ 
Sbjct: 545 FEKFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDA 604

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG   V  S AW+SK V +AWWV  +QV+KT  TG  L  G F I+G+KNF
Sbjct: 605 PIPPSTLSQAGNLCVATSSAWESKAVMAAWWVNANQVTKTT-TGGLLPTGEFEIKGEKNF 663

Query: 675 LPPHPLIMGFGLLFRLDESSLGSH 698
           L P  L++GF ++F++ ++SL +H
Sbjct: 664 LAPSQLVLGFAVMFQISKNSLKNH 687


>gi|340059520|emb|CCC53907.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 1048

 Score =  357 bits (915), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 238/754 (31%), Positives = 379/754 (50%), Gaps = 125/754 (16%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  N+YD+ PK ++FK  +       GE +K LLL
Sbjct: 1   MVKQRMTALDVRATVEEMRTELLGLRLMNIYDIPPKIFLFKFGH-------GEKKKTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            E+G+RLH T + R+K   P+ FTL+LRKH+R  RL+ V QL +DR + F+FG+G  A Y
Sbjct: 54  -ENGLRLHLTQFVREKPKVPTQFTLRLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L  LRSHRD+  GV I  R  YP              + 
Sbjct: 113 HIIVELFSKGNVILTDHEYRILLPLRSHRDE--GVNIFVRELYP--------------VT 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            +   ++  D  E + + E                       L +  +   + GA  +  
Sbjct: 157 PSFDQNRLRDMQESECIEE-----------------------LRREWSVVFSRGADYE-- 191

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T K++L     +GP+L++H+++ TG V N+K S +    D   + L+  +   E W    
Sbjct: 192 TTKSMLSGTHHFGPSLADHVLVVTG-VKNVKKSSMTCSGDELFEALLPGL--LEAWR--- 245

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI----------------------YDEFC 336
           I+   +  G  L++N   GK    ++SG++ +                       YD+F 
Sbjct: 246 IAISPLSSGGFLIKNCKSGKPRCDSQSGTAGEQENSAVDTVSASGPGKRNLQGEGYDDFT 305

Query: 337 PLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
           P+LL Q+      K    +F +  D F+   E  + EQQ + K  A   K  +   D + 
Sbjct: 306 PVLLAQYDGENVTKSYLPSFGSVCDTFFLHTEEGKIEQQKEKKTVAVMSKKERCERDHQR 365

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           R+  L++    + +  EL+  N E +DAAI  +  ALA+ + W+ L R++K+    G+PV
Sbjct: 366 RIEALERMELENARKGELLIQNAEKIDAAIGLINGALASGIQWDALRRLLKQRHAEGHPV 425

Query: 455 AGLIDKLYLERNCMSLLL-SNNLDEMDDEEKTLPVEK-----------VEVDLALSAHAN 502
           A ++ +L+L+RN MS+L+ +N+ D+  DE  ++  E            +EVDL+ +AHAN
Sbjct: 426 AYMVHELFLDRNNMSVLVETNDDDDCIDEGGSVSYESKVDDCNKPPWVIEVDLSKTAHAN 485

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNW 562
           A  ++  KK   +K ++T+ A ++A + AEKK      + +TV +I+  R   W+EKFNW
Sbjct: 486 AAAYFSQKKANRAKLDRTVAATAQAMRGAEKKGERMAARHQTVKDIATERHRCWWEKFNW 545

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ-------- 614
           F +S   LV+ G D Q  E++V+R M  GD++VH D+ GA   ++++ R           
Sbjct: 546 FRTSCGDLVLLGHDVQSTELLVRRVMCLGDLFVHCDVDGALPCILRSGRSVWCAAASGSQ 605

Query: 615 ------------------PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
                              V   +L +A  + V  S AW+ K    AWWVY  Q+     
Sbjct: 606 CVDNWMEKNIGSTRSDMLAVHVTSLREAAAWCVSRSSAWEGKFNVGAWWVYASQIIGGTA 665

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           TG YL        G+K+ + P PL +G GLLFR+
Sbjct: 666 TGCYL------FSGEKHHVLPQPLALGCGLLFRV 693


>gi|242764776|ref|XP_002340841.1| DUF814 domain protein, putative [Talaromyces stipitatus ATCC 10500]
 gi|218724037|gb|EED23454.1| DUF814 domain protein, putative [Talaromyces stipitatus ATCC 10500]
          Length = 1111

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 255/797 (31%), Positives = 402/797 (50%), Gaps = 108/797 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   +IG+R SN+YDLS + ++FKL       +        L++
Sbjct: 1   MKQRFSSIDVKIICQELNTSIIGLRVSNIYDLSSRIFLFKLAKPDYRKQ--------LII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   NTPSGF  +LRK ++TRR+  V+QLG DRII      G+   ++
Sbjct: 53  DSGFRCHLTEYSRTTANTPSGFVSRLRKCLKTRRVTAVKQLGTDRIIDIVISDGL--FHI 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GNI+LTD+E  +L L R+     +   +     Y  E  + +           
Sbjct: 111 YLEFFAGGNIILTDAENKILALFRTVAAAGEQDEVKIGLTYAVEKAQYY----------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                              N +   S+E L      K+ D  ++   N+    + K    
Sbjct: 160 -------------------NGIPPVSEERLRATI-QKAIDAEQSPGGNAQRKPKKKVDVF 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK-FEDWLQDV 298
           +  +      + P L E     TG   ++ L EV  LED +I    +AV +  E  +  +
Sbjct: 200 RRAVSSGFPEFPPLLLEDAFAATGFDSSITLKEV--LEDESIFQKAMAVLREAEKIVAGL 257

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSR---EFVKF 351
             G+   +GYI+ + +   KD    +S  S      ++++F P    QF  +     +++
Sbjct: 258 SEGET--KGYIVAKER-AKKDTDFDQSNDSASKENLLFEDFHPFRPRQFEGKPGYHILEY 314

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           + F+  +DE++S IESQ+ E +    E+ A  KL     D  +R   LKQ  +  ++ AE
Sbjct: 315 DNFNKTVDEYFSSIESQKLESRLAEHEETAKRKLEAARADHLDRAGALKQAQELHIRKAE 374

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            I+ N+  V  A  AV   +A  M W ++AR+++ E++  NPVA  I   L L  N ++L
Sbjct: 375 AIQANIYRVQEATDAVNGLIAQGMDWVEIARLIEMEQERNNPVAKTIKLPLKLFENTITL 434

Query: 471 LL---------------------SNNLDEMDDEEKTLPVEK------VEVDLALSAHANA 503
           LL                     S++  E + E+   P  K      +++DL+LS  +NA
Sbjct: 435 LLSEESAKGEGDKEEFSESEPEGSDSNSESEFEKDGGPKRKNAEPLAIDIDLSLSPWSNA 494

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEK 559
            ++YE KK    K++KTI +  KA K+ EKK     +  + QEK V   S  RK  WFEK
Sbjct: 495 TQYYEQKKTAAVKEQKTIQSSEKALKSQEKKVTEDLKKHLKQEKQVLRPS--RKPFWFEK 552

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVP 617
           + +FISSE YLV+ GRD+ Q E++ +RY+ KGDV+VHADL GA+  ++KN       P+P
Sbjct: 553 YLYFISSEGYLVLGGRDSHQVEILYQRYLKKGDVFVHADLEGATPMIVKNKEGTSNAPIP 612

Query: 618 PLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           P TL QAG  +V  S+AW++K +  +WWV+ HQVS+T   GE L  G FM++G+KN+L P
Sbjct: 613 PGTLTQAGSISVATSKAWETKALMPSWWVHAHQVSRTNERGELLASGGFMVKGEKNYLAP 672

Query: 678 HPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFE-----DSGHHKENSDIESE--- 729
              ++GF +LF++ + S+ +H   R+ R EE    D +     ++   + +SD++S    
Sbjct: 673 GQPVLGFAVLFQISKESVHNH---RKHRIEEYSELDTKETVSAETSAQEASSDVKSTVKE 729

Query: 730 -----KDDTDEKPVAES 741
                 DDT E+P  E+
Sbjct: 730 DVLAVADDTVEQPETET 746


>gi|255941192|ref|XP_002561365.1| Pc16g10550 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211585988|emb|CAP93725.1| Pc16g10550 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1160

 Score =  356 bits (913), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 240/734 (32%), Positives = 380/734 (51%), Gaps = 94/734 (12%)

Query: 4   VRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG 63
           V++ T ++A+E  C    + +R SN+YDLS + ++FKL             +  L+++SG
Sbjct: 63  VKVITQELASE--C----VNLRVSNIYDLSSRIFLFKLAKPD--------HRRQLIIDSG 108

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
            R H T Y+R    TPS F  +LRK++++RR+  + Q+G DRII F F  G  A+++ LE
Sbjct: 109 FRTHVTQYSRTAATTPSPFVTRLRKYLKSRRITGISQIGTDRIIDFSFSDG--AYHIFLE 166

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTS 183
            +A GNI+LTD E+ +L + R          I    +Y   +C                 
Sbjct: 167 FFAGGNIILTDREYNILAVFRQVAAGVGQEEIKVGLKY--TVC----------------- 207

Query: 184 SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
                    +K N DG     A +     +K    F    N+ K S    +     L+  
Sbjct: 208 ---------NKQNYDGVPDITADRVLQTLEKAQALFAQEGNAPKKSK---KKGTDVLRKA 255

Query: 244 LGEALG-YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           L +    Y P L +H+      DT    +  L   +KL+  A++ ++    +  +     
Sbjct: 256 LSQGFPEYPPLLLDHVFAIKEFDTTTPLDQVLGSQDKLQ--AVKEVLEESRRISNTFD-- 311

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR---EFVKFETFD 355
            SGD  P GYI+ +          T S +   +Y++F P    QF ++   + ++FE F+
Sbjct: 312 -SGDSHP-GYIVAKEDTRPVPEGETASKAPALLYEDFHPFKPRQFENKPGTKILEFERFN 369

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           A +DE++S +ESQR E +   +E+AA  KL  +  + + R+  LK   +  ++ A+ I+ 
Sbjct: 370 ATVDEYFSSLESQRLESRLTEREEAAKKKLESVRSEHKKRIDELKNVQEIHIRKADAIQD 429

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN 474
           N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N ++L+L  
Sbjct: 430 NVYRVQEAMDAVNGLVAQGMDWGEIARLIEMEQGRGNPVAQTIKLPLKLYENTVTLVLGE 489

Query: 475 -----------------NLDEMDDEEKTLPVEK------VEVDLALSAHANARRWYELKK 511
                            +  E + E++T   E+      +++DL LS  ANA ++Y+ KK
Sbjct: 490 AGDDEDEDEEFSSSDEESDSENEAEQETARAERESKLLTIDIDLGLSPWANASQYYDQKK 549

Query: 512 KQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           +   K+++T  + +KA K+ EKK     +  + +EK V  +   R   WFEKF +FISSE
Sbjct: 550 QASEKEQRTTQSSAKALKSHEKKVTTDLKRGLKKEKQV--LRQARTPFWFEKFIFFISSE 607

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAG 625
            YLVI  RDA Q+E++ +RY+SKGD++VHADL GA+  V+KN     + P+ P TL+QAG
Sbjct: 608 GYLVIGARDAMQSELLYRRYLSKGDIFVHADLEGATPIVVKNRAGSADAPISPSTLSQAG 667

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE-YLTVGSFMIRGKKNFLPPHPLIMGF 684
              V  S AWDSK V SAWW + HQVSK A  G   +  G F I+G+KNFL P  L++GF
Sbjct: 668 NLCVATSSAWDSKAVMSAWWAHAHQVSKIAENGSGIMPTGVFQIKGEKNFLAPSQLVLGF 727

Query: 685 GLLFRLDESSLGSH 698
           G++F++ + S+ +H
Sbjct: 728 GIMFQISQESVRNH 741


>gi|167395586|ref|XP_001741648.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165893772|gb|EDR21907.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 960

 Score =  355 bits (910), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 234/692 (33%), Positives = 355/692 (51%), Gaps = 100/692 (14%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L+    + VYD++ + Y+ KL  +          K  +++ESGVR+H T Y R+K + P
Sbjct: 27  KLLNFNINTVYDINRRLYVIKLSKTDC--------KEFIVIESGVRVHLTEYNREKSDFP 78

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           + FT KLRK++  ++L  + Q+G DR+I   FG     + ++++LY+ GNI L D E+ +
Sbjct: 79  NNFTSKLRKYLNKKKLIKINQIGNDRVIELVFGNVTERYSLVVDLYSNGNICLCDQEYKI 138

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPD---KVN 196
           L  LRS   D  G  +    +YP              LH         DAN  D   K+ 
Sbjct: 139 LLTLRSFTFDKTGDKVAVGEKYP--------------LHLL------SDANGIDELKKII 178

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
           ++ N +  +  E++ G                          TLK ++     +G  LS+
Sbjct: 179 KEYNTIFTS--ESMKGW-------------------------TLKQLINYTSDFGQQLSD 211

Query: 257 HIILDTG------LVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYIL 310
           H     G              E   L  N   +L  A+ ++E     + SG+   +GYI 
Sbjct: 212 HCCSQFGKESSKTKKFEEFNEEEKSLMKN---ILEEAITRYEK----IDSGNC--KGYIF 262

Query: 311 MQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRA 370
               H  K             Y+E    + NQ   R++++FE+F+ A+DEF+S IE Q  
Sbjct: 263 YHETHQKK------------YYEEVSCDIFNQDSKRKYIEFESFEKAMDEFHSHIEKQEY 310

Query: 371 EQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVA 430
           E + + KE     K+  +    + R   L  + +     AE +E N++ VD  I  + V 
Sbjct: 311 EAEVEKKEMIMKKKVQAVIDGHQKRYQGLLDKAETLKNEAEAVEENIQVVDQLIQEINVF 370

Query: 431 LANRMSWEDLARMVKEERKAGNP--VAGLIDKLYLERNCMSL-LLSNNLDEMDDEEKTLP 487
           L  +M WE +  ++ EE K  +P  +A  I +   +   + L L   N D++ D      
Sbjct: 371 LKEKMKWEQIEGII-EELKENDPTSIAKYIKRFDFKNEVVVLELRHTNEDKIID------ 423

Query: 488 VEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE-KKTRLQILQEKTVA 546
              VE+ L  +   N R +YE++K   +K EKTI +   A K AE K+ R+   ++ T+ 
Sbjct: 424 ---VEIALNKNGFENVRNFYEMRKNILAKAEKTIESKDLAIKQAENKQERVAKEKKITLV 480

Query: 547 NISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
           ++  MRK  WFEKF+WF+SSEN+++ISG+DA QN++I +RYM   D+YVHAD+HGA+S +
Sbjct: 481 DVKKMRKRFWFEKFHWFLSSENFIIISGKDALQNDIIYRRYMKNTDIYVHADIHGAASCI 540

Query: 607 IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSF 666
           IK   P + +   TL QAG   VC S AW SK+VTSAWWVY  QVSKTAP+GEYLT GSF
Sbjct: 541 IKG-IPGKTIGAPTLEQAGKIAVCRSSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSF 599

Query: 667 MIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           MIRGKKN+LPP PL+ G G++F +++    +H
Sbjct: 600 MIRGKKNYLPPVPLVFGIGIMFVVEKEDKENH 631


>gi|407406699|gb|EKF30889.1| hypothetical protein MOQ_005283 [Trypanosoma cruzi marinkellei]
          Length = 1098

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 250/778 (32%), Positives = 393/778 (50%), Gaps = 104/778 (13%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  NVYD++PK ++FK  +       GE+++ LLL
Sbjct: 1   MVKQRMTALDVRASVEEMRSELLGLRLLNVYDINPKMFLFKFGH-------GENKRTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ESGVR+H T   R+K   PS FTLKLRKH+R  RL+ V QL +DR + F+FG+G +A Y
Sbjct: 54  -ESGVRMHLTQLVREKPKVPSQFTLKLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEDASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L LLR+H+DDD  + +  R  YP  + R FE     ++ 
Sbjct: 113 HIIIELFSKGNVVLTDHEYRILLLLRTHKDDD--IKMFVRELYP--VTRPFEEQQEKEVM 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S    +  E ++  +         + N   Q+    F               A   
Sbjct: 169 TH--SEGGKEEEEKEQEEQQQQQQRQVRRTNALRQEWHTVF------------ARHADYE 214

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T+++ L     +GPAL++HI+  TG V N+K  E+    +    +L+  +   + W    
Sbjct: 215 TIRSTLSAVHHFGPALADHILTVTG-VKNVKKGELTSDAETLFTLLLPGM--LQAW---E 268

Query: 299 ISGDIVPEGYILMQNKH---------------LGKDHPPTESGSSTQI------------ 331
           I+   +P G  L+ N                 +G+D P TE   S  +            
Sbjct: 269 IAFSPLPGGGYLISNHRQRKEFRKGGKDVSSKIGEDKPQTEEEKSVNVNVADRSQQQMQT 328

Query: 332 --YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             YD+F P+LL Q+ S   V    ++F +  D F+   E+++ EQ ++ K  +   K NK
Sbjct: 329 VQYDDFSPVLLAQYSSEGVVTSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNK 388

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              D + R++TL+ E   + +  E I  +   +D AI  +  ALA  + W+ L  ++K  
Sbjct: 389 FERDHQRRLNTLEMEEQENQRKGECIIQHAVKIDEAIGLINGALAAGIQWDALRSLLKRR 448

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRW 506
              G+PVA ++ +L+LERN +S+L+ +N  E + EE   +    +EV+L+ +A+ANA  +
Sbjct: 449 HAEGHPVAYMVHELFLERNSISVLVESNEQEDEGEEDCDVTPMVIEVELSKTAYANATTY 508

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           +   K    K EKT+ A +KA   AEKK      ++KT   I   R+  W+EKF+WF +S
Sbjct: 509 FSKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWWEKFSWFRTS 568

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV---------------IKNHR 611
               V+ G+D Q  E++V+R M  GDV++H D+ GA   V               +K HR
Sbjct: 569 CGDFVLQGKDLQTTEILVRRVMQLGDVFLHCDVDGALPCVLRPIGSAWTTAFVEDVKGHR 628

Query: 612 PEQP------VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
            E        +   +L++AG + V  S AW+ K   +AWWV+  Q++    +G YL    
Sbjct: 629 QEGSQAKTCRIHMTSLDEAGAWCVSRSSAWEGKFTVAAWWVHASQITGGTASGCYL---- 684

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRL--------DESSLGSHLNE---RRVRGEEEGMD 712
               G+K++L P P+    GLLFR+        D   L + ++E   R    EEEG D
Sbjct: 685 --FDGEKHYLRPQPITFACGLLFRVPTRRIDPNDRDELPNFISEGERRPQHAEEEGED 740


>gi|121698891|ref|XP_001267840.1| DUF814 domain protein, putative [Aspergillus clavatus NRRL 1]
 gi|119395982|gb|EAW06414.1| DUF814 domain protein, putative [Aspergillus clavatus NRRL 1]
          Length = 1111

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 267/840 (31%), Positives = 414/840 (49%), Gaps = 114/840 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   L+ +R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSLDVKVICQELASELVSLRVSNIYDLSSRIFLFKLAKPD--------HRKQLVV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R     PS F  ++RK +++RR+  V Q+G DR+I F F  G+  +++
Sbjct: 53  DSGFRCHVTQYSRATATAPSPFVTRMRKFLKSRRVTSVEQIGTDRVIDFSFSDGL--YHM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRV---FERTTASKL 177
            LE +A GNI++TD E+    +L   R    G           E  RV   +  T     
Sbjct: 111 FLEFFAGGNIIITDREYN---ILALFRQVPAGAG--------EEEMRVGLKYTVTNKQNY 159

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
           H        P+    D++ E       ++      Q+G       K S K + D      
Sbjct: 160 HGV------PEITL-DRIKETLEKARESA-----AQEGAAP----KKSKKKNVD------ 197

Query: 238 PTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
             L+  L +    Y P L +H        P M L  V  L D+ +   V  V K    + 
Sbjct: 198 -VLRKALSQGFPEYPPLLLDHAFAVEEFDPAMPLETV--LGDDTLLEKVEGVLKEAQDVS 254

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTES---GSSTQIYDEFCPLLLNQFRSREFVK--- 350
             +S      GYI+ +        P  E+     S  +Y +F P    QF  +  +K   
Sbjct: 255 RRLSTKENHPGYIVAKEDTRPSAGPTAENQEGKKSGLLYQDFHPFKPRQFEGKPEIKILE 314

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F +F+A +DE++S +E+Q+ E +   +E+AA  KL+ +  + E R+  LKQ  +  ++ A
Sbjct: 315 FGSFNATVDEYFSSLETQKLESRLTEREEAAKRKLDAVRQEHEKRLGALKQAQELHIRKA 374

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N ++
Sbjct: 375 GAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQDRGNPVARIIKLPLKLYENTIT 434

Query: 470 LLLSNNLDEMDDEE---------------------KTLPVEKVEVDLALSAHANARRWYE 508
           L+L    +E D+ +                     K   +  +++DL LS  ANA ++YE
Sbjct: 435 LVLGEASEEQDEADELFSDDESEEDSESEEQEAIKKAPEMLTIDIDLGLSPWANATQYYE 494

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            KK    K++KT  + +KA K+ EKK     +  + QEK V  +   RK  WFEKF +FI
Sbjct: 495 QKKMAAVKEQKTAQSSTKALKSHEKKVTEDLKRSLKQEKQV--LRPARKPFWFEKFIFFI 552

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLN 622
           SSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN    P+ P+PP TL+
Sbjct: 553 SSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNRPGTPDAPIPPSTLS 612

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           QAG  +V  S AWDSK V +AWWV  +QV+KT  TG  L  G F  +G+K+FL P  L++
Sbjct: 613 QAGNLSVATSSAWDSKAVMAAWWVNANQVTKTT-TGGLLPTGEFETKGEKSFLAPSQLVL 671

Query: 683 GFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESL 742
           GF ++F++ + SL +H            +  FED+   +   D+  +KD+T  K     L
Sbjct: 672 GFAVMFQISKESLKNH-----------KIQLFEDT--PRPEPDV--QKDETTGKDGTSEL 716

Query: 743 SVPNSAHPA-PSHTNASNVDSHEFPAEDKTISNGIDSKIFDIA-------RNVAAPVTPQ 794
            V     PA P+ T A++ +     AED++  +  D +  D A       R V+ PV  Q
Sbjct: 717 PVTQDHGPAEPAETAATDTNDR---AEDQSSGSDDDEENPDSAPQRNPLQRGVSEPVPAQ 773


>gi|67468480|ref|XP_650274.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56466879|gb|EAL44894.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449704977|gb|EMD45123.1| zinc knuckle domain containing protein [Entamoeba histolytica KU27]
          Length = 959

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 231/685 (33%), Positives = 358/685 (52%), Gaps = 86/685 (12%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L     + VYD++ + Y+ KL  +          K  +++ESGVR+H T Y R+K + P
Sbjct: 27  KLQNFNINTVYDVNRRLYVIKLSKTDC--------KEFIVIESGVRVHLTEYNREKSDFP 78

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           + FT +LRK++  ++L  + Q+G DR+I   FG     + +I++LY+ GNI L D E+ +
Sbjct: 79  NNFTSRLRKYLNKKKLIKINQIGNDRVIELVFGNATERYSLIVDLYSNGNICLCDQEYKI 138

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDG 199
           L +LRS   D  G  +    +YP              LH         DAN  D++    
Sbjct: 139 LLILRSFTFDKTGDKVAVGEKYP--------------LHLL------SDANGIDEL---- 174

Query: 200 NNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII 259
                        +K  K +D    S          K  TLK ++     +G  LS+H  
Sbjct: 175 -------------KKIIKEYDTIFTSE-------SMKGWTLKQLINYTSDFGQQLSDHCC 214

Query: 260 LDTGL--VPNMKLSEVNKLEDNAIQ-VLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
              G       +L E N+ E + ++ +L  A+ ++E     + SG    +GYI       
Sbjct: 215 SQFGKESSKTKRLEEFNEEEKSLMKKILEEAITRYEK----IDSGKC--KGYIFYH---- 264

Query: 317 GKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
                     +  + Y+E    +  Q   R++++FE+F+ A+DEF+S IE Q  E + + 
Sbjct: 265 --------ETNKKKYYEEVSCDIFYQDSKRKYIEFESFEKAMDEFHSHIEKQEYEAEVEK 316

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
           KE     K+  +    + R   L  + +     AE +E N++ VD  I  + V L  +M 
Sbjct: 317 KEMIMKKKIQAVIDGHQKRYQGLLDKAETLKNEAEAVEENIQVVDQLIQEINVFLKEKMK 376

Query: 437 WEDLARMVKEERKAGNP--VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVD 494
           WE +  ++ E  K  +P  +A  I +   +   + L L +      +E+K +   +VE+ 
Sbjct: 377 WEQIEGII-ESLKENDPTSIAKYIKRFDFKNEVVVLELKHT-----NEDKII---EVEIA 427

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE-KKTRLQILQEKTVANISHMRK 553
           L  +   N R +YE++K   +K EKT+ +   A K AE K+ R+   ++ T+ ++  MRK
Sbjct: 428 LNKNGFENIRNFYEMRKNILAKAEKTMESKDLAIKQAENKQERVAKEKKITLVDVKKMRK 487

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPE 613
             WFEKF+WF+SSEN+++ISG+DA QN++I +RYM   DVYVHAD+HGA+S +IK   P 
Sbjct: 488 RFWFEKFHWFLSSENFIIISGKDALQNDVIYRRYMKSTDVYVHADIHGAASCIIKGI-PG 546

Query: 614 QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
           + +   TL QAG   VC S AW SK+VTSAWWVY  QVSKTAP+GEYLT GSFMIRGKKN
Sbjct: 547 KTIGAPTLEQAGKIAVCRSSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKN 606

Query: 674 FLPPHPLIMGFGLLFRLDESSLGSH 698
           +LPP PL+ G G++F +++    +H
Sbjct: 607 YLPPVPLVFGIGIMFAVEKEDKENH 631


>gi|296813237|ref|XP_002846956.1| serologically defined colon cancer antigen 1 [Arthroderma otae CBS
           113480]
 gi|238842212|gb|EEQ31874.1| serologically defined colon cancer antigen 1 [Arthroderma otae CBS
           113480]
          Length = 1103

 Score =  353 bits (905), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 240/753 (31%), Positives = 377/753 (50%), Gaps = 140/753 (18%)

Query: 14  EVKCLRR-----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
           +VK + R     ++G+R +N+YD+SP+T++FKL        +    K  L++ +G   H 
Sbjct: 9   DVKVISRELSANILGLRIANIYDISPRTFLFKL--------ALPDIKKQLIINAGFHCHL 60

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T  +R   + PS F  +LRK ++TRR+  VRQ+G DRI+ F+   G+   Y  LE +A G
Sbjct: 61  TESSRTTADAPSHFVSRLRKLLKTRRITGVRQIGTDRILEFEISDGLFRLY--LEFFAAG 118

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD 188
           N++LTD+++              G+  + RH  P                          
Sbjct: 119 NLILTDAKY--------------GIVALLRHVAP-------------------------- 138

Query: 189 ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN-----SNDGARA--KQPTLK 241
                     G++V           K G S+ L    N N     + D  +A  ++ T  
Sbjct: 139 ----------GSDVEEV--------KVGMSYKLESKMNYNGIPPLTIDRLKATLEKDTGS 180

Query: 242 TVLGEALGYG-----PALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
            VL  +L +G     P L +H     G   + KL     L DN +   ++ V +  D + 
Sbjct: 181 KVLKRSLYFGFPEYPPTLLDHAFHIIGF--DSKLQPAQILTDNNLIHGLMGVLQEADRVN 238

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI-YDEFCPLLLNQFR---SREFVKFE 352
           + +S D    GYIL +N   G       + S+  I + +F P   +Q +   +   ++F+
Sbjct: 239 NALSSDRQTPGYILAKNIVPGTADGAEGTQSAPTIEFRDFHPFEPSQSKDLPNTTMLRFD 298

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           TF++A+D+++S IE+++ E +   +EDAA  KL     D E RV+ LK++ +  V+ A  
Sbjct: 299 TFNSAVDKYFSSIEARKLESRLTEREDAARKKLEATKRDHEKRVNALKEKQEFHVRKAHA 358

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLL 471
           IE NL  V+ AI AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +++L
Sbjct: 359 IEANLPQVEDAINAVNGLVAQGMDWVEIARLIEMEQAKGNPVALCIKLPLKLYENTITIL 418

Query: 472 LSNN-----------------------------LDEMDDEEKTLPVEK-----------V 491
           L+                                +    ++ T   +K           +
Sbjct: 419 LTEETAETEDEDEESDESEGDDEDEDNDYGDDEYERPKHKKMTAKTQKEKKERKDNRLSI 478

Query: 492 EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVAN 547
           ++DL +S  ANAR++Y+ KK    K+EKT+ A +KA K+ EKK +    L + QEK V  
Sbjct: 479 DIDLGISPWANARQYYDEKKIAAVKEEKTLKASTKAIKSTEKKVKADLKLALKQEKPV-- 536

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           +   R   WFEKF +FISS+ YLVI GRD QQ+E++ +RY+ KGD+YVH DL G    ++
Sbjct: 537 LRRARNPAWFEKFFFFISSDGYLVIGGRDQQQDEILFQRYLKKGDIYVHTDLEGGVPLIV 596

Query: 608 KNHR--PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           KN    P+ P+PP T++QA  ++V  S+AWD+K     WWV+  QVSK   TG+ L  G 
Sbjct: 597 KNKPEFPDDPIPPNTISQASAYSVASSKAWDTKAAMGGWWVHASQVSKVTSTGDILKAGH 656

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           FMI+G+KN LPP  +++GF +LF+L   S+ +H
Sbjct: 657 FMIKGEKNHLPPGQIVLGFAVLFQLSPQSVQNH 689



 Score = 40.4 bits (93), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 27/41 (65%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAVSTLTCTIGVT 934
           RG++GK KK+  KY DQDEE+R + + LL  +  + T+  T
Sbjct: 845 RGKRGKAKKLATKYKDQDEEDRKLALRLLGSAPGSTTVNKT 885


>gi|154281559|ref|XP_001541592.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150411771|gb|EDN07159.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 1177

 Score =  352 bits (902), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 255/752 (33%), Positives = 390/752 (51%), Gaps = 106/752 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R ++ DV A       L+G+R SN+YDLS + ++FKL             +  L+++
Sbjct: 16  MKQRFSSLDVKA-------LVGLRISNIYDLSSRIFLFKLAKPD--------TRRQLIVD 60

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T Y+R     PS FT +LRK ++TRR+  V Q+G DRI+  +   G N H V+
Sbjct: 61  AGFRCHLTEYSRTTAAAPSSFTSRLRKFLKTRRVTAVSQVGTDRIVDIELSDG-NFH-VL 118

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LE YA GNI+LTD ++ +L L   HR   +G           E  RV        L   L
Sbjct: 119 LEFYAAGNIILTDKDYKILAL---HRIVPEGSD--------QEEVRV-------GLQYVL 160

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP-TL 240
           T+ +  +   P  +    + +  A       +  GK            N  A+ KQ   L
Sbjct: 161 TNKQNYNGVPPLSIERLRDALKKAKGVTGPAEAAGK------------NKRAKKKQAEAL 208

Query: 241 KTVLGEALG---YGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQ 296
           +  +  +LG   Y P L EH    TG   ++K  ++  LED  + + L++A+   E+   
Sbjct: 209 RRAV--SLGFPEYPPLLLEHAFHITGFDTSLKPEQL--LEDPKLAEKLMVALVVAENVNS 264

Query: 297 DVISGDIVPEGYILMQNK-HLGKD---HPPTESGSSTQIYDEFCPLLLNQFRSR---EFV 349
            + + +  P GYI+ + +   G+D        S SS   Y +F P    QF S      +
Sbjct: 265 SLSTAEETP-GYIVSKTEGKAGEDASVDSTVPSKSSNVAYIDFHPFEPKQFESEPGTSIL 323

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +F+TF+ A+DE++S  ESQ+ E +   +E+ A  KL     DQ+ RV  LK+  +  ++ 
Sbjct: 324 RFDTFNKAVDEYFSSAESQKLESRLTEREEIAKRKLEAAQKDQDKRVGVLKEAQELHIRK 383

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   L L  N +
Sbjct: 384 AQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQGRQNPVANVIKLPLKLYENAV 443

Query: 469 SLLL---SNNLDEMD----------------------------DEEKTLPVEKVEVDLAL 497
           +LLL   + N + MD                             ++   P+  +++DL +
Sbjct: 444 TLLLGEPTENEEPMDESEDEAEVEEEEEQESSEDEDSGKKPGVSKKPRQPLLSIDIDLGI 503

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRK 553
           S  ANAR++YE KK    K++KT+ +  +A K+ +KK     +  + QEK V  +   R 
Sbjct: 504 SPWANARQYYEQKKVAAVKEKKTLNSTKEAIKSTKKKVAADLKQALKQEKPV--LRPTRT 561

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP- 612
             WFEKF +F+SS+ YLV+ GRD QQ E++ +R++ +GDV+VHAD+ GA   ++KN +P 
Sbjct: 562 PFWFEKFIFFLSSDGYLVLGGRDVQQTEILYRRHLKRGDVFVHADVQGAIPVIVKN-KPG 620

Query: 613 --EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             + P+PP TL+QAG   V  S AWDSK V  AWW   +QVSKT P GEYL  G F+I G
Sbjct: 621 TLDAPIPPGTLSQAGNLCVATSTAWDSKAVMGAWWANANQVSKTTPLGEYLVTGGFVICG 680

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSLGSHLNER 702
           +KN LPP  L++GF ++F++   S+ +H   R
Sbjct: 681 EKNQLPPAQLLLGFAVMFQISGESIKNHTKHR 712


>gi|407039370|gb|EKE39608.1| zinc knuckle domain containing protein [Entamoeba nuttalli P19]
          Length = 959

 Score =  351 bits (901), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 245/738 (33%), Positives = 382/738 (51%), Gaps = 93/738 (12%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +L     + VYD++ + Y+ KL  +          K  +++ESGVR+H T Y R+K + P
Sbjct: 27  KLQNFNINTVYDVNRRLYVIKLSKTDC--------KEFIVIESGVRVHLTEYNREKSDFP 78

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           + FT +LRK++  ++L  + Q+G DR+I   FG     + +I++LY+ GNI L D E+ +
Sbjct: 79  NNFTSRLRKYLNKKKLIKINQIGNDRVIELVFGNATERYSLIVDLYSNGNICLCDQEYKI 138

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDG 199
           L  LR+   D  G  +    +YP              LH         DAN    +NE  
Sbjct: 139 LLTLRNFTFDKTGDKVAVGEKYP--------------LHLL------SDAN---GINELK 175

Query: 200 NNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII 259
           N +              K +D    S          K  TLK ++     +G  LS+H  
Sbjct: 176 NII--------------KEYDTIFTSE-------SMKGWTLKQLINYTSDFGQQLSDHCC 214

Query: 260 LDTGL--VPNMKLSEVNKLEDNAIQ-VLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
              G       +L E N+ E + ++ +L  A+ ++E     + SG    +GYI       
Sbjct: 215 SQFGKESSKTKRLEEFNEEEKSLMKKILEEAITRYEK----IDSGKC--KGYIFYH---- 264

Query: 317 GKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKA 376
                     +  + Y+E    +  Q   R++++FE+F+ A+DEF+S IE Q  E + + 
Sbjct: 265 --------ETNKKKYYEEVSCDIFYQDSKRKYIEFESFEKAMDEFHSHIEKQEYEAEVEK 316

Query: 377 KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMS 436
           KE     K+  +    + R   L  + +     AE +E N++ VD  I  + V L  +M 
Sbjct: 317 KEMIMKKKIQAVIDGHQKRYQGLLDKAETLKNEAEAVEENIQVVDQLIQEINVFLKEKMK 376

Query: 437 WEDLARMVKEERKAGNP--VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVD 494
           WE +  ++ E  K  +P  +A  I +   +   + L L +      +E+K +   +VEV 
Sbjct: 377 WEQIEGII-ESLKENDPTSIAKYIKRFDFKNEVVVLELKHT-----NEDKII---EVEVA 427

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE-KKTRLQILQEKTVANISHMRK 553
           L  +   N R +YE++K   +K EKT+ +   A K AE K+ R+   ++ T+ ++  MRK
Sbjct: 428 LNKNGFENIRNFYEMRKNILAKAEKTMESKDLAIKQAENKQERVAKEKKITLVDVKKMRK 487

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPE 613
             WFEKF+WF+SSEN+++ISG+DA QN++I +RYM   DVYVHAD+HGA+S +IK   P 
Sbjct: 488 RFWFEKFHWFLSSENFIIISGKDALQNDVIYRRYMKSTDVYVHADIHGAASCIIKG-IPG 546

Query: 614 QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
           + +   TL QAG   VC S AW SK+VTSAWWVY  QVSKTAP+GEYLT GSFMIRGKKN
Sbjct: 547 KTIGAPTLEQAGKIAVCRSSAWTSKIVTSAWWVYSDQVSKTAPSGEYLTTGSFMIRGKKN 606

Query: 674 FLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDT 733
           +LPP PL+ G G++F +++    +H  E  ++ E + +   E+     + S+ E +K+  
Sbjct: 607 YLPPVPLVFGIGIMFAVEKEDKENH--EEVIQQETKEVQQKENVESVIKISEQERDKEQK 664

Query: 734 DEK----PV-AESLSVPN 746
           +EK    PV  E ++V N
Sbjct: 665 EEKQEVVPVQVEKVNVKN 682


>gi|226292279|gb|EEH47699.1| DUF814 domain-containing protein [Paracoccidioides brasiliensis Pb18]
          Length = 1261

 Score =  351 bits (901), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 284/942 (30%), Positives = 447/942 (47%), Gaps = 144/942 (15%)

Query: 58   LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            L+++ G R H T Y+R     PS F  +LRK ++TRR+  V QLG DRII     L    
Sbjct: 149  LIVDIGFRCHLTEYSRTTAAAPSPFISRLRKFLKTRRVTAVSQLGTDRII--DIALSDGN 206

Query: 118  HYVILELYAQGNILLTDSEFTVLTLLR-SHRDDDKGVAIMSRHRYPTEICRVFERTTASK 176
             +++LE Y  GNI+LTD ++ ++ L R  H   ++            E  RV        
Sbjct: 207  FHLLLEFYVGGNIILTDKDYKIVALHRIVHGGGER------------EEVRV-------G 247

Query: 177  LHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
            L   +T+ +  +   P  +      +  A  E   G+ G         SNK    G + +
Sbjct: 248  LQYGITNKQNYNGVPPLSIERLRETLQRA--EEAEGESGAVE---GPGSNKR---GKKRQ 299

Query: 237  QPTLKTVLGEALGYGPALS-EHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDW 294
               LK  +       PAL  +H     G   N++  +   LED+ + + L+L + + E+ 
Sbjct: 300  TEALKRAISRGFPEYPALLLDHSFHAAGFDANLEPKQA--LEDSELMKRLMLVLTEAENV 357

Query: 295  LQDVISGDIVPEGYILMQNK-HLGKDHPPTESGSS---TQIYDEFCPLLLNQFRS---RE 347
            +  + + +  P GYI+++ +   G+     ++ S      +Y +F P    QF +     
Sbjct: 358  IARLSTLEDTP-GYIILKGESKTGEAITEADTDSPKPKNMLYHDFHPFKPKQFENVPGMT 416

Query: 348  FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
             + F TF+ A+DE++S +ESQ+ E +   +E+ A  KL     DQENRV  LK+  +  V
Sbjct: 417  ILTFNTFNKAVDEYFSSVESQKLEYRLTEREEIARRKLEAAQKDQENRVGALKEVQELHV 476

Query: 408  KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERN 466
            + A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   L L  N
Sbjct: 477  RKAQAIEANLLRVEEAINAVNGLIAQGMDWVEIARLIEMEKSRQNPVAKVIKLPLKLYEN 536

Query: 467  CMSLLLS---------------------------NNLDEMDDEEKTLPVEKVEVDLALSA 499
             ++LLL                            N +     ++    +  +++DL +S 
Sbjct: 537  TVTLLLGEPTEDEEPADESDEEEEDSESGDEDGGNKVKLERSKKAQQQLLSIDIDLGISP 596

Query: 500  HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVH 555
             ANAR++YE +K    K+EKT+ +  KA K+ EKK     +  + QEK +  +   R   
Sbjct: 597  WANARQYYEQRKAAAVKEEKTLKSTKKAIKSTEKKVTTDLKHALKQEKPI--LRPTRTPF 654

Query: 556  WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPE 613
            WFEKF +F+SS+ YLV+ GRD QQ E++ +RY+ KGDVYVHAD+ GA+   +KN    P+
Sbjct: 655  WFEKFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVYVHADVQGATPIFVKNKPGTPD 714

Query: 614  QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
             P+PP TL+QAG   V  S AWDSK V  AWWV   QVSKTAP+GE++  G F+IRG+K+
Sbjct: 715  APIPPGTLSQAGNLCVATSSAWDSKAVMGAWWVNADQVSKTAPSGEFVGTGGFVIRGEKH 774

Query: 674  FLPPHPLIMGFGLLFRLDESSLGSHLNER----------------------RVRGEEEGM 711
             LPP  L++G+ ++F++ E S+ +H   R                      +   E  G 
Sbjct: 775  QLPPAQLLLGYAVMFQISEDSIKNHTKFRVQDEPSIVEIAKEVQANEVLHSKQDSEAPGA 834

Query: 712  DDFEDSGHHKENSDIESEKDDTDEKPVAESL-SVPNSAHPAPSHTNASNVDSHEFPAEDK 770
            D  ++     E  D   E+D+  + P+   + S P+ +    +     +    + P++D 
Sbjct: 835  DGNKEISLASEEHDSSDEQDEETDNPLLTGMESEPDDS--GGNENKGGDNGEEKLPSDDT 892

Query: 771  TISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKH 830
                  D K ++   +V    T  LE              S     I   + D+SE+   
Sbjct: 893  D-----DEKEYN---SVVTKETVVLE--------------SGGDEPITQPEADVSEQQPG 930

Query: 831  VERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQ---------PESI 881
            + +   ++   ++S  ERR+LKKG    V+   +E+   R  DA SQ         P   
Sbjct: 931  ITKRQAIK---HLSARERRQLKKG----VL---IEQTSVRVADAESQSSSPTPSVAPSVT 980

Query: 882  VRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
                        RG++GK KK+  KY  QDEE+R + + LL 
Sbjct: 981  TTTNTNTLNSNIRGKRGKSKKLATKYQHQDEEDRELALRLLG 1022


>gi|295673284|ref|XP_002797188.1| DUF814 domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226282560|gb|EEH38126.1| DUF814 domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1258

 Score =  350 bits (897), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 298/945 (31%), Positives = 448/945 (47%), Gaps = 150/945 (15%)

Query: 58   LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            L+++ G R H T Y+R     PS F  +LRK ++TRR+  V QLG DRII   F  G N 
Sbjct: 243  LIVDIGFRCHLTEYSRTTAAAPSPFISRLRKFLKTRRVTAVSQLGTDRIIDIAFSDG-NF 301

Query: 118  HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY-----PTEICRVFERT 172
            H ++LE YA GNI+LT                DK   I++ HR        E  RV    
Sbjct: 302  H-LLLEFYAGGNIILT----------------DKDYKIVALHRIVHGGGEKEEVRV---- 340

Query: 173  TASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
                L   +T+ +  +   P  +      +  A  E   G+ G         +NK    G
Sbjct: 341  ---GLQYDITNKQNYNGVPPLSIERLRETLQRA--EEAEGECGAVE---GPGTNKR---G 389

Query: 233  ARAKQPTLKTVLGEALGYGPALS-EHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAK 290
             + +   LK  +       PAL  +H     G   N++  +   LED+ + + L+L + +
Sbjct: 390  KKKQAEALKRAISMGFPEYPALLLDHSFHAAGFDANLEPKQA--LEDSELMKRLMLVLTE 447

Query: 291  FEDWLQDVISGDIVPEGYILMQ-NKHLGKDHPPTESGSS---TQIYDEFCPLLLNQFRS- 345
             E     + + +  P GYI+ +     G+     ++ S      +Y +F P    QF + 
Sbjct: 448  AESVNARLSTLEDTP-GYIISKAESKTGEAITEADTDSPKPKNMLYHDFHPFEPKQFENV 506

Query: 346  --REFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
                 +KF+TF+ A+DE++S +ESQ+ E +   +E+ A  KL     DQENR+  LK+  
Sbjct: 507  PGMTILKFKTFNKAVDEYFSSVESQKLEYRLTEREEIARRKLEAAQKDQENRIGALKEVQ 566

Query: 404  DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLY 462
            +  V+ A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   L 
Sbjct: 567  ELHVRKAQAIEANLLRVEEAIKAVNGLIAQGMDWVEIARLIEMEKSRQNPVANVIKLPLK 626

Query: 463  LERNCMSLLLS--------------------------NNLDEMDDEEKTLPVEKVEVDLA 496
            L  N ++LLL                           N +     ++    +  +++DL 
Sbjct: 627  LYENTVTLLLGEPTEDEEPADESEEEEDSESDDEDGGNKVKLEGSKKAQQQLLSIDIDLG 686

Query: 497  LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMR 552
            +S  ANAR++YE K+    K+EKT+ +  KA K+ EKK     +  + QEK +  +   R
Sbjct: 687  ISPWANARQYYEQKRVAAVKEEKTLKSTKKAIKSTEKKVTTDLKHALKQEKPI--LRPTR 744

Query: 553  KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH-- 610
               WFEKF +F+SS+ YLV+ GRD QQ E++ +RY+ KGDVYVHAD+ GA+   +KN   
Sbjct: 745  TPFWFEKFMFFVSSDGYLVLGGRDLQQTEILYRRYLKKGDVYVHADVQGATPIFVKNKPG 804

Query: 611  RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             P+ P+PP TL+QAG   V  S AWDSK V  AWWV   QVSKTAP+GE++  G F+IRG
Sbjct: 805  TPDAPIPPGTLSQAGNLCVASSSAWDSKAVMGAWWVNADQVSKTAPSGEFVGTGGFVIRG 864

Query: 671  KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFED-------------- 716
            +K+ LPP  L++GF ++F++ E S+ +H  + RV+ E   +D  +D              
Sbjct: 865  EKHQLPPAQLLLGFAVMFQISEDSIKNH-TKYRVQDEPSIVDIAKDIQWANEVLNSKQDS 923

Query: 717  ----SGHHKENSDIESEKDDTDEK--PVAESLSVPNSAHPAPSHTN---ASNVDSHEFPA 767
                +  +KE S    E D +DE+   +   L     + P  S  N     +    + P 
Sbjct: 924  EAPRADGNKEISPASEEHDSSDEQDEEIENPLLTGMESEPDDSGGNEDKGGDNGEEKLPN 983

Query: 768  EDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEE 827
            +D       D K ++   +V    T  LE  +D  +    A +S    GI   Q      
Sbjct: 984  DDTD-----DEKEYN---SVVTKETVVLESGVDEPITQSEADVSKQPTGITKRQ------ 1029

Query: 828  DKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESI------ 881
                       D  +++  ERR+LKKG         +E+   R  DA SQ  S       
Sbjct: 1030 -----------DIKHLTARERRQLKKGV-------LIEQTSGRVGDAESQSSSPTPSVAP 1071

Query: 882  -VRKTKIEGGKIS--RGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
             V  T      IS  RG++GK KK+  KY  QDEE+R + + LL 
Sbjct: 1072 SVTTTTNTNTVISNIRGKRGKSKKLATKYQHQDEEDRELALRLLG 1116


>gi|209875685|ref|XP_002139285.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209554891|gb|EEA04936.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 1427

 Score =  346 bits (888), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 259/846 (30%), Positives = 421/846 (49%), Gaps = 128/846 (15%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   D+ A V  + + L G +  N+YD++ +TY+ K          G   K+ LL
Sbjct: 1   MVKSRMTAIDICAMVHSIAKDLKGQKLVNIYDINHRTYLLKF---------GGEGKLFLL 51

Query: 60  MESGVRLHTTAYARDKKNTP--------SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
           +E+G+R HTT + R  + T         S F  KLR+++R R+L D+ Q+  DRI+   F
Sbjct: 52  IEAGIRFHTTHWKRGSQQTMNSSSVVSISYFNNKLRRYLRGRKLVDMAQMDLDRIVKLTF 111

Query: 112 GLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER 171
           G G N  ++ILE +  GNI+LTD+ + +L +LR    D+  ++I  R+ +   I    + 
Sbjct: 112 GFGENIFHLILEFFVAGNIILTDNNYNILVILR----DNGNLSIGKRYNWENSI----DI 163

Query: 172 TTASKLHAALTSSKEPDAN---EPDKVN-------EDGNNVSNASKENLGGQKGGKSFDL 221
             +  +  ++  S  PD +    P  +        ED  N+    KE   G +  K   +
Sbjct: 164 DCSHAVFPSILRSPAPDIDVDQAPWMIQWLDESYLEDQLNI--MIKEAEAGSEE-KQLQI 220

Query: 222 SKNS----NKNSNDGARAKQP---TLKTVLGEALGYG-PALSEHIILDTGL-----VPNM 268
           S+ S    +K  ND   + QP   T + +LG+ L +  P + + ++   GL     V + 
Sbjct: 221 SRGSTNKRSKQGNDTIPSNQPSGITSQVLLGKILRFCHPIMLQQLLEKYGLDKDQLVTSS 280

Query: 269 KLSEVNK-----LEDNAIQVLVLAVAKFEDWLQDVI-SGDIVPEGYILMQNKHLGKDHPP 322
            + +++K     ++D    + +L  ++    +   + S D + EG +++++    + H  
Sbjct: 281 SIRDISKKFIKCIKDAKYLLGILCNSEVLGIMTLCLTSRDQMKEGDLILRDLQQVETHVS 340

Query: 323 TESGSSTQ------IYDEFCP------------------LLLNQFRSREFVKFETFDAAL 358
           +E  +  +      +Y  F P                  L++N+F S+       F   +
Sbjct: 341 SECKAKAEQDKTEPLYISFSPYVKDHEWIYSVQALPKDGLIVNRFTSK-------FSDCV 393

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEFYS I+  +  ++ + +E A   K++K+ +DQE R+  L +E +  +K A  +E    
Sbjct: 394 DEFYSSIDINKETKEIQQEEKAINSKIDKLRIDQERRLKELVEEKEACIKRANFMECCEL 453

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            ++  +L  R  +A    W+D+   V+++RK G+P+A  I  L LE + + +      DE
Sbjct: 454 LLEKILLLTRHLIATGAQWKDICNEVRQQRKIGHPIAKYIKSLDLEHDRVVVYFG--ADE 511

Query: 479 MDDE------------EKTLPVEKVEVDLALS--AHANARRWYELKKKQESKQEKTITAH 524
             ++             K    E +E+ L +S    AN R  YE  K   +K E+T +A+
Sbjct: 512 FPEDFDYSRYGYGESNSKLKSQEGIEIYLNISKSMQANIRSEYEESKHISAKLERTKSAY 571

Query: 525 SKAFKAAEKKT-----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
            +A     K       +L       V  I  +R+ +WFEKF+WFISS+ +LVI G D+ Q
Sbjct: 572 KRALNKVTKTVNRNTEKLTGPLNTGVNRIHKIRQSYWFEKFHWFISSDGFLVIGGNDSSQ 631

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
           NE++ +RY+ K D Y+HAD HGA++ ++KN +    +P  TL +AG  ++C+S++W +K 
Sbjct: 632 NELLYRRYLEKNDRYIHADTHGATTCIVKNPKNLADIPMNTLCEAGQMSICYSRSWANKT 691

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
           V SAWWVYP QVSKTAP+GEYLT GSF+IRGKKNFLPP  L MG  L+F           
Sbjct: 692 VISAWWVYPDQVSKTAPSGEYLTTGSFVIRGKKNFLPPLKLEMGIALVFV---------- 741

Query: 700 NERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASN 759
            + + + E+E + D ED     E+S   SE  DT+ K         NS     SH N+ N
Sbjct: 742 -KTKKQAEKEELSDLEDISSKFEDSTY-SETVDTEIKVNL------NSNISDKSHVNSDN 793

Query: 760 VDSHEF 765
             S +F
Sbjct: 794 DLSSKF 799



 Score = 44.3 bits (103), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 27/58 (46%), Positives = 32/58 (55%), Gaps = 7/58 (12%)

Query: 871  GKDASSQPESIVRKTKIEGG-------KISRGQKGKLKKMKEKYGDQDEEERNIRMAL 921
            GK     P    R   IE G       K+SR +K KLKKM  KYG+QDE+ER +RM L
Sbjct: 1191 GKIELKMPNISSRGRSIESGNNQSTNQKLSRRKKFKLKKMALKYGEQDEQERKLRMVL 1248


>gi|213403135|ref|XP_002172340.1| DUF814 family protein [Schizosaccharomyces japonicus yFS275]
 gi|212000387|gb|EEB06047.1| DUF814 family protein [Schizosaccharomyces japonicus yFS275]
          Length = 1013

 Score =  345 bits (884), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 249/739 (33%), Positives = 387/739 (52%), Gaps = 85/739 (11%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +  DV+A    L+ RL+G R +N+YDL+ +T++ K     G  +  ES    +++
Sbjct: 1   MKQRFSALDVSAITAELKDRLLGCRLNNIYDLNARTFLLKF----GKQDVKES----VII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN---- 116
           ESG R+H T + R+     SGF  KLRKH+++RRL ++ QL  DR+++F FG G N    
Sbjct: 53  ESGARVHATKFQRNPAPL-SGFVTKLRKHLKSRRLTNLYQLRSDRVVVFTFGGGENDSDP 111

Query: 117 --AHYVILELYAQGNILLTDSEFTVLTLLRS-HRDDDKGVAIMSRHRYPTEICRVFERTT 173
              +Y++ E +A GNILL D  F +L+LLR    D ++  A+  R+              
Sbjct: 112 AWTYYLVCEFFAAGNILLLDGSFKILSLLRVVTFDKNQFYAVGQRY-------------- 157

Query: 174 ASKLHAALTSSKEPDANEP-----DKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKN 228
              L+ ALT ++   + E      D++ E    V++ S  N                 K 
Sbjct: 158 --DLNDALTEAQRTISMESLSLLLDQITEQEKAVADVSPTNE-----------EVKDTKK 204

Query: 229 SNDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLA 287
           SN   + K  TL+  L   LG YG AL EH I  + L P M  S+    E+   ++L  A
Sbjct: 205 SNKSKKPKVTTLRKALTIRLGRYGNALIEHCIRLSQLDPLMLASDFKNDEEKKKELLE-A 263

Query: 288 VAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI-----YDEFCPLLLNQ 342
             + +  + D     I  +GYI    + + K    T +  + Q+     +  F PL L Q
Sbjct: 264 FHEADKIMNDATKPPI--KGYIFGLQQDIIKSGEETGAQKTEQVLMYEDFHPFKPLQLLQ 321

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
             +R  ++F +++  +DEF+S +ESQ+ E+Q+  +      ++     D EN++  L++ 
Sbjct: 322 NNNRTCIEFPSYNECVDEFFSSLESQKIEKQNHDRLKTFAKRIENAKRDVENKLKELQKA 381

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KL 461
            + S K A+ IE N + V+ AI  V   +   M W D+ +++  +++  +  A +I   L
Sbjct: 382 QELSEKKAQAIELNPQLVEGAIEYVNSLVGQAMDWLDIEKLITVQQRRQHAFASVIRLPL 441

Query: 462 YLERNCMSLLLSN-NLDEMDDEEKTL--------------PVEK---------VEVDLAL 497
            L++N ++L+L + N   +D+E +                PV++         VEVDLAL
Sbjct: 442 QLKKNLITLVLPDPNPLAVDEESEQSESESDSEPESTIITPVQRRLIQPKGLAVEVDLAL 501

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISHMRKVH 555
            A ANAR  Y  ++    K+EKTI + SKA K  +K+    L+    +    ++  R+  
Sbjct: 502 GAFANARVHYNNRRLAALKEEKTIESSSKAIKNTQKRAEADLKTAAAEAKQALTASRRTF 561

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           +FEKF+WFISS+ YLV+ GRD QQ E++ ++Y +KGDVYV ADL  +SS +IKN     P
Sbjct: 562 FFEKFHWFISSDGYLVLGGRDNQQRELLYEKYCNKGDVYVSADLPNSSSVIIKNRNENDP 621

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +PP TL QAG   +  S+AWD+K V SAWWV  H VSK   T + L  G F I  +KN+L
Sbjct: 622 IPPNTLQQAGALALATSKAWDTKTVISAWWVPIHAVSKVDQTKQILPTGHFWINEEKNYL 681

Query: 676 PPHPLIMGFGLLFRLDESS 694
           PP  L+MG+G+L+ LDE S
Sbjct: 682 PPTNLVMGYGILWFLDEVS 700


>gi|315050252|ref|XP_003174500.1| hypothetical protein MGYG_02028 [Arthroderma gypseum CBS 118893]
 gi|311339815|gb|EFQ99017.1| hypothetical protein MGYG_02028 [Arthroderma gypseum CBS 118893]
          Length = 1093

 Score =  344 bits (883), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 285/977 (29%), Positives = 465/977 (47%), Gaps = 172/977 (17%)

Query: 14  EVKCLRR-----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
           +VK + R     ++G+R +N+YD+S +T++FKL        +    K  L++ +G   H 
Sbjct: 9   DVKVISRELSTNILGLRIANIYDISGRTFLFKL--------ALPDIKKQLIINAGFHCHI 60

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T  +R   + PS F  +LRK ++TRR+  VRQ+G DRII F+   G+   Y  LE +A G
Sbjct: 61  TESSRTTADAPSHFVSRLRKLLKTRRITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAG 118

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD 188
           N++LTD+++ ++ LL              RH  P       +     KL +         
Sbjct: 119 NLILTDAKYGIVALL--------------RHVAPGSDIEEVKVGMTYKLES--------- 155

Query: 189 ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEAL 248
                K+N +G  +   + E L              S  + ++G++  + +L     E  
Sbjct: 156 -----KMNYNG--IPPLTVERL-------------KSALSKDNGSKVLKRSLYFGFPE-- 193

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
            Y P L +H     G   + KL     L DN +   ++ V +  D + + +S D    GY
Sbjct: 194 -YPPTLLDHAFNVVGF--DSKLQPAQILTDNNLVQGLMGVLQEADRINNTLSSDCQHPGY 250

Query: 309 ILMQNKHLGKDHPPTESGSSTQI-----YDEFCPLLLNQFR---SREFVKFETFDAALDE 360
           I+ +N         ++ G STQ      + +F P   +Q +   +   ++FE+F++A+D+
Sbjct: 251 IIAKNIAPSA----SDGGDSTQQAPVTEFRDFHPFEPSQTKDLPNTTTLRFESFNSAVDK 306

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           ++S IE+++ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE NL  V
Sbjct: 307 YFSSIEARKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIETNLLQV 366

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNL--- 476
           + A+ AV   +A  M W ++AR+++ E+   NPVA  I   L L  N +++LL+  +   
Sbjct: 367 EEAMTAVNGLVAQGMDWVEIARLIEMEQGKRNPVALSIKLPLKLYENTITVLLNEEVAEE 426

Query: 477 -------------------------------------DEMDDEEKTLPVEKVEVDLALSA 499
                                                 + + +EK      +++DL +S 
Sbjct: 427 EEEEESDESDEEEDEDDDDGYGDDEYERPKQKKRLVNPQREKKEKKDTRLSIDIDLGISP 486

Query: 500 HANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISHMRKVH 555
            ANAR++Y+ KK    K+EKT+ A +KA K+ E+K +    + + QEK V  +   R   
Sbjct: 487 WANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV--LRRTRNPT 544

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WFEKF +FISS+ YLVI GRD QQ+E++ +RYM KGD+YVH DL G    +IKN      
Sbjct: 545 WFEKFFFFISSDGYLVIGGRDQQQDEILFQRYMKKGDIYVHTDLEGGVPLIIKNKPDTPD 604

Query: 616 VPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
            P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G FMI+G+KN
Sbjct: 605 DPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKN 664

Query: 674 FLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDT 733
            +PP  +++GF +LF++   S+ +H   + +    EG    + + +   +S  ++ + D 
Sbjct: 665 HIPPGQIVLGFAVLFQISSQSIQNHA--KSLPATSEG----DVNNYQPISSAADTAQSDR 718

Query: 734 DEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTP 793
           DE       +VP+    A  H   S+ +  E   +DK +S  ++ K+  I          
Sbjct: 719 DE-------NVPSEQEDA--HEPGSDGEKEEL-NDDKAVS--LEEKVEFI---------- 756

Query: 794 QLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKK 853
             ED +D      SA +  T+      Q  L  E++    ++T+ ++P  S     +   
Sbjct: 757 YFEDDLDP----DSAQVHETEK-----QEALQPEEQSAHGSSTIAEEPEDSNESEDE--- 804

Query: 854 GQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEE 913
              S +  P   +E        S P  I      +     RG++GK KK+  KY DQDEE
Sbjct: 805 ---SQLTTPSAVQESR-----PSTPLVISSAGTQKFRPPVRGKRGKAKKLAMKYKDQDEE 856

Query: 914 ERNIRMALLAVSTLTCT 930
           +R + + LL  +  T T
Sbjct: 857 DRKLALRLLGSAAGTST 873


>gi|261195108|ref|XP_002623958.1| DUF814 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239587830|gb|EEQ70473.1| DUF814 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 1150

 Score =  343 bits (881), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 303/991 (30%), Positives = 462/991 (46%), Gaps = 177/991 (17%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L + L+G+R SN+YDLS + Y+FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVISRELSQALVGLRISNIYDLSSRIYLFKLAKPDTRKQ--------LIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T Y+R     PS F ++LRK ++TRR+  V Q+G DRII  +   G N H V
Sbjct: 53  DTGFRCHLTEYSRTTAAAPSPFIVRLRKFLKTRRVTAVTQVGTDRIIDIELSDG-NFH-V 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE YA GNI+LTD E+ ++ L   HR   +G           E  RV        L   
Sbjct: 111 LLEFYAGGNIILTDKEYKIVAL---HRIVPEG--------NDQEEVRV-------GLQYV 152

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT+ +  +   P  +      +  A      G+  G       N+ +     A A +  +
Sbjct: 153 LTNKQNYNGVPPLSIERLRETLEQAKDVAGSGEGAG-------NTKRAEKKQAEALRRAV 205

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQDVI 299
                E   Y P L EH+   TG+ P++K  +V  L DN  ++ L+LA+ + E     + 
Sbjct: 206 SLGFPE---YPPLLLEHVFHITGVDPSLKPEQV--LGDNELVEKLMLALVEAESVNSSLS 260

Query: 300 SGDIVPEGYIL-------MQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FV 349
           + D  P GYI+       +++  +    P     S    Y +F P    QF ++     +
Sbjct: 261 TADDTP-GYIVSKTEIKSVEDSEVTATDP---FKSKNLQYVDFHPFEPKQFENQADMAIL 316

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           KF+TF+ A+DE++S +E Q+ E +   +E+ A  KL     DQE RV  LK+  +  V+ 
Sbjct: 317 KFDTFNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRK 376

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A+ IE NL  V+ A+ AV   +A  M W ++AR+++ E+   NPVA +I   L L  N +
Sbjct: 377 AQAIEANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTV 436

Query: 469 SLLLSNNL------------------------------DEMDDEEKTLPVEKVEVDLALS 498
           +LLL                                   +  +++    +  +++DL +S
Sbjct: 437 TLLLGEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGIS 496

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EKTVANISHMRKVHW 556
             ANAR++YE KK    K+EKT+ +  KA K+ EKK    + Q  ++    +  +R   W
Sbjct: 497 PWANARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFW 556

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQ 614
           FEKF +FISS+ YL + GRDAQQ E++ +R++ KGDVYVHAD+ GA    +KN    P+ 
Sbjct: 557 FEKFIYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDA 616

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG   V  S AW SK V                 GEYL  G F+IRG+KN 
Sbjct: 617 PIPPGTLSQAGNLCVATSSAWHSKAV----------------MGEYLETGGFVIRGEKNQ 660

Query: 675 LPPHPLIMGFGLLFRLDESS-------------LGSHLNERRVRGEEEGMDDF----EDS 717
           LPP  L++GF      D+SS             L S L+++  R E E  + +    ++ 
Sbjct: 661 LPPAQLLLGFA-----DDSSTTTGVKETQGMEELPSRLDQQTPR-ESENKETYHQPEQND 714

Query: 718 GHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTIS-NGI 776
              +EN +IE   DD    P           H     +++ + D      ED+    +  
Sbjct: 715 SSDEENGEIEENTDDKRTNPF---------LHEKAESSDSDSEDGESKIGEDRPQDVDAK 765

Query: 777 DSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT 836
           D + +D A + A         + + ALG    S    + G E           H + +A 
Sbjct: 766 DEREYDHAESKA---------VEEAALGGKETSSQEEQAGSEP----------HTD-SAA 805

Query: 837 VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKIS--- 893
            R    +S  E  +LKK  G S+         E+     + PES  R T  E  + S   
Sbjct: 806 ARPAKRLSATENGQLKK--GVSI---------EQASTPPTDPES--RLTPNEPSRSSTPN 852

Query: 894 -RGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
            RG++GK KK+  KY  QDEE+R + + LL 
Sbjct: 853 IRGKRGKNKKIATKYQHQDEEDRELALRLLG 883


>gi|239610682|gb|EEQ87669.1| DUF814 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 1131

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 303/991 (30%), Positives = 462/991 (46%), Gaps = 177/991 (17%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L + L+G+R SN+YDLS + Y+FKL       +        L++
Sbjct: 1   MKQRFSSLDVKVISRELSQALVGLRISNIYDLSSRIYLFKLAKPDTRKQ--------LIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T Y+R     PS F ++LRK ++TRR+  V Q+G DRII  +   G N H V
Sbjct: 53  DTGFRCHLTEYSRTTAAAPSPFIVRLRKFLKTRRVTAVTQVGTDRIIDIELSDG-NFH-V 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE YA GNI+LTD E+ ++ L   HR   +G           E  RV        L   
Sbjct: 111 LLEFYAGGNIILTDKEYKIVAL---HRIVPEG--------NDQEEVRV-------GLQYV 152

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT+ +  +   P  +      +  A      G+  G       N+ +     A A +  +
Sbjct: 153 LTNKQNYNGVPPLSIERLRETLEQAKDVAGSGEGAG-------NTKRAKKKQAEALRRAV 205

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQDVI 299
                E   Y P L EH+   TG+ P++K  +V  L DN  ++ L+LA+ + E     + 
Sbjct: 206 SLGFPE---YPPLLLEHVFHITGVDPSLKPEQV--LGDNELVEKLMLALVEAESVNSSLS 260

Query: 300 SGDIVPEGYIL-------MQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSRE---FV 349
           + D  P GYI+       +++  +    P     S    Y +F P    QF ++     +
Sbjct: 261 TADDTP-GYIVSKTEIKSVEDSEVTATDP---FKSKNLQYVDFHPFEPKQFENQADMAIL 316

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           KF+TF+ A+DE++S +E Q+ E +   +E+ A  KL     DQE RV  LK+  +  V+ 
Sbjct: 317 KFDTFNKAVDEYFSSVECQKLESRLTEREEMAKRKLEAAQKDQEKRVGVLKEARELHVRK 376

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A+ IE NL  V+ A+ AV   +A  M W ++AR+++ E+   NPVA +I   L L  N +
Sbjct: 377 AQAIEANLLRVEEAMNAVNGLIAQGMDWVEIARLIEMEQTRQNPVAKVIKLPLKLYENTV 436

Query: 469 SLLLSNNL------------------------------DEMDDEEKTLPVEKVEVDLALS 498
           +LLL                                   +  +++    +  +++DL +S
Sbjct: 437 TLLLGEPTEDEEPMDESDEEDEDEESSEDEESERKLGGSKKPEQQLQQQLLSIDIDLGIS 496

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EKTVANISHMRKVHW 556
             ANAR++YE KK    K+EKT+ +  KA K+ EKK    + Q  ++    +  +R   W
Sbjct: 497 PWANARQYYEQKKAAAVKEEKTLMSAKKAIKSTEKKVTADLKQALKQNKPVLRPVRTPFW 556

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQ 614
           FEKF +FISS+ YL + GRDAQQ E++ +R++ KGDVYVHAD+ GA    +KN    P+ 
Sbjct: 557 FEKFIYFISSDGYLALGGRDAQQTEILYRRHLKKGDVYVHADVQGAIPFFVKNKPDTPDA 616

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           P+PP TL+QAG   V  S AW SK V                 GEYL  G F+IRG+KN 
Sbjct: 617 PIPPGTLSQAGNLCVATSSAWHSKAV----------------MGEYLETGGFVIRGEKNQ 660

Query: 675 LPPHPLIMGFGLLFRLDESS-------------LGSHLNERRVRGEEEGMDDF----EDS 717
           LPP  L++GF      D+SS             L S L+++  R E E  + +    ++ 
Sbjct: 661 LPPAQLLLGFA-----DDSSTTTGVKETQGMEELPSRLDQQTPR-ESENKETYHQPEQND 714

Query: 718 GHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTIS-NGI 776
              +EN +IE   DD    P           H     +++ + D      ED+    +  
Sbjct: 715 SSDEENGEIEENTDDKRTNPF---------LHEKAESSDSDSEDGESKIGEDRPQDVDAK 765

Query: 777 DSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT 836
           D + +D A + A         + + ALG    S    + G E           H + +A 
Sbjct: 766 DEREYDHAESKA---------VEEAALGGKETSSQEEQAGSEP----------HTD-SAA 805

Query: 837 VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKIS--- 893
            R    +S  E  +LKK  G S+         E+     + PES  R T  E  + S   
Sbjct: 806 ARPAKRLSATENGQLKK--GVSI---------EQASTPPTDPES--RLTPNEPSRSSTPN 852

Query: 894 -RGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
            RG++GK KK+  KY  QDEE+R + + LL 
Sbjct: 853 IRGKRGKNKKIATKYQHQDEEDRELALRLLG 883


>gi|391869409|gb|EIT78607.1| putative RNA-binding protein [Aspergillus oryzae 3.042]
          Length = 1103

 Score =  342 bits (877), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 230/750 (30%), Positives = 373/750 (49%), Gaps = 115/750 (15%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV    + L   ++ +R SN+YDLS + ++FKL             +  L++
Sbjct: 1   MKQRFSSLDVKVISQELASEIVNLRVSNIYDLSSRIFLFKLAKPD--------HRKQLIV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T Y+R   + PS F  ++RK +R+RR+  V+Q+G DRII   F  GM   ++
Sbjct: 53  DSGFRCHVTQYSRATASMPSPFVTRMRKFLRSRRITSVKQIGTDRIIDISFSDGMYHMFL 112

Query: 121 ----------------ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
                           IL LY Q ++   +     +    +++ +  G+  ++  R    
Sbjct: 113 EFFAGGNIIITDREHNILALYRQVSVSEGEEARVGIQYTVTNKQNYYGIPEITLDRI--- 169

Query: 165 ICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKN 224
                 R T  K  A                 EDG                       K 
Sbjct: 170 ------RETLEKAKALF-------------AREDG---------------------APKK 189

Query: 225 SNKNSNDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQV 283
           S K + D        L+  L +    Y P L +H  +   + P   L +V  L+D ++  
Sbjct: 190 SKKKNAD-------VLRKALSQGFPEYPPLLLDHAFVTKEVDPTTPLDKV--LQDESLLQ 240

Query: 284 LVLAVAKFEDWLQDVISGDIVPEGYILMQN------KHLGKDHPPTESGSSTQIYDEFCP 337
            V  V +        +S      GYI+ ++      +   ++  P+E+G+   +Y++F P
Sbjct: 241 EVNGVLQEAQNENTRLSTQESHPGYIVAKDDNRSVSQSANENEKPSETGNL--LYEDFHP 298

Query: 338 LLLNQFRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
               QF  +     ++F + +A +DE++S IE+Q+ E +   +E+AA  KL  +  + E 
Sbjct: 299 FKPRQFEGKPGISILEFPSLNATVDEYFSSIETQKLESRLTEREEAAKRKLEAVRQEHEK 358

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           ++  LK++ +  ++ A  IE N+  V  A+ AV   +A  M W ++AR+++ E+  GNPV
Sbjct: 359 KIGALKEQQELHIRKASAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEMEQSRGNPV 418

Query: 455 AGLID-KLYLERNCMSLLLSNNLDEMDD--------------------EEKTLP-VEKVE 492
           A +I   L L  N ++LLL    DE D+                    E +  P V  ++
Sbjct: 419 ARIIKLPLKLHENTITLLLGEAGDEQDEGDELFSSDESEKSEDEQDNGESQQPPSVLTID 478

Query: 493 VDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQILQEKTVANISH 550
           +DL +S  ANA+++YE KK+   K+++T  + +KA K+ EKK    L+   +K    +  
Sbjct: 479 IDLGISPWANAKQYYEQKKQAAVKEQRTAQSSTKALKSHEKKVTEDLKRGMKKEKQTLRQ 538

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            R+  WFEKF +FISSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA   ++KN 
Sbjct: 539 TRQPFWFEKFLFFISSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGARPMIVKNR 598

Query: 611 R--PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
              P  P+PP TL+QAG   V  S AWDSK V SAWWV   Q++KTA  G  L +G F++
Sbjct: 599 SKDPTAPIPPSTLSQAGNLCVATSSAWDSKAVMSAWWVQASQITKTAEVGGLLPMGDFLV 658

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           +G+KNFL P  L++GFG+ F++ + SL +H
Sbjct: 659 KGEKNFLAPSQLVLGFGVTFQISKDSLKNH 688


>gi|118350963|ref|XP_001008760.1| conserved hypothetical protein [Tetrahymena thermophila]
 gi|89290527|gb|EAR88515.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
          Length = 1213

 Score =  342 bits (877), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 207/532 (38%), Positives = 309/532 (58%), Gaps = 60/532 (11%)

Query: 250 YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG--DIVPEG 307
           + P + +HII   GL PN K++    + D AI      + +  D  +D+I      V +G
Sbjct: 272 HNPVI-DHIISSNGLNPNQKVT----VADVAI------IKQMADKCKDLILDFQKTVHQG 320

Query: 308 YILMQNKHLGKDHP-----------------PTESGSSTQI------YDEFCPLLLNQFR 344
           Y+++ +K   K  P                 PTE     +       Y +F PL L    
Sbjct: 321 YLIVSDKKEVKHRPNKQEQQQIEGAQNNDEIPTEKAKEEKKEEEKEKYFDFSPLYLTCHE 380

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
            ++F++  +F+A++D+++ ++ +Q+ +++    E  A+ K   I  DQ NR+  LK E +
Sbjct: 381 GKKFIENNSFNASVDKYF-QVMAQKIQEEQNDVESIAWKKYENIKNDQLNRIQKLKNEQE 439

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             V  A+LIE N++ VDA I  ++   ++  SW+ + +M+ E +K G+P+A LI  L  E
Sbjct: 440 EYVVKAQLIEMNIDYVDAIINIIKTLKSSGESWDKITKMINEGKKNGDPMAYLIHSLDFE 499

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
            N +S+LL +  D+M  EE T+    V +D+A SAH NAR +YE KKK   K++KT+ A 
Sbjct: 500 NNEISVLLGDPCDDM--EEYTV----VAIDIAYSAHQNARNYYENKKKNIVKEKKTLDAS 553

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
             A K AEK    +I   K   N+ + RK +WFEKF WFISSENYLVISGRD QQNE+IV
Sbjct: 554 KLALKQAEKTALKEIENLKLKNNVVNTRKQYWFEKFYWFISSENYLVISGRDMQQNEIIV 613

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           K+YM KGD+Y+HAD HGA+ST+IKN   + PV   T+ +A   T+C S+AW++K++ SAW
Sbjct: 614 KKYMRKGDIYMHADFHGAASTIIKNPFKDIPVSQQTIEEAAIATICRSKAWEAKIIASAW 673

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRV 704
           WVY HQVSK A TGEYL  GSFMIRGKKNF+ P  + MG  LL++LD+  +  HLN+RR 
Sbjct: 674 WVYDHQVSKRAETGEYLPSGSFMIRGKKNFIYPARMEMGCTLLYKLDDQFVEKHLNDRRR 733

Query: 705 RGEE-----------EGMDDFEDSGHH-KENSDIESEKDD-----TDEKPVA 739
           + ++           +  +DF+++    + N  +ES++ D      +E P A
Sbjct: 734 KDKDDNTTTVSGVQIDNQNDFDETNFEIRPNMQLESQQSDQGVSIVNEDPFA 785


>gi|71411706|ref|XP_808091.1| hypothetical protein Tc00.1047053507483.60 [Trypanosoma cruzi
           strain CL Brener]
 gi|70872222|gb|EAN86240.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1081

 Score =  342 bits (876), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 240/745 (32%), Positives = 380/745 (51%), Gaps = 105/745 (14%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  NVYD++PK ++FK  +       GE+++ LLL
Sbjct: 1   MVKQRMTALDVRASVEEMRSELLGLRLLNVYDINPKMFLFKFGH-------GENKRTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ESGVR+H T   R+K   PS FTLKLRKH+R  RL+ V QL +DR + F+FG+G  A Y
Sbjct: 54  -ESGVRMHLTQLVREKPKVPSQFTLKLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L LLR+H+DDD  + +  R  YP  + R FE     ++ 
Sbjct: 113 HIIIELFSKGNVVLTDHEYRILLLLRTHKDDD--IKMFVRELYP--VTRPFEEQQEKEVM 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           A   S KE +  +           +NA ++                   ++     A   
Sbjct: 169 AQSESGKEKEEEQ---------RRTNALRQEW-----------------HTVFARHADYE 202

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T+++ L     +GPAL++HI+  TG V N+K  E+    +   ++L+  +   + W    
Sbjct: 203 TIRSTLSAVHHFGPALADHILTVTG-VKNVKKGEITSDAETMFKLLLPGM--LQAW---E 256

Query: 299 ISGDIVPEGYILMQNKH---------------LGKDHPPTES----------GSSTQI-- 331
           I+   +P G  L+ N                 +G+D    E           GS  Q+  
Sbjct: 257 ITFSPLPGGGYLISNHRQRKDSRKGGQEASSKIGEDKSQAEEEKSVNANVADGSQQQMQA 316

Query: 332 --YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             YD+F P+LL Q+ S   V    ++F +  D F+   E+++ EQ ++ K  +   K NK
Sbjct: 317 VQYDDFSPVLLAQYSSDGVVMSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNK 376

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              D   R++ L+ E   + +  E I  N+  +D AI  +  ALA  + W+ L  ++K  
Sbjct: 377 FERDHLRRLNALEMEEQENQRKGECIIQNVVKIDEAIGLINGALAAGIQWDALRSLLKRR 436

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRW 506
              G+PVA ++  L+LERN +S+L+ +N  E + EE   +    +EV+L+ +A+ANA  +
Sbjct: 437 HAEGHPVAYMVHDLFLERNSISVLVESNEQEDEGEEDCDVTPMVIEVELSKTAYANATTY 496

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           +   K    K EKT+ A +KA   AEKK      ++KT   I   R+  W+EKF+WF +S
Sbjct: 497 FAKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWWEKFSWFRTS 556

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK--------------NHRP 612
               V+ G+D Q  E++V+R M  GDV+VH D+ GA   +++                 P
Sbjct: 557 CGDFVLQGKDLQTTEILVRRVMQLGDVFVHCDVDGALPCLLRPIGSAWATAFVEDVEGDP 616

Query: 613 EQPVPPLT-------LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           ++     T       L++AG + V  S AW+ K   +AWWV+  Q++    +G YL    
Sbjct: 617 QEGCQAKTCRIHMTSLDEAGAWCVSRSSAWEGKFSVAAWWVHASQINGGTASGCYL---- 672

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRL 690
               G+K++L P P+    GLLFR+
Sbjct: 673 --FDGEKHYLRPQPITFACGLLFRV 695


>gi|344257308|gb|EGW13412.1| Serologically defined colon cancer antigen 1-like [Cricetulus
           griseus]
          Length = 554

 Score =  341 bits (875), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 168/327 (51%), Positives = 225/327 (68%), Gaps = 31/327 (9%)

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           NL+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N 
Sbjct: 2   NLQIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNP 61

Query: 476 --LDEMDDEEKTLPVEK----------------------------VEVDLALSAHANARR 505
             L E +D++    VE                             V+VDL+LSA+ANA++
Sbjct: 62  YLLSEEEDDDGDASVEVSDAEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKK 121

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           +Y+ K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I   RKV+WFEKF WFIS
Sbjct: 122 YYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQKARKVYWFEKFLWFIS 181

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG
Sbjct: 182 SENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAG 240

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF 
Sbjct: 241 TMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFS 300

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMD 712
            LF++DES +  H  ER+VR ++E ++
Sbjct: 301 FLFKVDESCIWRHRGERKVRAQDEDIE 327


>gi|346325475|gb|EGX95072.1| serologically defined colon cancer antigen 1 [Cordyceps militaris
           CM01]
          Length = 1048

 Score =  340 bits (873), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 254/798 (31%), Positives = 392/798 (49%), Gaps = 99/798 (12%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L + L  +R +N+YDLS K  +FK    +         K  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELNQSLTSLRVANIYDLSTKILLFKFAKPN--------TKKQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G R HTT YAR     PS F  +LRK ++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  DIGFRCHTTEYARATAGIPSVFVARLRKVLKTRRLTSVSQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE +A GN++LTD+   +L + R+  + D         + P ++           L  +
Sbjct: 111 FLEFFASGNVILTDANLKILAIFRNVLEGD--------GQEPQKVG----------LQYS 152

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L  S++     P+   E       A+ E +   KG  S    K  N        A +  L
Sbjct: 153 L-ESRQNFLGIPELSQERVRTALTAAVETVSATKGHHSKPAPKQGN--------ALRKCL 203

Query: 241 KTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
              + E     P + +H++     DT L P   L + + L       LV  + K  + L 
Sbjct: 204 AVSITE---LPPIIVDHVLQANDFDTSLKPETILEDASLLSS-----LVENLRKARE-LV 254

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ----IYDEFCPLLLNQFRSR---EFV 349
             I+      G+I  + K   + + PTE  SS      +YD+F P +  +F++    E +
Sbjct: 255 GAITSSPSCTGFIFAK-KPAQEQNLPTEDTSSEAKAGLLYDDFHPFVPQKFQNNSKIEIL 313

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +FE F+  +D+F+S +E Q+ + +   +E AA  KL+    DQENR+  L+     + + 
Sbjct: 314 RFEGFNRTVDDFFSSLEGQKLQSRVVEREAAAQRKLDAAKQDQENRLKGLQTSQSDNFRK 373

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCM 468
           A  IE N+E V  A+ ++   LA  M W D+ ++V  E+K  N VA LI   L L  N +
Sbjct: 374 AAAIEANIERVQEAMDSINGLLAQGMDWVDIGKLVAREQKKNNAVANLICLPLSLADNVI 433

Query: 469 SLLLSNNLD---------EMDDE----EKTLPVEK-----------VEVDLALSAHANAR 504
           S+ LS   D         E DD     E  L   K           VE+ L LS  +NAR
Sbjct: 434 SIRLSEEDDAGSEVEDPFETDDSDADSETDLNAAKSVQNYSDKTIIVELTLTLSPWSNAR 493

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISHMRKVHWFEKF 560
            +Y+ +K    K+EKT     +A K+ E+K +      + QEK +  +  +R + WFEKF
Sbjct: 494 EYYDQRKTAVVKEEKTQLQADRAIKSTEQKIKHDLKRALKQEKAL--LQPIRNLMWFEKF 551

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP--VPP 618
            WFISS+ YLV+  +D  Q E++ +R++  GD++ HAD + A+  ++KN+   +   + P
Sbjct: 552 YWFISSDGYLVVGAKDKSQAEILYRRHLGSGDIFCHADANNAAIVIVKNNSNTEDAHIAP 611

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL QAG  ++C S+AWDSK    AWWV   QVSK+ PTG+ L  G+F I G+KNFLPP 
Sbjct: 612 ATLAQAGQLSICSSEAWDSKAGIGAWWVNSSQVSKSTPTGDILQPGNFNISGEKNFLPPG 671

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVRGEEE--GMDDFEDSGHHKENSDI----ESEKDD 732
            LI+G  ++F++ E S   H N+ R++  +E  G    E   + K+++ I    +   D+
Sbjct: 672 QLILGLSIMFKISEES-EIHHNKHRIQDGDETAGAPGRETETNSKQDTSIMDMNQESSDE 730

Query: 733 TDEKPVAESLSVPNSAHP 750
            DE    +    P  A+P
Sbjct: 731 EDEGDYKDGDKQPTRANP 748


>gi|32565397|ref|NP_497411.2| Protein Y82E9BR.18 [Caenorhabditis elegans]
 gi|373220360|emb|CCD73050.1| Protein Y82E9BR.18 [Caenorhabditis elegans]
          Length = 921

 Score =  340 bits (871), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 177/415 (42%), Positives = 256/415 (61%), Gaps = 12/415 (2%)

Query: 326 GSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKL 385
            +  QIY +F P+ + +F ++   +  +F  A+DEFYS+IE+Q+ EQ+    E  A  KL
Sbjct: 267 STPIQIYQDFNPISM-EFTAKLSKELSSFCEAVDEFYSRIETQKQEQKAVNMEKQALKKL 325

Query: 386 NKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVK 445
             +  DQ++R+  L+    +  +MA  I  N E V+ A+L +R ALAN+ SW+ +  M K
Sbjct: 326 ENVEKDQKDRIEALQLTQSQREQMANRIILNTELVEKALLLIRSALANQFSWQTIEEMRK 385

Query: 446 EERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
                G+PVA  ID    E N   + L+   D  DDE + L   KV +D++L+A  NA+R
Sbjct: 386 TAAGNGDPVAKSIDSFKFENNEFMMSLA---DPYDDEAEVL---KVPIDISLNASKNAQR 439

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
            +  KK    K +KT+ +  KA K A++K +  + Q K V  +   RK  WFEKF WFIS
Sbjct: 440 HFVDKKSAAEKVKKTVASSEKAIKNAQEKAKSTLEQVKIVVEVKKSRKSMWFEKFRWFIS 499

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           SE ++V++GRDAQQNE++VK+Y+   D+Y+HAD+ GASS VI+N   +  +PP TL +A 
Sbjct: 500 SEGFIVVAGRDAQQNELLVKKYLRPNDIYMHADVRGASSVVIRNKSFDAEIPPKTLTEAA 559

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              VC+S AW++ +  SAWWV+P QVS+TAPTGEYL  GSFMIRGKKNF+PP  L+MG G
Sbjct: 560 QMAVCYSNAWEATVTASAWWVHPDQVSRTAPTGEYLPSGSFMIRGKKNFMPPSQLVMGLG 619

Query: 686 LLFRLDESSLGSHLNERRVRGEEEGMDD---FEDSGHHKENSDIESEKDDTDEKP 737
           +LFR+DE S+  H+   + + EE+  +D    EDS   K+ + I     + DE P
Sbjct: 620 ILFRMDEESIERHVALEKSKAEEKSEEDGEKMEDSP--KKTAKIPENPAENDEFP 672



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 67/160 (41%), Positives = 94/160 (58%), Gaps = 8/160 (5%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R    DV A    L++L GMR +NVYD+  KTY+ KL        S   EK ++L E
Sbjct: 1   MKNRFTLVDVIAATTELKKLEGMRVNNVYDIDNKTYLIKL--------SRTDEKAVILFE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SGVRLH T +   K  TPS F++KLRKHI  +RL  +R +G+DR++   FG     + + 
Sbjct: 53  SGVRLHQTFHDWPKSQTPSSFSMKLRKHINQKRLTSIRVVGFDRLVELTFGTEDRENRLY 112

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           +ELY +GN++LTD E T+L +LR   D D  V    R ++
Sbjct: 113 VELYDRGNVVLTDQELTILNILRVRTDKDTSVRWAVREKF 152


>gi|71413048|ref|XP_808681.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70872935|gb|EAN86830.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1082

 Score =  339 bits (869), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 236/745 (31%), Positives = 376/745 (50%), Gaps = 105/745 (14%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  NVYD++PK ++FK  +       GE+++ LLL
Sbjct: 1   MVKQRMTALDVRASVEEMRSELLGLRLLNVYDINPKMFLFKFGH-------GENKRTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ESG+R+H T   R+K   PS FTLKLRKH+R  RL+ V QL +DR + F+FG+G  A Y
Sbjct: 54  -ESGIRMHLTQLVREKPKVPSQFTLKLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L LLR+H+DDD  + +  R  YP  + R FE     ++ 
Sbjct: 113 HIIIELFSKGNVVLTDHEYRILLLLRTHKDDD--IKMFVRELYP--VTRPFEEQQEKEVM 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           A   S KE +                        Q+  K+         ++     A   
Sbjct: 169 AQSESGKEKEEE----------------------QRRTKAL----QQEWHTVFARHADYE 202

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T+++ L     +GPAL++HI+  TG V N+K  E+    +   ++L+  +   + W    
Sbjct: 203 TIRSTLSAVHHFGPALADHILTVTG-VKNVKKGEITSDAETMFKLLLPGM--LQAW---E 256

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI--------------------------- 331
           I+   +P G  L+ N    K+       +S++I                           
Sbjct: 257 ITFSPLPGGGYLISNHRQRKESRKGGQEASSKIEEDKSQAEEEKSMNVNVADESQQQMQA 316

Query: 332 --YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             YD+F P+LL Q+ S   V    ++F +  D F+   E+++ EQ ++ K  +   K NK
Sbjct: 317 VKYDDFSPVLLAQYSSDGVVTSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNK 376

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              D + R++ L+ E   + +  E I  N   +D AI  +  ALA  + W+ L  ++K  
Sbjct: 377 FERDHQRRLNALEMEEQENQRKGECIIQNAVKIDEAIGLINGALAAGIQWDALRSLLKRR 436

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRW 506
              G+PVA ++  L+LERN +S+L+ +N  E + EE   +    +EV+L+ +A+ANA  +
Sbjct: 437 HAEGHPVAYMVHDLFLERNSISVLVESNEQEDEGEEDCDVTPMVIEVELSKTAYANATTY 496

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           +   K    K EKT+ A +KA   AEKK      ++KT   I   R+  W+EKF+WF +S
Sbjct: 497 FAKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWWEKFSWFRTS 556

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK--------------NHRP 612
               V+ G+D Q  E++++R M  GDV+VH D+ GA   V++                 P
Sbjct: 557 CGDFVLQGKDLQTTEILIRRVMQLGDVFVHCDVDGALPCVLRPIGSAWTTAFVEDVEGDP 616

Query: 613 EQPVPPLT-------LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           ++     T       L++AG + V  S AW+ K   +AWWV+  Q++    +G YL    
Sbjct: 617 QEGCQAKTCRIHMTSLDEAGAWCVSRSSAWEGKFSVAAWWVHASQINGGTASGCYL---- 672

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRL 690
               G+K++L P P+    GLLFR+
Sbjct: 673 --FDGEKHYLRPQPITFACGLLFRV 695


>gi|407846065|gb|EKG02413.1| hypothetical protein TCSYLVIO_006562 [Trypanosoma cruzi]
          Length = 1080

 Score =  338 bits (868), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 237/745 (31%), Positives = 376/745 (50%), Gaps = 105/745 (14%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L+G+R  NVYD++PK ++FK  +       GE+++ LLL
Sbjct: 1   MVKQRMTALDVRASVEEMRSELLGLRLLNVYDINPKMFLFKFGH-------GENKRTLLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ESG+R+H T   R+K   PS FTLKLRKH+R  RL+ V QL +DR + F+FG+G  A Y
Sbjct: 54  -ESGIRMHLTQLVREKPKVPSQFTLKLRKHVRAWRLDSVTQLQHDRTVDFRFGVGEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GN++LTD E+ +L LLR+H+DDD  + +  R  YP  + R FE     ++ 
Sbjct: 113 HIIIELFSKGNVVLTDHEYRILLLLRTHKDDD--IKMFVRELYP--VTRPFEEQQEKEVM 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           A   S KE +                        Q+  K+         ++     A   
Sbjct: 169 AQSESGKEKEEE----------------------QRRTKAL----RQEWHTVFARHADYE 202

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T+++ L     +GPAL++HI+  TG V N+K  E+    +   ++L+  +   + W    
Sbjct: 203 TIRSTLSAVHHFGPALADHILTVTG-VKNVKKGEITSDAETMFKLLLPGM--LQAW---E 256

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI--------------------------- 331
           I+   +P G  L+ N    K+       +S++I                           
Sbjct: 257 ITFSPLPGGGYLISNHRQRKESRKGGQEASSKIEEDKSQAEVEKSVNVNVAEESQQQMQA 316

Query: 332 --YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
             YD+F P+LL Q+ S   V    ++F +  D F+   E+++ EQ ++ K  +   K NK
Sbjct: 317 VQYDDFTPVLLAQYSSDGVVTSFLKSFGSVCDAFFLYTETEKIEQHNEKKTTSVISKRNK 376

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              D + R++ L+ E   + +  E I  N   +D AI  +  ALA  + W+ L  ++K  
Sbjct: 377 FERDHQRRLNALEMEEQENQRKGECIIQNAVKIDEAIGLINGALAAGIQWDALRSLLKRR 436

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRW 506
              G+PVA ++  L+LERN +S+L+ +N  E + EE   +    +EV+L+ +A+ANA  +
Sbjct: 437 HAEGHPVAYMVHDLFLERNSISVLVESNEQEDEGEEDCDVTPMVIEVELSKTAYANATTY 496

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           +   K    K EKT+ A +KA   AEKK      ++KT   I   R+  W+EKF+WF +S
Sbjct: 497 FSKMKSNRIKYEKTVAATAKALAGAEKKGERLAAKQKTKKAIVKERRRFWWEKFSWFRTS 556

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK--------------NHRP 612
               V+ G+D Q  E++V+R M  GDV+VH D+ GA   V++                 P
Sbjct: 557 CGDFVLQGKDLQTTEILVRRVMQLGDVFVHCDVDGALPCVLRPIGSAWTTAFVEDVEGDP 616

Query: 613 EQPVPPLT-------LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           ++     T       L++AG + V  S AW+ K   +AWWV+  Q++    +G YL    
Sbjct: 617 QEGCQAKTCRIHMTSLDEAGAWCVSRSSAWEGKFSVAAWWVHASQINGGTASGCYL---- 672

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRL 690
               G+K++L P P+    GLLFR+
Sbjct: 673 --FDGEKHYLRPQPVTFACGLLFRV 695


>gi|116193227|ref|XP_001222426.1| hypothetical protein CHGG_06331 [Chaetomium globosum CBS 148.51]
 gi|88182244|gb|EAQ89712.1| hypothetical protein CHGG_06331 [Chaetomium globosum CBS 148.51]
          Length = 1115

 Score =  338 bits (867), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 236/703 (33%), Positives = 352/703 (50%), Gaps = 101/703 (14%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R +N+YDL+ K  + K        +        +L+ESG R H T +AR     PS
Sbjct: 26  LVSLRLANIYDLNSKILLLKFAKPDNRQQ--------VLIESGFRCHLTDFARAAAPAPS 77

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  V Q+G DRII F+F  G  A+ + LE +A GN++LTD++  +L
Sbjct: 78  AFVARLRKFLKTRRVTGVSQIGTDRIIEFRFSDG--AYRLYLEFFAGGNVILTDADLKIL 135

Query: 141 TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGN 200
            LLR        +    + + P  +   +       L      +KE       ++ +   
Sbjct: 136 ALLR--------IVPEGKGQEPQRVGLTYSLENRQNLGGVPPLTKE-------RLRDALT 180

Query: 201 NVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIIL 260
            V+  +      +K G             +DG R    T  T L       P L +H+  
Sbjct: 181 TVTAQAATEKAKKKKG-------------SDGLRRGIVTTITELP------PVLIDHVFR 221

Query: 261 DTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH 320
             G  P    +EV  L D ++   +    +    + D ++     +GYI+       K +
Sbjct: 222 LRGFNPTTTPTEV--LNDESLFNALFGSLEEARSISDEVTSSPTAKGYII------AKPN 273

Query: 321 PPT-------------ESGSSTQIYDEFCPLLLNQF---RSREFVKFETFDAALDEFYSK 364
           P T             +  +   +Y++F P L  QF   R  E + F+ ++  +D F+S 
Sbjct: 274 PRTAELLKEGEEEEGQKEKARNLLYEDFQPFLPKQFEDIRDCEILSFDGYNKTVDNFFSS 333

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           +E Q+ E + + +E  A  KL     DQ  R+  L+     +++ A  +E N+E V  A+
Sbjct: 334 LEGQKLESRLQEREITAKRKLEAARRDQAQRIEGLQDVQMLNLRKAAAVEANIERVQEAM 393

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE-------------------R 465
            AV   +   M W D+ ++V+ E+K  NPVA +I KL ++                    
Sbjct: 394 DAVNGLIQQGMDWVDINKLVEREQKQHNPVAEMI-KLPMKLHESVITLLLGEEEEEGKVE 452

Query: 466 NCMSLLLSNNLDEMDD--EEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQEKT 520
             M      + +  DD  EEK+   +K   ++++L LS   NAR +Y+ K+    KQEKT
Sbjct: 453 EEMDFDYDTDEETADDAAEEKSKGPDKRLAIDINLKLSPRNNARYYYDQKRTAADKQEKT 512

Query: 521 ITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           +     A K AE+K     +  + QEK +  +  +RK  WFEKF WF+SS+ YLV+ GRD
Sbjct: 513 VQRSEIALKNAEQKIAEDLKKGLKQEKPI--LQPIRKQMWFEKFTWFVSSDGYLVLGGRD 570

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQA 634
           AQQNE++ KRY+ KGDVYVHAD+HGASS VIKN+   P+ P+PP TL QAG  +VC S A
Sbjct: 571 AQQNEILYKRYLRKGDVYVHADMHGASSVVIKNNPKTPDAPIPPSTLAQAGNLSVCCSSA 630

Query: 635 WDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           WDSK    AWWV   QVSK+AP+GEYL VGSFM+RGK+N LPP
Sbjct: 631 WDSKAAMGAWWVNADQVSKSAPSGEYLPVGSFMVRGKRNLLPP 673


>gi|340975808|gb|EGS22923.1| hypothetical protein CTHT_0014010 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 1116

 Score =  337 bits (864), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 237/715 (33%), Positives = 352/715 (49%), Gaps = 128/715 (17%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+ +R SN+YDL+ K  + K        +        LL+ESG R H T +AR     PS
Sbjct: 21  LVSLRLSNIYDLNSKILLLKFAKPDCRRQ--------LLIESGFRCHLTDFARTAAPAPS 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  +LRK ++TRR+  + Q+G DRII FQF  G  A+ + LE +A GN++LTD++  +L
Sbjct: 73  AFVARLRKFLKTRRVTRISQIGTDRIIEFQFSDG--AYRLYLEFFASGNVILTDADLKIL 130

Query: 141 TLLR---------------SHRDDDK----GVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            LLR               ++R D++    GV  ++R R                L  AL
Sbjct: 131 ALLRNVPEGEGQEPQRVGLTYRLDNRQNYGGVPALTRER----------------LRTAL 174

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
            ++ E    +P                                S K + D  R    T  
Sbjct: 175 QTAVEQAVKKP--------------------------------SKKKAADELRRGLATTI 202

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
           T L       P L +H+         +K  EV K ED   + L  A+ +    L ++ S 
Sbjct: 203 TELP------PVLVDHVFQLNKFDSTVKPLEVLKNED-LFESLFKALEQGRAILDEITSS 255

Query: 302 DIVPEGYILMQ-NKHL------GKDHPPTESGSSTQIYDEFCPLLLNQFR---SREFVKF 351
            ++ +GYI+ + N H       G + P     +S+ +Y++F P L  QF    + E + F
Sbjct: 256 PVL-KGYIIAKPNPHAQEQASEGGEAP--NGKASSLLYEDFQPFLPKQFEEDPNLEVLTF 312

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           + F+  +DEF+S +E Q+ + + + +E  A  KL     DQ  R+  L++    +++ A 
Sbjct: 313 DGFNKTVDEFFSSLEGQKLQSRLQEREATAKKKLEAARQDQAKRIEGLQEAQVLNLRKAA 372

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
            IE N+E V  A+ AV   L   M W D+ ++V+ E+K  NPVA +I   + L  N ++L
Sbjct: 373 AIEANIERVQEAMDAVNGLLQQGMDWVDINKLVEREQKLHNPVAEIIKLPMRLHENIITL 432

Query: 471 LLSNNLD------------EMDDEEKTLPVEK----------VEVDLALSAHANARRWYE 508
           LL    +            + D+E    P  +          V+++L LS   NAR +YE
Sbjct: 433 LLGEEEEEGPEDEEMDFEYDTDEEAANDPQPEKAKGPDKRLAVDINLKLSPWNNAREYYE 492

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFI 564
            K+    K +KTI     A K AE K     +  + QEK +  +  +R+  WFEKF WFI
Sbjct: 493 QKRSAADKAQKTIQQAEIALKNAEMKIAKDLKKDLKQEKPI--LQPIRQQLWFEKFIWFI 550

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLN 622
           SS+ YLV+ GRDAQQNE++ KRY  KGDV+VH+D+ GA++ +IKN    P+ P+PP TL 
Sbjct: 551 SSDGYLVLGGRDAQQNEILYKRYFKKGDVFVHSDVKGAATVIIKNDPKTPDAPIPPATLT 610

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
           QAGC +VC S AWDSK    AWWV   +VSK  PTG+ +  G+FMI G++N L P
Sbjct: 611 QAGCLSVCCSSAWDSKAAMGAWWVTADKVSKLGPTGDPMPEGTFMINGERNPLEP 665


>gi|327303108|ref|XP_003236246.1| hypothetical protein TERG_03295 [Trichophyton rubrum CBS 118892]
 gi|326461588|gb|EGD87041.1| hypothetical protein TERG_03295 [Trichophyton rubrum CBS 118892]
          Length = 1098

 Score =  336 bits (862), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 278/976 (28%), Positives = 458/976 (46%), Gaps = 178/976 (18%)

Query: 14  EVKCLRR-----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
           +VK + R     ++G+R +N+YD+S +T++FKL        +    K  L++ +G   H 
Sbjct: 9   DVKVISRELSANILGLRIANIYDISGRTFLFKL--------ALPDIKKQLIINAGFHCHL 60

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T  +R   + PS F  +LRK ++TRR+  VRQ+G DRII F+   GM   Y  LE +A G
Sbjct: 61  TESSRTTADAPSHFVSRLRKLLKTRRITGVRQIGTDRIIEFEISDGMFRLY--LEFFAAG 118

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD 188
           N++LTD+++              G+  + R   P       +     +L + L       
Sbjct: 119 NLILTDAKY--------------GIVALLRQVAPGSDIEEVKIGMTYRLESKL------- 157

Query: 189 ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEAL 248
                    + N +   + E L              S    ++G++  + +L     E  
Sbjct: 158 ---------NYNGIPPLTIERL-------------KSALEQDNGSKVLKRSLYFGFPE-- 193

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
            Y P L +H     G   + KL     L DN +   ++ V +  D +   +S D    GY
Sbjct: 194 -YPPTLLDHAFNVVGF--DSKLQPAQILTDNNLVQKLMEVLQEADRVNTALSSDTQQAGY 250

Query: 309 ILMQNKHLGKDHPPTESGSSTQI-----YDEFCPLLLNQFR---SREFVKFETFDAALDE 360
           I+ +N         ++ G  TQ      + +F P   +Q +   +   ++F  F++A+D 
Sbjct: 251 IIAKNVAPAA----SDVGGGTQTAPMAEFRDFHPFEPSQSKEAPNTTILRFGNFNSAVDR 306

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           ++S IE+Q+ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE NL  V
Sbjct: 307 YFSSIEAQKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIETNLPRV 366

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEM 479
           + A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +++LL+    E 
Sbjct: 367 EEAMNAVNGLVAQSMDWVEIARLIEMEQGKGNPVAQSIKLPLKLYENTITVLLNEGGTED 426

Query: 480 DDEEKTL----------------------PVEK-------------------VEVDLALS 498
           D+EE+                        P +K                   +++DL +S
Sbjct: 427 DEEEEEEEEPEEEEEEDDDDGYGDDEYERPSQKKHSAKPLKEKKEKKDTRLSIDIDLGIS 486

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISHMRKV 554
             ANAR++Y+ KK    K+EKT+ A +KA K+ E+K +    + + QEK V  +   R  
Sbjct: 487 PWANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV--LRRTRNP 544

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ 614
            WFEKF +FISS+ YLVI GRD QQ+E++ +RY+ KGD+YVH DL G    ++KN     
Sbjct: 545 TWFEKFFFFISSDGYLVIGGRDHQQDEILFQRYLKKGDIYVHTDLDGGVPLIVKNKPDAP 604

Query: 615 PVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
             P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G FMI+G+K
Sbjct: 605 DDPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEK 664

Query: 673 NFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDD 732
           N +PP  +++GF +LF++   SL                       +H ++     E D 
Sbjct: 665 NHIPPGQIVLGFAVLFQISNRSL----------------------QNHTKSLPSAPEDDV 702

Query: 733 TDEKPVAESLSVPNSAHPAPSHTNASNVDSHE---FPAEDKTISNGIDSKIFDIARNVAA 789
           T+E+P++ +  +  S         A+  D  E      ED+      D+K  DI+    A
Sbjct: 703 TNEEPISSTADMDQS--------EANQSDQEEDVPLEQEDEHQVESEDAKK-DISDERVA 753

Query: 790 PVTPQLEDL-IDRALGLGSASISSTKHGIETTQFDLSE-EDKHVERTATVRDKPYISKAE 847
           P+  QL+ + ++ +L   +A ++      E  +++ S+ E++ VE  +   ++   S   
Sbjct: 754 PLGEQLQSIHVEGSLDSNAAQVT------EADKYEASQAENQPVEGPSKNAEETEDSGES 807

Query: 848 RRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKY 907
             + +    S++ + +           + + +  V           RG++GK KK+  KY
Sbjct: 808 NDESRLATSSAIRESRSSTPSVISSSGTQKSKPPV-----------RGKRGKAKKLATKY 856

Query: 908 GDQDEEERNIRMALLA 923
            DQDEE+RN+ + LL 
Sbjct: 857 KDQDEEDRNLALRLLG 872


>gi|425773025|gb|EKV11400.1| hypothetical protein PDIG_50370 [Penicillium digitatum PHI26]
 gi|425782195|gb|EKV20118.1| hypothetical protein PDIP_19610 [Penicillium digitatum Pd1]
          Length = 1107

 Score =  336 bits (862), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 241/748 (32%), Positives = 374/748 (50%), Gaps = 105/748 (14%)

Query: 4   VRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG 63
           V++ T ++A+E  C    + +R SN+YDLS + ++FKL             +  L+++SG
Sbjct: 10  VKVITQELASE--C----VNLRVSNIYDLSSRIFLFKLAKPD--------HRRQLIIDSG 55

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
            R H T Y R    TPS F  +LRK++++RR+  + Q+G DRII   F  G  A+++ LE
Sbjct: 56  FRTHVTQYTRTTATTPSPFVTRLRKYLKSRRITGISQIGTDRIIEISFSDG--AYHIFLE 113

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL-- 181
            +A GNI+LTD E+ +L   R                      +V       ++ A L  
Sbjct: 114 FFAGGNIILTDREYNILAFFR----------------------QVAAGVGQEEIKAGLKY 151

Query: 182 -TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
             S+K+     PD +  D    +    + L  Q+G    +  K   K   D        L
Sbjct: 152 TVSNKQNYDGVPD-ITADRVLQTLEKAQGLSAQEG----NAPKKFKKKGTD-------VL 199

Query: 241 KTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           +  L +    Y P L +H+           L +V   +D  +Q +   + +         
Sbjct: 200 RKALSQGFPEYPPLLLDHVFAIKEFDTTTPLDQVIGSQD-LLQAVKEVLEESRRVSNTFD 258

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVK---FETFDA 356
           SG   P GYI+ +          T S ++  +Y++F P    QF ++  +K   FE F+A
Sbjct: 259 SGASHP-GYIVAKEDTRPIPEGETSSKAAGLLYEDFHPFKPRQFENKPGIKILEFERFNA 317

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
            +DE++S +ESQR E +   +E+AA  KL  +  + + R+  LK   +  ++ A  I+ N
Sbjct: 318 TVDEYFSSLESQRLESRLTEREEAAKKKLESVRFEHKKRIDELKNVQELHIRKANAIQDN 377

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSN- 474
           +  V  A+ AV   +A  M W ++AR+++ E+  GNPVA +I   L L  N +SLLL   
Sbjct: 378 VYRVQEAMDAVNGLVAQGMDWGEIARLIEMEQDRGNPVAQIIKLPLKLYENTVSLLLGEA 437

Query: 475 -------------------NLDEMDDE----EKTLPVEKVEVDLALSAHANARRWYELKK 511
                              + +E D E    E+   +  +++DL LS  ANA ++Y+ KK
Sbjct: 438 GDDEDEEEEFSSSDESDSDSENEADQETSSAERESKLLTIDIDLGLSPWANASQYYDQKK 497

Query: 512 KQESKQEKTITAHSKAFKAAEKK--TRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           +   K+++T  + +KA K+ EKK  T L+   +K    +   R   WFEKF +FISSE Y
Sbjct: 498 QASEKEQRTTQSSTKALKSHEKKVTTELKRGLKKEKQVLRQARTPFWFEKFVFFISSEGY 557

Query: 570 LVI----------------SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--R 611
           LVI                S RDA Q+E++ +RY+SKGD++VHADL GA+  V+KN    
Sbjct: 558 LVIGYVIPLNTVLRHTNPSSARDAMQSELLYRRYLSKGDIFVHADLEGATPIVVKNRAGS 617

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE-YLTVGSFMIRG 670
            + P+ P TL+QAG   V  S AWDSK V SAWW + HQVSK A  G   +  G F I+G
Sbjct: 618 ADAPISPSTLSQAGNLCVATSTAWDSKAVMSAWWAHAHQVSKIAENGSGIMPTGVFQIKG 677

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           +KNFL P  L++GFG++F++ + S+ +H
Sbjct: 678 EKNFLAPSQLVLGFGIMFQVSQESVRNH 705


>gi|429858117|gb|ELA32948.1| duf814 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 1040

 Score =  335 bits (859), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 192/488 (39%), Positives = 286/488 (58%), Gaps = 38/488 (7%)

Query: 252 PALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM 311
           P L +H    TG   + K +E+   E   +  L++A+ +    ++D  S     +GYI  
Sbjct: 212 PILVDHSFKTTGFDGSKKPAEILDNE-TLLDDLLVALTEARSIVKDATSS-ATAKGYIFA 269

Query: 312 QNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVK---FETFDAALDEFYSKI 365
           + ++   D  P E G + +   +Y++F P L N+F +   +K   F+ F+  +DEF+S +
Sbjct: 270 KYRN-QPDETPAEEGQTKRSDLLYEDFHPFLPNKFANDPTIKVLEFDGFNKTVDEFFSSL 328

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
           E Q+ E +   +E AA  KL     DQ  R+  L++    +V+ A  IE N+E V  A+ 
Sbjct: 329 EGQKLESKLSEREAAAKRKLEAARNDQAKRIEGLQEVQSLNVQKATAIEANVERVQEAMD 388

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL------------ 472
           AV   L   M W D++++++ E+K GNPVA +I   L L  N ++LLL            
Sbjct: 389 AVNGLLQQGMDWIDISKLIEREQKRGNPVAEIIKLPLNLADNTITLLLGEEEDIEDEDSN 448

Query: 473 -------SNNLDEM-DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                  S++ DE   +++KT    +V+V++ L+ +ANAR +YE K+    K+EKT+   
Sbjct: 449 YETDSDASDSEDEAASNKQKTAKHLEVDVNIGLTPYANAREYYEQKRSAAKKEEKTVQQT 508

Query: 525 SKAFKAAEKKTRLQIL----QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
             A K AE+K + ++     QEK V  ++ +RK  WFEKF WFIS++ YLV+ G+DAQQN
Sbjct: 509 EIALKNAEQKIQAELRKGLKQEKAV--LAPIRKQIWFEKFIWFISTDGYLVLGGKDAQQN 566

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKN--HRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           EM+ KRY+ KGDVY+HAD+HGA++ +IKN    P+ P+PP TL QAG   VC S AWDSK
Sbjct: 567 EMLYKRYLRKGDVYIHADIHGAATVIIKNTPSDPDAPIPPSTLAQAGTLAVCSSSAWDSK 626

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
               AWWV   QVSK+APTGEYL  GSFM+RG+KNFLPP  L++GFG+++++ E S   H
Sbjct: 627 AGMGAWWVKADQVSKSAPTGEYLPTGSFMVRGQKNFLPPAQLLLGFGIMWKISEESKARH 686

Query: 699 LNERRVRG 706
           +  R   G
Sbjct: 687 VKHRLYDG 694



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/145 (38%), Positives = 82/145 (56%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+  L+ +R +NVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSIDVKVIAHELQENLVSLRLANVYDLSSKILLLKFAKPDN--------KKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T + R     PS F  +LRK ++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  DSGFRCHLTDFTRTTAAAPSAFVTRLRKFLKTRRLTKVSQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE +A GN++LTD++  +LTLLR+
Sbjct: 111 FLEFFASGNVILTDADLKILTLLRN 135


>gi|452840445|gb|EME42383.1| hypothetical protein DOTSEDRAFT_73267 [Dothistroma septosporum
           NZE10]
          Length = 1122

 Score =  334 bits (856), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 191/488 (39%), Positives = 290/488 (59%), Gaps = 39/488 (7%)

Query: 250 YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYI 309
           + P L +H +  TG+   ++L  V   ++   +VL  A+ +    + D+ S  +   GYI
Sbjct: 211 FPPVLIDHALHVTGVDRQIELEAVIGRDEELDKVLK-ALQEANRVIDDITSLPVA-RGYI 268

Query: 310 LMQNKHLGKDHPPTESGSSTQI-YDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKI 365
           L + K    D   T +  +  + Y++F P    Q     +  F++ E F+ A+D+F+S I
Sbjct: 269 LAKRKVPKADANTTATEDNQNVMYEDFHPFKPAQLEGDPANVFIEHEGFNKAVDDFFSSI 328

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
           E Q+ E + + +E+ A  ++ +   +QE R+  L+Q  + +++ A+ IE N+E V+ A+ 
Sbjct: 329 EGQKLESRLQEREENAKRRIEQARQEQEKRITGLQQVQELNIRKAQAIEANVERVEEAVA 388

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLS----------- 473
           AV   +A  M W D+ R+++ E+K  NPVA +I   L L  N  +LLLS           
Sbjct: 389 AVNGLIAQGMDWVDIGRLIENEQKRHNPVAEMIKLPLKLHENTATLLLSELADADDEDMD 448

Query: 474 ---NNLDEMDDEEKTLPVEK----------VEVDLALSAHANARRWYELKKKQESKQEKT 520
              +   + +DE+    ++K          V++DLA S  +NAR++Y+ ++   +KQEKT
Sbjct: 449 ETDSEPSDSEDEDHQANIKKSFVPEDERLTVDIDLAASGWSNARQYYDQRRTAATKQEKT 508

Query: 521 ITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
             A  KA K+ E+K     +  + QEK V  +  +RK  WFEKF +FISS+ YLV++G+D
Sbjct: 509 AQAAQKALKSTEQKVMADLKKGLKQEKEV--LRPVRKQFWFEKFIYFISSDGYLVLAGKD 566

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQA 634
           AQQNEM+ +R++ KGDVYVHAD+HGA+S +IKN+   P+ P+PP +L+QAG  +VC S A
Sbjct: 567 AQQNEMLYRRHLRKGDVYVHADMHGAASVIIKNNPATPQAPIPPSSLSQAGNLSVCTSSA 626

Query: 635 WDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESS 694
           WDSK V SAWWV   QVSKTAPTGEYLT G FM+RGKKNFLPP  L++GF L+F++ E S
Sbjct: 627 WDSKAVMSAWWVNADQVSKTAPTGEYLTTGGFMVRGKKNFLPPAQLLLGFALVFQISEDS 686

Query: 695 LGSHLNER 702
              H   R
Sbjct: 687 KAKHAKHR 694


>gi|325093107|gb|EGC46417.1| DUF814 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 1136

 Score =  331 bits (849), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 261/816 (31%), Positives = 395/816 (48%), Gaps = 136/816 (16%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++++G R H T Y+R     PS FT +LRK ++TRR+  V Q+G DRII  +   G N 
Sbjct: 66  LIVDTGFRCHLTRYSRTTAAAPSSFTSRLRKFLKTRRVTAVSQVGTDRIIDIELSDG-NF 124

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           H V+LE YA GNI+LTD E+ +L L   HR   +G           E  RV        L
Sbjct: 125 H-VLLEFYAAGNIILTDKEYKILAL---HRIVPEG--------SDQEEVRV-------GL 165

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLG-GQKGGKSFDLSKNSNKNSNDGARAK 236
              LT+ +  +   P  + E   +    SK+  G  +  GK+    K + K   +  R  
Sbjct: 166 QYVLTNKQNYNGVPPLSI-ERLRDALEKSKDVTGPAEAAGKN----KRAKKKQAEALRR- 219

Query: 237 QPTLKTVLGEALG---YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
                     +LG   Y P L EH       DT L P  +L E  KL +  +  LV+A  
Sbjct: 220 --------AVSLGFPEYPPLLLEHAFHITGFDTSLKPE-QLVEDPKLAEKLMVALVVA-- 268

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI----YDEFCPLLLNQFRS 345
             E+    + + +  P GYI+ + +    +    +S   +++    Y +F P    QF S
Sbjct: 269 --ENVNSSLSTAEETP-GYIVSKTEGKAGEDASVDSTDPSKLRNVAYIDFHPFEPKQFES 325

Query: 346 R---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
                 ++F+TF  A+DE++S +ESQ+ E +   +E+ A  KL     DQ+ RV  LK+ 
Sbjct: 326 EPGTSILRFDTFSKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAQKDQDKRVGVLKEA 385

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KL 461
            +  ++ A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   L
Sbjct: 386 QELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQSRQNPVAKVIKLPL 445

Query: 462 YLERNCMSLLLSNNLDEMD-------------------------------DEEKTLPVEK 490
            L  N ++LLL    +  +                                ++   P+  
Sbjct: 446 KLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSKKTRQPLLS 505

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVA 546
           +++DL +S  ANAR++YE KK    K+EKT+ +   A K+ EKK     +  + QEK V 
Sbjct: 506 IDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKTAIKSTEKKVAADLKQALKQEKPV- 564

Query: 547 NISHMRKVHWFEKFNWFISSENYLVI---------------------SGRDAQQNEMIVK 585
            +   R   WFEKF +F+SS+ YLV+                     SGRD QQ E++ +
Sbjct: 565 -LRPTRTPFWFEKFIFFLSSDGYLVLGLVTVLMSCGFLLCFIANCVSSGRDVQQTEILYR 623

Query: 586 RYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
           R++ +GDV+VHAD+ GA   ++KN    P+ P+PP TL+QAG   V  S AWDSK V  A
Sbjct: 624 RHLKRGDVFVHADVQGAIPIIVKNKPGTPDAPIPPGTLSQAGNLCVATSTAWDSKAVMGA 683

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER- 702
           WWV   QVSKT P GEYL  G F+I G+KN L P  L++GF ++F++   S+ +H   R 
Sbjct: 684 WWVNADQVSKTTPLGEYLVTGGFVICGEKNHLSPAQLLLGFAVMFQISGESIKNHTKHRV 743

Query: 703 -----------RVRGEEE---GMDDFEDSGHHKEN-SDIESEKDDTDEKP-VAESLSVPN 746
                         G EE   G+ D E   + K N +D + ++ D  E P + +  ++P 
Sbjct: 744 QDETPISESAKDTLGTEELPSGL-DLETPKYSKINETDHQHQESDAVEVPKLGQMENLPK 802

Query: 747 SAHPAPSHTNASNVDS--HEFPAEDKTISNGIDSKI 780
               +   T++  V    H F  E + + NGI  ++
Sbjct: 803 EEASSEPQTDSITVQPAKHPFVRERRLLKNGIIEQV 838


>gi|258574555|ref|XP_002541459.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237901725|gb|EEP76126.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 1070

 Score =  329 bits (843), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 291/942 (30%), Positives = 447/942 (47%), Gaps = 164/942 (17%)

Query: 35  KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRR 94
           + Y+FKL       +        ++++SG R H T Y R     PS F  +LR+ +++RR
Sbjct: 12  RIYLFKLQKPDVRKQ--------IVIDSGFRCHLTEYTRATAPAPSHFVSRLRQFLKSRR 63

Query: 95  LEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR----SHRDDD 150
           +  V Q+G DRII  +F  G    +++LE +A GNI+LTD+EF +++LLR        D+
Sbjct: 64  VTAVSQVGTDRIIHIEFSDGQ--FHLLLEFFASGNIILTDNEFKIVSLLRIVPEGEEQDE 121

Query: 151 KGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENL 210
             + ++ R         V    +  +L  AL   KE DA++P+                 
Sbjct: 122 IRIGLIYRLDNKQNYGGV-PPLSVDRLRTALERGKERDASQPEAT--------------- 165

Query: 211 GGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII----LDTGLVP 266
                      +K + K  ++  R     L     E   Y P L EH +     D+ L P
Sbjct: 166 -----------TKRAKKKQDEALRR---ALSLGFPE---YPPLLLEHALHVTGFDSTLRP 208

Query: 267 NMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVP----EGYILMQNKHLGKDHPP 322
           N  L E + + D  + VL  A           +SG++       GYI+ +N++   + P 
Sbjct: 209 NQIL-EASDMIDELMHVLEEA---------QRVSGELSTAEQTRGYIITRNENKPSEPPT 258

Query: 323 --TESGSSTQIYDEFCPLLLNQFRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAK 377
             TE+      Y ++ P    QF        +  E+F+ A+DE+YS +E+Q+ E +   +
Sbjct: 259 QGTETKPDKSSYIDYHPFEPKQFADNPDTRILPLESFNKAVDEYYSSVEAQKLESRLTDR 318

Query: 378 EDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSW 437
           E+    KL     D E RV  LK+    +V+ A+ IE NL  V+ AI A    +A  M W
Sbjct: 319 EETMKRKLEATKRDHEKRVGALKEVQQLNVRKAQAIEANLSKVEEAINAANSLIAQGMDW 378

Query: 438 EDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD------------------- 477
            ++AR+++ E+   NP+A +I   L L  N +++LL + +                    
Sbjct: 379 VEIARLIEMEQSRRNPIAKMIKLPLKLYENTITILLPDGMPVDDESESESEDEDEEDESG 438

Query: 478 -EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT- 535
            E + + +   V  +++DLAL+  ANA ++Y+ KK    K++KTI A  KA K+AEKK  
Sbjct: 439 DEPEKKSREPEVLSIDIDLALTPWANASQYYDQKKTAAMKEDKTIKASKKALKSAEKKVT 498

Query: 536 ---RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
              +  + QEK V  +   R   WFEKF +FISS+ YLV+ G+DA+Q+E++  R++ KGD
Sbjct: 499 ADLKQGLKQEKPV--LRPARTPFWFEKFFFFISSDGYLVLGGQDARQDEILYHRHLQKGD 556

Query: 593 VYVHADLHGASSTVIKNHRP---EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
           VYVH D  GA   +IKN +P   + P+PP TL QAG FTV  S+AWD+K +  AWWV   
Sbjct: 557 VYVHTDTEGAMPMIIKN-KPGAFDDPIPPGTLAQAGTFTVATSRAWDTKALLGAWWVKAE 615

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           QVS+T  TGEYL   S +I G+KN L P  LI+GF +LF++   S+ +H   RR R EE 
Sbjct: 616 QVSRTTATGEYLPT-SVVISGEKNHLAPGQLILGFAVLFQISPESVANH---RRHRLEES 671

Query: 710 GMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAED 769
           G               +ESE D  D +P +E                   V  H+   ED
Sbjct: 672 GSPQIA----------VESE-DGKDPQPPSE-----------------REVLEHD---ED 700

Query: 770 KTISNGIDSKIFDIARNVAAPVTPQLE---DLIDRALGLGSASISSTKHGIETTQFDLSE 826
           K    G + +        A+ + PQ +   DL D      S  + +   G    + D S 
Sbjct: 701 K----GGELEEKGEPSEAASSLHPQNDEHGDLND------STPLMNEPQG----EVDQSS 746

Query: 827 EDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPE-----SI 881
           ED++       + +P  S    +     +  S+      RE+     ++SQP      SI
Sbjct: 747 EDEYDSADPAYQQQPEASDTATKDFSHARSPSI------REEGESVPSTSQPSRTSTPSI 800

Query: 882 VRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
              +  +  +  RG++GK KK+  KY DQDEE+R + + LL 
Sbjct: 801 QSSSTPKSQQQVRGKRGKAKKLASKYKDQDEEDRELALRLLG 842


>gi|340505619|gb|EGR31934.1| hypothetical protein IMG5_099620 [Ichthyophthirius multifiliis]
          Length = 1423

 Score =  328 bits (842), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 165/380 (43%), Positives = 249/380 (65%), Gaps = 8/380 (2%)

Query: 332  YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
            Y EF PL+LN ++ ++  + E+F+  +++++ K+  +  E+Q +  E  A+ K   I  D
Sbjct: 727  YFEFSPLILNSYQGKQIEQMESFNDCINKYFQKMSKKIEEEQKEDVESIAWKKYLNIKTD 786

Query: 392  QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
            QENR+  LK E +  +  A+LIE N +DV+A    ++   ++ ++W+ + +M+ E +K G
Sbjct: 787  QENRIKKLKDEQEEFITKAQLIEENYQDVEAITNILKTMKSSGLAWDKIIKMINEGKKQG 846

Query: 452  NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
            +P+A LI ++  E N +S+ L    D+M +    +PV    VD+  SAH NAR +YE K+
Sbjct: 847  DPLANLIHQIDFENNEVSIYLGFIDDQMSE---LIPVS---VDIYQSAHQNARNYYENKR 900

Query: 512  KQESKQEKTITAHSKAFKAAEKKTRLQI-LQEKTVANISHMRKVHWFEKFNWFISSENYL 570
            K   K++KT+ A   A K AEK    +I  Q+     + ++RK +WFEKF WFI+SENYL
Sbjct: 901  KNVLKEKKTLDATKTALKQAEKTALKEIETQKHKTMQLVNVRKQYWFEKFYWFITSENYL 960

Query: 571  VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN-HRPEQPVPPLTLNQAGCFTV 629
            VISGRD+QQNE++VK+YM KGD+Y+HAD HGA+ST+IKN H+    +   T+ +A   T+
Sbjct: 961  VISGRDSQQNEILVKKYMKKGDIYMHADYHGAASTLIKNPHKDSSFISQQTIEEAAVATI 1020

Query: 630  CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
            C S+AW++K++ SAWWV  HQVSK A TGEYL  GSFMIRGKKNF+ P  + M   +LF+
Sbjct: 1021 CRSKAWEAKIIASAWWVDSHQVSKRAETGEYLPSGSFMIRGKKNFVYPSRMEMACTILFK 1080

Query: 690  LDESSLGSHLNERRVRGEEE 709
            L++ SL  HLN+R+ +  EE
Sbjct: 1081 LNDDSLERHLNDRKRKVNEE 1100


>gi|326479424|gb|EGE03434.1| DUF814 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 979

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 282/980 (28%), Positives = 466/980 (47%), Gaps = 185/980 (18%)

Query: 14  EVKCLRR-----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
           +VK + R     ++G+R +N+YD+S +T++FKL        +    K  L++ +G   H 
Sbjct: 9   DVKVISRELSANILGLRIANIYDISGRTFLFKL--------ALPDIKKQLIINAGFHCHL 60

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T  +R   + PS F  +LRK ++TRR+  VRQ+G DRII F+   G+   Y  LE +A G
Sbjct: 61  TESSRTTADAPSHFVSRLRKLVKTRRITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAG 118

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPD 188
           N++LTD+++ +             VA++ RH        V   +   ++   +T   E  
Sbjct: 119 NLILTDAKYEI-------------VALL-RH--------VAAGSDIEEVKIGMTYRLE-- 154

Query: 189 ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEAL 248
                K+N +G  +   + E L              S  + ++G++  + +L     E  
Sbjct: 155 ----SKLNYNG--IPPLTIERL-------------KSALDQDNGSKVLKRSLYFGFPE-- 193

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
            Y P L +H     G   + KL     L DN +   ++ V +  D +   +S D    GY
Sbjct: 194 -YPPTLLDHAFNVVGF--DSKLQPAQILTDNNLVQKLMEVLQEADRVNTALSSDSQQAGY 250

Query: 309 ILMQNK-----HLGKD---HPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAA 357
           I+ +N       +G D    P TE       + +F P   +Q +   +   ++FE F++A
Sbjct: 251 IIAKNVAPTALDVGGDIQKAPVTE-------FRDFHPFEPSQSKEAPNTTILRFENFNSA 303

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +D ++S IE+++ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE NL
Sbjct: 304 VDRYFSSIEARKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIEINL 363

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNL 476
             V+ A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +++LL+   
Sbjct: 364 PRVEEAMNAVNGLVAQGMDWVEIARLIEMEQGKGNPVAQSIKLPLKLYENTITVLLNEEG 423

Query: 477 DEM--------------------------------DDEEKTLPVEK----------VEVD 494
            E                                   ++ T P+++          +++D
Sbjct: 424 TEDDEEEEEEESEEEEEEEEEDDDGYGDDEYERPSQKKQLTKPLKEKKEMKDTRLSIDID 483

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISH 550
           L +S  ANAR++Y+ KK    K+EKT+ A +KA K+ E+K +    + + QEK V  +  
Sbjct: 484 LGISPWANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV--LRR 541

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            R   WFEKF +FISS+ YLVI GRD QQ+E++ +RYM KGD+YVH DL G    ++KN 
Sbjct: 542 TRNPTWFEKFFFFISSDGYLVIGGRDHQQDEILFQRYMKKGDIYVHTDLDGGVPLIVKNK 601

Query: 611 RPEQPVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
                 P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G FMI
Sbjct: 602 PDTPDDPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMI 661

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIES 728
           +G+KN +PP  +++GF +LF+         ++ R V+             +HK+      
Sbjct: 662 KGEKNHIPPGQIVLGFAVLFQ---------ISNRSVQ-------------NHKKCLPPAP 699

Query: 729 EKDDTDEKPVAES--LSVPNSAHPAPSH-TNASNVDSHEFPAEDKTISNGIDSKIFDIAR 785
           E   T+++P++ +  +  P +    P         D H+   ED +  + ID ++     
Sbjct: 700 EDGVTNDEPISSTGDMDQPEANQSDPEEDVPLEQEDEHQEEPED-SKKDIIDERV----- 753

Query: 786 NVAAPVTPQLEDL-IDRALGLGSASISSTKHGIETTQFDLSE-EDKHVERTATVRDKPYI 843
              AP+  QL+ + ++ +L    A +       E  + + S+ E++ VE  +   + P  
Sbjct: 754 ---APLGEQLKSMHVEDSLDSNPAQVH------EADKEEASKGENQPVEGPSKNAEGPED 804

Query: 844 SKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKM 903
           S+      +    S +  P   +E       +S P +I      +     RG++GK KK+
Sbjct: 805 SE------QSDDESILATPSATQESR-----ASTPSAISSSGTQKSKPPVRGKRGKAKKL 853

Query: 904 KEKYGDQDEEERNIRMALLA 923
             KY DQDEE+R + + LL 
Sbjct: 854 ATKYKDQDEEDRKLALRLLG 873


>gi|390356696|ref|XP_001200483.2| PREDICTED: nuclear export mediator factor Nemf-like
           [Strongylocentrotus purpuratus]
          Length = 334

 Score =  327 bits (837), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 161/331 (48%), Positives = 230/331 (69%), Gaps = 7/331 (2%)

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           +ESQ+ + +   +E  A  KL+ +  D E R+ +L+Q  + + K   LIE NL  V+ A+
Sbjct: 1   MESQKLDMKVIQQERGALKKLDNVKKDHEKRISSLQQNQELNEKKGALIEINLPLVEQAL 60

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD--- 481
             VR A+AN++ W+++  ++KE +  G+PVA  I  L L+ N   +LL +   + DD   
Sbjct: 61  RVVRSAVANQIDWKEIDSIIKEAQTQGDPVALAIRSLRLDTNHFQMLLRDPYKQYDDADE 120

Query: 482 --EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
             E+   P+  V++D+A SA+ANAR+++  KK  + K++KT+ + SKA K+AEKKT   +
Sbjct: 121 GEEDGARPM-LVDIDIAQSAYANARKYFVQKKTSQKKEQKTMESSSKAIKSAEKKTMQAL 179

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
               TVA+I+  RK +WFEK+ W ISSENY++I+GRD QQNE++VK+Y+S GD+YVHAD+
Sbjct: 180 KDVATVASINKSRKTYWFEKYYWCISSENYIIIAGRDQQQNEIVVKKYLSPGDIYVHADI 239

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
           HGASS +IKN +   PVPP TL +AG   VC+S AWD+K++TSAWWV   QVSKTAPTGE
Sbjct: 240 HGASSVIIKNPKGG-PVPPKTLQEAGTMAVCYSVAWDAKVITSAWWVRHDQVSKTAPTGE 298

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           +LT GSFM+RGKKNFLPP  L+MGFG L ++
Sbjct: 299 FLTTGSFMVRGKKNFLPPTQLVMGFGFLMKV 329


>gi|302665563|ref|XP_003024391.1| DUF814 domain protein, putative [Trichophyton verrucosum HKI 0517]
 gi|291188443|gb|EFE43780.1| DUF814 domain protein, putative [Trichophyton verrucosum HKI 0517]
          Length = 1074

 Score =  325 bits (833), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 271/946 (28%), Positives = 441/946 (46%), Gaps = 170/946 (17%)

Query: 35  KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRR 94
           +T++FKL        +    K  L++ +G   H T  +R   + PS    +LRK ++TRR
Sbjct: 12  RTFLFKL--------ALPDIKKQLIINAGFHCHLTESSRTTADAPSHLVSRLRKLLKTRR 63

Query: 95  LEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR--SHRDDDKG 152
           +  VRQ+G DRII F+   G+   Y  LE +A GN++LTD+++ ++ LLR  +   D + 
Sbjct: 64  ITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAGNLILTDAKYGIVALLRQVAPGSDIEE 121

Query: 153 VAIMSRHRYPTEI-CRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLG 211
           V I   +R  +++        T  +L +AL                + +NVS A K +L 
Sbjct: 122 VKIGMTYRLESKLNYNGIPPLTIERLKSAL----------------EQDNVSKALKRSL- 164

Query: 212 GQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLS 271
                  F   +                          Y P L +H     G   + KL 
Sbjct: 165 ------YFGFPE--------------------------YPPTLLDHAFNVVGF--DSKLQ 190

Query: 272 EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
               L DN +   ++ V +  D +   +S D    GYI+ +N         ++ G  TQ 
Sbjct: 191 PAQILTDNNLVQKLMEVLQEADRVNTALSSDTQQAGYIIAKNVAPAA----SDVGGGTQT 246

Query: 332 -----YDEFCPLLLNQFR---SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH 383
                + +F P   +Q +   +   ++FE F++A+D ++S IE+++ E +   KEDAA  
Sbjct: 247 APVTEFRDFHPFEPSQSKEAPNTTILRFENFNSAVDRYFSSIEARKLESRLTEKEDAARK 306

Query: 384 KLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARM 443
           KL     + E RV+ LK++ +  V+ A  IE NL  V+ A+ AV   +A  M W ++AR+
Sbjct: 307 KLESTKREHEKRVNALKEKQEFHVRKARAIETNLPQVEEAMNAVNGLVAQGMDWVEIARL 366

Query: 444 VKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDEEKTL---------------- 486
           ++ E+  GNPVA  I   L L  N +++LL+    E D+EE+                  
Sbjct: 367 IEMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGTEDDEEEEEDESEEEEEDDDDDGYGD 426

Query: 487 -----PVEK-------------------VEVDLALSAHANARRWYELKKKQESKQEKTIT 522
                P +K                   +++DL +S  ANAR++Y+ KK    K+EKT+ 
Sbjct: 427 DEYERPSQKKHSAKPLKEKKGKKDTRLSIDIDLGISPWANARQYYDEKKIAAVKEEKTLK 486

Query: 523 AHSKAFKAAEKKTR----LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
           A +KA K+ E+K +    + + QEK V  +   R   WFEKF +FISS+ YLVI GRD Q
Sbjct: 487 ASTKAIKSTERKVKADLKMALKQEKPV--LRRTRNPTWFEKFFFFISSDGYLVIGGRDHQ 544

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL--TLNQAGCFTVCHSQAWD 636
           Q+E++ +RYM KGD+YVH DL G    ++KN       P    T++QA  +TV  S+AWD
Sbjct: 545 QDEILFQRYMKKGDIYVHTDLDGGVPLIVKNKPDAPDDPIPPNTISQASAYTVASSKAWD 604

Query: 637 SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 696
           +K     WWV+  QVSK   TG+ L  G FMI+G+KN +PP  +++GF +LF++   S+ 
Sbjct: 605 TKAAMGGWWVHASQVSKMTSTGDILKAGHFMIKGEKNHIPPGQIVLGFAVLFQISNRSVQ 664

Query: 697 SHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTN 756
           +H  + ++   E G+ + E      +    E+ + D +E        VP           
Sbjct: 665 NH-TKSQLSAPEGGVTNEEPISSTADMDQPEANQSDQEE-------DVP----------- 705

Query: 757 ASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDL-IDRALGLGSASISSTKH 815
               D H+  +ED            DI+    AP+  Q++ + +D +L   +A +     
Sbjct: 706 LEQEDEHQVESEDAKK---------DISDERVAPLGEQMQSIHVDDSLDSSAAQV----- 751

Query: 816 GIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDAS 875
                    +E DK  +  +   ++P    ++  +  +  G S  + ++       +  +
Sbjct: 752 ---------TEADK--DEASQAENQPVEGPSKNAEETEDSGESDDESRLATPSATQESRA 800

Query: 876 SQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMAL 921
           S P  I      +     RG++GK KK+  KY DQDEE+R + + L
Sbjct: 801 STPLVISSSGTQKSKPPVRGKRGKAKKLATKYKDQDEEDRKLALRL 846


>gi|345565416|gb|EGX48366.1| hypothetical protein AOL_s00080g336 [Arthrobotrys oligospora ATCC
           24927]
          Length = 1207

 Score =  325 bits (832), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 244/816 (29%), Positives = 381/816 (46%), Gaps = 145/816 (17%)

Query: 28  NVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLR 87
           N++DLS +T+ FK  +S+  T      K +L+++SG R H T +AR+   +PSGF  KLR
Sbjct: 28  NIHDLSSRTFQFKFTSSATQT------KHILIVDSGFRCHLTNFARNVAASPSGFVEKLR 81

Query: 88  KHIRTRRLEDVRQLGYDRIILFQFGL---------------------------------- 113
           K ++TRR+  +RQ+G DRI+  QFG+                                  
Sbjct: 82  KCLKTRRVTGIRQVGSDRIVELQFGIVGDNAAATTSATTATGGGVGGGEGGAEGGVEIKG 141

Query: 114 --GMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY-----PTEIC 166
              +  + +  E +A GNI+LTD+ F ++TLLR   +      I     Y      T   
Sbjct: 142 IPHVGGYRLFFEFFAGGNIILTDASFKIITLLRIVPEGPNQPKIARGETYTISSASTTFG 201

Query: 167 RVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSN 226
            ++  T+ +++  AL S  E   NE  K  +D                  K +   K   
Sbjct: 202 SLYTNTSNAQIKKALKSHLEKRENEEKKGIDDL-----------------KDWQKKKLKK 244

Query: 227 KNSNDGARAKQPTLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLV 285
              +DG       L  VLG  +  +   L EH +L  G+ P++K  EV  + D+AI   V
Sbjct: 245 TRKDDG-------LNRVLGAVMTEFSSTLIEHCLLTVGVDPDLKAGEV--VGDDAIIDKV 295

Query: 286 LAVAKF-EDWLQDVISGDIVPEGYILMQNK------------------------------ 314
               K  E  ++D++    V  G+I+ +                                
Sbjct: 296 AEGFKLAETMVKDIVENKEVI-GWIIAKKPSPKTEKADTEDNGTKSKKNKKKKVAFGDAG 354

Query: 315 ------------HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR---EFVKFETFDAALD 359
                        L +D  P ++ +S  IYD+F P L  QF+ +     +   T++  +D
Sbjct: 355 IKEAEDELEAMLELDEDITP-QTDASGYIYDDFHPFLPTQFKDKPNVHTIPITTYNKTVD 413

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            F+S IESQ+ EQ+   K+  A  +L     + +N++ +LK   +  V+ A+ IE N+E 
Sbjct: 414 SFFSSIESQKLEQKTAEKKSLAAKRLANARNEHKNKIESLKSAQEVHVRKAQAIEANVER 473

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL------- 472
           V+  I AV   +A  M W ++  +V+ E+ AGN VA +I  +    N + + L       
Sbjct: 474 VEEVIDAVNGLIAQGMDWTEIRSLVEREKSAGNGVAEMIRDVKFMENTVVVRLYEEEEED 533

Query: 473 -------SNNLDEMDDEEKTLPVE-KVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
                   +  ++ + EEK       +E+DLAL+ +ANAR +YE K+    K+ KT+ + 
Sbjct: 534 DSDDDDDESGSEDGNGEEKEGRSHLDIEIDLALTGYANARIYYEQKRSAAVKETKTLQSS 593

Query: 525 SKAFKAAEKKTRLQILQEKTV--ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
           +KA K+ EKK +  + Q        +  +R+  W+EKF WF SSE YLV+  +D  Q +M
Sbjct: 594 AKALKSTEKKIQKDLKQAYKAEKMELRTLRRQGWWEKFYWFRSSEGYLVLGAKDPTQADM 653

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPE--QPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           + K+Y  KGDV+VHA++ G+   V+KN   +   P+PP TL+QAG   V  S AW+ KMV
Sbjct: 654 LYKKYFKKGDVWVHAEVPGSCHVVVKNKVEDVNSPIPPGTLSQAGSLAVASSDAWEKKMV 713

Query: 641 TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH-- 698
            SAWW    QV K    G  L  G F+++G+K +LPP  L+MGF + + L +   G    
Sbjct: 714 ISAWWAGYEQVGKIGAGGIVLGTGEFVVKGEKKWLPPAMLVMGFAVGWLLADGEGGEDED 773

Query: 699 -LNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDT 733
            L E R    E    + E+    KE+SD + E  DT
Sbjct: 774 ILEEERTNLPEVSNSE-EEKVEQKEDSDDDEEFPDT 808


>gi|326471330|gb|EGD95339.1| hypothetical protein TESG_02825 [Trichophyton tonsurans CBS 112818]
          Length = 1099

 Score =  324 bits (831), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 237/753 (31%), Positives = 376/753 (49%), Gaps = 138/753 (18%)

Query: 14  EVKCLRR-----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHT 68
           +VK + R     ++G+R +N+YD+S +T++FKL        +    K  L++ +G   H 
Sbjct: 9   DVKVISRELSANILGLRIANIYDISGRTFLFKL--------ALPDIKKQLIINAGFHCHL 60

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T  +R   + PS F  +LRK ++TRR+  VRQ+G DRII F+   G+   Y  LE +A G
Sbjct: 61  TESSRTTADAPSHFVSRLRKLVKTRRITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAG 118

Query: 129 NILLTDSEFTVLTLLR--SHRDDDKGVAIMSRHRYPTEI-CRVFERTTASKLHAALTSSK 185
           N++LTD+++ ++ LLR  +   D + V I   +R  +++        T  +L +AL    
Sbjct: 119 NLILTDAKYEIVALLRHVAAGSDIEEVKIGMTYRLESKLNYNGIPPLTIERLKSAL---- 174

Query: 186 EPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLG 245
                       D +N S   K +L        F            G     PTL     
Sbjct: 175 ------------DQDNGSKVLKRSL-------YF------------GFPEYPPTLLDHAF 203

Query: 246 EALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVP 305
             +G+          D+ L P   L+     ++N +Q L + V +  D +   +S D   
Sbjct: 204 NVVGF----------DSKLQPAQILT-----DNNLVQKL-MEVLQEADRVNTALSSDSQQ 247

Query: 306 EGYILMQNK-----HLGKD---HPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETF 354
            GYI+ +N       +G D    P TE       + +F P   +Q +   +   ++FE F
Sbjct: 248 AGYIIAKNVAPTALDVGGDIQKAPVTE-------FRDFHPFEPSQSKEAPNTTILRFENF 300

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           ++A+D ++S IE+++ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE
Sbjct: 301 NSAVDRYFSSIEARKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIE 360

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLS 473
            NL  V+ A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +++LL+
Sbjct: 361 INLPRVEEAMNAVNGLVAQGMDWVEIARLIEMEQGKGNPVAQSIKLPLKLYENTITVLLN 420

Query: 474 NNLDEM--------------------------------DDEEKTLPVEK----------V 491
               E                                   ++ T P+++          +
Sbjct: 421 EEGTEDDEEEEEEESEEEEEEEEEDDDGYGDDEYERPSQKKQLTKPLKEKKEMKDTRLSI 480

Query: 492 EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVAN 547
           ++DL +S  ANAR++Y+ KK    K+EKT+ A +KA K+ E+K     ++ + QEK V  
Sbjct: 481 DIDLGISPWANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV-- 538

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           +   R   WFEKF +FISS+ YLVI GRD QQ+E++ +RYM KGD+YVH DL G    ++
Sbjct: 539 LRRTRNPTWFEKFFFFISSDGYLVIGGRDHQQDEILFQRYMKKGDIYVHTDLDGGVPLIV 598

Query: 608 KNHRPEQPVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           KN       P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G 
Sbjct: 599 KNKPDTPDDPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGH 658

Query: 666 FMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           FMI+G+KN +PP  +++GF +LF++   S+ +H
Sbjct: 659 FMIKGEKNHIPPGQIVLGFAVLFQISNRSVQNH 691



 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 218/739 (29%), Positives = 355/739 (48%), Gaps = 124/739 (16%)

Query: 250 YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYI 309
           Y P L +H     G   + KL     L DN +   ++ V +  D +   +S D    GYI
Sbjct: 194 YPPTLLDHAFNVVGF--DSKLQPAQILTDNNLVQKLMEVLQEADRVNTALSSDSQQAGYI 251

Query: 310 LMQNK-----HLGKD---HPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAAL 358
           + +N       +G D    P TE       + +F P   +Q +   +   ++FE F++A+
Sbjct: 252 IAKNVAPTALDVGGDIQKAPVTE-------FRDFHPFEPSQSKEAPNTTILRFENFNSAV 304

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           D ++S IE+++ E +   KEDAA  KL     + E RV+ LK++ +  V+ A  IE NL 
Sbjct: 305 DRYFSSIEARKLESRLTEKEDAARKKLESTKREHEKRVNALKEKQEFHVRKARAIEINLP 364

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD 477
            V+ A+ AV   +A  M W ++AR+++ E+  GNPVA  I   L L  N +++LL+    
Sbjct: 365 RVEEAMNAVNGLVAQGMDWVEIARLIEMEQGKGNPVAQSIKLPLKLYENTITVLLNEEGT 424

Query: 478 EM--------------------------------DDEEKTLPVEK----------VEVDL 495
           E                                   ++ T P+++          +++DL
Sbjct: 425 EDDEEEEEEESEEEEEEEEEDDDGYGDDEYERPSQKKQLTKPLKEKKEMKDTRLSIDIDL 484

Query: 496 ALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR----LQILQEKTVANISHM 551
            +S  ANAR++Y+ KK    K+EKT+ A +KA K+ E+K +    + + QEK V  +   
Sbjct: 485 GISPWANARQYYDEKKIAAVKEEKTLKASTKAIKSTERKVKADLKMALKQEKPV--LRRT 542

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           R   WFEKF +FISS+ YLVI GRD QQ+E++ +RYM KGD+YVH DL G    ++KN  
Sbjct: 543 RNPTWFEKFFFFISSDGYLVIGGRDHQQDEILFQRYMKKGDIYVHTDLDGGVPLIVKNKP 602

Query: 612 PEQPVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
                P    T++QA  +TV  S+AWD+K     WWV+  QVSK   TG+ L  G FMI+
Sbjct: 603 DTPDDPIPPNTISQASAYTVASSKAWDTKAAMGGWWVHASQVSKMTSTGDILKAGHFMIK 662

Query: 670 GKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESE 729
           G+KN +PP  +++GF +LF++         + R V+  E+ +    + G           
Sbjct: 663 GEKNHIPPGQIVLGFAVLFQI---------SNRSVQNHEKCLPSAPEDGV---------- 703

Query: 730 KDDTDEKPVAES--LSVPNSAHPAPSH-TNASNVDSHEFPAEDKTISNGIDSKIFDIARN 786
              T+++P++ +  +  P +    P         D H+   ED +  + ID ++      
Sbjct: 704 ---TNDEPISSTGDMDQPEANQSDPEEDVPLEQEDEHQEEPED-SKKDIIDERV------ 753

Query: 787 VAAPVTPQLEDL-IDRALGLGSASISSTKHGIETTQFDLSE-EDKHVERTATVRDKPYIS 844
             AP+  QL+ + ++ +L    A +       E  + + S+ E++ VE  +   + P  S
Sbjct: 754 --APLGEQLKSMHVEDSLDSNPAQVH------EADKEEASKGENQPVEGPSKNAEGPEDS 805

Query: 845 KAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMK 904
           +      +    S +  P   +E       +S P +I      +     RG++GK KK+ 
Sbjct: 806 E------QSDDESILATPSATQESR-----ASTPSAISSSGTQKSKPPVRGKRGKAKKLA 854

Query: 905 EKYGDQDEEERNIRMALLA 923
            KY DQDEE+R + + LL 
Sbjct: 855 TKYKDQDEEDRKLALRLLG 873


>gi|74025594|ref|XP_829363.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70834749|gb|EAN80251.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1100

 Score =  322 bits (824), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 239/784 (30%), Positives = 377/784 (48%), Gaps = 146/784 (18%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L G+R +NVYD+ P+T++FK  NS         +K  LL
Sbjct: 1   MVKQRMTALDVRASVEEMRTELQGLRLTNVYDIPPRTFLFKFGNSE--------KKRTLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+GVRLH T   R+K   P+ FTL+LRKH+R  RL+ V QL +DR + F+FG+   A Y
Sbjct: 53  LENGVRLHLTQLVREKPKVPTQFTLRLRKHVRAWRLDSVTQLQHDRTVDFRFGVAEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+++GNI+LTD E+ ++ LLR+H+DD  GV +  R  YP  + + FE+    +  
Sbjct: 113 HIIVELFSKGNIVLTDHEYRIMLLLRAHKDD--GVNMFVRELYP--VTKSFEQQQEEECQ 168

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                ++  +A                       ++ G  F               A+  
Sbjct: 169 QLTEGAQRVEALR---------------------REWGAVFT------------RHAEYE 195

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           T ++ L     +GP+L++HI+  TG V ++K + +    D   + L+  +   E W    
Sbjct: 196 TTRSTLSATHHFGPSLADHILTVTG-VKSVKKANMTCSGDEMFEKLLPGM--LEAWR--- 249

Query: 299 ISGDIVPEGYILMQ---------NKHLGKDHPPTESGSSTQI------------------ 331
            +   +P G  L+           +  GK  P  ++G  T                    
Sbjct: 250 FAFSPLPTGGYLISKTAATKGRGTQERGKAPPHVDAGVGTTADGGEAGSGVEKQPRPHLQ 309

Query: 332 ---YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLN 386
              Y++F P+LL Q+R          +F +  D F+   E ++ EQ +         K  
Sbjct: 310 GVQYEDFSPVLLAQYRGDAVSASYLPSFGSVCDAFFLYTEKEKIEQHNDRATTCVLSKKE 369

Query: 387 KIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
           K   D   R+  L++  + + +  ELI  N E +D AI  +  ALA  + WE L R++K+
Sbjct: 370 KFERDHNRRIAALERSEEENTRKGELIIQNAEKIDEAIGLINGALAAGIQWEALRRLLKQ 429

Query: 447 ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM---DDEEKTLPV--------------E 489
               G+PVA ++ +L+L+RN +S+L+  N +++   +DEE  + V              E
Sbjct: 430 RHAEGHPVAYMVHELFLDRNSISVLVEENDEDVECYEDEESKVKVGGKGENHRYGGNSGE 489

Query: 490 K-------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR 536
           K             +EVDL+ +A+ANA  ++  KK   +K EKTI A +KA   AEKK  
Sbjct: 490 KKDRVEGCSRTPSVIEVDLSKTAYANAASYFTQKKANRAKLEKTIAATAKAAAGAEKKGE 549

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
               +++T   I+  R   W+EKFNWF +S   LV+ G D Q  E++V+R M  GDV+VH
Sbjct: 550 RLAAKKQTKKAIATERHRCWWEKFNWFRTSCGDLVLQGHDTQSTELLVRRIMRLGDVFVH 609

Query: 597 ADLHGA-------------SSTVIKNHRPEQP------------VPPLTLNQAGCFTVCH 631
           +D+ G              +ST       E+             +  ++L++A  + VC 
Sbjct: 610 SDVEGGLPCILRAAGSAWDASTAFGEGESEENSIQVGESTKGWLIHMISLDEAAAWCVCR 669

Query: 632 SQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           S AW+SK    AWWV+  Q+      G YL      + G+KN+L P PL++G GLLFR+ 
Sbjct: 670 SSAWESKFSVGAWWVHASQIVGGTAAGCYL------LSGEKNYLRPRPLMLGCGLLFRIS 723

Query: 692 ESSL 695
             ++
Sbjct: 724 SRAI 727


>gi|225563152|gb|EEH11431.1| DUF814 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 1158

 Score =  321 bits (822), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 286/949 (30%), Positives = 444/949 (46%), Gaps = 172/949 (18%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++++G R H T Y+R     PS FT +LRK ++TRR+  V Q+G DRII  +   G N 
Sbjct: 33  LIVDTGFRCHLTGYSRTTAAAPSSFTSRLRKFLKTRRVTAVSQVGTDRIIDIELSDG-NF 91

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           H V+LE YA GNI+LTD E+ +L L   HR   +G           E  RV        L
Sbjct: 92  H-VLLEFYAAGNIILTDKEYKILAL---HRIVPEG--------SDQEEVRV-------GL 132

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
              LT+ +  +   P  +    + +  A       +  GK            N  A+ KQ
Sbjct: 133 QYVLTNKQNYNGVPPLSIERLRDALEKAKDLTGPAEAAGK------------NKRAKKKQ 180

Query: 238 P-TLKTVLGEALG---YGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFE 292
              L+  +  +LG   Y P L EH    TG   ++K  ++  LED  + + L++A+   E
Sbjct: 181 AEALRRAV--SLGFPEYPPLLLEHAFHITGFDTSLKPEQL--LEDPKLAEKLMVALVVAE 236

Query: 293 DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESG----SSTQIYDEFCPLLLNQFRSR-- 346
           +    + + +  P GYI+ + +    +    +S     SS   Y +F P    QF S   
Sbjct: 237 NVNSSLSTAEETP-GYIVSKTEGKAGEDASVDSTDPSKSSNVAYIDFHPFEPKQFESEPG 295

Query: 347 -EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
              ++F+TF+ A+DE++S +ESQ+ E +   +E+ A  KL     DQ+ RV  LK+  + 
Sbjct: 296 TSILRFDTFNKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAKTDQDKRVGVLKEAQEL 355

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
            ++ A+ IE NL  V+ A+ AV   +A  M W ++AR+++ E+   NPVA +I   L L 
Sbjct: 356 HIRKAQAIEANLLRVEEAVNAVNGLIAQGMDWGEIARLIEMEQSRQNPVAKVIKLPLKLY 415

Query: 465 RNCMSLLLSNNLDEMD-------------------------------DEEKTLPVEKVEV 493
            N ++LLL    +  +                                ++   P+  +++
Sbjct: 416 ENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSQKTRQPLLSIDI 475

Query: 494 DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANIS 549
           DL +S  ANAR++YE KK    K+EKT+ +  KA K+ EKK     +  + QEK V   +
Sbjct: 476 DLGISPWANARQYYEQKKAAAVKEEKTLNSTKKAIKSTEKKVAADLKQALKQEKPVLRPT 535

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
                                   GRD QQ E++ +R++ +GDV+VHAD+ GA   ++KN
Sbjct: 536 RT-------------------PFCGRDVQQTEILYRRHLKRGDVFVHADVQGAIPIIVKN 576

Query: 610 H--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
               P+ P+PP TL+QAG   V  S AWDSK V  AWWV   QVSKT P GEYL  G F+
Sbjct: 577 KPGTPDAPIPPGTLSQAGNLCVATSTAWDSKAVMGAWWVNADQVSKTTPLGEYLVTGGFV 636

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER------------RVRGEEE---GMD 712
           I G+KN LPP  L++GF ++F++   S+ +H   R             + G EE   G+ 
Sbjct: 637 ICGEKNQLPPAQLLLGFAVMFQISGESIKNHTKHRVPDEAPTSESAKDILGTEELPSGL- 695

Query: 713 DFEDSGHHKEN-SDIESEKDDTDEKPVAESLSVPNSAHPAP-----SHTNASNVDSHEFP 766
           D E   + K N +D + ++ D+ ++   E   + ++    P     + +N S  +S E P
Sbjct: 696 DLETPKNSKRNETDHQHQESDSTDQENGEIEQIADNKRTNPLLNDGAESNRSGSESEE-P 754

Query: 767 AEDKTISNGIDS---KIFDIAR--NVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQ 821
              +  S  +D+   K +D +R   V  P   Q+E+L                     ++
Sbjct: 755 NIGENGSQDVDARYDKGYDNSRFEAVEVPKLGQMENL---------------------SK 793

Query: 822 FDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESI 881
            + S E +    TA     P++   ERR LK G         +E+   R  D +S   + 
Sbjct: 794 EEASSEPQTDSITAQPAKHPFVR--ERRLLKNG--------FIEQVPARLTDPASHSATN 843

Query: 882 V--RKTKIEGGKIS-----RGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
           V  R +    G  +     RG++GK KK+  KY  QDEE+R + + LL 
Sbjct: 844 VPSRSSTPSIGASTATPNIRGKRGKNKKIATKYQHQDEEDRELALRLLG 892


>gi|396473834|ref|XP_003839430.1| similar to DUF814 domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312215999|emb|CBX95951.1| similar to DUF814 domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 1115

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 247/747 (33%), Positives = 363/747 (48%), Gaps = 120/747 (16%)

Query: 252 PALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEG 307
           P L +H +     D+ L P   L++ + LE      LV+ +       +++   + + +G
Sbjct: 213 PLLVDHALHNADFDSCLKPEQVLADESLLEK-----LVVVLKDARKIAEEITQPEQI-KG 266

Query: 308 YILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRS--REFVKFETFDAALDEFYSKI 365
           YIL +            SG +  +Y++F P    QF +   +F++F+ F+ A+DEF+S I
Sbjct: 267 YILAKPNPAVASTEDASSGKAKFLYEDFHPFKSQQFENLDYQFLEFDGFNKAVDEFFSSI 326

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
           E Q+ E +   +E  A  KL K   + E R+  L+Q  + + + AE I  N+  V  A  
Sbjct: 327 EGQKLESKLTEREQQAKKKLEKARKEHEERIGGLQQVQEMNFRKAEAILANVHRVTEATE 386

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDD--- 481
           AV   +   M W D++R+++ E+  GN VA  I   L L +N ++LLL+    + ++   
Sbjct: 387 AVNGLIRQGMDWVDISRLIEREQAQGNAVAQSIRLPLKLHQNTITLLLNETDWDHEEEEE 446

Query: 482 --------------------EEKTLPVE-------KVEVDLALSAHANARRWYELKKKQE 514
                               ++K  P +        +++DL LSA AN+  +Y+ KK   
Sbjct: 447 DEGNETSSVSEDSEEEEEGSKKKAAPTKVTQQPQLAIDIDLGLSAWANSTEYYDQKKTAA 506

Query: 515 SKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
           SK+++T  A SKA K+ EKK     +  + QEK V  +  +RK  WFEK+ +FISS+ YL
Sbjct: 507 SKEDRTAAASSKALKSHEKKVTEDLKKGLKQEKEV--LRPVRKQQWFEKYIYFISSDGYL 564

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFT 628
           V+ G+DAQQNE+I KR++ KGDVYVHADL GA   +IKN    P+ P+PP TL+QAG  +
Sbjct: 565 VLGGKDAQQNEIIYKRFLRKGDVYVHADLKGAVPMIIKNKPDTPDAPIPPSTLSQAGHLS 624

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           VC S+AW+SK V SAWWV   QVSKT  TGE+L  G F I GKK FLPP  L++G  ++F
Sbjct: 625 VCTSEAWESKAVMSAWWVRSTQVSKTGQTGEFLPAGMFNITGKKEFLPPAQLVVGLAVMF 684

Query: 689 RLDESSLGSHLNER---------------------RVRGEEEGMDDFEDSGHHKENSDIE 727
            + ESS+ +H   R                     R   + E  D+F D+     + D E
Sbjct: 685 EISESSISNHQKHRIQATAVSAAEMTEDSTNAEEERNEADSEHDDEFPDAKLDSGSDDDE 744

Query: 728 --SEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISN----GIDSKIF 781
               K D  E   AES +     +P  SH     VD H+   ED T  N    G +S   
Sbjct: 745 FPDAKIDDAEDSDAESEAGALRTNPLQSH---KMVDKHDSETEDDTSPNNKPAGTES--H 799

Query: 782 DIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTATVRDKP 841
           DI     AP      D  D A  +G    +S +H                          
Sbjct: 800 DIRE---APAKESTVD--DGAESVGKTDPTSRRH-------------------------- 828

Query: 842 YISKAERRKLKKGQ---GSSV-VDPKVEREKERG-KDASSQPESIVRKTKIEGGKISRGQ 896
            +S  ERR L+KGQ   G+ +   P    E   G   A ++P + V     +   + RG+
Sbjct: 829 -LSARERRLLRKGQQLDGADIATGPGSADESVHGDPSAFTKPPATVTSQSSKASALPRGK 887

Query: 897 KGKLKKMKEKYGDQDEEERNIRMALLA 923
           +GK KK+  KY  QDEE+R + M LL 
Sbjct: 888 RGKAKKLATKYAAQDEEDRALAMRLLG 914



 Score =  106 bits (265), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 84/145 (57%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L  +L  +R +NVYDLS + ++ K              +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSAKLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T YAR     PS F  KLRK+++TRR+  V Q+G DRI+ FQF  G+  + +
Sbjct: 53  DSGFRCHLTEYARTTAAAPSAFVAKLRKYLKTRRVTSVAQIGTDRILEFQFSDGL--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE YA GNI+LTD+   +L+LLR+
Sbjct: 111 YLEFYAGGNIVLTDANLHILSLLRN 135


>gi|89130574|gb|AAI14230.1| Zgc:153813 protein [Danio rerio]
          Length = 556

 Score =  320 bits (819), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 155/281 (55%), Positives = 202/281 (71%), Gaps = 11/281 (3%)

Query: 435 MSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN-------NLDEMDDEEKTLP 487
           + W ++ RMV E + AG+PVA  I +L L+ N ++LLL N          E+   +K+  
Sbjct: 99  VDWVEIGRMVTEAQAAGDPVACAIKELKLQSNHITLLLRNPEACPEGGAAELQSGKKSRS 158

Query: 488 VEK---VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT 544
            EK   V++D+ LSAHANA+R+Y+ K+    K++KT+ A  KAFK+AEKKT+  +   +T
Sbjct: 159 REKAVLVDIDINLSAHANAKRYYDSKRSAAKKEQKTVEAAQKAFKSAEKKTKQTLKDVQT 218

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
           V +I   RKV+WFEKF WF+SSENYL+I+GRD QQNEMIVKRY+  GD+YVHADLHGA+S
Sbjct: 219 VTSIQKARKVYWFEKFLWFLSSENYLIIAGRDQQQNEMIVKRYLRAGDLYVHADLHGATS 278

Query: 605 TVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            VIKN   E  VPP TL +A    VC+S AWD+K++TSAWWV   QVSKTAP+GEYLT G
Sbjct: 279 CVIKNPSGE-AVPPRTLTEAATMAVCYSAAWDAKVITSAWWVQHDQVSKTAPSGEYLTTG 337

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           SFMIRGKKNFLPP  LIMGFG LF++D+ S+  H  ER+++
Sbjct: 338 SFMIRGKKNFLPPSYLIMGFGFLFKVDDQSVFRHRGERKMK 378



 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 65/107 (60%), Gaps = 9/107 (8%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  +    +GMR +N+YD+  KTY+ +L             K +LL+
Sbjct: 1   MKGRFNTVDIRAAIAEINASCVGMRVNNIYDIDNKTYLIRLQKPEC--------KAVLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
           ESG+R+H T +   K   PSGF +K RKH+++RRL  VRQLG DRI+
Sbjct: 53  ESGIRIHCTEFDWPKNMMPSGFAMKCRKHLKSRRLVHVRQLGVDRIV 99


>gi|156083749|ref|XP_001609358.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154796609|gb|EDO05790.1| conserved hypothetical protein [Babesia bovis]
          Length = 1006

 Score =  317 bits (813), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 220/723 (30%), Positives = 366/723 (50%), Gaps = 82/723 (11%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MV+ R+N  DVAA V  LR +++     N+YD++ + Y+ K         S   +K  +L
Sbjct: 1   MVRERLNAVDVAAVVGNLRSQILDYNLVNIYDVTSRVYVLKF--------SRNEDKRFVL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            E G R+HTT + R     PS F +KLRKH+RTR+L  + Q+  DR++ F F  G  A++
Sbjct: 53  FEIGHRIHTTQFLRTTDKLPSNFNVKLRKHLRTRKLRGIYQIAQDRVVDFTFSSGEYAYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
           +I++L+  GN+ LTD  + VLT+LR     D    +   +  P          + + +  
Sbjct: 113 LIVQLFLPGNVYLTDYSYKVLTVLRPQNAGDSFFRVGETYGIPEASVPWNIPVSPAVIDG 172

Query: 180 ALTSSKEPDANEPDKVNEDGN-NVSNASKENLGGQKGGKSFDLSKNSNKNSND------- 231
            L+                GN + SN+ K+    +   ++ D SK S  N +D       
Sbjct: 173 ILSGMGH------------GNVDASNSQKKVTNSRGKPETGDSSKQSIVNGSDQGDYLDI 220

Query: 232 GARAKQPTLKTVLGEALGYGPALSEHII---LDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
           G+  K  ++  +L       P+++  ++   L   +  ++  S+V+ +E + I   V A+
Sbjct: 221 GSEFKDRSVSMLLKLIF---PSVTLRMMRYALVKAIGADICDSDVSAVESSTIYTAVEAL 277

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
               D L + ++ ++   GY+  +          TE       Y++F          R  
Sbjct: 278 RSTLDSLSNPVNLNL---GYLYKKG---------TE-------YEDFGCFDYGDGWER-- 316

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
             F+ F+ ALD +++K E ++ E++ + K+     KL KI  DQ  R    ++EV R   
Sbjct: 317 --FDDFNMALDAYFTKSELRKIERKEQPKKPI---KLQKIKDDQNRRELEREREVHRLGV 371

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
              L+E + +  D  +  +R  +A+  SW+++   +  +R +G+ +A  I  + +    +
Sbjct: 372 SIALVEGHRDTFDTVLDLMRSLVASGASWQEITDQLSRQRDSGHLLARHIRSVNIPDRRV 431

Query: 469 SLLLSN-------NLDEMDDEEKTLPVEK-----------VEVDLALSAHANARRWYELK 510
            + L N       N+  M D+      +K           V +D  L+   N    Y  K
Sbjct: 432 DVCLPNDDPGYYTNVTSMGDKRNKRGSKKSQSSDQFDDTSVTLDYGLTCFQNLEIMYSQK 491

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANIS--HMRKVHWFEKFNWFISSEN 568
           K+   K E+T   H  A K  +++   Q+ + +   N+S   +RK  WFEKF+WFI+S+ 
Sbjct: 492 KRMAEKLERTRAGHQFALKRVDREKEKQV-KSRGDRNVSLVKVRKRMWFEKFHWFITSDG 550

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LV+ GRD+ QNE++VKRY++KGD+Y HAD+HGA+S ++KN        P T+++A CF+
Sbjct: 551 FLVLGGRDSTQNELLVKRYLTKGDLYFHADVHGAASCILKNPSGNAESFPNTIDEAACFS 610

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +C S AW  KMV  AWWV+ HQVS++AP+GEYL  GSFMIRGKKN++ P  L M  G++F
Sbjct: 611 LCLSSAWSQKMVVPAWWVHHHQVSRSAPSGEYLPHGSFMIRGKKNYVQPQRLEMAIGVVF 670

Query: 689 RLD 691
            ++
Sbjct: 671 HIE 673


>gi|401416565|ref|XP_003872777.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322489002|emb|CBZ24251.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 1189

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 237/753 (31%), Positives = 374/753 (49%), Gaps = 117/753 (15%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  LIG+R  N+YD+  K ++FK  +       GE++K +LL
Sbjct: 1   MVKQRMTALDVRATVEEMRATLIGLRLLNIYDIGSKMFLFKFGH-------GENKKNVLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM--NA 117
            ESG RLH T  AR+K   PS FTLKLRKH+R  RL+ V QL +DR I   FG+      
Sbjct: 54  -ESGTRLHLTELAREKPKVPSQFTLKLRKHVRAWRLDSVAQLQHDRTIDLCFGVPSTEGC 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            ++I+EL+++GN++LT+  +T++ LLR+HRDD+ G+ +M    YP          TA  +
Sbjct: 113 FHIIVELFSKGNVILTNYAYTIMMLLRTHRDDE-GLKLMVNQVYPV---------TAPFV 162

Query: 178 HAALTSSKE-PDANEPDKVNEDGN-NVSNASKENLG-GQKGGKS-------FDLSKNSNK 227
            A    S+E P    P  V+  G+ ++   +  +L   Q+  K         D     ++
Sbjct: 163 AAVAAESEESPMFLYPPHVDASGHLHLQRTADADLTLAQRQLKEERTRLMKVDWEVGLSR 222

Query: 228 NSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLA 287
            SND     +  ++T++     +GP L++H++  TG VPN       +  DN    L+  
Sbjct: 223 -SND-----RTVVQTLVAGIQHFGPDLAQHVLTVTG-VPNAPRKSWTQSTDNVFVTLLPG 275

Query: 288 VAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI---------------- 331
           +   E +  D+   D+   G  L++        P  + GS+                   
Sbjct: 276 L--LEAF--DLAKVDLTSAGGYLIK--------PKAKPGSTVHAPAPPAPGAPAGAADLV 323

Query: 332 -----YDEFCPLLLNQFRSR--EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHK 384
                Y+ F P+LL Q+ +   E +   +F    DEF+   E++R +  +  +++ A  K
Sbjct: 324 AVAEQYESFTPILLAQYTNDGVEALYRSSFGRVCDEFFLITETERIDASNAKRKNTAKSK 383

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
            +K   D   R++ L+ ++  +    E +  N + VD AI  +  ALA  +SW+ L  ++
Sbjct: 384 EDKFATDHARRINALEADIAANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLL 443

Query: 445 KEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANA 503
           K     G+PVA +I  L+LERN +S+LL   LDE   EE   +P   VEV L+ +AHANA
Sbjct: 444 KRRHAEGHPVAYMIHDLFLERNSISVLLEAVLDEEKGEEDCDVPPLVVEVALSKTAHANA 503

Query: 504 RRWYELKKKQESKQEKTITAHSKAF---------KAAEKKTRLQILQEKTVANISHMRKV 554
             ++  +K   SK E+T+ A +KA          KAA +K R  I++E         R+ 
Sbjct: 504 ADYFSKQKHHRSKLERTVAATAKAAAGAALKGARKAAAQKERKVIVKE---------RQR 554

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--- 611
            W+EKF WF ++   LV+ G+D Q  E++V+R M  GD+++H ++ GA   +++      
Sbjct: 555 QWWEKFLWFRTTAGDLVLRGKDVQSTELLVRRVMRLGDLFIHCEVDGALPCLLRPMNDVW 614

Query: 612 ----------------PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTA 655
                             QPV   ++ +AG + V  S AW+ K  T +WWVY  QV+   
Sbjct: 615 QELGGNNAGGDFTAAPATQPVALHSVCEAGAWCVAFSGAWERKQTTGSWWVYASQVTGGT 674

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            TG YL        G+++ LPP  + +G  LLF
Sbjct: 675 ATGAYLYA------GERHHLPPQSMSLGCALLF 701


>gi|430813962|emb|CCJ28739.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 631

 Score =  315 bits (807), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 220/655 (33%), Positives = 336/655 (51%), Gaps = 70/655 (10%)

Query: 18  LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN 77
           L++LIG+R  N+YD+S +T+ FK         SG  E   LL+ESG R+H T Y R+   
Sbjct: 8   LQKLIGLRLQNIYDISERTFQFKF------ATSGHKEH--LLVESGSRIHLTCYVRETAA 59

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA-------HYVILELYAQGNI 130
            PS F  KLRKH++++RL  ++Q+  DR++   FG G          +Y+I E YA GNI
Sbjct: 60  LPSQFCAKLRKHLKSKRLVSLKQINSDRVVYLGFGCGSETVESFKPQYYLIFEFYAAGNI 119

Query: 131 LLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY---PTEICRVFERTTASKLHAALTSSKEP 187
           LLTDS+  +L+LLR  R             Y   PT   +  E+ T   L + + + K+ 
Sbjct: 120 LLTDSDMKILSLLRLVRPGGMHQQFSVGQLYQITPTPQNKQVEKMTEDVLRSLIKTLKDK 179

Query: 188 DANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEA 247
             +  ++      N+S + K+    +K  +                    P  K V  E 
Sbjct: 180 YLSPKEEPLPKQMNLSTSFKKTSKKEKKPREL------------------PLKKLVSWEL 221

Query: 248 LGYGPALSEHIILDTGLVPNMKLSEV-NKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPE 306
             YG AL EHII D  + P+MK+ E  + +E   +Q L+L+  + +D ++    G +   
Sbjct: 222 SNYGNALIEHIIRDANIDPDMKIDEFYHNIESINLQHLLLSFQRADDLIKKCEEGSVT-- 279

Query: 307 GYILMQNKHLGK----DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           GYI+ + +   +    D     +    +IY +F P +  Q+ +       TFD      Y
Sbjct: 280 GYIVEKIESKTRINLNDITLESTPDPVKIYVDFNPFIPKQYSNNPNYSVITFDDG----Y 335

Query: 363 SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDA 422
           +K  SQ+ + + K ++D A+ +L     + + ++  L++  +  +K A+ IE N E VD 
Sbjct: 336 NK--SQKFDMKLKNQKDIAYRRLQITKEEHQKKIDDLQKFQNICIKKAKAIEENQEIVDE 393

Query: 423 AILAVRVALANRMSWEDLARMVKEERKAGN-------PVAGLIDKLYLERNCMSLLLSNN 475
            I AV   +   M WED+A++VK E++  +       P   L D +Y   +   L    N
Sbjct: 394 TIKAVNTCVLRSMDWEDIAKLVKTEKEYESNTITIQLPCPHLDDNIYENDSTTGLFNGQN 453

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
                D+ +TL    +++ L+L+A  NAR +YE KK    K+EKTI A SKA K AE+K 
Sbjct: 454 -----DKTETL---NIDIKLSLNAWTNARDYYEKKKAASVKEEKTIAASSKALKNAERKI 505

Query: 536 ----RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
               +    QEK    +  MR + WFEKF WFISS+ YLV++G D  QN+++++ + SK 
Sbjct: 506 NSDLKRNTAQEK--KKLVPMRNLQWFEKFLWFISSDGYLVLAGHDLLQNKILIQNHFSKN 563

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           D+YVHADL  A+  +IKN      VPP TLNQAG F++  S AW SK+VTSAW +
Sbjct: 564 DIYVHADLKDAAVVIIKNMIDSSFVPPNTLNQAGAFSIAKSNAWTSKIVTSAWCI 618


>gi|154332902|ref|XP_001562713.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059716|emb|CAM41838.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 1198

 Score =  312 bits (799), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 231/754 (30%), Positives = 371/754 (49%), Gaps = 93/754 (12%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  LIG+R  N+Y++  K ++FK  +       GE +K +LL
Sbjct: 1   MVKQRMTALDVRATVEEMRANLIGLRLLNIYNMDSKMFLFKFGH-------GEHKKNVLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN--A 117
            ESGVR H T   R+K   PS FTLKLRKH+R  RL+ + QL +DR I   FG+  +   
Sbjct: 54  -ESGVRFHLTELEREKPKVPSQFTLKLRKHVRAWRLDSISQLQHDRTIDLCFGVSSSEGC 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER------ 171
            ++I+EL+++GN++LTD  + ++ LLR+HRDD+ G  +M    YP  +   F        
Sbjct: 113 FHIIVELFSKGNVILTDYTYKMMMLLRTHRDDE-GHNLMVNQVYP--VTAPFVAAVAVES 169

Query: 172 ----------TTASKLHAALTSSKEPDAN-EPDKVNEDGN----NVSNASKENLGGQKGG 216
                     T +S    A+++++ P     P  V+  G+     +++A       Q   
Sbjct: 170 ASAQEADTATTVSSVTRTAVSAAEVPHIFLYPPHVDASGHLHVQRIADADLTLAQQQVKE 229

Query: 217 KSFDLSKNSNK----NSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSE 272
           +   L K   +     SND     +  ++T++     +GP L++H++  TG+    + S 
Sbjct: 230 ERTRLMKAEWEVGLTRSND-----RTVVQTLVAGIQHFGPDLAQHVLAITGVSNAPRKSW 284

Query: 273 VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNK-----HLGKDHPPTESGS 327
               +D    +L   +  F     D+   D+   G  L+++K           PP    S
Sbjct: 285 KQSTDDIFATLLPGLLEAF-----DLAKVDLASAGGYLIKSKAGPGSRANAAEPPAPDAS 339

Query: 328 ST-----------QIYDEFCPLLLNQFRSREFVKF--ETFDAALDEFYSKIESQRAEQQH 374
           +            + Y+ F P+LL Q+     V F   +F    DEF+   E+ R +  +
Sbjct: 340 TAAAGVADLVAVAEKYESFTPILLAQYTEDGVVSFYRASFGRVCDEFFLITETARIDASN 399

Query: 375 KAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANR 434
           + +++ + +K +K   D   R++ L+ ++  +    + +  N + VD AI  +  ALA  
Sbjct: 400 EKRKNTSKNKEDKFAADHARRINALETDIAANQLKGQQLILNADRVDEAIQLINGALATG 459

Query: 435 MSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT-LPVEKVEV 493
           +SWE L  ++K     G+PVA +I  L+LERN +S+LL   LDE   EE   +P   VEV
Sbjct: 460 ISWEALRILLKRRHAEGHPVAYMIHDLFLERNSISVLLETVLDEEAGEEDCDVPPMVVEV 519

Query: 494 DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRK 553
            L+ +AHANA  ++  +K+  SK E+TI A  +A   A +K   +  ++K    I   R+
Sbjct: 520 ALSKTAHANAADYFGRQKQHRSKLERTIAATDRAAAGAARKGERKAAEQKERKVIVKERQ 579

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK----- 608
             W+EKF WF +S   LV+ G+D Q  E++V+R M  GD+++H D+ GA   +++     
Sbjct: 580 RSWWEKFFWFRTSAGDLVLRGKDVQSTELLVRRVMRLGDLFIHCDVDGALPCLLRPMNDV 639

Query: 609 -----NHRP---------EQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
                 H            QPV   +  +AG + V  S AW+ K  T +WWVY  QV+  
Sbjct: 640 WQELGGHNAGGNAVVSPRTQPVAMHSACEAGAWCVAFSGAWERKQTTGSWWVYASQVTGG 699

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
             TG YL        G+++ LPP  + +G  LLF
Sbjct: 700 TATGTYLYT------GERHHLPPQSMSLGCALLF 727


>gi|240275734|gb|EER39247.1| DUF814 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 1183

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 286/957 (29%), Positives = 430/957 (44%), Gaps = 188/957 (19%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++++G R H T Y+R     PS FT +LRK ++TRR+  V Q+G DRII  +   G N 
Sbjct: 58  LIVDTGFRCHLTRYSRTTAAAPSSFTSRLRKFLKTRRVTAVSQVGTDRIIDIELSDG-NF 116

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           H V+LE YA GNI+LTD E+ +L L   HR   +G           E  RV        L
Sbjct: 117 H-VLLEFYAAGNIILTDKEYKILAL---HRIVPEG--------SDQEEVRV-------GL 157

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLG-GQKGGKSFDLSKNSNKNSNDGARAK 236
              LT+ +  +   P  + E   +    SK+  G  +  GK            N  A+ K
Sbjct: 158 QYVLTNKQNYNGVPPLSI-ERLRDALEKSKDVTGPAEAAGK------------NKRAKKK 204

Query: 237 QP-TLKTVLGEALG---YGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
           Q   L+  +  +LG   Y P L EH       DT L P  +L E  KL +  +  LV+A 
Sbjct: 205 QAEALRRAV--SLGFPEYPPLLLEHAFHITGFDTSLKPE-QLVEDPKLAEKLMVALVVA- 260

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI----YDEFCPLLLNQFR 344
              E+    + + +  P GYI+ + +    +    +S   +++    Y +F P    QF 
Sbjct: 261 ---ENVNSSLSTAEETP-GYIVSKTEGKAGEDASVDSTDPSKLRNVAYIDFHPFEPKQFE 316

Query: 345 SR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
           S      ++F+TF  A+DE++S +ESQ+ E +   +E+ A  KL     DQ+ RV  LK+
Sbjct: 317 SEPGTSILRFDTFSKAVDEYFSSVESQKLESRLTEREEIAKRKLEAAQKDQDKRVGVLKE 376

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
             +  ++ A+ IE NL  V+ AI AV   +A  M W ++AR+++ E+   NPVA +I   
Sbjct: 377 AQELHIRKAQAIEANLLRVEEAINAVNGLIAQGMDWGEIARLIEMEQSRQNPVAKVIKLP 436

Query: 461 LYLERNCMSLLLSNNLDEMD-------------------------------DEEKTLPVE 489
           L L  N ++LLL    +  +                                ++   P+ 
Sbjct: 437 LKLYENAVTLLLGEPTENEEPMDESEEEAEVEEEEEQESSEDEDSGKKPGVSKKTRQPLL 496

Query: 490 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTV 545
            +++DL +S  ANAR++YE KK    K+EKT+ +   A K+ EKK     +  + QEK V
Sbjct: 497 SIDIDLGISPWANARQYYEQKKAAAVKEEKTLNSTKTAIKSTEKKVAADLKQALKQEKPV 556

Query: 546 ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
              +                        GRD QQ E++ +R++ +GDV+VHAD+ GA   
Sbjct: 557 LRPTRT-------------------PFCGRDVQQTEILYRRHLKRGDVFVHADVQGAIPI 597

Query: 606 VIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTV 663
           ++KN    P+ P+PP TL+QAG   V  S AWDSK V  AWWV   QVSKT P GEYL  
Sbjct: 598 IVKNKPGTPDAPIPPGTLSQAGNLCVATSTAWDSKAVMGAWWVNADQVSKTTPLGEYLVT 657

Query: 664 GSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER------------RVRGEEE-- 709
           G F+I G+KN L P  L++GF ++F++   S+ +H   R               G EE  
Sbjct: 658 GGFVICGEKNHLSPAQLLLGFAVMFQISGESIKNHTKHRVPDETPISESAKDTLGTEELP 717

Query: 710 -GMD---------------DFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPS 753
            G+D                 E  G  +EN +IE   D+    P+    +  N +    S
Sbjct: 718 SGLDLETPKYSKINETDHQHQESDGTDQENGEIEQIADNKRTNPLLNDGAESNRSG---S 774

Query: 754 HTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISST 813
            +   N+  +     D     G D+  F+    V  P   Q+E+L               
Sbjct: 775 ESEEPNIGGNGSQDVDARYDKGYDNSRFEA---VEVPKLGQMENL--------------- 816

Query: 814 KHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKD 873
                  + + S E +    T      P++   ERR LK G         +E+   R  D
Sbjct: 817 ------PKEEASSEPQTDSITVQPAKHPFVR--ERRLLKNG--------IIEQVPARLTD 860

Query: 874 ASSQPESIV--RKTKIEGGKIS-----RGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
            +S   + V  R +    G  +     RG++GK KK+  KY  QDEE+R + + LL 
Sbjct: 861 PASHSATNVPSRSSTPSIGASTATPNIRGKRGKNKKIATKYQHQDEEDRELALRLLG 917


>gi|402074990|gb|EJT70461.1| serologically defined colon cancer antigen 1 [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 1086

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 196/522 (37%), Positives = 285/522 (54%), Gaps = 51/522 (9%)

Query: 252 PALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVLVLAVAKFEDWLQDVISGDIVPEGYIL 310
           P L +H   +    P  K +++  LED  +   L  A+ +    + D+ S D V +GYI+
Sbjct: 212 PILVDHAFKENNFDPKAKPADI--LEDEGVFDALFTALERARGIIDDITSSDTV-KGYIV 268

Query: 311 MQN---------KHLGKDHPPTESGSSTQIYDEFCPLLLNQFR---SREFVKFETFDAAL 358
            +N                P     S   +Y++F P L  QF    S   + FE F+  +
Sbjct: 269 ARNPDVADAGAAAEGAVVKPFAPELSKGLLYEDFSPFLPQQFAGDPSNVVLTFEGFNKTV 328

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEF+S +E Q+ E +   +E  A  KL+    + E R+  L++    +++ A  IE N+E
Sbjct: 329 DEFFSSLEGQKLESRLTEREAGAKRKLDAAKREHEKRIEGLQEYQLLNLRKAAAIEANVE 388

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD 477
            V  A+ AV   L   M W D+ ++V+ E+K  NPVA +I+  + L  N ++L+++   D
Sbjct: 389 RVQEAMDAVIGLLEQGMDWVDVGKLVEREQKRHNPVAEIIELPMDLANNTITLVIAEQDD 448

Query: 478 EMDDEEKTLPVE---------------------KVEVDLALSAHANARRWYELKKKQESK 516
             DD E     E                     +V++ L+L+   NA  +Y+ K+    K
Sbjct: 449 VDDDSEDGYETESSASDDDDDAAAVQTGKAKTLEVDIKLSLTPWGNAGEYYDQKRSAAVK 508

Query: 517 QEKTITAHSKAFKAAEKKTR--LQ--ILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
           QEKT+   S A K+A++K    LQ  + +EK V  ++  R+  WFEKF+WFISS+ YLV+
Sbjct: 509 QEKTVQQSSIALKSAQEKIAKDLQKGLKKEKPVMQLA--RRQMWFEKFHWFISSDGYLVL 566

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVC 630
            GRDAQQNE++ +RY+ +GDVYVHADLHGA S +IKN+   P+ PVPP TL+QAG   VC
Sbjct: 567 GGRDAQQNEILYRRYLKRGDVYVHADLHGAPSVIIKNNPRTPDAPVPPSTLSQAGQLAVC 626

Query: 631 HSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
            S AW+SK    A+WV   QVSK+APTGE+L  GSFM+RGK+N LPP PLI+GFG++FR+
Sbjct: 627 ASSAWESKAGMGAYWVGADQVSKSAPTGEFLPTGSFMVRGKRNELPPAPLIVGFGVMFRI 686

Query: 691 DESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDD 732
            + S   H    RV    EG    E S   K +   E+  DD
Sbjct: 687 SDESKAKH-TRHRVYESAEG----EPSTAPKPSPGTEAAADD 723



 Score = 97.4 bits (241), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 54/148 (36%), Positives = 84/148 (56%), Gaps = 11/148 (7%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R+++ DV A    L++ L+ +R SN+YDLS K ++ +              K  L++
Sbjct: 1   MKQRLSSLDVRAIAHELQQSLVTLRLSNIYDLSSKIFLLRFAKPD--------LKKQLII 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T ++R     PS F  ++RK +RTRR   V Q+G DRII  QF  G  +  +
Sbjct: 53  DSGFRCHLTDFSRPTAPAPSQFVARVRKFLRTRRCTAVSQVGTDRIIELQFSDG--SLRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRD 148
             E +A GNI+LTD+   +L LLR+ ++
Sbjct: 111 FFEFFASGNIILTDANLNILALLRNVKE 138


>gi|224014996|ref|XP_002297159.1| signal peptidase [Thalassiosira pseudonana CCMP1335]
 gi|220968134|gb|EED86484.1| signal peptidase [Thalassiosira pseudonana CCMP1335]
          Length = 968

 Score =  309 bits (792), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 215/605 (35%), Positives = 319/605 (52%), Gaps = 53/605 (8%)

Query: 250 YGPALSEHIILDTGLVPNMKLSEVN---KLEDNAIQVLVLAVA-KFEDWLQDVISGDIVP 305
           YGP+L EH I   G+ P +KL+  N    L + +   LV ++  +    ++++ SG+   
Sbjct: 192 YGPSLIEHCITTAGVDPMVKLTHDNIEYTLPEASWNDLVSSLCGEGAKVIENLSSGE--S 249

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI 365
            GYIL + K         +     +   EF P LL+Q +++  + + TF  A DEF+S +
Sbjct: 250 GGYILYKPKQ------TDDKNDYNKTLLEFQPHLLHQHKNQHALSYTTFATATDEFFSHL 303

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAIL 425
            SQR  Q+  A E AA  +L+KI +DQ+ RV  L  E ++S   A L+E + EDVD  + 
Sbjct: 304 SSQRIAQRADAAEAAARERLSKIQLDQQRRVDGLVAEQEKSRDCARLVEMHAEDVDRVLG 363

Query: 426 AVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL-LLSNNLDEMDDEEK 484
            +  AL + M+W+ L ++V  E+   NP+A LI KL L ++ + L L   +  +  D ++
Sbjct: 364 VINSALESGMNWDALEQLVLVEQGNENPIALLIFKLELCKDQVVLALPDIDDWDDSDPDR 423

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH-------SKAFKAAEKKTRL 537
              +  V V +  SAH NAR  +   K+ ++ +  T            +  +A +KK R+
Sbjct: 424 PPKLHYVTVSIKESAHGNARNMFATIKQSKTLEASTTALKAAEAKAKQQLAEAQKKKQRI 483

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
           Q++           RK +WFEKF WFI+S+NYLV++G+DAQQNE +VK+Y+  GD Y+HA
Sbjct: 484 QVMPN---------RKTYWFEKFAWFITSDNYLVVAGQDAQQNEQLVKKYLRPGDAYLHA 534

Query: 598 DLHGASSTVIKNHRPEQ--------PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
           ++HGA++ +++  R  +        P+    L +AG FT C S AW SKMV SA+WV  H
Sbjct: 535 EVHGAATCILRAKRRRRSDGKTQVIPLSDQALREAGTFTTCRSSAWSSKMVCSAYWVESH 594

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHLNERRVRGEE 708
           QVSKTAPTGEYLTVGSFMIRG+KNFLPP  L MG G+LFRL D++S+  H NERR     
Sbjct: 595 QVSKTAPTGEYLTVGSFMIRGRKNFLPPSSLEMGMGVLFRLGDDASVARHANERRDFALM 654

Query: 709 EGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAE 768
           E  + F      +E + +  E +D  E    +S    +       HTNA +         
Sbjct: 655 EHEEIFARQDALREKNKVSVEVEDESEPIPLDSYEKEHDDVCPTGHTNAID--------- 705

Query: 769 DKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEED 828
                N  D  I D   NV   VTP  E+  ++      +S      G E    D  ++ 
Sbjct: 706 ----GNAGDEAIEDTENNV--EVTPDAEESTEQPNSDNESSDGKQSDGDEVPTADTKKKQ 759

Query: 829 KHVER 833
           K + R
Sbjct: 760 KELSR 764



 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/133 (39%), Positives = 87/133 (65%), Gaps = 12/133 (9%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAY----ARD 74
           R ++G + +NVYD S       +M ++   ++ ++++ +LL+ESGVR H T +    +  
Sbjct: 7   RSMLGFKLANVYDGSA----LGIMPAA---DAEQAKRAMLLIESGVRFHPTTHYSQSSSS 59

Query: 75  KKNTPSGFTLKLRKHIRTRRLEDVRQLG-YDRIILFQFGLGMNAHYVILELYAQGNILLT 133
             + PS F +KLRKH+R  RLE+V QLG  DR++ F+FG G   H+++LELY+ GN++L 
Sbjct: 60  SSSMPSAFAMKLRKHLRNLRLENVTQLGNLDRVVDFRFGSGSLTHHLLLELYSLGNLILC 119

Query: 134 DSEFTVLTLLRSH 146
           D ++ +L LLR+H
Sbjct: 120 DGQYRILGLLRTH 132


>gi|355718192|gb|AES06188.1| serologically defined colon cancer antigen 1 [Mustela putorius
           furo]
          Length = 547

 Score =  309 bits (792), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 139/222 (62%), Positives = 177/222 (79%), Gaps = 1/222 (0%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           V+VDL+LSA+ANA+++Y+ K+    K +KT+ A  KAFK+AEKKT+  + + +TV +I  
Sbjct: 43  VDVDLSLSAYANAKKYYDHKRYAAKKTQKTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQK 102

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            RKV+WFEKF WFISSENYL+I GRD QQNEMIVKRY++ GD+YVHADLHGA+S VIKN 
Sbjct: 103 ARKVYWFEKFLWFISSENYLIIGGRDQQQNEMIVKRYLTTGDIYVHADLHGATSCVIKNP 162

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             E P+PP TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRG
Sbjct: 163 TGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRG 221

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           KKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 222 KKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 263


>gi|302917991|ref|XP_003052561.1| hypothetical protein NECHADRAFT_77690 [Nectria haematococca mpVI
           77-13-4]
 gi|256733501|gb|EEU46848.1| hypothetical protein NECHADRAFT_77690 [Nectria haematococca mpVI
           77-13-4]
          Length = 1072

 Score =  309 bits (791), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 185/457 (40%), Positives = 261/457 (57%), Gaps = 52/457 (11%)

Query: 331 IYDEFCPLL---LNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
           +Y++F P +   L++  + E ++F+ ++  +DEF+S +E Q+ E +   +E AA  KL+ 
Sbjct: 290 LYEDFHPFVPQKLSKDPTIEVLEFKGYNETVDEFFSSLEGQKLESRLTEREAAAKRKLDA 349

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              +Q  R+  L++  + + + A  IE N+E V  A+ AV   L+  M W D+ ++V+ E
Sbjct: 350 AKQEQAKRIEGLQEAQNLNFRKAAAIEANVERVQEAMDAVNGLLSQGMDWVDVGKLVERE 409

Query: 448 RKAGNPVAGLID-KLYLERNCMSLL-------------LSNNLDEMDDEEKTLPVE---- 489
           +K  NPVA +I   L L  N ++L                 + DE  DEE + P +    
Sbjct: 410 KKRHNPVAEIIKLPLNLAENLITLELAEEEFEPEEDDPYETDDDESADEEDSTPTKGKHA 469

Query: 490 ----KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQ 541
                VE++L LS  +NAR +++ +K    K+EKT    S+A K AE+K     +  + Q
Sbjct: 470 SKALSVEINLGLSPWSNAREYFDQRKSAAVKKEKTEQQASRALKNAEQKITQDLKKGLKQ 529

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK +  +  +RK  WFEKF WFISS+ YLVI G+DAQQNEMI KRY+ KGD+Y HADLHG
Sbjct: 530 EKAL--LQPIRKQLWFEKFIWFISSDGYLVIGGKDAQQNEMIYKRYLRKGDIYCHADLHG 587

Query: 602 ASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
           ASS +IKN+   P+ P+PP TL+QAG   VC S AWDSK   SAWWV   QVSK+APTGE
Sbjct: 588 ASSVIIKNNPKTPDAPIPPATLSQAGSIAVCSSDAWDSKAGMSAWWVNADQVSKSAPTGE 647

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR-------------- 705
           +L  GSFM+RGKKNFLPP  L++G GL+FR+ E S   H+  R                 
Sbjct: 648 FLPTGSFMVRGKKNFLPPAQLLLGLGLVFRISEESKAKHVKHRLYDVDSAIGDSVSGITT 707

Query: 706 -----GEEEGMDDFEDSGHHKENSDIESEKDDTDEKP 737
                G+     +  ++ H    SD ESE D  DEKP
Sbjct: 708 PQVEVGQGSAEAEQSEAAHSDHVSDDESEDDQPDEKP 744



 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 56/145 (38%), Positives = 83/145 (57%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+ RL+ +R SNVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKVIAHELQQRLVTLRLSNVYDLSSKILLLKFAKPDN--------KKQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK ++TRRL  V Q+G DR++ F+F  G   + +
Sbjct: 53  DTGFRCHLTEFARTTAAAPSAFVARLRKFLKTRRLTSVSQVGTDRVLEFEFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE +A GNI+LTD++  +LTL R+
Sbjct: 111 FLEFFASGNIILTDADLKILTLART 135


>gi|302419577|ref|XP_003007619.1| DUF814 domain-containing protein [Verticillium albo-atrum VaMs.102]
 gi|261353270|gb|EEY15698.1| DUF814 domain-containing protein [Verticillium albo-atrum VaMs.102]
          Length = 1107

 Score =  308 bits (789), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 232/741 (31%), Positives = 339/741 (45%), Gaps = 158/741 (21%)

Query: 1    MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
            ++K R ++ DV      L   L+ +R +NVYDLS K  + K              K  +L
Sbjct: 418  IMKQRFSSLDVKVIAHELHESLVTLRLANVYDLSSKILLLKFAKPD--------NKKQIL 469

Query: 60   MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            ++SG R H T +AR     PS F  +LRK ++TRRL  V Q+G DRII F F  G   + 
Sbjct: 470  IDSGFRCHLTDFARTTAAAPSAFVARLRKFLKTRRLTAVSQVGTDRIIEFTFSDGQ--YR 527

Query: 120  VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHA 179
            + LE +A GN++LTD+E  +LTLLR+                                  
Sbjct: 528  LFLEFFASGNVILTDAELRILTLLRN---------------------------------- 553

Query: 180  ALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQK--------------GGKSFDLSKNS 225
                  E +  EP +V   G + S  +++N GG                  K+ +     
Sbjct: 554  ----VPEGEGQEPQRV---GLSYSLDNRQNFGGVPPLTRERLQNALRVMAAKAANAPTTG 606

Query: 226  NKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEV---NKLEDNAIQ 282
             K    G + ++  L T + E     P L +H    TG  P    +E+   + L D+ + 
Sbjct: 607  KKKIKPGDQLRK-GLATTITE---LPPMLVDHAFQVTGFDPTKTPAELLDSDALLDSLLH 662

Query: 283  VLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKD-HPPTESGSSTQ----IYDEFCP 337
             L +A    ED      +      GY++ + +   ++     + G+ T+    +YD+F P
Sbjct: 663  ALTVARKVVED-----ATSSATTTGYVIAKYRQKSEETEEKPDDGAETKREDLLYDDFHP 717

Query: 338  LLLNQFRSREFVKFETFDA---ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
             L  +F     VK  TFD     +DEF+         +     E                
Sbjct: 718  FLPQKFADDPSVKVLTFDGFNKTVDEFFFLARGPETREAQSLNE---------------- 761

Query: 395  RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
                         + A  IE N+E V  A+ AV   +   M W ++ ++++ E+K  NP 
Sbjct: 762  -------------QKAAAIEANVERVQEAMDAVNGLVQQGMDWVNIGKLIEREQKRHNP- 807

Query: 455  AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE--------------------KVEVD 494
                       N M+LLL     E +DE      +                    ++E++
Sbjct: 808  -----------NLMTLLLGTEAVEDEDEAYETGSDASDSEDDEDGAKAKGADRRLQIEIN 856

Query: 495  LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISH 550
            L LS  ANAR +Y+ ++    K+ KT+   + A K AEKK     +  + QEK V  +  
Sbjct: 857  LGLSPWANAREYYDQRRTAAVKELKTVQHSTMALKNAEKKITEDLKKGLKQEKAV--LQP 914

Query: 551  MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            +RK  WFEKF WF+SS+ YLV+ G+DAQQNE + KRY+ KGDVY HAD+HGA++ ++KN 
Sbjct: 915  IRKQMWFEKFIWFLSSDGYLVLGGKDAQQNETLYKRYLRKGDVYCHADMHGAATVIVKNK 974

Query: 611  R--PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
            +  P+ P+PP TL QAG  +VC S AWDSK    AWWV   QVSK+APTGEYL   +FM 
Sbjct: 975  QDTPDAPIPPSTLAQAGMLSVCSSSAWDSKAGMGAWWVRADQVSKSAPTGEYLPAAAFMG 1034

Query: 669  RGK-KNFLPP-HPLIMG-FGL 686
             G  +NFLPP  PL  G FG+
Sbjct: 1035 AGPGRNFLPPGRPLGAGAFGI 1055


>gi|148704666|gb|EDL36613.1| mCG3169, isoform CRA_b [Mus musculus]
          Length = 658

 Score =  305 bits (780), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 137/222 (61%), Positives = 177/222 (79%), Gaps = 1/222 (0%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           V+VDL+LSA+ANA+++Y+ K+    K ++T+ A  KAFK+AEKKT+  + + +TV +I  
Sbjct: 53  VDVDLSLSAYANAKKYYDHKRYAAKKTQRTVEAAEKAFKSAEKKTKQTLKEVQTVTSIQK 112

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            RKV+WFEKF WFISSENYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN 
Sbjct: 113 ARKVYWFEKFLWFISSENYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNP 172

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
             E P+PP TL +AG   +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRG
Sbjct: 173 TGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRG 231

Query: 671 KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           KKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 232 KKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 273


>gi|452000540|gb|EMD93001.1| hypothetical protein COCHEDRAFT_1172752 [Cochliobolus
           heterostrophus C5]
          Length = 1128

 Score =  303 bits (776), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 250/714 (35%), Positives = 364/714 (50%), Gaps = 97/714 (13%)

Query: 288 VAKFEDWLQDV--ISGDIVP----EGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLLL 340
           V K  D LQD   I+ +I      +GYIL +  +     P  ES    + +YD+F P   
Sbjct: 241 VEKLVDVLQDARKITDEITKTDRIKGYILAK-PNPSASKPDDESSDKPRFLYDDFHPFRP 299

Query: 341 NQFRSRE--FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHT 398
            QF + +  F++F+ F+ A+DEF+S IE Q+ E +   +E  A  KL K   + E+R+  
Sbjct: 300 QQFENTDYTFLEFDGFNKAVDEFFSSIEGQKLESKLTEREQQAKKKLEKARKEHEDRIGG 359

Query: 399 LKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLI 458
           L+Q  + + + AE I  N+  V  A  AV   +   M W D+ R+++ E+ +GN VA LI
Sbjct: 360 LQQVQELNFRKAEAILANVHRVTEATEAVNGLIRQGMDWVDIERLIEREQNSGNAVAQLI 419

Query: 459 D-KLYLERNCMSLLL---------------------SNNLDEMDDE-EKTLPVEKV---- 491
              L L  N ++LLL                     S + D+ DD   KT P + V    
Sbjct: 420 RLPLKLHENTITLLLNETNWEEGGEEEDEGNETSSVSEDTDDEDDRPRKTSPPKPVARPQ 479

Query: 492 ---EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKT 544
              ++DL LSA AN+  +++ KK    K+ +T+ A +KA K+ EKK     +  + QEK 
Sbjct: 480 LAIDIDLGLSAWANSTEYFDQKKTAADKEGRTLQASTKALKSHEKKVAEDLKKGLKQEKE 539

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
           V  +  +RK HWFEKF +FISS+ YLV+ G+DAQQNE+I +R++ KGDVYVHADL GA  
Sbjct: 540 V--LRPVRKQHWFEKFIYFISSDGYLVLGGKDAQQNEIIYRRFLRKGDVYVHADLKGAMP 597

Query: 605 TVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLT 662
            +IKN    P+ P+PP TL+QAG  ++C S AWDSK V SAWWV   QVSKT  TGE+L 
Sbjct: 598 MIIKNKPDTPDAPIPPSTLSQAGNLSICTSDAWDSKAVMSAWWVRSDQVSKTGQTGEFLP 657

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH----LNERRVRGEEEGMDDFEDSG 718
            G F I+GKK FLPP  L++G  ++F + +SS  +H    + E  V   E  M D +   
Sbjct: 658 AGMFNIKGKKEFLPPAQLVVGLAVMFEISDSSKANHHKHRVQETAVSAAE--MTD-QPGN 714

Query: 719 HHKENSDIESEKDDTDEKPVAESLSVPNSAHPAP--SHTNASNVDSHEFPAEDKTISNGI 776
             KE +  ++++ + DE P A+  S      P     HT  S+ +S          SN +
Sbjct: 715 ESKEAAATKTDESNDDEFPDAKFDSDSEDDFPDAKMEHTEESDAESEAAAPR----SNPL 770

Query: 777 DSKIFDIARNVAAPVTPQLEDLIDRALGLGSA--SISSTKHGI----ETTQFDLSEEDKH 830
            S      RN A   + + ++L+   +G G A  +    K+G+    E  + + S  D  
Sbjct: 771 QSST----RN-AKEDSGEEDELV---VGKGDAEHAKPGEKNGVVAKKEPPEDEGSIADTE 822

Query: 831 VERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPE-------SIVR 883
               +T R K  +S  ERR  +KGQ   +  P+V  +     D + Q E       S   
Sbjct: 823 PISKSTGRGK--LSARERRLARKGQLPEL--PQVPSDTVPAVDGADQDEGDSAEGGSAKA 878

Query: 884 KTKIEG-----------GKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAVST 926
            TK++G             + RG++ K KK   KY  QDEE+R + M LL   T
Sbjct: 879 PTKVDGTVTSQMNKQKNAPLPRGKRAKAKKQAAKYAAQDEEDRELAMRLLGSKT 932



 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 59/145 (40%), Positives = 86/145 (59%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L  +L  +R +NVYDLS + ++ K              +  LL+
Sbjct: 1   MKQRFSSLDVKVIAHELSAKLTSLRVTNVYDLSSRIFLIKFHKPD--------HREQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T YAR     PSGF  KLRK+++TRR+  + Q+G DRI+ FQF  G+  + +
Sbjct: 53  DSGFRCHLTEYARTTAAAPSGFVAKLRKYLKTRRVTSISQIGTDRILEFQFSDGL--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE YA GNI+LTD++  VL+LLR+
Sbjct: 111 YLEFYAGGNIILTDADLNVLSLLRN 135


>gi|340516439|gb|EGR46688.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1078

 Score =  302 bits (774), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 189/550 (34%), Positives = 292/550 (53%), Gaps = 76/550 (13%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCP 337
           +  LV  +++  D ++++I+     +GYI  + +      P     +      +Y++F P
Sbjct: 243 LDALVNHLSEARDVVENIIASSTC-KGYIFAKRRTTPSSAPDDAEQAQKHEGLLYEDFHP 301

Query: 338 LLLNQFR---SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQEN 394
            +  +F+   S + ++F+ ++  +DEF+S +E Q+ E +   +E+AA  KL     +Q  
Sbjct: 302 FVPQKFKNDPSIQVLEFDGYNRTVDEFFSSLEGQKLESRLTGREEAARKKLEAARQEQAK 361

Query: 395 RVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
           R+  L+     + + A  IE N+E V  A+ AV   LA  M W D+ ++++ E+K  NPV
Sbjct: 362 RIQGLQDAQAMNYRKAAAIEANVERVQEAMDAVNGLLAQGMDWVDIGKLIEREKKRQNPV 421

Query: 455 AGLID-KLYLERNCMSLLLSN----------------NLDEMDDEE-----------KTL 486
           A +I   L L  N ++LLL+                   D+ D EE           KT 
Sbjct: 422 AEIISLPLKLADNTITLLLAEEAFDEDEAEEEEDNPFETDDSDSEEDQGGKATSKDKKTD 481

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQE 542
            +  V++ L +S  +NAR +YE ++    KQEKT    +KA K+ E+K     +  + QE
Sbjct: 482 KLLTVDIVLNMSPWSNAREYYEERRSAAMKQEKTQQQATKALKSTEQKIAEDLKKGLKQE 541

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           K +  +  +RK  WFEKF WFISS+ YLV+ G+D QQ+E++ +RY+ KGDVY HAD+ GA
Sbjct: 542 KAL--LQPIRKQMWFEKFLWFISSDGYLVLGGKDPQQSEILYRRYLRKGDVYCHADIRGA 599

Query: 603 SSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           ++ VIKN+   P+ P+PP TL+QAG  +VC S+AWDSK    AWWV   QVSKT P+G+ 
Sbjct: 600 ANIVIKNNPNMPDAPIPPATLSQAGSLSVCTSEAWDSKAGMGAWWVNADQVSKTTPSGDI 659

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNER-----------RVRGEE- 708
           L  G+F I+GKKN+LPP  L++G G  F++ E S G+HL  R              G+E 
Sbjct: 660 LPAGTFTIQGKKNYLPPTQLLLGLGFAFKISEQSKGNHLKHRVHDGRSSTATEAATGDEG 719

Query: 709 -----EGMDDFEDS---------GHHKENSDIESE------KDDTDEKPVAESLS-VPNS 747
                EG+DD EDS         GH +  + ++S        DD  +K  A  +S  P +
Sbjct: 720 EAQNTEGIDDQEDSDSEPEDNQPGHEERANPLQSSGIGEETADDAADKLSAVKISDQPGN 779

Query: 748 AHPAPSHTNA 757
             P P   +A
Sbjct: 780 DEPTPPSEDA 789



 Score = 39.7 bits (91), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAVS 925
           RGQKGK KK+ +KY DQDEE+R    AL+  +
Sbjct: 848 RGQKGKAKKIAQKYKDQDEEDRATAEALIGAT 879


>gi|326434920|gb|EGD80490.1| hypothetical protein PTSG_13144 [Salpingoeca sp. ATCC 50818]
          Length = 947

 Score =  298 bits (764), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 174/478 (36%), Positives = 272/478 (56%), Gaps = 16/478 (3%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           L+  L   +  GPA  EH +L+ G  PN +++E   +  +  +VL  A+ + E  L   +
Sbjct: 17  LRKHLTRIMDCGPAFIEHCLLEAGFPPNARVNEGCNVATDLPRVLA-ALQQAEHLLFTKL 75

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
               V +GYIL++  H   D     +     ++++  P  L QF  R F +F++FD A+D
Sbjct: 76  EQGQV-KGYILLK-AHAKADARKDAAKEEVVVFEDVMPFPLKQFEGRTFKEFDSFDVAVD 133

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            ++S+IES + E +   +E AA  KL +      +RV   K+        A LIE N E 
Sbjct: 134 TYFSEIESHKLEMRALQQERAARQKLEQARRSHHDRVKGYKEARLEDEYKATLIELNHEL 193

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           V+ AI  +   + N + W ++  +V+E R  G+PVA  I KL L++N + + L+      
Sbjct: 194 VNEAIDVINKMVGNHLDWREIEELVQESRVRGDPVANAISKLKLKKNAIVMHLTEPSMGG 253

Query: 475 -------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                  + DE +DE+       VE+DLA +AH NAR+ +E KK   SK+EK + +  +A
Sbjct: 254 ADDDSWSDEDEDEDEDDNTKGALVEIDLAETAHGNARKLHERKKTIRSKEEKALASTEQA 313

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
            ++ EK+   ++ + +  A IS  R   WFEKFNWFISSENYLV++GRD  QNE +V+++
Sbjct: 314 LRSVEKRAMDRLKKTQITATISKSRAPLWFEKFNWFISSENYLVLAGRDRLQNEALVRKH 373

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +++ D+YVHAD++GASS V+KN    + +PP TL++A  F V HS AW++     AWWV+
Sbjct: 374 LTQHDLYVHADMNGASSVVVKNSNTGE-IPPKTLSEAATFAVAHSPAWENNQQADAWWVH 432

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
            +QV KT+  G+ L  GSF I G K+F+P   L + + +LF++D+ S   H  ERR +
Sbjct: 433 ANQVEKTSSEGKPLGAGSFRITGAKHFIPIRQLALAYAILFKVDDESAKRHEGERRCK 490


>gi|260803888|ref|XP_002596821.1| hypothetical protein BRAFLDRAFT_130588 [Branchiostoma floridae]
 gi|229282081|gb|EEN52833.1| hypothetical protein BRAFLDRAFT_130588 [Branchiostoma floridae]
          Length = 834

 Score =  292 bits (748), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 169/428 (39%), Positives = 248/428 (57%), Gaps = 45/428 (10%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            LK +L   L YGPA+ +H +L+ G     K+     +  +  Q L+ A+ + E +L+  
Sbjct: 24  ALKRILNSKLVYGPAVLDHCLLNAGFPEGAKVGRDFDVSQDLPQ-LMAALVEAEKFLE-- 80

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI--YDEFCPLLLNQFRSREFVKFETFDA 356
            SG    +GYI+ + +      P  + G + ++  Y EF P    Q      V+F +F+ 
Sbjct: 81  ASGSQPCQGYIVQKREK----KPKQDGGPAEELLTYAEFHPFQFKQHEKSPCVEFPSFNK 136

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEF+S++ESQR + +   +E  A  KL  +  D E R+ TL++  D     A+LIE N
Sbjct: 137 AVDEFFSQLESQRLDLKALQQEKVAIKKLENVKKDHERRLETLQKVQDEDKHKAQLIELN 196

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNL 476
           L+ VD AIL VR A+AN++ W ++  +VKE +  G+PVA  I  L L+ N ++L+L N  
Sbjct: 197 LDLVDKAILVVRSAIANQIDWTEIWDIVKEAQAQGDPVASTIKSLKLDSNHITLVLRNPF 256

Query: 477 ------DEMDDEEKTLPVE-------KVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
                  E DD++  +  E       K+++DLALSA+ANA+++Y+ K+    K++KTI A
Sbjct: 257 SGYESDSEGDDDKAGVGREASSDRPMKIDIDLALSAYANAKKYYDQKRHAAKKEQKTIDA 316

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             K                       H   V  FEKF WFI+SENYLVI+GRD+QQNE+I
Sbjct: 317 SEKC----------------------HEFVVERFEKFLWFITSENYLVIAGRDSQQNELI 354

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
           VKR++  GD+YVHADLHGA+S VI+NH     VPP +LN+AG F +CHS AWD+K+VTSA
Sbjct: 355 VKRHLKPGDLYVHADLHGATSCVIQNHS-SNSVPPKSLNEAGTFAICHSAAWDAKVVTSA 413

Query: 644 WWVYPHQV 651
           W+V+  Q 
Sbjct: 414 WYVHHDQT 421


>gi|221059774|ref|XP_002260532.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|193810606|emb|CAQ42504.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 2040

 Score =  290 bits (741), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 175/454 (38%), Positives = 262/454 (57%), Gaps = 40/454 (8%)

Query: 317 GKDHPPTESGSSTQI-YDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIE-SQ 368
           GK     E  S  +I + EF P++LN  +++      E + F+ F+  +D ++S++E S+
Sbjct: 439 GKGVVKEEEKSGEEITFTEFSPIILNNHKNKVEENKLEIIHFDDFNKCVDSYFSRMELSK 498

Query: 369 RAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVR 428
             +QQ   K   +  K++KI +D E R+  L++EV    K   LI+ N E V+ AI  +R
Sbjct: 499 YDKQQEVIKIKKSLTKMDKIKLDHERRIEQLEKEVSSLRKKISLIQMNDELVEQAIQLMR 558

Query: 429 VALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN---NLDEMDDEEKT 485
            A+A   +WE +   +K  +K  +P+A  I  +      M LLL +   N +  DD  + 
Sbjct: 559 AAVATNANWEKIWEHIKLFKKQNHPIALRISSVNFNNCEMELLLDDGEENEEGSDDSSRE 618

Query: 486 LPVEK------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
              E             V ++L  S + N   + +++KK E K  KT  + + A K  EK
Sbjct: 619 ADEESPKRATGRESKLAVTINLNNSVYGNVEDYQKMRKKAEEKIRKTKISTNFAVKKVEK 678

Query: 534 KTRLQIL-----QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
           K + +         KTV  I  +RKV+WFEKF+WFISSENYLVI+GRDA QNE++ +RY 
Sbjct: 679 KKKEKENKQKGKHNKTVGQIQKLRKVYWFEKFHWFISSENYLVIAGRDALQNEILFRRYF 738

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
            K DVYVHAD+HGAS+ +IKN   + P+P  TL++AG   +C S AW++K++TSAWWV+ 
Sbjct: 739 QKNDVYVHADIHGASTCIIKNPYKDIPIPEKTLSEAGQLAICRSSAWNNKIITSAWWVHY 798

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
           HQVSK+APTGEYL  GSF+IRGKKN+LP   L MG  ++F++D +++ +         EE
Sbjct: 799 HQVSKSAPTGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNAAVEND--------EE 850

Query: 709 EGMDDFEDSGHHKENSDIESEKDDTDEKPVAESL 742
             +DD + S    EN D E +  D D++ V ++L
Sbjct: 851 NNLDDTQKSF---ENDD-EKKNSDGDQEVVEDAL 880



 Score =  115 bits (289), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 91/147 (61%), Gaps = 9/147 (6%)

Query: 1   MVKVRMNTADVAAEVK-CLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C   ++G   +N+Y++S K Y+ K         S + +K   L
Sbjct: 1   MAKQRLTALDIRAIITLCKNIIVGCVVTNIYNISNKIYVLKC--------SKKEQKYFFL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PS FT+KLRKH+R+R++ ++ QLG DR++  QFG    A +
Sbjct: 53  VEAEKRIHITEWKREKDVMPSAFTMKLRKHLRSRKITNISQLGGDRVVDIQFGFDDKACH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSH 146
           +I+ELY  GNI+LTD+   +L++L+S+
Sbjct: 113 LIVELYIAGNIILTDNNHKILSILKSN 139


>gi|320581674|gb|EFW95893.1| hypothetical protein HPODL_2176 [Ogataea parapolymorpha DL-1]
          Length = 940

 Score =  289 bits (740), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 222/745 (29%), Positives = 386/745 (51%), Gaps = 98/745 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+   VK +   I G R  NVY+L  +P++++ K         S    K  L
Sbjct: 1   MKQRVSAFDIRVLVKEIEHAIKGHRLQNVYNLVANPRSFLLKF--------SVPDSKANL 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++ESG +++ T + R     PS F +KLRKH+++RRL +++Q+G DR+++ +FG GM  +
Sbjct: 53  VIESGFKVYLTEFQRPTAPEPSNFVVKLRKHLKSRRLSNIKQVGNDRVVVLEFGDGM--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR---SHRDDDKGVAIMSRHRYPTEICR-VFERTTA 174
           Y++LE ++ GNI+L DS+  +L+L R    H ++D         RY   +   +F+R+  
Sbjct: 111 YLVLEFFSAGNIILLDSDRKILSLFRLVEEHENND---------RYAVGVTYGMFDRSLF 161

Query: 175 SKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGAR 234
            + H  L         EP                        + +D +K++++N+     
Sbjct: 162 EE-HGQL---------EPRHYT------------------SAEIYDWAKSASENT----- 188

Query: 235 AKQPTL-KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
           +K P++ K V   A      L +  +   G+ P    S V  ++D  +   V A     +
Sbjct: 189 SKVPSIAKLVFLNAAYLSSDLIQIQLSKNGIDPAS--SGVKIVQDEELLAKVTAAVNSCE 246

Query: 294 W----LQDVISGDIVPEGYILMQNKHLGKDHP---PTESGSSTQ---IYDEFCPL--LLN 341
                L ++ +G++   GYI+      GK +P   P E  S      +YDEF P   +  
Sbjct: 247 QEFYRLTNLPAGEL--SGYII------GKHNPFFKPEEDASYDNLEYVYDEFHPFEPVHK 298

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
           +  +    + + ++  LD+F+S +ES +A  + + ++  A  +L  +  +   ++  L++
Sbjct: 299 KKENTRVEEVKGYNRTLDKFFSTLESSKAVLKIQQQQANAAKRLQTVKDEHMTKLQRLEE 358

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
           +   + +  ELI ++ E ++    +V+  L  +M W ++ ++V  E+K  NP+A +I   
Sbjct: 359 QQAINYRKGELITFHSEQIEQCKQSVQALLDQQMDWTNIEKLVAMEQKRRNPIANMIKLP 418

Query: 461 LYLERNCMSLLLSN--------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
           L L +N +++LL +        +  + +++ K+ PV  V +DL+LSA+ANA R+++  + 
Sbjct: 419 LNLAKNEITVLLPDIEEQSDSDSDSDSEEKRKSGPVA-VAIDLSLSAYANATRYFDAMRA 477

Query: 513 QESKQEKTITAHSKAFKAAEKKTR--LQILQEKT--VANISHMRKVHWFEKFNWFISSEN 568
              KQ KT  + S A K  E+  +  L+ +Q+K+   + +  +R   WFEKF WFI+S+N
Sbjct: 478 ALDKQNKTKNSASIAIKNTERTIQQDLKRMQKKSQEPSGLKQIRAKFWFEKFWWFITSDN 537

Query: 569 YLVISGRDAQQNEMIVKRYMSK-GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           +L I+GRD  Q ++I  RY  K  DV V  DL G    ++KN    + +PP TL QAG F
Sbjct: 538 HLCIAGRDDTQVDLIYYRYFDKNNDVLVSNDLDGL-KVIVKNPFKNKDIPPSTLLQAGIF 596

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           ++  S+AWD+KMVTS W V   QVSK    G  +  G   I+G+K FLPP  L+MGFGLL
Sbjct: 597 SLSASKAWDNKMVTSPWMVKGTQVSKKDFDGSIVPAGMLNIQGEKTFLPPCQLVMGFGLL 656

Query: 688 FRLDESSLGSHLNERRVRGEEEGMD 712
           +  DE +   + +  + R +E G++
Sbjct: 657 WLGDEETTRKYRDSAKSRIQEVGLE 681


>gi|156101618|ref|XP_001616502.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805376|gb|EDL46775.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 2067

 Score =  284 bits (727), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 179/484 (36%), Positives = 262/484 (54%), Gaps = 63/484 (13%)

Query: 332 YDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHK 384
           + EF P++LN  +++      E V F+ F+  +D ++S++E S+  +QQ   K   +  K
Sbjct: 441 FTEFSPIILNNHKNKVEENKLEVVHFDDFNKCVDTYFSRMELSKYDKQQEVIKIKKSLTK 500

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
           ++KI +D E R+  L++EV    K   LI+ N E V+ AI  +R A+A   +WE +   +
Sbjct: 501 MDKIKLDHERRIDQLEKEVSTLRKKISLIQMNDELVEQAIQLMRAAVATNANWEKIWEHI 560

Query: 445 KEERKAGNPVAGLIDKLYLERNCMSLLLS----NNL------------DEMDDEEKTLPV 488
           K  +K  +P+A  I  +      M LLL     N L            D+   E    P 
Sbjct: 561 KLFKKQNHPIALRISSVNFNNCEMELLLDDGEENGLGSDDSSEANGRSDDPSSEANEQPS 620

Query: 489 EK---------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF----- 528
           +                V ++L  S + N   + +L+KK E K  KT  + + A      
Sbjct: 621 KGKKSSNKKAATNNRFAVTINLNNSVYGNVEDYQKLRKKAEEKIRKTKISTNFAVKKVEK 680

Query: 529 KAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
           K  EK+ + +    KTV  I  +RKV+WFEKF+WFISSENYLVI+GRDA QNE++ +RY 
Sbjct: 681 KKKEKENKQKGKHNKTVGQIQKIRKVYWFEKFHWFISSENYLVIAGRDALQNEILFRRYF 740

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
            K DVYVHAD+HGAS+ +IKN   + P+P  TL++AG   +C S AW++K++TSAWWV+ 
Sbjct: 741 QKNDVYVHADIHGASTCIIKNPHKDIPIPEKTLSEAGQLAICRSSAWNNKIITSAWWVHY 800

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
           HQVSK+APTGEYL  GSF+IRGKKN+LP   L MG  ++F++D ++L ++        EE
Sbjct: 801 HQVSKSAPTGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNAALDNN--------EE 852

Query: 709 EGMDDFEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASN-----VDSH 763
             +DD + S  +    D E    D D+  V     V   A  A  H  A N     ++  
Sbjct: 853 NNLDDTQKSFEN----DGERRSSDGDQAVVG---GVTIDACTAEGHIQAGNPYTGPMEGT 905

Query: 764 EFPA 767
            FPA
Sbjct: 906 SFPA 909



 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 91/147 (61%), Gaps = 9/147 (6%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +   R +I G   +N+Y++S K Y+ K         S + +K   L
Sbjct: 1   MAKQRLTALDIRAIITLCRNIIVGCVVTNIYNISNKIYVLKC--------SKKEQKYFFL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PS FT+KLRKH+R+R++ ++ QLG DR++  QFG    A +
Sbjct: 53  VEAEKRIHITEWKREKDVMPSAFTMKLRKHLRSRKITNISQLGGDRVVDIQFGFDDKACH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSH 146
           +I+ELY  GNI+LTD+   +L++L+++
Sbjct: 113 LIVELYIAGNIILTDNNHKILSILKTN 139


>gi|237842889|ref|XP_002370742.1| hypothetical protein TGME49_014090 [Toxoplasma gondii ME49]
 gi|211968406|gb|EEB03602.1| hypothetical protein TGME49_014090 [Toxoplasma gondii ME49]
          Length = 1859

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 158/385 (41%), Positives = 235/385 (61%), Gaps = 23/385 (5%)

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +R  + F   +  +DE++S ++ Q++E+        A  ++ KI  DQE R+  L++E  
Sbjct: 448 TRVLLHFRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAA 507

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             ++ A+ +E N+  V+  I  +R ALA  + W++L R +K + K G+P+A  + +L LE
Sbjct: 508 NLLQQAQAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLE 567

Query: 465 RNCMSLLLSNNLDEM-----DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           +    LLL     E      +  E  L    V VD+ALSAH NA+  +   K+ ++K +K
Sbjct: 568 KQRAMLLLEAPRREEAEEPGEASETIL----VPVDVALSAHGNAQLLHSQVKQLKAKTQK 623

Query: 520 TITAHSKAFKAAEKKTRLQILQE-----KTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           T  A + A  AA++K +  + Q+     +    +  +RK  WFEKF+WFISS++YLV++G
Sbjct: 624 TSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHYLVLAG 683

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-------VPPLTLNQAGCF 627
           RDAQQNE++ +RY+   DVYVHAD+HGA++ +IKN R  +P       VP  TL Q G F
Sbjct: 684 RDAQQNEILFRRYLRSNDVYVHADVHGAATCIIKNSRETEPGKCDDPPVPLTTLQQCGEF 743

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AW +K  ++AWWVY  QVSK+AP+G YL+ GSFMIRG++NF+  H L MGFGLL
Sbjct: 744 AVCRSSAWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLL 803

Query: 688 FRL-DESSLGSHLNER-RVRGEEEG 710
           FRL DE+S+  H+  R R+  EE G
Sbjct: 804 FRLADEASVARHVAARTRLALEEAG 828



 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/146 (38%), Positives = 87/146 (59%), Gaps = 1/146 (0%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            K R+   DV A V  +R  ++G+R +NVYD S         +S  +  +G+  KV L +
Sbjct: 4   TKQRVGALDVRALVASVRPSIVGLRVTNVYDFSAGGSRGGTSSSYILKFAGKESKVFLFI 63

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G RL+TT + +DK   PS F ++LRK +R ++LED+ Q G DR+++  FG   NA ++
Sbjct: 64  HAGFRLYTTEWKKDKGALPSPFCVRLRKGLRGKKLEDIHQHGADRVVILTFGKSENALHL 123

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSH 146
           ++ELY  GNI+LTD    +  +LR H
Sbjct: 124 VVELYVSGNIILTDHTNLIQAVLRRH 149



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 20/97 (20%)

Query: 835  ATVRDKPYISKAERRKLKKGQGSSVVDPKVERE-------KERGKDASSQPESIVRKTKI 887
            A +  +  +S AERR+ KKG   +  DP    E       KE+ K    QP         
Sbjct: 1606 AEIPSRKRMSAAERRRQKKGNREAKDDPAGTAEEKEDMGGKEKAKGPRLQP--------- 1656

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
                + RG++GKL KMK+KYGDQDEEE+  +M+L+  
Sbjct: 1657 ----VPRGKRGKLAKMKKKYGDQDEEEKQFKMSLIGA 1689


>gi|221482059|gb|EEE20420.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 1859

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 158/385 (41%), Positives = 235/385 (61%), Gaps = 23/385 (5%)

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +R  + F   +  +DE++S ++ Q++E+        A  ++ KI  DQE R+  L++E  
Sbjct: 448 TRVLLHFRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAA 507

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             ++ A+ +E N+  V+  I  +R ALA  + W++L R +K + K G+P+A  + +L LE
Sbjct: 508 NLLQQAQAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLE 567

Query: 465 RNCMSLLLSNNLDEM-----DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           +    LLL     E      +  E  L    V VD+ALSAH NA+  +   K+ ++K +K
Sbjct: 568 KQRAMLLLEAPRREEAEEPGEASETIL----VPVDVALSAHGNAQLLHSQVKQLKAKTQK 623

Query: 520 TITAHSKAFKAAEKKTRLQILQE-----KTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           T  A + A  AA++K +  + Q+     +    +  +RK  WFEKF+WFISS++YLV++G
Sbjct: 624 TSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHYLVLAG 683

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-------VPPLTLNQAGCF 627
           RDAQQNE++ +RY+   DVYVHAD+HGA++ +IKN R  +P       VP  TL Q G F
Sbjct: 684 RDAQQNEILFRRYLRSNDVYVHADVHGAATCIIKNSRETEPGKCDDPPVPLTTLQQCGEF 743

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AW +K  ++AWWVY  QVSK+AP+G YL+ GSFMIRG++NF+  H L MGFGLL
Sbjct: 744 AVCRSSAWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLL 803

Query: 688 FRL-DESSLGSHLNER-RVRGEEEG 710
           FRL DE+S+  H+  R R+  EE G
Sbjct: 804 FRLADEASVARHVAARTRLALEEAG 828



 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/146 (38%), Positives = 87/146 (59%), Gaps = 1/146 (0%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            K R+   DV A V  +R  ++G+R +NVYD S         +S  +  +G+  KV L +
Sbjct: 4   TKQRVGALDVRALVASVRPSIVGLRVTNVYDFSAGGSRGGTSSSYILKFAGKESKVFLFI 63

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G RL+TT + +DK   PS F ++LRK +R ++LED+ Q G DR+++  FG   NA ++
Sbjct: 64  HAGFRLYTTEWKKDKGALPSPFCVRLRKGLRGKKLEDIHQHGADRVVILTFGKSENALHL 123

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSH 146
           ++ELY  GNI+LTD    +  +LR H
Sbjct: 124 VVELYVSGNIILTDHTNLIQAVLRRH 149



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 20/97 (20%)

Query: 835  ATVRDKPYISKAERRKLKKGQGSSVVDPKVERE-------KERGKDASSQPESIVRKTKI 887
            A +  +  +S AERR+ KKG   +  DP    E       KE+ K    QP         
Sbjct: 1606 AEIPSRKRMSAAERRRQKKGNREAKDDPAGTAEEKEDMGGKEKAKGPRLQP--------- 1656

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
                + RG++GKL KMK+KYGDQDEEE+  +M+L+  
Sbjct: 1657 ----VPRGKRGKLAKMKKKYGDQDEEEKQFKMSLIGA 1689


>gi|308808798|ref|XP_003081709.1| zinc knuckle (CCHC-type) family protein (ISS) [Ostreococcus tauri]
 gi|116060174|emb|CAL56233.1| zinc knuckle (CCHC-type) family protein (ISS), partial
           [Ostreococcus tauri]
          Length = 1090

 Score =  283 bits (723), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 193/588 (32%), Positives = 296/588 (50%), Gaps = 63/588 (10%)

Query: 1   MVKVRMNTADVAAEVKCLRRL-IGMRCSNVYDLSPKTYIFKLMNSSGVTESGES----EK 55
           M K +    DVAA    +RRL +G   +N  D+  +     +M  +  +  G+      +
Sbjct: 120 MPKRKYTAFDVAASTAAIRRLALGCALANARDVDGEGGDAVMMTFNRPSRDGDGVESRAR 179

Query: 56  VLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM 115
           V ++++   R H T+YAR +  TPS F + +R+  R ++L D RQLG DR +   FG G 
Sbjct: 180 VRVVIDPSSRAHVTSYARARDGTPSAFVMAVRRAARGKKLRDARQLGRDRAMDLTFGAGD 239

Query: 116 NAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTAS 175
            A +VI+EL+ +GN+++TD+ +TV   LR+ RDDD    + +   Y       +     +
Sbjct: 240 GACHVIVELFGRGNVIVTDANYTVARALRTRRDDDVKTRVEANQPYSLARFHAWRPYGKA 299

Query: 176 KLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA 235
            + +AL +++         V  DG          LG +      D++            A
Sbjct: 300 DVVSALATAR--------VVAGDGE---------LGVE------DVT---------AVDA 327

Query: 236 KQP-TLKTVLGEALGYGPALSEHIILDTGLV--PNMKLSEVNKLEDNAIQVLVLAVAKFE 292
           ++P TL+  L  A GY P ++EH+    G++   N  L   + + +  +  L  A+   E
Sbjct: 328 RRPATLREALCRAFGYSPPIAEHVARAAGVLDGSNAALPFADDVRERYVDGLTRAIEDIE 387

Query: 293 DWLQDVISGDIV---PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFV 349
            W + V +G  V   P  Y  M  K  G D          ++ D+F P  L Q   R   
Sbjct: 388 SWFEGVTTGKRVADAPRVYTKMDAKADGTDE--------IEVVDDFAPFELKQNEGRRTK 439

Query: 350 KFE---------TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
            +E          FD  +DE++++++SQ    Q +  E  A  +L K   DQ+NRV  L+
Sbjct: 440 TYELPKGLDPALAFDHYVDEYFNELDSQSVILQRRKAEAQAIARLEKTLRDQKNRVEQLE 499

Query: 401 QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDK 460
           +E +   + A LIEYN E VD AI AV  ALA+ MSW++L  M+KEER+ GNPVAG+I  
Sbjct: 500 RERELEEQRAVLIEYNHEAVDVAIEAVNSALASGMSWDELEAMIKEERRLGNPVAGMIKS 559

Query: 461 LYLERNCMSLLLSNNLDEMDDEEKTLPVEK---VEVDLALSAHANARRWYELKKKQESKQ 517
           + L  N +++ L N+LDE+ ++E  L  +K   V VDL LSAHANA   +  KKK   K 
Sbjct: 560 MDLANNEITITLENHLDELGEDEDALGKKKRVAVSVDLGLSAHANASVRFAAKKKNADKF 619

Query: 518 EKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           EKT+ A +KA  AAE K +  + +   V   +  R+  WFEKF+WFI+
Sbjct: 620 EKTLNAQNKAVAAAESKMKSAMERAANVVVATRARQPLWFEKFHWFIT 667


>gi|221502557|gb|EEE28284.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1859

 Score =  282 bits (722), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 158/385 (41%), Positives = 235/385 (61%), Gaps = 23/385 (5%)

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +R  + F   +  +DE++S ++ Q++E+        A  ++ KI  DQE R+  L++E  
Sbjct: 448 TRVLLHFRDINMCVDEYFSSVDVQKSERAEAQARQEALSRVEKIKSDQEQRMQLLEEEAA 507

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             ++ A+ +E N+  V+  I  +R ALA  + W++L R +K + K G+P+A  + +L LE
Sbjct: 508 NLLQQAQAVEANVVLVEQIIQLLRAALATGVDWDELGRQMKLQAKEGHPLAVHVHELKLE 567

Query: 465 RNCMSLLLSNNLDEM-----DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           +    LLL     E      +  E  L    V VD+ALSAH NA+  +   K+ ++K +K
Sbjct: 568 KQRAMLLLEAPRREEAEEPGEASETIL----VPVDVALSAHGNAQLLHSQVKQLKAKTQK 623

Query: 520 TITAHSKAFKAAEKKTRLQILQE-----KTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           T  A + A  AA++K +  + Q+     +    +  +RK  WFEKF+WFISS++YLV++G
Sbjct: 624 TSAATAAALAAADRKAQRTLKQKDQQVLQAQQQLQKVRKAFWFEKFHWFISSDHYLVLAG 683

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-------VPPLTLNQAGCF 627
           RDAQQNE++ +RY+   DVYVHAD+HGA++ +IKN R  +P       VP  TL Q G F
Sbjct: 684 RDAQQNEILFRRYLRSNDVYVHADVHGAATCIIKNSRETEPGKCDDPPVPLTTLQQCGEF 743

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AW +K  ++AWWVY  QVSK+AP+G YL+ GSFMIRG++NF+  H L MGFGLL
Sbjct: 744 AVCRSSAWTTKSPSAAWWVYGRQVSKSAPSGLYLSTGSFMIRGRRNFIQVHRLEMGFGLL 803

Query: 688 FRL-DESSLGSHLNER-RVRGEEEG 710
           FRL DE+S+  H+  R R+  EE G
Sbjct: 804 FRLADEASVARHVAARTRLALEEAG 828



 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 56/146 (38%), Positives = 87/146 (59%), Gaps = 1/146 (0%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            K R+   DV A V  +R  ++G+R +NVYD S         +S  +  +G+  KV L +
Sbjct: 4   TKQRVGALDVRALVASVRPSVVGLRVTNVYDFSAGGSRGGTSSSYILKFAGKESKVFLFI 63

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G RL+TT + +DK   PS F ++LRK +R ++LED+ Q G DR+++  FG   NA ++
Sbjct: 64  HAGFRLYTTEWKKDKGALPSPFCVRLRKGLRGKKLEDIHQHGADRVVILTFGKSENALHL 123

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSH 146
           ++ELY  GNI+LTD    +  +LR H
Sbjct: 124 VVELYVSGNIILTDHTNLIQAVLRRH 149



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 20/97 (20%)

Query: 835  ATVRDKPYISKAERRKLKKGQGSSVVDPKVERE-------KERGKDASSQPESIVRKTKI 887
            A +  +  +S AERR+ KKG   +  DP    E       KE+ K    QP         
Sbjct: 1606 AEIPSRKRMSAAERRRQKKGNREAKDDPAGTAEEKEDMGGKEKAKGPRLQP--------- 1656

Query: 888  EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
                + RG++GKL KMK+KYGDQDEEE+  +M+L+  
Sbjct: 1657 ----VPRGKRGKLAKMKKKYGDQDEEEKQFKMSLIGA 1689


>gi|328774280|gb|EGF84317.1| hypothetical protein BATDEDRAFT_8510 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 695

 Score =  282 bits (722), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 163/461 (35%), Positives = 254/461 (55%), Gaps = 49/461 (10%)

Query: 252 PALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM 311
           PA+++  I D     ++ L  ++  + ++   L+ A+ + +D L   I  D   +GYI+ 
Sbjct: 145 PAVNDANIADVQDSTSLDLYRIST-DSSSFLALLNALKQGDDILSSSI--DTPQQGYIVT 201

Query: 312 QNKHLGKDHPPTESGSST-----QIYDEFCPLLLNQF-------------RSREFVKFET 353
            +  + +    +++  S+       Y EF P    QF             +   F++F +
Sbjct: 202 SDSMVSQQLASSDTAQSSPTTTFTTYQEFHPYRFEQFNQDRSTSLSAELPKQTRFMEFVS 261

Query: 354 FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
           FD A+DE++SK+ESQR E +    E AA  KL  +    + ++   +  V+ + + A+ I
Sbjct: 262 FDKAVDEYFSKMESQRLEIRAHQAELAAVKKLENVKKSHQAQIQNFQSNVESNEQYAQAI 321

Query: 414 EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           E  LED+D+ +  V+  LA+ M W+DL  +VKEE   GN +A +I    L          
Sbjct: 322 ESRLEDIDSVLRTVQSFLASGMDWKDLEDLVKEETNNGNALAKMIIGFKLN--------- 372

Query: 474 NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
                       +   K+++D+  +A+ANARR+Y  KK   +KQ KT+   +K  K AE 
Sbjct: 373 ------------MEFFKIDLDIYSTAYANARRYYGAKKVAITKQSKTMEQSAKVVKMAEM 420

Query: 534 KTRLQILQ-EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
           K    +   +KT  +I+ +RK +WFEKF WF+SSEN+LV+ G+DA Q+ M+V RY+ KGD
Sbjct: 421 KIFQHLASVQKTAVSITKIRKPYWFEKFLWFVSSENFLVVGGKDATQSNMLVTRYLKKGD 480

Query: 593 VYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
            YVH+DL GA+S ++K       +       AG  +VC S+AWD+K++TSA+W   HQVS
Sbjct: 481 AYVHSDLPGAASVIVK------CMQSCVGTDAGTMSVCQSRAWDAKIITSAYWAEAHQVS 534

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           KT  TG+ L +G+FMIRGKKN+LPP  LI G  +LF+ D S
Sbjct: 535 KTTSTGDTLPLGTFMIRGKKNWLPPVQLIYGMAMLFQTDHS 575



 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 73/145 (50%), Positives = 102/145 (70%), Gaps = 9/145 (6%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +  DV+A V  L+ RL+G+R  NVYD++ KTY+FK         S    K LLL+
Sbjct: 1   MKQRFSALDVSASVVELKTRLVGLRLQNVYDINSKTYLFKF--------SRNETKELLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT ++RDK   PSGF +KLRKH+RTRRL ++RQLG DRI+  QF  G  A ++
Sbjct: 53  ESGIRMHTTQFSRDKSQMPSGFCMKLRKHLRTRRLVNLRQLGADRIMDMQFSEGEYAFHI 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
           I+E Y+ GNI+LTD E+ +L++LR+
Sbjct: 113 IVEFYSSGNIILTDHEYRILSVLRT 137


>gi|115443352|ref|XP_001218483.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114188352|gb|EAU30052.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 858

 Score =  281 bits (720), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 206/630 (32%), Positives = 326/630 (51%), Gaps = 59/630 (9%)

Query: 331 IYDEFCPLLLNQFRSREFV---KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
           +Y++F P    QF  +  V   +F + +A +DE++S IESQR E +   +E+AA  KL+ 
Sbjct: 42  LYEDFHPFKPRQFEGKPGVTILEFPSMNATVDEYFSSIESQRLESRLTEREEAAKKKLDA 101

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
           +  + E ++  LK   +  ++ A  IE N+  V  A+ AV   +A  M W ++AR+++ E
Sbjct: 102 VRQEHEKKIGALKHAQELHIRKAGAIEDNVYRVQEAMDAVNGLIAQGMDWVEIARLIEME 161

Query: 448 RKAGNPVAGLID-KLYLERNCMSLLLSNNLDE--------------------MDDEEKTL 486
           +  GNPVA +I   L L  N ++LLL    +E                        +   
Sbjct: 162 QGRGNPVAKIIKLPLKLYENTITLLLGEAGEEQEEAEELFSESEESEDEQETTQTAQNKP 221

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQE 542
            V  +++DL LS  ANA ++YE KK    K+++T  + +KA K+ EKK     +  + QE
Sbjct: 222 EVLTIDIDLGLSPWANATQYYEQKKMAAVKEQRTAQSSTKALKSHEKKVTEDLKKSLKQE 281

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           K V  +   RK  WFEKF +F+SSE YLV+ GRDA Q+E++ +R++ KGD++VHADL GA
Sbjct: 282 KQV--LRPARKPFWFEKFLFFVSSEGYLVLGGRDAMQSELLYRRHLKKGDIFVHADLEGA 339

Query: 603 SSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
           +  V+KN    P  P+PP TL+QAG   V  S AWDSK V  AWWV+ HQVSKTA  G  
Sbjct: 340 TPIVVKNRPGTPNAPIPPSTLSQAGNLCVATSSAWDSKAVMPAWWVHAHQVSKTAEAGGL 399

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHH 720
           L  G F+++G+KNFL P  L++GF ++F++ + SL +H   R         D  E +   
Sbjct: 400 LPAGDFLVKGEKNFLAPSQLVLGFAVMFQISKESLKNHKLHR--------FDVTETTEST 451

Query: 721 KENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHT-NASNVDSHEFPAEDKTISNGIDSK 779
            +   + +  ++  + P   + + P +    PS T N          ++D+         
Sbjct: 452 AQGDGVAASAEEVKQTPEQANETAPAN---EPSETLNKQEEGEQSSDSDDEQEDPDAVPA 508

Query: 780 IFDIARNVAAPVTPQLEDLI---DRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT 836
              + R  +  +  +  D +   DRA    +A     +  +   +   +EED     T  
Sbjct: 509 RNPLQRGDSESMPEEPVDEVKGADRAADKPTADSQEAEEELSEEETAPAEEDSP---TVQ 565

Query: 837 VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIE---GGKIS 893
            ++K ++S  E+R  K+G+    +D + +   E+G   +S P +  +++K +       S
Sbjct: 566 TQEKEHLSAKEKRLAKQGKS---LDSETD---EKGSKQTSSPATNGKRSKGDTKSAPASS 619

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
           RG+KGK KK   KY DQDEEER + + LL 
Sbjct: 620 RGKKGKAKKAAAKYADQDEEERELALRLLG 649


>gi|347828082|emb|CCD43779.1| similar to serologically defined colon cancer antigen 1
           [Botryotinia fuckeliana]
          Length = 674

 Score =  278 bits (712), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 211/647 (32%), Positives = 335/647 (51%), Gaps = 95/647 (14%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R SNVYDLS K ++ K              K  +L+
Sbjct: 78  MKQRFSSIDVKVIAHELSNALVTLRVSNVYDLSSKIFLIKFAKPDN--------KQQILI 129

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T ++R     PS F  +LRK ++TRR+  V Q+G DRII FQF  G    Y 
Sbjct: 130 DSGFRCHLTDFSRATAAAPSVFVQRLRKFLKTRRVTQVSQVGTDRIIEFQFSDGQYRLY- 188

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
            LE YA GNI+LTD E  +LTLLR     D G A                     +L   
Sbjct: 189 -LEFYAGGNIILTDKELNILTLLRVV---DPGEA-------------------QEELRVG 225

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSND-GARAKQP- 238
           L  S +      ++ N  G  + + +KE L          L K  +K  +D G + K+P 
Sbjct: 226 LKYSLD------NRQNYGG--IPDLTKERLQEA-------LQKGVDKGEDDSGKKKKKPG 270

Query: 239 -TLKTVLGEALG-YGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
             L+  L  ++  + P L +H +  T    ++K +EV + ED  ++ L+ ++ + +  +Q
Sbjct: 271 DALRKALAVSITEFPPMLVDHAMRITNFNSSLKPAEVLQSED-LLEHLMKSLQEAQRVVQ 329

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFR---SREFVK 350
           ++ S +   +GYI+ + K      P  E+ +  +   +YD+F P    QF+   S  F++
Sbjct: 330 EITSSE-TAKGYIVAKKKD--PQTPSDENETDIRKGLLYDDFHPFKPKQFQDDPSLVFLE 386

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE F+  +DEF+S IE Q+ E + + +E  A  K+     +Q  R+  L++    + + A
Sbjct: 387 FEGFNKTVDEFFSSIEGQKLESKLEEREKQAQKKIQAARNEQAKRLGGLQEIQALNERKA 446

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             ++ N+E V  A  AV   +A  M W ++ R+++ E+K  NPVA +I   L LE N ++
Sbjct: 447 SALQANVERVQEATDAVNGLIAQGMDWFEIGRLIEREQKFNNPVASMIKLPLKLEENTVT 506

Query: 470 LLLS---------------NNLDEMDDEEKTLPVEK-----------VEVDLALSAHANA 503
           +LL                +++ E +DE+ T    K           +++DLALS  ANA
Sbjct: 507 ILLDEEAFDEEEDSTYETDSDVSESEDEDDTAKTNKKKEKVADTRIPIDIDLALSPWANA 566

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKT----RLQILQEKTVANISHMRKVHWFEK 559
           R +++ K+   SK++KT+ + SKA K+ E K     +  + QEK +  +  +RK  WFEK
Sbjct: 567 RNYFDQKRSAASKEDKTLQSSSKALKSTEAKIAQDLKKGLKQEKAI--LRPVRKQMWFEK 624

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
           F WFISS+ YLV++G+DAQQ+E++ KRY+ KGDVY+HAD+ GA+S +
Sbjct: 625 FIWFISSDGYLVLAGKDAQQSEILYKRYLKKGDVYLHADIRGAASVI 671


>gi|401410580|ref|XP_003884738.1| hypothetical protein NCLIV_051350 [Neospora caninum Liverpool]
 gi|325119156|emb|CBZ54708.1| hypothetical protein NCLIV_051350 [Neospora caninum Liverpool]
          Length = 1853

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 152/372 (40%), Positives = 229/372 (61%), Gaps = 18/372 (4%)

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +R  + F   +  +DE++S ++ Q+ E+        A  ++ KI  DQ  R+  L++E  
Sbjct: 435 TRVLLHFRDINVCVDEYFSSVDVQKGERAEALARHEALSRVEKIRSDQAQRMQQLEEEAA 494

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
             ++ A+ +E N+  V+  I  +R ALA  + W++L R +K++ K G+P+A  + +L LE
Sbjct: 495 SLLEEAQAVEANVGLVEQIIQLLRAALATGVDWDELGRQMKQQAKEGHPLAVHVQELRLE 554

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
           +    LLL     E   EE +     V +D+ LSAH NA+  +   K+ ++K  KT +A 
Sbjct: 555 KQRALLLL-----EAPGEEASGATTVVSIDITLSAHGNAQLLHSQVKQLKAKTLKTSSAT 609

Query: 525 SKAFKAAEKKTRLQILQEKTVANIS-----HMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
           + A  AA++K +  + Q++     +      +RK  WFEKF+WFISS++YLV++GRDAQQ
Sbjct: 610 AAALAAADRKAQRTLKQKEQQVLQAQQQLQKVRKAFWFEKFHWFISSDHYLVLAGRDAQQ 669

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKN-------HRPEQPVPPLTLNQAGCFTVCHS 632
           NE++ +RY+   DVYVHAD+HGA++ +IKN          E PVP  TL Q G F VC S
Sbjct: 670 NEILFRRYLRANDVYVHADVHGAATCIIKNTGETDPGKTEEPPVPLATLQQCGEFAVCRS 729

Query: 633 QAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-D 691
            AW++K   +AWWVY HQVSK+AP+G YL+ GSFMIRG++NF+  H L MGFGLLFRL D
Sbjct: 730 SAWNTKTPAAAWWVYGHQVSKSAPSGLYLSTGSFMIRGRRNFIQIHRLEMGFGLLFRLAD 789

Query: 692 ESSLGSHLNERR 703
           E+S+  H+  R+
Sbjct: 790 EASVARHVAARK 801



 Score =  114 bits (284), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 59/146 (40%), Positives = 87/146 (59%), Gaps = 1/146 (0%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
            K R+   DV A V  +R  ++G+R +NVYD S         +S  V  +G+  K+ L +
Sbjct: 4   TKQRVGALDVRALVASIRPAVLGLRVTNVYDFSSGGGRGAGSSSYIVKLAGKDSKIFLFI 63

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
            +G RL+TT + +DK   PS F ++LRK +R ++LED+ Q G DR++L  FG G N   +
Sbjct: 64  HAGFRLYTTEWKKDKGALPSPFCMRLRKSLRGKKLEDIHQHGADRVVLLTFGKGENTLRL 123

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSH 146
           I+ELY  GNI+LTD    +L +LR H
Sbjct: 124 IVELYVSGNIVLTDHTNLILAVLRRH 149


>gi|297736765|emb|CBI25966.3| unnamed protein product [Vitis vinifera]
          Length = 369

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 128/160 (80%), Positives = 142/160 (88%), Gaps = 2/160 (1%)

Query: 593 VYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
           +Y+HADLHGASSTVI+NH+PE PVPPLTL+QAGCFTVCHSQAWDSK+VTSAWWVYPHQVS
Sbjct: 104 LYIHADLHGASSTVIENHKPEHPVPPLTLSQAGCFTVCHSQAWDSKIVTSAWWVYPHQVS 163

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           KTAPT EYLTVGSFMIRGKKNFLPPHPL+MGFGLL  LDESSLGSHLNERRVRGEEEG  
Sbjct: 164 KTAPTVEYLTVGSFMIRGKKNFLPPHPLMMGFGLLLCLDESSLGSHLNERRVRGEEEGAQ 223

Query: 713 DFEDSGHHKENSDIESEKDDTDEKPVAESLSV--PNSAHP 750
           DFE++   K NSD ESEK++TDEK  AES S+  P++  P
Sbjct: 224 DFEENESLKGNSDSESEKEETDEKRTAESKSIMDPSTHQP 263


>gi|428179079|gb|EKX47951.1| hypothetical protein GUITHDRAFT_106038 [Guillardia theta CCMP2712]
          Length = 841

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 229/766 (29%), Positives = 378/766 (49%), Gaps = 116/766 (15%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTES------------ 50
           K+RM++ D++ E + LR LIG R +N+YD++ +T   +L  S  + ES            
Sbjct: 91  KMRMSSLDLSVETRILRNLIGTRVANIYDINARTLEIRLGASCALKESQTLPMSADALHV 150

Query: 51  -GESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
            G S+++ +++ESG RLHT+ + R   + PS F  K+RKHIR + L DVRQ+G DR++  
Sbjct: 151 NGSSQRISVVIESGSRLHTSRFHRATASRPSNFATKIRKHIRGQFLNDVRQVGKDRVLQM 210

Query: 110 QFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVF 169
            FGLG  ++++ILE YA GNI+L D E T+L+LLRS+   D G  +  + +Y  +    F
Sbjct: 211 TFGLGNRSNHLILEFYAAGNIILCDHEMTILSLLRSYETPD-GRHVEVKSKYLIDDGGGF 269

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNS 229
           +  +  +L  A+  S+               ++    +E+ G     K            
Sbjct: 270 QPMSCDRLVKAIERSR---------------SICRGLRESTGSSLTRKD----------- 303

Query: 230 NDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
                 K+  L  +L     Y   L EH++L  G+ P++   EV    D  +Q L+ A  
Sbjct: 304 -----KKKTALMKLLATECQYPGQLIEHVLLCAGIQPDIPADEVRN--DIDLQRLLQAFK 356

Query: 290 KFEDWLQ-------DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQ 342
           + +              S     +GY+++           ++S + T +  EF P+LL Q
Sbjct: 357 EIDHLFMLGHSQQLATPSSSAALKGYVILDR--------ISDSSNQTLVISEFSPILLKQ 408

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
              +  +++ + D A+DEF+S I+  R ++      +    K+NK   D ++    LKQE
Sbjct: 409 QEDKMVLEYPSIDVAMDEFFSTIDFNRDQKDANEAVETVSKKVNKAKKDIKSHTEGLKQE 468

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K A L+E N   +D A+  +R              +VK  R A N    ++ +++
Sbjct: 469 ELLNHKKATLLELNSFHIDEALDKIR-------------GLVKIHRNAAN----VLHEIH 511

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVE--VDLALSAHANARRWYELKKKQESKQEKT 520
              +  S  L        +  K+     +   +D ++S+ ANAR +++ KKK  +KQ++ 
Sbjct: 512 EMNSTASFRLPQEGIVESEAVKSRGATDITLVLDYSISSLANARNFFQKKKKVAAKQQRA 571

Query: 521 -----ITAHSKAFKAAEKK----TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
                I+  +   KA+++K    ++     + +   IS +R+  WFEKF WFISS+  LV
Sbjct: 572 EEMADISLKNTQIKASQRKNTKASKNDFQSKSSSIGISSVRRKFWFEKFFWFISSDQILV 631

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-PPLTLNQAGCFTVC 630
           I+G+DAQQNE++VKR                      N   E+ V P  T+ QA  F VC
Sbjct: 632 IAGKDAQQNELLVKR----------------------NELKERKVLPENTILQAAEFAVC 669

Query: 631 HSQAWDSKMV--TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            S AW SK    T+A+WVYP QVSK   +GEYL+ G F+IRGKKNF+    L MGFG+ F
Sbjct: 670 RSSAWKSKTASGTAAYWVYPDQVSKAPQSGEYLSKGGFVIRGKKNFVSISTLCMGFGIFF 729

Query: 689 RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTD 734
               ++  ++ +E  +R E+E ++   ++      ++I+++K +TD
Sbjct: 730 YSPRANDLTY-DENLMRKEQEDVEIVTETMSQTSFTEIDADKRNTD 774


>gi|68475252|ref|XP_718344.1| hypothetical protein CaO19.10114 [Candida albicans SC5314]
 gi|68475451|ref|XP_718248.1| hypothetical protein CaO19.2582 [Candida albicans SC5314]
 gi|46440007|gb|EAK99318.1| hypothetical protein CaO19.2582 [Candida albicans SC5314]
 gi|46440107|gb|EAK99417.1| hypothetical protein CaO19.10114 [Candida albicans SC5314]
          Length = 1018

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 215/751 (28%), Positives = 359/751 (47%), Gaps = 108/751 (14%)

Query: 19  RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKK 76
           + L   R  N+Y+++   + Y+FK         S    K ++++E G R+H T + R   
Sbjct: 19  KELSNYRLQNIYNVASNSRQYLFKF--------SIPDSKKVVVLEYGNRIHLTDFERPTT 70

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
             P+ F  KLRKH++TRRL  ++Q+  DRI++ +F  G   +Y++LE ++ GNILL D  
Sbjct: 71  QQPTNFVTKLRKHLKTRRLSGIKQISNDRILVLEFSDG--KYYLVLEFFSAGNILLLDES 128

Query: 137 FTVLTLLR--SHRDDDKGVAIMSRHRYPTEICRVFERTTASK-LHAALTSSKEPDANEPD 193
             +L L R  S + ++   A+        E  ++F+++   +  H              +
Sbjct: 129 QRILALQRLVSAKQENDRYAV-------NEEYKMFDKSLFQQDFHY-------------E 168

Query: 194 KVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL-KTVLGEALGYGP 252
           K + D + V +  + +           LS+NS     D  +AK  ++ K     A     
Sbjct: 169 KRSYDLDEVESWIQTH--------KLKLSQNS-----DNKKAKVFSIHKLAFINASHLSG 215

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +    ++G+ P+       K ++ A+Q +V A+   ED   D+I+G+I  EGYI+ +
Sbjct: 216 ELIQKWFFESGIDPSQSCLSFEKNQE-ALQRVVNALGVCEDKYIDLINGEIATEGYIVAK 274

Query: 313 NKHLGKDHPPTESGSSTQIYDEFCPL---LLNQFRSREFVKFETFDAALDEFYSKIESQR 369
                K++  +E      IYDEF P      NQ    +F+    ++  LD+F+S IES +
Sbjct: 275 -----KNNKVSEKSDLEYIYDEFHPFEPYKPNQ-EGIKFISVSGYNKTLDKFFSNIESTK 328

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
              + + +++ A  +L K   +++ ++ +L  +   + K  ELI+Y+ E V+     V+ 
Sbjct: 329 FSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAKKGELIQYHSELVEECRSYVQS 388

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD----------- 477
            +  +M W ++  ++  E+K  N +A  I   L L+ N + +LL +  D           
Sbjct: 389 FIDQQMDWTNIETVISLEQKKKNELAQHIQLPLNLKENKIKVLLEDFDDYEEITESASAT 448

Query: 478 ------------------EMDDEEKTLPVEKVE-----------------VDLALSAHAN 502
                             E D++E  +PV++ +                 +DL+ SA AN
Sbjct: 449 ETGSETETESESESSSESESDNDEDKIPVKRTQRKTNTKEKPKRKTIPTWIDLSQSAFAN 508

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKF 560
           AR +++ KK  E+KQ K   + S A K AE+K    + +     N  +  +R  +WFEKF
Sbjct: 509 ARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLTRSLKQENDTLKEIRPKYWFEKF 568

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WF+SSE YL ++G+DA Q +MI  R+ S  D  V AD+ G+    IKN    + +PP T
Sbjct: 569 FWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSADMEGSLKVFIKNPLKGEALPPST 628

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L QAG F +  S AW+ K+ TSAW ++  ++SK    G  +  G F    +K +LPP  L
Sbjct: 629 LMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDGSIVPEGEFNYLVQKEYLPPAQL 688

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           IMGFG    LD+ S   +   R  R  E G 
Sbjct: 689 IMGFGFYCLLDDESTKRYGEIRTKRELEHGF 719


>gi|389585510|dbj|GAB68240.1| hypothetical protein PCYB_131140 [Plasmodium cynomolgi strain B]
          Length = 1898

 Score =  273 bits (698), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 168/444 (37%), Positives = 249/444 (56%), Gaps = 70/444 (15%)

Query: 332 YDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHK 384
           + EF P++LN  +++      E + F+ F+  +D ++S++E S+  +QQ   K   +  K
Sbjct: 464 FTEFSPIILNNHKNKVEENKLEVINFDDFNKCVDTYFSRMELSKYDKQQEVIKIKKSLTK 523

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
           ++KI +D E R+  L++EV    K   LI+ N E V+ AI  +R A+A   +WE +   +
Sbjct: 524 MDKIKLDHERRIEQLEKEVSSLKKKISLIQMNDELVEQAIQLMRAAVATNANWEKIWEHI 583

Query: 445 KEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKT------LPVE--------- 489
           K  +K  +P+A  I  +      M LLL       DDEE T      L  E         
Sbjct: 584 KLFKKQNHPIALRISSVNFNNCEMELLL-------DDEEATEQGSDDLSSEANEQGSDDP 636

Query: 490 ------------------------KVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
                                    V ++L  S + N   + +L+KK E K  KT  + +
Sbjct: 637 SSEANEQQSKGKASNREVATRSRFAVTINLNNSVYGNVEDYQKLRKKAEEKIRKTKISTN 696

Query: 526 KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
            A K  EKK + +I + +   NI+  R V+WFEKF+WFISSENYLVI+GRDA QNE++ +
Sbjct: 697 FAVKKVEKKKKKKINRRE---NITRQR-VYWFEKFHWFISSENYLVIAGRDALQNEILFR 752

Query: 586 RYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWW 645
           RY  K D+YVHAD+HGAS+ +IKN   + P+P  TL++AG   +C S AW++K++TSAWW
Sbjct: 753 RYFQKNDIYVHADIHGASTCIIKNPHKDIPIPEKTLSEAGQLAICRSSAWNNKIITSAWW 812

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           V+ HQVSK+AP GEYL  GSF+IRGKKN+LP   L MG  ++F++D ++L ++       
Sbjct: 813 VHYHQVSKSAPAGEYLKTGSFVIRGKKNYLPHVKLEMGLCIIFQVDNAALDNN------- 865

Query: 706 GEEEGMDD----FEDSGHHKENSD 725
            EE  +DD    FE+ G  K +SD
Sbjct: 866 -EENNLDDTQRSFENDG-EKRSSD 887



 Score =  116 bits (291), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 56/147 (38%), Positives = 91/147 (61%), Gaps = 9/147 (6%)

Query: 1   MVKVRMNTADVAAEVK-CLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C   L+G   +N+Y++S K Y+ K         S + +K   L
Sbjct: 1   MAKQRLTALDIRAIITLCKNILVGCVVTNIYNISNKIYVLKC--------SKKEQKYFFL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PS FT+KLRKH+R+R++ ++ QLG DR++  QFG    A +
Sbjct: 53  VEAEKRIHITEWKREKDVMPSAFTMKLRKHLRSRKITNISQLGGDRVVDIQFGFDDKACH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSH 146
           +I+ELY  GNI+LTD+   +L++L+S+
Sbjct: 113 LIVELYIAGNIILTDNNHKILSILKSN 139


>gi|3859683|emb|CAA22020.1| conserved hypothetical protein [Candida albicans]
          Length = 1018

 Score =  272 bits (696), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 214/751 (28%), Positives = 358/751 (47%), Gaps = 108/751 (14%)

Query: 19  RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKK 76
           + L   R  N+Y+++   + Y+FK         S    K ++++E G R+H T + R   
Sbjct: 19  KELSNYRLQNIYNVASNSRQYLFKF--------SIPDSKKVVVLEYGNRIHLTDFERPTT 70

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
             P+ F  KLRKH++TRRL  ++Q+  DRI++ +F  G   +Y++LE ++ GNILL D  
Sbjct: 71  QQPTNFVTKLRKHLKTRRLSGIKQISNDRILVLEFSDG--KYYLVLEFFSAGNILLLDES 128

Query: 137 FTVLTLLR--SHRDDDKGVAIMSRHRYPTEICRVFERTTASK-LHAALTSSKEPDANEPD 193
             +L L R  S + ++   A+        E  ++F+++   +  H              +
Sbjct: 129 QRILALQRLVSAKQENDRYAV-------NEEYKMFDKSLFQQDFHY-------------E 168

Query: 194 KVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL-KTVLGEALGYGP 252
           K + D + V +  + +           LS+NS     D  +AK  ++ K     A     
Sbjct: 169 KRSYDLDEVESWIQTH--------KLKLSQNS-----DNKKAKVFSIHKLAFINASHLSG 215

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +    ++G+ P+       K ++ A+Q +V A+   ED   D+I+G I  EGYI+ +
Sbjct: 216 ELIQKWFFESGIDPSQSCLSFEKNQE-ALQRVVNALGVCEDKYIDLINGAIATEGYIVAK 274

Query: 313 NKHLGKDHPPTESGSSTQIYDEFCPL---LLNQFRSREFVKFETFDAALDEFYSKIESQR 369
                K++  +E      IYDEF P      NQ    +F+    ++  LD+F+S IES +
Sbjct: 275 -----KNNKVSEKSDLEYIYDEFHPFEPYKPNQ-EGIKFISVSGYNKTLDKFFSNIESTK 328

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
              + + +++ A  +L K   +++ ++ +L  +   + K  ELI+Y+ E V+     V+ 
Sbjct: 329 LSMKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAKKGELIQYHSELVEECRSYVQS 388

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD----------- 477
            +  +M W ++  ++  E+K  N +A  I   L L+ N + +LL +  D           
Sbjct: 389 FIDQQMDWTNIETVISLEQKKKNELAQHIQLPLNLKENKIKVLLEDFDDYEESTESASAT 448

Query: 478 ------------------EMDDEEKTLPVEKVE-----------------VDLALSAHAN 502
                             E D++E  +PV++ +                 +DL+ SA AN
Sbjct: 449 ETGSETETESESESSSESESDNDEDKIPVKRTQRKTNTKEKPKRKTIPTWIDLSQSAFAN 508

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKF 560
           AR +++ KK  E+KQ K   + S A K AE+K    + +     N  +  +R  +WFEKF
Sbjct: 509 ARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLTRSLKQENDTLKEIRPKYWFEKF 568

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WF+SSE YL ++G+DA Q +MI  R+ S  D  V AD+ G+    IKN    + +PP T
Sbjct: 569 FWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSADMEGSLKVFIKNPLKGEALPPST 628

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L QAG F +  S AW+ K+ TSAW ++  ++SK    G  +  G F    +K +LPP  L
Sbjct: 629 LMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDGSIVPEGEFNYLVQKEYLPPAQL 688

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           +MGFG    LD+ S   +   R  R  E G 
Sbjct: 689 VMGFGFYCLLDDESTKRYGEIRTKRELEHGF 719



 Score = 40.4 bits (93), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 891 KISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           ++ RG++ KLKK+  KY DQDEEER +RM  L  
Sbjct: 791 QLPRGKRSKLKKIAAKYRDQDEEERKLRMDALGT 824


>gi|83033024|ref|XP_729296.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23486663|gb|EAA20861.1| strong similarity to unknown protein-related [Plasmodium yoelii
           yoelii]
          Length = 1768

 Score =  272 bits (695), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 163/416 (39%), Positives = 241/416 (57%), Gaps = 29/416 (6%)

Query: 330 QIYDEFCPLLL----NQFRSR--EFVKFETFDAALDEFYSKIESQRAEQ-QHKAKEDAAF 382
           +++ EF P+LL    N+   +  E +KF+ F+  +D ++SKIE  + ++ Q   K   A 
Sbjct: 437 RLFVEFSPILLKNHINKINEKKIEIIKFDNFNMCVDTYFSKIELTKYDKHQEMNKNKNAL 496

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K++KI +D E R+  L++EV    K   LIE N + V  AI  +R A++   +WE +  
Sbjct: 497 TKMDKIKLDHEKRIEGLEKEVSMLKKKILLIELNYQFVGEAIKLMRSAISTSANWEKIWD 556

Query: 443 MVKEERKAGNPVAGLIDKLYLERNCMSLLLS-------------NNLDEMDDEEKTLPVE 489
            +K  +K  +P+A  I  +      M LLL              NNL     +EK +  +
Sbjct: 557 HIKLFKKRNHPIALKIMSVNFNNCEMELLLDDNDDDDVEESGDDNNLKNDKWKEKVIEEK 616

Query: 490 K----VEVDLALSAHANARRWYELKKKQESKQEK----TITAHSKAFKAAEKKTRLQILQ 541
                V ++L  S   N   + +L+KK E K  K    T  A  K  K  + K   Q  +
Sbjct: 617 NKTCAVTINLNNSVFGNIEDYEKLRKKAEEKIRKIKMSTNIAVKKVEKKKKDKDIKQKGK 676

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
            K+V  I  +RK+ WFEKFNWFISSENYLVISGRD+ QNE++ +RY    D+YVHAD+HG
Sbjct: 677 NKSVFQIKKIRKIFWFEKFNWFISSENYLVISGRDSLQNEILFRRYFQNNDIYVHADIHG 736

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           A+S +IKN   + P+P  TL++AG   +C S AW++KM+TSAWWVY HQVSKTAPTGEY+
Sbjct: 737 AASCIIKNPYKDIPIPEKTLSEAGQLAMCRSSAWNNKMITSAWWVYYHQVSKTAPTGEYI 796

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDS 717
             GSF+IRGKKN+LP   L MG  ++F++++  +  +  E ++  +E   D+ E++
Sbjct: 797 KTGSFVIRGKKNYLPYAKLEMGLCIIFQINK-KVNDNNEENKLTDDEPNCDNNEEN 851



 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 62/156 (39%), Positives = 100/156 (64%), Gaps = 9/156 (5%)

Query: 1   MVKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C   +IG   +N+Y++S K Y+ K         S + +K  LL
Sbjct: 1   MGKQRLTALDIRAIITSCKNTIIGSVVTNIYNISNKIYVLKC--------SKKEQKYFLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PSGFT+KLRKH+R+R++ ++ QLG DR+I  QFG   N ++
Sbjct: 53  LEAEKRVHITEWVREKDVMPSGFTMKLRKHLRSRKITNISQLGGDRVIDIQFGYDDNMYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAI 155
           +I+ELY  GNI+LTDS++ ++ +L+S+ D+ K + I
Sbjct: 113 LIVELYIAGNIILTDSDYKIIFILKSNDDNKKNLKI 148


>gi|170576547|ref|XP_001893673.1| Serologically defined colon cancer antigen 1 [Brugia malayi]
 gi|158600188|gb|EDP37492.1| Serologically defined colon cancer antigen 1, putative [Brugia
           malayi]
          Length = 307

 Score =  272 bits (695), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 138/277 (49%), Positives = 185/277 (66%), Gaps = 10/277 (3%)

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           +AG+P+A  I  L L  N M+LLL       D     +  +KV +D+ALS++ NAR+ + 
Sbjct: 7   EAGSPIAASIVGLNLNSNQMTLLLG------DPYRPEIDPKKVTIDIALSSYQNARKLHT 60

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
            KK  + K++KTI A SKA K+ + K +  +    T A +   R V WFEKF WF+SSEN
Sbjct: 61  EKKAAQQKEQKTICASSKALKSTKMKMKETLKVVHTKAEVMKKRHVMWFEKFFWFVSSEN 120

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           YLVI GRDAQQNE++VKRY+  GD+Y+HAD+ GASS +I+N      VPP TLN+A    
Sbjct: 121 YLVIGGRDAQQNELLVKRYLRPGDIYMHADVRGASSIIIRNKLGGGDVPPRTLNEAATMA 180

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           + +S AW++K+ +SAWWV+ HQVS+TAPTGEYLT GSFMIRGKKN+LP   L MGFG++F
Sbjct: 181 ISYSSAWEAKITSSAWWVHQHQVSRTAPTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMF 240

Query: 689 RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSD 725
           +LDE SL  H  ER+V      M   ED+  H+++ D
Sbjct: 241 QLDEESLERHREERKV----APMVTAEDNAMHQDDGD 273


>gi|238879662|gb|EEQ43300.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1018

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 214/751 (28%), Positives = 356/751 (47%), Gaps = 108/751 (14%)

Query: 19  RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKK 76
           + L   R  N+Y+++   + Y+FK         S    K ++++E G R+H T + R   
Sbjct: 19  KELSNYRLQNIYNVASNSRQYLFKF--------SIPDSKKVVVLEYGNRIHLTDFERPTT 70

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
             P+ F  KLRKH++TRRL  ++Q+  DRI++ +F  G   +Y++LE ++ GNILL D  
Sbjct: 71  QQPTNFVTKLRKHLKTRRLSGIKQISNDRILVLEFSDG--KYYLVLEFFSAGNILLLDES 128

Query: 137 FTVLTLLR--SHRDDDKGVAIMSRHRYPTEICRVFERTTASK-LHAALTSSKEPDANEPD 193
             +L L R  S + ++   A+        E  ++F+++   +  H              +
Sbjct: 129 QRILALQRLVSAKQENDRYAV-------NEEYKMFDKSLFQQDFHY-------------E 168

Query: 194 KVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL-KTVLGEALGYGP 252
           K + D + V +  + +           LS+NS     D  +AK  ++ K     A     
Sbjct: 169 KRSYDLDEVESWIQTH--------KLKLSQNS-----DNKKAKVFSIHKLAFINASHLSG 215

Query: 253 ALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQ 312
            L +    ++G+ P+       K    A+Q +V A+   ED   D+I+G I  EGYI+ +
Sbjct: 216 ELIQKCFFESGIDPSQSCLSFEK-NQGALQRVVNALGVCEDKYIDLINGAIATEGYIVAK 274

Query: 313 NKHLGKDHPPTESGSSTQIYDEFCPL---LLNQFRSREFVKFETFDAALDEFYSKIESQR 369
                K++  +E      IYDEF P      NQ    +F+    ++  LD+F+S IES +
Sbjct: 275 -----KNNKVSEKSDLEYIYDEFHPFEPYKPNQ-EGIKFISVSGYNKTLDKFFSNIESTK 328

Query: 370 AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV 429
              + + +++ A  +L K   +++ ++ +L  +   + K  ELI+Y+ E V+     V+ 
Sbjct: 329 FSIKIEQQKENAAKRLEKARSERDKQIDSLVAQQRLNAKKGELIQYHSELVEECRSYVQS 388

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLD----------- 477
            +  +M W ++  ++  E+K  N +A  I   L L+ N + +LL +  D           
Sbjct: 389 FIDQQMDWTNIETVISLEQKKKNELAQHIQLPLNLKENKIKVLLEDFDDYEESTESASAT 448

Query: 478 ------------------EMDDEEKTLPVEKVE-----------------VDLALSAHAN 502
                             E D++E  +PV++ +                 +DL+ SA AN
Sbjct: 449 ETGSETETESESESSSESESDNDEDKIPVKRTQRKTNTKEKPKRKTIPTWIDLSQSAFAN 508

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKF 560
           AR +++ KK  E+KQ K   + S A K AE+K    + +     N  +  +R  +WFEKF
Sbjct: 509 ARSYFDSKKTAETKQVKVENSTSMALKNAERKITQDLTRSLKQENDTLKEIRPKYWFEKF 568

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WF+SSE YL ++G+DA Q +MI  R+ S  D  V AD+ G+    IKN    + +PP T
Sbjct: 569 FWFVSSEGYLCLAGKDASQTDMIYYRHFSDNDSIVSADMEGSLKVFIKNPLKGEALPPST 628

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
           L QAG F +  S AW+ K+ TSAW ++  ++SK    G  +  G F    +K +LPP  L
Sbjct: 629 LMQAGIFAMSASSAWNGKVTTSAWVLHGTEISKRDFDGSIVPEGEFNYLVQKEYLPPAQL 688

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           +MGFG    LD+ S   +   R  R  E G 
Sbjct: 689 VMGFGFYCLLDDESTKRYGEIRTKRELEHGF 719



 Score = 40.0 bits (92), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 891 KISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           ++ RG++ KLKK+  KY DQDEEER +RM  L  
Sbjct: 791 QLPRGKRSKLKKIAAKYRDQDEEERKLRMDALGT 824


>gi|297297786|ref|XP_002805097.1| PREDICTED: serologically defined colon cancer antigen 1-like
           [Macaca mulatta]
          Length = 856

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 163/447 (36%), Positives = 237/447 (53%), Gaps = 89/447 (19%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  KLE   I+ +++++ K ED+++   
Sbjct: 183 LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--T 238

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
           + +   +GYI+ Q + +       +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 239 TSNFSGKGYII-QKREIKPSLEADKPVEDILTYEEFHPFLFSQHSQCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
           EFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL+ 
Sbjct: 298 EFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQI 357

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN----- 474
           VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N     
Sbjct: 358 VDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRNPYLLS 417

Query: 475 ---------------NLDEMDDEEKTLPVEK------------VEVDLALSAHANARRWY 507
                          N  E    +K     K            V+VDL+LSA+ANA+++ 
Sbjct: 418 EEEDDDVDGDVNVEKNETEPPKGKKKKQKNKQLQKPQKNKPLLVDVDLSLSAYANAKKF- 476

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
                                                        K  WF      ISSE
Sbjct: 477 --------------------------------------------EKFLWF------ISSE 486

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
           NYL+I GRD QQNE+IVKRY++ GD+YVHADLHGA+S VIKN   E P+PP TL +AG  
Sbjct: 487 NYLIIGGRDQQQNEIIVKRYLTPGDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTM 545

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKT 654
            +C+S AWD++++TSAWWVY HQ+ ++
Sbjct: 546 ALCYSAAWDARVITSAWWVYHHQIIRS 572



 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 102/170 (60%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E
Sbjct: 113 IIELYDRGNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE 162


>gi|70949333|ref|XP_744087.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56523889|emb|CAH79538.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 1345

 Score =  261 bits (668), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 155/375 (41%), Positives = 222/375 (59%), Gaps = 20/375 (5%)

Query: 330 QIYDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIESQRAEQ-QHKAKEDAAF 382
           +++ EF P+LL    ++      E +KF  F+  +D ++SK+E  + ++ Q   K   A 
Sbjct: 403 RLFVEFSPILLKNHINKIDEKKIELIKFNDFNMCVDTYFSKMELTKYDKHQEMNKRKNAL 462

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K++KI +D E R+  L++EV+   K   LI+ N E V  AI  +R A++   +WE +  
Sbjct: 463 TKIDKIKLDHERRIEALEKEVNILKKKILLIQANDEFVGEAIKLMRAAISTSANWEKIWD 522

Query: 443 MVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHAN 502
            VK  +K  +PVA  I  +    NC   LL   L+E D EE +      E  +     A 
Sbjct: 523 HVKLFKKRNHPVALKIMSVNFN-NCEIELL---LNEGDTEESSSEDSSKEKGMEEKNKAC 578

Query: 503 ARRWYELKKKQESKQEK----TITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
                 L+KK E K  K    T  A  K  K  + K   Q  + K+V  I  +RK+ WFE
Sbjct: 579 T-----LRKKAEEKIRKIKMSTNVAIKKVEKKKKDKDTKQKGKHKSVFQIQKLRKIFWFE 633

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KFNWF+SSENYLVISGRD+ QNE++ +RY    D+YVHAD+HGA+S +IKN   + P+P 
Sbjct: 634 KFNWFLSSENYLVISGRDSLQNEILFRRYFQNNDIYVHADIHGAASCIIKNPYKDIPIPE 693

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
            TL +AG   +C S AW++K++TSAWWVY HQVSKTAPTGEY+  GSF+IRGKKN+LP  
Sbjct: 694 KTLAEAGQLAMCRSSAWNNKVITSAWWVYYHQVSKTAPTGEYIKTGSFVIRGKKNYLPYA 753

Query: 679 PLIMGFGLLFRLDES 693
            L MG  ++F+++++
Sbjct: 754 KLEMGLSIIFQVNKN 768



 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 101/156 (64%), Gaps = 9/156 (5%)

Query: 1   MVKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C + +IG   +N+Y++S K Y+ K         S + +K  LL
Sbjct: 1   MGKQRLTALDIRAIITSCKKTIIGSVVTNIYNISNKIYVLKC--------SKKEQKYFLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PSGFT+KLRKH+R+R++ ++ QLG DR++  QFG   N ++
Sbjct: 53  LEAEKRMHITEWMREKDVMPSGFTMKLRKHLRSRKITNISQLGGDRVVDIQFGYDDNVYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAI 155
           +I+ELY  GNI+LT++E+ ++ +L+S+ D+ K + I
Sbjct: 113 LIVELYIAGNIVLTNNEYKIIFILKSNDDNKKKLKI 148


>gi|320589532|gb|EFX01993.1| duf814 domain containing protein [Grosmannia clavigera kw1407]
          Length = 1969

 Score =  261 bits (668), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 160/429 (37%), Positives = 245/429 (57%), Gaps = 22/429 (5%)

Query: 327  SSTQI-YDEFCPLLLNQFRSRE---FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
            +ST++ Y +F P    QF +      + F++F+ A DEFYS ++  +A++Q   +E  AF
Sbjct: 1215 ASTKLDYVDFHPFKPRQFEADPKCVLLPFDSFNKAADEFYSHLQGLKADRQLHQQESVAF 1274

Query: 383  HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
             KL     DQ  R+ +L++    + + A  IE N E V AAI AV   L     WEDLA 
Sbjct: 1275 KKLEATRRDQAMRIESLQETQQLNTRKAAAIEANQEWVQAAIDAVNDQLHVGTDWEDLAH 1334

Query: 443  MVKEERKAGNPVAGLID-KLYLERNCMSLLLSNN--------LDEMDDEEKTLPVEKVEV 493
            ++ E     NPVA LI   + L    ++L LS+          DE ++E +   +  V V
Sbjct: 1335 LI-ENSADSNPVAALIKLPMRLADGIITLQLSDEPAADFDEDFDEDEEEAEEEELLDVNV 1393

Query: 494  DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT--RLQILQEKTVANISHM 551
             LALSA  NAR +Y+ K+   SK++KT    S A + AEKK    L+ +Q+        +
Sbjct: 1394 KLALSAWGNAREYYDQKRVAASKEQKTKEVTSMALRNAEKKVAEELKRVQKGGKPAPQLI 1453

Query: 552  RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
            R+  WFEKF WF+SS+ +LVI+ +++QQ E++ +R++ +GD+YVHAD+ G+   ++  +R
Sbjct: 1454 RRQLWFEKFLWFVSSDGHLVIAAKESQQCELMYRRHLRRGDIYVHADIRGSPGIIVVKNR 1513

Query: 612  PE----QPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
            P+     P+PP TL QAGC  VC S+AWD+K    A+WV+ +QV KT  +G+ L +GSF 
Sbjct: 1514 PDVGADAPIPPGTLAQAGCLAVCASEAWDNKAGFGAYWVHANQVFKTTASGDVLPLGSFD 1573

Query: 668  IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIE 727
            IRG+KN LPP   ++GFGLLF++  +    +  E  V GE+   DD E  G   ++  +E
Sbjct: 1574 IRGEKNHLPPPQRVLGFGLLFQISNARTADYA-EVEVAGEDVA-DDVESDGPEIDSCPVE 1631

Query: 728  SEKDDTDEK 736
                +++ K
Sbjct: 1632 GNAQESEVK 1640



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 79/162 (48%), Gaps = 20/162 (12%)

Query: 2    VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTE-----SGESEK 55
            +K R ++ DV A    L   L G R +NVYDL P +       +S         S   +K
Sbjct: 878  MKQRFSSLDVRAISHELHHSLAGTRVTNVYDLVPPSSSASSTAASTSRALLLRFSRGQDK 937

Query: 56   VLLLMESGVRLHTTAY-ARDK-----------KNTPSGFTLKLRKHIRTRRLEDVRQLGY 103
              L+++SG R H TAY AR              + PS F  +LR  +  R +  V+Q+G 
Sbjct: 938  FQLVVDSGFRCHLTAYDARASAASKGSSAGSAPHAPSAFVARLRTFLNGRHVTAVQQVGT 997

Query: 104  DRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRS 145
            DRI+  +F  G    Y  LE +A GN++LT++E  VL L R+
Sbjct: 998  DRIVELRFSDGQLRLY--LEFFAAGNVVLTNAEAKVLALQRT 1037


>gi|73853411|gb|AAZ86776.1| IP12823p [Drosophila melanogaster]
          Length = 489

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 194/553 (35%), Positives = 285/553 (51%), Gaps = 100/553 (18%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R NT D+   V  L++L+G R + +YD+  KTY+F++  +  V      EKV LL+E
Sbjct: 1   MKTRFNTFDIICGVAELQKLVGWRVNQIYDVDNKTYLFRMQGTGAV------EKVTLLIE 54

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           SG R HTT +   K   PSGF++KLRKH++ +RLE V+Q+G DRI+ FQFG G  A++VI
Sbjct: 55  SGTRFHTTRFEWPKNMAPSGFSMKLRKHLKNKRLEKVQQMGSDRIVDFQFGTGDAAYHVI 114

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           LELY +GN++LTD E T L +LR H + +  +    R +YP E  R  + T   +L A +
Sbjct: 115 LELYDRGNVILTDYELTTLYILRPHTEGE-NLRFAMREKYPVE--RAKQPTKELELEALV 171

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                       K+ E                              N+ +G   +Q    
Sbjct: 172 ------------KLLE------------------------------NARNGDYLRQ---- 185

Query: 242 TVLGEALGYGPALSEHIILDTGL------------------------------VPNMKLS 271
            +L   L  GPA+ EH++L  GL                                N KL 
Sbjct: 186 -ILTPNLDCGPAVIEHVLLSHGLDNHVIKKETTEETPEAEDKPEKGGKKQRKKQQNTKLE 244

Query: 272 EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI 331
           +      N + +L  AV   ++ + +  SG    +GYI+       K+  PTE+G+    
Sbjct: 245 QKPFDMVNDLPILQQAVKDAQELIAEGNSGK--SKGYIIQ-----VKEEKPTENGTVEFF 297

Query: 332 YD--EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           +   EF P L  QF++ E   FE+F  A+DEFYS  ESQ+ + +   +E  A  KL+ + 
Sbjct: 298 FRNIEFHPYLFIQFKNFEKATFESFMEAVDEFYSTQESQKIDMKTLQQEREALKKLSNVK 357

Query: 390 MDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
            D   R+  L   Q+VDR  K AELI  N   VD AI AV+ A+A+++SW D+  +VKE 
Sbjct: 358 NDHAKRLEELTKVQDVDR--KKAELITSNQSLVDNAIRAVQSAIASQLSWPDIHELVKEA 415

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP-VEKVEVDLALSAHANARRW 506
           +  G+ VA  I +L LE N +SL+LS+  D  +D++   P V  V+VDLALSA ANARR+
Sbjct: 416 QANGDAVASSIKQLKLETNHISLMLSDPYDNDEDDDLKDPEVTVVDVDLALSAWANARRY 475

Query: 507 YELKKKQESKQEK 519
           Y++K+    K++K
Sbjct: 476 YDMKRSAAQKKKK 488


>gi|26334499|dbj|BAC30950.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  259 bits (662), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 169/477 (35%), Positives = 247/477 (51%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRN 412


>gi|51593729|gb|AAH80716.1| Sdccag1 protein, partial [Mus musculus]
          Length = 443

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 169/477 (35%), Positives = 247/477 (51%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRN 412


>gi|74152610|dbj|BAE42589.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 169/477 (35%), Positives = 246/477 (51%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH++ RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKGRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRN 412


>gi|12837616|dbj|BAB23886.1| unnamed protein product [Mus musculus]
          Length = 438

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 168/477 (35%), Positives = 246/477 (51%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KAXLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD  
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKP 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLLRN 412


>gi|12857277|dbj|BAB30959.1| unnamed protein product [Mus musculus]
          Length = 415

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 168/475 (35%), Positives = 246/475 (51%), Gaps = 69/475 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNQVTMLL 410


>gi|54887337|gb|AAH37106.2| Sdccag1 protein [Mus musculus]
          Length = 415

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 168/475 (35%), Positives = 246/475 (51%), Gaps = 69/475 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    V  A+                             K   L
Sbjct: 160 --------AAEPLLTLERLTEVIAAA----------------------------PKGEVL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH ++++G   N K+ E  KLE   I+ +++ V + ED+L+   +
Sbjct: 184 KRVLNPLLPYGPALIEHCLIESGFSGNAKVDE--KLESKDIEKILVCVQRAEDYLRK--T 239

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P          Y+EF P L +Q     +++FE+FD A
Sbjct: 240 SNFNGKGYIIQKREAKPSLDADKP----AEDILTYEEFHPFLFSQHLQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGVIVKEAQAQGDPVACAIKELKLQTNHVTMLL 410


>gi|113911846|gb|AAI22665.1| SDCCAG1 protein [Bos taurus]
          Length = 443

 Score =  255 bits (652), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 167/477 (35%), Positives = 249/477 (52%), Gaps = 69/477 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R             
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDESDDVKFAVRERYPIDHAR------------- 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP    E    +       L G   G+                      L
Sbjct: 160 --------AAEPLLTLERLTEI-------LAGAPKGE---------------------LL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +++ G   N+K+ E  K E   ++ +++ + K E++++   S
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFPANVKVDE--KFESKDVEKVLVCLQKAEEYMKTTSS 241

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ Q + +    P  E    T+    Y+EF P L +Q     +++FE+FD A
Sbjct: 242 FN--GKGYII-QKREI---KPSLEVDKPTEDILTYEEFHPFLFSQHSQCPYIEFESFDKA 295

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 296 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRN 412


>gi|254566655|ref|XP_002490438.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238030234|emb|CAY68157.1| hypothetical protein PAS_chr1-4_0316 [Komagataella pastoris GS115]
 gi|328350832|emb|CCA37232.1| Uncharacterized protein YPL009C [Komagataella pastoris CBS 7435]
          Length = 1007

 Score =  254 bits (649), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 220/787 (27%), Positives = 381/787 (48%), Gaps = 122/787 (15%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+   V  L   I G R  N+Y +  + K+Y+FK      + +S +S    L
Sbjct: 1   MKQRISALDLKLIVSELSHSIKGYRLQNIYSMINNNKSYLFKF----AIPDSKKS----L 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++ESGV+LH T + R     PS F +KLRKH++ +RL +++Q+G DR+++F+F  GM  +
Sbjct: 53  VVESGVKLHLTDFQRPTTQQPSNFVVKLRKHLKAKRLTNLKQVGDDRLVVFEFSDGM--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR--SHRDDDKGVAIMSRHRYPT-EICRVFERTTAS 175
           Y++LE ++ GN++L D +  ++TL R  S ++++         +Y T E   +F+   A 
Sbjct: 111 YLVLEFFSGGNVILLDQDQKIMTLQRLVSEKENN--------EKYATGEFYNMFD---AK 159

Query: 176 KLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA 235
           KL +      E  A+           + + SKE++      + F + +         A+ 
Sbjct: 160 KLFS------EAPADHA---------IKSYSKEDIIQWLDTQDFKIEQ---------AKK 195

Query: 236 KQPTLKTVLGEALGY--GPALSE---HIIL-DTGLVPNMKLSEVNKLEDNAIQVLVLAVA 289
              T+K    + L +   P LS    HI+L + G+ P    S + + ED ++  L+ ++A
Sbjct: 196 TGKTMKPYTIQKLLFVNAPHLSSDLIHIVLREKGIDPTSD-STLYRSED-SLAKLLESLA 253

Query: 290 KFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSR--E 347
           + E  L ++++     +GYI+ +   +   H P+  GS   IYDEF P      RS   +
Sbjct: 254 EAEIRLSELLTRKEDVDGYIVSKRNPI---HDPSTEGSLEYIYDEFHPYEPTHKRSSDTQ 310

Query: 348 FVKFETFDAALDEFYSKIESQR---AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
               + ++  +D+F++ IE  +    EQQ K     A  +L  +  +   ++  L +   
Sbjct: 311 IKTIKGYNKTIDDFFTTIEVSKHSLKEQQQKVN---AERRLQSVKSENLEKIAKLTEAQL 367

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYL 463
            +++  E+I    + V+    AV+  L  +M W  + +++  E+K GN +A LI+  L L
Sbjct: 368 LNIQKGEVIMVYSDVVEQCKAAVQSLLDQQMDWNHIEKLIGVEKKRGNEIAKLINLPLNL 427

Query: 464 ERNCMSLLL--------------------------------------SNNLDEMDDEEKT 485
             N +SL L                                       ++      ++KT
Sbjct: 428 LENKISLALPLVNFDESSEEEDESDSEDESDSEDSSSSDEQETKNKKQSSTKHSRKKDKT 487

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV 545
           +    V +DL+LSA+ANA  +++ KK  + K  KT      A K+AE K    + ++K  
Sbjct: 488 I---NVNIDLSLSAYANASTYFDAKKIAQDKLVKTEKNSELAIKSAESKINRDLKKQKKT 544

Query: 546 ---------ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM-SKGDVYV 595
                    A +  +R   WFEK+ WFISS+ +L ++GRD QQ + I   Y  +  D  V
Sbjct: 545 ESSQVNNSNAALRQIRDKFWFEKYFWFISSDGFLCVAGRDDQQFDHIYFEYFDNDNDFLV 604

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTA 655
             +L GA   ++KN    + V P T  QAG F++  ++AW++KMV+S W V    VSK  
Sbjct: 605 SNELEGALKVIVKNPFLNKDVAPNTFIQAGAFSLSTTKAWENKMVSSPWIVTGSSVSKRD 664

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFE 715
             G  L  G   I  +K FLPP  ++MGFG+L+  D+ +   +L++ + R EE G++  +
Sbjct: 665 VDGSALAPGLVNITTEKQFLPPCQMVMGFGMLWLGDKRTNDDYLSKSQSRTEELGLESVD 724

Query: 716 DSGHHKE 722
            +   K+
Sbjct: 725 VNAFKKK 731



 Score = 43.9 bits (102), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 50/82 (60%), Gaps = 5/82 (6%)

Query: 843 ISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESI-VRKTKIEGGKISRGQKGKLK 901
           +SK E+ +L+K Q  +  +P V+  +E  K   S  E + + + + +   + RG++ KLK
Sbjct: 743 LSKYEK-ELEKKQIQNDKEPSVDNAEEDSKSIVSSLEGLDINENQTQ---VKRGRRAKLK 798

Query: 902 KMKEKYGDQDEEERNIRMALLA 923
           K+K+KY DQDEE++  RM LL 
Sbjct: 799 KIKQKYADQDEEDKLKRMELLG 820


>gi|400593352|gb|EJP61303.1| DUF814 domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 1062

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 158/461 (34%), Positives = 237/461 (51%), Gaps = 50/461 (10%)

Query: 331 IYDEFCPLLLNQFRSR---EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK 387
           +YD+F P +  +F      E ++FE ++  +DEF+S +E QR E +   +E AA  K++ 
Sbjct: 292 LYDDFHPFVPTKFEKNDDIEILRFEGYNRTVDEFFSSLEGQRLESRLMEREAAAQRKIDA 351

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              DQENR+  L+     + + A  IE N+E V  A+ ++   L   M W D+ ++V  E
Sbjct: 352 ARQDQENRIRGLQTAQLDNFRKAAAIEANIERVQEAMDSINGLLNQGMDWVDIGKLVARE 411

Query: 448 RKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDEEKTLPVEK---------------- 490
           +K  NPVA LI   L L  N +S+ LS   D   ++E+    +                 
Sbjct: 412 QKKNNPVATLICLPLNLVDNVISVRLSEEDDVASEDEEPYETDDSDVRFEDDLDTTESGL 471

Query: 491 --------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--LQIL 540
                   VE+ L LS  +NAR +Y+ +K    K+EKT     KA K+ E K +  L+ +
Sbjct: 472 KNSDKTIVVELTLNLSPWSNARGYYDQRKNAVVKEEKTQLQADKAIKSTEHKVKQDLKKV 531

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
            ++  A +  +R   WFEKF WFISS+ YLV+  +D  Q E++ ++++  GD + HAD  
Sbjct: 532 LKQEKALLQPIRNPMWFEKFYWFISSDGYLVLGAKDKSQAELLYRQHLRSGDAFCHADAS 591

Query: 601 GASSTVIKNHR--PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTG 658
            A+  V+KN+    + P+ P TL QAG  ++C S+AWDSK    AWWV  +QVSK+  TG
Sbjct: 592 NAAIVVVKNNSKTADVPIAPATLAQAGQLSICSSEAWDSKAGIGAWWVNSNQVSKSTSTG 651

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSG 718
           + L  G+F I G+KNFLPP  L++G  +LF++ E S   H N+ R+  E    D      
Sbjct: 652 DILQPGNFNISGEKNFLPPGQLVLGLSVLFKISEES-KIHHNKHRIPDEPAVSD-----A 705

Query: 719 HHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASN 759
             KE              P +E  +  N   PA S  N SN
Sbjct: 706 PRKETY------------PNSEQEATTNDIQPAASTANGSN 734



 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 64/207 (30%), Positives = 101/207 (48%), Gaps = 30/207 (14%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L + L  +R +N+YDLS + ++FK              K  LL+
Sbjct: 1   MKQRFSSLDVKVVAHELSQSLTSLRVANIYDLSTRIFLFKFAKPG--------TKKQLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G R HTT + R    TPS F  +LRK ++TRRL  V Q+G DRI+ FQF  G   + +
Sbjct: 53  DIGFRCHTTEFVRTTAGTPSAFVCRLRKALKTRRLTSVSQIGTDRILEFQFSDGQ--YRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS-------------------HRDDDKGVAIMSRHRY 161
            LE +A GN +LTD +  +L L R+                    R +  G+  +S+ R 
Sbjct: 111 FLEFFASGNAILTDVDLRILALYRNVSEGEGQESQKVGLLYSLKSRQNFFGIPDLSQDRV 170

Query: 162 PTEICRVFERTTASKLHAALTSSKEPD 188
            T +    E+ + +K  ++  + K+ D
Sbjct: 171 RTALAAAIEKVSTTKAASSNRTPKQGD 197


>gi|124805420|ref|XP_001350435.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
 gi|23496557|gb|AAN36115.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
          Length = 2158

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 147/405 (36%), Positives = 227/405 (56%), Gaps = 36/405 (8%)

Query: 332 YDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHK 384
           + EF P++L     +      +++ F+ ++  +D ++SK+E S+  +QQ   K   A  K
Sbjct: 436 FTEFSPIILKNHEMKLNEGKIKYISFDDYNLCVDTYFSKLELSKYDKQQEITKSKNAITK 495

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
           ++KI +D E R+  L++EV    K   LI+ N   ++  I  +R AL+   +WE +   +
Sbjct: 496 VDKIKLDHERRIEQLEKEVLLLKKKITLIQLNDVLIEEGIKLMRSALSTSANWEKIWEHI 555

Query: 445 KEERKAGNPVAGLIDKLYLERNC-MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANA 503
           K  +K  +P+A  I  +   +NC M  LLS    + DD +      K+  D   +   + 
Sbjct: 556 KIFKKQEHPIAVRIKSVNF-KNCEMDYLLS----DCDDRKGN----KMGDDGDDNDDDDD 606

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKA----------------AEKKTRLQILQEKTVAN 547
                    +   + KT  A  K  K                  +   + +   + +V  
Sbjct: 607 GDDDNNNNNKSCVKPKTFAAEEKIRKTKMATDFAVKKVEKKKKNKDNNKQKGKAKSSVGQ 666

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           I  +RKV+WFEKF+WFISSENYLVI+GRDA QNE++ +RY  K D+YVHAD+HGA+S +I
Sbjct: 667 IQKLRKVYWFEKFHWFISSENYLVIAGRDALQNEILFRRYFQKNDIYVHADIHGAASCII 726

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           KN   + P+P  TL++AG   +C S AW++K++TSAWWVY +QVSK+AP+GEYL  GSF+
Sbjct: 727 KNPYKDTPIPDKTLSEAGQLAICRSSAWNNKIITSAWWVYYNQVSKSAPSGEYLKTGSFV 786

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           IRGKKN+LP   L MGF +LF+++++     LN   +  EE  +D
Sbjct: 787 IRGKKNYLPHVKLEMGFCVLFQIEKN---EDLNVENLPLEENTID 828



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 57/147 (38%), Positives = 94/147 (63%), Gaps = 9/147 (6%)

Query: 1   MVKVRMNTADVAAEVK-CLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A V  C + ++G   +N+Y++S K Y+ K         S + +K+  L
Sbjct: 1   MAKQRLTALDIRAIVTLCKKNIVGCIVTNIYNISNKIYVIKC--------SRKEQKLFFL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PS FT+KLRKH+R+R++ +++QLG DR+I  QFG    A +
Sbjct: 53  VEAEKRIHITEWKREKDVMPSSFTMKLRKHLRSRKISNIKQLGADRVIDIQFGYDEKASH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSH 146
           +I+ELY  GNI+LTD  + +L++L+S+
Sbjct: 113 LIVELYIAGNIILTDENYKILSILKSN 139


>gi|224108804|ref|XP_002314973.1| predicted protein [Populus trichocarpa]
 gi|222864013|gb|EEF01144.1| predicted protein [Populus trichocarpa]
          Length = 235

 Score =  252 bits (643), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 150/221 (67%), Positives = 171/221 (77%), Gaps = 9/221 (4%)

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
           MAE IE+NL+ VD+AILAV VALA  + WEDLARMVK+E+KAGNP+AGLIDKL+ E+NCM
Sbjct: 1   MAEFIEHNLQGVDSAILAVPVALAKGIGWEDLARMVKDEKKAGNPIAGLIDKLHFEKNCM 60

Query: 469 SLLLSN-NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           +LL++  ++  M        + K       S+HANA+RWYELKKKQE KQEKT TAH KA
Sbjct: 61  ALLIAIISMKWM--------MMKRHFQCISSSHANAQRWYELKKKQECKQEKTFTAHKKA 112

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           FKAAEKK  LQ+ QEK+VA ISHM KVHW EKFNWFI + NYLVIS RDAQQNEM VKRY
Sbjct: 113 FKAAEKKIHLQLSQEKSVATISHMHKVHWLEKFNWFIGTWNYLVISRRDAQQNEMTVKRY 172

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           MSKGD+ V     GASSTVIKNHRPEQPVPPLTLNQ    T
Sbjct: 173 MSKGDLEVCPCRSGASSTVIKNHRPEQPVPPLTLNQGEYLT 213


>gi|66357888|ref|XP_626122.1| MJ1625/yease Yp1009cp-like HhH domain [Cryptosporidium parvum Iowa
           II]
 gi|46227289|gb|EAK88239.1| MJ1625/yease Yp1009cp-like HhH domain [Cryptosporidium parvum Iowa
           II]
          Length = 1378

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 140/357 (39%), Positives = 207/357 (57%), Gaps = 30/357 (8%)

Query: 351 FETFDAALDEFYSKI----ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
            + F   +DEFYS I    ES+ A Q+HK      + K++K+ +DQE R+  L  E +  
Sbjct: 370 LDNFCKCVDEFYSSIDIVKESKFATQEHKT----IYSKVDKVKIDQERRLEGLSSEKEAC 425

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
           +  A+ +E + E ++  +  +R  +A    W+D+   +++++K  +P+A  I  L L+ +
Sbjct: 426 IVRAKFMESHQEILEKILQLIRHLIATGAQWQDIWNEIQQQKKNNHPLARHIKSLNLKDD 485

Query: 467 CMSLLLSNNLDEMDDEEKTLPV-----EKVEVDLALSA--HANARRWYELKKKQESKQEK 519
            + +L S    + D   +T PV     + +E DL +S    +N R  Y   K    K EK
Sbjct: 486 KVKILFS----QRDLGSETTPVVDQIGKSIEFDLIISKSIQSNIRFQYMESKALAEKFEK 541

Query: 520 TITAHSKAFKA--------AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
           T  A+  A K         AEK ++  +     V  I  +R  +WFEKF WFISS+ YL+
Sbjct: 542 TQLAYKIALKKVTNIAKKDAEKASKGLV---SNVPRIKKLRAQYWFEKFYWFISSDGYLI 598

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCH 631
           I G DA QNE++ +RY+ K D Y+HAD+HGA++ ++KN    Q +P  TL +AG  ++C+
Sbjct: 599 IGGHDASQNELLFRRYLEKNDRYIHADIHGATTCIVKNTNNVQDIPLNTLCEAGQMSICY 658

Query: 632 SQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           S+AW +K V SAWWVYP QVSK AP+GEYL+ GSF+IRGKKNFLPP  L MG  L F
Sbjct: 659 SKAWVNKTVISAWWVYPDQVSKNAPSGEYLSTGSFVIRGKKNFLPPLKLEMGCALYF 715



 Score =  125 bits (313), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/154 (39%), Positives = 92/154 (59%), Gaps = 15/154 (9%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM + D+ A V  + + L G +  N+YD++ +TY+FK          G  EK  LL
Sbjct: 4   MVKSRMTSVDICAMVHGISKDLKGQKLINIYDINSRTYLFKF---------GGEEKKFLL 54

Query: 60  MESGVRLHTTAYARDKK-----NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +ESG+R HTT + R+ +     ++ S F  KLR++IR ++L+D+ Q+G DRI+   FG G
Sbjct: 55  VESGIRFHTTQWKRENEHKTSVSSISFFNSKLRRYIRNKKLDDISQMGMDRIVKLTFGFG 114

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRD 148
            N  Y+I E +  GNI+LTD  + +L +LR   D
Sbjct: 115 DNTFYLIFEFFVAGNIILTDCNYKILVILRDTND 148



 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 864  VEREKERGKDASSQPESI-VRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
            +ER  +  ++A+S   +I     K +   + RG+K KLKK+ +KYG+QD+EER I+M L 
Sbjct: 1142 LERLPKTSEEATSTKNNINSTNNKQKNSALPRGKKSKLKKVADKYGEQDDEERKIKMMLF 1201

Query: 923  A 923
             
Sbjct: 1202 G 1202


>gi|349605644|gb|AEQ00813.1| Serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 388

 Score =  249 bits (636), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 157/454 (34%), Positives = 234/454 (51%), Gaps = 70/454 (15%)

Query: 23  GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGF 82
           GMR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K   PS F
Sbjct: 1   GMRVNNVYDVDNKTYLIRLQKPDF--------KATLLLESGIRIHTTEFEWPKNMMPSSF 52

Query: 83  TLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTL 142
            +K RKH+++RRL   +QLG DRI+ FQFG    A+++I+ELY +GNI+LTD E+ +L +
Sbjct: 53  AMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTDYEYLILNI 112

Query: 143 LRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLHAALTSSKEPDANEPDKVNEDGNN 201
           LR   D+   V    R RYP +  R  E   T  +L   + S+                 
Sbjct: 113 LRFRTDESDDVKFAVRERYPVDHARAAEPLLTLERLTEIIASA----------------- 155

Query: 202 VSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILD 261
                                             K   LK VL   L YGPAL EH +++
Sbjct: 156 ---------------------------------PKGELLKRVLNPLLPYGPALIEHCLIE 182

Query: 262 TGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHP 321
            G   N+K+ E  K E   I+ +++ + K ED+++   + +   +GYI+ + +      P
Sbjct: 183 NGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--TTSNFSGKGYIIQKREM----KP 234

Query: 322 PTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
             E    TQ    Y+EF P L +Q     +++FE+FD A+DEFYSKIE Q+ + +   +E
Sbjct: 235 SLEVDKPTQDILTYEEFHPFLFSQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQE 294

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
             A  KL+ +  D E+R+  L+Q  +      ELIE NL+ VD AI  VR ALAN++ W 
Sbjct: 295 KQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWT 354

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           ++  +VKE +  G+PVA  I +L L+ N +++LL
Sbjct: 355 EIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLL 388


>gi|34784822|gb|AAH56687.1| SDCCAG1 protein, partial [Homo sapiens]
          Length = 426

 Score =  249 bits (636), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 161/454 (35%), Positives = 240/454 (52%), Gaps = 68/454 (14%)

Query: 24  MRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFT 83
           MR +NVYD+  KTY+ +L             K  LL+ESG+R+HTT +   K   PS F 
Sbjct: 1   MRVNNVYDVDNKTYLIRLQKPDF--------KATLLLESGIRIHTTEFEWPKNMMPSSFA 52

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
           +K RKH+++RRL   +QLG DRI+ FQFG    A+++I+ELY +GNI+LTD E+ +L +L
Sbjct: 53  MKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHLIIELYDRGNIVLTDYEYVILNIL 112

Query: 144 RSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVS 203
           R   D+   V    R RYP +  R  E          LT  +  +             V+
Sbjct: 113 RFRTDEADDVKFAVRERYPLDHARAAE--------PLLTLERLTEI------------VA 152

Query: 204 NASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTG 263
           +A K  L                             LK VL   L YGPAL EH +L+ G
Sbjct: 153 SAPKGEL-----------------------------LKRVLNPLLPYGPALIEHCLLENG 183

Query: 264 LVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNK---HLGKDH 320
              N+K+ E  KLE   I+ +++++ K ED+++   + +   +GYI+ + +    L  D 
Sbjct: 184 FSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TTSNFSGKGYIIQKREIKPCLEADK 239

Query: 321 PPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDA 380
           P  +  +    Y+EF P L +Q     +++FE+FD A+DEFYSKIE Q+ + +   +E  
Sbjct: 240 PVEDILT----YEEFHPFLFSQHSQCPYIEFESFDKAVDEFYSKIEGQKIDLKALQQEKQ 295

Query: 381 AFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDL 440
           A  KL+ +  D ENR+  L+Q  +      ELIE NL+ VD AI  VR ALAN++ W ++
Sbjct: 296 ALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNLQIVDRAIQVVRSALANQIDWTEI 355

Query: 441 ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
             +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 356 GLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRN 389


>gi|440301763|gb|ELP94149.1| zinc knuckle domain containing protein, partial [Entamoeba invadens
           IP1]
          Length = 703

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 137/350 (39%), Positives = 215/350 (61%), Gaps = 18/350 (5%)

Query: 344 RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
           + R F  F++F  A+DEF+S IE Q  E++ + K+     K+  +    E R   L ++ 
Sbjct: 1   KGRLFDTFDSFCDAMDEFHSHIEKQEYEEELEKKDATMKKKIQAVIDGHEKRYKGLLEKA 60

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP--VAGLIDKL 461
           +  V  A+++E ++  VD  I  + V L+ +M WE +  ++ +  K  +P  VA  I K 
Sbjct: 61  EEMVVKAKVVESHIIIVDQLIKEINVFLSEKMQWERVEEII-QSAKENDPTSVAQYIKKF 119

Query: 462 YLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA--NARRWYELKKKQESKQEK 519
               + + L L N  ++           K++VD+ L+ +   N R +YE+++   +K +K
Sbjct: 120 DFANDVVVLSLENANNQ-----------KIDVDVLLTKNGFENVRNFYEMRRVVLAKADK 168

Query: 520 TITAHSKAFK-AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
           T+ +   A + A +K+ R+   ++  +A++  MR+  WFEKF+WFISSEN+++ISG+DA 
Sbjct: 169 TLESRETAIQQATQKQERVAKTKQIDLADLKKMRRRFWFEKFHWFISSENFVIISGKDAL 228

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           QN+++ +RYM   D+YVHAD+HGA+S +IK  +  + +   TL QAG   VC S AW +K
Sbjct: 229 QNDVMYRRYMKNTDIYVHADIHGAASCLIKGVKG-KVIGAATLEQAGKVAVCRSSAWTNK 287

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +VTSA+WVY  QVSKTAP+GEYL  GSFMIRGKKN+LPP PL+ G G++F
Sbjct: 288 IVTSAYWVYSDQVSKTAPSGEYLVTGSFMIRGKKNYLPPAPLVFGLGIVF 337


>gi|147771938|emb|CAN75699.1| hypothetical protein VITISV_035986 [Vitis vinifera]
          Length = 327

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 143/288 (49%), Positives = 171/288 (59%), Gaps = 59/288 (20%)

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
           KG++      HGASSTVIKNH+PE PVPPLTLNQAGCFTVCHSQ WDSK+VTSAWWVYPH
Sbjct: 9   KGNMISMKYPHGASSTVIKNHKPEHPVPPLTLNQAGCFTVCHSQVWDSKIVTSAWWVYPH 68

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           Q                                          SSLGSHL ERRVRGEEE
Sbjct: 69  Q------------------------------------------SSLGSHLYERRVRGEEE 86

Query: 710 GMDDFEDSGHHKENSDIESEKDDTDEK---------------PVAESLSVPNSAHPAPSH 754
           G  DFE++   K NSD ESEK++TDEK               P+ E  S  +SAH   + 
Sbjct: 87  GAQDFEENESLKGNSDSESEKEETDEKRTAESKSIXDPPTHQPILEGFSEISSAHNELTT 146

Query: 755 TNASNVDSHEFPAEDKTISNGIDSK-IFDIARNVAAPVTPQLEDLIDRALGLGSASISST 813
           +N  +++  E P E++ + NG DS+ I DI+    + V PQLEDLID AL LGS + S  
Sbjct: 147 SNVGSINLPEVPLEERNMLNGNDSEHIDDISGRHVSSVNPQLEDLIDWALELGSNTASGK 206

Query: 814 KHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVD 861
           K+ +ET+Q DL E+  H  R A VR+KPYISKAERRKLKKGQ +S  D
Sbjct: 207 KYALETSQVDL-EDHNHEXRKAKVREKPYISKAERRKLKKGQKTSTSD 253


>gi|68533893|gb|AAH99277.1| LOC733300 protein [Xenopus laevis]
          Length = 453

 Score =  240 bits (613), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 161/474 (33%), Positives = 246/474 (51%), Gaps = 63/474 (13%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R NT D+ A +  L   L+GMR  NVYD+  KTY+ +L             K +LL+
Sbjct: 1   MKSRFNTIDIRAVIAELTDSLLGMRVHNVYDIDNKTYLIRLQKPDS--------KAVLLV 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH+++RRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R  YP +             HA 
Sbjct: 113 IVELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVREHYPID-------------HAK 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                   A EP                            LS    K   D A+ K   L
Sbjct: 160 --------APEP---------------------------LLSVERLKEVLDNAK-KGDQL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YG  L EH +LDTGL  N+K+ +++  ED  ++ +  A+ K E ++   ++
Sbjct: 184 KKVLNPHLPYGATLIEHCLLDTGLSSNVKVDQISGPED--LEKVHTALRKAEGYMD--LT 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
            +   +G+I+ Q +       P ++       +EF P L  Q  +  +++ ++F+  +DE
Sbjct: 240 QNFNGKGFII-QKREKKPSLEPDKASEDIFTNEEFHPFLFAQHANSTYIELDSFNKTVDE 298

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           F+SK+E Q+ + +   +E  A  KL  +  D E+R+ +L+   D      ELIE NL+ V
Sbjct: 299 FFSKLEGQKIDIKALQQEKQALKKLGNVRKDHEHRLESLQYAQDADKAKGELIEMNLDIV 358

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           D AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N ++++L N
Sbjct: 359 DRAIQVVRSALANQIDWTEIGLIVKEAQIQGDPVALAIKELKLQTNHITMMLKN 412


>gi|344230527|gb|EGV62412.1| hypothetical protein CANTEDRAFT_126343 [Candida tenuis ATCC 10573]
          Length = 969

 Score =  238 bits (606), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 194/715 (27%), Positives = 334/715 (46%), Gaps = 97/715 (13%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           +++E G R+H T Y R+ + TPS F  KLRKH++TRRL  ++Q+G DRI++ +F  G+  
Sbjct: 1   MIVEFGNRIHFTDYERNIEPTPSNFVTKLRKHLKTRRLSSIKQIGDDRILVMEFSDGL-- 58

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            Y++LE ++ GNI+L D +  +L L R    +D   A+        E   +F+R+     
Sbjct: 59  FYLVLEFFSAGNIVLLDHDRKILMLQRVVDSNDDKFAV-------NETYNMFDRSL---- 107

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
                   EP      +V +  + +     +    Q   K          NS+   + K+
Sbjct: 108 ---FEQEPEPYVKRQYEVEQINSWIEKEKTKVEDNQNRLKEL-------ANSHTPTKLKK 157

Query: 238 PTLKTVLGEALGYGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
             + ++          LS  +IL T    G+  +    E +  E   +  +V  + + ED
Sbjct: 158 SKIFSIHKLLFVNASHLSSDLILKTLNENGIRSSSSCFEFHDSE--MLSTIVATMNQCED 215

Query: 294 WLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLLLNQFRSREFVKF- 351
               ++ G  + EG I+ +       +   E+  + Q ++DEF P     F+     KF 
Sbjct: 216 EYVKILQGGEI-EGIIVSKKNT----NATEETAENLQYLFDEFHPF--RPFKDGSLYKFT 268

Query: 352 --ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
             + ++  LD+F+S +ES + E + + ++  A  +L+K   ++  ++ +L  E + ++K 
Sbjct: 269 SIQGYNKTLDQFFSTLESLKNEIKIENQKQLAMKRLDKAKNERVKQIESLINEKNANIKK 328

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
            +LI  N   V   I  +   L  +M W D+ + ++ ++ +G+ +   I    L  N + 
Sbjct: 329 GDLIILNANLVSGCIDFINGMLEKQMDWHDIEKYIELQKSSGDDITNAIQ---LPLNLLE 385

Query: 470 LLLSNNLDEMDDEEKT-------------------------------------------- 485
             +  NL + D +E                                              
Sbjct: 386 NKIKLNLPDTDVDENVESSETSSSDTESESDSSSSDSDSDSDSDSDSDSDFRGTKKSKSK 445

Query: 486 ------LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTR--- 536
                 +P   V +DL+LS +ANA  +++ KK  E KQ K       A + AE+K     
Sbjct: 446 SKKTKSVPTISVWIDLSLSPYANASTFFDSKKSAEVKQLKVEKNTGIALQNAERKITHDL 505

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
            + LQ ++ A ++ +R+  WFEKF WF++S+ YL +SG+D  QN+MI  RY +  D +V+
Sbjct: 506 TKALQNESEA-LNKVREKFWFEKFYWFVTSDGYLCLSGKDDLQNDMIYYRYFNDDDFFVY 564

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           +D+ GA    IKN    + VPP T+ QAG F++ +S++W +K  +SAW++    VSK   
Sbjct: 565 SDIEGALKVFIKNPYKGETVPPSTIWQAGMFSLSNSESWSNKSSSSAWYLPGPGVSKKDI 624

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
            G  L  G F  +GKK  +PP  L+MGFG+ F  D+ +      +R VR EE G+
Sbjct: 625 DGSLLRPGKFNFKGKKEHMPPVQLVMGFGIYFVGDDETTKRAREKRLVRQEEMGL 679


>gi|68071251|ref|XP_677539.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497695|emb|CAH96713.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 1012

 Score =  237 bits (605), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 143/370 (38%), Positives = 218/370 (58%), Gaps = 13/370 (3%)

Query: 361 FYSKIESQRAEQ-QHKAKEDAAFHKLNKIHMDQENRVH-TLKQEVDRSVKMAELIEYNLE 418
           + +K+ES + ++ Q   K   A  K++KI +D E R+  + K++V    K   LI+ N E
Sbjct: 7   YLTKMESTKYDKHQEMNKRKNALTKIDKIKLDHERRIEGSTKKQVSILKKKISLIQLNDE 66

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC-MSLLLSNNLD 477
            V  AI  +R A++   +WE +   +K  +K  +P+A  I  +    NC M LLL+++  
Sbjct: 67  SVGEAIKLMRSAISTSANWEQIWDHIKLFKKRDHPIALKIMSVNFN-NCEMELLLNDDDI 125

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK----TITAHSKAFKAAEK 533
           E + ++  L     +  +A     N++    L+KK E K  K    T  A  K  K  + 
Sbjct: 126 EENGDDNNLKNNSWKEKIA---DKNSKTC-TLRKKAEEKIRKIKMSTNMAVKKVEKKKKD 181

Query: 534 KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
           K   Q  + K+V  I  +RKV WFEKFNWFISSENYLVISG+D+ QNE++ +RY    D+
Sbjct: 182 KDTKQKGKNKSVFQIKKLRKVFWFEKFNWFISSENYLVISGKDSLQNEILFRRYFQNNDI 241

Query: 594 YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK 653
           YVHAD+HGA++ +IKN   +  +P  TL +AG   +C S +W++K++TSAWWVY HQVSK
Sbjct: 242 YVHADVHGAATCIIKNPYKDISIPEKTLFEAGQLAMCRSSSWNNKIITSAWWVYYHQVSK 301

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
           TAPTGEY+  GSF+IRGKKN+LP   L MG  ++F++++  +  +  E  + G+++  + 
Sbjct: 302 TAPTGEYIKTGSFVIRGKKNYLPYAKLEMGLCIIFQVNK-QMDDNNKENALNGDKQNYES 360

Query: 714 FEDSGHHKEN 723
                 + EN
Sbjct: 361 INSGDENGEN 370


>gi|403222989|dbj|BAM41120.1| uncharacterized protein TOT_030000383 [Theileria orientalis strain
           Shintoku]
          Length = 1119

 Score =  235 bits (599), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 146/422 (34%), Positives = 223/422 (52%), Gaps = 57/422 (13%)

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
           DIVP GYI    K                + D+F P    + ++ E+   E ++ ALD F
Sbjct: 243 DIVP-GYIYRNAKG---------------VMDDFGPF---ELQNAEY--HEDYNYALDAF 281

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           ++K E  + E++ ++K+     KL KI  DQ+ R   L +E+    K   ++E N++ VD
Sbjct: 282 FTKNELVKQEKKTESKKPT---KLTKIKADQDKRESKLMEEIMGYDKQIRVLEENIDIVD 338

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
             +   +  +A+  SW D+   ++ +RK  +P+   I ++ +    +  + +   +E DD
Sbjct: 339 NCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVCYIKEINIPNQTLVFVSNPEGNERDD 398

Query: 482 E--EKTLPVEKVEV-DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
           E   K L  E+V V D  L+ + N +++Y  +KK E+K E+T      A K   K    Q
Sbjct: 399 EPERKELVEEQVVVLDYRLTGYQNLKKFYINRKKAENKLERTKIGKEYALKKVAKSLSKQ 458

Query: 539 ILQEK-----TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
              +K         IS +RK  WFEKF WFI+S+ YLV++GRD+ QNE++VK+Y++KGD+
Sbjct: 459 PEVKKGDRRTREVKISSLRKRFWFEKFYWFITSQGYLVLAGRDSLQNELLVKKYLTKGDL 518

Query: 594 YVHADLHGASSTVIKNHRPEQPVPP-------------------------LTLNQAGCFT 628
           Y HAD+HGASS ++K +  E                              +++ +A  F 
Sbjct: 519 YFHADIHGASSVILKTNSQELIKSSESAEVSEVEKAGGRGNEEEFIAKIRVSIEEAANFA 578

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           VCHS AW+ K    +WWVY HQVSKT PTGEY+  GSF+IRGKKN+L P  L MG   LF
Sbjct: 579 VCHSNAWNDKFSVQSWWVYWHQVSKTPPTGEYVPQGSFVIRGKKNYLQPQKLEMGITYLF 638

Query: 689 RL 690
           ++
Sbjct: 639 QV 640



 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 63/171 (36%), Positives = 94/171 (54%), Gaps = 9/171 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MV+ R+N  DVA  V  L++ L  +   N+YD++ + +I K         S    K+ +L
Sbjct: 1   MVRERLNAIDVAISVANLKKTLDNITLVNIYDITNRLFILKF--------SRNENKIYVL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E G R+HTT + R   + PS F  KLRKH+R RRL DV+Q+  DRII F F    +A +
Sbjct: 53  IEIGCRIHTTQFLRSVDHLPSNFNAKLRKHLRNRRLRDVKQMSQDRIIDFTFSSEEHAMH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           +I++L+  GNI LTD E+ VL +L+     D    + + +    E    FE
Sbjct: 113 LIVQLFLPGNIYLTDHEYKVLAVLKPKNTGDNFFKVGTNYVCDMEYNSWFE 163


>gi|399216143|emb|CCF72831.1| unnamed protein product [Babesia microti strain RI]
          Length = 933

 Score =  232 bits (592), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 149/453 (32%), Positives = 246/453 (54%), Gaps = 53/453 (11%)

Query: 264 LVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPT 323
           +V N ++SE N   D   + LV A+ K  + L+ +  G+    GY+ +  K++   +   
Sbjct: 209 IVHNEQISEDNI--DQCAERLVCAILKISELLETLKKGN--NGGYVTLDPKYV---NSSL 261

Query: 324 ESGSSTQIYDEFCPLLLNQFRSREFVKFETFD------------------------AALD 359
           +   +T + D + P++  +  +R  V F +++                          LD
Sbjct: 262 DCIPATALID-YSPIIA-EIDTRNCVSFNSYNEVSYFFVRIGYYNLIIEQSKIKISKCLD 319

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
            ++ K E+     +  +K +       KI +DQE R+  +K +V  + K A LI+ +   
Sbjct: 320 FYFGKFETFEKPTKKPSKAE-------KIKIDQEKRISNMKTQVQIAEKNAYLIDKHSAL 372

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VD  I  +R  +A    W+D+   ++ +++ G+ +A L D++  +   + L L  N D+ 
Sbjct: 373 VDECISLMRTLIATGSRWDDIWDEIELQKQMGHEIAILFDRVDFKTGEIFLSLKENSDDE 432

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
           D       V  V V +  S  +N R  + ++K   +K ++T  + + A K  +K  +   
Sbjct: 433 D-------VCIVPVSVNQSVFSNLRGIHNMRKNILAKIDRTGLSMAMAIKNVQKNDKTPN 485

Query: 540 LQEKT----VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
             +K+    V  I  ++K +WFEKF WFISS++YLV++GRD+ QNE++VKR+M   D+Y+
Sbjct: 486 KSDKSSTKQVERI-KVKKRYWFEKFKWFISSDDYLVLAGRDSIQNEILVKRHMESNDIYI 544

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTA 655
           HAD+HGA+S ++KN+  + P+P  TL +AG F+VC+S AW +K +TSAWWV   QVSKT 
Sbjct: 545 HADIHGAASCIVKNNSSD-PIPQRTLIEAGQFSVCNSSAWKAKFMTSAWWVESSQVSKTP 603

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            TGEYL  GSF+IRGKKNFLPP  L MG  +++
Sbjct: 604 ETGEYLPSGSFVIRGKKNFLPPSKLEMGLAVIY 636



 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 87/139 (62%), Gaps = 9/139 (6%)

Query: 6   MNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A +K ++  ++G    N+YD+S K YI K+ N           K  LL+E+G 
Sbjct: 4   MTSLDICAVLKEIKEAIVGGSVINLYDVSKKVYILKVSN--------RDSKFFLLLEAGS 55

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T + R K + PSGFT+KLRKH++ +R+  VRQLG DR++   FG G   H++I++ 
Sbjct: 56  RIHLTQFMRSKDSMPSGFTMKLRKHLKGKRVSKVRQLGLDRVVDIVFGTGDYEHHLIIQF 115

Query: 125 YAQGNILLTDSEFTVLTLL 143
           Y  GNI LTD+E+ +LT L
Sbjct: 116 YVSGNIFLTDNEYKILTSL 134


>gi|294875379|ref|XP_002767293.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239868856|gb|EER00011.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 1087

 Score =  232 bits (592), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 142/357 (39%), Positives = 217/357 (60%), Gaps = 11/357 (3%)

Query: 348 FVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
            V++ +F   +D++Y+++   + E Q   K+     K+  I  DQ  R+  L++E     
Sbjct: 365 VVEYPSFTECVDDYYTRLMRAQLEGQLVQKQSQMISKVENIKSDQRRRMGELEKEQQSLW 424

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + A  +E N    DAAI  V   LA ++ W++L   VK++++AG+P+A  I +L L++N 
Sbjct: 425 EQAVALEANTTLADAAIQMVNALLAAKLRWDELTIAVKQQQRAGHPLAMHIRQLALDKNR 484

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           +S++L       DD++    VE V +DL  +A AN    +E +K  + K  KT    ++A
Sbjct: 485 ISIVLEKAASTDDDDDGATTVE-VWLDLGRTAQANVALLHEKRKGMQEKMGKTEEQMARA 543

Query: 528 FKAAEKKTRLQILQEKTVAN---------ISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
            K AEK+ + +       A          ++  RK  WF+KF WFISS+  LV++GRDAQ
Sbjct: 544 VKMAEKRLKGKGAGGNQAAAALGGAEKQLLAKRRKKFWFQKFFWFISSDRLLVLAGRDAQ 603

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           QNE++ +RY++  D+YVHADL GA++ VIK  +     P  TL +AG +++C S+AWD+K
Sbjct: 604 QNELLWRRYLAPTDIYVHADLAGAATVVIKMPKGGVEPPQRTLAEAGQYSLCRSRAWDNK 663

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP-LIMGFGLLFRLDESS 694
           +VTSAWWV+  QVSKTAPTGE+L+ GSFMIRGKKNFLPP   L MG G+++ + + S
Sbjct: 664 IVTSAWWVWAKQVSKTAPTGEFLSTGSFMIRGKKNFLPPTGRLEMGLGVMWTVTDDS 720



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 23/38 (60%), Positives = 29/38 (76%), Gaps = 3/38 (7%)

Query: 889 GGK---ISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
           GGK   ++R Q+ KL K++EKYGDQDEEER IRM L+ 
Sbjct: 806 GGKTKPLTRHQRKKLAKIREKYGDQDEEERLIRMKLMG 843


>gi|159477991|ref|XP_001697091.1| hypothetical protein CHLREDRAFT_181058 [Chlamydomonas reinhardtii]
 gi|158269999|gb|EDO96040.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 246

 Score =  229 bits (584), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 116/202 (57%), Positives = 142/202 (70%), Gaps = 16/202 (7%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           V VDL+LSAHANA  +++ ++K  +K  +      +       +      Q         
Sbjct: 1   VAVDLSLSAHANASAYFDTRRKHLAKLGEQDAGCQRGGAGGGGEEGGGGTQAA------- 53

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
                      WFISSENYLV+SGRDAQQNE++VKRY  KGDVYVHA+LHGASST++KN 
Sbjct: 54  ---------LPWFISSENYLVVSGRDAQQNELLVKRYFRKGDVYVHAELHGASSTIVKNP 104

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
           +P+QP+PP+TL QAGC  VC S+AWDSK+VTSAWWV+ HQVSKTAP+GEYL  GSFMIRG
Sbjct: 105 QPDQPIPPITLQQAGCACVCRSRAWDSKIVTSAWWVHHHQVSKTAPSGEYLVTGSFMIRG 164

Query: 671 KKNFLPPHPLIMGFGLLFRLDE 692
           KKNFLPP PL+MGFG LF+ DE
Sbjct: 165 KKNFLPPQPLVMGFGFLFKWDE 186


>gi|154418675|ref|XP_001582355.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121916590|gb|EAY21369.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 875

 Score =  228 bits (582), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 143/436 (32%), Positives = 231/436 (52%), Gaps = 27/436 (6%)

Query: 321 PPTESGS-STQIYDEFC-PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
           PP   G   T+  D+F  P  L Q+   +   F+TFD A DEF+S  E +RA+++HK  E
Sbjct: 266 PPKPKGYVYTKGKDKFLSPFPLAQYDPSQSQVFDTFDKACDEFWSVRELERAQKEHKENE 325

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
            A   K+  +  + + +    + E+D   +   LI+ N   ++     +   +ANR+ W+
Sbjct: 326 AAPDKKVQSVKKNFDKKRKQFQDELDLLNRTGHLIQANATQIEQCRNVINSFIANRVRWD 385

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALS 498
           ++   ++  ++ GN +A +IDK+  E++    L++      D+E KT   E++ ++L  +
Sbjct: 386 EIRMSIRAYQECGNELASMIDKVDFEKSGFYCLVN------DEEGKT---ERIFIELKKT 436

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
           A+ANA  +++ +     K E       +  K  EK       ++K  + I   RK  WFE
Sbjct: 437 AYANASAYFDKRAVLVKKLEGANAKEEEVLKKVEKDAIA--AKKKVTSTIQERRKTWWFE 494

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           +F+WFI++ENYLVISGRD  QNE++V  Y+ K D+Y+HA++HGA+S +IKN    +PV P
Sbjct: 495 RFHWFITTENYLVISGRDKVQNEVLVAHYLKKDDIYLHAEIHGAASVIIKNPT-SKPVSP 553

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
           ++L QA  F V  S AW S    + +WV+  QV K  P       G+F I G+KN +   
Sbjct: 554 ISLEQAAEFAVARSSAWKSNEPCNCFWVHADQVKKNLPGQPTAPKGTFYIVGEKNMMTMT 613

Query: 679 PLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEK-- 736
              MG G+LF + E  +  H NER++R +E+           K  S+I  E+ +T  K  
Sbjct: 614 MPQMGLGILFHVTEQHVADHANERKIRVDED----------EKPESEIPKEEGETKPKLP 663

Query: 737 PVAESLSVPNSAHPAP 752
           P  +S  +  +A P P
Sbjct: 664 PRVDSAEI-EAALPFP 678



 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 75/143 (52%), Gaps = 13/143 (9%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           + ++ +V  E+  L+ LIGMR  N++ +   T   K     GV+        +L++++GV
Sbjct: 4   QFSSYEVKVEIDSLQELIGMRIGNIHQVDKDTLTMKFW-KLGVSR-------ILIVQNGV 55

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R H T + R+K   P  F  +LRK +R RRL D+ Q   DR + F FG       +  EL
Sbjct: 56  RFHITDFPREKPKVPPDFCCRLRKLLRFRRLNDIIQPLNDRAVYFCFG----DLRLCFEL 111

Query: 125 YAQGNILL-TDSEFTVLTLLRSH 146
           +  GNI+L  +++  +  +L+ H
Sbjct: 112 FQGGNIILFQETDKIIQAVLKYH 134


>gi|71027701|ref|XP_763494.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68350447|gb|EAN31211.1| hypothetical protein, conserved [Theileria parva]
          Length = 1249

 Score =  228 bits (581), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 132/386 (34%), Positives = 204/386 (52%), Gaps = 51/386 (13%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE F+ A+D F++K E     +Q K  +D    KLNKI +DQ+ R   L +++ +     
Sbjct: 275 FEDFNDAVDAFFTKHE---LAKQEKKTQDKKPTKLNKIKIDQDKREQKLVEDIRKLDLEI 331

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           +L+E N++  +  +   +  +A+  SW D+   ++ +RK  +P+   I ++ +     +L
Sbjct: 332 KLLEENVDIAENCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVHYIKEINIP--TQTL 389

Query: 471 LLSNNLDEMDDEEKTLPVEK-----------------VEVDLALSAHANARRWYELKKKQ 513
           +  N +   D   +     K                 V +D  L++H N ++ Y  +K+ 
Sbjct: 390 IFHNPISGSDQLSQGGQSGKPGKSGTQSKLSKDLTASVSLDYRLNSHQNLKKLYNERKRL 449

Query: 514 ESKQEKTITAHSKAFKAAEKKTRLQILQE----KTVANISHMRKVHWFEKFNWFISSENY 569
           E+K E+T      A K   K  + Q  ++    K    IS +RK  WFEKF WFI+S+ Y
Sbjct: 450 ENKLERTKIGKEYALKKVTKSLKKQETKKTDKNKRDVRISSVRKRFWFEKFYWFITSQGY 509

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK-----------------NHRP 612
           LV++GRDA QNE++VK+Y++ GD+Y HAD+HGA+S ++K                 N   
Sbjct: 510 LVLAGRDALQNELLVKKYLTNGDLYFHADIHGAASVILKTNSNSSSFNLTTGTTSDNTET 569

Query: 613 EQPVPPL--------TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
               PP         ++++AG F VC S AW+ K    +WWVY HQVSKT PTGEY+  G
Sbjct: 570 TNTSPPYDMIKSVKESIDEAGNFAVCLSTAWNEKFSVQSWWVYWHQVSKTPPTGEYVPQG 629

Query: 665 SFMIRGKKNFLPPHPLIMGFGLLFRL 690
           SF+IRGKKN+LPP  L MG   LF++
Sbjct: 630 SFVIRGKKNYLPPQKLEMGITYLFQV 655



 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/172 (38%), Positives = 96/172 (55%), Gaps = 9/172 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIG-MRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+N  DVA  V  L++LI  +   N+YD++ + +I K         S    K+ +L
Sbjct: 1   MAKERLNAVDVAVVVSNLKKLISNLTLVNIYDITNRIFILKF--------SKNENKIYIL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E G R+H T + R   + PS F  KLRKH+R RRL D+ Q+  DR+I F F     AH+
Sbjct: 53  IEIGCRIHATQFLRSVDHLPSNFNAKLRKHLRNRRLRDISQISQDRVIDFTFSSEEYAHH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER 171
           +I++L+  GNI LTD E+ VLT+LR     DK   + S + Y  E    FE+
Sbjct: 113 LIVQLFLPGNIYLTDHEYKVLTVLRPQNTGDKFFKVGSNYVYDMEYNSWFEK 164


>gi|304314240|ref|YP_003849387.1| hypothetical protein MTBMA_c04780 [Methanothermobacter marburgensis
           str. Marburg]
 gi|302587699|gb|ADL58074.1| conserved hypothetical protein [Methanothermobacter marburgensis
           str. Marburg]
          Length = 653

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 192/686 (27%), Positives = 314/686 (45%), Gaps = 109/686 (15%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  DV A  + L  ++ G R    Y     T I +          GE  ++ ++M++GV
Sbjct: 4   MSNVDVFAVTRELNDILSGARVDKAYQPLRDTVIIRFHVP------GEG-RMDVVMQAGV 56

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T Y  +    P  F + LRKH+R   + +VRQ  +DRI+  +       + +++EL
Sbjct: 57  RIHRTDYPPENPKIPPSFPMLLRKHLRGGIVREVRQHSFDRIVEIEIE-KEQKYTLVVEL 115

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
           +++GNI+L + E  ++  L+     D+ +A  SR R        +E   +  +H      
Sbjct: 116 FSKGNIILLNQEGEIILPLKRKTWSDRRIA--SRER--------YEYPPSRGIH------ 159

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
             P   E  ++ E   N                  DL +   +N                
Sbjct: 160 --PLRYEIGELEEMLKNSDT---------------DLIRTLARN---------------- 186

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
               G+G   +E IIL +GL    K    + L  + I+ +  A+ +    L+D       
Sbjct: 187 ----GFGGLYAEEIILRSGL---DKKRAASTLSRDEIEKIESAINELFKPLRD------- 232

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK 364
                L  N H+ K+      G       +  P+ L  +R RE   FETF+ A DEF+S 
Sbjct: 233 -----LKFNPHIIKN------GEG-----DVLPIELMVYRDREREYFETFNEAADEFFSS 276

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           I  +   + H+A+ +    K  K    Q   +   +  +D S +  +L+  +   V+  +
Sbjct: 277 IFREELRKVHEAEWEKEVEKFRKRLRIQRETLQKFQDTIDTSTRKGDLLYAHYAAVEDVL 336

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
             +R A   + SW+++ +++ + R  G   A +I ++    N M+LL+            
Sbjct: 337 RTIRDA-REKYSWKEIRKIIADARSKGMVEAQMIQEIDGMGN-MTLLIDG---------- 384

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK--KTRLQILQE 542
               E++ +D  L    NA  +YE  KK + K +  + A  K  +  EK  K R   L+ 
Sbjct: 385 ----ERIRIDPTLGVPENAEVYYEKAKKAKRKIKGVLQAIEKTEREIEKVEKRRDDALRN 440

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
             V      RK+ WFEKF WFISS+++LVI GRDA  NEM+VKR+M   D+Y+H+D+HGA
Sbjct: 441 IMVPQKRVKRKLRWFEKFRWFISSDDFLVIGGRDAGTNEMVVKRHMEPRDIYLHSDIHGA 500

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYL 661
            S VIK+   E  VP  T+ +A  F    S AW     +   +WV+P QVSKT  +GE++
Sbjct: 501 PSVVIKSEGRE--VPETTIQEAAVFAASFSSAWTRGFTSLDVYWVHPEQVSKTPRSGEFV 558

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLL 687
             G+F+IRG +N++   PL +  G++
Sbjct: 559 ARGAFIIRGTRNYIRGVPLKVAVGVV 584


>gi|349581807|dbj|GAA26964.1| K7_Ypl009cp [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1027

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 203/730 (27%), Positives = 347/730 (47%), Gaps = 97/730 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+ +        L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDES--------LFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N +  KD    E      IYD F P    +N   S      E    ++  LD+F+S IE
Sbjct: 291 ENYNSEKDTADLEF-----IYDTFHPFKPYINGGDSDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILA 426
           S +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++   LA
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLIIENAPLIEEVKLA 405

Query: 427 VRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL---LLSNNLDEMDDE 482
           V+  +  +M W  + +++K E+K GN +A L++  L L++N +S+   L S  L+   DE
Sbjct: 406 VQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQNKISVKLDLSSKELNTSSDE 465

Query: 483 E------------------------------KTLPVEKVEV--DLALSAHANARRWYELK 510
           +                              K    EK+ V  DL LSA+ANA  ++ +K
Sbjct: 466 DNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGLSAYANATEYFNIK 525

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFNWFISSE 567
           K    KQ+K      KA K  E K   Q L++K   + S ++K+   ++FEK++WFISSE
Sbjct: 526 KTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSVLKKIRTPYFFEKYSWFISSE 584

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGC 626
            +LV+ G+   + + I  +Y+   D+Y+    +  S   IKN  PE+  VPP TL QAG 
Sbjct: 585 GFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWIKN--PERTEVPPNTLMQAGI 640

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGK--KNFLPPHPLIMG 683
             +  S+AW  K+ +S WW +   VSK        L  G+F ++ +  +N LPP  L+MG
Sbjct: 641 LCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQNHLPPAQLVMG 700

Query: 684 FGLLFRLDES 693
           FG L+++  S
Sbjct: 701 FGFLWKVKTS 710



 Score = 40.0 bits (92), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 24/31 (77%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG++GKLKK+++KY DQDE ER +R+  L  
Sbjct: 827 RGKRGKLKKIQKKYADQDETERLLRLEALGT 857


>gi|151942783|gb|EDN61129.1| conserved protein [Saccharomyces cerevisiae YJM789]
          Length = 1040

 Score =  222 bits (565), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 202/730 (27%), Positives = 346/730 (47%), Gaps = 97/730 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+         +L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDE--------SLFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N    KD    E      IYD F P    +N   +      E    ++  LD+F+S IE
Sbjct: 291 ENYISEKDTADLEF-----IYDTFHPFKPYINGGDTDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILA 426
           S +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++   LA
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLIIENAPLIEEVKLA 405

Query: 427 VRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL---LLSNNLDEMDDE 482
           V+  +  +M W  + +++K E+K GN +A L++  L L++N +S+   L S  L+   DE
Sbjct: 406 VQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQNKISVKLDLSSKELNTSSDE 465

Query: 483 E------------------------------KTLPVEKVEV--DLALSAHANARRWYELK 510
           +                              K    EK+ V  DL LSA+ANA  ++ +K
Sbjct: 466 DNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGLSAYANATEYFNIK 525

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFNWFISSE 567
           K    KQ+K      KA K  E K   Q L++K   + S ++K+   ++FEK++WFISSE
Sbjct: 526 KTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSVLKKIRTPYFFEKYSWFISSE 584

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGC 626
            +LV+ G+   + + I  +Y+   D+Y+    +  S   IKN  PE+  VPP TL QAG 
Sbjct: 585 GFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWIKN--PEKTEVPPNTLMQAGI 640

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGK--KNFLPPHPLIMG 683
             +  S+AW  K+ +S WW +   VSK        L  G+F ++ +  +N LPP  L+MG
Sbjct: 641 LCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQNHLPPAQLVMG 700

Query: 684 FGLLFRLDES 693
           FG L+++  S
Sbjct: 701 FGFLWKVKTS 710



 Score = 40.0 bits (92), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 24/31 (77%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG++GKLKK+++KY DQDE ER +R+  L  
Sbjct: 840 RGKRGKLKKIQKKYADQDETERLLRLEALGT 870


>gi|6325248|ref|NP_015316.1| Tae2p [Saccharomyces cerevisiae S288c]
 gi|74676621|sp|Q12532.1|TAE2_YEAST RecName: Full=Translation-associated element 2
 gi|683781|emb|CAA88377.1| unknown [Saccharomyces cerevisiae]
 gi|965084|gb|AAB68096.1| Ypl009cp [Saccharomyces cerevisiae]
 gi|1314067|emb|CAA95032.1| unknown [Saccharomyces cerevisiae]
 gi|285815527|tpg|DAA11419.1| TPA: Tae2p [Saccharomyces cerevisiae S288c]
          Length = 1038

 Score =  221 bits (564), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 202/730 (27%), Positives = 346/730 (47%), Gaps = 97/730 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+         +L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDE--------SLFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N    KD    E      IYD F P    +N   +      E    ++  LD+F+S IE
Sbjct: 291 ENYISEKDTADLEF-----IYDTFHPFKPYINGGDTDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILA 426
           S +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++   LA
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLIIENAPLIEEVKLA 405

Query: 427 VRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL---LLSNNLDEMDDE 482
           V+  +  +M W  + +++K E+K GN +A L++  L L++N +S+   L S  L+   DE
Sbjct: 406 VQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQNKISVKLDLSSKELNTSSDE 465

Query: 483 E------------------------------KTLPVEKVEV--DLALSAHANARRWYELK 510
           +                              K    EK+ V  DL LSA+ANA  ++ +K
Sbjct: 466 DNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGLSAYANATEYFNIK 525

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFNWFISSE 567
           K    KQ+K      KA K  E K   Q L++K   + S ++K+   ++FEK++WFISSE
Sbjct: 526 KTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSVLKKIRTPYFFEKYSWFISSE 584

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGC 626
            +LV+ G+   + + I  +Y+   D+Y+    +  S   IKN  PE+  VPP TL QAG 
Sbjct: 585 GFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWIKN--PEKTEVPPNTLMQAGI 640

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGK--KNFLPPHPLIMG 683
             +  S+AW  K+ +S WW +   VSK        L  G+F ++ +  +N LPP  L+MG
Sbjct: 641 LCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQNHLPPAQLVMG 700

Query: 684 FGLLFRLDES 693
           FG L+++  S
Sbjct: 701 FGFLWKVKTS 710



 Score = 40.0 bits (92), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 24/31 (77%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG++GKLKK+++KY DQDE ER +R+  L  
Sbjct: 838 RGKRGKLKKIQKKYADQDETERLLRLEALGT 868


>gi|392296002|gb|EIW07105.1| Tae2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1036

 Score =  221 bits (563), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 202/730 (27%), Positives = 346/730 (47%), Gaps = 97/730 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+         +L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDE--------SLFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N    KD    E      IYD F P    +N   +      E    ++  LD+F+S IE
Sbjct: 291 ENYISEKDTADLEF-----IYDTFHPFKPYINGGDTDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILA 426
           S +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++   LA
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELNERKGHLIIENAPLIEEVKLA 405

Query: 427 VRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL---LLSNNLDEMDDE 482
           V+  +  +M W  + +++K E+K GN +A L++  L L++N +S+   L S  L+   DE
Sbjct: 406 VQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQNKISVKLDLSSKELNTSSDE 465

Query: 483 E------------------------------KTLPVEKVEV--DLALSAHANARRWYELK 510
           +                              K    EK+ V  DL LSA+ANA  ++ +K
Sbjct: 466 DNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKINVTIDLGLSAYANATEYFNIK 525

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFNWFISSE 567
           K    KQ+K      KA K  E K   Q L++K   + S ++K+   ++FEK++WFISSE
Sbjct: 526 KTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSVLKKIRTPYFFEKYSWFISSE 584

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGC 626
            +LV+ G+   + + I  +Y+   D+Y+    +  S   IKN  PE+  VPP TL QAG 
Sbjct: 585 GFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWIKN--PEKTEVPPNTLMQAGI 640

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGK--KNFLPPHPLIMG 683
             +  S+AW  K+ +S WW +   VSK        L  G+F ++ +  +N LPP  L+MG
Sbjct: 641 LCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGAFRLKNENDQNHLPPAQLVMG 700

Query: 684 FGLLFRLDES 693
           FG L+++  S
Sbjct: 701 FGFLWKVKTS 710



 Score = 40.0 bits (92), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 24/31 (77%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG++GKLKK+++KY DQDE ER +R+  L  
Sbjct: 836 RGKRGKLKKIQKKYADQDETERLLRLEALGT 866


>gi|367000852|ref|XP_003685161.1| hypothetical protein TPHA_0D00840 [Tetrapisispora phaffii CBS 4417]
 gi|357523459|emb|CCE62727.1| hypothetical protein TPHA_0D00840 [Tetrapisispora phaffii CBS 4417]
          Length = 1016

 Score =  217 bits (553), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 217/834 (26%), Positives = 389/834 (46%), Gaps = 124/834 (14%)

Query: 25  RCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTL 84
           R +N+Y++S     F L      TES    K  +L++ G+R+H+T + R     PSGF +
Sbjct: 25  RLTNIYNISDSNRQFLL--KFNRTES----KCSVLVDCGLRIHSTTFNRPIPPAPSGFVV 78

Query: 85  KLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR 144
           KLRKH++++RL  +RQ+  DRI++ QF  G+  +Y++LE ++ GN++L D E  +L+L R
Sbjct: 79  KLRKHLKSKRLTALRQVKNDRILVLQFADGL--YYLVLEFFSSGNVILLDEEKKILSLQR 136

Query: 145 SHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSN 204
                     ++  H       RV E  T       +  +++P A++ +   +   +  N
Sbjct: 137 ----------VVQEHE-----NRVGEVYTMFDDSLFIGGNEKPIADKREYTEDLIESWIN 181

Query: 205 ASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHII----- 259
             KE +  +           +N  S  G + K+  + ++    L   P LS  +I     
Sbjct: 182 EVKEKIAAE-----------ANVISEPGHQKKKLRVPSIHKLLLSKVPHLSSDLISKNLK 230

Query: 260 ---LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHL 316
              +D  L     + +++KL     Q+LV    ++ D L++  S     +GYIL   K  
Sbjct: 231 KNEIDPSLSSLDFVDKISKLN----QLLVETEDEYTDLLKNRYS-----KGYILA--KRN 279

Query: 317 GKDHPPTESGSSTQIYDEFCPLL----LNQFRSREFVKFE-TFDAALDEFYSKIESQRAE 371
            K     +S  +  IY+ F P       N+    + ++ E  ++  LD F+S IES +  
Sbjct: 280 PKFIEEKDSKDTEYIYETFHPFAPYVDPNEIDISKVIEVEGPYNNTLDLFFSTIESSKYA 339

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
            + + +E  A  KL+    +   +++ L+     + +   LI    + ++    AV+  +
Sbjct: 340 LRIQNQEFLAKKKLDDAVNENLTKINALRDIQSINEEKGVLIIEKADLIEEVKGAVQSLI 399

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL------------------ 472
             +M W  +  +++ E+K  N +A LI   L L+ N ++++L                  
Sbjct: 400 DQQMDWNAIENIIRNEQKKRNNIARLIMLPLNLKENKINIILPAEDNNSDDSDNSSSSSD 459

Query: 473 -------------------SNNLDEMDDEE-KTLPVEKVEV--DLALSAHANARRWYELK 510
                               N +   + +  K + ++  ++  DLALSA ANA  ++  K
Sbjct: 460 SDSEYSDNSDSDSSDDDIEKNRIKRKNRKNSKNVKIKGTQITIDLALSAFANASEYFNKK 519

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSEN 568
           K    KQ+K      KA K  E++ ++Q+ ++   ++  +  +R  ++FE+FNWF SSE 
Sbjct: 520 KTSAEKQKKVEKNAEKALKNIEERIKVQLNKKLKDSHDILKKIRAPYFFERFNWFFSSEG 579

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLTLNQAGCF 627
           +L++ G+     + I  +Y+   D+Y+       +   IKN  PE+  +PP TL QAG  
Sbjct: 580 FLILMGKSPLDTDQIYSKYIEDDDIYMSNSF--GTQVWIKN--PEKTEIPPNTLMQAGVL 635

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE-YLTVGSFMIRGKK--NFLPPHPLIMGF 684
            +  S+AW  K+ +S WW +   VSK +  G+  L  G F ++  K  NFLPP  L+MGF
Sbjct: 636 CMSASEAWSKKIASSPWWCFAKNVSKFSSDGKSVLEPGLFRMKNDKQQNFLPPAQLVMGF 695

Query: 685 GLLFRL---DESSLGSHLNERRVRGEEEGMDDFEDSGHHK---ENSDIESEKDDTDEKPV 738
           G L+++   DE     +LNE R    EE +   ED+   K   E++D+  + +   E   
Sbjct: 696 GFLWKVKIEDEGDADDNLNEVR----EEVLTGDEDNVVEKIVNESADVTDQNELLKEDEE 751

Query: 739 AESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVT 792
            ES +  +S     ++ + +N D+    +  +T +N I+    D ++ VA  +T
Sbjct: 752 IESFNGMSSITQEINNLDITNADN---ISNQQTTTNNINE--MDASKTVATVLT 800


>gi|358254228|dbj|GAA54239.1| nuclear export mediator factor Nemf, partial [Clonorchis sinensis]
          Length = 527

 Score =  217 bits (552), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 112/222 (50%), Positives = 146/222 (65%), Gaps = 24/222 (10%)

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           S AFKA + +  L     +TVA I+ +RK  WFEKF WFISSENYLV++GRD+QQNE++V
Sbjct: 4   SAAFKAQQTRKDL-----RTVAQITKIRKPMWFEKFFWFISSENYLVVAGRDSQQNEVLV 58

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRP---------------EQPVPPL-TLNQAGCFT 628
           KR++   D+YVHAD+HGASS ++K  RP                 P+PP  TL +AG   
Sbjct: 59  KRHLGSDDIYVHADVHGASSVIVK-ARPLTTEESSSDSVSSTSRLPLPPPKTLIEAGTLA 117

Query: 629 VCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
           +  S AW++++VTSAWWV   QVSKTAP+GEYLT G+FMIRG+KN+LPP   + GFG+LF
Sbjct: 118 IVLSSAWNARVVTSAWWVRQDQVSKTAPSGEYLTTGAFMIRGRKNYLPPCHFMYGFGVLF 177

Query: 689 RLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEK 730
           +LDE S+  H  ERRV    +  DDF  S       D+ +EK
Sbjct: 178 KLDEESVEHHRGERRVT-RIDPSDDFT-SAPKTNAEDVPAEK 217



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 23/37 (62%), Positives = 27/37 (72%)

Query: 886 KIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALL 922
           K   G I RGQK KL+K+K+KYG QDE+ER  RM LL
Sbjct: 297 KSNSGPIKRGQKSKLRKIKQKYGTQDEDERMARMKLL 333


>gi|242399100|ref|YP_002994524.1| fibronectin-binding protein [Thermococcus sibiricus MM 739]
 gi|242265493|gb|ACS90175.1| Predicted fibronectin-binding protein [Thermococcus sibiricus MM
           739]
          Length = 650

 Score =  216 bits (549), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 188/693 (27%), Positives = 323/693 (46%), Gaps = 122/693 (17%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L G R   +Y    +  I   ++++G    G ++   L++E
Sbjct: 1   MKQEMSSVDIKYIVEELKTLEGARVDKIYQDKNRVRI--KLHTTG---EGRND---LIIE 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R+H T Y ++    PS FT+ LRK++   R+E + Q  +DRI+  + G     + +I
Sbjct: 53  AGKRIHLTTYIKEAPQHPSSFTMLLRKYLSGSRVEKIEQHDFDRIVKLKIG----NYTLI 108

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            EL+ +GNI+L                 D+   I+S  RY       F+  T    H  L
Sbjct: 109 AELFQKGNIILV----------------DENNVIISAMRYEE-----FKDRTIKPQHVYL 147

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                P A E         N  +   EN       +  ++ +                  
Sbjct: 148 L----PPARE---------NPVDILWENFRELISSQDVEIVR------------------ 176

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
             L   L  G   +E I+L  G+    K    N L++N ++V+      FE  +++V + 
Sbjct: 177 -ALARKLNMGGLYAEEILLRAGI---EKTKRANALDENELKVI------FEK-IKEVFNA 225

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
               +  I+ +N     D+P            +  P+ L  + S +   F TF  ALDE+
Sbjct: 226 P--KKANIIYKN-----DNPI-----------DVVPIELKWYESYKKKFFTTFSEALDEY 267

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           + KI  + A+ +   K      +L      QE  ++  K ++  + ++ +LI  N   ++
Sbjct: 268 FGKILLESAKIERTKKLQNKKRQLEATLRKQEEMINGFKNQIQENQEIGDLIYTNFAFIE 327

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
             +  +  A+  ++ W++    V+  +K+GN +A +I  +                  D 
Sbjct: 328 NLLKELSKAV-EKLGWKEFKERVENGKKSGNKIAQIIKNI------------------DA 368

Query: 482 EEKTLPVE----KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           +EK + +E    KV++ L  S   NA  +YE  KK + K E    AH +  K  ++  +L
Sbjct: 369 KEKAVTIELDGKKVKLYLNKSVGENAEIYYEKAKKAKHKLEGAQKAHKETLKKIKEIEKL 428

Query: 538 QILQEKTVANISHM--RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
              +EK   ++  +  RK  WFEKF WF+SSE +L+I+G+DA  NE++VKRYMS+ D+Y 
Sbjct: 429 IEEEEKKELSVRKLEKRKKKWFEKFRWFLSSEGFLIIAGKDATTNEIVVKRYMSENDLYC 488

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKT 654
           HAD++GA   VIK+ +        TL +A  F V  S+AW   + +  A+W  P+QV+K 
Sbjct: 489 HADIYGAPHVVIKDGK---KAGEKTLFEACQFAVSMSRAWKEGLYSGDAYWTDPNQVTKK 545

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           AP+GEYL  G+FM+ GK+N++   P+ +  G++
Sbjct: 546 APSGEYLGKGAFMVYGKRNWMHGLPVKLAVGIV 578


>gi|85000891|ref|XP_955164.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65303310|emb|CAI75688.1| hypothetical protein, conserved [Theileria annulata]
          Length = 1185

 Score =  215 bits (548), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 129/400 (32%), Positives = 209/400 (52%), Gaps = 63/400 (15%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE F+ A+D F++K E     +Q K   D    K+NKI +DQ  R   L +++ +     
Sbjct: 274 FEDFNDAVDTFFTKHE---LAKQEKKSVDKRPTKINKIKIDQNKRELNLMEDIQKIDSKI 330

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           +L+E +++  +  +   +  +A+  SW D+   ++ +RK  +P+   I ++++    +  
Sbjct: 331 KLLEEHVDVAENCLNLTKALIASGASWNDIYEQLQIQRKQNHPLVHYIKEIHIPTQTLIF 390

Query: 471 LLSNNLDEMDDEEKTLPVEK--------------------VEVDLALSAHANARRWYELK 510
             + N D+ +++ K    ++                    VE+D  L++H N ++ Y  +
Sbjct: 391 YSNQNQDQHNEQNKQNQFQQNIQQKNENKQNKKNTRDEVVVELDYRLNSHQNLKKLYNER 450

Query: 511 KKQESKQEKTIT----AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           K+ E+K E+T      A  K  K+ +K+   +  ++     IS +R+  WFEKF WFI+S
Sbjct: 451 KRLENKLERTRIGKEYALKKVTKSLKKEENKKTDKKGRDVKISSVRRRFWFEKFYWFITS 510

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK------------------ 608
           + YLV++GRDA QNE++VK+Y++ GD+Y HAD+HGASS ++K                  
Sbjct: 511 QGYLVLAGRDALQNELLVKKYLTNGDLYFHADIHGASSVILKTNSTSNNNTFNLSNSTNT 570

Query: 609 ----------------NHRPEQPVPPL--TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
                           N   E     L  ++++AG F VC S AW+ K    +WWVY HQ
Sbjct: 571 ATTSTTGTTTTSLDNENSNVEDVSKRLKESIDEAGNFAVCLSTAWNEKFSVQSWWVYWHQ 630

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           VSKT PTGEY+  GSF+IRGKKN+LPP  L MG   LF++
Sbjct: 631 VSKTPPTGEYVPQGSFVIRGKKNYLPPQKLEMGITYLFQV 670



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/171 (38%), Positives = 97/171 (56%), Gaps = 9/171 (5%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+N  DVA  V  L++LI  +   N+YD++ + +I K         S    K+ +L
Sbjct: 1   MAKERLNAVDVAVTVSNLKKLITNLTLVNIYDITNRVFILKF--------SKNENKIYIL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E G R+H+T + R   + PS F  KLRKH+R RRL D+ Q+  DR+I F F     AH+
Sbjct: 53  IEIGCRIHSTQFLRSVDHLPSNFNAKLRKHLRNRRLRDISQMSQDRVIDFTFSSEEYAHH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           +I++L+  GNI LTDSE+ VLT+LR     DK   + + + Y  +    FE
Sbjct: 113 LIVQLFLPGNIYLTDSEYKVLTVLRPQNTGDKFFKVGTNYVYDMDYNSWFE 163


>gi|406604691|emb|CCH43887.1| putative RNA-binding protein [Wickerhamomyces ciferrii]
          Length = 983

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 135/397 (34%), Positives = 215/397 (54%), Gaps = 40/397 (10%)

Query: 331 IYDEFCPL-LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           +Y++F P   +N     E V  + ++  LD+F+S IES +   + + +E+ A  +L +  
Sbjct: 282 LYEQFHPFEPINLKEDEELVPIQGYNKTLDKFFSTIESSKYALRIQNQENQAKKRLQQAR 341

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
            D++ +V  L      +    E I +N E V+ A  AV+  L  +M W+ + +++  E+ 
Sbjct: 342 DDKQQQVQRLLDVQAVNTLKGETIIFNAEIVEEAKAAVQALLDQQMDWKTMEKLINVEKA 401

Query: 450 AGNPVAGLID-KLYLERNCMSLLLSNN-------------------LDEMDDEEKTLPVE 489
            GN VA +I+  L L+ N +SL LS                      +   DE++  PV+
Sbjct: 402 KGNRVAKVINLPLNLKENKISLSLSTEDPYANDEDEDESSSESEPESESDSDEDEPKPVK 461

Query: 490 ------------KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT-- 535
                        V +DL LS++ANA  ++ +KK    KQ+K   + +KA K  E+K   
Sbjct: 462 SQAKKDNVKNTINVTIDLTLSSYANASEYFNVKKSTVEKQKKVEQSATKALKNIEQKIEK 521

Query: 536 --RLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
             +  + QE  +  +  +R  ++FEKFNWFIS+ENYL++SG+D  Q ++I  RY++  D+
Sbjct: 522 DLKKNLKQENDI--LRKLRNPYFFEKFNWFISNENYLILSGKDDSQCDLIYHRYINDDDI 579

Query: 594 YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK 653
           YVHAD+ G+S   IKN    + V P TL QAG  ++  S+AW++KMVTS+WW+Y   V+K
Sbjct: 580 YVHADIDGSSHVFIKNPNKGE-VSPSTLMQAGILSLSTSKAWENKMVTSSWWLYASDVTK 638

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
               G  L  GSF    +KNFLPP  L+MGF  L+++
Sbjct: 639 KDIDGTILNAGSFRYLKEKNFLPPSQLVMGFAFLWKV 675



 Score = 92.8 bits (229), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 46/126 (36%), Positives = 78/126 (61%), Gaps = 12/126 (9%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           +   R  N+Y++  S K Y+ K     G+ +S ++    L+++SG + H T ++R    T
Sbjct: 13  ITNYRLQNIYNIATSNKQYLLKF----GLPDSKKN----LVLDSGFKTHITEFSRPTPQT 64

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PS F +KLRKH+++RRL  ++Q+G DR+I+  F  G  A++++LE ++ GNI+L D E  
Sbjct: 65  PSSFVVKLRKHLKSRRLSSIKQVGIDRVIVLTFSDG--AYHLVLEFFSAGNIVLLDHERR 122

Query: 139 VLTLLR 144
           +L L R
Sbjct: 123 ILALQR 128



 Score = 46.2 bits (108), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 20/31 (64%), Positives = 24/31 (77%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG+KGK+KK+  KYGDQDEEER +RM  L  
Sbjct: 783 RGKKGKMKKIANKYGDQDEEERRLRMEALGT 813


>gi|401842736|gb|EJT44818.1| TAE2-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 1032

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 189/733 (25%), Positives = 342/733 (46%), Gaps = 112/733 (15%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ +         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLRF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  +RQ+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTSLRQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           +++L R          ++       +I  +F+ T  +  +  +  S       P+ + E 
Sbjct: 131 IMSLQR---------VVLEHENQVGQIYEMFDETLFAAGNDFVNES-------PEIIKEK 174

Query: 199 GNNVSNASKENLGGQKGGKSFDLS-----KNSNKNSNDGARAKQPTLKTVLGEALGYGPA 253
               SN   E +   +     D++        NKN +   + K P++  +L   L   P 
Sbjct: 175 Y--TSNLVNEWIEATQSKYDSDIAVIKQLNIQNKNDSKEKKVKVPSIHKLL---LSKVPH 229

Query: 254 LSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYI 309
           LS  ++        + P+M    +    +   ++L    +++ + L    + D   +GYI
Sbjct: 230 LSSDLLSKNLKVFNIDPSMSCLALLDRTNTLAEMLNRTQSEYNELL---TTSD--RKGYI 284

Query: 310 LM-QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYS 363
           L  +N++      P +      IYD F P    +N+  S  F   +    ++  LD+F+S
Sbjct: 285 LAKKNENFNSIKDPADLEF---IYDTFHPFRPYINEKNSGSFRIADVEGPYNKTLDKFFS 341

Query: 364 KIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAA 423
            IES +   + + +E  A  K++    + + ++  L    + + +   LI  N   ++  
Sbjct: 342 TIESSKYALRIQNQESQAQKKIDDARAENDRKIQALLNVQELNERKGHLIIENASLIEEV 401

Query: 424 ILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDE 482
            LAV+  +  +M W  + +++K E+K GN +A L++  L L++N +S+ L     ++  E
Sbjct: 402 KLAVQGLVDQQMDWSTIEKLIKSEQKKGNKIAQLLNLPLNLKQNKISVKL-----DISRE 456

Query: 483 EKTLPVE--------------------------------------KVEVDLALSAHANAR 504
           E+++                                          V +DL LSA+ANA 
Sbjct: 457 EESITSSDEDDESEDSSSEGSSDSGDMSTFKEENSKKKGQSNNALNVTIDLGLSAYANAS 516

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV---HWFEKFN 561
            ++ +KK    KQ+K      KA K  E K   Q L+ K   + S ++KV   ++FEK+N
Sbjct: 517 EYFNIKKTSAEKQKKVEKNVGKAMKNIEVKIDQQ-LKRKLKESHSVLKKVRTPYFFEKYN 575

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-VPPLT 620
           WFISSE +LV+ G+   + + I  +Y+   D+Y+    +  +   IKN  P++  VPP T
Sbjct: 576 WFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--THVWIKN--PDKTEVPPNT 631

Query: 621 LNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFMIRGKK--NFLPP 677
           L QAG   +  S+AW  K+ +S WW +   V K  +     L  G+  ++ +K  N LPP
Sbjct: 632 LMQAGILCMSSSEAWSKKIASSPWWCFAKNVCKFDSSDNSILPEGALRLKNEKDLNLLPP 691

Query: 678 HPLIMGFGLLFRL 690
             L+MGF  L+++
Sbjct: 692 AQLVMGFAFLWKV 704



 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 24/30 (80%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
           RG++GKLKK++ KY DQDE+ER +R+  L 
Sbjct: 824 RGKRGKLKKIQRKYADQDEQERFLRLEALG 853


>gi|312136934|ref|YP_004004271.1| fibronectin-binding a domain-containing protein [Methanothermus
           fervidus DSM 2088]
 gi|311224653|gb|ADP77509.1| Fibronectin-binding A domain protein [Methanothermus fervidus DSM
           2088]
          Length = 645

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 182/691 (26%), Positives = 323/691 (46%), Gaps = 122/691 (17%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  DV A V  L +L+ G +    Y   P+  I  L     V   G   +V +++++GV
Sbjct: 1   MSNVDVYAVVYELNKLLKGSKFVKAY--QPRKDIIVL--RFHVKNKG---RVDVIIQTGV 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-LGMNAHYVILE 123
           R+H T Y+ +    P  F + LRK+++   +E V+Q  +DRI+ F    LG   + +I+E
Sbjct: 54  RIHATRYSLENPKFPPSFPMLLRKYLKGGIVESVKQHKFDRIVEFNVKVLGKKNYKLIVE 113

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTS 183
           L+ +GNI+LT+    ++  LR+ +  D+ ++   +++YP        + T SKL   L  
Sbjct: 114 LFGKGNIILTEENGKIIQPLRTEKWSDREISAGKKYKYPESRGLNPLKITKSKLKELL-- 171

Query: 184 SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
                      +N D + V   +    GG                               
Sbjct: 172 -----------LNSDKDVVRTLALNGFGG------------------------------- 189

Query: 244 LGEALGYGPALSEHIILDTGL---VPNMKLS--EVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                      +E I+  +G+    P+  LS  E+NK+ D +I+ +  ++ ++    Q +
Sbjct: 190 ---------TYAEEIVYRSGIDKNTPSKSLSDNEINKIYD-SIEEIYGSLKEYNFKPQII 239

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
           +  D+VP                                + L  +++ E   F+ F+ AL
Sbjct: 240 VDKDVVP--------------------------------IELKIYKNYEKRYFDNFNKAL 267

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLE 418
           DEF++    +  +++ +        KL +I   Q+N + + K++  +  ++ +LI    E
Sbjct: 268 DEFFTPKLREELKKEKEKVWKNKIEKLERILNSQKNAIKSFKKKAKKYREIGDLIYLKYE 327

Query: 419 DVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE 478
            +   I  ++ A   + +W+++     E+ K       +     + ++ +  L   N+D 
Sbjct: 328 LISKVINTLKNA-KEKYTWKEII----EKVKKAKKENKIKIINSITKDGIVTL---NIDG 379

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
                     + V +D+  S   NA  +YE  KK   K +  I A  +  K      + +
Sbjct: 380 ----------KSVNIDINKSLEKNAEIYYEKAKKIRKKIKGAIKAMEETEKKLNNLKKKR 429

Query: 539 ILQEKTV-ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
            ++ K +   I   RK+ WFEKF WFISS+ +LVI GRDAQ NE+IVK+YM + D+Y+HA
Sbjct: 430 DIEIKNILIPIKKRRKLKWFEKFRWFISSDGFLVIGGRDAQTNEIIVKKYMEENDIYLHA 489

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAP 656
           D+HGA S VIKN    + +P  T+N+A  F    S+AW   + ++  +WVYP QV+K+ P
Sbjct: 490 DIHGAPSVVIKNK--NKKIPENTINEAAIFAASFSKAWTYGLGSADVYWVYPQQVTKSPP 547

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +GEY++ G+F+IRGK+N++   P+ +  G++
Sbjct: 548 SGEYISKGAFVIRGKRNYIRNVPIELAVGIV 578


>gi|255710571|ref|XP_002551569.1| KLTH0A02530p [Lachancea thermotolerans]
 gi|238932946|emb|CAR21127.1| KLTH0A02530p [Lachancea thermotolerans CBS 6340]
          Length = 1058

 Score =  214 bits (544), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 196/761 (25%), Positives = 347/761 (45%), Gaps = 128/761 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+++ D+    + L+ +L G R SN+Y++  S + ++ K         +    K+  
Sbjct: 1   MKQRISSLDLELLYRELKSQLEGYRLSNIYNIAESSRQFLLKF--------NKPDSKLNA 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T + R    TPSGF +KLRKH++++RL  V+++  DRI++  F  G    
Sbjct: 53  IIDCGLRVHLTDFTRPVPATPSGFVVKLRKHLKSKRLTTVKRVANDRILVLSFNDGQ--F 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           +++LE ++ GN++L DS+  ++ L R          I+  H +  ++  ++     S L 
Sbjct: 111 FLVLEFFSAGNVILLDSDRKIIVLQR----------IV--HEHENKVGHIYNMFDGSFLE 158

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                  +   +  D+VN              G  K  K F    +S+  +  G  AK  
Sbjct: 159 NTRIEPPKSKVHSADEVN--------------GWIKEAKDF---ADSSVKAKTGKGAKVL 201

Query: 239 TLKTVLGEALGYGPALSEHIIL----DTGLVPNMK-LSEVNKLEDNAIQVLVLAVAKFED 293
           ++  +L       P LS  +I       G+ PN   L+ ++K+ D  + +L    ++  +
Sbjct: 202 SIHKLL---FLREPQLSSDLISRNLKSRGIAPNSPCLNFLDKI-DEIVDLLDATESEVNE 257

Query: 294 WLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ--IYDEFCPLLLNQFRSRE-FVK 350
            L+D         GYI+ +       H  +E G +    +Y++F P   +     + + K
Sbjct: 258 LLRDGCKL-----GYIIAKKNP----HYDSEKGDANLEFVYEQFHPFPPHLSEDEKGYTK 308

Query: 351 F----ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
                  ++  +D+F+S IES +   + + +E  A ++L    +D E R+  L     ++
Sbjct: 309 IIEVPGQYNKTVDDFFSTIESSKYALRIQNQEFQAKNRLESAKLDNEKRIQALIDVQTQN 368

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
                 I    + V+ A  A++  +  +M W+ +  ++  E+K  N +A LI   L L+ 
Sbjct: 369 EVRGHAIIAAADLVEEAQNAIKALVEQQMDWKTIEVLISNEQKKNNRIARLIKLPLDLKN 428

Query: 466 NCMSLLLSNN----LDEMDDEEKTL----------------------------------- 486
           N  +L L  N     D  D+EE  L                                   
Sbjct: 429 NKFTLSLPRNDEIESDNSDEEEDNLTSSEDETSSSDSSDSSLSDFEADDNDEDELTSVSN 488

Query: 487 -------------PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
                        P     +DL LSA+ANA  ++ +KK    KQ+K      KA K  E+
Sbjct: 489 IKKDRNDNKKKEKPSIDATIDLTLSAYANASNYFNIKKSNVEKQKKVEKNAQKALKNIEQ 548

Query: 534 KTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           +    + ++   ++  ++  RK ++FEKF+WF+SSE +LV+ G+   +++ I  +Y+   
Sbjct: 549 RIEKDLKKKLKESHDVLNKTRKPYFFEKFHWFVSSEGFLVLMGKSGMESDQIYGKYIHDN 608

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
           DV+V       +   IKN   E  VPP TL QAG   +  S AW  K+ +SAWW +  ++
Sbjct: 609 DVFVSNSFD--THVWIKNP-DETEVPPNTLMQAGIMCMSASPAWSKKIQSSAWWCFAKEL 665

Query: 652 SKTAPT-GEYLTVGSFMIRG--KKNFLPPHPLIMGFGLLFR 689
           SK     GE L  G+F ++   KK+FLPP  L+MGF LL++
Sbjct: 666 SKFDNYGGEVLPAGTFRLKDEKKKSFLPPSQLVMGFALLWK 706


>gi|15679889|ref|NP_277007.1| hypothetical protein MTH1907 [Methanothermobacter
           thermautotrophicus str. Delta H]
 gi|2623041|gb|AAB86367.1| conserved protein [Methanothermobacter thermautotrophicus str.
           Delta H]
          Length = 655

 Score =  211 bits (537), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 192/690 (27%), Positives = 302/690 (43%), Gaps = 118/690 (17%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  DV A    L  ++ G R    Y     T I +          GE  +V ++M++GV
Sbjct: 7   MSNVDVFAVTSELNEMLRGARVDKAYQPLRDTVIIRFHVP------GEG-RVDVVMQAGV 59

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T Y       P  F + LRKH++   + +VRQ G+DRI               +E+
Sbjct: 60  RIHRTNYPPQNPKVPPSFPMLLRKHLKGGVVREVRQHGFDRI---------------VEI 104

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKG-VAIMSRHRYPTEICRVFERTTASKLHAALTS 183
             +      D E+T++  L +     KG + ++++ R   EI    +R T S    A   
Sbjct: 105 TVE-----KDQEYTLMVELFA-----KGNIILLNQQR---EIILPLKRKTWSDRRIA--- 148

Query: 184 SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
           S+E     P +    G N  +     L         DL +   +N               
Sbjct: 149 SREIYEYPPSR----GINPLDHDPSELEDILMNSGADLIRTLARN--------------- 189

Query: 244 LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
                G+G   +E I+L  GL  N   S  N   D+  ++       F+   +  +   I
Sbjct: 190 -----GFGGLYAEEIVLRAGLDKNTPCS--NLTPDDIRKIDAAIYETFKPLRELDLKPHI 242

Query: 304 VPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYS 363
           + +G                         ++  P+ L  +  RE   FE+F+ A DEF+S
Sbjct: 243 IGDG-------------------------EDVLPIELRVYSGRERRYFESFNDAADEFFS 277

Query: 364 KI---ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
            I   E +RA ++   +E   F K  +I   Q   +   K+ ++ S +  +L+  N   V
Sbjct: 278 SIFREEIRRAHEEEWEREVDRFRKRLRI---QRETLEKFKKTIEVSTRRGDLLYANYSLV 334

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           +  +  +R A   + SW+++  ++ + RK G P A  I                 +D M 
Sbjct: 335 EEVLATIRRA-REKYSWDEIKNIIADARKRGLPEASNI---------------TEIDRMG 378

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK--KTRLQ 538
           +    L  E V +D  L    NA  +YE  KK + K +  +TA  K  K  E+  K R  
Sbjct: 379 NITIFLDGEPVRIDSKLGVPENAEVYYEKAKKAKRKIKGVMTAIEKTEKEIERIEKKRDD 438

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
            L+   V      RK+ WFEKF WF+SS+ +LVI GRDA  NEM+VK++M   D+Y+H+D
Sbjct: 439 ALRNIMVPRRRVKRKLRWFEKFRWFVSSDGFLVIGGRDAGTNEMVVKKHMEPRDIYLHSD 498

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPT 657
           +HGA S VIK     + VP  T+ +A  F    S AW     +   +WV+P QVSKT  +
Sbjct: 499 IHGAPSVVIKTE--GRDVPETTIQEAAVFAASFSSAWTRGFTSLDVYWVHPEQVSKTPRS 556

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           GE++  G+F+IRG +N+L   PL +  G++
Sbjct: 557 GEFVARGAFIIRGSRNYLRGVPLKIAIGVV 586


>gi|241958102|ref|XP_002421770.1| conserved hypothetical protein [Candida dubliniensis CD36]
 gi|223645115|emb|CAX39711.1| conserved hypothetical protein [Candida dubliniensis CD36]
          Length = 1012

 Score =  209 bits (532), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 152/494 (30%), Positives = 242/494 (48%), Gaps = 68/494 (13%)

Query: 276 LEDN--AIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD 333
            EDN  A+Q +V A+   ED   D+I+G I  EGYI+ +     K++  +E+     IYD
Sbjct: 236 FEDNQEALQQVVNALGVCEDKYIDLINGAIDNEGYIVAK-----KNNKASENSELEYIYD 290

Query: 334 EFCPL---LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHM 390
           EF P      NQ    +F+    ++  LD+F+S IES +   + + +++ A  +L K   
Sbjct: 291 EFDPFEPYKPNQ-EGLKFIPVSGYNKTLDKFFSNIESTKFSMKIEQQKENAAKRLEKARS 349

Query: 391 DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
           +++ ++ +L  +   + K  ELI+Y+ E V+     V+  +  +M W ++  ++  E+K 
Sbjct: 350 ERDKQIDSLVAQQKLNAKKGELIQYHSELVEECRNYVQSFIDQQMDWTNIETVISLEQKK 409

Query: 451 GNPVAGLID-KLYLERNCMSLLLSNNLDEMDD---------------------------- 481
            N +A  I   L L+ N + +LL    ++ DD                            
Sbjct: 410 KNDLAKHIQLPLNLKENKIKVLL----EDFDDYEESTESASATETESETESETDSDSSSE 465

Query: 482 -----EEKTLPVEKVE-----------------VDLALSAHANARRWYELKKKQESKQEK 519
                +E  +PV++ +                 +DL+ SA ANAR +++ KK  E+KQ K
Sbjct: 466 SESDNDEDKIPVKRTQRKKNAKEKPKRKTVPTWIDLSQSAFANARSYFDSKKTAETKQVK 525

Query: 520 TITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDA 577
             ++ S A K AE+K    + +     N  +  +R  +WFEKF WF+SSE YL ++G+DA
Sbjct: 526 VESSTSMALKNAERKINQDLTRSLKQENETLKEIRPKYWFEKFFWFVSSEGYLCLAGKDA 585

Query: 578 QQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS 637
            Q +MI  R+ S  D  V AD+ G+    IKN    + +PP TL QAG F +  S AW+ 
Sbjct: 586 SQTDMIYYRHFSDNDSIVSADMEGSLKVFIKNPLKGEALPPSTLMQAGIFAMSTSSAWNG 645

Query: 638 KMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
           K+ TSAW ++  ++SK    G  +  G F    +K +LPP  L+MGFG    LDE S   
Sbjct: 646 KVTTSAWVLHGTEISKRDYDGSIVPEGEFNYLVQKEYLPPAQLVMGFGFYCLLDEESTKH 705

Query: 698 HLNERRVRGEEEGM 711
           +   R  R  E G 
Sbjct: 706 YAEIRTKRELEHGF 719



 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 80/146 (54%), Gaps = 13/146 (8%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLL 58
           +K R+ + D+      L + L   R  N+Y+++   + Y+FK         S    K ++
Sbjct: 1   MKQRITSLDLQILTSELSKELSNYRLQNIYNVASNSRQYLFKF--------SIPDSKKVV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++E G R+H T + R     P+ F  KLRKH++TRRL  ++Q+  DRI++ +F  G   +
Sbjct: 53  VLEYGNRIHLTDFERPATQQPTNFVTKLRKHLKTRRLSGIKQISNDRILVLEFSDG--KY 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR 144
           Y++LE ++ GN+LL D    +L L R
Sbjct: 111 YLVLEFFSAGNVLLLDESQKILALQR 136


>gi|156338807|ref|XP_001620041.1| hypothetical protein NEMVEDRAFT_v1g149359 [Nematostella vectensis]
 gi|156204309|gb|EDO27941.1| predicted protein [Nematostella vectensis]
          Length = 287

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 129/360 (35%), Positives = 186/360 (51%), Gaps = 81/360 (22%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y EF P L+ Q++   +++F +FD  +D+F+S I SQ+ + +   +E +A  KL  +  D
Sbjct: 8   YQEFYPFLMTQYKDHPYLEFPSFDKTVDDFFSSIGSQKLDVKALNQEKSALKKLENVKKD 67

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
            E R+  L+   +  V+ A+LIE NL+ VD AIL V  A+AN++ W ++  +VKE +  G
Sbjct: 68  HEKRIQQLQSAQEADVRKAQLIEINLDLVDRAILVVNSAIANQIDWSEILNLVKEAQIQG 127

Query: 452 NPVAGLIDKLYLERNCMSLLLSN-NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
           +PVA  I +L L+ N +++LL   +L  ++          V VD+ L AH NARR     
Sbjct: 128 DPVASAIRELKLQTNHITMLLRYVSLASING-------RPVRVDIDLLAHLNARR----- 175

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
                                                         FEKF WFISSENY+
Sbjct: 176 ----------------------------------------------FEKFLWFISSENYV 189

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI GRD QQNE++VKR++  G+   +     A  T+I ++            Q+   T  
Sbjct: 190 VIGGRDQQQNELVVKRHLQPGNATCNTIFSQA--TLICSY------------QSQLSTTA 235

Query: 631 HSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
            + ++ S++  +        VSKTAPTGEYLT GSFMIRGKKNFLPP  LIMGF  LFR+
Sbjct: 236 INHSYQSQLSIT--------VSKTAPTGEYLTTGSFMIRGKKNFLPPCHLIMGFSFLFRV 287


>gi|298675852|ref|YP_003727602.1| fibronectin-binding A domain-containing protein [Methanohalobium
           evestigatum Z-7303]
 gi|298288840|gb|ADI74806.1| Fibronectin-binding A domain protein [Methanohalobium evestigatum
           Z-7303]
          Length = 670

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 179/717 (24%), Positives = 312/717 (43%), Gaps = 121/717 (16%)

Query: 3   KVRMNTADVAAEVKCL----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL 58
           K  M++AD++A +  L      ++  + + +Y  +P      +     +   G      L
Sbjct: 4   KQEMSSADISALISELSDGSNSIVDAKINKIYQPTPDEVRINIY----IPRVGRDN---L 56

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           ++E+G R+H + + R     P  F + LRKHI   R+  +RQ  +DRI+      G    
Sbjct: 57  VIEAGKRIHLSKHLRSNPKMPGPFPMLLRKHIMGGRITFIRQYDFDRIVEIGISKGDVDT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I E ++QGN++L ++E  ++  ++      + +     + YP       E      L 
Sbjct: 117 ILIAEFFSQGNVILLNNERKIILPMKPRTFRGRKIQGGEMYEYPESQISPLE-AEKDDLE 175

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
            A +SS            ED    + A+  NLGG                          
Sbjct: 176 QAFSSS------------EDDVVRTIATSFNLGG-------------------------- 197

Query: 239 TLKTVLGEALGYGPALSEHIILDTGL-----VPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
                          L+E +    G+     V ++ L E +KL D             +D
Sbjct: 198 --------------LLAEEVCARAGVDKNKPVDDVTLDEKSKLTDT-----------LKD 232

Query: 294 WLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFET 353
               +++G++ P    +++ K        T + S    Y +  P  L Q++  E   F++
Sbjct: 233 VFTPIVTGELNP---CIIKQK--------TNNQSE---YVDVLPFELEQYKEYEKQYFDS 278

Query: 354 FDAALDEFYSK--IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           F+ ALDEF+ K  +E++R  Q+   KE    ++  +    Q+  +   ++E ++   +AE
Sbjct: 279 FNKALDEFFGKEVVEAERKIQESAKKEKVDIYQ--RRLQQQQGAIEKFEKEANKYNSIAE 336

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
            I  +   V+  I  +  A  +  SW+D+   +KE      P A LI  +  +   + + 
Sbjct: 337 AIYSHYPFVEEVITVLTNARKSGYSWDDIKSKLKEANDI--PSAKLIQSIDPKSGTIVM- 393

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
                 ++D  + TL       D+  S   NA+ +YE  K+   K+E  + A  +  +  
Sbjct: 394 ------DLDGTKATL-------DIRYSVPQNAQTYYEKAKRVMKKREGALRAIEETKRII 440

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           E + + Q    K       +RK HW+ +F WFISS+ +LV+ GRDA  NE I K+YM K 
Sbjct: 441 ENRDKPQQQTRKRKV----IRKKHWYSRFRWFISSDGFLVVGGRDADTNEEIFKKYMEKQ 496

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS-KMVTSAWWVYPHQ 650
           D+ +H  + GA   ++K+ R    VP  T+ +A  F V +S  W S +     +WVYP+Q
Sbjct: 497 DIILHTQVPGAPLAIVKSKRYN--VPEQTMYEAAQFVVSYSSIWKSGQFGGDCYWVYPNQ 554

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           VSKT  +GE+L  GSF+IRG +N+    P+ +  GL    +   +G  L+  +  G+
Sbjct: 555 VSKTPESGEFLKKGSFIIRGDRNYFKNVPVSVAIGLELENETRVIGGPLDAVKKNGK 611


>gi|385304258|gb|EIF48283.1| tae2-like protein [Dekkera bruxellensis AWRI1499]
          Length = 979

 Score =  208 bits (529), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 191/733 (26%), Positives = 332/733 (45%), Gaps = 110/733 (15%)

Query: 23  GMRCSNVYDLSP--KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           G R SNVY LS   ++++FK              KV + +ESG +L+ T Y +     P+
Sbjct: 23  GHRLSNVYSLSSNNRSFLFKFAQPDS--------KVNVAVESGFKLYITDYQKPVLPQPT 74

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
            F  KLRKH++++RL  V Q+G DR+++ +F  GM  +Y++LE ++ GNI+L DS   ++
Sbjct: 75  SFCTKLRKHLKSKRLTHVEQVGDDRVVVLEFSDGM--YYLVLEFFSAGNIILLDSNRQII 132

Query: 141 TLLRSHRD-----DDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKV 195
           +L R   +     D           YP+    +FE     K    +T  K       +++
Sbjct: 133 SLFRVVENKMKASDPDAFNYSIGQIYPSFDSTLFEDENM-KTREFVTYDKGLVVGWINEM 191

Query: 196 NEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALS 255
            +      N        +K G+ F ++K                          + P LS
Sbjct: 192 QQREEQNKNRETSGKKKKKKGRIFSVNK----------------------LCFMHAPYLS 229

Query: 256 EHII----LDTGLVPNMKLSEVNKLEDNA-IQVLVLAVAKFEDWLQDVIS---GDIVPEG 307
             +I    LD G+ P+   S +N LEDN+ ++ +V ++ + E+  + ++    G +  +G
Sbjct: 230 SDLIQRSLLDNGVTPSQ--SCLNMLEDNSLVEKVVTSLQESENTFKSLLQTPPGKV--QG 285

Query: 308 YILMQNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFETFDAALDEFYSKI 365
           +IL +   L  +     S +    Y+EF P   +  +    +    + ++  +D F++ I
Sbjct: 286 WILRKINPLFDNTKEESSENLKYTYEEFHPFEPVHKENEDSKVDVVDGYNKTVDTFFTMI 345

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIEYNLEDVDAA 423
           E  +A    + ++ AA  +L  +  + E ++  L   QE++R  K   LI  +  +++  
Sbjct: 346 ELSKASLSRQQQKAAAAKRLQLVKEENEKKLAKLDAVQELNR--KKGYLITLHSSEIEDC 403

Query: 424 ILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD------ 477
             +++  L  +M W+++ ++++ ER+ GNP A +I  L L ++  ++LL +  +      
Sbjct: 404 RSSIQALLDQQMDWQNIDKLIEVERRRGNPTAKMIKSLNLLKHEFTVLLPDEQEVVDDEN 463

Query: 478 -----------------EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKT 520
                            + + E+K   +  V +D+  SA AN+ R+++ KK  + KQEKT
Sbjct: 464 EDESDSDSDSDSDDDDDDDETEDKKSNIISVSIDIRESAFANSTRYFDAKKNAQEKQEKT 523

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
               + A K +E K    +   K + N                  S+N + I        
Sbjct: 524 KENAAIAIKNSEMKIHRDM---KRLEN-----------------ESKNTVDIHS------ 557

Query: 581 EMIVKRYMSKG-DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
             I  RY+    D  V +D+  +   VIKN    + +PP T  QAG + +  S+AWDSKM
Sbjct: 558 --IYYRYLDNNTDYLVSSDVDKSLKVVIKNPYKNKEIPPSTFVQAGIYCLTTSKAWDSKM 615

Query: 640 VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHL 699
             S W+V    VSK    G  L  G   I+G KNFLPP  L+MG GLL+  DE +   H+
Sbjct: 616 SPSPWFVKGDAVSKKDFDGSLLPPGLLNIKGDKNFLPPSQLVMGIGLLWLPDEKTKARHI 675

Query: 700 NERRVRGEEEGMD 712
                R ++ G +
Sbjct: 676 EYMLNRNKDIGFE 688


>gi|342186351|emb|CCC95837.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1015

 Score =  207 bits (526), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 186/739 (25%), Positives = 337/739 (45%), Gaps = 127/739 (17%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKV-LL 58
           MVK RM + DV A  + +   L  +R  N+Y + P+T++F+          G++EK   +
Sbjct: 1   MVKSRMTSLDVKASSQEMHAELKNLRLLNIYSIPPRTFLFRF---------GQAEKKKTV 51

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM-NA 117
           +++ G+RLH T   R+K   PS F  K+RK +   ++  VRQL +DR++ F  G+   N+
Sbjct: 52  VLDVGIRLHLTQVVREKPQIPSAFAQKMRKLLCNWKVRSVRQLDHDRVVDFHLGMSEENS 111

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            ++++EL+++GN+                        +++ H Y  ++  +F     +K+
Sbjct: 112 LHIVVELFSKGNL------------------------VVTDHEYRVKL--LFRTEAVNKV 145

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             A+           D++      +  A  E  GGQ+      L +  N+     A+   
Sbjct: 146 TPAV-----------DEIFL--KTIPRAPLEE-GGQEQISEEMLQQEWNEKF---AQWDG 188

Query: 238 PT-LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
           P  + ++L     +G +L+ HI+   G VPN+   ++N   +   + L+  +   + W  
Sbjct: 189 PVEICSILSSMYSFGNSLAGHIMSRAG-VPNVTKDKMNCSGEEMFRKLLPGM--LDAW-- 243

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPL-------LLNQFRSREFV 349
            + S  +   GY+L  +K  G++         T I   FC +       L+N F+    V
Sbjct: 244 RLFSSPLPEGGYLLKSSKRGGQE----AMIPGTMISALFCSISTRRMLWLINIFQIS--V 297

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
            F     A++ F+ + + +R E  +   +     K  +   +   R+  LK+  + S++ 
Sbjct: 298 AF-----AMNFFHIR-KKKRIEHHNDKVKTVVVSKREECERNHNRRIDKLKRSEEESIRK 351

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
             LI  N E +D  I  +  AL  ++ W+D   ++K+ R  G+P+A +I ++  ER  + 
Sbjct: 352 GHLIFQNTETIDKIIGLINEALDMKIRWDDFRSVLKQRRDEGHPLASMIKEVLFERRKVV 411

Query: 470 LLLSNNLDEMDD----------EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           +L++ + D+ D+          E++     ++E+DL  +AH NA  ++   K   +K ++
Sbjct: 412 VLMNEDADDDDEQTEDEEGEKREDRDRATYEIEIDLTKTAHTNAEEYFARAKSTAAKLKR 471

Query: 520 TITAHSKAFKAAEKKTRLQI--LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDA 577
           TI A  KA   AE+K R      QEK +      R   W+EKFNWF +S   LV+ GRD 
Sbjct: 472 TIAATEKAMAGAERKGRTVTGKTQEKKIIT---ERCRFWWEKFNWFRTSCGDLVLQGRDE 528

Query: 578 QQNEMIVKRYMSKGDVYVHADLHGASSTVIK-------------NHRPEQ---------- 614
           +  +++++R M  GD+++   + G    +++                P+           
Sbjct: 529 RSTQLLLRRVMRLGDIFLCCHVVGGLPCILRPAGSVWSAVNASSKSGPDGGNGGDVCATP 588

Query: 615 ---PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
              PV   ++ +A  + V  S AW+SK    AWWV+  QVS     G YL        G+
Sbjct: 589 KMCPVRKKSVEEAASWCVSRSPAWESKFTVGAWWVHASQVSGGTSAGCYL------YEGE 642

Query: 672 KNFLPPHPLIMGFGLLFRL 690
           ++ L P    +G GLLFR+
Sbjct: 643 QHDLEPPSSRLGCGLLFRV 661


>gi|190407936|gb|EDV11201.1| hypothetical protein SCRG_02481 [Saccharomyces cerevisiae RM11-1a]
          Length = 1030

 Score =  206 bits (524), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 207/750 (27%), Positives = 342/750 (45%), Gaps = 137/750 (18%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           L G R SN+Y++  S K ++ K         +    K+ ++++ G+R++ T ++R    T
Sbjct: 21  LEGYRLSNIYNIADSSKQFLLKF--------NKPDSKLNVVVDCGLRIYLTEFSRPIPPT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF +KLRKH++ +RL  ++Q+  DRI++ QF  G    Y++LE ++ GN++L D    
Sbjct: 73  PSGFVVKLRKHLKAKRLTALKQVDQDRILVLQFADG--HFYLVLEFFSAGNVILLDENRR 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           ++ L R          ++       +I  +F+ +        L ++    A+E  + N  
Sbjct: 131 IMALQR---------VVLEHENKVGQIYEMFDES--------LFTTNNESADESIEKNRK 173

Query: 199 GNNVSNASKENLGGQKGGKSFDLS--KNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
               S    E +   +     D++  K  N    +GA+ K+  + ++    L   P LS 
Sbjct: 174 AEYTSELVNEWIKAVQAKYESDITVIKQLNIQGKEGAKKKKVKVPSIHKLLLSKVPHLSS 233

Query: 257 HIILDTGLVPNMKLSE--VNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM--- 311
            ++     V N+  SE  +N LE+      +L   + E + Q + + D   +GYIL    
Sbjct: 234 DLLSKNLKVFNIDPSESCLNLLEETDSLAELLNSTQLE-YNQLLTTTD--RKGYILAKRN 290

Query: 312 QNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET---FDAALDEFYSKIE 366
           +N    KD    E      IYD F P    +N   +      E    ++  LD+F+S IE
Sbjct: 291 ENYISEKDTADLEF-----IYDTFHPFKPYINGGDTDSSCIIEVEGPYNRTLDKFFSTIE 345

Query: 367 SQ--------------------RAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           S                     RAE   K +      +LN      E + H +       
Sbjct: 346 SSKYALRIQNQESQAQKKIDDARAENDRKIQALLDVQELN------ERKGHLI------- 392

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
           ++ A LIE          LAV+  +  +M W  + +++K E+K GN +A L++  L L++
Sbjct: 393 IENAPLIE-------EVKLAVQGLIDQQMDWNTIEKLIKSEQKKGNRIAQLLNLPLNLKQ 445

Query: 466 NCMSL---LLSNNLDEMDDEE------------------------------KTLPVEKVE 492
           N +S+   L S  L+   DE+                              K    EK+ 
Sbjct: 446 NKISVKLDLSSKELNTSSDEDNESEGNTTDSSSDSDSEDMESSKERSTKSMKRKSNEKIN 505

Query: 493 V--DLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           V  DL LSA+ANA  ++ +KK    KQ+K      KA K  E K   Q L++K   + S 
Sbjct: 506 VTIDLGLSAYANATEYFNIKKTSAQKQKKVEKNVGKAMKNIEVKIDQQ-LKKKLKDSHSV 564

Query: 551 MRKV---HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           ++K+   ++FEK++WFISSE +LV+ G+   + + I  +Y+   D+Y+    +  S   I
Sbjct: 565 LKKIRTPYFFEKYSWFISSEGFLVMMGKSPAETDQIYSKYIEDDDIYMSNSFN--SHVWI 622

Query: 608 KNHRPEQP-VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGS 665
           KN  PE+  VPP TL QAG   +  S+AW  K+ +S WW +   VSK        L  G+
Sbjct: 623 KN--PEKTEVPPNTLMQAGILCMSSSEAWSKKISSSPWWCFAKNVSKFDGSDNSILPEGA 680

Query: 666 FMIRGK--KNFLPPHPLIMGFGLLFRLDES 693
           F ++ +  +N LPP  L+MGFG L+++  S
Sbjct: 681 FRLKNENDQNHLPPAQLVMGFGFLWKVKTS 710



 Score = 40.0 bits (92), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 24/31 (77%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG++GKLKK+++KY DQDE ER +R+  L  
Sbjct: 830 RGKRGKLKKIQKKYADQDETERLLRLEALGT 860


>gi|435850617|ref|YP_007312203.1| putative RNA-binding protein, snRNP like protein
           [Methanomethylovorans hollandica DSM 15978]
 gi|433661247|gb|AGB48673.1| putative RNA-binding protein, snRNP like protein
           [Methanomethylovorans hollandica DSM 15978]
          Length = 664

 Score =  205 bits (522), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 181/690 (26%), Positives = 301/690 (43%), Gaps = 107/690 (15%)

Query: 2   VKVRMNTADVAAEVKCLRR----LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVL 57
           +K  M +ADVAA V  L      LI  +   +Y        F L     V   G   +V 
Sbjct: 1   MKEEMASADVAALVAELSSGELSLIDAKVGKIYQPLEDEIRFNLF----VFGKG---RVD 53

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            ++++G R H + Y       P  F + LRKH+ + R+  ++Q  +DRII   F  G   
Sbjct: 54  FIIQAGKRAHLSQYVSPSPKLPQSFPMLLRKHVMSSRITSIKQYDFDRIIEIGFVRGGVE 113

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
             +I EL+A+GNI+L D+E  +  +L  +    KG  + S   Y     ++     + + 
Sbjct: 114 TVLIAELFARGNIVLIDNERRI--ILPMNPTTFKGRRVRSGEIYSYPEAQISPLDASEEQ 171

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             A+  S + D              + A++ NLGG                         
Sbjct: 172 MLAVFRSSDSDVVR-----------TIATRFNLGG------------------------- 195

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQD 297
                           LSE +    G+  N+ +SEV   E      + L +   +D    
Sbjct: 196 ---------------LLSEEVCSRAGIKKNLPVSEVGSEE------ITLLLRAMKDMFSP 234

Query: 298 VISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
           + +G++ P   I+M+ +           G + Q  D   P  L  +R     ++ +F+ A
Sbjct: 235 LQTGELDP--CIIMKGE-----------GDTAQSID-VVPFELEVYRELTKERYPSFNKA 280

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           LDE++ K E+    +Q  + +      L +    QE  V    +E ++   +AE I  N 
Sbjct: 281 LDEYFGKREAASITEQAFSVKKEKVDLLERRLRQQEEAVEKYGKESEKHTSIAETIYANY 340

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
           + V+  +  + +A     SW+ +   +K  +          D +   ++ +S+  +  + 
Sbjct: 341 QAVEDVLKVLAIARDKGYSWDQIKSTIKAAK----------DSVPAAKSILSIDSATGIV 390

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
            +D     L   K  +D+  +   NA+ +YE  KK   KQE  I +  +   A +KK + 
Sbjct: 391 VLD-----LMGMKTNIDVTKTVPQNAQVYYERSKKLAKKQEGAIRSIEQTKLAMQKKEKT 445

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
              +  TV     ++K  W+++F WF+SS+ +LVI GRDA  NE I  +YM K D+ +H 
Sbjct: 446 ATRKRGTV----RIKK-QWYDRFRWFVSSDGFLVIGGRDADTNEEIFVKYMEKRDIVLHT 500

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAP 656
            + GA  TVIK    E  VP  T+ +A  F V +S  W S   ++  +WV P QVSKT  
Sbjct: 501 QMPGAPLTVIKTGGKE--VPSQTIEEAARFVVSYSSVWKSGQFSADCYWVNPTQVSKTPE 558

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +GEY+  GSF+IRG++N+L   P+ +  G+
Sbjct: 559 SGEYVKKGSFIIRGERNYLKDVPVGVAVGI 588


>gi|367011407|ref|XP_003680204.1| hypothetical protein TDEL_0C01040 [Torulaspora delbrueckii]
 gi|359747863|emb|CCE90993.1| hypothetical protein TDEL_0C01040 [Torulaspora delbrueckii]
          Length = 1016

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 200/761 (26%), Positives = 345/761 (45%), Gaps = 118/761 (15%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+    + LR  L G R SN+Y++  S + ++ K         +    K  +
Sbjct: 1   MKQRISALDIQILAEELRAHLEGHRLSNIYNIADSSRQFLLKF--------NKPDSKFSV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T Y R     PS F +KLRKH++++RL  +RQ+  DRI++ QF  G+   
Sbjct: 53  VVDCGLRIHLTDYDRPIPPGPSSFVVKLRKHLKSKRLSALRQVKNDRILVLQFADGL--F 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN++L D    +L+L R   + +  V       Y      +F    +S+  
Sbjct: 111 YLVLEFFSAGNVILLDENKKILSLQRIVHEHENKVG----ETYTMFDDSLFNVNNSSQSA 166

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                SK  D        E+       SK +L       +  + ++S K      +A +P
Sbjct: 167 DQTIKSKSYDVELVRVWLEEAQ-----SKFSLQSSMQADAMKVKQSSKK------KALKP 215

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN---------KLEDNAIQVLVLAVA 289
              T+    L   P LS  +     L  N+K+ ++N           ED  + +L     
Sbjct: 216 L--TIHKLLLSKEPHLSSDL-----LSKNLKMRKINPSSPCIEFLAKEDVLVDLLNYTEI 268

Query: 290 KFEDWLQDVISGDIVPEGYILMQ---NKHLGKDHPPTESGSSTQIYDEFCPLL-----LN 341
           ++ D L +  S      G+IL +   N  LGKD    E      I++ F P        +
Sbjct: 269 EYHDVLSNKDS-----RGFILAKKNVNYTLGKDSEDLEF-----IFENFHPFKPFIEEQD 318

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQ 401
           Q RSR       ++  LD F+S IES +   + + +E  A  K+    ++ + R+  L  
Sbjct: 319 QGRSRITEVPGEYNKTLDTFFSTIESSKYALRIQQQEQLAKKKIEDARLENQKRIQALLD 378

Query: 402 EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-K 460
               + +    I  N + V+ A +AV+  +  +M W+ + ++++ E+   N +A +ID  
Sbjct: 379 VQSSNEQKGHAIIANADLVEEAKIAVQGLIDQQMDWQTIEKLIRNEQLKKNKIAMVIDLP 438

Query: 461 LYLERNCMSLLLS-------NNLDEMD-------------------DEEKTLPVE----- 489
           L L+ N +++L+        NN  E D                   D+ +    E     
Sbjct: 439 LNLKENAVNILVPVSHDDEHNNESESDESFVESSSDESDSDEGTDSDDSEVSDFETEESR 498

Query: 490 --------------KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
                         K+ +DL LSA+ANA +++ +KK    KQ+K      KA K  E++ 
Sbjct: 499 NESRTSKRKVENKLKIRIDLGLSAYANASKYFTVKKTSADKQKKVEKNVEKAMKNIEQRI 558

Query: 536 RLQILQ--EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
             Q+ Q  +++ + +   R  ++FEK  WF SSE +LV+ GR   + + I  +Y+   D+
Sbjct: 559 DKQLKQKLKESHSVLKRARSPYFFEKHFWFYSSEGFLVLMGRSPLETDQIYSKYIEDDDI 618

Query: 594 YVHADLHGASSTVIKN-HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVS 652
           Y+ +     +   IKN +R E  VPP TL QAG F +  S+AW  K+ +S  W +   ++
Sbjct: 619 YMCSSFD--TQVWIKNPNRTE--VPPNTLMQAGVFCMAASEAWSKKVSSSPQWCFAKNIT 674

Query: 653 KTAPTGE-YLTVGSFMIRGKKNF--LPPHPLIMGFGLLFRL 690
           K   T +  L  G + I+ +     LPP  L+MGFG L+++
Sbjct: 675 KFDHTNKGVLDPGLYRIKKESEMSHLPPAQLVMGFGFLWKV 715


>gi|343472755|emb|CCD15168.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 559

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 163/626 (26%), Positives = 302/626 (48%), Gaps = 86/626 (13%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKV-LL 58
           MVK RM + DV A  + +   L  +R  N+Y + P+T++F+          G++EK   +
Sbjct: 1   MVKSRMTSLDVKASSQEMHAELKNLRLLNIYSIPPRTFLFRF---------GQAEKKKTV 51

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM-NA 117
           +++ G+RLH T   R+K   PS F  K+RK +   ++  VRQL +DR++ F  G+   N+
Sbjct: 52  VLDVGIRLHLTQVVREKPQIPSAFAQKMRKLLCNWKVRSVRQLDHDRVVDFHLGMSEENS 111

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
            ++++EL+++GN+++TD                        H Y  ++  +F     +K+
Sbjct: 112 LHIVVELFSKGNLVVTD------------------------HEYRVKL--LFRTEAVNKV 145

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
             A+           D++      +  A  E  GGQ+      L +  N+     A+   
Sbjct: 146 TPAV-----------DEIFL--KTIPRAPLEE-GGQEQISEEMLQQEWNEKF---AQWDG 188

Query: 238 PT-LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
           P  + ++L     +G +L+ HI+   G VPN+   ++N   +   + L+  +   + W  
Sbjct: 189 PVEICSILSSMYSFGNSLAGHIMSRAG-VPNVTKDKMNCSGEEMFRKLLPGM--LDAW-- 243

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR--SREFVKFETF 354
            + S  +   GY+L  +K  G++       ++   YD+F P+LL+Q++  +  +  F  F
Sbjct: 244 RLFSSPLPEGGYLLKSSKRGGQE-------ANDSRYDDFSPVLLDQYKKDAVAYQHFPNF 296

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
            +  DEF+S  E +R E  +   +     K  +   +   R+  LK+  + S++   LI 
Sbjct: 297 SSVCDEFFSYSEKKRIEHHNDKVKTVVVSKREECERNHNRRIDKLKRSEEESIRKGHLIF 356

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N E +D  I  +  AL  ++ W+D   ++K+ R  G+P+A +I ++  ER  + +L++ 
Sbjct: 357 QNTETIDKIIGLINEALDMKIRWDDFRSVLKQRRDEGHPLASMIKEVLFERRKVVVLMNE 416

Query: 475 NLDEMDD-----------EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
           + D+ DD           E++     ++E+DL  +AH NA  ++   K   +K ++TI A
Sbjct: 417 DADDDDDEQTEDEEGEKREDRDRATYEIEIDLTKTAHTNAEEYFARAKSTAAKLKRTIAA 476

Query: 524 HSKAFKAAEKKTRLQI--LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
             KA   AE+K R      QEK +      R   W+EKFNWF +S   LV+ GRD +  +
Sbjct: 477 TEKAMAGAERKGRTVTGKTQEKKIIT---ERCRFWWEKFNWFRTSCGDLVLQGRDERSTQ 533

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVI 607
           ++++R M  GD+++   + G    ++
Sbjct: 534 LLLRRVMRLGDIFLCCHVVGGLPCIL 559


>gi|302309325|ref|NP_986649.2| AGL017Wp [Ashbya gossypii ATCC 10895]
 gi|299788305|gb|AAS54473.2| AGL017Wp [Ashbya gossypii ATCC 10895]
          Length = 1006

 Score =  204 bits (518), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 214/827 (25%), Positives = 377/827 (45%), Gaps = 140/827 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R+++ D+    + L+ +L G R +N+Y+++  +  F L  + G        K+ +L+
Sbjct: 1   MKQRISSLDLQLLARELKAQLEGCRLANLYNVADASKQFLLKFTKG------ESKISILI 54

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G+++  T ++R    +P  F  KLRKH++ +RL  V+Q+G DRI++  F  G+   ++
Sbjct: 55  DCGLKIFATEFSRPIPPSPGPFVAKLRKHLKAKRLTTVKQVGADRILVLSFADGL--FFL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE +A GN++L D++  +L L R  RD ++ V          EI  +F+      +   
Sbjct: 113 VLEFFAAGNVILLDADRRILALQRVVRDHEQKVG---------EIYNMFDDHFLEDVSLP 163

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA---KQ 237
           +    + D +    V E       A++E             SK     +  G R    K 
Sbjct: 164 VP---KLDTHTLPVVQELLIKTKTAAEE-------------SKAVMPAAPVGGRKQSLKV 207

Query: 238 PTLKTVLGEALGYGPA-LSEHIILDTGLVPNMKLSEVNKLEDNAIQVL-VLAVAKFEDWL 295
           P++  +L  +  Y  + L   I+ + G+ P+    E   L D+A Q++ +L +A+ E ++
Sbjct: 208 PSIHKLLFSSYPYLSSDLLNKILKEHGIDPSQSFLE---LFDSADQLVDILNIAEKEAYM 264

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET 353
             +++ +    GYIL +   L  +    E    T  Y++F P    L     ++F   E 
Sbjct: 265 --LLTSE-KKNGYILARENPLYDEKKDAEGIRLT--YEQFHPFRPYLPDGSQKKFEIVEV 319

Query: 354 ---FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
              ++  +D+F+S I+S +   + + +E  A  KL K   + + ++  L +    + +  
Sbjct: 320 DGDYNRTVDKFFSTIDSTKYALRIQTQEQNARKKLEKAKAENQKKIQALVEVQHTNEQRG 379

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             I  N+E V+ A  A++  L  +M W  + +++K E+   N +A +I   L L+ N +S
Sbjct: 380 NAIINNIELVEEAKSAIQGLLDQQMDWTSIEKLIKTEQAKSNRIARVIKLPLNLKANKIS 439

Query: 470 --LLLSN-----------------------------NLDEMDDE---------------- 482
             L LSN                              L + D E                
Sbjct: 440 VELPLSNEDDESSDGSWGDSESDSGFSSSDDELSDSGLSDFDAEVVRGSGSKNKKGKSKV 499

Query: 483 -EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
             K++    V +DL++SA+ANA  ++E+KK    KQ        KA K  E+K    + +
Sbjct: 500 SNKSI---TVSIDLSMSAYANASSYFEMKKTGAKKQLGVEQNVQKAMKNIEQKIEKDLKK 556

Query: 542 EKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           +    +  +  +R  ++FEK+ WFIS+E +LV+ G+   + + I  +Y+   DVYV    
Sbjct: 557 KLKEQHDVLQVIRSPYFFEKYFWFISTEGFLVLMGKSGIETDQIYSKYIEDDDVYVS--- 613

Query: 600 HGASSTV-IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT- 657
           +G  S V IKN    + +PP TL QAG F    S+AW  K+ TS WW     +SK     
Sbjct: 614 NGFGSQVWIKNFERTE-IPPNTLMQAGIFANSASEAWSKKVATSPWWCAAKNLSKFDDVG 672

Query: 658 GEYLTVGSFMIRG--KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFE 715
           G  L  G+F ++    KNFLPP  L+MGF  ++++                     DD +
Sbjct: 673 GGLLPSGAFRLKSDEAKNFLPPAQLVMGFAFMWKIK-------------------TDDDQ 713

Query: 716 DSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPA----PSHTNAS 758
           ++G+ +   D+ +E D+  E        V  S  PA    PS++N S
Sbjct: 714 EAGYEE---DMPAEIDEMGEVSHPSEEMVEESIGPADNLLPSNSNQS 757


>gi|190345457|gb|EDK37344.2| hypothetical protein PGUG_01442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 873

 Score =  202 bits (515), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 142/498 (28%), Positives = 237/498 (47%), Gaps = 53/498 (10%)

Query: 262 TGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHP 321
            G+V N    E    ED  ++ +V A+   ++  ++++S        I++  K+   + P
Sbjct: 94  VGVVGNQSCLE---FEDKDLESVVEALKNSDNEYRNLVSSLGTEVTGIIVSKKNPAFE-P 149

Query: 322 PTESGSSTQIYDEFCPL--LLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKED 379
             ++     +YDEF P         S +F +   ++  +D F+S ++S++ E + + ++ 
Sbjct: 150 SDDNKDLEYLYDEFHPFKPYKENLESFKFTEIRGYNKTVDTFFSTLDSKKHELRMEQQKH 209

Query: 380 AAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWED 439
            A  +L     +++ ++  L+ + + + K  + I Y+ + V   I +V+  L  +M W +
Sbjct: 210 NAKKRLLNAREERDKQIDNLRIQQEMNSKKGDAIIYHADLVSECIASVQTLLDQQMDWAN 269

Query: 440 LARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDD----------------- 481
           +  ++K E+  GN VA  I   L L  N + L L +  D M D                 
Sbjct: 270 IESLIKLEQSRGNSVAKTIKLPLNLTENKIGLKLPDT-DSMYDPADIDSESDSETSSESE 328

Query: 482 --------------------------EEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
                                     + K +P   V +DL+LS  ANAR ++E KK+ ES
Sbjct: 329 TESESESESGSESEDETPPKRMSKKAKSKEIPALSVWIDLSLSPFANARTYFESKKQAES 388

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVIS 573
           KQEK       A + A+KK    + +     N  +  +R  +WFEKF WF+SSE YL I+
Sbjct: 389 KQEKVEKNTDMALRNAQKKIEQDLAKNLKNENETLRQVRPKYWFEKFFWFVSSEGYLCIA 448

Query: 574 GRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQ 633
           GRD  Q +MI  R+ S  D +V +D+ G+   V+KN    + +PP TL QAG F +  S 
Sbjct: 449 GRDDAQVDMIYYRHFSDNDFFVSSDIEGSLKVVVKNPYRGEALPPYTLMQAGMFAMSASA 508

Query: 634 AWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           AW+ K+ TS W++  + V+K    G  +  G+F  +GKK FLPP  L+MG G  F  D+ 
Sbjct: 509 AWNGKITTSPWFLAGNDVTKLDFDGSLVPSGTFNYKGKKEFLPPTQLVMGLGFYFLGDDD 568

Query: 694 SLGSHLNERRVRGEEEGM 711
           +   +   R  R  E G+
Sbjct: 569 TTKKYGETRITRQNESGL 586



 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 27/37 (72%)

Query: 888 EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           E  K++RG++ K+K+  +KY DQDE+ER +RM +L  
Sbjct: 656 EPHKLTRGKRSKMKRAAKKYADQDEDERKLRMEMLGT 692


>gi|303290793|ref|XP_003064683.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226453709|gb|EEH51017.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 807

 Score =  202 bits (515), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 156/497 (31%), Positives = 228/497 (45%), Gaps = 105/497 (21%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH-------------PPTES-- 325
           ++ L+  ++  +DW + V  G  VP G +  + K                   PP ++  
Sbjct: 239 VERLLRQLSVLDDWFEGVGDGSAVPTGVVTRRRKPGATGDDDDAFVVDDFSPLPPIDAID 298

Query: 326 --GSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH 383
              +ST   D+                +E+FD ALD +++  E+Q A +Q +  E A   
Sbjct: 299 SNANSTATDDD----------DARVQAYESFDDALDAYFASFETQAATRQRERAEKAVVD 348

Query: 384 KLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARM 443
           +L K+  DQ  R   L++E +     A LIEYNLE VD A+ AV  ALA  M W DL  M
Sbjct: 349 RLEKVRKDQSQRAAALEREREADELRATLIEYNLERVDVALAAVNNALAGGMGWGDLEIM 408

Query: 444 VKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK------------- 490
           ++EE +AGNPVAG I  L L  N +++ L+N+LD+ +D+E     +              
Sbjct: 409 IREETRAGNPVAGTIKSLDLANNKITVTLANHLDDDEDDEDEEEEDGEDEDKDGDEDDAG 468

Query: 491 --------------------------VEVDL--ALSAHANARRWYELKKKQESKQEKTIT 522
                                     V V+L  +LSA+ANAR  +E KKK  +K +KT+ 
Sbjct: 469 EGDDEKSSERKRKQQQKKLRRKRRKAVAVELDLSLSAYANARTHFEKKKKHATKHDKTLA 528

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEM 582
              +A                               KF WF+++EN LV+S RDA Q + 
Sbjct: 529 QTERA-------------------------------KFWWFVTTENCLVVSARDAAQTDA 557

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS 642
           ++K+Y   G   V     G       N      VPP +L QAG   +C S AWDS+ V S
Sbjct: 558 MLKKYAPPGSSVVVGGGGGGGGAGWCNG-----VPPASLAQAGAACLCRSNAWDSRQVIS 612

Query: 643 AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL-DESSLGSHLNE 701
           AW+V P Q+ K  P GE L  G     GKK FLPP PL+MGF  +F L D++S+ +H  +
Sbjct: 613 AWYVKPEQIRKETPEGEPLLNGVVWTVGKKTFLPPAPLVMGFAYMFVLGDDASVEAHAGD 672

Query: 702 RRVRGEEEGMDDFEDSG 718
           R V+ +   + + +  G
Sbjct: 673 RVVKQQMAALGNADGEG 689



 Score =  177 bits (448), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 92/199 (46%), Positives = 120/199 (60%), Gaps = 12/199 (6%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K + N  D+ AEV CLR RL+G   +NVYD    +++FK   S G TESGE EK+ ++
Sbjct: 1   MPKQKFNNYDIRAEVACLRARLVGTWLTNVYDRDKTSFVFKFTRSGGATESGEGEKINVV 60

Query: 60  MESGVRLHTTAYARDKK-----------NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +ESG R H T++AR              + PS F  KLR H+R +RL  + Q+G DR + 
Sbjct: 61  IESGTRFHCTSHARASASGGGGGKASSTDQPSKFNAKLRMHLRGKRLNAIDQIGSDRAVD 120

Query: 109 FQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRV 168
           F F  G   H++I+ELYAQGN+LL D +  VLTLLR+HRDDDKGV I+  HRYP E  R 
Sbjct: 121 FTFSSGDTEHHLIVELYAQGNVLLLDKDDVVLTLLRTHRDDDKGVKILGNHRYPRERFRT 180

Query: 169 FERTTASKLHAALTSSKEP 187
            +R T   L  AL   + P
Sbjct: 181 HKRVTLHDLEGALGLGQNP 199


>gi|374109900|gb|AEY98805.1| FAGL017Wp [Ashbya gossypii FDAG1]
          Length = 1006

 Score =  202 bits (515), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 213/827 (25%), Positives = 376/827 (45%), Gaps = 140/827 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R+++ D+    + L+ +L G R +N+Y+++  +  F L  + G        K+ +L+
Sbjct: 1   MKQRISSLDLQLLARELKAQLEGCRLANLYNVADASKQFLLKFTKG------ESKISILI 54

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G+++  T ++R    +P  F  KLRKH++ +RL  V+Q+G DRI++  F  G+   ++
Sbjct: 55  DCGLKIFATEFSRPIPPSPGPFVAKLRKHLKAKRLTTVKQVGADRILVLSFADGL--FFL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           +LE +A GN++L D++  +L L R  RD ++ V          EI  +F+      +   
Sbjct: 113 VLEFFAAGNVILLDADRRILALQRVVRDHEQKVG---------EIYNMFDDHFLEDVSLP 163

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARA---KQ 237
           +      D +    V E       A++E             SK     +  G R    K 
Sbjct: 164 VPKL---DTHTLPVVQELLIKTKTAAEE-------------SKAVMPAAPVGGRKQSLKV 207

Query: 238 PTLKTVLGEALGYGPA-LSEHIILDTGLVPNMKLSEVNKLEDNAIQVL-VLAVAKFEDWL 295
           P++  +L  +  Y  + L   I+ + G+ P+    E   L D+A Q++ +L +A+ E ++
Sbjct: 208 PSIHKLLFSSYPYLSSDLLNKILKEHGIDPSQSFLE---LFDSADQLVDILNIAEKEAYM 264

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPL--LLNQFRSREFVKFET 353
             +++ +    GYI+ +   L  +    E    T  Y++F P    L     ++F   E 
Sbjct: 265 --LLTSE-KKNGYIVARENPLYDEKKDAEGIRLT--YEQFHPFRPYLPDGSQKKFEIVEV 319

Query: 354 ---FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
              ++  +D+F+S I+S +   + + +E  A  KL K   + + ++  L +    + +  
Sbjct: 320 DGDYNRTVDKFFSTIDSTKYALRIQTQEQNARKKLEKAKAENQKKIQELVEVQHTNEQRG 379

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             I  N+E V+ A  A++  L  +M W  + +++K E+   N +A +I   L L+ N +S
Sbjct: 380 NAIINNIELVEEAKSAIQGLLDQQMDWTSIEKLIKTEQAKSNRIARVIKLPLNLKANKIS 439

Query: 470 --LLLSN-----------------------------NLDEMDDE---------------- 482
             L LSN                              L + D E                
Sbjct: 440 VELPLSNEDDESSDGSWGDSESDSGFSSSDDELSDSGLSDFDAEVVRGSGSKNKKGKSKV 499

Query: 483 -EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
             K++    V +DL++SA+ANA  ++E+KK    KQ        KA K  E+K    + +
Sbjct: 500 SNKSI---TVSIDLSMSAYANASSYFEMKKTGAKKQLGVEQNVQKAMKNIEQKIEKDLKK 556

Query: 542 EKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           +    +  +  +R  ++FEK+ WFIS+E +LV+ G+   + + I  +Y+   DVYV    
Sbjct: 557 KLKEQHDVLQVIRSPYFFEKYFWFISTEGFLVLMGKSGIETDQIYSKYIEDDDVYVS--- 613

Query: 600 HGASSTV-IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT- 657
           +G  S V IKN    + +PP TL QAG F    S+AW  K+ TS WW     +SK     
Sbjct: 614 NGFGSQVWIKNFERTE-IPPNTLMQAGIFANSASEAWSKKVATSPWWCAAKNLSKFDDVG 672

Query: 658 GEYLTVGSFMIRG--KKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFE 715
           G  L  G+F ++    KNFLPP  L+MGF  ++++                     DD +
Sbjct: 673 GGLLPSGAFRLKSDEAKNFLPPAQLVMGFAFMWKIK-------------------TDDDQ 713

Query: 716 DSGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPA----PSHTNAS 758
           ++G+ +   D+ +E D+  E        V  S  PA    PS++N S
Sbjct: 714 EAGYEE---DMPAEIDEMGEVSHPSEEMVEESIGPADNLLPSNSNQS 757


>gi|410077749|ref|XP_003956456.1| hypothetical protein KAFR_0C03290 [Kazachstania africana CBS 2517]
 gi|372463040|emb|CCF57321.1| hypothetical protein KAFR_0C03290 [Kazachstania africana CBS 2517]
          Length = 1038

 Score =  202 bits (514), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 202/770 (26%), Positives = 351/770 (45%), Gaps = 144/770 (18%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+++ D+    + L++ I G R SN+Y++  S + ++ K         +    K+ +
Sbjct: 28  MKQRISSLDLKLLAQELQKAIEGYRLSNIYNVADSKRQFLLKF--------NKPDSKINV 79

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+++H T Y R     PSGF  KLRKH++++RL  +RQ+  DRI++ +F  G+  +
Sbjct: 80  IVDCGLKVHVTEYTRPTPQLPSGFVAKLRKHLKSKRLTALRQVDNDRILVLEFSDGL--Y 137

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN+LL D+   ++ L R   + +  V          E+ ++F+ T   +  
Sbjct: 138 YLVLEFFSAGNVLLLDNNRCIMALQRIVEEHENKVG---------ELYKIFDSTLFKE-- 186

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSN-KNSNDGARAKQ 237
                        PD   E         +E +   K     D + NSN K   D  + K 
Sbjct: 187 ------------NPDNPLERQFYTEELVREWISSAK-----DTTSNSNTKGPTDKKKIKV 229

Query: 238 PTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVN---------KLEDNAIQVLVLAV 288
            ++  +L   L   P LS  +     L  N+K + +N           E   + +L    
Sbjct: 230 FSIHKLL---LSKQPHLSSDL-----LQKNLKEAGINCASSCLDFVNREQTIVSLLNTTA 281

Query: 289 AKFEDWLQDVISGDIVPEGYILMQ---NKHLGKDHPPTESGSSTQIYDEFCP----LLLN 341
            +++  LQ         +G+IL +   N    KD P  E      +Y+ F P    +   
Sbjct: 282 KEYKQLLQTEFK-----KGFILAKKNVNYDSLKDKPELE-----YLYENFHPFKPYISGA 331

Query: 342 QFRSREFVKFE-TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
           + +S   ++ E +++  LD F+S IES +   + + +E  A  KL     D + R+ +L 
Sbjct: 332 EEKSVRILEIEGSYNRTLDVFFSTIESLKYSLRIQNQELQAKKKLEDARSDNQKRIQSLS 391

Query: 401 QEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID- 459
                +   A  I  N + VD+A  AV+  L  +  W  + +++  E+K  N +A +I+ 
Sbjct: 392 DVQILNETKANAILNNTDLVDSAKQAVQDLLEQQTDWNMIEKLIMNEKKRRNKIAEIIEL 451

Query: 460 KLYLERNCMSL-------------LLSNN----------------LDEMDD--------- 481
            L L+ N +++               S+N                  E+ D         
Sbjct: 452 PLNLKNNKINIKIPLQSPSQFEEETFSDNESVKSSLSDSDFSDESDSELSDFSMEEVVGR 511

Query: 482 EEKTLPV------EKVEVDLALS--AHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
            E T  +      + V V + LS  ++ANA +++  KK    KQ+K     +KA    E 
Sbjct: 512 HENTRKIRAKDDKQHVTVTIDLSLSSYANASQYFNSKKDSAEKQKKMEKHMAKAMTNIEN 571

Query: 534 KTRLQI---LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           +   Q+   L+E     +  +RK ++FEK+NWFISSE YLV++G+ A +N+ I  +Y+  
Sbjct: 572 RIDQQLKKKLRESHTV-LKKIRKPYFFEKYNWFISSEGYLVMTGKSALENDQIYMKYIED 630

Query: 591 GDVYVHADLHGASSTVIKNHRPEQ-PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
            D+++       S   IKN  P++  +PP TL QAG F    S+AW +K+V S  W Y  
Sbjct: 631 DDIFMSTSF--GSKAWIKN--PDRGEIPPNTLMQAGIFCASSSKAWSNKVVCSPKWCYAR 686

Query: 650 QVSK-------TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            ++K        A TGE++ +       K++ LPP  LIMG G L++L +
Sbjct: 687 NITKFTQDGSIVAETGEFVLID----EQKQSTLPPAQLIMGIGFLWKLKQ 732


>gi|255722283|ref|XP_002546076.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240136565|gb|EER36118.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 857

 Score =  202 bits (513), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 149/505 (29%), Positives = 247/505 (48%), Gaps = 62/505 (12%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLL 340
           +Q +V A+   ED   D+ISG    +GYI+ +     K+   +E      I DEF P   
Sbjct: 89  LQKVVDALHVCEDKYMDLISGKTETQGYIVSR-----KNKNASEDSEFDYICDEFHPF-- 141

Query: 341 NQFRSR----EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
             ++S     +F +   ++  +D+F+S +ES +   + + +++ A  +L K   +++ ++
Sbjct: 142 KPYKSNVTDLKFTEVSGYNKTVDQFFSTLESSKFSLKIEQQKENASKRLEKAKSERDKQI 201

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
            +L  +   + K  ELI+Y+ E V+     V+  L  +M W ++  ++  E+K  NP + 
Sbjct: 202 ESLVAQQQLNSKKGELIQYHSELVEECRRYVQQYLDQQMDWTNIETVIALEQKKNNPTSK 261

Query: 457 LID-KLYLERNCMSLLLSNNLDEMDDEEKT-------------------------LPVEK 490
            I   L L+ N + +LL +  D  D E  +                         +PV++
Sbjct: 262 SIQLPLNLKDNKIKVLLPDFEDYSDSESASATETESESETESESESDSDSDSDDDIPVKR 321

Query: 491 VE------------------VDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE 532
           V+                  +DL+LS+ ANAR +++ KK  E+KQ K   + + A K AE
Sbjct: 322 VQKPAKTKAPKKKQNIIPTWIDLSLSSFANARTYFDSKKTAETKQVKVENSTNLALKNAE 381

Query: 533 KKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
           +K    + +     N  +  +R  +WFEKF WF+SSE YL ++G+D  Q +MI  R+ S 
Sbjct: 382 RKINQDLAKALKQENETLKEIRPKYWFEKFYWFVSSEGYLCLAGKDNSQIDMIYYRHFSD 441

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
            D  V AD+ G+    IKN    + +PP TL QAG F++  S AW+ K+ TSAW ++  +
Sbjct: 442 NDSIVSADMEGSLKVFIKNPFQGEAIPPSTLMQAGIFSMSASTAWNGKVTTSAWVLHGTE 501

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           +SK    G  +  G F    KK +LPP  L+MG G    +DE S   +   R  R +E G
Sbjct: 502 ISKRDFDGSIVPDGEFKYLAKKEYLPPAQLVMGLGFYCLVDEESTKKYAEIRSNREKEHG 561

Query: 711 M-----DDFEDSGHHKENSDIESEK 730
           +     +  +D  + K N  +ESEK
Sbjct: 562 LTIVVDNKKKDLENIKLNMPVESEK 586



 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 5/51 (9%)

Query: 874 ASSQPESIVRKTKIEGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           A+++P+SI   T +      RG+K KLKK   KY DQDEEER +RM  L  
Sbjct: 614 AATEPDSIKSNTPV-----PRGKKSKLKKTAAKYRDQDEEERRLRMDALGT 659


>gi|408381973|ref|ZP_11179520.1| fibronectin-binding A domain-containing protein [Methanobacterium
           formicicum DSM 3637]
 gi|407815421|gb|EKF86006.1| fibronectin-binding A domain-containing protein [Methanobacterium
           formicicum DSM 3637]
          Length = 711

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 178/672 (26%), Positives = 307/672 (45%), Gaps = 66/672 (9%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +V +  ++G+R+HTT Y  +    P  F + LRKH++   ++ VRQ  +DRI+  +  + 
Sbjct: 47  RVDVAFQAGLRVHTTQYPPENPKVPPSFPMLLRKHLKNATVKGVRQHNFDRIL--EIDIQ 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTT 173
               + +++EL++QGNI+L D E  ++  L+      + +     ++YP       E   
Sbjct: 105 KEHRFTLVVELFSQGNIILLDEENQIILPLKHRHAQGRKITSKEEYQYP-------EERG 157

Query: 174 ASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGA 233
              L+  L   KE  AN       D + +   ++  LGG    + F  S         G 
Sbjct: 158 IHILNVELEDLKELFANS------DSDLIRTLARSGLGGMYSEEIFLRS---------GV 202

Query: 234 RAKQPTLKTVLGEALGYGPALSEHI--ILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKF 291
             KQP  +T   E      +++E    +      P +    V   E    +       K 
Sbjct: 203 DKKQPANETSESEIESIYQSMTELFKPLKTFKFQPQIVKEVVEGEEKENEEKTGKEEGK- 261

Query: 292 EDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
              ++D+       E     + K   K     E   + +  ++  PL +  +++    +F
Sbjct: 262 ---VKDISKTKKGKEDSKTKKGKEDSKTKKGKEDSKTKKGKEDVLPLDILTYQNFHKERF 318

Query: 352 ETFDAALDEFYSK---IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ETF+ A DEFYS     + ++ ++   AKE   + K  +I   QE  +   ++ +  + +
Sbjct: 319 ETFNQAADEFYSGKVGADIKKVQEDIWAKEVGKYEKRLRI---QEETLEKFQKTIVETKR 375

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
              LI  +  ++   +  +  A   + SW ++A  +K+ RK G   A +I  +    + M
Sbjct: 376 KGNLIYSHYSEIQNLLDIIHQA-REKFSWMEIASKLKKARKEGMVQAQIIQSM----DKM 430

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            +L  N           L  E V VD  L    NA ++Y   KK + K +    A  +  
Sbjct: 431 GVLTLN-----------LEGETVTVDANLEIPENAEKYYNKGKKAKRKIKGVNMAIERTK 479

Query: 529 KAAEKK-TRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           K  E+K  + ++  E+       +RK + WFEK  WF+SS+ +LVI GRDA  NEM+VKR
Sbjct: 480 KDVERKRNKRELALERVRVPQKRVRKELKWFEKLRWFLSSDGFLVIGGRDAGTNEMVVKR 539

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWW 645
           ++   D+Y+H+D+HGA S VIK    E+ +P  T+++AG      S AW     +   +W
Sbjct: 540 HLDNPDIYLHSDIHGAPSVVIKKGEAEE-IPESTIHEAGNLAASFSSAWSKGYGSQDVYW 598

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVR 705
           V+P QVSKT  +GE++  G+F+IRG +N+L   PL +  G++          +  ER + 
Sbjct: 599 VHPDQVSKTPQSGEFVARGAFIIRGSRNYLRGIPLKIAVGIV---------DYEGERIMA 649

Query: 706 GEEEGMDDFEDS 717
           G  E +  + D+
Sbjct: 650 GPTEAVSKYTDN 661


>gi|261335340|emb|CBH18334.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1100

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 157/548 (28%), Positives = 254/548 (46%), Gaps = 99/548 (18%)

Query: 235 AKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
           A+  T ++ L     +GP+L++HI+  TG V ++K + +    D   + L+  +   E W
Sbjct: 192 AEYETTRSTLSATHHFGPSLADHILTVTG-VKSVKKANMTCSGDEMFEKLLPGM--LEAW 248

Query: 295 LQDVISGDIVPEGYILMQ---------NKHLGKDHPPTESGSSTQI-------------- 331
                +   +P G  L+           +  GK  P  ++G  T                
Sbjct: 249 R---FAFSPLPTGGYLISKTAATKGRGTQERGKAPPHVDAGVGTTADGGEAGSGVEKQPR 305

Query: 332 -------YDEFCPLLLNQFRSREFVK--FETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
                  Y++F P+LL Q+R          +F +  D F+   E ++ EQ +        
Sbjct: 306 PHLQGVQYEDFSPVLLAQYRGDAVSASYLPSFGSVCDAFFLYTEKEKIEQHNDRATTCVL 365

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K  K   D   R+  L++  + + +  ELI  N E +D AI  +  ALA  + WE L R
Sbjct: 366 SKKEKFERDHNRRIAALERSEEENTRKGELIIQNAEKIDEAIGLINGALAAGIQWEALRR 425

Query: 443 MVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM---DDEEKTLPV----------- 488
           ++K+    G+PVA ++ +L+L+RN +S+L+  N +++   +DEE  + V           
Sbjct: 426 LLKQRHAEGHPVAYMVHELFLDRNSISVLVEENDEDVECYEDEESKVKVGGKGENHRYGG 485

Query: 489 ---EK-------------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAE 532
              EK             +EVDL+ +A+ANA  ++  KK   +K EKTI A +KA   AE
Sbjct: 486 NSGEKKDRVEGCSRTPSVIEVDLSKTAYANAASYFTQKKANRAKLEKTIAATAKAAAGAE 545

Query: 533 KKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGD 592
           KK      +++T   I+  R   W+EKFNWF +S   LV+ G D Q  E++V+R M  GD
Sbjct: 546 KKGERLAAKKQTKKAIATERHRCWWEKFNWFRTSCGDLVLQGHDTQSTELLVRRIMRLGD 605

Query: 593 VYVHADLHGA-------------SSTVIKNHRPEQP------------VPPLTLNQAGCF 627
           V+VH+D+ G              +ST       E+             +  ++L++A  +
Sbjct: 606 VFVHSDVEGGLPCILRAAGSAWDASTAFGEGESEENSIQVGESTKGWLIHMISLDEAAAW 665

Query: 628 TVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            VC S AW+SK    AWWV+  Q+      G YL      + G+KN+L P PL++G GLL
Sbjct: 666 CVCRSSAWESKFSVGAWWVHASQIVGGTAAGCYL------LSGEKNYLRPRPLMLGCGLL 719

Query: 688 FRLDESSL 695
           FR+   ++
Sbjct: 720 FRISSRAI 727



 Score =  149 bits (376), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 77/164 (46%), Positives = 109/164 (66%), Gaps = 12/164 (7%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  L G+R +NVYD+ P+T++FK  NS         +K  LL
Sbjct: 1   MVKQRMTALDVRASVEEMRTELQGLRLTNVYDIPPRTFLFKFGNSE--------KKRTLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+GVRLH T   R+K   P+ FTL+LRKH+R  RL+ V QL +DR + F+FG+   A Y
Sbjct: 53  LENGVRLHLTQLVREKPKVPTQFTLRLRKHVRAWRLDSVTQLQHDRTVDFRFGVAEGASY 112

Query: 120 -VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+++GNI+LTD E+ ++ LLR+H+DD  GV +  R  YP
Sbjct: 113 HIIVELFSKGNIVLTDHEYRIMLLLRAHKDD--GVNMFVRELYP 154


>gi|294496348|ref|YP_003542841.1| Fibronectin-binding A domain protein [Methanohalophilus mahii DSM
           5219]
 gi|292667347|gb|ADE37196.1| Fibronectin-binding A domain protein [Methanohalophilus mahii DSM
           5219]
          Length = 662

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 185/720 (25%), Positives = 310/720 (43%), Gaps = 127/720 (17%)

Query: 2   VKVRMNTADVAAEVKCL----RRLIGMRCSNVYD-----LSPKTYIFKLMNSSGVTESGE 52
           +K  M +ADVAA    L      L+  +   +Y      L    YIFK            
Sbjct: 1   MKEEMTSADVAALATELGTGENSLVDSKIGKIYQPGESLLRIHLYIFK------------ 48

Query: 53  SEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
             K  LL+E+G RLH + Y       P  F + LRKHI   R+   RQ  +DRII     
Sbjct: 49  KGKANLLIEAGSRLHLSEYIPPSPKNPQSFPMLLRKHIMGGRITYFRQYDFDRIIEIGIK 108

Query: 113 LGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERT 172
            G +   +++E++ QGNI+L DS+  ++  +       + +     ++YP          
Sbjct: 109 RGDDETVLVVEIFGQGNIILLDSDRKIILPMNPVTFKGRRIRSGEIYQYP---------- 158

Query: 173 TASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
                 A LT         P  VNED                      L +  + + +D 
Sbjct: 159 -----EAQLT---------PLDVNED---------------------QLCEVFSNSDSDV 183

Query: 233 ARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFE 292
            R         L      G  LSE + L +G+  N+  SEV+                  
Sbjct: 184 VRT--------LATRFNLGGILSEEVCLRSGVDKNLPASEVDPQ---------------- 219

Query: 293 DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFE 352
                 I+  ++    +L      G+  P T S   ++   +  P  L ++   E   ++
Sbjct: 220 ------IASKLIEAIGVLFSPLEKGQLKPCTVSKPGSKETFDVVPFDLEKYADFEKNYYD 273

Query: 353 TFDAALDEFYSKIESQRAEQQHKA----KEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           +F+ ALD+F+ K  +   EQ+ +A    K +  F +  K    QE  +   +++++++  
Sbjct: 274 SFNKALDDFFGKRAAISLEQKKEASVKEKTEDVFQRRLK---QQEGAIKKFEKDIEKNTS 330

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
           +AE I  + +D++  +  +  A     SW+++  ++ + +          D+L   +  +
Sbjct: 331 IAEKIYEHYQDIELLLQTLLDAREKDYSWKEIQSIISDAK----------DELPAAKKII 380

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
           ++  S  L  +D + K     K  +D+ L+   NA R+YE  KK E K++  + A     
Sbjct: 381 NIDGSQGLVLLDLDGK-----KANIDVRLTVPQNAMRYYEKAKKLEKKRKGALAA----- 430

Query: 529 KAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
              + K  ++  +     +   + K HW+E+F WF SS+ +LV+ GRDA  NE IVK+YM
Sbjct: 431 -IEDTKNAMKKKKAAPKKHFKVVHKKHWYERFRWFFSSDGFLVVGGRDATTNEEIVKKYM 489

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVY 647
            K D+  H    GA  TV+K    E  +P  TL +A  F V  S  W     +   +W+Y
Sbjct: 490 EKRDLVFHTQAPGAPITVVKTGGKE--IPDTTLQEAAEFVVSFSSIWKGGQFSGDCYWIY 547

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           P QV+KT  +GEYL  GSF+IRG++N+    P+    GL  + +  ++G  ++  + RGE
Sbjct: 548 PEQVTKTPESGEYLKKGSFIIRGERNYYRDVPVRAAVGLELKPETRAIGGPVSAVKARGE 607


>gi|146419620|ref|XP_001485771.1| hypothetical protein PGUG_01442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 873

 Score =  199 bits (507), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 132/442 (29%), Positives = 211/442 (47%), Gaps = 55/442 (12%)

Query: 321 PPTESGSSTQIYDEFCPLL-----LNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHK 375
           P  ++     +YDEF P       L  F+   F +   ++  +D F+S ++S++ E + +
Sbjct: 149 PSDDNKDLEYLYDEFHPFKPYKENLELFK---FTEIRGYNKTVDTFFSTLDSKKHELRME 205

Query: 376 AKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRM 435
            ++  A  +L     +++ ++  L+ + + + K  + I Y+ + V   I +V+  L  +M
Sbjct: 206 QQKHNAKKRLLNAREERDKQIDNLRIQQEMNSKKGDAIIYHADLVSECIASVQTLLDQQM 265

Query: 436 SWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDD------------- 481
            W ++  ++K E+  GN VA  I   L L  N + L L +  D M D             
Sbjct: 266 DWANIESLIKLEQSRGNSVAKTIKLPLNLTENKIGLKLPDT-DSMYDPADIDSELDSETS 324

Query: 482 ------------------------------EEKTLPVEKVEVDLALSAHANARRWYELKK 511
                                         + K +P   V +DL LS  ANAR ++E KK
Sbjct: 325 SESETESESESESGSESEDETPPKRMSKKAKSKEIPALSVWIDLLLSPFANARTYFESKK 384

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--ISHMRKVHWFEKFNWFISSENY 569
           + ESKQEK       A + A+KK    + +     N  +  +R  +WFEKF WF+SSE Y
Sbjct: 385 QAESKQEKVEKNTDMALRNAQKKIEQDLAKNLKNENETLRQVRPKYWFEKFFWFVSSEGY 444

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           L I+GRD  Q +MI  R+ S  D +V +D+ G+   V+KN    + +PP TL QAG F +
Sbjct: 445 LCIAGRDDAQVDMIYYRHFSDNDFFVSSDIEGSLKVVVKNPYRGEALPPYTLMQAGMFAM 504

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
             S AW+ K+ TS W++  + V+K    G  +  G+F  +GKK FLPP  L+MG G  F 
Sbjct: 505 SASAAWNGKITTSPWFLAGNDVTKLDFDGSLVPSGTFNYKGKKEFLPPTQLVMGLGFYFL 564

Query: 690 LDESSLGSHLNERRVRGEEEGM 711
            D+ +   +   R  R  E G+
Sbjct: 565 GDDDTTKKYGETRITRQNESGL 586



 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 27/37 (72%)

Query: 888 EGGKISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           E  K++RG++ K+K+  +KY DQDE+ER +RM +L  
Sbjct: 656 EPHKLTRGKRSKMKRAAKKYADQDEDERKLRMEMLGT 692


>gi|157865120|ref|XP_001681268.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68124563|emb|CAJ02783.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 1224

 Score =  199 bits (507), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 146/524 (27%), Positives = 250/524 (47%), Gaps = 48/524 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ-DV 298
           ++T++     +GP L++H++  TG VPN       +  ++    L   + +  D  + D+
Sbjct: 255 VQTLVAGIQHFGPDLAQHVLTVTG-VPNAPRKSWTQSTESIFATLCPGLLEAFDLAKVDL 313

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESG------------SSTQIYDEFCPLLLNQFRSR 346
            S      GY++      G     +               +  + Y+ F P+LL Q+ + 
Sbjct: 314 TSAG----GYLIKPKARPGSAAHASAPPAPGASAGAADLVAVAERYESFTPILLAQYAND 369

Query: 347 --EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
             E +   +F    DEF+   E++R +  +  +++ A  K +K   D   R++ L+ ++ 
Sbjct: 370 GVEALYRTSFGRVCDEFFLLTETERIDASNAKRKNTAKSKEDKFAADHARRINALETDIA 429

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            +    E +  N + VD AI  +  ALA  +SW+ L  ++K     G+PVA +I  L+LE
Sbjct: 430 ANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHPVAYMIHDLFLE 489

Query: 465 RNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
           RN +S+LL   LDE + EE   +P   VEV L+ +AHANA  ++  +K+  SK E+T+ A
Sbjct: 490 RNSISVLLETALDEENGEEDCDVPPLVVEVALSKTAHANAADYFSKQKQYRSKLERTVAA 549

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             KA   A +K   +   +K    I   R+ +W+EKF WF ++   LV+ G+D Q  E++
Sbjct: 550 TEKAAAGAARKGARKAAGQKEKKVIVKERQRNWWEKFFWFRTTAGDLVLRGKDVQSTELL 609

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHR-------------------PEQPVPPLTLNQA 624
           V+R M  GD+++H ++ GA   +++                        QPV   ++ +A
Sbjct: 610 VRRVMHLGDLFIHCEVDGALPCLLRPMNDVWQELGGNNAGGDLTASPATQPVALRSVCEA 669

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G + V  S AW+ K  T +WWVY  QV+    TG YL        G+++ LPP  + +G 
Sbjct: 670 GAWCVAFSGAWERKQTTGSWWVYASQVTGGTATGAYLYA------GERHHLPPQSMSLGC 723

Query: 685 GLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIES 728
            LLF +  +        R       G DD  +   H E++ ++S
Sbjct: 724 ALLFYVARTVCEPAAVARAASAACAGDDD--EGAEHVEDNAVDS 765


>gi|396081612|gb|AFN83228.1| putative RNA-binding protein [Encephalitozoon romaleae SJ-2008]
          Length = 648

 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 114/341 (33%), Positives = 187/341 (54%), Gaps = 40/341 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F TF+ A + F+        + + K   +    K++K+   QEN +  ++QE     K A
Sbjct: 245 FSTFNDAAEFFF--------QNRKKFGRNDRESKVDKVRKRQENYMKEMEQERQSYRKKA 296

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           EL+E N + V+  +   ++   N++ W D  +  ++E K GN ++  I K  ++   C  
Sbjct: 297 ELLEENADFVNKILDIFKIVKKNKVRWTDFEKFREQENKKGNEISKAIVKTDFISHTCTI 356

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
                           L  E++++D   S  +N  R+Y+  KK E K +KT  +  +  K
Sbjct: 357 ---------------ALEGEEIQIDFETSLFSNISRFYQKNKKLEEKIKKTRDSLEEVLK 401

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
               K     ++ K V      R ++WFEKF++F SS+  LVI G++AQQNE++VK+++ 
Sbjct: 402 KVAPK-----VETKKVT-----RALYWFEKFHFFFSSDGILVIGGKNAQQNEILVKKHLE 451

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D+Y H D+HG+SS ++K  +P Q     T+ +A    +C S+ W++ +V+  W+VY  
Sbjct: 452 PTDLYFHGDMHGSSSIIVK--KPTQK----TIEEAASMALCMSKCWEANVVSPVWYVYGE 505

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           QVSKTAP+GEYLT GSFMI+GKKN++  H +  G GLLF++
Sbjct: 506 QVSKTAPSGEYLTKGSFMIKGKKNYVECHKIEYGLGLLFKV 546



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 64/134 (47%), Gaps = 19/134 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  LR RL+G    N Y  S +    K  N           K +LL+
Sbjct: 1   MKQRYTFLDIRATVNELRPRLVGKFIQNFYTTSQRIIYIKFSN-----------KDILLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVR+H T   ++     S F   LR+  R  ++ D+ Q G+DR+++ + G       +
Sbjct: 50  EPGVRIHLT---QEHDMDISHFCKILRRKARRDKVVDIYQCGFDRVVVLELG----RQKI 102

Query: 121 ILELYAQGNILLTD 134
           + E ++ GNIL+ +
Sbjct: 103 VFEFFSGGNILIVE 116


>gi|18977764|ref|NP_579121.1| hypothetical protein PF1392 [Pyrococcus furiosus DSM 3638]
 gi|397651884|ref|YP_006492465.1| hypothetical protein PFC_06185 [Pyrococcus furiosus COM1]
 gi|18893505|gb|AAL81516.1| hypothetical protein PF1392 [Pyrococcus furiosus DSM 3638]
 gi|393189475|gb|AFN04173.1| hypothetical protein PFC_06185 [Pyrococcus furiosus COM1]
          Length = 649

 Score =  199 bits (505), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 192/695 (27%), Positives = 321/695 (46%), Gaps = 127/695 (18%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M++ D+    + L+ +I G R   +Y    +   FKL + +GV       +V LL+
Sbjct: 1   MKESMSSVDIKYITEELKDMIVGSRVEKIYHEGNEIR-FKL-HKTGVG------RVDLLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T Y ++    P+ F + LRK++  + LED+RQ  +DR+++  FG     +++
Sbjct: 53  EAGKRIHITTYVKENLQ-PTSFAMLLRKYLSGKFLEDIRQYEFDRVVILSFG----EYFL 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I EL+ +GNI+    ++ ++  LR     D+  AI  + +Y      VF  + A+ L  +
Sbjct: 108 IAELFGRGNIIFVTKDWEIIGALRYEEFKDR--AIKPKIKY------VFPPSRANPLKVS 159

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
               KE        +N  G  +  A               L+KN                
Sbjct: 160 FEEFKEII------LNSQGTEIVRA---------------LAKN---------------- 182

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFED-----WL 295
                     G   SE  +L   +  + K+ E+++ E   +   +L V   E      + 
Sbjct: 183 -------FSIGGLYSEETLLRAKIDKDRKVDELSEEELRLVYDTLLTVLNDEKKPNIVYN 235

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDE-FCPLLLNQFRSREFVKFETF 354
           ++ +  D+VP   I +Q     +++      S ++  DE F  L + + R  +  + E  
Sbjct: 236 KEGVMVDVVP---IDLQ---WYREYTKRYYESFSEALDEYFGKLTIEKARLEKTKQLEER 289

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
             AL+     I  +R E+Q K  E  A    N+   D     +++  E+ R +  A L +
Sbjct: 290 RKALE-----ISLRRIEEQIKGFEKEAM--TNQEKGDALYAHYSIVNEILRVISSA-LKQ 341

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           Y +E+V                     + ++E +KAG P A +I  + +  N ++L    
Sbjct: 342 YGVEEVK--------------------KRIEEGKKAGYPWAKMI--IDVTDNKVTL---- 375

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA-FKAAEK 533
           NLD +          KV +D+  S   NA  +YE  KK + K E    A+ +   K  E 
Sbjct: 376 NLDGI----------KVSLDVEKSLEENAELYYERAKKAKKKLEGAKIAYEETKRKLIEL 425

Query: 534 KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
           +  ++   ++        +K  WFEKF WFISSE +LVI G+DA  NE++VK++M + D+
Sbjct: 426 EKEIERESKEINIKKITRKKKKWFEKFRWFISSEGFLVIGGKDATTNEIVVKKHMDENDI 485

Query: 594 YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVS 652
           Y HAD+ GA   +IKN R        T+ +A  F V  S+AW   + ++ A+WVYP QVS
Sbjct: 486 YCHADIWGAPHVIIKNGR---NASEKTIREACQFAVAMSRAWSEGLASADAYWVYPEQVS 542

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           K AP GEYL  G+FM+ GK+N++   PL +  G++
Sbjct: 543 KQAPAGEYLPKGAFMVYGKRNWIHGIPLKLAVGIV 577


>gi|257215816|emb|CAX83060.1| Serologically defined colon cancer antigen 1 [Schistosoma
           japonicum]
          Length = 521

 Score =  198 bits (504), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 96/181 (53%), Positives = 122/181 (67%), Gaps = 19/181 (10%)

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           KT+A I+ +RK  WFEKF WFISSENYLV++G D+QQNE++VKRY+  GD++VHAD+HGA
Sbjct: 5   KTIAQITEVRKPMWFEKFFWFISSENYLVVAGHDSQQNEVLVKRYLKSGDIFVHADIHGA 64

Query: 603 SSTVIKN-------------------HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSA 643
           S+ +IK                    HR     PP TL +A    V  S AW S ++T A
Sbjct: 65  STVIIKARHLTSEESDFSKHESLLHLHRSLPLPPPKTLLEAANMAVVLSSAWQSHVLTRA 124

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERR 703
           WWV+  QVSKTAP+GEYLT GSF+IRGKKN+LPP P   GFG++F+L E S+  H  ERR
Sbjct: 125 WWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFGIMFKLHEDSVFKHKGERR 184

Query: 704 V 704
           +
Sbjct: 185 I 185


>gi|398011164|ref|XP_003858778.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322496988|emb|CBZ32058.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 1228

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 140/484 (28%), Positives = 235/484 (48%), Gaps = 46/484 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           ++T++     +GP L++H++  TG++ N       +  DN  + L   +   E +  D+ 
Sbjct: 255 VQTLVAGIQHFGPDLAQHVLTVTGVL-NTPRKSWTQSADNVFEALRPGL--LEAF--DLA 309

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSST-------------QIYDEFCPLLLNQFRSR 346
             D+   G  L++ K        T +  +              + Y+ F P+LL Q+ + 
Sbjct: 310 KVDLTSAGGYLIKPKAKPASTAHTPAPPAPGASAAAADLVAVAEQYESFTPILLAQYTND 369

Query: 347 --EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
             E +   +F    DEF+   E++R +  +  +   A  K +K   D   R++ L+ ++ 
Sbjct: 370 GVEALYRTSFGRVCDEFFLITETERIDASNAKRTKTAKSKEDKFAADHARRINALETDIA 429

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            +    E +  N + VD AI  +  ALA  +SW+ L  ++K     G+PVA +I  L+LE
Sbjct: 430 ANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHPVAYMIHDLFLE 489

Query: 465 RNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
           RN +S+LL   LDE   EE   +P   VEV L+ +AHANA  ++  +K+  SK E+T+ A
Sbjct: 490 RNSISVLLETVLDEEKGEEDCDVPPLVVEVTLSKTAHANAADYFSKQKQHRSKLERTVAA 549

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             KA   A +K   +   +K    I   R+ +W+EKF WF ++   LV+ G+D Q  E++
Sbjct: 550 TEKAAAGAARKGARKAAAQKEKKVIVKERQRNWWEKFFWFRTTAGDLVLRGKDVQSTELL 609

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHR-------------------PEQPVPPLTLNQA 624
           V+R M  GD+++H D+ G+   +++                        QPV   ++ +A
Sbjct: 610 VRRVMRLGDLFIHCDVDGSLPCLLRPMNDVWQELGGNNAGGDLTASPATQPVALHSVCEA 669

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G + V  S AW+ K  T +WWVY  QV+    TG YL        G+++ LPP  + +G 
Sbjct: 670 GAWCVAFSGAWERKQTTGSWWVYASQVTGGTATGAYLYA------GERHHLPPQSMSLGC 723

Query: 685 GLLF 688
            LLF
Sbjct: 724 ALLF 727



 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 74/165 (44%), Positives = 106/165 (64%), Gaps = 12/165 (7%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  LIG+R  N+Y++  K ++FK  +       GE++K +LL
Sbjct: 1   MVKQRMTALDVRATVEEMRATLIGLRLLNIYNIGNKMFLFKFGH-------GENKKNVLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM--NA 117
            ESG R H T  AR+K   PS FTLKLRKHIR  RL+ + QL +DR I   FG+      
Sbjct: 54  -ESGTRFHLTELAREKPKVPSQFTLKLRKHIRAWRLDSIAQLQHDRTIDLCFGVPSTEGC 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            ++I+EL+++GN++LTD  +T++ LLR+HRDD+ G+ +M    YP
Sbjct: 113 FHIIVELFSKGNVILTDYAYTIMMLLRTHRDDE-GLKLMVNQVYP 156


>gi|146078492|ref|XP_001463556.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134067642|emb|CAM65921.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 1228

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 140/484 (28%), Positives = 235/484 (48%), Gaps = 46/484 (9%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           ++T++     +GP L++H++  TG++ N       +  DN  + L   +   E +  D+ 
Sbjct: 255 VQTLVAGIQHFGPDLAQHVLTVTGVL-NTPRKSWTQSADNVFEALRPGL--LEAF--DLA 309

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSST-------------QIYDEFCPLLLNQFRSR 346
             D+   G  L++ K        T +  +              + Y+ F P+LL Q+ + 
Sbjct: 310 KVDLTSAGGYLIKPKAKPASTAHTPAPPAPGASAAAADLVAVAEQYESFTPILLAQYTND 369

Query: 347 --EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
             E +   +F    DEF+   E++R +  +  +   A  K +K   D   R++ L+ ++ 
Sbjct: 370 GVEALYRTSFGRVCDEFFLITETERIDASNAKRTKTAKSKEDKFAADHARRINALETDIA 429

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            +    E +  N + VD AI  +  ALA  +SW+ L  ++K     G+PVA +I  L+LE
Sbjct: 430 ANQMKGEQLILNADRVDEAIQLINGALATGISWDALRMLLKRRHAEGHPVAYMIHDLFLE 489

Query: 465 RNCMSLLLSNNLDEMDDEEKT-LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
           RN +S+LL   LDE   EE   +P   VEV L+ +AHANA  ++  +K+  SK E+T+ A
Sbjct: 490 RNSISVLLETVLDEEKGEEDCDVPPLVVEVTLSKTAHANAADYFSKQKQHRSKLERTVAA 549

Query: 524 HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMI 583
             KA   A +K   +   +K    I   R+ +W+EKF WF ++   LV+ G+D Q  E++
Sbjct: 550 TEKAAAGAARKGARKAAAQKEKKVIVKERQRNWWEKFFWFRTTAGDLVLRGKDVQSTELL 609

Query: 584 VKRYMSKGDVYVHADLHGASSTVIKNHR-------------------PEQPVPPLTLNQA 624
           V+R M  GD+++H D+ G+   +++                        QPV   ++ +A
Sbjct: 610 VRRVMRLGDLFIHCDVDGSLPCLLRPMNDVWQELGGNNAGGDLTASPATQPVALHSVCEA 669

Query: 625 GCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
           G + V  S AW+ K  T +WWVY  QV+    TG YL        G+++ LPP  + +G 
Sbjct: 670 GAWCVAFSGAWERKQTTGSWWVYASQVTGGTATGAYLYA------GERHHLPPQSMSLGC 723

Query: 685 GLLF 688
            LLF
Sbjct: 724 ALLF 727



 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 74/165 (44%), Positives = 106/165 (64%), Gaps = 12/165 (7%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM   DV A V+ +R  LIG+R  N+Y++  K ++FK  +       GE++K +LL
Sbjct: 1   MVKQRMTALDVRATVEEMRATLIGLRLLNIYNIGNKMFLFKFGH-------GENKKNVLL 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM--NA 117
            ESG R H T  AR+K   PS FTLKLRKHIR  RL+ + QL +DR I   FG+      
Sbjct: 54  -ESGTRFHLTELAREKPKVPSQFTLKLRKHIRAWRLDSIAQLQHDRTIDLCFGVPSTEGC 112

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            ++I+EL+++GN++LTD  +T++ LLR+HRDD+ G+ +M    YP
Sbjct: 113 FHIIVELFSKGNVILTDYAYTIMMLLRTHRDDE-GLKLMVNQVYP 156


>gi|336476370|ref|YP_004615511.1| fibronectin-binding A domain-containing protein [Methanosalsum
           zhilinae DSM 4017]
 gi|335929751|gb|AEH60292.1| Fibronectin-binding A domain protein [Methanosalsum zhilinae DSM
           4017]
          Length = 660

 Score =  197 bits (500), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 183/697 (26%), Positives = 307/697 (44%), Gaps = 123/697 (17%)

Query: 2   VKVRMNTADVAAEVKCL----RRLIGMRCSNVYD-LSPKTYIFKLMNSSGVTESGESEKV 56
           +K  M++ADV+A V  L      LI  +   +Y   S +  I   ++  G          
Sbjct: 1   MKDEMSSADVSALVYELVHGPYNLIDAKIGKIYQPFSDEIRINLFIHGKGRDN------- 53

Query: 57  LLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
            L++E+G R H +         P  F + LRKH+   R+ D+ Q  +DRII  +   G  
Sbjct: 54  -LILEAGKRAHISKNLPPNPKLPPSFPMLLRKHLSGGRILDISQYDFDRIIEIRIVRGGV 112

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASK 176
              ++ EL+A+GNI+L DSE  ++  ++      + +     + YP       E  T  K
Sbjct: 113 ETVLVAELFARGNIVLLDSERKIILPMKPVTFRGRKIRSGETYEYPESKVNPLE-ITEEK 171

Query: 177 LHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
           +   L +S        D V       + A+K NLGG                        
Sbjct: 172 MKDLLYTSTS------DLVR------TIATKMNLGGN----------------------- 196

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQ 296
                            LSE I L +G+  N    E++   D  I +L  +V    D L 
Sbjct: 197 -----------------LSEEICLVSGIDKNRSAKEID---DQEISILCESV---NDVLS 233

Query: 297 DVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFE---- 352
            ++SGD+ P    +++ K+                 D+  P+ +N F  + F K+E    
Sbjct: 234 PLVSGDLKPN---IVKKKN-----------------DDLEPINVNPFDLKIFEKYEKEYY 273

Query: 353 -TFDAALDEFYSKIESQRAEQQ-HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
            +F+ ALDE++ K   ++ +++    K+D A     +    Q+  +   +++ ++ V+ A
Sbjct: 274 ESFNEALDEYFGKASLEKVDEKVETVKKDKA-GVFERRLQQQKTAISKFEKQAEKYVQAA 332

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E I    +D++    A+  A +   SW ++  ++K  + +      +I+   ++     +
Sbjct: 333 EKIYSYYQDIEHITDALNNARSKGYSWSEIKSIIKSSKDSTQAAKSIIN---IDPGKGII 389

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           +L  +LD  +          VE+++  S   NA  +YE  KK   K++  + A       
Sbjct: 390 VL--DLDGTN----------VEININKSIPQNAEMYYEKAKKVTRKRDGALKA------L 431

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
            E K  +Q  ++K  +    +RK  W+E+F WFISS+ +LV+ GRDA  NE IVK+YM K
Sbjct: 432 EETKASMQKKEKKEPSKRKIIRKPSWYERFRWFISSDGFLVVGGRDADTNEEIVKKYMEK 491

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD-SKMVTSAWWVYPH 649
            D++ H    GA  T+IK    E  VP  T+ +A  F V +S  W         + V P 
Sbjct: 492 RDLFFHTQAPGAPVTIIKTEGKE--VPSTTIEEASRFVVSYSSLWKLGHFAGDCYMVKPE 549

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           QVSKT  +GEYL  GSF+IRG++N+    P+ +  G+
Sbjct: 550 QVSKTPESGEYLKKGSFVIRGERNYFKNVPMRVAVGI 586


>gi|84489327|ref|YP_447559.1| RNA-binding protein [Methanosphaera stadtmanae DSM 3091]
 gi|84372646|gb|ABC56916.1| predicted RNA-binding protein [Methanosphaera stadtmanae DSM 3091]
          Length = 666

 Score =  197 bits (500), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 187/692 (27%), Positives = 301/692 (43%), Gaps = 116/692 (16%)

Query: 6   MNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  D+   V  L + LI  R    Y     T   KL       ++GE  K L++ ++GV
Sbjct: 1   MSNVDIHRMVNELNKELINTRIDKAYQPDVDTIRIKL------RKAGEGRKDLVI-QAGV 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-E 123
           R+H T Y +     P  F + LRKH+    +  + Q  +DRII  +  +     Y IL E
Sbjct: 54  RIHLTNYPQPNPTIPPNFPMLLRKHLSGGSITSIEQHNFDRII--KIKVQKKEEYTILVE 111

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTS 183
           L+++GNI+L                 DK   I+S  ++ T              H    +
Sbjct: 112 LFSKGNIILL----------------DKDNNIISPLKHKT-------------WHDRKIT 142

Query: 184 SKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTV 243
           + E     P+K    G N++N   E+L         D+++    N   G  A+       
Sbjct: 143 AHEEYKYPPEK----GININNCRFEDLKTVINTSDRDITRTLATNGLGGLYAE------- 191

Query: 244 LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDI 303
             E + Y     E       L   +   E+ +L +NAI  L   +             + 
Sbjct: 192 --EVISYTSINKEK------LAKELTDDEITQL-NNAINELFNKI-------------ET 229

Query: 304 VPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYS 363
            P+  I++      KD                 P+ LN++   +   FETF+ A DEFYS
Sbjct: 230 NPQPQIILDENDKNKD---------------LVPITLNKYAQFKSKSFETFNMAADEFYS 274

Query: 364 K---IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           K    + +  E++  AK    F K  K+   QE  +    + ++      + I  +  ++
Sbjct: 275 KKIVSDIKNKEEKLWAKRIGKFEKRLKM---QEETLEGFYKTIEDKQHKGDTIYAHYNEI 331

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGN-PVAGLIDKLYLERNCMSLLLSNNLDEM 479
              I  +  A  N  SW+++  ++K+ +K G  P   +I+ +    + M ++   NL ++
Sbjct: 332 QQIINVIHQAREN-YSWKEIGSIIKKSKKEGKIPELEMIESI----DKMGVI---NL-KL 382

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKAFKAAEKKTR 536
           DD         V++D  +    +  ++Y   KK + K +   K I       K  E K  
Sbjct: 383 DDTH-------VQIDSNIGIPESTEKYYNKGKKAKRKIDGVNKAIENTKSEIKKLEDKKE 435

Query: 537 LQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
           + I   +        R++ W+EK  WFIS + YLVI GRDA  NE +VK+Y    D+Y+H
Sbjct: 436 VAIELLRQKQEKREKRELKWYEKLRWFISRDGYLVIGGRDANSNEQVVKKYSKNNDIYLH 495

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTA 655
            D+HGA ST+I+N + E  +P  TL  A CF    S AW     +  A+WV   QVSKT 
Sbjct: 496 CDIHGAPSTIIQN-KNEDEIPESTLYDAACFASSFSSAWTEGFSSYDAYWVTLDQVSKTP 554

Query: 656 PTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            +GE+L  G+F+IRGKKNF+   P+++  G++
Sbjct: 555 QSGEFLKKGAFVIRGKKNFIRNVPVLIAIGVV 586


>gi|332157694|ref|YP_004422973.1| hypothetical protein PNA2_0051 [Pyrococcus sp. NA2]
 gi|331033157|gb|AEC50969.1| hypothetical protein PNA2_0051 [Pyrococcus sp. NA2]
          Length = 650

 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 118/358 (32%), Positives = 198/358 (55%), Gaps = 28/358 (7%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E V F+TF  ALDE++ K+  ++A ++   K +    +L      QEN 
Sbjct: 243 VPIDLRWYDGYEKVYFDTFSKALDEYFGKLTIEKAREEKTKKLEEKKKQLIATLKRQENM 302

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   K+E+ R+ ++A+LI  N + VD  +  +  A+  R+ WE+L R V+E +K GN +A
Sbjct: 303 IKGFKEEMRRNQEIADLIYANYQLVDNLLKELSKAV-ERLGWEELIRRVEEGKKKGNRIA 361

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            +I  +  + N +++       E++D++  L +++         + NA  +YE  KK + 
Sbjct: 362 MMIKSINPQENSVTI-------EIEDKKVRLYIDR-------DINENAEIYYEKAKKAKH 407

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH-----WFEKFNWFISSENYL 570
           K E       KA++  +KK      + +       ++K+      WFEKF WFISSE +L
Sbjct: 408 KLE----GAKKAYEELKKKLEQVEKEIEEEEKKVQVKKIERRKKKWFEKFRWFISSEGFL 463

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI G+DA  NE++V++YM + D+Y HAD+ GA   +IK+ R        T+ +A  F V 
Sbjct: 464 VIGGKDATTNEIVVRKYMGENDIYCHADIWGAPHVIIKDGR---RASEKTIFEACQFAVS 520

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            S+AW   + ++ A+WVYP QV K AP+GE+L  G+FM+ GK+N++   PL +  G++
Sbjct: 521 MSRAWSEGLYSADAYWVYPEQVKKQAPSGEFLPKGAFMVYGKRNWMHGIPLKLAVGII 578



 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 83/162 (51%), Gaps = 13/162 (8%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M++ D+   V+ L+  L G R   VY    +  I        + ++GE  + L++ 
Sbjct: 1   MKEEMSSVDIRYIVQELKEELKGARIDKVYHEGDEVRI-------KLHKTGEGRRDLII- 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G RLH T Y ++  ++PS F + LRK++    ++++ Q  +DRI+  + G       +
Sbjct: 53  EAGKRLHLTTYIKESSSSPSSFAMLLRKYLSGAFVDEIEQHDFDRIVKIRVG----KFTI 108

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           I EL+ +GN++L D    +L  +R     D+ +     +++P
Sbjct: 109 IAELFRRGNVILVDENNVILGAIRYEEFKDRSIKPKHEYKFP 150


>gi|401826788|ref|XP_003887487.1| hypothetical protein EHEL_061370 [Encephalitozoon hellem ATCC
           50504]
 gi|395460005|gb|AFM98506.1| hypothetical protein EHEL_061370 [Encephalitozoon hellem ATCC
           50504]
          Length = 648

 Score =  193 bits (490), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 112/341 (32%), Positives = 187/341 (54%), Gaps = 40/341 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F TF+ A + F+        + + K  ++    K++K+   QEN +  ++QE +   K A
Sbjct: 245 FPTFNDAAEFFF--------QSRKKFGKNDRESKVDKVRKRQENYMKEMEQEGESYRKKA 296

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           EL+E N + V+  +   +V   N++ W D  +  ++E + GN ++  I K  ++   C  
Sbjct: 297 ELLEANADFVNKILDIFKVVKKNKVKWTDFEKFREQENRKGNEISKAIVKTDFISHTCTI 356

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           +L                 E++++D  ++   N  R+Y+  KK E K  KT  +  +  K
Sbjct: 357 VLEG---------------EEIQIDFEVTLFNNVSRFYQKSKKLEEKIMKTRDSLEEVLK 401

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
               K   + +           R ++WFEKF++F SS+  LVI GR+AQQNE++VK+++ 
Sbjct: 402 KIAPKVETKKIT----------RALYWFEKFHFFFSSDGVLVIGGRNAQQNEILVKKHLE 451

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D+Y H D+HG+SS ++K     +P P  T+ +A    +C S+ W++ +V+  W+VY  
Sbjct: 452 PNDLYFHGDMHGSSSIIVK-----KPTPK-TIEEAASMALCMSKCWEANVVSPVWYVYGE 505

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRL 690
           QVSKTAP+GEYLT GSFMI+GKKN++  H +  G GLLF++
Sbjct: 506 QVSKTAPSGEYLTKGSFMIKGKKNYVECHKIEYGLGLLFKV 546



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 64/134 (47%), Gaps = 19/134 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  LR RL+G    N Y  S +    K  N           K +LL+
Sbjct: 1   MKQRYTFLDIRATVNELRPRLVGKFIQNFYTTSQRIIYIKFSN-----------KDILLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVR+H T   ++     S F   LR+  R  ++ D+ Q G+DR+++ + G       +
Sbjct: 50  EPGVRIHLT---QEHDMDISHFCKILRRKARRDKVVDIYQCGFDRVVVLELG----RQKI 102

Query: 121 ILELYAQGNILLTD 134
           + E ++ GNIL+ +
Sbjct: 103 VFEFFSGGNILIVE 116


>gi|117938818|gb|AAH06001.1| SDCCAG1 protein [Homo sapiens]
          Length = 398

 Score =  192 bits (488), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 148/477 (31%), Positives = 222/477 (46%), Gaps = 111/477 (23%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K                                   
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMK----------------------------------- 77

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
                  GNI+LTD E+ +L +LR   D+   V    R RYP +  R  E          
Sbjct: 78  -------GNIVLTDYEYVILNILRFRTDEADDVKFAVRERYPLDHARAAE--------PL 122

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT  +  +             V++A K  L                             L
Sbjct: 123 LTLERLTEI------------VASAPKGEL-----------------------------L 141

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           K VL   L YGPAL EH +L+ G   N+K+ E  KLE   I+ +++++ K ED+++   +
Sbjct: 142 KRVLNPLLPYGPALIEHCLLENGFSGNVKVDE--KLETKDIEKVLVSLQKAEDYMK--TT 197

Query: 301 GDIVPEGYILMQNK---HLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            +   +GYI+ + +    L  D P  +  +    Y+EF P L +Q     +++FE+FD A
Sbjct: 198 SNFSGKGYIIQKREIKPCLEADKPVEDILT----YEEFHPFLFSQHSQCPYIEFESFDKA 253

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
           +DEFYSKIE Q+ + +   +E  A  KL+ +  D ENR+  L+Q  +      ELIE NL
Sbjct: 254 VDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHENRLEALQQAQEIDKLKGELIEMNL 313

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 314 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVASAIKELKLQTNHVTMLLRN 370


>gi|347524253|ref|YP_004781823.1| hypothetical protein Pyrfu_1716 [Pyrolobus fumarii 1A]
 gi|343461135|gb|AEM39571.1| protein of unknown function DUF814 [Pyrolobus fumarii 1A]
          Length = 668

 Score =  192 bits (488), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 186/687 (27%), Positives = 298/687 (43%), Gaps = 125/687 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  M   DVA+ V+ L  L G R  N+Y++    Y+ +L  +             ++ E 
Sbjct: 5   KTSMTAFDVASVVRELEELKGARLVNIYEVFENVYLLRLRGTRDAR---------VIAEP 55

Query: 63  GVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
           G R+H T+Y    K  P    + LRKHIR  RL  V+QLG+DRIILF+F    N + +++
Sbjct: 56  GRRVHETSYDVTGKEQPPPLIMALRKHIRGERLSTVKQLGFDRIILFEFA---NGYKLVV 112

Query: 123 ELYAQGNILLTDSEFTVL--TLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           EL  +G + L D + ++L  +  R  RD                  RV +R    K    
Sbjct: 113 ELLPRGVLALLDEKGSILHASEWREMRD------------------RVIKRGVEYK---- 150

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
                 P A  P+ + ED        +E L G  G                        +
Sbjct: 151 ---QPPPAAVHPENLTED------VVRERLAGASG-----------------------EV 178

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
             VL   LGY   + E  +   G+    K + V KL  + I  +V A+       + +  
Sbjct: 179 VRVLVRKLGYPGEVVEEALFRAGI---EKTTPVEKLGASDIGAIVEAI-------RGIYR 228

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
             +   GYI+   K L     P            F P   + +  R +   E+   ALDE
Sbjct: 229 ESLEARGYIVYDEKGLVLTVVP------------FKP---SMYEGR-YRAVESISKALDE 272

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           ++ ++E  RA ++   K +    KL       E  +   +++  +  K+A L+  N   V
Sbjct: 273 YFVELEKARAVEEAVEKLEEEKGKLRAAISKTEELIREYEEKKVKLEKLALLVAENAALV 332

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
           D A+   R  +     W+ +          GN   G++D +   R  + L +  ++ E+D
Sbjct: 333 DQALECAR-RMREGSGWDYIP---------GN-CPGVVD-VEPSRGVVKLNIGGSIVEVD 380

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQIL 540
                     +  D A   +   R+  EL+KK+ S+  +T+    K  ++ E    L+I 
Sbjct: 381 ----------IRSDSARLINELYRKIGELEKKR-SRALRTLEELKKKLESLE----LEIR 425

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           +E   A     RK  W+EK++W  +S   LVI GRDA QNE +VKRY+ + ++++HAD+ 
Sbjct: 426 EEARRARARIRRK-EWYEKYHWMFTSHWLLVIGGRDASQNESVVKRYLGENNIFMHADIR 484

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGE 659
           GA + V+     E P     + +A     C+S+AW   +     +WV+  QVSK AP GE
Sbjct: 485 GAPAVVVFAGGKEPPEE--DIREAAVIAACYSRAWKEGLGAIDVYWVWGRQVSKAAPPGE 542

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           YLT G+FM+ G++N++    L +  GL
Sbjct: 543 YLTKGAFMVYGERNYIRGVELKLAIGL 569


>gi|19074389|ref|NP_585895.1| hypothetical protein ECU06_1390 [Encephalitozoon cuniculi GB-M1]
 gi|19069031|emb|CAD25499.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
 gi|449329389|gb|AGE95661.1| hypothetical protein ECU06_1390 [Encephalitozoon cuniculi]
          Length = 648

 Score =  191 bits (484), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 116/343 (33%), Positives = 183/343 (53%), Gaps = 40/343 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FETF+ A  EFY +   +  +   ++K D       K+   QE  V  ++Q+ +   + A
Sbjct: 245 FETFNEAA-EFYFQSRKKFGKNDRESKVD-------KVRKRQEEYVKEMEQQGELLRRKA 296

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           EL+E N + V+  +   +V   NR+ W D  +   +E K GN V+  I K  ++   C  
Sbjct: 297 ELLERNSKLVNRILDIFKVVKKNRIKWTDFEKFWGQENKKGNEVSKAIVKTDFMAHKCWI 356

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           +L                 E++E+D   S  +N    Y+  KK E K  +T  +  +  K
Sbjct: 357 VLEG---------------EEIEIDFDSSLFSNISGLYQKSKKLEEKIRRTRDSLEEVLK 401

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
               K     ++ K +      R  +WFEKF++F SS+  LVI G++AQQNE++VK+++ 
Sbjct: 402 RIAPK-----IESKKIT-----RAPYWFEKFHFFFSSDGVLVIGGKNAQQNEILVKKHLE 451

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
            GD+Y H+D+HG+SS ++K    +      T+ +A    +C S+ W++ +V+  W+VY  
Sbjct: 452 PGDLYFHSDMHGSSSIIVKKATQK------TIEEAASMALCMSKCWEANVVSPVWYVYGD 505

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
           QVSKTAP+GEYL  GSFMI GKKN++  H +  G GLLFR+ E
Sbjct: 506 QVSKTAPSGEYLKKGSFMITGKKNYVECHRIEYGLGLLFRVSE 548



 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 62/134 (46%), Gaps = 19/134 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  LR RL      N Y  S +    K  N           K +LL+
Sbjct: 1   MKQRYTFLDIRATVNELRPRLKEKFIQNFYTTSQRIIYIKFSN-----------KDILLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVR+H T   ++     S F   LR+  R  ++ D+ Q G+DR+++ + G       +
Sbjct: 50  EPGVRIHLT---QEYDTDISHFCKILRRKARRDKVVDIYQCGFDRVVVLELG----RQKI 102

Query: 121 ILELYAQGNILLTD 134
           + E ++ GNIL+ +
Sbjct: 103 VFEFFSGGNILIVE 116


>gi|410670434|ref|YP_006922805.1| hypothetical protein Mpsy_1229 [Methanolobus psychrophilus R15]
 gi|409169562|gb|AFV23437.1| hypothetical protein Mpsy_1229 [Methanolobus psychrophilus R15]
          Length = 664

 Score =  191 bits (484), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 171/657 (26%), Positives = 280/657 (42%), Gaps = 108/657 (16%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++E+G R H + + R     P  F + LRKHI   R+  VRQ  +DRII F    G   
Sbjct: 54  LVIEAGKRAHLSEHIRQSPKIPHSFPMLLRKHIFAGRITYVRQYDFDRIIEFGMVRGGVE 113

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
             ++ EL++ GNI+L DSE  ++  ++      + +     ++YP               
Sbjct: 114 TVLVAELFSPGNIVLLDSERKIILPMKPVTFKGRKIRSGEVYQYP--------------- 158

Query: 178 HAALTSSKEPDANEPDKVNEDGNNV--SNASKENLGGQKGGKSFDLSKNSNKNSNDGARA 235
            A L+  +  + +  D  +    +V  + A++ NLGG                       
Sbjct: 159 EAQLSPVEAGEKDLSDVFSSSDADVVRTIANRFNLGG----------------------- 195

Query: 236 KQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWL 295
                             L+E +    G+       +    +D + + +   V+   +  
Sbjct: 196 -----------------VLAEEVCFRAGI------DKKTAAKDMSQEGIASIVSSLRELF 232

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
             +I G++ P  YI+ +           E     Q +D   P  L      E   F +F+
Sbjct: 233 SPLIKGELSP--YIVKK-----------EIKGEVQPFD-VAPFELKTHAGLEKEVFPSFN 278

Query: 356 AALDEFYSK----IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            ALD F+ K      ++  E   K K D    +L K    QE  +    +E +R V +AE
Sbjct: 279 KALDGFFGKRSAEEVTEVVEAVKKEKVDVFERRLRK----QEEAIENFGREAERHVDVAE 334

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
            I  + + ++  I  +  A  N  SW+++  ++K  ++   P A  I  +      + L 
Sbjct: 335 KIYAHYQVIEDVIGVLEKARQNGYSWDEIKSILKGAKET-VPAAKSISSIDSATGRIVLD 393

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           L                 K  +D+ L+   NA+ +YE  KK   K+E  I A      A 
Sbjct: 394 LEGT--------------KATIDIKLTIPQNAQSYYEKAKKLTRKKEGAIRAIEDTRVAM 439

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
           +KK +     ++ V    HM+K HW+++F WF SSE +LV+ GRDA+ NE +VK+YM K 
Sbjct: 440 QKKEKKVSGNKRKV----HMKK-HWYDRFRWFYSSEGFLVVGGRDAETNEELVKKYMDKS 494

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQ 650
           DV  H    GA  T++K     +PV   TL +A  F V +S  W S   +   +WV P Q
Sbjct: 495 DVVFHTQDPGAPMTIVKAQ--GKPVTEQTLMEAAQFVVSYSSVWKSGQFSGDCYWVLPEQ 552

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           VSKT  +GEY+  G+F+IRG++N+     + M   L    +   +G  ++  R  G+
Sbjct: 553 VSKTPESGEYVKKGAFIIRGERNYFRDVQVGMAVALELGAETRVIGGPVSAVRQHGQ 609


>gi|303389736|ref|XP_003073100.1| putative RNA-binding protein [Encephalitozoon intestinalis ATCC
           50506]
 gi|303302244|gb|ADM11740.1| putative RNA-binding protein [Encephalitozoon intestinalis ATCC
           50506]
          Length = 648

 Score =  189 bits (479), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 112/343 (32%), Positives = 184/343 (53%), Gaps = 40/343 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F TF+ A + F+        + + K  ++    K++K+   QEN +  ++Q+ +   K A
Sbjct: 245 FNTFNDAAEYFF--------QGRKKFGKNDRETKVDKVRKRQENYMKEMEQQGECYRKKA 296

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           EL+E N + V+  +   +V   N++ W D  +  ++E K G+ V+  I K  ++   C  
Sbjct: 297 ELLEKNADLVNRILEIFKVVRKNKVKWTDFEKFREQENKKGSEVSKAIVKTDFVSHTCWI 356

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
                          TL  E++ +D  +S   N   +Y+  KK E K  KT  +  +  K
Sbjct: 357 ---------------TLEGEEIPIDFNISLFNNVSEFYQKSKKLEEKIRKTRDSLGEVLK 401

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
               K   + +           R ++WFEKF++F SS+  LVI G+ AQQNE++VK+++ 
Sbjct: 402 KIAPKVETKKI----------TRTLYWFEKFHFFFSSDGVLVIGGKTAQQNEILVKKHLE 451

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D+Y H+D+HGASS ++K  +P +     T+ +     +C S+ W++ +V+  W+VY  
Sbjct: 452 PTDLYFHSDVHGASSIIVK--KPTEK----TIVETASMALCMSRCWETNVVSPVWYVYGE 505

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
           QVSKTAP+GEYL  GSFMI+GKKN++  H +  G GLLFR+ E
Sbjct: 506 QVSKTAPSGEYLGKGSFMIKGKKNYVDCHKIEYGLGLLFRVFE 548



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 64/134 (47%), Gaps = 19/134 (14%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  L+ RL+G    N Y  S +    K  N           K +LL+
Sbjct: 1   MKQRYTFLDIRATVNELKPRLVGKFIQNFYTTSQRIIYIKFSN-----------KDILLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVR+H T   ++     S F   LR+  R  ++ D+ Q G+DR+++ + G       +
Sbjct: 50  EPGVRIHLT---QEHDMDISHFCKILRRKARRDKVVDIYQCGFDRVVVLELG----RQKI 102

Query: 121 ILELYAQGNILLTD 134
           + E ++ GNIL+ +
Sbjct: 103 VFEFFSGGNILIVE 116


>gi|159111661|ref|XP_001706061.1| Serologically defined colon cancer antigen 1 [Giardia lamblia ATCC
           50803]
 gi|157434154|gb|EDO78387.1| Serologically defined colon cancer antigen 1 [Giardia lamblia ATCC
           50803]
          Length = 1063

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 95/215 (44%), Positives = 137/215 (63%), Gaps = 17/215 (7%)

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI--LQEKTVANI---SHMR 552
           +AH  A+  +E  K  E K ++T+   S  F   EKK    I  + ++T A +    H R
Sbjct: 537 TAHIIAKTLFEAAKAAEEKCKRTLGHSSAYFDKVEKKATADIDSVMKETDAELIALQHQR 596

Query: 553 KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR- 611
              WFEKF+WF S++ YLV+SGRDAQ NE++VK++MS  D++VH++ HGA+ T++K  R 
Sbjct: 597 SPLWFEKFHWFFSTDGYLVLSGRDAQSNELLVKKFMSSNDIFVHSEAHGAACTIVKAPRL 656

Query: 612 -----PEQP-----VPPL-TLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEY 660
                P+Q      VPP+ T+ +AG FTV HS+ W  K+ T ++WVY  QVSKTAP G Y
Sbjct: 657 TTTDIPQQNTVLRWVPPVQTMLEAGAFTVIHSKMWAQKVGTQSYWVYADQVSKTAPAGMY 716

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           +  GSF+IRGK+NF+P  PL +G  LL+R D +++
Sbjct: 717 IGTGSFVIRGKRNFIPQQPLELGVALLWRYDTANV 751



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 77/150 (51%), Gaps = 12/150 (8%)

Query: 3   KVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE------- 54
           K+  ++ DVA   K L   L+  R ++V +LS  TY+ +   S+ V +  +++       
Sbjct: 6   KLTPSSFDVAVLAKELSAILVNTRLNSVTNLSKTTYLLRFHASTTVIDQCQTKNQTLIDT 65

Query: 55  --KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
             K  +++E G  +H T +   K   P+ F+ +LR  I       V Q  +DR+I+ +F 
Sbjct: 66  YSKPSIIIEPGFYMHATRFDWSKAIPPTAFSNRLRTEICNMICTGVSQFYFDRVIILEFS 125

Query: 113 LGMN--AHYVILELYAQGNILLTDSEFTVL 140
              +    Y+I+ELY +GN++LTD  + VL
Sbjct: 126 RYNSDLKRYLIVELYGRGNLILTDEAYKVL 155


>gi|50312521|ref|XP_456296.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49645432|emb|CAG99004.1| KLLA0F27335p [Kluyveromyces lactis]
          Length = 1027

 Score =  187 bits (476), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 200/800 (25%), Positives = 365/800 (45%), Gaps = 131/800 (16%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+++ D+    K L  +++G R  N+Y++  S + ++ K     G  +S    K+ +
Sbjct: 1   MKQRLSSLDLQLISKELENQIVGFRLRNIYNIADSNRQFLLKF----GKPDS----KLNV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+HTT + R    TPS F  KLR +++ +RL  V+Q+  DRII+F F  G   +
Sbjct: 53  VIDCGLRVHTTDFTRPIPPTPSWFVSKLRSYLKEKRLTAVKQIPNDRIIVFTFADG--KY 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN+LL D++  +L L R   D            Y  ++   ++    ++++
Sbjct: 111 YLVLEFFSAGNVLLLDADQKILLLQRVVDD------------YSMKVGEFYDMANFAEIN 158

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
              TS+  PD  E  +     N +++  KE     K      +     K      +A  P
Sbjct: 159 Q--TSTTVPDPKEYFE-----NEIADWLKEADVKAKST----IVPGEAKKGKLKGKASVP 207

Query: 239 TLKTVLGEALGYGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
           +++ +L     + P LS  +I ++    G+ P+    E      + + VLV  ++  E  
Sbjct: 208 SIQKLL---FVHAPHLSSDLIQNSLKAIGIDPSSSCLEFK----HNVSVLVDLMSSLEVQ 260

Query: 295 LQDVISGDIVPEGYILMQNKHLG---KDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
              +IS      GYI+     L    +D P  E   S   +  F P + +    +     
Sbjct: 261 ANKLISTTSTRIGYIVAHKNKLYDPLRDKPELEYTFSN--FHPFKPFVGDSTDVKIIEIG 318

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
             ++  +D F+S IES +   + + ++  A  KL++   + E  + +L      + +   
Sbjct: 319 GMYNNTVDTFFSTIESNKYASRIQNQDFQAQKKLDEAKNNNETIIKSLLHAQQTNEEKGN 378

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSL 470
           ++  N   V+ A  AV+  L  +M W+ +  ++  E++ GN +A +I   + L  N +++
Sbjct: 379 ILIANANLVEEAKNAVKSLLDQQMDWQSMETLIANEQRKGNKIARIIKLPMDLPNNKITI 438

Query: 471 LLSNNLDEMDD------EEKTLPVEKVEVD---LALSAHANARRWYEL---KKKQESKQE 518
            L  +    DD       E      + +V+    ++S+  +   + EL   K KQ+S+++
Sbjct: 439 ELPKDGYSEDDSTEHHQSEADYSSNESDVNQSDSSVSSDYSDSDFEELTSSKSKQQSRRK 498

Query: 519 KTITAHSK------------AFKAA-----------EKKTRLQILQEKTVANI------- 548
             IT+  +            AF  A           EK+ +++   EK + +I       
Sbjct: 499 SKITSEKRETVLLTVDLSLSAFANASSYFNAKKATSEKQKKVEKNAEKALKSIQQKIEKD 558

Query: 549 -------SH-----MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
                  SH     +R  ++FEK+ WFISSE++LV+ G+   + + +  +Y++  D+ V 
Sbjct: 559 LQKKSKESHDILKAIRTPYFFEKYYWFISSESFLVLMGKSPVETDQLYAKYVNDDDIMV- 617

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
            +     + ++   + E  VPP TL QAG F    S AW  K+ +S WW +   V+K   
Sbjct: 618 TNAFDVKAWILNPQKTE--VPPNTLMQAGTFANSASDAWSKKIASSPWWCFAKNVTKFDD 675

Query: 657 T-GEYLTVGSFMIR--GKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDD 713
             G  L VGSF ++    KN LPP  L+MG GL++              +V+ E    D 
Sbjct: 676 IDGSVLPVGSFRMKQPKAKNMLPPAQLVMGLGLVW--------------KVKTE----DS 717

Query: 714 FEDSGHHKENSDIESEKDDT 733
            E  G +++NSD+E+  DDT
Sbjct: 718 EEKEGEYEQNSDLEASDDDT 737



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 25/31 (80%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG++GKLKK+++KY DQDEEER +R+  L  
Sbjct: 820 RGKRGKLKKIQKKYFDQDEEERLLRLEALGT 850


>gi|337284225|ref|YP_004623699.1| hypothetical protein PYCH_07400 [Pyrococcus yayanosii CH1]
 gi|334900159|gb|AEH24427.1| hypothetical protein PYCH_07400 [Pyrococcus yayanosii CH1]
          Length = 648

 Score =  187 bits (474), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 120/357 (33%), Positives = 195/357 (54%), Gaps = 26/357 (7%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSK--IESQRAEQQHKAKEDAAFHKLNKIHMDQ- 392
            P+ L  +   E   FETF  ALDE++ K  +E  +AE+  K +E     K  +I +++ 
Sbjct: 241 VPIELKWYDGYERKYFETFSEALDEYFGKLTVEKAKAEKTRKLEEK---RKALEISLERI 297

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
             ++   ++E  ++ ++ +LI  N   V+  +  +R A+  ++ WE+L R V+E +K GN
Sbjct: 298 REQMMAFEEEAKKNQELGDLIYANYSLVERLLEELRAAV-KKLGWEELERRVEEGKKTGN 356

Query: 453 PVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
             A +I  ++   N +++       E+D +        +++ L  S   NA  +YE  K+
Sbjct: 357 KAAEVIKGIHPSENAVTV-------EIDGK-------AIKLYLNRSLGENAELYYERAKR 402

Query: 513 QESKQEKTITAHSKA-FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
            ++K E    A+ +   K  E +  ++   +K        RK  WFEKF WFISSE +LV
Sbjct: 403 AKAKLEGARKAYEETKIKIEELERLIEEEGKKVGVKKLERRKKKWFEKFRWFISSEGFLV 462

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCH 631
           I G+DA  NEM+VKR+M + D+Y HAD++GA   VIK+ R        T+ +A  F V  
Sbjct: 463 IGGKDATTNEMVVKRHMEENDIYCHADVYGAPHVVIKDGR---KAGERTIFEACQFAVSM 519

Query: 632 SQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           S+AW   + ++ A+WVYP QVSK +P GEYL  G+FM+ GK+N+    PL +  G++
Sbjct: 520 SRAWGQGLYSADAYWVYPEQVSKKSPAGEYLPKGAFMVYGKRNWFHGIPLKLAVGVV 576



 Score = 69.3 bits (168), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 78/157 (49%), Gaps = 13/157 (8%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M + D+   VK LR L+G R   VY    +  I          ++GE  K L++ E+G R
Sbjct: 5   MTSVDIRYIVKELRELVGARVDKVYHEGNEIRI-------KFHKAGEGRKDLII-EAGKR 56

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H T Y ++   TP+ F + LRKH+    L  + Q  +DRI+   F      + +++EL+
Sbjct: 57  IHLTTYIKEI-PTPTSFAMLLRKHLGGAFLSGIEQHDFDRIVKLSF----RDYTLVVELF 111

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +GN++L   +  ++  LR     D+ +     +++P
Sbjct: 112 GKGNLVLVGPDGLIIAALRYEEFRDRAIKPKVEYKFP 148


>gi|308160802|gb|EFO63274.1| Serologically defined colon cancer antigen 1 [Giardia lamblia P15]
          Length = 1063

 Score =  186 bits (471), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 134/422 (31%), Positives = 203/422 (48%), Gaps = 74/422 (17%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
           +R+ +  ++E+++  LDE+ S + + RA Q        A   L       ENRV +L   
Sbjct: 327 YRAEDIREYESYNKTLDEYNSLLVTARAYQNRAQLVQKAKLTLAHAQDTTENRVASLLNS 386

Query: 403 VDRSVKMAELI-------EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV- 454
             R   +AE I       +Y  + ++      RV   + + W +   M     +A + V 
Sbjct: 387 ATRKRLLAECILWKAAEIDYLTKQMEFLFKTERVTWNDVIVWMNYGSMDVPLLEAISSVD 446

Query: 455 -------------AGLIDKLYLERNCMSLLLSNNL-------------DEMDDEEK---- 484
                        A  I  ++ E     L LS +              DE +D ++    
Sbjct: 447 VVRKVVSFNISIFASDIHDMHYEDCTPFLALSKSRATAKQEIPDLEASDETEDNDEQQGY 506

Query: 485 ------------TLPVEKVEVDLAL------SAHANARRWYELKKKQESKQEKTITAHSK 526
                       T P+  + VD+        +AH  A+  +E  K  E K ++T+   S 
Sbjct: 507 GSCENTRIMPDPTEPI-IISVDVPFKGTAGTNAHTIAKTLFEAAKAAEEKCKRTLGHSSA 565

Query: 527 AFKAAEKKTRLQI--LQEKTVANI---SHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
            F   EKK    I  + ++T A +    H R   WFEKF+WF S+  YLV+SGRDAQ NE
Sbjct: 566 YFDKVEKKATADIDSVMKETDAELIALQHQRSPLWFEKFHWFFSTNGYLVLSGRDAQSNE 625

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNHR------PEQP-----VPP-LTLNQAGCFTV 629
           ++VK++MS  D++VH++ HGA+ T++K  R      P++      VPP  T+ +AG FTV
Sbjct: 626 LLVKKFMSPNDIFVHSEAHGAACTIVKAPRLTTTDAPQENTVLRWVPPEQTMLEAGAFTV 685

Query: 630 CHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
            HS+ W  K+ T ++WVY  QVSKTAP G Y+  GSF+IRGK+NF+P  PL +G  LL+R
Sbjct: 686 IHSKMWTQKVGTQSYWVYADQVSKTAPAGMYIGTGSFVIRGKRNFIPQQPLELGVALLWR 745

Query: 690 LD 691
            D
Sbjct: 746 YD 747



 Score = 66.6 bits (161), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 77/150 (51%), Gaps = 12/150 (8%)

Query: 3   KVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE------- 54
           K+  ++ DVA   K L   L+  R ++V +LS  TY+ +   S+ V +  +++       
Sbjct: 6   KLTPSSFDVAVLAKELSAILVNTRLNSVTNLSKTTYLLRFHASTTVIDQCQTKNQTLIDT 65

Query: 55  --KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
             K  +++E G  +H T +   K   P+ F+ +LR  I       V Q  +DR+I+ +F 
Sbjct: 66  YSKPSVIIEPGFYMHATRFDWSKAIPPTVFSNRLRTEICNMICTGVSQFYFDRVIILEFS 125

Query: 113 LGMN--AHYVILELYAQGNILLTDSEFTVL 140
              +    Y+I+ELY +GN++LTD  + VL
Sbjct: 126 RYNSELKRYLIVELYGRGNLILTDETYKVL 155


>gi|448583074|ref|ZP_21646543.1| hypothetical protein C454_08194 [Haloferax gibbonsii ATCC 33959]
 gi|445730031|gb|ELZ81623.1| hypothetical protein C454_08194 [Haloferax gibbonsii ATCC 33959]
          Length = 702

 Score =  185 bits (470), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 177/707 (25%), Positives = 292/707 (41%), Gaps = 130/707 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP           AS+L 
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                       +P  V+ D                      L +N  ++  D  R    
Sbjct: 165 ------------DPLTVSRDA---------------------LGRNMEQSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L  G   +E +    G+   + +++    + +A+   ++      D  Q V
Sbjct: 188 ----TLATQLNLGGLYAEELCTRAGVEKTLDIADATAEDYDAVYDAIV------DLRQQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD-EFCPLLLNQFRSREFVKFETFDAA 357
            SG+  P  Y+                G   ++ D    PL  +Q    +   ++TF+ A
Sbjct: 238 RSGEFDPRLYL----------------GDDGEVVDVTPFPLREHQNAGLDEEAYDTFNDA 281

Query: 358 LDEFYSKIESQRAEQQ---HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           LDE++ +++    EQ+   ++   +    K  +I   QE  +   +Q+ +   + AEL+ 
Sbjct: 282 LDEYFFRLDLTADEQEATSNRPDFEEEIAKQQRIIDQQEGAIEGFEQQAEDERERAELLY 341

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N + VD  +  VR A    + W+D+A  ++E  + G P A  +  +      +++    
Sbjct: 342 ANYDLVDDVLSTVRGAREEGVPWDDIAARLEEGAEQGIPEAEAVTNVDGANGTVTI---- 397

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAE 532
              E+DD   TL V       ++    NA R Y   K+ E K+E  + A   ++   AA 
Sbjct: 398 ---ELDDATVTLEV-------SMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAV 447

Query: 533 KKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSENYLVISG 574
           KK R +   +                    + ++      HWFE+F WF +S  YLV+ G
Sbjct: 448 KKRRDEWEADDDEDDEEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYLVVGG 507

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTV 629
           R+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +A  F V
Sbjct: 508 RNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSDETLREAAQFAV 567

Query: 630 CHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 568 SYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYF 614


>gi|282165250|ref|YP_003357635.1| hypothetical protein MCP_2580 [Methanocella paludicola SANAE]
 gi|282157564|dbj|BAI62652.1| conserved hypothetical protein [Methanocella paludicola SANAE]
          Length = 666

 Score =  185 bits (469), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 117/381 (30%), Positives = 198/381 (51%), Gaps = 27/381 (7%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSK--IESQRAEQQHKAKEDAAFHKLNKIHMD 391
           +  P+ L+++   + V FE+F+ ALDE+YSK  +   +AE   K  E      L +    
Sbjct: 249 DVLPIELSRYAGYQKVYFESFNKALDEYYSKHIVAEAKAEVVEKKAEKLGV--LERRLKQ 306

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           QE+ +   ++E    V+  ELI      VD  I  ++ A +  +SW+D+ +++K+ +KAG
Sbjct: 307 QEDAIAKFEKEEKEYVRKGELIYAEYGAVDDIIKVIKGARSRGISWDDIRKILKDAKKAG 366

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           NP A +I  +    N +++                P   + +++ L+   N++ +Y+  K
Sbjct: 367 NPAASMIQSVDPAANTVAV--------------KFPEATININVDLTVPQNSQTYYDKAK 412

Query: 512 KQESKQEKTITAHSKAFKA-AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
           K +SK++  + A     +A A++  R +  + K  A     RK  W+EK+ WF +S+ +L
Sbjct: 413 KVQSKKDGALKAIEDTKRAMAKEMPREKPAEPKKPAVKMKPRKPKWYEKYRWFFTSDGFL 472

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI+GRDA QNE IVK+Y+ K D++ HA   GA  TV+K    E  + P  + +   F V 
Sbjct: 473 VIAGRDADQNEEIVKKYLDKKDIFFHAQAFGAPITVVKTEGRE--ITPEAIAEVAQFAVA 530

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           +S  W S   +   +WV P QVSKT  +GEY+  G+F+IRG +N++         G+  R
Sbjct: 531 YSSVWKSGQSSGDCFWVRPEQVSKTPESGEYVAKGAFIIRGDRNYVKNVEARAAVGI--R 588

Query: 690 LDESS---LGSHLNERRVRGE 707
            DE+    +G  +   + RG+
Sbjct: 589 FDETGCYVVGGPVAAVKARGK 609



 Score = 69.3 bits (168), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 44/161 (27%), Positives = 80/161 (49%), Gaps = 7/161 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ DV A V  L+ LI  +    Y  +      KL       +  ++ K  L++E
Sbjct: 1   MKEEMSSVDVYAVVMELQFLIDSKLEKAYQHTADEIRLKL-------QEFKTGKYDLILE 53

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T + R+    P  F + LRK++   R+  + Q  +DRI+          + ++
Sbjct: 54  AGKRLHLTEHPRESPKLPPSFPMMLRKYMMGGRITRIAQHNFDRIVEIDVVRAGVMNTLV 113

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL++QGN++L D +  ++  LRS +  D+ V    ++ +P
Sbjct: 114 AELFSQGNVILLDQDRRIMMPLRSLKMKDRDVLRGEQYEFP 154


>gi|358339725|dbj|GAA47729.1| nuclear export mediator factor NEMF [Clonorchis sinensis]
          Length = 449

 Score =  184 bits (467), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 152/486 (31%), Positives = 232/486 (47%), Gaps = 76/486 (15%)

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PSGF++KLRKHI+ ++L +V+QLG DRI+ FQFG   +  ++I+ELY +GN+ LTD  +T
Sbjct: 2   PSGFSMKLRKHIKNKKLSNVKQLGMDRIVDFQFGFDEHLFHLIIELYDRGNMCLTDHSYT 61

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           +L LLR   D ++ V   +  +YP ++     RT    L              PD +N D
Sbjct: 62  ILHLLRPRTDANQDVRYAAHEKYPLDLV----RTVPECLQGL-----------PDDINID 106

Query: 199 GNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHI 258
           G       K  LG     K     + SN+       A +P  K +L     YG    EH 
Sbjct: 107 G-----VCKRVLGLLDEAKGPWCPRGSNE-------ALKPVQK-LLSSEFSYGQPCVEHC 153

Query: 259 I----------LDTGLVPNMKLSEVNKL----EDNAIQVLV----LAVAKFEDWLQDVIS 300
                      L T    N+ + E ++L    ED A   ++    L +A +     +V  
Sbjct: 154 CRLANMAVQSTLKTSATENVPVDEEDRLRQIKEDYAKHFVMALRNLLLAAYLVGTDNVEM 213

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G  +  GYI       GK   P +   S Q  ++F P L +QFR+R  V F TF  A+D 
Sbjct: 214 G--MSRGYI------FGKKLQPEDEELSRQ--EDFQPFLFDQFRNRPHVAFPTFSKAVDT 263

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           ++SKIE  +  +     E+ A  K   I  D E R+  LK + ++ V  A+L+E N + V
Sbjct: 264 YFSKIERDKTTELLVQNENKANKKFENIKKDHELRLAALKADQEQDVHKAQLLEKNRQLV 323

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL---------L 471
           D  IL +  AL+N++ W  L  M++E R  G+ +A  I +L L++N +++         L
Sbjct: 324 DNIILMINHALSNQLDWGTLDTMIQEARARGDLLASHIVQLNLQQNQITVSLKYGFSLYL 383

Query: 472 LSNNLDEMD----------DEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTI 521
           L    D  +          D+  + P E V + L L+A  NAR++Y+ K+    K+EKT+
Sbjct: 384 LIMPRDPFESESEGENCERDQTISAPTEVV-ISLDLNALNNARKYYDRKRAALKKEEKTL 442

Query: 522 TAHSKA 527
            A  K 
Sbjct: 443 IASRKV 448


>gi|409095360|ref|ZP_11215384.1| Fibronectin-binding protein A (FbpA) [Thermococcus zilligii AN1]
          Length = 650

 Score =  184 bits (467), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 119/361 (32%), Positives = 191/361 (52%), Gaps = 33/361 (9%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F TF  ALDE++ +I  ++A  +   K +A   +L    M QE  
Sbjct: 242 VPIELKVYGGLEKKYFSTFSEALDEYFGRITVEKARIEQTQKLEAKKKQLLTTLMMQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++ +  + ++ +LI  N   V+  +   + A   ++ WE+  + ++E +KAGN VA
Sbjct: 302 LRGFEKAMKENQELGDLIYANYPVVERLLEEFKRA-TEKLGWEEFKKRIEEGKKAGNRVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE------KVEVDLALSAHANARRWYEL 509
            ++                   E+D +EK + VE      K+ VD +L    NA  +YE 
Sbjct: 361 LMVK------------------EIDPKEKAVTVELEGKEVKLHVDRSLGE--NAELYYEN 400

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM--RKVHWFEKFNWFISSE 567
            KK   K E  + A+    +  E+  +L   + K   N+  +  RK  WFEKF WF+SSE
Sbjct: 401 AKKFRHKYEGALKAYEDTRRKIEEIEKLIEEEMKKELNVRRIEGRKKRWFEKFRWFVSSE 460

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LV++G+DA  NE +VK++M K D+Y HAD++GA   VIK+    Q     T+ +A  F
Sbjct: 461 GFLVLAGKDANTNETLVKKHMDKNDLYCHADVYGAPHVVIKDG---QKAGEKTIFEACQF 517

Query: 628 TVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            V  S+AW   + ++ A+W YP QV+K AP+GEYL  G+FM+ GK+N+L   PL +  G+
Sbjct: 518 AVSMSRAWSQGLYSADAYWAYPEQVTKQAPSGEYLGKGAFMVYGKRNWLHGLPLKLAVGV 577

Query: 687 L 687
           +
Sbjct: 578 V 578



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYIVRELQWLVGSRVDKVYHEGDEIRI-KLHTKEGRAD--------LVLQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PSGFT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYVKEAPKEPSGFTMLLRKHLSGGFIDAIEQHQFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+  GNI+L DSE  +++ LR     D+ +   + + +P
Sbjct: 108 GELFRSGNIVLVDSENRIISALRYEEYRDRAIKPNAEYIFP 148


>gi|349602918|gb|AEP98908.1| Serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 517

 Score =  183 bits (464), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 80/122 (65%), Positives = 98/122 (80%), Gaps = 1/122 (0%)

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQ 650
           GD+YVHADLHGA+S VIKN   E P+PP TL +AG   +C+S AWD++++TSAWWVY HQ
Sbjct: 1   GDIYVHADLHGATSCVIKNPTGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQ 59

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           VSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E 
Sbjct: 60  VSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHRGERKVRVQDED 119

Query: 711 MD 712
           M+
Sbjct: 120 ME 121


>gi|240103770|ref|YP_002960079.1| Fibronectin-binding protein A (FbpA) [Thermococcus gammatolerans
           EJ3]
 gi|239911324|gb|ACS34215.1| Fibronectin-binding protein A (FbpA) [Thermococcus gammatolerans
           EJ3]
          Length = 650

 Score =  182 bits (463), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 117/360 (32%), Positives = 190/360 (52%), Gaps = 31/360 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F+TF  ALDE++ K+  ++A+ +   K ++   +L      QE  
Sbjct: 242 VPIELKIYEGLEKRYFKTFSEALDEYFGKLTIEKAKIEKTRKLESKKKQLLATLRKQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++ ++ + ++ +LI  N   V+  +   R A   ++ WE+  R ++  +K GN VA
Sbjct: 302 LKGFEKAMNENQEIGDLIYANYAMVERLLDEFRKA-TEKLGWEEFKRRIEAGKKEGNKVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++  +                  D +EKT+ +E    KV++ L  S   NA  +YE  K
Sbjct: 361 LMVKAI------------------DPKEKTVTIELEGRKVKLYLNKSIGENAELYYEKAK 402

Query: 512 KQESKQEKTITAHS---KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K   K E  + A+    +     EK    ++ +E  V  I   RK  WFEKF WFISSE 
Sbjct: 403 KFRHKYEGALKAYEDTRRKLDEVEKLIEEEMKKELNVKRIER-RKKKWFEKFRWFISSEG 461

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LV++G+DA  NE ++K++MS  D+Y HAD++GA   VIK+    Q     T+ +A  F 
Sbjct: 462 FLVLAGKDASTNETLIKKHMSDNDLYCHADVYGAPHVVIKDG---QKAGEKTIFEACQFA 518

Query: 629 VCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V  S+AW   +  + A+W YP+QV+K AP+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 519 VSMSRAWSQGLYGADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLKLAVGVI 578



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYVVRELQWLVGSRVDKVYHDGDEIRI-KLRTKEGRAD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PS FT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYVKEAPKQPSSFTMLLRKHLSGGFIDAIEQHQFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GNI+L DSE  ++  LR     D+ +   + +++P
Sbjct: 108 GELFRRGNIVLVDSENRIVAALRYEEYKDRAIKPKAEYKFP 148


>gi|212223298|ref|YP_002306534.1| fibronectin-binding protein [Thermococcus onnurineus NA1]
 gi|212008255|gb|ACJ15637.1| predicted fibronectin-binding protein [Thermococcus onnurineus NA1]
          Length = 649

 Score =  182 bits (463), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 189/360 (52%), Gaps = 31/360 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  + + E   F TF  ALDE++ K+  ++A+ +   K +A   +L      QE  
Sbjct: 241 VPVELKVYENFEKRYFSTFSEALDEYFGKVTLEKAKIEQTKKLEAKKRQLLMTLKKQEEL 300

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   +++   + ++ +LI  N   V+  +   R A   R+ WE+  + + E +KAGN  A
Sbjct: 301 LKGFEEQAKANQEIGDLIYANFTMVERLLDEFRKA-TERLGWEEFKKRIDEGKKAGNKAA 359

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++  +                  D +EK + +E    KV + L  S   NA  +YE  K
Sbjct: 360 LMVKSI------------------DPKEKAVTIELEGKKVRLYLNKSIGENAELYYEKAK 401

Query: 512 KQESKQEKTITAH---SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K + K E  + A+    +     EK    ++ +E  V  I   RK  WFEKF WF+SSE 
Sbjct: 402 KAKHKLEGALKAYEDTKRKLDEIEKLIEEEMKKELAVKRIER-RKKKWFEKFRWFVSSEG 460

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LV++G+DA  NE ++K++M + D+Y HAD++GA   VIK+    Q     T+ +A  F 
Sbjct: 461 FLVLAGKDASTNENLIKKHMDENDLYCHADVYGAPHVVIKDG---QKAGEKTIFEACQFA 517

Query: 629 VCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V  S+AW   + ++ A+W YP+QV+K AP+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 518 VSMSKAWSQGLYSADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLKLAVGVI 577



 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 82/161 (50%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   +Y    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYVVRELQSLVGSRVDKIYHDGDEIRI-KLRTKEGRQD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T Y ++    PS FT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHVTTYVKEAPKMPSSFTMLLRKHLSGGFIDAIEQHDFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GNI+L D E  ++  LR     D+ +   + +++P
Sbjct: 108 GELFRRGNIILVDGENRIVAALRYEEFKDRAIKPKAEYKFP 148


>gi|253745574|gb|EET01418.1| Serologically defined colon cancer antigen 1 [Giardia intestinalis
           ATCC 50581]
          Length = 1065

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 92/216 (42%), Positives = 131/216 (60%), Gaps = 19/216 (8%)

Query: 498 SAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI------LQEKTVANISHM 551
           +AH  A   +E  K+ E K E+T+   S  F   EKK   +I         K +A + H 
Sbjct: 539 NAHTIANTLFEAAKEAEQKCERTLGHSSAYFNKVEKKATAEIDSAIKETDAKLIA-LQHQ 597

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           R   WFEKF+WF S++ YLV+SGRDAQ NE++VK++MS  D++VH++ HGA+ T++K  R
Sbjct: 598 RPPLWFEKFHWFFSTDGYLVLSGRDAQSNELLVKKFMSPHDIFVHSEAHGAACTIVKAPR 657

Query: 612 PEQP-----------VPP-LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE 659
                          +PP  T+ +AG FTV HS+ W  K+   ++WVY  QVSKTAP G 
Sbjct: 658 LTTADTIQQNKILRWIPPEQTMLEAGAFTVIHSKMWAQKIGAQSYWVYADQVSKTAPPGM 717

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           Y+  GSF+IRGK+NF+P  PL +G  LL+R D +++
Sbjct: 718 YIGTGSFVIRGKRNFIPQQPLELGVALLWRYDAANV 753



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 77/150 (51%), Gaps = 12/150 (8%)

Query: 3   KVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL-- 59
           K+  ++ DVA   K L   L+  R +++ +LS  TY+ +   S+   +  +++  +L+  
Sbjct: 6   KLTPSSFDVAVLAKELSAILVNTRLNSITNLSKTTYLLRFHASTTAIDQCQTKDQMLIDT 65

Query: 60  -------MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
                  +E G  +HTT +   K   P+ F+ +LR  I       V Q  +DR+I+ +F 
Sbjct: 66  YSKPSVIIEPGFYMHTTRFDWSKAIPPTAFSNRLRTEICNLICTGVSQFYFDRVIIMEFS 125

Query: 113 LGMN--AHYVILELYAQGNILLTDSEFTVL 140
              +    Y+I+ELY +GN+LLTD  + VL
Sbjct: 126 RYNSEFKRYLIVELYGRGNLLLTDENYKVL 155


>gi|315231919|ref|YP_004072355.1| RNA-binding protein [Thermococcus barophilus MP]
 gi|315184947|gb|ADT85132.1| RNA-binding protein [Thermococcus barophilus MP]
          Length = 650

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 192/359 (53%), Gaps = 29/359 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  + + E   FETF  ALDE++ KI  ++A+ +   + +    ++      QE +
Sbjct: 242 VPIELKWYENYEKKYFETFSEALDEYFGKITVEKAKIERTKRLEEKKRQILATLRRQEEQ 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   + E+ ++ ++ +LI  N   +D  +     A+  ++ WE+  + ++E +KAGN +A
Sbjct: 302 MKGFEAEMKKNQELGDLIYANFTFIDNLLREFSKAV-EKLGWEEFKKRIEEGKKAGNKIA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++  +                  D +EK + +E    K+++ L  S   NA  +YE  K
Sbjct: 361 LMVKSI------------------DPKEKAVTIEIEGRKIKLYLNKSIGENAEIYYEKAK 402

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRL--QILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K + K E    A+    K  ++  +L  + ++++        RK  WFEKF WFISSE +
Sbjct: 403 KAKHKLEGAKRAYEDTKKKLQEIEKLIEEEMKKELKVKKLEKRKKKWFEKFRWFISSEGF 462

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LVI G+DA  NEM+VKR+M   D+Y HAD+HGA   VIK+    Q     T+ +A  F V
Sbjct: 463 LVIGGKDATTNEMVVKRHMGDNDLYCHADVHGAPHVVIKDG---QKAGEKTIFEACQFAV 519

Query: 630 CHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
             S+AW   + ++ A+W YP+QV+K AP+GEYL  G+FM+ GK+N+    PL +  G++
Sbjct: 520 SMSKAWSEGVYSADAYWAYPNQVTKKAPSGEYLGKGAFMVYGKRNWYHGIPLKLAVGII 578



 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 83/160 (51%), Gaps = 14/160 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L G R   +Y    +  I        + ++GE  K L++ E
Sbjct: 1   MKEEMSSVDIKYIVEELKSLKGARIDKIYHDGSEIRI-------KLHKAGEGRKDLII-E 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R+H T+Y R+    PS FT+ LRKH+     +++ Q  +DRI+  + G     + +I
Sbjct: 53  AGKRIHLTSYIREAPKMPSSFTMLLRKHLSGGFFDNIEQHDFDRIVKIRIG----NYTLI 108

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
            EL+ +GNI+L D    ++  LR     D+  AI  +H Y
Sbjct: 109 AELFRKGNIILVDENNIIIGALRYEEFKDR--AIKPKHEY 146


>gi|313215449|emb|CBY16187.1| unnamed protein product [Oikopleura dioica]
          Length = 404

 Score =  181 bits (458), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 117/327 (35%), Positives = 171/327 (52%), Gaps = 77/327 (23%)

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           AD+HGASS ++KN  P +PV P+TL++ G   VCHS AW++K++TSAWWV+ +QVSKTAP
Sbjct: 1   ADIHGASSCIVKNIDPSKPVSPVTLHEVGHAAVCHSAAWNAKVLTSAWWVHANQVSKTAP 60

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFED 716
           +GEYL+ GSFMIRGKKN+LPP  L++GFG LF+LD++ +  H  ER+++G    ++D E+
Sbjct: 61  SGEYLSTGSFMIRGKKNYLPPSQLVLGFGFLFKLDDACVARHAGERKIKGL---VNDVEE 117

Query: 717 SGHHKENSDIESEKDDTDEKPVAESLSVPNSAHPAPSHTNASNVDSHEFPAEDKTISNGI 776
               KE S++   K++ + +P  E        +   S  + S  D  EFP          
Sbjct: 118 ----KEQSELGEIKEENENEPQLE------GENDDDSEDSDSKSDDLEFP---------- 157

Query: 777 DSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEEDKHVERTAT 836
           D+KI     N+   V  ++E++++   G G  +I                          
Sbjct: 158 DTKI-----NIKYNVDTEVEEIVNVGKGAGKKNIE------------------------- 187

Query: 837 VRDKPYISKAERRKLKKGQGSSVVDPKVEREKERGKDASSQPESIVRKTKIEGGKISRGQ 896
                     ERRK              E EK+     + Q E   +K + +  +  RG+
Sbjct: 188 ----------ERRK--------------EAEKKSRAKPAWQLEHEEQKAEKDKFRKKRGK 223

Query: 897 KGKLKKMKEKYGDQDEEERNIRMALLA 923
            GK KKMK+KYGDQDEE+R   M  L 
Sbjct: 224 AGKEKKMKQKYGDQDEEDRAAMMEFLG 250


>gi|313126151|ref|YP_004036421.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
 gi|448285991|ref|ZP_21477228.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
 gi|312292516|gb|ADQ66976.1| predicted RNA-binding protein, snRNP like protein [Halogeometricum
           borinquense DSM 11551]
 gi|445575584|gb|ELY30057.1| RNA-binding protein, snrnp like protein [Halogeometricum
           borinquense DSM 11551]
          Length = 702

 Score =  180 bits (456), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 173/712 (24%), Positives = 292/712 (41%), Gaps = 140/712 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D++A V  L R  G +    Y         ++ +        +  +V L++E 
Sbjct: 4   KRELTSVDLSALVTELNRYEGAKVDKAYLYGDNLLRLRMRDF-------DRGRVELILEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT    +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GDVKRAHTAKPEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFDFERGDEDT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGN+ + D    V+  L + R   + VA  +++ +P+           S+LH
Sbjct: 117 EIVVELFGQGNVAVLDETGEVVRSLETVRLKSRTVAPGAQYEFPS-----------SRLH 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                        P  V+ +G                       +    +  D  R    
Sbjct: 166 -------------PFTVSYEG---------------------FKRRMEDSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   +  G   +E      G+   M++S+     D   + +  A+  F D L+  
Sbjct: 188 ----TLATQVNLGGLYAEEFCTRAGVEKTMEISDAG---DEEYRAIYDAIQTFHDRLK-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P  Y              TE G+         PL  ++        ++TF+ AL
Sbjct: 239 -SGDFDPRVY--------------TEDGNVVDATP--FPLKEHEAEGLNSESYDTFNEAL 281

Query: 359 DEFY------SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           DE++      ++ E +     ++   +A   K  +I   QE  +   +Q+ +R  + AEL
Sbjct: 282 DEYFFAFDRSAEDEPEEEPGSNRPDFEAEIEKKKRIIEQQEGAIEGFEQQAERERERAEL 341

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNCMSLL 471
           +  N E VD  +  VR A    + W+++ + +++  + G P A  ++D            
Sbjct: 342 LYANYELVDEVLSTVRSARDESVPWDEIRQTLEDGAERGIPAAEAVVD------------ 389

Query: 472 LSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESKQEKTITA-HSK 526
                  +D  E T+ +E    ++EV++ +    NA R Y+  K+ E K+E  + A    
Sbjct: 390 -------VDGAEGTVTIEIDGTRIEVEVDMGVEKNADRLYKEAKRVEGKKEGAMAAIEDT 442

Query: 527 AFKAAEKKTRLQILQEK-----------------TVANISHMRKVHWFEKFNWFISSENY 569
             + AE K R    +E                  + ++I    +  W+E+F WF +S+ Y
Sbjct: 443 REELAEVKARRDAWEEDDEDDDEEPEEPEDIDWLSRSSIPLKTEEQWYEQFRWFHTSDGY 502

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-----PPLTLNQA 624
           LVI GR+A QNE IVK+Y++K D++ H   HG   TV+K   P +P      P  T  +A
Sbjct: 503 LVIGGRNADQNEEIVKKYLNKHDLFFHTQAHGGPVTVVKATGPSEPAQEVEFPDSTKREA 562

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             F V +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 563 AQFAVSYSSIWKEGRYADDAYMVTPDQVSKTPESGEYIEKGSFVIRGDRTYF 614


>gi|14520906|ref|NP_126381.1| hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
 gi|5458123|emb|CAB49612.1| Hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
 gi|380741455|tpe|CCE70089.1| TPA: hypothetical protein PAB1903 [Pyrococcus abyssi GE5]
          Length = 649

 Score =  178 bits (451), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 110/354 (31%), Positives = 187/354 (52%), Gaps = 20/354 (5%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E V FETF  ALDE++ K+  ++A+++   K +    +L      QE  
Sbjct: 242 VPIELKWYEGYERVYFETFSQALDEYFGKLTIEKAKEERTRKLEEKKKQLMATLERQERM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++E  ++ ++ +LI  N   +D  +     A+  +  W +  + ++E +K GN +A
Sbjct: 302 IKGFEEEARKNQEIGDLIYANYTIIDGILREFSKAV-EKFGWNEFKKRLEEGKKQGNKIA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            L+  +  E + +++ L                 K+++ L  S + NA  +YE  KK + 
Sbjct: 361 LLVKNVNPEEDSITIELEGR--------------KIKLYLNRSINDNAELYYEKAKKAKH 406

Query: 516 KQEKTITAHSK-AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISG 574
           K E    A+ +   K  + +  ++  ++K        RK  WFEKF WFISSE +LVI G
Sbjct: 407 KLEGAKKAYEELKRKLEQIEKEIEEEEKKIQVKKIEKRKKKWFEKFRWFISSEGFLVIGG 466

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQA 634
           +DA  NE++V++YM + D+Y HAD+ GA   +IK+    Q     T+ +A  F V  S+A
Sbjct: 467 KDATTNEIVVRKYMQENDIYCHADIWGAPHVIIKDG---QKASERTIFEACQFAVSMSRA 523

Query: 635 WDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           W   + +  A+WVYP QV K AP+GE+L  G+FM+ GK+N++   PL +  G++
Sbjct: 524 WSEGLYSGDAYWVYPEQVKKQAPSGEFLPKGAFMVYGKRNWMHGIPLKLAVGVV 577



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/162 (27%), Positives = 83/162 (51%), Gaps = 14/162 (8%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M++ D+   V+ L+  ++G R   VY    +  I        + ++GE  K L++ 
Sbjct: 1   MKEEMSSVDIRYIVQELKEEIVGARVDKVYHEGNEVRI-------KLHKAGEGRKDLII- 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T+Y ++    PS F + LRKH+    ++ + Q  +DRI+  + G       +
Sbjct: 53  EAGKRIHLTSYIKESPQ-PSSFAMLLRKHLSGSFVDGIEQHDFDRIVKIRIG----KFTI 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           I EL+ +GN++L D   T++  +R     D+ +     ++YP
Sbjct: 108 IAELFRRGNVILVDENNTIIGAIRYEEFKDRAIKPKLEYKYP 149


>gi|390960715|ref|YP_006424549.1| hypothetical protein containing fibronectin-binding protein
           [Thermococcus sp. CL1]
 gi|390519023|gb|AFL94755.1| hypothetical protein containing fibronectin-binding protein
           [Thermococcus sp. CL1]
          Length = 649

 Score =  178 bits (451), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 188/356 (52%), Gaps = 23/356 (6%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F TF  ALDE++ +I  ++A+ +   K +    +L      QE  
Sbjct: 241 VPIELKIYEGLEKKYFNTFSEALDEYFGRITIEKAKIERTRKLENKKRQLLMTLRKQEEM 300

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   +  +  + ++ +LI  N   ++  +   R A   ++ WE+  + ++E +KAGN VA
Sbjct: 301 LKGFEGAMRENQEIGDLIYANYALIERLLDEFRKA-TEKLGWEEFRKRIEEGKKAGNRVA 359

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            ++  +  +   +++       E+D +       KV++ L  S   NA  +YE  KK   
Sbjct: 360 MMVKGINPKEKAVTI-------ELDGK-------KVKLYLNRSIGENAELYYEKAKKFRH 405

Query: 516 KQEKTITAH---SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
           K E  + A+    +     EK    ++ +E  V  I   RK  WFEKF WFISSE +LV+
Sbjct: 406 KHEGALKAYEDTKRKLNEVEKLIEEEMKKELNVKRIER-RKKKWFEKFRWFISSEGFLVL 464

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +G+DA  NE+++KR+M + D+Y HAD++GA   VIK+    Q     T+ +A  F V  S
Sbjct: 465 AGKDASTNEILIKRHMGENDLYCHADVYGAPHVVIKDG---QKAGERTIFEACQFAVSMS 521

Query: 633 QAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +AW   + +  A+W YP+QV+K  P+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 522 KAWSRGVYSEDAYWAYPNQVTKQTPSGEYLGKGAFMVYGKRNWLHGLPLKLAVGVI 577



 Score = 82.8 bits (203), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 85/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYVVRELQWLVGSRVDKVYHDGDEIRI-KLRTKEGRAD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PS FT+ LRKH+    ++ + Q G+DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYIKEAPKQPSSFTMLLRKHLSGGFIDAIEQHGFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GN++L DSE  ++  LR     D+ +   + +RYP
Sbjct: 108 GELFRRGNVILVDSENRIVAALRYEEYKDRAIKPKAEYRYP 148


>gi|223478404|ref|YP_002582764.1| fibronectin-binding protein A domain-containing protein
           [Thermococcus sp. AM4]
 gi|214033630|gb|EEB74457.1| Fibronectin-binding protein A domain protein [Thermococcus sp. AM4]
          Length = 650

 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 190/360 (52%), Gaps = 31/360 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F+TF  ALDE++ K+  ++A+ +   K +    +L      QE  
Sbjct: 242 VPIELKIYEGLEKHYFKTFSEALDEYFGKLTIEKAKIERTRKLENKKRQLLATLRKQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++ ++ + ++ +LI  N   ++  +   R A   ++ WE+  + ++  +K GN VA
Sbjct: 302 LKGFEKAMNENQEIGDLIYANYALIERLLEEFRKA-TEKLGWEEFKKRIEAGKKEGNRVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++  +                  D +EK + +E    KV++ L  S   NA  +YE  K
Sbjct: 361 LMVKSI------------------DPKEKAVTIELEGKKVKLYLNKSIGENAELYYEKAK 402

Query: 512 KQESKQEKTITAH---SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K   K E  + A+    +     EK    ++ +E  V  I   RK  WFEKF WF+SSE 
Sbjct: 403 KFRHKYEGALKAYEDTKRKLDEVEKLIEEEMRKELNVKRIER-RKKKWFEKFRWFVSSEG 461

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LV++G+DA  NE+++K++M++ D+Y HAD++GA   VIK+    Q     T+ +A  F 
Sbjct: 462 FLVLAGKDASTNEVLIKKHMTENDLYCHADVYGAPHVVIKDG---QKAGERTIFEACQFA 518

Query: 629 VCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V  S+AW   +  + A+W YP+QV+K AP+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 519 VSMSRAWSQGLYGADAYWAYPNQVTKQAPSGEYLGKGAFMVYGKRNWLRGLPLKLAVGVI 578



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYVVRELQWLVGSRVDKVYHDGDEIRI-KLRTKEGRAD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PS FT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYVKEAPKQPSSFTMLLRKHLSGGFIDAIEQHQFDRIVKIRVG----DYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GNI+L DSE  ++  LR     D+ +   + +++P
Sbjct: 108 GELFRRGNIVLVDSENRIVAALRYEEYKDRAIKPKAEYKFP 148


>gi|300176454|emb|CBK23765.2| unnamed protein product [Blastocystis hominis]
          Length = 767

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 91/228 (39%), Positives = 149/228 (65%), Gaps = 6/228 (2%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK--AAEKKTRLQILQEKTVANI 548
           V+V+L+L+ + N    +  KK  + K +KT+ A   A    + +++T L++ +    A I
Sbjct: 151 VDVELSLNCNQNISLLFSQKKDLQDKLDKTVQAAQAAVAEASKQRQTELRVAEAAHPAEI 210

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
           +  R+  WFEKF+W ++++ ++V++G+  +QNE++V+RY+  GD+++HAD+HGA++ V++
Sbjct: 211 ARQREKRWFEKFDWCVTTDGFIVLAGKSGEQNEILVRRYLRPGDLFLHADVHGAATVVLR 270

Query: 609 NHR-PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           N+R PE P     L QA  F +CHS AWD++++   +WV   QVSKTAP+GEYL  GSFM
Sbjct: 271 NYRAPELP-GEAALLQAAAFALCHSSAWDAQLLCKVYWVPARQVSKTAPSGEYLPTGSFM 329

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFE 715
           IRGKKNFL P+ + MG  +LF +    +  H  +R+ R  E+   D+E
Sbjct: 330 IRGKKNFLAPYRMEMGLTVLFEVRPEDVQRHFYDRKPREMEDA--DWE 375


>gi|333987711|ref|YP_004520318.1| fibronectin-binding A domain-containing protein [Methanobacterium
           sp. SWAN-1]
 gi|333825855|gb|AEG18517.1| Fibronectin-binding A domain protein [Methanobacterium sp. SWAN-1]
          Length = 663

 Score =  177 bits (449), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 132/402 (32%), Positives = 193/402 (48%), Gaps = 31/402 (7%)

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIY----DEFCPLLLNQFRSREFVKF 351
           +D  S DI PE    + N       P   +    QI     D+  PL L ++   E   F
Sbjct: 204 KDKPSSDITPEELDFIHNAMSDVFSPLKTAQFHPQIISSEKDDVLPLNLTKYEKYEKKTF 263

Query: 352 ETFDAALDEFYSKIESQRAEQQHK---AKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ETF+ A DEFYS I     +Q H+   A E   F K  KI M+    +   K  + ++  
Sbjct: 264 ETFNQAADEFYSSIVGDDIKQVHEDVWAAEVGKFEKRLKIQMET---LEKFKDTIVKTKI 320

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
             E I  N +++   +  +  A   R ++  L  +   ++     V+GL           
Sbjct: 321 KGEAIYSNYQNIQNILDIIHNA---RETYSWLDIIDIIKKGKKEKVSGLD---------- 367

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
              +  +LD+M      L    V VD  +S   NA  +Y   KK + K      A  K  
Sbjct: 368 ---IIESLDKMGVLTLNLDGTIVNVDSNMSIPENAEIYYNKGKKAKRKISGVNIAIEKTM 424

Query: 529 KAAEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           K  E+ K + +I  EK +     +RK + WFEK  WF+SS+  LVI GRDA  NEMIVK+
Sbjct: 425 KEVERAKNKREIAMEKVLVPQKRVRKELKWFEKLRWFLSSDGLLVIGGRDATTNEMIVKK 484

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWW 645
           +M   D+Y H+D+HGA+S V+K    E  VP  TLN+   F    S AW +    T  +W
Sbjct: 485 HMENRDIYFHSDIHGAASVVVKAGEGE--VPESTLNETASFAGSFSSAWSAGFGSTDVYW 542

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V+P QVSKT  +GE++  G+F+IRG +NF+   PL++  G++
Sbjct: 543 VHPDQVSKTPQSGEFVGKGAFIIRGSRNFIRNAPLLVAVGIV 584



 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 65/110 (59%), Gaps = 1/110 (0%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +V ++ ++G+R+HTT Y  +    P  F + LRKH++   +  V+Q  +DRI+       
Sbjct: 47  RVDVVFQAGLRVHTTQYPPENPQIPPSFPMILRKHLKGGNVTCVKQHNFDRILKINIQ-K 105

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            + + +++EL+A+GNI+L D E T++  L+    +D+ ++    ++YP E
Sbjct: 106 EHKYSLVIELFAKGNIILLDEEGTIIMPLKRKLWEDRNISSKEEYKYPPE 155


>gi|57641373|ref|YP_183851.1| fibronectin-binding protein [Thermococcus kodakarensis KOD1]
 gi|57159697|dbj|BAD85627.1| predicted fibronectin-binding protein [Thermococcus kodakarensis
           KOD1]
          Length = 650

 Score =  176 bits (447), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 115/376 (30%), Positives = 191/376 (50%), Gaps = 29/376 (7%)

Query: 319 DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
           D P         +  +  P+ L  +   E   F TF  ALDE++ KI  ++A+ +   K 
Sbjct: 225 DEPKPNIVFKDGVMHDVVPIELKIYEGFEKRYFPTFSEALDEYFGKITLEKAKIEQTKKL 284

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
           +     L      QE  +   ++ +  + ++ +LI  N   ++  +   R A    + W+
Sbjct: 285 EEKKRGLMATLRKQEEMLKGFEKAMRENQEIGDLIYANYTLIERLLEEFRKA-TETLGWD 343

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVD 494
           +  R + E +K GN VA ++  +                  D +EK + +E    KV++ 
Sbjct: 344 EFRRRIDEGKKTGNKVALMVKGI------------------DPKEKAVTIELDGKKVKLY 385

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM--R 552
           L  S   NA  +YE  KK   K E  + A+    +  E+  +L   ++K   N+  +  R
Sbjct: 386 LEKSLGENAEIYYEKAKKFRHKYEGALKAYEDTKRKLEEIEKLIEEEQKKELNVKKLERR 445

Query: 553 KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP 612
           K  WFEKF WF+SSE +LV++G+DA  NE++VK++M   D+Y HAD++GA   VIK+   
Sbjct: 446 KRKWFEKFRWFVSSEGFLVLAGKDASTNEVLVKKHMEDNDLYCHADVYGAPHVVIKDG-- 503

Query: 613 EQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
            Q     T+ +A  F V  S+AW   + ++ A+W YP+QV+K AP+GEYL  G+FM+ GK
Sbjct: 504 -QKAGEKTIFEACQFAVSMSRAWSQGLYSADAYWAYPNQVTKQAPSGEYLGKGAFMVYGK 562

Query: 672 KNFLPPHPLIMGFGLL 687
           +N++   PL +  G++
Sbjct: 563 RNWMHGLPLKLAVGVI 578



 Score = 80.1 bits (196), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   VY    +   FKL    G  +        L++E
Sbjct: 1   MKEEMSSVDIRYIVRELQWLVGSRVDKVYHDGDEVR-FKLRTKEGRAD--------LILE 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T+Y ++    PS FT+ LRKH+    ++ + Q  +DRI+  + G     + +I
Sbjct: 52  AGKRFHLTSYIKEAPKQPSSFTMLLRKHLGGGFIDAIEQHQFDRIVKIRIG----NYTLI 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GNI+L DSE  ++  LR     D+ +   + +++P
Sbjct: 108 GELFRRGNIILVDSENKIVAALRYEEYKDRAIKPKAEYKFP 148


>gi|448491980|ref|ZP_21608648.1| Fibronectin-binding A domain protein [Halorubrum californiensis DSM
           19288]
 gi|445692198|gb|ELZ44379.1| Fibronectin-binding A domain protein [Halorubrum californiensis DSM
           19288]
          Length = 729

 Score =  176 bits (446), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 183/732 (25%), Positives = 303/732 (41%), Gaps = 130/732 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHAADPDNVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G VP  K + + +  D+ +  L  A+++ ++ L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAG-VP--KETPIEEATDDQLGALHDALSRLDERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD----EFCPLLLNQFRSREFVKFETF 354
            SGD+ P  Y         ++    + G  T   D    +  P  L +      V F+TF
Sbjct: 239 -SGDVDPRVY---------EESVEGDGGDETDERDPRVVDVTPFPLAEHEGLPSVGFDTF 288

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRS 406
           +AA+DE++ ++ ++  ++     +  A          K  +I   Q   +   +++    
Sbjct: 289 NAAVDEYFYRLGNEETDEGEAPADAGASRPDFEEEIAKQERIIEQQLGAIEGFEEQAQAE 348

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
            + AEL+  + + VD  +  VR A  N + W+++A  +    + G P A  +    ++ +
Sbjct: 349 RERAELLYAHYDLVDEVLSTVREARENEVPWDEIAATLDAGAERGIPAAAAV----VDVD 404

Query: 467 CMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQ---EKTITA 523
                ++  LDE  D E T+    VE+D +     NA R Y   K+ E K+   ++ I +
Sbjct: 405 GGEGTVTVELDEEGDGEGTV---TVELDASEGVEVNADRLYREAKRVEEKKAGAKEAIES 461

Query: 524 HSKAFKAA-EKKTRLQILQEK----------------------TVANISHMRKVHWFEKF 560
             +  +A  E+K   +  Q                        + ++I       WFE+F
Sbjct: 462 TREELEAVKERKAEWEEQQAADDGSGGDDGGEDDEEEYETDWLSRSSIPIRSPDDWFERF 521

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL- 619
            WF +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +   P+ 
Sbjct: 522 RWFRTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTILKASGPSESADPVD 581

Query: 620 ----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
               TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + +
Sbjct: 582 FSEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTY 641

Query: 675 LPPHPLIMGFGL 686
               P  +  G+
Sbjct: 642 FEDVPCRIAVGV 653


>gi|448565126|ref|ZP_21636097.1| hypothetical protein C457_11862 [Haloferax prahovense DSM 18310]
 gi|445715785|gb|ELZ67538.1| hypothetical protein C457_11862 [Haloferax prahovense DSM 18310]
          Length = 702

 Score =  176 bits (446), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 175/706 (24%), Positives = 287/706 (40%), Gaps = 128/706 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP           AS+L 
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                       +P  V+ D                      L +N  ++  D  R    
Sbjct: 165 ------------DPLTVSRDA---------------------LGRNMEQSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L  G   +E +    G+   + +++    + +A+   ++      D  Q V
Sbjct: 188 ----TLATQLNLGGLYAEELCTRAGVEKTLDIADATADDYDAVYDAIV------DLRQQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG+  P  Y+              E G    +     PL  +Q    +   ++TF+ AL
Sbjct: 238 RSGEFDPRLYL-------------DEDGEVVDVTP--FPLREHQNAGLDEEAYDTFNDAL 282

Query: 359 DEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           DE++ +++    EQ+  +     +    K  +I   QE  +   +Q+     + AEL+  
Sbjct: 283 DEYFFRLDLTADEQEATSDRPDFEEQIAKQQRIIDQQEGAIEGFEQQAQDERERAELLYA 342

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N + VD  +  VR A    + W+D+   + E  + G P A  +  +      +++     
Sbjct: 343 NYDLVDDVLSTVRGAREEGVPWDDIGETLAEGAEQGIPEAEAVTNVDGANGTVTV----- 397

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAEK 533
             ++DD   TL V       ++    NA R Y   K+ E K+E  + A   ++   AA K
Sbjct: 398 --DLDDATVTLEV-------SMGVEKNADRLYTEAKRIEEKKEGALAAIEDTREELAAVK 448

Query: 534 KTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSENYLVISGR 575
           K R +   +                    + ++      HWFE+F WF +S  YLV+ GR
Sbjct: 449 KRRDEWEADDDEDDEDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSGYLVVGGR 508

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTVC 630
           +A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +A  F V 
Sbjct: 509 NADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLREAAQFAVS 568

Query: 631 HSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 569 YSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYF 614


>gi|448528898|ref|ZP_21620278.1| Fibronectin-binding A domain protein [Halorubrum hochstenium ATCC
           700873]
 gi|445710346|gb|ELZ62165.1| Fibronectin-binding A domain protein [Halorubrum hochstenium ATCC
           700873]
          Length = 740

 Score =  176 bits (445), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 184/736 (25%), Positives = 297/736 (40%), Gaps = 127/736 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHAADPDHVADAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP            S+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGSLSTVRLKSRTVAPGSQYEYP-----------GSRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                              D  +VS          +GG      ++  ++ +D  R    
Sbjct: 165 -------------------DPLDVS----------RGG----FERHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+     + E     D+ ++ L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKETPIEEAT---DDQLRALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGY---ILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
            SGDI P  Y   I           P        ++ D   P  L +      V F++F+
Sbjct: 239 -SGDIDPRVYEESIDGDGNADDDADP--------RVVD-VTPFPLAEHEDLPSVGFDSFN 288

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSV 407
           AA+DE++ ++ S+ AE      + +A          K  +I   QE  +   +++     
Sbjct: 289 AAVDEYFYRLGSEDAEAGDAPADASASRPDFEGEIAKQQRIIEQQEGAIEGFEEQAQAER 348

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERN 466
           + AEL+  N + VD  +  VR A  + + W+++   +    + G P A  ++D    E  
Sbjct: 349 ERAELLYANYDLVDEVLSTVREARESEVPWDEIEETLDAGAERGIPAAEAVVDVDGGEGT 408

Query: 467 CMSLLLSNNLDEMDDEE-KTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
               L   + D+ DDE        ++E+D +     NA R Y+  K+ E K+E  + A  
Sbjct: 409 VTVELADESGDDADDEGGANGGTTRIELDASEGVEVNADRLYQEAKRVEEKKEGAMAAIE 468

Query: 526 KAFKAAEK-KTRLQILQEKTVAN----------------------------ISHMRKVHW 556
              +  E  K R    +E+  AN                            I      +W
Sbjct: 469 STREELEAVKERKAEWEEQQAANDGSGQGDDGDDGADDEEEYETDWLSRASIPIRSPDNW 528

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV 616
           +++F WF +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +  
Sbjct: 529 YDRFRWFHTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTILKASGPSESA 588

Query: 617 PPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
            P+     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG
Sbjct: 589 DPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVDPDQVSKTPESGEYIEKGSFVIRG 648

Query: 671 KKNFLPPHPLIMGFGL 686
            + +    P  +  G+
Sbjct: 649 DRTYFEDVPCRIAVGV 664


>gi|76156824|gb|AAX27946.2| SJCHGC07203 protein [Schistosoma japonicum]
          Length = 184

 Score =  176 bits (445), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 90/180 (50%), Positives = 116/180 (64%), Gaps = 19/180 (10%)

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           ++  K+A  K    +   KT+A I+ +RK  WFEKF WFISSENYLV++G D+QQNE++V
Sbjct: 5   AQILKSAIHKAEATMKTAKTIAQITEVRKPMWFEKFFWFISSENYLVVAGHDSQQNEVLV 64

Query: 585 KRYMSKGDVYVHADLHGASSTVIKN-------------------HRPEQPVPPLTLNQAG 625
           KRY+  GD++VHAD+HGAS+ +IK                    HR     PP TL +A 
Sbjct: 65  KRYLKSGDIFVHADIHGASTVIIKARHLTSEESDFSKHESLLHLHRSLPLPPPKTLLEAA 124

Query: 626 CFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
              V  S AW + ++T AWWV+  QVSKTAP+GEYLT GSF+IRGKKN+LPP P   GFG
Sbjct: 125 NMAVVLSSAWQNHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFG 184


>gi|13542268|ref|NP_111956.1| RNA-binding protein snRNP [Thermoplasma volcanium GSS1]
 gi|14325702|dbj|BAB60605.1| hypothetical protein [Thermoplasma volcanium GSS1]
          Length = 604

 Score =  175 bits (443), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 138/200 (69%), Gaps = 12/200 (6%)

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           E +++D   SA  NA R+++L K    K    I    KA + AE++ R++ LQEK V ++
Sbjct: 343 EDIDIDYTKSAGENANRYFDLSKDYRKK----IEGAKKAIEEAEQE-RIK-LQEKKVKSV 396

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
           +  R++ WFE ++WFISSE YLVI+GRDA+ NE IVK+++ +GD+YVHAD++GA ST+IK
Sbjct: 397 N--RRIFWFETYHWFISSEGYLVIAGRDAKSNEKIVKKHLKEGDLYVHADMYGAPSTIIK 454

Query: 609 NHRPEQPVPPL-TLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
           +    +P+P   T+ QA  F +C S+AW + + + +A+WVYP QVSKT  +GEY++ GS+
Sbjct: 455 SE--GKPMPGEDTIRQAAAFAICFSRAWPAGIASGTAYWVYPSQVSKTPESGEYVSTGSW 512

Query: 667 MIRGKKNFLPPHPLIMGFGL 686
           +IRGK+N++    L +  GL
Sbjct: 513 IIRGKRNYVTNLKLELCIGL 532



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 58/123 (47%), Gaps = 12/123 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           RL+G     VY   P  ++ +L  S       +   +L+ ++ G+   +     +  +T 
Sbjct: 20  RLVGSFVKKVYQTGPDDFLIQLYRSDL-----KRFDMLVSLKKGIFFKS----EETPDTA 70

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           S   + LRK I  RR+  V Q+ +DR++ F F  G     +ILEL+  GN++ TD +   
Sbjct: 71  SQTAMVLRKTISDRRIVSVEQVNFDRVVKFVFHTG---QALILELFRDGNLIATDGDKIT 127

Query: 140 LTL 142
             L
Sbjct: 128 FVL 130


>gi|389852774|ref|YP_006355008.1| hypothetical protein Py04_1359 [Pyrococcus sp. ST04]
 gi|388250080|gb|AFK22933.1| hypothetical protein Py04_1359 [Pyrococcus sp. ST04]
          Length = 642

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 117/356 (32%), Positives = 194/356 (54%), Gaps = 26/356 (7%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQH-KAKEDAAFHKLNKIHMDQ-E 393
            P+ L  + + E V +++F  ALDE++ K+  ++A+++  KA E+    K  +I + + E
Sbjct: 237 LPVDLVWYSNYEKVFYDSFSKALDEYFGKLTIEKAKRERTKALEEK--RKALEISLKRIE 294

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
            ++   ++E   + +  +L+  N   V   +  +R  +   +  E++ + ++E +K G P
Sbjct: 295 EQIRGFEKEAQENQERGDLLYANYTLVKEILETIRRGIKT-LGVEEVVKRIEEAKKKGYP 353

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQ 513
            A +I K+  +    SL++             L  +K+++D+  +   NA  +YE  KK 
Sbjct: 354 WANIISKVSKD----SLVIE------------LEGKKIKLDINKTLEENAEIFYEKAKKA 397

Query: 514 ESKQEKTITAHSKAFKAAEKKTRLQILQEKTVA-NISHMRKVHWFEKFNWFISSENYLVI 572
             K E    A+ +  K  E   +  + +EK +A      R+  WFEKF WFISSE +LVI
Sbjct: 398 RQKLEGARKAYEETKKKIENIEQEIMEEEKKIAVKKLEKRRKKWFEKFRWFISSEGFLVI 457

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
            G+DA  NE++VKR+MS+ D+Y HAD+ GA   VIK  R        T+ +A  F V  S
Sbjct: 458 GGKDATTNEIVVKRHMSENDLYCHADIWGAPHVVIKEGR---KASEKTIFEACQFAVSMS 514

Query: 633 QAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +AW   + ++ A+WVYP QVSK AP GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 515 RAWSEGLASADAYWVYPEQVSKQAPAGEYLPKGAFMVYGKRNWLHGIPLKLAVGII 570



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 86/157 (54%), Gaps = 13/157 (8%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M++ D+   V+ L+ +IG R   VY    +  I        + ++GE  +V LL+E+G R
Sbjct: 1   MSSVDIKYVVEELQNIIGSRVDKVYHQDNELRI-------KLHKAGEG-RVDLLIEAGKR 52

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H T+Y ++    P+ F + LRK++  + L  + Q  +DRI++ +FG     + +I EL+
Sbjct: 53  IHVTSYIKENLQ-PTAFAMLLRKNLSGKFLTKIEQREFDRIVILEFG----EYKLIAELF 107

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +GNI+L D ++ ++  LR     D+ +     +++P
Sbjct: 108 GKGNIILVDKDWKIIGALRYEEFRDRAIKPKIHYQFP 144


>gi|383318475|ref|YP_005379316.1| RNA-binding protein, eukaryotic snRNP-like protein [Methanocella
           conradii HZ254]
 gi|379319845|gb|AFC98797.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Methanocella conradii HZ254]
          Length = 662

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 185/359 (51%), Gaps = 22/359 (6%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L+++ S + V FE+F+ ALDE++S+  +  A+ +   ++        +    QE  
Sbjct: 251 LPIELSRYSSHQKVYFESFNQALDEYFSRHVAAEAKAEVVERKAEKLGVYERRLRQQEEA 310

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++E   +V+  E I      +   I  +R A A   SW+D+ +++++ RKAGN  A
Sbjct: 311 IAKFEREEAENVRKGEAIYAEYNTISEVIGVIRGARAKGYSWDDIRKILRDARKAGNKAA 370

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            LI  +    N +++ LS+                V V++ L+   NA+ +Y+  KK   
Sbjct: 371 SLIQSVDPAANTVNVKLSSV--------------SVNVNIDLTVPQNAQAYYDKAKKARL 416

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGR 575
           K+E  + A  +  KA  K+T     +    A   H RK  W+EK+ WF +S+ +LVI GR
Sbjct: 417 KKEGALKAIEETKKAMAKETPAPPREPSAKA---HPRKPRWYEKYRWFYTSDGFLVIGGR 473

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAW 635
           DA QNE +VK+YM K DV+ HA   GA  T++K     + V P  L +A  F V +S  W
Sbjct: 474 DADQNEELVKKYMEKSDVFFHAQAFGAPITIVKAG--GRDVTPAALAEAAQFAVSYSSVW 531

Query: 636 DSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
            S   +   +WV P QVSKT   GEY+  G+F+IRG +N++    +    G+  R DE+
Sbjct: 532 KSGQYSGDCFWVRPEQVSKTPEHGEYVAKGAFIIRGDRNYVKNVEVRAAVGI--RFDET 588



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 81/161 (50%), Gaps = 7/161 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ DV A V+ L+ L+  +    Y  +      +L       +  ++ K  L++E
Sbjct: 1   MKEEMSSVDVYAAVRELQFLVDAKVEKAYQHTADEIRIRL-------QEFKTGKYDLVIE 53

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T + R+    P  F + LRKH+   R+  + Q  +DRI+  +         ++
Sbjct: 54  AGKRLHLTRHPRESPKLPPSFPMMLRKHMMGGRITRIAQHNFDRIVEIEVARAGVKSTLV 113

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+AQGN++L D E  ++  LRS +  D+ V    ++ YP
Sbjct: 114 AELFAQGNVILLDGERRIMMPLRSMKMKDRDVVRGEQYEYP 154


>gi|386003039|ref|YP_005921338.1| hypothetical protein Mhar_2365 [Methanosaeta harundinacea 6Ac]
 gi|357211095|gb|AET65715.1| hypothetical protein Mhar_2365 [Methanosaeta harundinacea 6Ac]
          Length = 668

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 180/706 (25%), Positives = 296/706 (41%), Gaps = 132/706 (18%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M+  DVAA V+ L+ +L+G      Y LSP   +          +S  S K+ LL+
Sbjct: 1   MKKAMSNVDVAAVVEELQEKLVGGFVGKSYQLSPDRVVISF-------QSPASGKLDLLL 53

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T   R+    P  F   LR  +   R+  VRQ G+DR+   +   G + + +
Sbjct: 54  EAGRRIHLTEKPREAPKMPPQFPTMLRSRLSGGRVAAVRQHGFDRVAEIEIERGDDRYTL 113

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I E++ +GN+LL DS   ++  LR     D+                        KL A 
Sbjct: 114 IAEIFPKGNVLLLDSGGRIVLPLRPLAFRDR------------------------KLLAG 149

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
            T     D  +P  V          S+ +L       +F L+ + ++            L
Sbjct: 150 ETYQYREDQVDPRTV----------SRNDL-------AFILASSDSE------------L 180

Query: 241 KTVLGEALGYGPALSEHIILDTGL---VPNMKLS--EVNKLEDNAIQVLVLAVAKFEDWL 295
              L   L  G   +E I L  G+   VP   L+  E+++L     +V  LA    E + 
Sbjct: 181 VRTLVRGLNMGGTYAEEICLRAGINKTVPAFALAGEEIDRLHWALGEVFGLA----EAYP 236

Query: 296 QDVISG----DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKF 351
             V  G    D+VP                     +   +YD             E  +F
Sbjct: 237 HLVAEGERIVDVVP---------------------APLAVYDGL-----------ERREF 264

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
            +F  ALDEF+S  E++  E    AK   A  +  ++   QE  +   ++      ++ E
Sbjct: 265 GSFSEALDEFFSSKEAEAEE----AKPKTALERRREM---QERSIQEFRERERELARLGE 317

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
            +     +V+A + A+        ++ ++   +K    +G P+A  I  L  +      L
Sbjct: 318 KVYERYGEVEAVLAAISKGFERGFTYSEILAKIK---TSGLPIAEKILALDYQGELRLRL 374

Query: 472 LSNNLDEMDDEEKTLPVEK----------VEVDLALSAHANARRWYELKKKQESKQEKTI 521
                 +  + +     +           +E++  L+   NA+R+Y+L K+Q  K+E   
Sbjct: 375 DDPGDGDGGEGKGGTVGDTGGKGEARGAVLELNSNLTVPQNAQRYYDLAKEQAKKREGAE 434

Query: 522 TAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNE 581
            A  +  +   +K   +  + KT A +   RK  W+E+F WF SS+ +LVI GRDA  NE
Sbjct: 435 KALEETIRLIARKAGPE--KAKTRA-VYRRRKPKWYERFRWFTSSDGFLVIGGRDATSNE 491

Query: 582 MIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT 641
            I  +Y+ K D+ +H D  GA  TVIK     + VP  TL +A  F V +S  W + +  
Sbjct: 492 EIYAKYLEKRDLALHTDAPGAPLTVIKTL--GEAVPESTLEEAASFAVSYSSLWKAGLFE 549

Query: 642 S-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
              + V   QV+KT   GE+L  G+F++RG++ +    PL +  G+
Sbjct: 550 GDCYLVAADQVTKTPEPGEFLKKGAFVVRGERRYYRDVPLGLALGI 595


>gi|410722235|ref|ZP_11361543.1| putative RNA-binding protein, snRNP like protein [Methanobacterium
           sp. Maddingley MBC34]
 gi|410597380|gb|EKQ52002.1| putative RNA-binding protein, snRNP like protein [Methanobacterium
           sp. Maddingley MBC34]
          Length = 742

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 113/364 (31%), Positives = 186/364 (51%), Gaps = 25/364 (6%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSK---IESQRAEQQHKAKEDAAFHKLN 386
           ++ ++  PL +  +++    +F+TF+ A DEFYS     + ++ ++   AKE   + K  
Sbjct: 328 KVKEDVLPLDILTYQNFHKERFDTFNQAADEFYSGKVGADIKKVQEDIWAKEVGKYEKRL 387

Query: 387 KIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKE 446
           +I   QE  +   ++ +  + K   L+  +  ++   +  +  A   + SW ++A   K+
Sbjct: 388 RI---QEETLEKFQKTIVETKKKGNLLYSHYSEIQDLLDIIHQA-REKFSWMEIASKFKK 443

Query: 447 ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRW 506
            RK G   A +I+ +    + M +L  N           L  E+V VD  L    NA ++
Sbjct: 444 ARKEGMKEAQIIESM----DKMGVLTLN-----------LEGERVTVDANLEIPENAEKY 488

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKK--TRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           Y   KK + K      A  +  K  E+K   R   L+   V      +++ WFEK  WF+
Sbjct: 489 YNKGKKAKRKIRGVNIAIERTKKDVERKRNKREMALERVRVPQKRVRKELKWFEKLRWFL 548

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           SS+ YLVI GRDA  NEM+VKR++   D+Y+H+D+HGA S VIK    E  +P  T+ +A
Sbjct: 549 SSDGYLVIGGRDAGTNEMVVKRHLDNQDIYLHSDIHGAPSVVIKKGEVEGEIPESTVQEA 608

Query: 625 GCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           G      S AW     +   +WV+P QVSKT  +GE++  G+F+IRG +N+L   PL + 
Sbjct: 609 GTLAASFSSAWSKGYGSQDVYWVHPDQVSKTPQSGEFVARGAFIIRGSRNYLRGIPLKIA 668

Query: 684 FGLL 687
            G++
Sbjct: 669 VGIV 672



 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 62/111 (55%), Gaps = 3/111 (2%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +V +  ++G+R+HTT Y  +    P  F + LRKH++   ++ VRQ  +DRI+  +  + 
Sbjct: 47  RVDVAFQAGLRVHTTQYPPENPKVPPSFPMLLRKHLKNATVKGVRQHNFDRIL--EIDIQ 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
               + +++EL++QGNI+L D +  ++  L+      + +     ++YP E
Sbjct: 105 KEHRFTLVVELFSQGNIILLDEDNQIILPLKHRHAQGRKITSKEEYQYPEE 155


>gi|448474105|ref|ZP_21602073.1| Fibronectin-binding A domain protein [Halorubrum aidingense JCM
           13560]
 gi|445818385|gb|EMA68244.1| Fibronectin-binding A domain protein [Halorubrum aidingense JCM
           13560]
          Length = 731

 Score =  173 bits (438), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 180/734 (24%), Positives = 303/734 (41%), Gaps = 132/734 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+ A V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLGALVTELNRYAGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHVADPEHVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  +++ YP           AS+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGSLSTVRLKSRTVAPGAQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                    N    +LGG K        ++  ++ +D  R    
Sbjct: 165 -------------------------NPLDVSLGGFK--------RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+   + + E     D+ ++ L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKTLPVDEAT---DDQLRALHEALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSST----QIYDEFCPLLLNQFRSREFVKFETF 354
            SGDI P  Y    +   G++    ++GS T    ++ D   P  L++      V F++F
Sbjct: 239 -SGDIDPRVYEEALDGD-GEEDGNGDAGSDTDRDPRVVD-VTPFPLSEHEGLPSVGFDSF 295

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRS 406
           +AA+DE++ ++E +  +      + +A          K  +I   Q   +    ++  + 
Sbjct: 296 NAAVDEYFYRLEHEDTDAGEAPADASASRPDFEEEIAKQERIIEQQRGAIEGFDEQAAQE 355

Query: 407 VKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            + AEL+  EY+L  VD  +  VR A AN + W+++A  +    + G P A  +  +   
Sbjct: 356 RERAELLYAEYDL--VDEVLSTVRDARANDVPWDEIADTLAAGAERGIPAAEAVVDVDGS 413

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQ---EKTI 521
              +++ L ++              +VE+D       NA R Y+  K+ E K+   E+ I
Sbjct: 414 DGTVTVELGDD------------GTRVEIDTGAGVEVNADRLYQEAKRIEDKKAGAEQAI 461

Query: 522 TAHSKAFKAA-EKKTRLQILQEK----------------------TVANISHMRKVHWFE 558
            +     +A  E+K      Q                        + ++I   R   W+E
Sbjct: 462 ESTRAELEAVKERKAEWAAQQAAADDDQSDSEEDDDEEEHEIDWLSRSSIPIRRPEDWYE 521

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           +F WF ++  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   P +   P
Sbjct: 522 RFRWFHTASGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKAAGPSESADP 581

Query: 619 L-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
           +     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG +
Sbjct: 582 VDFSEQTLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDR 641

Query: 673 NFLPPHPLIMGFGL 686
            +    P  +  G+
Sbjct: 642 TYFEDVPCRVAVGV 655


>gi|375084281|ref|ZP_09731287.1| fibronectin-binding protein [Thermococcus litoralis DSM 5473]
 gi|374741041|gb|EHR77473.1| fibronectin-binding protein [Thermococcus litoralis DSM 5473]
          Length = 650

 Score =  172 bits (436), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 116/359 (32%), Positives = 188/359 (52%), Gaps = 29/359 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   FETF  ALDE++ KI  + A+ +   K       L      QE  
Sbjct: 242 LPIELKWYEGYEKKFFETFSEALDEYFGKILIESAKIERTKKLQDKKRGLEVTLRKQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++++  + ++ +LI  N   V+  +  +  A+  ++ WE+  + ++E RK+GN VA
Sbjct: 302 IKGFERQMQENQEIGDLIYANFTFVENLLKELSKAV-EKLGWEEFKKRIEEGRKSGNKVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            +I  +                  D +EK + VE    KV++ L  S   NA  +YE  K
Sbjct: 361 QIIKGI------------------DPKEKAVTVELEGKKVKLYLNKSIGENAEIYYEKAK 402

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH--WFEKFNWFISSENY 569
           K + K E    A+    K  ++  +L   +EK   ++  + K    WFEKF WF+SSE +
Sbjct: 403 KAKHKLEGARKAYEDTLKKIQEIEKLIEEEEKKELSVKKLEKRKKKWFEKFRWFVSSEGF 462

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LVI G+DA  NE++VKR+MS+ D+Y HAD++GA   VIK+ +        T+ +A  F V
Sbjct: 463 LVIGGKDATTNEIVVKRHMSENDLYCHADIYGAPHVVIKDGK---KAGEKTIFEACQFAV 519

Query: 630 CHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
             S+AW   + +  A+W  P QV+K AP+GEYL  G+FM+ GK+N++   P+ +  G++
Sbjct: 520 SMSRAWKDGIYSGDAYWADPSQVTKKAPSGEYLGKGAFMVYGKRNWMHGLPVKLAIGIV 578



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 85/166 (51%), Gaps = 14/166 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L G R   +Y    +  I        +  +GE  K L++ E
Sbjct: 1   MKQEMSSVDIKYIVEELKSLEGARVDKIYHDGDQIRI-------KLHIAGEGRKDLII-E 52

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R+H T Y ++    PS FT+ LRK++   RLE + Q  +DRI+  + G     + +I
Sbjct: 53  AGRRIHLTTYIKEAPQQPSSFTMLLRKYLSGLRLEKIEQHDFDRIVKLKIG----EYTLI 108

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICR 167
            EL+ +GN++L D +  +++ +R     D+  AI  +H Y     R
Sbjct: 109 AELFKRGNVILVDKDNVIISAMRHEEFKDR--AIKPKHEYKIPPAR 152


>gi|448612034|ref|ZP_21662464.1| hypothetical protein C440_11728 [Haloferax mucosum ATCC BAA-1512]
 gi|445742795|gb|ELZ94289.1| hypothetical protein C440_11728 [Haloferax mucosum ATCC BAA-1512]
          Length = 701

 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 167/716 (23%), Positives = 285/716 (39%), Gaps = 127/716 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  + R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTEMNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GDIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI + D    V+  L                    E  R+  RT A    
Sbjct: 117 KIVVELFGQGNIAILDETGEVVRSL--------------------ETVRLKSRTVAPGSQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+     +P  ++ D                      L ++  ++  D  R    
Sbjct: 157 YEYPSSR----LDPLTISRDA---------------------LGRHMEQSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                +   L  G   +E +    G+   + + +  + + +AI   ++ +       Q V
Sbjct: 188 ----TIATQLNLGGLYAEELCTRAGVEKTLDIEDATEDDYDAIYDAIVNLR------QQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG+  P  Y+    + +  D  P              PL  +Q    +   +ETF+ AL
Sbjct: 238 RSGEFDPRLYLADDGEVV--DVTP-------------FPLQEHQNAGLDEEAYETFNEAL 282

Query: 359 DEFYSKIESQRAEQQ---HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           DE++ +++    EQ+   ++   +    K  +I   QE  +    Q+ D   + AEL+  
Sbjct: 283 DEYFFRLDLTADEQEATSNRPDFEEQIAKQERIIEQQEQAIEGFDQQADEERERAELLYA 342

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N +  D  +  VR A    + W+++A  + E    G P A  +  +      +++ L   
Sbjct: 343 NYDLADDVLSTVRDAREQGVPWDEIAVTLDEGADQGIPAAEAVTNVDSANGTVTVELDGT 402

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAEK 533
                          V +D+++    NA R Y   K+ + K+E  + A   ++    A K
Sbjct: 403 --------------SVTLDVSMGVEKNADRLYTEAKRIQEKKEGALAAIEDTREELEAAK 448

Query: 534 KTRLQILQEK-----------------TVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           + R +   +                  ++ ++      HWFE+F WF +S  YLV+ GR+
Sbjct: 449 RRRDEWEADDGGGDADEDDEPEETDWLSLESVPVKSTEHWFERFRWFYTSSGYLVVGGRN 508

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTVCH 631
           A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +A  F V +
Sbjct: 509 ADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQKVDFSEETLREAAQFAVAY 568

Query: 632 SQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + +    P  +  G+
Sbjct: 569 SSIWKEGRFADDAYMVEPSQVSKTPESGEYIDKGSFVIRGDRRYFEDVPAKVAVGI 624


>gi|16082623|ref|NP_394872.1| RNA-binding protein snRNP [Thermoplasma acidophilum DSM 1728]
          Length = 601

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/197 (45%), Positives = 126/197 (63%), Gaps = 10/197 (5%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           VE+D  +SA  NA R++   K    K E  +    KA + AEK    Q L E   A    
Sbjct: 344 VEIDYTVSAGENANRYFSQAKDYRRKIEGAM----KAIEEAEK----QRLTEMQKAEKKK 395

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            RKV WFE ++WFISSE YLVI+GRDA+ NE IVK+++ +GD+YVHAD++GA ST+IK+ 
Sbjct: 396 RRKVFWFETYHWFISSEGYLVIAGRDAKSNEKIVKKHLQEGDIYVHADMYGAPSTIIKSS 455

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
             +QP    TL +A  F V  S+AW + + + +A+WVYP QVSKT  +GEY+  GS++IR
Sbjct: 456 -GKQPPGEATLREAASFAVSFSRAWPAGIASGTAYWVYPSQVSKTPESGEYVATGSWIIR 514

Query: 670 GKKNFLPPHPLIMGFGL 686
           GK+N++    L +  G+
Sbjct: 515 GKRNYITDLKLELCIGM 531



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 86/175 (49%), Gaps = 18/175 (10%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + ++ D  A V   R R +G     VY + P  ++ ++  S       +   VL+ +
Sbjct: 1   MKDKESSIDFYAFVNIYRDRFVGSFVKKVYQVGPDDFMVQIYRSDI-----KRMDVLISL 55

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           + G+   T     +   T +   + LRK I  RR+  +RQ+ +DR++ F F  G     +
Sbjct: 56  KHGIFFKTV----ETPETATQTAMVLRKTISDRRIVGIRQINFDRVVEFTFHTGQK---L 108

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTAS 175
           ILEL+ +GN++ TD +  +  +LR  +  ++ + +   ++ P+     F+ ++AS
Sbjct: 109 ILELFREGNLIATDGD-RITFVLRPRKWKNRDLEVGGTYQPPSS----FDPSSAS 158


>gi|10640760|emb|CAC12538.1| conserved hypothetical protein [Thermoplasma acidophilum]
          Length = 588

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/197 (45%), Positives = 126/197 (63%), Gaps = 10/197 (5%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISH 550
           VE+D  +SA  NA R++   K    K E  +    KA + AEK    Q L E   A    
Sbjct: 331 VEIDYTVSAGENANRYFSQAKDYRRKIEGAM----KAIEEAEK----QRLTEMQKAEKKK 382

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            RKV WFE ++WFISSE YLVI+GRDA+ NE IVK+++ +GD+YVHAD++GA ST+IK+ 
Sbjct: 383 RRKVFWFETYHWFISSEGYLVIAGRDAKSNEKIVKKHLQEGDIYVHADMYGAPSTIIKSS 442

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
             +QP    TL +A  F V  S+AW + + + +A+WVYP QVSKT  +GEY+  GS++IR
Sbjct: 443 -GKQPPGEATLREAASFAVSFSRAWPAGIASGTAYWVYPSQVSKTPESGEYVATGSWIIR 501

Query: 670 GKKNFLPPHPLIMGFGL 686
           GK+N++    L +  G+
Sbjct: 502 GKRNYITDLKLELCIGM 518



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 77/156 (49%), Gaps = 17/156 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           R +G     VY + P  ++ ++  S       +   VL+ ++ G+   T     +   T 
Sbjct: 7   RFVGSFVKKVYQVGPDDFMVQIYRSDI-----KRMDVLISLKHGIFFKTV----ETPETA 57

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           +   + LRK I  RR+  +RQ+ +DR++ F F  G     +ILEL+ +GN++ TD +  +
Sbjct: 58  TQTAMVLRKTISDRRIVGIRQINFDRVVEFTFHTGQK---LILELFREGNLIATDGD-RI 113

Query: 140 LTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTAS 175
             +LR  +  ++ + +   ++ P+     F+ ++AS
Sbjct: 114 TFVLRPRKWKNRDLEVGGTYQPPSS----FDPSSAS 145


>gi|429961918|gb|ELA41462.1| hypothetical protein VICG_01446 [Vittaforma corneae ATCC 50505]
          Length = 351

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/265 (36%), Positives = 151/265 (56%), Gaps = 31/265 (11%)

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL-ERNCMSLLLSNNLDEMDDEEKTLPV 488
               +M W       +EE++ GNP A  I    L ER C+ L+                 
Sbjct: 25  VFETKMEWSAFEAFWEEEKRNGNPYAKAIVSYDLSERKCIVLIDHRY------------- 71

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
             +E+D+++    N  +++  +KK   K +KT        KAA +    +++ +K +   
Sbjct: 72  --IELDVSMPLSKNIEKYFSKRKKALDKSDKT--------KAALENIVDKLIPKKAIVP- 120

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
           +  R+++WFEKF++FIS+EN LVI G++AQQNE+IVK+++   D+Y H D+HGASS   K
Sbjct: 121 AQKRELYWFEKFHFFISTENELVIGGKNAQQNEIIVKKHLEPTDLYFHCDIHGASSIACK 180

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
             R E     +T+ +A    +C S+ WD  ++   ++V P QVSK+AP+GEY+T GSFMI
Sbjct: 181 G-RSE-----VTIEEASYMALCMSKCWDEGVIKPVFYVEPDQVSKSAPSGEYITKGSFMI 234

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDES 693
           +GK+N + P+ L  G GLLF+L+ S
Sbjct: 235 KGKRNIMNPYRLEYGIGLLFKLEGS 259


>gi|294658357|ref|XP_002770767.1| DEHA2F07678p [Debaryomyces hansenii CBS767]
 gi|202953070|emb|CAR66294.1| DEHA2F07678p [Debaryomyces hansenii CBS767]
          Length = 1064

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 95/257 (36%), Positives = 144/257 (56%), Gaps = 8/257 (3%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--I 548
           V +D++LS  ANAR ++E KK  ESKQ K   +   A K A+KK    +  +    N  +
Sbjct: 514 VWIDISLSPFANARVYFESKKSAESKQIKVEKSTEFALKNAKKKIEQDLNNKLKNENDSL 573

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +R  +WFEKF WF+SSE YL ++GRD  Q +MI  R+ +  D ++ +D+ G+    IK
Sbjct: 574 KQIRPKYWFEKFLWFVSSEGYLCLAGRDNSQIDMIYYRHFNDNDYFISSDIEGSLKVFIK 633

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
           N    + +PP TL QAG F +  S AW+ K+ TSAW ++   +SK    G  ++ G+F  
Sbjct: 634 NPFKGESIPPSTLMQAGIFAISASSAWNGKVTTSAWLLHGADISKKDFDGTLISSGNFNY 693

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG----MDDFEDSGHH--KE 722
           + KK +LPP  LIMGFG  +  DE +   +   R  R EE G    MD+ +    H  K 
Sbjct: 694 KAKKTYLPPCQLIMGFGFYWLGDEETTKKYTETRLSREEEHGLKIVMDNKKQDLEHSSKS 753

Query: 723 NSDIESEKDDTDEKPVA 739
           ++ I+S  ++ D++ V+
Sbjct: 754 SNKIQSSLNEVDDEKVS 770



 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 119/479 (24%), Positives = 230/479 (48%), Gaps = 56/479 (11%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           + ++  R  N+Y++S  +  F L  S       +S+KV++L + G +LH T + R    T
Sbjct: 19  KEILNYRLQNIYNVSSSSRQFLLKFSIP-----DSKKVVVL-DCGNKLHLTEFDRPTTQT 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PS F  KLRKH++TRRL  ++Q+G DR+++ +F  G+   Y+ LE ++ GNILL D +  
Sbjct: 73  PSNFVTKLRKHLKTRRLSQIKQIGNDRVLVLEFSDGL--FYLALEFFSAGNILLLDQDRK 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASKLHAALTSSKEPDANEPDKVNE 197
           +L+L R     DKG       RY   EI ++F+ +             + D N   K   
Sbjct: 131 ILSLQRMV--SDKG----GNDRYAVNEIYKMFDESLF-----------KSDFNYERKT-- 171

Query: 198 DGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL---KTVLGEALGYGPAL 254
                   SKE + G    +   L    ++ S DG + K       K +   +      L
Sbjct: 172 -------YSKEQVQGWIKSQRDKL----DQRSQDGNKKKNKVFSIHKLLFVNSSHLSSDL 220

Query: 255 SEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFE-DWLQDVISGDIVPEGYILMQN 313
            +  ++  G+  +    +    +D  + +++ A+ + E D++  +   +    GYI+ + 
Sbjct: 221 VQLNLIKNGISSSASCFDFEN-DDAKMDLIIKALEEAESDYINLLEKSEDAINGYIVSK- 278

Query: 314 KHLGKDHPPTESGSSTQ-IYDEFCPLL-----LNQFRSREFVKFETFDAALDEFYSKIES 367
           K+L  +    +S +  + I DEF P       ++ +R   F + + ++  +D F+S IES
Sbjct: 279 KNLSYNPDNDDSTNDLEYIMDEFYPYKPYKSDMDNYR---FTEIQGYNRTMDSFFSTIES 335

Query: 368 QRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAV 427
            +   +   ++  A  +L+    +++ ++ +L  + + ++K  + I Y  + VD    +V
Sbjct: 336 TKYALRIDQQKQQATKRLDYAREERDKQIQSLLAQQESNIKKGDAIMYYADLVDQCKDSV 395

Query: 428 RVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLLSNNLDEMDDEEKT 485
              +  +M W ++  +++ E+  GN +A  I+  L L+ N ++L L  ++DE ++E KT
Sbjct: 396 VKLIDQQMDWTNIESLIELEQSRGNKIARFINLPLNLKENKINLHLP-DMDEENEENKT 453



 Score = 47.0 bits (110), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 57/125 (45%), Gaps = 21/125 (16%)

Query: 812 STKHGIETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKERG 871
           S K   E ++ D++ E     +    R    +S  ERR L+KG+     D KV   ++  
Sbjct: 770 SAKEDTEPSKEDITSEPASESKEGKKR----LSAKERRMLRKGK-----DIKVSENEDTD 820

Query: 872 KDASSQPESIVRKTKIEGGKIS------------RGQKGKLKKMKEKYGDQDEEERNIRM 919
           +D     E  ++  K+E  K              RG+K K+KK+  KY DQDEEER IRM
Sbjct: 821 EDVFDPIEQEMKNLKLEETKKKTAEPSSQKPPNVRGKKSKMKKIAAKYADQDEEERKIRM 880

Query: 920 ALLAV 924
             L  
Sbjct: 881 EALGT 885


>gi|341581973|ref|YP_004762465.1| Fibronectin-binding protein A (FbpA) [Thermococcus sp. 4557]
 gi|340809631|gb|AEK72788.1| Fibronectin-binding protein A (FbpA) [Thermococcus sp. 4557]
          Length = 650

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 184/359 (51%), Gaps = 29/359 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E   F TF  ALDE++ KI  ++A  +   + +A   +L      QE  
Sbjct: 242 VPIELRIYEGFEKRYFTTFSEALDEYFGKITMEKARVEQTKRLEAKKRQLLMTLRKQEEM 301

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++    + ++ +LI  N   ++  +   R A    + W++  + ++E ++AGN VA
Sbjct: 302 LKGFEEGAKANQEIGDLIYANYALIERLLEEFRKA-TETLGWDEFKKRIEEGKRAGNRVA 360

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKK 511
            ++                     D +EK + +E    KV + L  S   NA  +YE  K
Sbjct: 361 LMVKG------------------TDPKEKAVTIELEGKKVRLYLNRSIGENAELYYEKAK 402

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM--RKVHWFEKFNWFISSENY 569
           K   K E  + A+    +  ++  RL   + K   ++  +  RK  WFEKF WF+SSE +
Sbjct: 403 KFRHKHEGALKAYEDTKRKLDEIERLIEEELKKELSVKRIERRKKKWFEKFRWFVSSEGF 462

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LV++G+DA  NE+++KR+M   D+Y HAD++GA   VIK+    Q     T+ +A  F V
Sbjct: 463 LVLAGKDAGTNEILIKRHMDDNDLYCHADVYGAPHVVIKDG---QKAGEKTIFEACQFAV 519

Query: 630 CHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
             S+AW   + +  A+W +P+QV+K  P+GEYL  G+FM+ GK+N+L   PL +  G++
Sbjct: 520 SMSKAWSRGVYSEDAYWAHPNQVTKQTPSGEYLGKGAFMVYGKRNWLHGLPLKLAVGVI 578



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 83/161 (51%), Gaps = 13/161 (8%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+   V+ L+ L+G R   +Y    +  I KL    G  +        L+++
Sbjct: 1   MKEEMSSVDIRYIVRELQSLVGSRVDKIYHDGDEIRI-KLRTKEGRQD--------LILQ 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R H T Y ++    PS FT+ LRKH+    ++ + Q G+DRI+  + G     + ++
Sbjct: 52  AGKRFHVTTYVKEAPKQPSSFTMLLRKHLSGGFIDAIEQHGFDRIVKIRVG----DYTLV 107

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            EL+ +GN++L D E  ++  LR     D+ +   + ++YP
Sbjct: 108 GELFRRGNVILVDGENRIVAALRYEEYKDRRIMPKAEYQYP 148


>gi|448725341|ref|ZP_21707802.1| hypothetical protein C448_01989 [Halococcus morrhuae DSM 1307]
 gi|445798677|gb|EMA49073.1| hypothetical protein C448_01989 [Halococcus morrhuae DSM 1307]
          Length = 695

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 164/703 (23%), Positives = 280/703 (39%), Gaps = 128/703 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELTSVDLAALVTELGTYAGAKLDKAYLYGDDLLRLKLRDF-------DRGRVELLIEV 56

Query: 63  GV--RLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  +  +  D    P GF   LR  +       V Q G+DR++ F+F  G    
Sbjct: 57  GETKRAHVVSPEHVPDAPGRPPGFAKMLRNRLSGADFAGVSQFGFDRVLTFEFERGDRNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            V+ EL+ +GN+ + D+   V+  L +                     R+  RT A    
Sbjct: 117 KVVAELFGEGNVAVLDATGEVVDCLNT--------------------VRLQSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S++     +P  V+ DG                      +    +++ D       
Sbjct: 157 YEFPSTR----FDPLAVDYDG---------------------FAARMEESNTD------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            L   L   L +G    E +    G+   + + E ++ E    +VL  A+    + L   
Sbjct: 185 -LVRTLATQLNFGGLYGEELCTRAGVEKELAIEEADETE---FEVLYDALTGLSEQLS-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P  Y               + G    +     P  L++    +  +F++F AAL
Sbjct: 239 -SGDFDPRIY--------------RDDGEPVDV----TPFPLDERAEFDSEEFDSFTAAL 279

Query: 359 DEFYSKI---ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++ ++   E + + ++ +   +    +  +I   QE  +   + + DR  + AE +  
Sbjct: 280 DAYFVELDTTEDEESGERERPDFEEQIERQQRIIDQQEGAIEDFEAQADRERETAESLYA 339

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E VD  +  VR A    + WE +     E  + G   AG +  +      +++     
Sbjct: 340 NYELVDEILTTVRNAREEGIGWEAIEERFAEGEERGIAAAGAVTGIEPSEGTVTI----- 394

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWY-ELKKKQESKQ--EKTITAHSKAFKAAE 532
             E+DD +       VE+D       NA R Y E K+  E K+  E+ +    +  +A E
Sbjct: 395 --EIDDRD-------VELDPQEGVEQNADRLYREAKRVVEKKEGAEEAVVETREELEAIE 445

Query: 533 KK--------------TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQ 578
           ++                 + +   +  +I   +   W+E+F WF +S+ YLVI GR+A 
Sbjct: 446 RQRDEWEAGDVDDDPDEESEDVDWLSRRSIPTRKNEQWYERFRWFHTSDGYLVIGGRNAD 505

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGCFTVCHSQ 633
           QNE +VK+Y+ +GD + H  + G   T++K   P +P     +P  +L +A  F V +S 
Sbjct: 506 QNEDLVKKYLDRGDRFFHTQVQGGPVTILKATGPSEPTREIDLPDRSLEEAAQFAVSYST 565

Query: 634 AW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            W + +    A+   P QVSKT  +GEYL  G F IRG + + 
Sbjct: 566 VWKNGRFAGDAYMAEPDQVSKTPESGEYLEKGGFAIRGDRTYF 608


>gi|395504204|ref|XP_003756446.1| PREDICTED: nuclear export mediator factor NEMF [Sarcophilus
           harrisii]
          Length = 996

 Score =  170 bits (430), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 117/351 (33%), Positives = 189/351 (53%), Gaps = 41/351 (11%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           L+ +L   L YG  L EH + + G     ++ E  K E   I+ +++ + K ED ++ + 
Sbjct: 183 LRRILNPYLPYGATLIEHCLRENGFSSYFRVDE--KFETGDIEKVLVCLQKAEDHMKTM- 239

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALD 359
             +   +GYI+ Q K       P +       Y+EF P L +Q     +++FE+FD A+D
Sbjct: 240 -SNFSGKGYII-QKKEKKPSLEPDKQSEDILTYEEFHPFLFSQHSKCPYIEFESFDKAVD 297

Query: 360 EFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTL--KQEVDRSVKMAELIEYNL 417
           EFYSK+E Q+ + +   +E  A  KL+ +  D E+R+  L   QE+D+ +K  ELIE NL
Sbjct: 298 EFYSKLEGQKIDLKALQQEKQALKKLDNVRKDHEHRLEALHQAQEIDK-IK-GELIEMNL 355

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN--- 474
           + VD AI  VR ALAN++ W ++  +VKE +  G+ VA  I +L L+ N +++LL N   
Sbjct: 356 QIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDIVANAIRELKLQTNHVTMLLKNPYL 415

Query: 475 -----------NLDEMDDEE-----------------KTLPVEKVEVDLALSAHANARRW 506
                      N+++ + EE                 K  P+  V+VDL+LSA+ANA+++
Sbjct: 416 ISDEEEEDDEINIEKEETEEPKGKKKKQKNKQLQKLQKNKPL-LVDVDLSLSAYANAKKY 474

Query: 507 YELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWF 557
           Y+ K+    K +KT+ A  KAF++AEKKT+  + + + V  I   RKV+  
Sbjct: 475 YDHKRHAARKTQKTVEAAEKAFRSAEKKTKQTLKEVQMVTTIQKARKVYCI 525



 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 71/170 (41%), Positives = 104/170 (61%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+ A +     RL+GMR  N+YD+  KTY+ +L             KV LL+
Sbjct: 1   MKTRFSSVDICAILSEFNARLLGMRVYNIYDVDNKTYLIRLQKPDF--------KVTLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LT+ E+ +L +LR   D+   V    R +YP +  RV E
Sbjct: 113 IIELYDKGNIVLTNYEYLILNILRFRSDEADDVKFAVREKYPIDHARVME 162



 Score = 97.8 bits (242), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/63 (66%), Positives = 51/63 (80%)

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEE 709
           QVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DE  +  H  ER+VRG++E
Sbjct: 536 QVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDEPCVWRHRGERKVRGQDE 595

Query: 710 GMD 712
            ++
Sbjct: 596 DLE 598


>gi|322368861|ref|ZP_08043428.1| Fibronectin-binding A domain protein [Haladaptatus paucihalophilus
           DX253]
 gi|320551592|gb|EFW93239.1| Fibronectin-binding A domain protein [Haladaptatus paucihalophilus
           DX253]
          Length = 711

 Score =  169 bits (429), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 164/713 (23%), Positives = 290/713 (40%), Gaps = 128/713 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA  + L    G +    Y         K+ +        +  ++ LL+E 
Sbjct: 22  KRELSSIDLAAITRELNSFEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLVEV 74

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  +       P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 75  GEVKRAHTVAPEHVPPAPGRPPNFAMMLRNRLSGADFAGVEQFEFDRILQFHFKREDGDT 134

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+ + D    V+  L                    +  R+  RT A    
Sbjct: 135 TIVAELFGQGNVAVLDENNEVIDCL--------------------DTVRLKSRTVAPGSQ 174

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+      P +++ +                     +     N +  D  R    
Sbjct: 175 YEFPSSR----VNPLEIDYE---------------------EFEYRMNDSDTDVVR---- 205

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L +G   +E +    G+    K++++   +++  + L  A+ +  + L+  
Sbjct: 206 ----TLATQLNFGGLYAEEVCTRAGV---EKVTDIADADEDEYERLYAAIERLREPLE-- 256

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +GD  P  Y                      +  +  P  L ++   +   F++F+AA+
Sbjct: 257 -TGDFDPRVYY------------------EDDVRVDVTPFPLEEYEGLDSEAFDSFNAAV 297

Query: 359 DEFYSKI---ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D++++ +   E++ A +  K    A   K  +I   QE  +   +++ D   + AEL+  
Sbjct: 298 DDYFTNLDVSENEDAGEPQKPDFQAQIEKQQRIIEQQEGAIEGFERKADAEREKAELLYA 357

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N   VD  +  VR A A    W D+    +E  + G   A  +  +      +++     
Sbjct: 358 NYGFVDEILATVRNARAEDTPWADIEARFEEGAERGIEAAEAVQGIDPSEGTVTV----- 412

Query: 476 LDEMDDEEKTL-PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK- 533
             E+DD + TL P + VE         NA R Y+  K+ E K+E  + A     +  E+ 
Sbjct: 413 --EIDDTKITLFPDDGVE--------KNANRLYQEAKRIEEKKEGALAAIEDTREELEEV 462

Query: 534 KTRLQILQEK--------------TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
           K R +  +E+              + A+I   ++  W+E+F WF +S+ +LV+ GR+A +
Sbjct: 463 KKRAEQWEEEPEEERTEPENIDWLSRASIPVRKQEQWYERFRWFRTSDGFLVLGGRNADE 522

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGCFTVCHSQA 634
           NE +VK+YM + D++ H+  HG   T++K   P +P     VP  +  +A  F V +S  
Sbjct: 523 NEELVKKYMDRNDLFFHSQAHGGPITILKTSDPSEPSKDVDVPEQSKREAAQFAVSYSSV 582

Query: 635 W-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           W D +    A+ V P QVSKT  +GEYL  G F IRG + +    P+ +  G+
Sbjct: 583 WKDGRFAGDAYMVTPDQVSKTPESGEYLEKGGFAIRGDRTYFEDTPVGVAVGI 635


>gi|14591254|ref|NP_143331.1| hypothetical protein PH1465 [Pyrococcus horikoshii OT3]
 gi|3257889|dbj|BAA30572.1| 650aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 650

 Score =  169 bits (429), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 108/358 (30%), Positives = 190/358 (53%), Gaps = 28/358 (7%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L  +   E V FETF  ALDE++ K+  ++A+ +   K +    +L      QE  
Sbjct: 243 VPIDLKWYEGYEKVYFETFSQALDEYFGKLTIEKAKAEKTKKLEEKRKQLLATLKRQEEM 302

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++E+ ++ ++  LI  N   +D  +     A+ N + W++  + ++E +K GN +A
Sbjct: 303 IKGFEKELKKNQEIGNLIYANYTLIDGLLREFSKAVKN-LGWDEFKKRIEEGKKKGNKIA 361

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            ++  +  E N +++ +                ++V++ L    + NA  +YE  KK + 
Sbjct: 362 LMVKGIEPESNSITVEIEG--------------KRVKLYLDKDLNENAEIYYEKAKKAKH 407

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH-----WFEKFNWFISSENYL 570
           K E       KA++  ++K      + +       ++K+      WFEKF WFISSE +L
Sbjct: 408 KLE----GARKAYEDLKRKLESIEREIEEEEKKIQVKKIEKRKKKWFEKFRWFISSEGFL 463

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI G+DA  NE++V++Y+ + D+Y HAD+ GA   VIK+    Q     T+ +A  F V 
Sbjct: 464 VIGGKDATTNEIVVRKYLEENDLYCHADIWGAPHVVIKDG---QKAGEKTIFEACQFAVS 520

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            S+AW   + ++ A+WVYP+QV K AP+GE+L  G+FM+ GK+N++   PL +  G++
Sbjct: 521 MSRAWSEGLYSADAYWVYPNQVKKQAPSGEFLPKGAFMVYGKRNWMYGIPLKLAVGII 578



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/162 (27%), Positives = 83/162 (51%), Gaps = 13/162 (8%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M++ D+   V+ L+  +IG R   VY    +  I        + ++GE  K L++ 
Sbjct: 1   MKEEMSSVDIRYIVEELKSEIIGARVDKVYHEGDEVRI-------KLHKTGEGRKDLII- 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T+Y ++  + PS F + LRKHI    +ED+ Q  +DRI+  +    +    +
Sbjct: 53  EAGKRIHLTSYIKESSSQPSSFAMLLRKHISGNFVEDIEQHDFDRIVKIK----IGKFKI 108

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           I EL+ +GN++    +  +L  +R     D+ +     ++YP
Sbjct: 109 IAELFKKGNVVFVTEDNIILGAIRYEEFKDRVIKPKHEYKYP 150


>gi|448578556|ref|ZP_21643976.1| hypothetical protein C455_13495 [Haloferax larsenii JCM 13917]
 gi|445725734|gb|ELZ77354.1| hypothetical protein C455_13495 [Haloferax larsenii JCM 13917]
          Length = 702

 Score =  169 bits (427), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 169/705 (23%), Positives = 290/705 (41%), Gaps = 128/705 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLVEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GDIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYDFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP           AS+L 
Sbjct: 117 KIVVELFGQGNISVLDETGEVVRSLETVRLKSRTVAPGSQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                       +P  V+ D                      L +N +++  D  R    
Sbjct: 165 ------------DPLSVSRDA---------------------LGRNMDESDTDIVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L  G   +E +    G+   + +S+  + + +A+   ++      D  + V
Sbjct: 188 ----TLATQLNLGGLYAEELCTRAGVDKTLDISDATEEDYDAVFDAIV------DLREQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +G+  P  Y L  ++++    P               PL  +Q    +   +++F+ AL
Sbjct: 238 RAGEFDPRLY-LDDDENVVDVTP--------------FPLREHQNDGLDEEAYDSFNEAL 282

Query: 359 DEFYSKIESQRAEQQ----HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE 414
           DE++ +++    EQQ    ++   +A   K  +I   QE  +    +      + AEL+ 
Sbjct: 283 DEYFFRLDLTADEQQDVGSNRPDFEAQIAKQERIIEQQEGAIEGFDERAAAERERAELLY 342

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N + VD  +  VR A    + W+D+A  +    + G P A  +  +      +++    
Sbjct: 343 ANYDLVDDVLSTVRDAREEGVPWDDIAEKLDAGAEQGIPAAEAVTNVDGAEGTVTI---- 398

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK 534
              E+DD   TL       D+++    NA R Y   K+ + K+E  + A     +  E+ 
Sbjct: 399 ---ELDDSTITL-------DVSMGVEKNADRLYTEAKRIQEKKEGALAAIEDTREELEEV 448

Query: 535 TRLQILQEK-------------------TVANISHMRKVHWFEKFNWFISSENYLVISGR 575
            R +   E                    ++ ++      +W+E+F WF +S+ YLV+ GR
Sbjct: 449 KRRRDEWEADDDEDDAEDEEEQEETDWLSLQSVPVKSTDYWYEQFRWFHTSDGYLVVGGR 508

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-----PPLTLNQAGCFTVC 630
           +A QNE +VK+YM K D + H    G   T++K   P +P      P  +L++A  F V 
Sbjct: 509 NADQNEALVKKYMDKHDRFFHTQARGGPVTLLKATGPSEPAKEVDFPESSLHEAAQFAVS 568

Query: 631 HSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + +
Sbjct: 569 YSSIWKDGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDRTY 613


>gi|414878086|tpg|DAA55217.1| TPA: hypothetical protein ZEAMMB73_507954 [Zea mays]
          Length = 522

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 134/325 (41%), Positives = 178/325 (54%), Gaps = 77/325 (23%)

Query: 667 MIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDI 726
           MIRGKKNFLPPHPL+MGFG+LFRLDESSL SHLNERRVRGE+E + + E +   K+ S+ 
Sbjct: 1   MIRGKKNFLPPHPLVMGFGILFRLDESSLASHLNERRVRGEDEALHEME-AESRKKQSNP 59

Query: 727 ESEKDDTDEKPVAES-----------------LSVPNSAHPAPSHTNASNVDSHEFPAE- 768
           ES++D   E    E+                 L +P+ +      +N    +S E   E 
Sbjct: 60  ESDEDIGSEGANKETHEDESNGQTTNIQQNNDLELPDLS------SNIGTANSSELLPEI 113

Query: 769 --DKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSE 826
             ++T+ NG  S I      + A V+ QL+DL+D+ L LG A +S     + +    L+E
Sbjct: 114 QAEETLDNG--SSILK-EETIEASVSSQLDDLLDKTLCLGPAKVSGKSSLLTSIPSSLAE 170

Query: 827 EDKHVE-RTATVRDKPYISKAERRKLKKGQ-----------GSSVVDPKVEREKERGK-- 872
           +D  +E +  T+RDKPYISKAERRKLKKGQ           G +V  P   ++ E+GK  
Sbjct: 171 DDDDLEVKRPTIRDKPYISKAERRKLKKGQVNDETATDSQNGEAVETPGTSKQ-EKGKAE 229

Query: 873 -----DASSQPESIVRK-----TKIEG----------------------GKISRGQKGKL 900
                  +SQP++  ++     TK  G                       K+SRGQKGKL
Sbjct: 230 TKATDSKASQPDTSQQEKGKANTKATGSKLSQPGNSQQEKGKGSTHAGNAKVSRGQKGKL 289

Query: 901 KKMKEKYGDQDEEERNIRMALLAVS 925
           KK+KEKY +QDEEER IRMALLA S
Sbjct: 290 KKIKEKYAEQDEEEREIRMALLASS 314


>gi|268323401|emb|CBH36989.1| conserved hypothetical protein containing fibronectin-binding
           protein A N-terminal domain, DUF814 family [uncultured
           archaeon]
 gi|268324037|emb|CBH37625.1| conserved hypothetical protein containing fibronectin-binding
           protein A N-terminal (FbpA) domain and DUF814 domain
           [uncultured archaeon]
          Length = 631

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 163/681 (23%), Positives = 273/681 (40%), Gaps = 142/681 (20%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M++ D+AA V  L+ L+G R    Y    +    KL +   +          L++E
Sbjct: 1   MKESMSSVDIAAIVIELQELLGARLVKAYQPGREEIRLKLHHKGSLD---------LIIE 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G R+H T Y R     PS F + LRKH+   R+  +RQL +DRI+            +I
Sbjct: 52  AGKRIHLTKYKRASPRMPSNFAMYLRKHLSGARIAQIRQLDFDRIVEITIERWDKKLRLI 111

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
            EL  +GNI++ D + T+L  LR      + + +  ++  P                   
Sbjct: 112 AELLPRGNIVVVDEDGTILLPLRRKSFASRKIKVGEKYERP------------------- 152

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                P    P  ++E                      DL  N  K   D A        
Sbjct: 153 -----PSRANPLTMSES---------------------DLM-NLCKRDKDIA-------- 177

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISG 301
           +V    L +G   +E +    G+   M+  E+   E NAI   +  +  FE         
Sbjct: 178 SVFASELSFGGLYAEEVCAKAGIDKRMRADELTATEINAIHETIHTL--FEP-------- 227

Query: 302 DIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEF 361
                  I+  +K   K H   E         +F P  L+ + ++E   F + + A DE+
Sbjct: 228 -------IITNDKSTLKAHIVIEGEDKI----DFVPFELSSYENKEKQFFPSLNDAADEY 276

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           ++   ++  E+Q K++ D    K  +I  +Q   +H  + +   S K  E+I  +     
Sbjct: 277 FTTQIAEVVEEQAKSEHDTVIGKYERILNEQLEALHKFELKEAESTKKGEMIYAH----- 331

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
                          + +L  M++E  K              +R  ++L L        D
Sbjct: 332 ---------------YLELEEMLQEPDK--------------KRKVVTLTLP-------D 355

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYE----LKKKQESKQEKTITAHSKAFKAAEKKTRL 537
            + +L     E+D ++S H NA  +Y+     +KK+E  +        K     EK+ R+
Sbjct: 356 TDISL-----EIDTSVSLHKNAGAYYDKAKVFRKKREGVEPAIEMTKEKIRTEKEKEVRI 410

Query: 538 QILQEKTVANISHMR--KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +   E+ +     +R  K  W+EKF WF +S+ +LV+ G+DA  NE++ K++M   D++ 
Sbjct: 411 E---EELIPTKKEVRTEKEEWYEKFRWFETSDGFLVVGGKDATTNEILAKKHMEPNDLFF 467

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKT 654
           H    GA   + K    E  +    L +   F   +S  W         + V   QVSKT
Sbjct: 468 HTQAEGAPVVIAKAGGKE--ISESGLKEIAQFAASYSNLWKYGFYEGECYCVVGEQVSKT 525

Query: 655 APTGEYLTVGSFMIRGKKNFL 675
            P+GEY+  GSFM+RGK+ + 
Sbjct: 526 PPSGEYIKKGSFMVRGKRKYF 546


>gi|354544800|emb|CCE41525.1| hypothetical protein CPAR2_800770 [Candida parapsilosis]
          Length = 661

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 86/224 (38%), Positives = 128/224 (57%), Gaps = 3/224 (1%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQEKTVAN 547
           V +D  LS++ANA  ++E KK  ESKQ K       A+K AEKK    +   L+ +   +
Sbjct: 200 VSIDYTLSSYANASIYFESKKAAESKQAKIEKGAEIAYKNAEKKINQDLVKNLRRENGTS 259

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
            +  R+  WFE F WF+SSE YL ++GR   Q +++  +Y S  D  V +++ G+    +
Sbjct: 260 SNAEREKFWFESFYWFVSSEGYLCLAGRSKSQTDLLYFKYFSDDDFLVSSEIEGSLKVFV 319

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           KN    + VPP T+ QAG F +  SQAW+ K+ T+AW ++  ++SK   +G  L  G F 
Sbjct: 320 KNPLKGESVPPTTILQAGIFAMAASQAWNGKINTAAWVLHGSEISKYNSSGALLPAGEFE 379

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
              KK+FLPP  L+MGFGL F +DE S   H  +R  + +E G+
Sbjct: 380 YLAKKHFLPPAQLVMGFGLYFLVDEGSAEGHKIQRVQKEKEHGL 423


>gi|448089209|ref|XP_004196743.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|448093427|ref|XP_004197774.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|359378165|emb|CCE84424.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
 gi|359379196|emb|CCE83393.1| Piso0_003968 [Millerozyma farinosa CBS 7064]
          Length = 1056

 Score =  166 bits (420), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 132/229 (57%), Gaps = 2/229 (0%)

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EK 543
           +P+ +V +DL+LS+ AN+R +++ KK  E+KQ K       A + AEKK    +    +K
Sbjct: 508 VPLLEVSIDLSLSSFANSRIYFDNKKNAETKQAKVEKNTEIALRNAEKKINRDLSSNLKK 567

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
               +  +R   WFEKF WF+S+E YL ++G D  Q +MI  R+ +  D +V +D+ G+ 
Sbjct: 568 ESETLKQIRPKFWFEKFYWFVSNEGYLCLAGNDDTQTDMIYYRHFNDNDYFVTSDIEGSL 627

Query: 604 STVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTV 663
              +KN    + V P TL QAG F++  S+AWD+K+ TSAW++   +VSK    G  ++ 
Sbjct: 628 KVFVKNPYQGKEVSPSTLTQAGIFSMSASKAWDNKITTSAWYLKGSEVSKKDFDGSLVSF 687

Query: 664 GSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           G+F  +G+K FLPP  L+MG    F  DE +   + + R  R  E G++
Sbjct: 688 GNFNYKGEKQFLPPSQLVMGLAFYFLGDEETTQRYRSTRLERQAEFGLE 736



 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 124/491 (25%), Positives = 242/491 (49%), Gaps = 69/491 (14%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+   D+    K L+  ++  R  NVY    S K YI K      V +S    K L+
Sbjct: 1   MKQRVTGLDLQILCKELQEEIVSYRLQNVYGTAKSNKQYILKF----SVADS----KKLV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +E+G R+H T Y R  +  PS F  K+RKH+++RRL  V+Q+  DR+++ +F  G  A 
Sbjct: 53  ALETGNRIHLTEYERATEAFPSSFVTKMRKHLKSRRLTGVKQVANDRVLVLEFSDG--AF 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASKL 177
           Y+ LE ++ GNI+L D    +L+L R+ +  +KG       +Y   E   +F+++   K 
Sbjct: 111 YLALEFFSAGNIILLDENLKILSLQRTVQ--EKG----GNDKYAVNETYSMFDKSLFQKE 164

Query: 178 HAALTSSKEPD------ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSK----NSNK 227
                 S  PD      A++  ++     +V++ASK      K  K + + K    N++ 
Sbjct: 165 IQIPKISFTPDLISEWIASQKTRL----EDVTDASK------KKKKVYSIHKLLFVNASH 214

Query: 228 NSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLA 287
            S D        L++++ + +    +  +++             +   LED     +V A
Sbjct: 215 LSGD------LILRSLVKQGINPSSSCFDYV------------EDTQGLED-----IVRA 251

Query: 288 VAKFE-DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR-- 344
           + + + ++L+ V S   V    ++++NK    + P  +S     I DEF P   ++    
Sbjct: 252 LQETQAEYLEIVESPSRVKGCIVMVKNKLYNPEDP--DSKDLKYIMDEFHPYKPHKENED 309

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           S +F++ E ++  LD ++S IES R   + + +++ A  +L K   +++ ++ +L  + +
Sbjct: 310 SYQFMEVEGYNKTLDTYFSTIESSRYALRIEQQKEQARKRLEKARNERDKQIQSLLDQKN 369

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYL 463
            ++K  E I Y+ + ++    +V   +  +M WE++ ++++ E+  GN +A +I   L L
Sbjct: 370 LNIKKGEAIIYHADVIEECKESVLQLIRQQMDWENIEKLIQLEQTRGNKLAQMIKLPLNL 429

Query: 464 ERNCMSLLLSN 474
            +N +++LL++
Sbjct: 430 VQNKINVLLTD 440


>gi|448454957|ref|ZP_21594359.1| Fibronectin-binding A domain protein [Halorubrum lipolyticum DSM
           21995]
 gi|445814337|gb|EMA64302.1| Fibronectin-binding A domain protein [Halorubrum lipolyticum DSM
           21995]
          Length = 736

 Score =  166 bits (420), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 174/731 (23%), Positives = 303/731 (41%), Gaps = 121/731 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+ A V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLGALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHVADAEHVADAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L++ R   + VA  +++ YP           AS+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGALQTVRLKSRTVAPGAQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                    N    +LGG K        ++  ++ +D  R    
Sbjct: 165 -------------------------NPLDVSLGGFK--------RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+   + + E     D+ ++ L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKTLPVDEAT---DDQLRALHEALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y   +    G      +     ++ D   P  L++      V F++F+AA+
Sbjct: 239 -SGDIDPRVY---EEDLDGAGSEDADGDGDPRVVD-VTPFPLSEHEGLPSVGFDSFNAAV 293

Query: 359 DEFYSKIESQRAEQQHKAKEDAA---------FHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           DE++ ++E +  +   +A  DA+           K  +I   Q   +   +++ +   + 
Sbjct: 294 DEYFYRLEREDGDA-GEAPADASPSRPEFEEEIAKQERIIEQQRGAIEGFEEQAEAERER 352

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
           AEL+    + VD  +  VR A  N + W+++A  ++   + G P A  +  +      ++
Sbjct: 353 AELLYARYDLVDEVLSTVREARENEVPWDEIAETLEAGAERGIPAAEAVADVDGGEGTVT 412

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSK 526
           + L     E  +   ++   +VE+D +     NA R Y+  K+ E K+E   + I +  +
Sbjct: 413 VELDREGGEDGESGDSV---RVELDASTGVEVNADRLYQEAKRIEGKKEGAMEAIESTRR 469

Query: 527 AFKAAE-KKTRLQILQEK------------------------TVANISHMRKVHWFEKFN 561
             +A E +K   + ++                          + ++I       W+++F 
Sbjct: 470 ELEAVEERKAEWEAMEAADDGDGDGGDSEDEDDEEEYETDWLSRSSIPIRSPDDWYDRFR 529

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-- 619
           WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   P +   P+  
Sbjct: 530 WFHTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKAAGPSESADPVDF 589

Query: 620 ---TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
              TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 590 SEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYF 649

Query: 676 PPHPLIMGFGL 686
              P  +  G+
Sbjct: 650 EDVPCRIAVGV 660


>gi|448688255|ref|ZP_21694088.1| hypothetical protein C444_10199 [Haloarcula japonica DSM 6131]
 gi|445779316|gb|EMA30246.1| hypothetical protein C444_10199 [Haloarcula japonica DSM 6131]
          Length = 717

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 161/663 (24%), Positives = 269/663 (40%), Gaps = 103/663 (15%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQPTLKTV--LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+     V  L   L +G    E +    G+  N+    V+ LE++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLEESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   G++ P  Y    +   G D    + G   +  D   P+ L+++     
Sbjct: 232 DEMGTRLRE---GNVDPRVYYETLDDDDGADSGEADDGPDRRRVD-VTPIPLSEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
             F  F++ALD+++     QR E+       +   +A   K  +I   QE  +   + + 
Sbjct: 288 ESFTEFNSALDDYFFNF--QREEEVEGGETQRPDFEAEIEKQKRIIQQQEQAIEDFEADA 345

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL 463
           +   + AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L  
Sbjct: 346 EVEREKAELLYANYDLVDDVLSTVQAAREDDVSWDDIEAKFDEGADRGIAAAEAVVSLDG 405

Query: 464 ERNCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARR-WYEL 509
               ++L +               N DE+  E K +  +K   + AL+A  N R    E+
Sbjct: 406 SEGTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEEV 462

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K++++  +       +   ++ ++ T    +Q     ++      HW+E+F WF +S+ +
Sbjct: 463 KERRDEWEADDGDDETDEDQSEDEPTDWLSMQ-----SVPTRSTEHWYEQFRWFHTSDGF 517

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQA 624
           LVI GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA
Sbjct: 518 LVIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQASLDQA 577

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
             F V +S  W D K     + V P QVSKT  +GEYL  G F IRG + +    P+ + 
Sbjct: 578 AQFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPVGVA 637

Query: 684 FGL 686
            G+
Sbjct: 638 VGI 640


>gi|147920849|ref|YP_685344.1| hypothetical protein RCIX612 [Methanocella arvoryzae MRE50]
 gi|110620740|emb|CAJ36018.1| conserved hypothetical protein [Methanocella arvoryzae MRE50]
          Length = 670

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 189/366 (51%), Gaps = 33/366 (9%)

Query: 334 EFCPLLLNQFRSREF--VKFETFDAALDEFYS---KIESQRAEQQHKAKEDAAFHKLNKI 388
           +  P+ L ++    +  V FETF+ A+D ++    K E++ A  + KA++   F +  + 
Sbjct: 249 DVLPIELKRYEGEGYEKVYFETFNKAVDAYFGARIKTEAKAAIVEKKAEKLGVFERRLR- 307

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
              Q++ +   ++E   + +  E+I    + V+  I  ++ A     SW+D+ +++K+ +
Sbjct: 308 --QQQDAIAKFEREEQENARKGEVIYAEYQKVEEIIKVIKGARDRGYSWDDIRKILKDAK 365

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           KAGN  A  I  +      ++++L              P   V +D+ L+   NA+ +Y+
Sbjct: 366 KAGNQAAAAIQAIDSATGLITVVL--------------PEATVNIDVKLTVPQNAQAYYD 411

Query: 509 LKKKQESKQE---KTITAHSKAFKAAEKKTRL--QILQEKTVANISHMRKVHWFEKFNWF 563
             KK ++K+E   K I    KA   A+ K     + +Q+K  A     RK  W+++F WF
Sbjct: 412 KVKKVQAKKEGALKAIEETRKAMAKAQPKVAEPGKPVQKKVSAK---PRKPKWYDRFRWF 468

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
            +S+ +LV++GRDA  NE IVK+YM K DV+ HA  HGA  TV+K     +PV    L +
Sbjct: 469 FTSDGFLVVAGRDADTNEEIVKKYMEKNDVFFHAQAHGAPITVLKTA--GKPVTEQALAE 526

Query: 624 AGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
              F V +S  W +   +   +WV P QVSKT   GEY+  G+F++RG++N++    +  
Sbjct: 527 VAQFAVSYSSVWKAGQFSGDCYWVKPEQVSKTPEPGEYVAKGAFIVRGERNYVKDVQVRA 586

Query: 683 GFGLLF 688
             G+ F
Sbjct: 587 AIGIRF 592



 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 84/162 (51%), Gaps = 7/162 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  M + DV A VK L+ L+  +    Y  S      +L       +  ++ K  L+ E
Sbjct: 1   MKEEMTSVDVYAVVKELQFLVDAKLEKAYQTSADEIRLRL-------QEFKTGKYDLIAE 53

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH TA A +    P  F + LRK+    R+  +RQ G+DRI+  +       + +I
Sbjct: 54  AGKRLHITANAPESPKLPPAFAMILRKYTMGGRITAIRQHGFDRIVEIETVRAGEGNILI 113

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           +E++A+GNI+L D+E  ++  L+S +  D+ V    ++ YP+
Sbjct: 114 VEMFARGNIILADAERKIIMPLKSLKMRDRDVVRGEKYEYPS 155


>gi|448424081|ref|ZP_21582207.1| Fibronectin-binding A domain protein [Halorubrum terrestre JCM
           10247]
 gi|448478971|ref|ZP_21603977.1| Fibronectin-binding A domain protein [Halorubrum arcis JCM 13916]
 gi|445682746|gb|ELZ35159.1| Fibronectin-binding A domain protein [Halorubrum terrestre JCM
           10247]
 gi|445822801|gb|EMA72563.1| Fibronectin-binding A domain protein [Halorubrum arcis JCM 13916]
          Length = 735

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 179/718 (24%), Positives = 296/718 (41%), Gaps = 118/718 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHAADPDHVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G VP  K + +++  D+ +  L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAG-VP--KETPIDEATDDQLGALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y    +   G       S        +  P  L +      V F++F+AA+
Sbjct: 239 -SGDIDPRVYEESVDGEGGDGGDGDGSDGRDPRVVDVTPFPLAEHEDLPSVGFDSFNAAV 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++  +  E+     + +A          K  +I   Q+  +   +++     + A
Sbjct: 298 DEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQERIIEQQKGAIEGFEEQAQAERERA 357

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  + + VD  I  VR A  N + W+++   +    + G P A  +  +      +++
Sbjct: 358 ELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAGAERGIPAAEAVAGVDGGEGTVTV 417

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKA 527
                L+E  D+  T+    VE+D +     NA R Y   K+ E K+E   + I +  + 
Sbjct: 418 ----ELEEEGDDGGTV---TVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREE 470

Query: 528 FKAAEKKTRLQILQEKTV------------------------ANISHMRKVHWFEKFNWF 563
            +A +++ R    Q+                           ++I       WFE+F WF
Sbjct: 471 LEAVKERKREWEEQQAADDGSGGDGGDNEEEDEEYETDWLARSSIPIRSPDDWFERFRWF 530

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL---- 619
            +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +   P+    
Sbjct: 531 HTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKAAGPSESADPVDFSE 590

Query: 620 -TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 591 ETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYF 648


>gi|389848295|ref|YP_006350534.1| hypothetical protein HFX_2877 [Haloferax mediterranei ATCC 33500]
 gi|448618500|ref|ZP_21666737.1| hypothetical protein C439_16130 [Haloferax mediterranei ATCC 33500]
 gi|388245601|gb|AFK20547.1| hypothetical protein HFX_2877 [Haloferax mediterranei ATCC 33500]
 gi|445746871|gb|ELZ98329.1| hypothetical protein C439_16130 [Haloferax mediterranei ATCC 33500]
          Length = 701

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 166/716 (23%), Positives = 288/716 (40%), Gaps = 127/716 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  + R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTEMNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GDIKRAHIAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ QGNI +                D+ G  + S      E  R+  RT A    
Sbjct: 117 KIVVELFGQGNIAVL---------------DETGEVVRS-----LETVRLKSRTVAPGSQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+     +P  ++ D                      L ++  ++  D  R    
Sbjct: 157 YEYPSSR----LDPLTISRDA---------------------LGRHMEQSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                +   L  G   +E +    G+   + +++      +AI   ++ +       Q V
Sbjct: 188 ----TIATQLNLGGLYAEELCTRAGVEKTLDIADATDDHYDAIYDAIVNLR------QQV 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SG+  P  Y    +  +     P +   +  + +E                ++TF+ AL
Sbjct: 238 RSGEFDPRLYTDDDDAVVDVTPFPLQEHQNAGLDEE---------------AYDTFNEAL 282

Query: 359 DEFYSKIESQRAEQQ---HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           DE++ +++    EQ+   ++   +    K  +I   Q+  +    ++ +   + AEL+  
Sbjct: 283 DEYFFRLDLTADEQEATSNRPDFEEQIAKQERIIEQQKQAIEGFDEQANEERERAELLYA 342

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N + VD  +  VR A    + W+D+A  + E  + G P A  +  +      +++     
Sbjct: 343 NYDLVDDVLSTVREAREQGVPWDDIAVTLDEGAEQGIPAAEAVTNVDGANGTVTI----- 397

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWY-ELKKKQESKQEK--TITAHSKAFKAAE 532
             ++DD   TL       D+++    NA R Y E K+ QE KQ     I    +  +AA+
Sbjct: 398 --KLDDATVTL-------DVSMGVEKNADRLYTEAKRIQEKKQGALAAIEDTREELEAAK 448

Query: 533 KK----------------TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           ++                   +     ++ ++      HWFE+F WF +S  YLV+ GR+
Sbjct: 449 RRRDEWEADDQEDESDEDEEPEETDWLSLDSVPVKSTEHWFERFRWFHTSSGYLVVGGRN 508

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTVCH 631
           A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +A  F V +
Sbjct: 509 ADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQKVDFSEETLQEAAQFAVSY 568

Query: 632 SQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + +    P  +  G+
Sbjct: 569 SSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDRRYFEDVPAKVAVGI 624


>gi|448502987|ref|ZP_21612851.1| Fibronectin-binding A domain protein [Halorubrum coriense DSM
           10284]
 gi|445693389|gb|ELZ45541.1| Fibronectin-binding A domain protein [Halorubrum coriense DSM
           10284]
          Length = 730

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 181/744 (24%), Positives = 296/744 (39%), Gaps = 153/744 (20%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHAADPDNVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+     + E     D+ +  L  A+++ ++ L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKETPIEEAT---DDQLGALHDALSRLDERLR-- 238

Query: 299 ISGDIVPEGY---ILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
            SGDI P  Y   +           P        ++ D   P  L +      V F++F+
Sbjct: 239 -SGDIDPRVYEESVDGDGSEDDGGDP--------RVVD-VTPFPLAEHEGLPSVGFDSFN 288

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAA---------FHKLNKIHMDQENRVHTLKQEVDRS 406
           AA+DE++ ++ ++ A    +A  DA            K  +I   Q   +   +++    
Sbjct: 289 AAVDEYFYRLGNE-ATDDGEAPADATASRPDFEAEIAKQERIVEQQRGAIEGFEEQAQAE 347

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
            + AEL+  N + VD  +  VR A  N + W+++A  +    + G P A  +  +     
Sbjct: 348 RERAELLYANYDLVDEVLSTVREARENEVPWDEIAATLDAGAERGIPAAAAVVDVDGGEG 407

Query: 467 CMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK 526
            +++ L       DDE+      ++++D +     NA R Y+  K+ E K+         
Sbjct: 408 TVTVAL-------DDEDGG--SVRIDLDASEGVEVNADRLYQEAKRVEEKKAGA------ 452

Query: 527 AFKAAEKKTR--LQILQEK------------------------------------TVANI 548
             KAA + TR  L+ + E+                                    + ++I
Sbjct: 453 --KAAIESTREELEAVNERKAEWEEQEAAADESAGADGDGEDGEDGDEAYETDWLSRSSI 510

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
                  WFE+F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K
Sbjct: 511 PIRSPDDWFERFRWFRTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTILK 570

Query: 609 NHRPEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLT 662
              P +   P+     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+ 
Sbjct: 571 ASGPSESADPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIE 630

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            GSF+IRG + +    P  +  G+
Sbjct: 631 KGSFVIRGDRTYFEDVPCRIAVGV 654


>gi|448448413|ref|ZP_21591226.1| Fibronectin-binding A domain protein [Halorubrum litoreum JCM
           13561]
 gi|445814829|gb|EMA64787.1| Fibronectin-binding A domain protein [Halorubrum litoreum JCM
           13561]
          Length = 736

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 179/719 (24%), Positives = 296/719 (41%), Gaps = 119/719 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHAADPDHVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G VP  K + +++  D+ +  L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAG-VP--KETPIDEATDDQLGALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y    +   G       S        +  P  L +      V F++F+AA+
Sbjct: 239 -SGDIDPRVYEESVDGEGGDGGDADGSDGRDPRVVDVTPFPLAEHEDLPSVGFDSFNAAV 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++  +  E+     + +A          K  +I   Q+  +   +++     + A
Sbjct: 298 DEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQERIIEQQKGAIEGFEEQAQAERERA 357

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  + + VD  I  VR A  N + W+++   +    + G P A  +  +      +++
Sbjct: 358 ELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAGAERGIPAAEAVVGVDGGEGTVTV 417

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKA 527
                L+E  D+  T+    VE+D +     NA R Y   K+ E K+E   + I +  + 
Sbjct: 418 ----ELEEEGDDGGTM---AVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREE 470

Query: 528 FKAAEKKTRLQILQEKTV-------------------------ANISHMRKVHWFEKFNW 562
            +A +++ R    Q+                            ++I       WFE+F W
Sbjct: 471 LEAVKERKREWEEQQAADDGSGGDGGDNEGEEDEEYETDWLARSSIPIRSPDDWFERFRW 530

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL--- 619
           F +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +   P+   
Sbjct: 531 FHTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKAAGPSESADPVDFS 590

Query: 620 --TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 591 EETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYF 649


>gi|448512226|ref|ZP_21616340.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           9100]
 gi|448520849|ref|ZP_21618182.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           10118]
 gi|445694546|gb|ELZ46671.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           9100]
 gi|445702985|gb|ELZ54924.1| Fibronectin-binding A domain protein [Halorubrum distributum JCM
           10118]
          Length = 735

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 179/718 (24%), Positives = 296/718 (41%), Gaps = 118/718 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHAADPDHVADAPGRPPNFAKMLRNRLSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVIGALSTVRLKSRTVAPGSQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S                              +GG  FD  ++  ++ +D  R    
Sbjct: 166 PLTVS------------------------------RGG--FD--RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G VP  K + +++  D+ +  L  A+++  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAG-VP--KETPIDEATDDQLGALHDALSRIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGDI P  Y    +   G       S        +  P  L +      V F++F+AA+
Sbjct: 239 -SGDIDPRVYEESVDGEGGDGGDGDGSDGRDPRVVDVTPFPLAEHEDLPSVGFDSFNAAV 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++  +  E+     + +A          K  +I   Q+  +   +++     + A
Sbjct: 298 DEYFHRLGGEETEEGEAPADASASRPDFEEEIAKQERIIEQQKGAIEGFEEQAQAERERA 357

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  + + VD  I  VR A  N + W+++   +    + G P A  +  +      +++
Sbjct: 358 ELLYAHYDLVDEVISTVREARENEVPWDEIEETLAAGAERGIPAAEAVVGVDGGEGTVTV 417

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKA 527
                L+E  D+  T+    VE+D +     NA R Y   K+ E K+E   + I +  + 
Sbjct: 418 ----ELEEEGDDGGTV---TVELDASEGVEVNADRLYREAKRVEGKKEGAKEAIESTREE 470

Query: 528 FKAAEKKTRLQILQEKTV------------------------ANISHMRKVHWFEKFNWF 563
            +A +++ R    Q+                           ++I       WFE+F WF
Sbjct: 471 LEAVKERKREWEEQQAADDGSGGDGGDNEEEDEEYETDWLARSSIPIRSPDDWFERFRWF 530

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL---- 619
            +S  YLVI GR+A QNE +VK+YMSK D + H   HG   T++K   P +   P+    
Sbjct: 531 HTSTGYLVIGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKAAGPSESADPVDFSE 590

Query: 620 -TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GSF+IRG + + 
Sbjct: 591 ETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGSFVIRGDRTYF 648


>gi|448414286|ref|ZP_21577425.1| RNA-binding protein, snrnp like protein [Halosarcina pallida JCM
           14848]
 gi|445682579|gb|ELZ34996.1| RNA-binding protein, snrnp like protein [Halosarcina pallida JCM
           14848]
          Length = 701

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 173/711 (24%), Positives = 287/711 (40%), Gaps = 138/711 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D++A V  L R  G +    Y         ++ +        +  +V L++E 
Sbjct: 4   KRELTSVDLSALVTELNRYEGAKVDKAYLYGDDLLRLRMRDF-------DRGRVELILEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F + LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHAAKPEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFEFERDDEDT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+ +GNI + D    V+  L + R   + VA  +++ +P+           S+LH
Sbjct: 117 QIVVELFGEGNIAVLDETGEVVRSLETVRLKSRTVAPGAQYEFPS-----------SRLH 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                        P  V+ +G                       +    +  D  R    
Sbjct: 166 -------------PFTVSYEG---------------------FKRRMEDSDTDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   +  G   +E      G+   M ++E     D   + +  A+  F D L+  
Sbjct: 188 ----TLATQVNLGGLYAEEFCTRAGVDKTMDITEAG---DEEFRAVYDAIQSFRDRLK-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P  Y   +      D  P              PL  ++         +TF+ AL
Sbjct: 239 -SGDFDPRVY---EEDESVVDATP-------------FPLEEHEAEGLNSESHDTFNDAL 281

Query: 359 DEFYSKI----ESQRAEQQHKAKED--AAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           DE++ ++    E +  E+    + D  A   K  +I   QE  +   +++     + AEL
Sbjct: 282 DEYFFRLDRTAEDEPDEEPGSNRPDFEAEIEKKKRIIQQQEGAIEGFEEQAQEERERAEL 341

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           +  + + VD  +  VR A    + W+D+ + ++E  + G P A                 
Sbjct: 342 LYAHYDLVDEVLTTVRDAREENVPWDDIRQRLEEGAERGIPAA----------------- 384

Query: 473 SNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESKQEKTITA-HSKA 527
             ++ ++D  E T+ VE    ++EV +      NA R Y   K+ E K+E  + A     
Sbjct: 385 -ESVVDVDGAEGTVTVELEDTRIEVVVDTGVEKNADRLYTEAKRVEGKKEGALAAVEDTR 443

Query: 528 FKAAEKKTRLQILQEK-----------------TVANISHMRKVHWFEKFNWFISSENYL 570
            + AE K R +  +E+                 + ++I    + HWFE+F WF +S+ YL
Sbjct: 444 EELAEAKRRREEWEEEDEDDEEEDEEPEDIDWLSRSSIPLRTEEHWFERFRWFHTSDGYL 503

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAG 625
           VI GR+A QNE IVK+Y++K D++ H   HG   TV+K   P +P      P  T  +A 
Sbjct: 504 VIGGRNADQNEEIVKKYLNKHDLFFHTQAHGGPVTVVKATGPSEPSEAVEFPDATKREAA 563

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            F V +S  W + +    A+ V P QVSKT  +GEYL  GSF+IRG + + 
Sbjct: 564 QFAVSYSSIWKEGRYAGEAYMVTPDQVSKTPESGEYLEKGSFVIRGDRTYF 614


>gi|448508289|ref|XP_003865916.1| hypothetical protein CORT_0A00840 [Candida orthopsilosis Co 90-125]
 gi|380350254|emb|CCG20475.1| hypothetical protein CORT_0A00840 [Candida orthopsilosis Co 90-125]
          Length = 654

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 128/224 (57%), Gaps = 3/224 (1%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQEKTVAN 547
           V +D  LS++ANA  ++E KK  E+KQ K       A+K AEKK    +   L+ +   +
Sbjct: 197 VSIDYTLSSYANASVYFENKKAAEAKQTKVEKGAEIAYKNAEKKINQDLVKNLRRENGTS 256

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
               R+  WFE F WF+SSE YL ++GR   Q +++  +Y S  D +V +++ G+    +
Sbjct: 257 SKSEREKFWFESFYWFVSSEGYLCLAGRTKSQIDLLYFKYFSDDDFFVSSEIEGSLKVFV 316

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           KN    + VPP T+ QAG F +  SQAW+ K+ T+AW ++  +VSK   +G  L  G F 
Sbjct: 317 KNPLKGESVPPSTILQAGIFAMSASQAWNGKINTAAWVLHGSEVSKYNQSGALLPPGEFE 376

Query: 668 IRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
              +K+FLPP  L+MGFGL F +DE S   H  +R  + +E G+
Sbjct: 377 YLARKHFLPPAQLVMGFGLYFLVDEGSAEGHKQQRVQKEKEHGL 420


>gi|345005767|ref|YP_004808620.1| fibronectin-binding A domain-containing protein [halophilic
           archaeon DL31]
 gi|344321393|gb|AEN06247.1| Fibronectin-binding A domain protein [halophilic archaeon DL31]
          Length = 717

 Score =  163 bits (413), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 171/709 (24%), Positives = 288/709 (40%), Gaps = 120/709 (16%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA    L R  G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELSSVDLAALATELSRYEGAKLDKAYLYGEDLLRLKLRDF-------DRGRVELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  I +  L  V Q  +DRI++F+F       
Sbjct: 57  GDTKRAHVAAQEHVPDAPGRPPEFAMMLRGRIESADLVSVEQYEFDRILVFEFERPDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +++EL+  GN+ + D    V+  L + R   + VA  + + +P            S+L+
Sbjct: 117 TLVVELFGDGNVAVLDGNGEVVRSLETVRLKSRTVAPGTPYGFPQ-----------SRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S +  +A   D   +    V  A++ NLGG  G                       
Sbjct: 166 PLEMSYEALEARMEDSDTDVVRTV--ATQLNLGGFWG----------------------- 200

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                            E +    G+   M + +  + E  A+   ++++A        +
Sbjct: 201 -----------------EELCRRAGVEKAMDIEDAGEAEYRAVHRELMSLA------DTL 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPP-TESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            SG   P  Y+   +     D    TE G    +     P+ L +      V F++F+AA
Sbjct: 238 TSGQFDPRVYVEETDGESDDDDKSLTERGKVVDV----SPVALKERSELLSVAFDSFNAA 293

Query: 358 LDEFYSKIESQRAEQQHKAKE-----DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           LDE++ ++  Q   ++          +A   K  +I   QE  +   ++E ++  + AEL
Sbjct: 294 LDEYFYRLTHQERREEEGGGRKRPDFEADIEKEKRIIQQQEGAIEGFEEEAEQRRREAEL 353

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
                E VD  +  ++ A      W+++   + +  + G P A  +  +   +  +++  
Sbjct: 354 CYERYELVDEVLSTIQQARQQEHGWDEIQETLAQGAEQGIPAAEAVVDVNSAKGMVTI-- 411

Query: 473 SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKAFK 529
                E+DD   TL       D ++    NA R Y   K+ E K+E   + I    K  +
Sbjct: 412 -----ELDDHRITL-------DASMGVEKNADRLYREAKRVEGKKEGAREAIEDTRKRLE 459

Query: 530 AAEKK-----------------TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVI 572
           AA+++                    + +   T  +I   +  HW+E+F WF +S+ YLVI
Sbjct: 460 AAKQRREEWEAEDDPEPEPDPDEEQEEVDWLTREDIPIRQPEHWYEEFRWFRTSDGYLVI 519

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCF 627
            GR+A QNE +VK+Y+ K D + H   HG   T++K   P +   P+     TL +A  F
Sbjct: 520 GGRNADQNEALVKKYLDKHDRFFHTQAHGGPVTLLKASGPSEAASPVDFPDATLQEAAQF 579

Query: 628 TVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            V +S  W D +    A+ V P QVSKT  +GEYL  G F IRG + + 
Sbjct: 580 AVSYSSVWKDGRGAGDAYMVDPDQVSKTPESGEYLEKGGFAIRGDREYF 628


>gi|448470211|ref|ZP_21600408.1| Fibronectin-binding A domain protein [Halorubrum kocurii JCM 14978]
 gi|445808289|gb|EMA58361.1| Fibronectin-binding A domain protein [Halorubrum kocurii JCM 14978]
          Length = 735

 Score =  163 bits (412), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 181/743 (24%), Positives = 302/743 (40%), Gaps = 146/743 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+ A V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLGALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHVADAEHVADAPGRPPNFAKMLRNRMAGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L + R   + VA  +++ YP           AS+L+
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGALSTVRLKSRTVAPGAQYEYP-----------ASRLN 165

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                        P  V+  G                       ++  ++ +D  R    
Sbjct: 166 -------------PLDVSPGG---------------------FERHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+   + + EV    D+ ++ L  A+++  D L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGVEKTLPVDEVT---DDQLRALHEALSRIGDRLR-- 238

Query: 299 ISGDIVPEGYI-LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAA 357
            SGDI P  Y   +     G+D    ES        +  P  L++      V F++F+AA
Sbjct: 239 -SGDIDPRVYEEALDGGDGGED---AESDDRDPRVVDVTPFPLSEHEGLPSVGFDSFNAA 294

Query: 358 LDEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +DE++ ++E++  +      + +A          K  +I   Q   +   +++ +   + 
Sbjct: 295 VDEYFYRLEAEDTDAGEAPADASASRPDFEEEIAKQERIIEQQRGAIEGFEEQAEAERER 354

Query: 410 AELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           AEL+  EY+L  VD  +  V+ A    + W+++A  +      G P A  +  +      
Sbjct: 355 AELLYAEYDL--VDEVLSTVQEAREAEVPWDEIAETLDAGADRGIPAAEAVVDVDGGEGT 412

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           +++       E+DDE+      +VE+D +     NA R Y+  K+ E K+E        A
Sbjct: 413 VTV-------ELDDEDGD--SVRVELDASAGVEVNADRLYQEAKRIEGKKEG-------A 456

Query: 528 FKAAEKKTR-LQILQEK-------------------------------------TVANIS 549
            +A E   R L+ ++E+                                     + ++I 
Sbjct: 457 MEAIESTRRELEAVKERKAEWEAKEAAADETPGGGGDGDGDDDADDEEYETDWLSRSSIP 516

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
                 WFE+F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K 
Sbjct: 517 IRSPDDWFERFRWFRTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKA 576

Query: 610 HRPEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTV 663
             P +   P+     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  
Sbjct: 577 AGPSESADPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEK 636

Query: 664 GSFMIRGKKNFLPPHPLIMGFGL 686
           GSF+IRG + +    P  +  G+
Sbjct: 637 GSFVIRGDRTYFEDVPCRVAVGV 659


>gi|260942807|ref|XP_002615702.1| hypothetical protein CLUG_04584 [Clavispora lusitaniae ATCC 42720]
 gi|238850992|gb|EEQ40456.1| hypothetical protein CLUG_04584 [Clavispora lusitaniae ATCC 42720]
          Length = 605

 Score =  162 bits (410), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 102/299 (34%), Positives = 154/299 (51%), Gaps = 7/299 (2%)

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQE 542
           +P   V++DLALSA ANA  ++E KK   +KQ +       A K AE+K +  +   L+ 
Sbjct: 70  MPTLTVDIDLALSAFANASVYFESKKVAVTKQTRVEKNTKIALKNAERKIQSDLNKNLKN 129

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           +T  ++   R   WFEK+ WF +S+ YL ++GRD  Q +MI  R+ S GD +V +DL GA
Sbjct: 130 ET-ESLRAFRHKFWFEKYFWFTTSDGYLCLAGRDDLQTDMIYYRHFSDGDYFVSSDLDGA 188

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLT 662
           +   I N    Q V P  L QAG F +  S AW +K+ +SAWW+    V+K    G  L 
Sbjct: 189 AKVFILNPYKAQNVSPSALFQAGIFALSTSTAWSAKISSSAWWMSGADVTKREFDGSLLG 248

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD-DFEDSGHHK 721
            G    + KKN++PP  ++MGFG  +  DE +   +   R  R EE G+   F +     
Sbjct: 249 PGILKYKAKKNYMPPAQMVMGFGFYWLCDEETTQKYKIAREKRQEEHGLKVSFSNKKSDL 308

Query: 722 ENSDIESEKDDT-DEKPVAESLSVP-NSAHPAPSHTNASNVDSHEFPAEDKTISNGIDS 778
           ++  I+S  + T +E  + E+   P NS  P+     +   DS     E+K     ++S
Sbjct: 309 DDMSIKSSMNSTKEEASLEETQKEPENSDEPSKKDAYSPIEDSEASHPEEKETETMVES 367



 Score = 46.6 bits (109), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 20/31 (64%), Positives = 24/31 (77%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG+KGKLKK+  KY DQDEEER +RM +L  
Sbjct: 394 RGKKGKLKKINAKYADQDEEERRLRMEMLGT 424


>gi|378754807|gb|EHY64836.1| hypothetical protein NERG_02239 [Nematocida sp. 1 ERTm2]
          Length = 697

 Score =  162 bits (409), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 114/360 (31%), Positives = 172/360 (47%), Gaps = 43/360 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           FE F AA+D  ++  E      Q K +         KI   QE  +H   +E+      A
Sbjct: 269 FEGFGAAMDAVFNVQEITETASQKKQR---------KIREAQERDLHKKIEEMTILKDKA 319

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  N  +V   I  +  A A  +S ++  R  +E  K  NP A +I K+      + L
Sbjct: 320 ELLSENQAEVKNVISVIEAANAASLSEKEFERF-RETEKDTNPTAQIIKKVNFGNKTVDL 378

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA----HSK 526
                         TL  + V +D   S        Y+  KK E K +KT  A      K
Sbjct: 379 --------------TLDKKAVSIDYTKSIFEQINMLYQKAKKIEEKLKKTRKALDESKHK 424

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
             + A K  +++ ++          R   WFEKF WFI+ ++ L+I+GRD++QNE++VK+
Sbjct: 425 EVEIASKVEKIEKIE----------RNPFWFEKFRWFITKDSDLIIAGRDSKQNEILVKK 474

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWV 646
           Y+   D Y HAD+ G SS ++  H  +      T   A    +  S+AW++ ++T  + V
Sbjct: 475 YLLDTDYYFHADIRGGSSVIVGEHATDH-----TKEIAASMAMHLSKAWENNLITEVYCV 529

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
              QVSKTAP GEYLT GSFMI GKK F  P  L  GF L++++++  +    + R+V G
Sbjct: 530 RGDQVSKTAPAGEYLTHGSFMITGKKEFYHPTRLEYGFSLIYKIEDEEITISDDNRKVTG 589



 Score = 75.9 bits (185), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 48/144 (33%), Positives = 73/144 (50%), Gaps = 14/144 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R++  D+ A V  L ++ G     VY  S K  + K  N           K  LL++
Sbjct: 1   MKGRLSWLDIRAGVNELEKINGCHIKTVYSTSKKAILIKFSN-----------KDQLLID 49

Query: 62  SGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
              + H T  + +K N TP    L LR+ I   R+E + QLG+DR+ + +   G     +
Sbjct: 50  PPSKFHLTHKSYEKVNLTP--LALYLRREISNYRVEKITQLGFDRVAVIKIRSGKGCRLL 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I+E+YA GNI+LTD E  ++ LLR
Sbjct: 108 IVEMYANGNIILTDEELNIINLLR 131


>gi|297619525|ref|YP_003707630.1| Fibronectin-binding A domain-containing protein [Methanococcus
           voltae A3]
 gi|297378502|gb|ADI36657.1| Fibronectin-binding A domain protein [Methanococcus voltae A3]
          Length = 722

 Score =  162 bits (409), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 114/344 (33%), Positives = 185/344 (53%), Gaps = 16/344 (4%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           +E +  ALDE++S+   Q+  ++ + K D    K  +I   Q       +++  ++ +  
Sbjct: 311 YENYLNALDEYFSQFILQKDIKKEETKLDKLIRKQERIVNSQIETKAKYEKQSAKNHQKG 370

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           +LI  N  ++D  I  +R A   +M W+ + ++V E +   NP+   I+ +  +   ++L
Sbjct: 371 DLIYANFTEIDEIINTIRSA-REKMEWKQIKKIVSENK--DNPILSKIESINEKNAELNL 427

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
            L   + E   E  T+    V +D+  SA  NA  +Y   KK ++K    I A   + K 
Sbjct: 428 KL---IAEYGGELGTIK-GNVAIDIRESAFENANSYYTKAKKFKNKVSGVIVALEISQKK 483

Query: 531 AEK---KTRL--QILQEKTVANISHMRKV-HWFEKFNWFISSENYLVISGRDAQQNEMIV 584
            EK   +T L  ++L++K        R+V  W+EK  W I  +NYL+I+G+DA  NE+IV
Sbjct: 484 LEKIRQQTELDAELLKQKQQNIKKKERRVLKWYEKLKWTII-DNYLIIAGKDATTNEIIV 542

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-A 643
           K+Y+ K DV  H  + GA  TVIKN   E P    TL +   F V HS+AW   + ++  
Sbjct: 543 KKYLEKNDVVFHTLMEGAPFTVIKNTSEETPSEE-TLLEVAKFAVSHSKAWKLGLGSADV 601

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +WV P Q+SKTA +GE+L  G+F+IRGK+NF+   PL +G G++
Sbjct: 602 YWVLPEQISKTAESGEFLKKGAFVIRGKRNFIRSAPLDLGVGIV 645



 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 79/169 (46%), Gaps = 16/169 (9%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSP---KTYIFKLMNSSGVTESGESEKVLLL 59
           K  M   D+   VK L+ LI  +    + ++    +  I K+ N    TE G  E V+  
Sbjct: 15  KKEMTNIDICVAVKELQNLINAKFDKAFLVNNQDGRELILKVHN----TEMGTQEIVI-- 68

Query: 60  MESGV----RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM 115
              GV     +  T Y R K   P  F + LRK++R  ++  + Q  +DRI+   F    
Sbjct: 69  ---GVGKYKYITKTEYDRQKPKNPHSFVMLLRKNLRNIKITKIEQHNFDRIVKITFEWNE 125

Query: 116 NAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
             + +I+EL+  GN++L D E  ++  LR+ R  D+ +     +++P +
Sbjct: 126 LKYTLIIELFKDGNVILLDKENKIVMPLRNERFSDRKLIPKEEYKFPAQ 174


>gi|110667755|ref|YP_657566.1| hypothetical protein HQ1801A [Haloquadratum walsbyi DSM 16790]
 gi|109625502|emb|CAJ51929.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
          Length = 719

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 163/724 (22%), Positives = 288/724 (39%), Gaps = 147/724 (20%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  LRR  G +    Y        F++ +        +  ++ LL+E 
Sbjct: 4   KQELTSVDIAALVTELRRYTGAKVDKTYRYGDDLLRFRMRDF-------DRGRLELLIEV 56

Query: 63  GV--RLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R+HT    +  D    P  F + LR  +    L +V Q  +DRI++  F  G    
Sbjct: 57  GTQKRIHTADPDHVPDAPERPPNFAMMLRNRLSGADLVNVEQFEFDRIMILSFERGEEMT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+  GN+ + DS   V+  L                    E  R+  RT A    
Sbjct: 117 RIIVELFGDGNVAVVDSAGEVIQSL--------------------ETVRLKSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                S+      P +V  D                  +   L   S+ +          
Sbjct: 157 YEFPDSR----VNPLQVTYD------------------RFISLMNESDTD---------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            +   L   L  G   +E +    G+    K +++    D   + +  A+      LQ  
Sbjct: 185 -IVRTLATQLNLGGLYAEEVCARAGI---DKTTQITNTSDKIYRAIYTALESLGTQLQ-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P    L  +     D  P              PL   + ++ +   +++F+ AL
Sbjct: 239 -SGDFEPR---LYTDDDAVIDATP-------------FPLEERKQQNLDVTTYDSFNGAL 281

Query: 359 DEFYSKIE-SQRAEQQHKAKED--AAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++ +++ +  AE+  + + D  A   K  +I   QE  +   +Q  +     AEL+  
Sbjct: 282 DVYFREVDRNPAAEESGQTRPDFAAEIAKKQRIIEQQEGAIDDFEQRAEAERSRAELLYA 341

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E V+  I  ++ A A   SW+++        + G   A  +    +  +    +++  
Sbjct: 342 NYELVNEIIETIQTARAEDTSWDEIRETFAMGAERGIDAAAAV----VSVDGAEAMVTIE 397

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAEK 533
           +D+M          +V V++ +    NA + Y   K+ E K+E  +TA  +++    A K
Sbjct: 398 IDDM----------RVPVNVDVGVEKNADQRYTEAKRIEEKKEGALTAIENTREELNAVK 447

Query: 534 KTR------------------LQILQEK------------------TVANISHMRKVHWF 557
           + R                   + + +K                  ++ +I   +   W+
Sbjct: 448 QRRDAWDREDAKPDTEDNADNTETVTDKVNTGTEPSRMGPTDDEWLSMTSIPLQKNDDWY 507

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVP 617
           E+F WF +S  YLV+ GR+A QNE +VK+Y++K D + H + HG   T++K   P +P  
Sbjct: 508 EQFRWFHTSTGYLVVGGRNADQNETLVKKYLNKHDRFFHTEAHGGPITILKASGPSEPAE 567

Query: 618 PLTLN-----QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+ L      +   F + +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG 
Sbjct: 568 PIELTAETRREVAQFAISYSSIWKEGRYADDAYVVTPDQVSKTPESGEYIEKGSFVIRGD 627

Query: 672 KNFL 675
           + ++
Sbjct: 628 RTYI 631


>gi|448391228|ref|ZP_21566471.1| fibronectin-binding A domain-containing protein [Haloterrigena
           salina JCM 13891]
 gi|445666097|gb|ELZ18766.1| fibronectin-binding A domain-containing protein [Haloterrigena
           salina JCM 13891]
          Length = 723

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 179/692 (25%), Positives = 281/692 (40%), Gaps = 154/692 (22%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLLEVGETKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDS------ 162

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT        LT S+E                               +FD  +    +  
Sbjct: 163 RTNP------LTVSRE-------------------------------AFD--REMEDSDT 183

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLED------NAIQVL 284
           D  R    TL T     L +G   +E I    G+   M ++E +  ED       AI+ L
Sbjct: 184 DVVR----TLAT----QLNFGGLYAEEICTRAGVEKAMDIAEAD--EDVYDRIYGAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTES--------GSSTQIYDEFC 336
            L          D+ +G+  P  Y+  +    G D   +ES         S  ++ D   
Sbjct: 234 AL----------DLRNGNFDPRLYLADE----GDDDNESESDENGGDGDSSPDRVVDA-T 278

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQ-----HKAKEDAAFHKLNKIHMD 391
           P  L +        +++F AALD+++ ++E    E++      +   +    K  +I   
Sbjct: 279 PFPLEEHVELASEPYDSFLAALDDYFYRLELAEDEEETDPTTQRPDFEEEIAKYERIIEQ 338

Query: 392 QENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
           Q+  +   +QE D   + AEL+  EY L  VD  +  V+ A A    WE++     EER 
Sbjct: 339 QQGAIEGFEQEADALREQAELLYAEYGL--VDDILSTVQEARAQDRPWEEI-----EER- 390

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARR 505
                       + E     +  +  + ++D  E T+ VE    ++++        NA R
Sbjct: 391 ------------FAEGADRGIAAAEAVVDVDGSEGTVTVELDGERIDLVAKQGVEQNADR 438

Query: 506 WYELKKKQESKQEKTITAHSKAFK-AAEKKTR----------------------LQILQE 542
            Y   K+ E K+E  + A     +  AE K R                         L E
Sbjct: 439 LYTEAKRVEEKKEGALAAIEDTREDLAEAKARRDRWEEEDAAAEGDDDEDEDDDRDWLSE 498

Query: 543 KTVANISHMRKVH-WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
            +V     +R+   WF++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG
Sbjct: 499 PSVP----IRENEPWFDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHG 554

Query: 602 ASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKT 654
              TV+K   P +       +P  ++ +A  F V +S  W D +     + V   QV+KT
Sbjct: 555 GPVTVLKATDPSEASSSDIELPESSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKT 614

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +GEYL  G F IRG + +    P+ +  G+
Sbjct: 615 PESGEYLEKGGFAIRGDRTYYRDTPVDVAVGI 646


>gi|422295934|gb|EKU23233.1| zinc knuckle (cchc-type) family protein, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 397

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 130/413 (31%), Positives = 192/413 (46%), Gaps = 90/413 (21%)

Query: 1   MVKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK +  T DV A V+ LR +++G++  N+YD+  +TY FKL    G       EKV LL
Sbjct: 1   MVKTKFTTPDVRAMVRDLRTKVLGLKVVNIYDIDNRTYTFKLAVPGG-------EKVTLL 53

Query: 60  MESGVRLHTTAYARDKK---NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
           +ESG R HTTAYAR++      P+ F +KLRK++R + LEDVRQLG DR+++F+FG G  
Sbjct: 54  LESGARFHTTAYARERSVPGELPNVFAMKLRKYLRGKGLEDVRQLGMDRVVVFRFGQGEG 113

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSH----------------------------RD 148
           A ++ILELYA GN++LTD+ + +L LLR+H                            R 
Sbjct: 114 ALHLILELYASGNLVLTDANYLILALLRTHQYDQGPEKAVDGEVVGKDAEAGAGTVEGRV 173

Query: 149 DDKGVAIMSRHRYP----------TEICRVFERTTASK----------LHAALTSSKEPD 188
           ++ G  +   H YP          T      E+   +K              L + +E  
Sbjct: 174 EESGRVVRVGHVYPLAFASNALAATRSSAGVEKDAGAKQDPPPWLAVTAETVLAALREVV 233

Query: 189 ANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEAL 248
             E  K  ++GN  S+ ++   GG++G         ++  S    +    T KT+   AL
Sbjct: 234 VREKGKAGKEGNGTSSMAQ---GGKRGRTKRGGQAGASARSKVNLKMALMTSKTLDLSAL 290

Query: 249 GYGPALSEHIILDTGLVPNMKL-----------------SEVNKLEDNAIQVLVLAVAKF 291
             GPA+ EH +L+ GL P ++L                  +   L +     L  AV   
Sbjct: 291 --GPAIVEHAVLEAGLRPLLRLMPPASAVALGEDEEEGEGQREGLTEEEAARLAEAVQGL 348

Query: 292 EDWLQDV-ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQI-YDEFCPLLLNQ 342
           +  L+ + + G    EGYIL +      D   T  G   +I Y+EF PL L Q
Sbjct: 349 DGRLRRLDLPGQ---EGYILCRK----ADGAGTRGGEEDEIMYEEFHPLRLRQ 394


>gi|91773364|ref|YP_566056.1| hypothetical protein Mbur_1391 [Methanococcoides burtonii DSM 6242]
 gi|91712379|gb|ABE52306.1| FbpA, DUF814 containing protein [Methanococcoides burtonii DSM
           6242]
          Length = 663

 Score =  159 bits (402), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 111/379 (29%), Positives = 182/379 (48%), Gaps = 32/379 (8%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQ----HKAKEDAAFHKLNKIH 389
           +  PL L Q+   E   + +F+ ALDEF+ K  S+   +Q     K KED    +L K  
Sbjct: 257 DVLPLELTQYSDAEKEFYPSFNKALDEFFGKKASEEVIEQVVAKKKEKEDVFERRLRK-- 314

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
             Q+  +   + +  R   +AE I  N + V+  +  +  A     SW+D+   +K+ + 
Sbjct: 315 --QQEAILKFETDSTRYTLIAESIYGNYQTVEEVLSVLEAARDKGYSWKDIWDTLKKAK- 371

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
                    D L   +  +S+  +     +D     L V    +++  +   NA+ +Y  
Sbjct: 372 ---------DTLPAAKAIVSIDPAEGSVVVD-----LDVVNANINVRKTIPQNAQMYYNK 417

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
            KK   K++  + A     +A +K+      ++K       + K HW+++F WF SS+ +
Sbjct: 418 AKKISKKRDGALIAIEDTKRAMQKR------EQKVSKRRKAVFKKHWYDRFRWFFSSDGF 471

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
           LVI GRD+  NE IVK+YM K D+  H  + GA  TVIK    +  +P  TL +A  F V
Sbjct: 472 LVIGGRDSDTNEEIVKKYMEKRDIVFHTQVPGAPITVIKTEGKD--IPETTLEEAARFVV 529

Query: 630 CHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLF 688
            +S  W S   +   +W+ P QVSKT  +GEYL  GSF+IRG++N+    P+ +  GL  
Sbjct: 530 SYSSVWKSGQFSGDCYWIKPEQVSKTPESGEYLKKGSFIIRGERNYYKDVPVGVAIGLDL 589

Query: 689 RLDESSLGSHLNERRVRGE 707
             +   +G  L+  +  G+
Sbjct: 590 GAETRVIGGPLSAVQSNGK 608



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 72/147 (48%), Gaps = 11/147 (7%)

Query: 2   VKVRMNTADVAAEVKCLR----RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVL 57
           +K  M +ADVAA V  L      LI  +   +Y  +P      L     +   G      
Sbjct: 1   MKQEMTSADVAALVSELGDGEGSLIDSKIGKIYQPAPDEIRINLF----IFGKGRYN--- 53

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++E+G R H + Y R+   TP  F + LRKHI   R+  ++Q  +DRII      G   
Sbjct: 54  LVIEAGKRAHMSNYVRESPKTPQAFPMLLRKHILGGRITSIKQYDFDRIIEMGVIRGGIE 113

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLR 144
             ++ EL+++GNI+L +S+  ++  ++
Sbjct: 114 TILVCELFSRGNIVLLNSDRKIILPMK 140


>gi|448737510|ref|ZP_21719550.1| hypothetical protein C451_08253 [Halococcus thailandensis JCM
           13552]
 gi|445803654|gb|EMA53937.1| hypothetical protein C451_08253 [Halococcus thailandensis JCM
           13552]
          Length = 695

 Score =  159 bits (401), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 156/709 (22%), Positives = 274/709 (38%), Gaps = 140/709 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELTSVDLAALVTELGTYAGAKLDKAYLYGDDLLRLKLRDF-------DRGRVELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  +  +  D    P GF   LR  +       V Q G+DR++ F+F  G    
Sbjct: 57  GETKRAHVVSPEHVPDAPGRPPGFAKMLRNRLSGADFAGVSQFGFDRVLTFEFERGDRNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            V+ EL+ +GN+ + D+   V+  L +                     R+  RT A    
Sbjct: 117 KVVAELFGEGNVAVLDATGEVIDCLNT--------------------VRLQSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               S++     +P  V+ DG                      +    +++ D       
Sbjct: 157 YEFPSAR----FDPLAVDYDG---------------------FAARMEESNTD------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            L   L   L +G    E +    G+   + + E ++ +  A+   +  ++      + +
Sbjct: 185 -LVRTLATQLNFGGLYGEELCTRAGVEKELAIEEADETDFEALYDALTGLS------EQL 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P  Y               + G    +     P  L++    +   F++F AAL
Sbjct: 238 SSGDFNPRIY--------------RDDGDPVDV----TPFPLDERAELDSEGFDSFTAAL 279

Query: 359 DEFYSKIESQRAEQ---QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++ ++++   E+   + +   +    +  +I   QE  +   + + DR  + AE +  
Sbjct: 280 DAYFVELDTTEDEESGGRERPDFEEQIERQQRIIDQQEGAIEDFEAQADRERETAESLYA 339

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E VD  +  VR A    + WE +     E  + G   A  +  +      +++    +
Sbjct: 340 NYELVDEILTTVRNAREEGIGWEAIEERFAEGEERGIAAAEAVSGIEPSEGTVTV----D 395

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKT 535
           +D+ D          VE+D       NA R Y   K+   K+E    A       AE + 
Sbjct: 396 IDDRD----------VELDPQEGVEQNADRLYREAKRVVEKKEGAEEA------VAETRE 439

Query: 536 RLQILQEK-----------------------TVANISHMRKVHWFEKFNWFISSENYLVI 572
            L+ ++ +                       +  +I       W+E+F WF +S+ +LV+
Sbjct: 440 ELEAIERQRDEWEAGDVDDDPDEESEDVDWLSQRSIPVRTDEQWYERFRWFHTSDGFLVL 499

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGCF 627
            GR+A QNE +VK+Y+ +GD + H  + G   T++K   P +P     +P  +L +A  F
Sbjct: 500 GGRNADQNEDLVKKYLDRGDRFFHTQVQGGPVTILKATGPSEPTREIDLPDRSLEEAAKF 559

Query: 628 TVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            V +S  W + +    A+   P QVSKT  +GEYL  G F IRG + + 
Sbjct: 560 AVSYSTVWKNGRFAGDAYMAEPDQVSKTPESGEYLEKGGFAIRGDRTYF 608


>gi|385803199|ref|YP_005839599.1| hypothetical protein Hqrw_1937 [Haloquadratum walsbyi C23]
 gi|339728691|emb|CCC39852.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
          Length = 719

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 163/724 (22%), Positives = 287/724 (39%), Gaps = 147/724 (20%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  LRR  G +    Y        F++ +        +  ++ LL+E 
Sbjct: 4   KQELTSVDIAALVTELRRYTGAKVDKTYRYGDDLLRFRMRDF-------DRGRLELLIEV 56

Query: 63  GV--RLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R+HT    +  D    P  F + LR  +    L +V Q  +DRI++  F  G    
Sbjct: 57  GTQKRIHTADPDHVPDAPERPPNFAMMLRNRLSGADLVNVEQFEFDRIMILSFERGEEMT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+  GN+ + DS   V+  L                    E  R+  RT A    
Sbjct: 117 RIIVELFGDGNVAVVDSAGEVIQSL--------------------ETVRLKSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                S+      P +V  D                           N++  D  R    
Sbjct: 157 YEFPDSR----VNPLQVTYDR---------------------FVSLMNESDTDIVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
                L   L  G   +E +    G+    K +++    D   + +  A+      LQ  
Sbjct: 188 ----TLATQLNLGGLYAEEVCARAGI---DKTTQITNTSDKIYRAIYTALESLGTQLQ-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD  P    L  +     D  P              PL   + ++ +   +++F+ AL
Sbjct: 239 -SGDFEPR---LYADDDAVIDATP-------------FPLEERKQQNLDVTAYDSFNGAL 281

Query: 359 DEFYSKIE-SQRAEQQHKAKED--AAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++ +++ +  AE+  + + D  A   K  +I   QE  +   +Q  +     AEL+  
Sbjct: 282 DVYFREVDRNPAAEESGQTRPDFAAEIAKKQRIIEQQEGAIDDFEQRAEAERSRAELLYA 341

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N E V+  I  ++ A A   SW+++        + G   A  +    +  +    +++  
Sbjct: 342 NYELVNEIIETIQTARAEDTSWDEIRETFAMGAERGIDAAAAV----VSVDGAEAMVTIE 397

Query: 476 LDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKAAEK 533
           +D++          +V V++ +    NA + Y   K+ E K+E  +TA  +++    A K
Sbjct: 398 IDDV----------RVPVNVDVGVEKNADQRYTEAKRIEEKKEGALTAIENTREELNAVK 447

Query: 534 KTR------------------LQILQEK------------------TVANISHMRKVHWF 557
           + R                   + + +K                  ++ +I   +   W+
Sbjct: 448 QRRDAWDREDAKPDTEDNADNTETVTDKVNTGTEPSRMGPTNDEWLSMTSIPLQKNDDWY 507

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVP 617
           E+F WF +S  YLV+ GR+A QNE +VK+Y++K D + H + HG   T++K   P +P  
Sbjct: 508 EQFRWFHTSTGYLVVGGRNADQNETLVKKYLNKHDRFFHTEAHGGPITILKASGPSEPAE 567

Query: 618 PLTLN-----QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           P+ L      +   F + +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG 
Sbjct: 568 PIELTAETRREVAQFAISYSSIWKEGRYADDAYVVTPDQVSKTPESGEYIEKGSFVIRGD 627

Query: 672 KNFL 675
           + ++
Sbjct: 628 RTYI 631


>gi|448677723|ref|ZP_21688913.1| hypothetical protein C443_04694 [Haloarcula argentinensis DSM
           12282]
 gi|445773398|gb|EMA24431.1| hypothetical protein C443_04694 [Haloarcula argentinensis DSM
           12282]
          Length = 717

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 160/662 (24%), Positives = 264/662 (39%), Gaps = 101/662 (15%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQPTLKTV--LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+     V  L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   G++ P  Y    +   G  +  +      +  D   P+ L+++     
Sbjct: 232 DEMGTRLRE---GNVDPRVYYETLDDGDGAGNGESGDDPDRRRVD-VTPIPLSEYEGLYS 287

Query: 349 VKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
             F  F++ALD+++     QR E+       +   +    K  +I   QE  +   + + 
Sbjct: 288 ESFTEFNSALDDYFFNF--QREEEVEGGETQRPDFEVEIEKQKRIIQQQEQAIEDFEADA 345

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL 463
           +   + AEL+  N + VD  +  VR A  + +SW+D+     E    G   A  +  L  
Sbjct: 346 EVEREKAELLYANYDLVDDVLSTVRAAREDDVSWDDIEAKFDEGADRGIAAAEAVVSLDG 405

Query: 464 ERNCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
               ++L +               N DE+  E K +  +K   + AL+A  N R   E  
Sbjct: 406 SEGTVTLDIGGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAV 462

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
           K++  + E    A     + AE +   +     ++ +I       W+E+F WF +S+ +L
Sbjct: 463 KERRDEWE----ADDGEDEVAEDEGEDEPTDWLSMQSIPTRSTERWYEQFRWFHTSDGFL 518

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAG 625
           VI GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA 
Sbjct: 519 VIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKKVDFPQSSLDQAA 578

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            F V +S  W D K     + V P QVSKT  +GEYL  G F IRG + +    P+ +  
Sbjct: 579 QFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFESTPVGVAV 638

Query: 685 GL 686
           G+
Sbjct: 639 GI 640


>gi|448666601|ref|ZP_21685246.1| fibronectin-binding A domain-containing protein [Haloarcula
           amylolytica JCM 13557]
 gi|445771732|gb|EMA22788.1| fibronectin-binding A domain-containing protein [Haloarcula
           amylolytica JCM 13557]
          Length = 717

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 160/669 (23%), Positives = 263/669 (39%), Gaps = 115/669 (17%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    A+  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHAADPAHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEHGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                     +++    +++
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGF--------------------VARIKESDAD 185

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
                    L   L   L +G    E +    G+  N+    V++L+++  + L   + +
Sbjct: 186 ---------LVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDELDESDFERLYELIDQ 233

Query: 291 FEDWLQDVISGDIVPEGYI--LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
               L++   GD+ P  Y   L      G   P  E         +  P+ L ++     
Sbjct: 234 MGTRLRE---GDVDPRVYYEALDDGDGAGSADPDDEPDRRRV---DVTPIPLEEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
             F  F+ ALD+++     QR E+       +   +A   K  +I   QE  +   + + 
Sbjct: 288 ESFTEFNPALDDYFFNF--QREEEVEGGETQRPDFEAEIEKQKRIIQQQEQAIEDFEADA 345

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL 463
           +   + AEL+  N + VD  +  V+ A A+ +SW+D+     E    G   A  +  L  
Sbjct: 346 EVEREKAELLYANYDLVDDVLSTVQAARADDVSWDDIEAKFNEGADRGIAAAEAVVSLDG 405

Query: 464 ERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
               ++L +                 +V VD       NA   Y+  K+ E K+E  + A
Sbjct: 406 SEGTVTLDIDGT--------------RVTVDAFTGVEKNADELYKEAKRIEEKKEGALAA 451

Query: 524 --HSKAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWF 563
             +++    A K+ R +   +                   ++ +I      HW+E+F WF
Sbjct: 452 IENTREDLEAVKERRDEWEADDGDDEADEDEGEDEPTDWLSMQSIPTRSTEHWYEQFRWF 511

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPP 618
            +S+ +LVI GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P 
Sbjct: 512 HTSDGFLVIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSTEVDFPQ 571

Query: 619 LTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
            +L+QA  F V +S  W D K     + V P QVSKT  +GEYL  G F IRG + +   
Sbjct: 572 SSLDQAAQFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAIRGDRTYFES 631

Query: 678 HPLIMGFGL 686
            P  +  G+
Sbjct: 632 TPAGIAVGI 640


>gi|296109018|ref|YP_003615967.1| Fibronectin-binding A domain protein [methanocaldococcus infernus
           ME]
 gi|295433832|gb|ADG13003.1| Fibronectin-binding A domain protein [Methanocaldococcus infernus
           ME]
          Length = 666

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 194/355 (54%), Gaps = 16/355 (4%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P+ L +++  E   F +F  ALDE+++K  +    ++ K+K +    K   I   Q   
Sbjct: 257 VPIELRKYKDYEKRYFNSFYEALDEYFAKFLTSVEIKKEKSKLEKEIEKQESILRRQLET 316

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
           +   ++EV ++    +LI  N + V+  + A+RVA  ++  WE++ R+++E ++  +P+ 
Sbjct: 317 LKAYEEEVRKNQIKGDLIYSNYQLVEEILNAIRVA-KDKKGWEEVKRVIRENKE--HPII 373

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
            LI+ +  ++  + + LS++LD   +E       +V +D+  S   NA  +Y   KK +S
Sbjct: 374 KLIEGVNEKKGEIIVRLSSDLDGKIEE-------RVVLDIRKSTFENAESYYNKAKKFKS 426

Query: 516 KQE--KTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVIS 573
           K E  K     SK      KK R   ++EK        ++  W+EKF W + + N+LVI+
Sbjct: 427 KIEGIKKAIEMSKKKLEELKKKRDVEIEEKKALKKKVKKERKWYEKFKWTVIN-NFLVIA 485

Query: 574 GRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQ 633
           G+DA  NE+I+K+Y  K D+  HAD+ GA  TVIK +  E  V   TL +   F+V HS+
Sbjct: 486 GKDAITNEIIIKKYTDKDDIVFHADIQGAPFTVIKTNGRE--VDEETLMEVAKFSVSHSK 543

Query: 634 AWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           AW         +WV P Q+SK A +GEYL  G+F+IRGK+N++   PL +G G+L
Sbjct: 544 AWKLGYGALDTYWVKPDQISKRAESGEYLKRGAFVIRGKRNYIRNVPLELGIGVL 598



 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 56/96 (58%)

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           T+Y R+K   P  F + LRK+++  +L  + Q+ +DRI+L +F +G   + +I EL+  G
Sbjct: 64  TSYEREKPKLPPSFAMLLRKYLKNAKLLRIDQVEFDRILLLEFSIGEKKYKIIAELFKDG 123

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           NI+  D E  ++  LR     ++ +A   ++++P +
Sbjct: 124 NIIFLDEEDNIIAPLRVEVFSNRKIAPKEKYQFPPQ 159


>gi|34364937|emb|CAE45889.1| hypothetical protein [Homo sapiens]
          Length = 505

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 86/111 (77%), Gaps = 1/111 (0%)

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYL 661
           A+S VIKN   E P+PP TL + G   +C+S AWD++++TSAWWVY HQVSKTAPTGEYL
Sbjct: 1   ATSCVIKNPTGE-PIPPRTLTEVGTMALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYL 59

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           T GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 60  TTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDEDME 110


>gi|452210388|ref|YP_007490502.1| hypothetical protein MmTuc01_1891 [Methanosarcina mazei Tuc01]
 gi|452100290|gb|AGF97230.1| hypothetical protein MmTuc01_1891 [Methanosarcina mazei Tuc01]
          Length = 775

 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 113/390 (28%), Positives = 191/390 (48%), Gaps = 22/390 (5%)

Query: 320 HPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIE-SQRAEQQHKAKE 378
           H   E     + +D   P  LN++   E   F++F+ ALDEF+ K    Q AE +   K+
Sbjct: 269 HIKQEINGKMETFD-VVPFDLNRYSEYEKEYFDSFNTALDEFFGKKALEQVAEVKEAEKK 327

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
           +       +  M QE  +   ++E++++  +AE +  N + ++     +  A A   SW+
Sbjct: 328 EKTLGVFERRLMQQEESLAKFEKEIEKNNALAETVYANYQIIEELFSVLNGARAKGYSWD 387

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALS 498
           ++  ++K+ +K   P A  I  +  +   +++    NLD           + + +D+  +
Sbjct: 388 EIRSILKQAKKT-VPAAQTITNIDQKTGTVTV----NLDG----------KSINLDIRKT 432

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
              NA+ +YE  KK   K++  I A     KA EKK   +  +      +   RK HW++
Sbjct: 433 VPQNAQEYYEKVKKFTKKKDGAIRAIEDTKKAMEKKAATKSAKAGR--KLQASRKKHWYD 490

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           +F WF+SS+ +LV+ GRDA  NE I K+YM K D+  H    GA  TV+K    E  VP 
Sbjct: 491 RFRWFVSSDGFLVVGGRDADTNEEIFKKYMEKRDIVFHTQTPGAPLTVVKTGGKE--VPD 548

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
            TL +   F V +S  W +   +   +W+   QV+KT  +GEYL  G+F+IRG++N+   
Sbjct: 549 STLQEVSQFAVSYSSLWKAGQFSGDCYWIKSEQVTKTPESGEYLKKGAFVIRGERNYFKD 608

Query: 678 HPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
            PL +  GL  + +   +G   +  R  G+
Sbjct: 609 VPLGIAVGLELKGETRIIGGPASAVRKHGD 638



 Score = 63.2 bits (152), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 68/139 (48%), Gaps = 11/139 (7%)

Query: 6   MNTADVAAEVKCL----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           M++ADVAA V  L    R +I  +   +Y  + +     L     V   G      L++E
Sbjct: 1   MSSADVAAVVAELSAGPRSIIDAKIGKIYQPASEEIRINLY----VFHQGRDN---LVIE 53

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T + R     P  F + LRK++   R+  V Q  +DRI+            +I
Sbjct: 54  AGKRLHMTKHIRPSPTLPQAFPMLLRKYLMGGRIVSVEQHDFDRIVKIGIERAGVRSTLI 113

Query: 122 LELYAQGNILLTDSEFTVL 140
           +EL+A+GN+L+ DSE  ++
Sbjct: 114 VELFARGNVLIVDSENKII 132


>gi|395506524|ref|XP_003757582.1| PREDICTED: uncharacterized protein LOC100920250 [Sarcophilus
           harrisii]
          Length = 231

 Score =  156 bits (395), Expect = 5e-35,   Method: Composition-based stats.
 Identities = 70/113 (61%), Positives = 86/113 (76%), Gaps = 2/113 (1%)

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
           H RK   FEKF WFISSENYL+I GRD QQNEMIVKRY++ GD+YVHADLHGA+S VIKN
Sbjct: 46  HQRKCG-FEKFLWFISSENYLIIGGRDQQQNEMIVKRYLTPGDIYVHADLHGATSCVIKN 104

Query: 610 HRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLT 662
              E P+PP TL +AG   +C+S AWD++++TSAWWVY HQ+      G+ L+
Sbjct: 105 PTGE-PIPPRTLTEAGTMALCYSAAWDARVITSAWWVYHHQLRSAFRVGDSLS 156


>gi|448339346|ref|ZP_21528374.1| Fibronectin-binding A domain protein [Natrinema pallidum DSM 3751]
 gi|445620575|gb|ELY74071.1| Fibronectin-binding A domain protein [Natrinema pallidum DSM 3751]
          Length = 721

 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 166/713 (23%), Positives = 280/713 (39%), Gaps = 125/713 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRDF-------DRGRMELLLEV 56

Query: 63  G--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFIFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT      
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP---- 166

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
             LT S+E   +E D  + D                                        
Sbjct: 167 --LTVSREAFDHEMDDSDTD---------------------------------------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            +   L   L +G   +E +    G+   M + + ++   + +   +  +A       D+
Sbjct: 185 -VVRTLATQLNFGGLYAEEVCTRAGVEKGMDIDDADEAVYDRLYETIERLA------LDI 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +G+  P  Y+   ++    D   T  G    + D   P  L +    +   +++F +AL
Sbjct: 238 RNGNFDPRLYLETDDEDDDADGDGTPEGGDAHVVD-VTPFPLEEHEDLDGEPYDSFLSAL 296

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           D+++ ++E    E+     +   F     K  +I   Q+  +   +QE +   + AEL+ 
Sbjct: 297 DDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAESLREQAELLY 356

Query: 414 -EYNL-EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
            EY L +D+ + IL  R       SW+D+    +E  + G   A  +    ++ +     
Sbjct: 357 AEYGLVDDILSTILGAR---KRDRSWDDIRDRFEEGAEQGIDAAEAV----VDVDGSDGT 409

Query: 472 LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAA 531
           ++ ++D+          E++ +D       NA R Y   K+ E K+E  + A        
Sbjct: 410 VTVDIDD----------ERISLDAQQGVEQNADRLYTEAKRVEEKKEGALAAIENTRDDL 459

Query: 532 EKKTRLQILQEK-----------------------TVANISHMRKVHWFEKFNWFISSEN 568
           E   R +   E                        +  +I       WF++F WF +S+ 
Sbjct: 460 EDAKRRRDEWEDDESGGADEAEADEDEEDSQRDWLSEPSIPIRENEPWFDRFRWFHTSDG 519

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLN 622
           YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ 
Sbjct: 520 YLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIDLPESSVA 579

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           +A  F V +S  W D +     + V   QVSKT  +GEYL  G F IRG + +
Sbjct: 580 EAAQFAVSYSSVWKDGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTY 632


>gi|448329966|ref|ZP_21519260.1| Fibronectin-binding A domain protein [Natrinema versiforme JCM
           10478]
 gi|445613154|gb|ELY66864.1| Fibronectin-binding A domain protein [Natrinema versiforme JCM
           10478]
          Length = 720

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 165/714 (23%), Positives = 292/714 (40%), Gaps = 104/714 (14%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +     + G  E +L + E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRD----FDRGRMELILEVGEV 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTTRI 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT        
Sbjct: 119 IVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP------ 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT S+E   +E D  + D                                         +
Sbjct: 167 LTVSREAFDHEMDDSDTD-----------------------------------------V 185

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
              L   L +G   +E + +  G+   M   +++  +++  + L   + +      D+ +
Sbjct: 186 VRTLATQLNFGGLYAEELCVRAGVEKGM---DIDDADEDVYERLYETIERL---ALDIRN 239

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G+  P  Y+   ++        +E   +  +  +  P  L +    +   +++F +ALD+
Sbjct: 240 GNFDPRLYLERDDEEADDGEGESEDADANVV--DVTPFPLEEHDDLDGEAYDSFLSALDD 297

Query: 361 FYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI--E 414
           ++ ++E    E+     +   F     K  +I   Q+  +   +QE +   + AEL+  E
Sbjct: 298 YFFRLELAEEEESDPTDQRPDFESEIAKQERIIEQQQGAIEGFEQEAEELREQAELLYAE 357

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG-NPVAGLID--------KLYLER 465
           Y L  VD  +  ++ A     SW+++    +E  + G +    ++D         + ++ 
Sbjct: 358 YGL--VDDILSTIQGAREQDRSWDEIRERFEEGAEQGIDAAEAVVDVDGSDGTVTVDIDG 415

Query: 466 NCMSLL----LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTI 521
             + L+    +  N D +  E K +  +K   + AL+A  N R   E  K++  + E   
Sbjct: 416 ERIGLVAGRGVEQNADRLYTEAKRVEEKK---EGALAAIENTREDLEEAKRRRDEWEADE 472

Query: 522 TAHSKAFKAAEKKTRLQ--ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
           +  +   ++ E +   Q   L E +   I       WF++F WF +S++YLVI GR+A Q
Sbjct: 473 SGPAAETESDEDEEETQRDWLSEPS---IPIRENEPWFDRFRWFQTSDDYLVIGGRNADQ 529

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQ 633
           NE +VK+Y+  GD   H   HG   TV+K   P +       +P  ++ +A  F V ++ 
Sbjct: 530 NEELVKKYLEPGDKVFHTQAHGGPVTVLKATDPSEASSSDIELPESSIEEAAQFAVSYAS 589

Query: 634 AW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            W D +     + V   QVSKT  +GEYL  G F IRG + +    P+    G+
Sbjct: 590 VWKDGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 643


>gi|448439536|ref|ZP_21588100.1| Fibronectin-binding A domain protein [Halorubrum saccharovorum DSM
           1137]
 gi|445691070|gb|ELZ43265.1| Fibronectin-binding A domain protein [Halorubrum saccharovorum DSM
           1137]
          Length = 733

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 181/741 (24%), Positives = 296/741 (39%), Gaps = 144/741 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+ A V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLGALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHVADPDNVSDAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L++ R   + VA  S++ YP           AS+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGALQTVRLKSRTVAPGSQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                    N    +LGG K        ++  ++ +D  R    
Sbjct: 165 -------------------------NPLDVSLGGFK--------RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +     +     + E     D+ ++ L  A+ +  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRASVEKETPIEEAT---DDQLRALHEALERIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD+ P  Y    ++  G      E+        +  P  L++      V F++F+AA+
Sbjct: 239 -SGDVDPRVYEEELDEGDGDGGEDDEADDRDPRVVDVTPFPLSEHEGLPSVGFDSFNAAV 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAA--------FHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           DE++ ++E + ++      + +A          K  +I   Q+  +   +++ +   + A
Sbjct: 298 DEYFYRLEHEESDAGEAPTDASASRPDFEEEIAKQERIIEQQKGAIEGFEEQAEAERERA 357

Query: 411 ELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNC 467
           EL+  EY+L  VD  +  V+ A  N + W+++A  +    + G P A  ++D        
Sbjct: 358 ELLYAEYDL--VDEVLSTVQEARENDVPWDEIAETLDAGAERGIPAAEAVVD-------- 407

Query: 468 MSLLLSNNLDEMDDEEKTLPVE------KVEVDLALSAHANARRWYELKKKQESKQEKTI 521
                      +D  E T+ VE      +VE+D +     NA R Y+  K+ E K+E  +
Sbjct: 408 -----------VDGGEGTVTVELGEDDTRVELDASAGVEVNADRLYQEAKRIEGKKEGAM 456

Query: 522 TAHSKAFKAAEK-KTRLQILQEKTVAN-----------------------------ISHM 551
            A     +  E  K R    + K  A+                             I   
Sbjct: 457 EAIESTRQDLEAVKERKAEWKAKEAADDEEGGSDAGGGEGDEGEEEYETDWLSRSSIPIR 516

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
               WFE+F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   
Sbjct: 517 SPDDWFERFRWFRTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKAAG 576

Query: 612 PEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           P +   P+     TL +A  F V +S  W D +    A+ V P QVSKT  +GEY+  GS
Sbjct: 577 PSESADPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGS 636

Query: 666 FMIRGKKNFLPPHPLIMGFGL 686
           F+IRG + +    P  +  G+
Sbjct: 637 FVIRGDRTYFEDVPCRVAVGV 657


>gi|300706574|ref|XP_002995542.1| hypothetical protein NCER_101531 [Nosema ceranae BRL01]
 gi|239604689|gb|EEQ81871.1| hypothetical protein NCER_101531 [Nosema ceranae BRL01]
          Length = 644

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 101/340 (29%), Positives = 176/340 (51%), Gaps = 39/340 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F++F+ A++ F+     ++ E+  K         L KI   Q   +  L+  V      A
Sbjct: 239 FQSFNEAVEFFFMDRRKKKIEKVDK---------LQKIRNKQYEHIKELENMVKDMTMKA 289

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKL-YLERNCMS 469
           +LI  N + V+  +      + N+++W D  +  ++E+  GN +A +I K  +  ++C+ 
Sbjct: 290 DLILKNADIVENVLDIHNYVIKNKLNWNDFLKFKEDEKSKGNEIADIIVKSDFKNKSCI- 348

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
                 +D  D+E+       +E+    S H+NA+ ++E +KK E K  KT     KA  
Sbjct: 349 ------IDLKDNEDSHF----IEISFDKSLHSNAQNYFEKRKKFEEKILKT----EKAID 394

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
             + KT  +  +EK    I   R V WFEKFN+  +++  LVI G++AQQNE+IVK++++
Sbjct: 395 TIKIKTYTK--EEK----IKIQRSVFWFEKFNFCFTTDKKLVIGGKNAQQNEIIVKKHLT 448

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
              +Y H +  G SS + +          + +++     +C+S  W+  +V+  ++V   
Sbjct: 449 PNHLYFHTESSGGSSVISE--------ADVNIDEVALVALCNSACWEVNVVSPVFYVKSD 500

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           QVSKT PTG++L  GSF+IRG K ++  + L  G GLLF+
Sbjct: 501 QVSKTPPTGQFLPKGSFLIRGTKTYVNVYKLEYGVGLLFK 540



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 74/148 (50%), Gaps = 14/148 (9%)

Query: 53  SEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
           S K +LL+E G+R+H T+ A D     S F   LRK  R  ++ D+ Q+G+DR+I+F+  
Sbjct: 42  SSKDILLIEPGIRIHLTSEADD---GISHFCNILRKKARRDKVVDIYQVGFDRVIVFE-- 96

Query: 113 LGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY---PTEICRVF 169
             ++   +++E ++ GN+ + D    ++ + R  ++ D    I+   +Y   P E    +
Sbjct: 97  --LSRQKIVIEFFSGGNVFILDEFDKIVEVFRVVKELD----IIKNTQYVFNPAEFDFSW 150

Query: 170 ERTTASKLHAALTSSKEPDANEPDKVNE 197
           E     +    L   KE   N   K+N+
Sbjct: 151 ENFCNMEFKEFLPFEKELVDNLIKKINK 178


>gi|126466189|ref|YP_001041298.1| hypothetical protein Smar_1299 [Staphylothermus marinus F1]
 gi|126015012|gb|ABN70390.1| protein of unknown function DUF814 [Staphylothermus marinus F1]
          Length = 663

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 123/441 (27%), Positives = 209/441 (47%), Gaps = 64/441 (14%)

Query: 243 VLGEALGYG-PA-LSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
           V G   G+G P  ++E +I   GL    K  ++N +E   +  L+     FE  + +V+ 
Sbjct: 179 VRGIVKGWGLPGYIAEELIYRAGLYEK-KNYKINMIEKTDLYSLIYI---FEKIINEVLE 234

Query: 301 GDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDE 360
           G    +GY++  N             +   IY  + P L  +       K++  +  LD 
Sbjct: 235 G----KGYLVKLN-------------NEPHIYTSYEPKLYKELYELNVEKYDELNHVLDI 277

Query: 361 FYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDV 420
           +Y + E +   +Q   K+     K+ K   +Q+  +    +E ++  K +E +  N  +V
Sbjct: 278 YYGEYEKRIYYEQKTTKQQMLIEKIKKNIEEQQKIIKKYIEESEKYRKFSETLVTNY-NV 336

Query: 421 DAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMD 480
              IL           WE +                 I + Y ++  + +       ++D
Sbjct: 337 LEKILKCVHETRRTSGWEKIVENCPN-----------IVEFYKDKGIVIV-------KLD 378

Query: 481 DEEKTLPVEKVEVDLALSAHANARRWYELKK---KQESKQEKTITAHSKAFKAAEKKTRL 537
           D E       + +D+ L    N  R+ +L     K+  + E+ +    K+ + A  K   
Sbjct: 379 DYE-------IPIDIRLDTWNNILRYKKLSGELLKKAKRAEEALRELEKSLEEAVNKK-- 429

Query: 538 QILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
           Q++++KT   I   +   W+E+F+W I+SE +LVI+GRDA QNE+IVK+YM   D+++HA
Sbjct: 430 QLIEKKTEIGI---KPRLWYERFHWMITSEGFLVIAGRDADQNELIVKKYMEPHDIFLHA 486

Query: 598 DLHGASSTVIKNHR--PEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKT 654
           D+HGA +TVIK H   P Q     ++ +A     C+S+AW+        +WV+  QVSKT
Sbjct: 487 DIHGAPATVIKTHNRMPSQK----SIEEAAVIAACYSKAWNEGFGAIDVFWVHASQVSKT 542

Query: 655 APTGEYLTVGSFMIRGKKNFL 675
            P+GEYL+ G+FMI GKKN++
Sbjct: 543 PPSGEYLSKGAFMIYGKKNYV 563



 Score = 47.0 bits (110), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 78/163 (47%), Gaps = 10/163 (6%)

Query: 1   MVKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M+K  M+  D+ +      +++IG    N+Y  +   ++ K+         G+S    L 
Sbjct: 1   MIKKAMDILDIYSWTNNFGKQVIGCFIENIY-FTGFYWLLKIRCPG----KGKS---YLK 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E  +RLH +     +K     F+  +RK+IR  R+ DV+QLG++RII          + 
Sbjct: 53  IEPSIRLHVSNIDPLEKKIDK-FSSFMRKYIRGARIVDVKQLGWERIIELHVKSRNKKYI 111

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           +I E+  +G ++LT+  + +L   R     D+ +   S++  P
Sbjct: 112 LINEIMPRGFLVLTNETYNILYANRFQELRDRIIKRGSKYTPP 154


>gi|222479900|ref|YP_002566137.1| Fibronectin-binding A domain protein [Halorubrum lacusprofundi ATCC
           49239]
 gi|222452802|gb|ACM57067.1| Fibronectin-binding A domain protein [Halorubrum lacusprofundi ATCC
           49239]
          Length = 733

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 178/741 (24%), Positives = 308/741 (41%), Gaps = 144/741 (19%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHVADPENVADAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ QGN+   D    V+  L++ R   + VA  +++ YP           AS+L 
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGSLQTVRLKSRTVAPGAQYEYP-----------ASRL- 164

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                                    N    +LGG K        ++  ++ +D  R    
Sbjct: 165 -------------------------NPLDVSLGGFK--------RHMRESDSDVVR---- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           TL T     L  G   +E +    G+    K + ++ + D+ ++ L  A+ +  + L+  
Sbjct: 188 TLAT----QLNLGGLYAEEVCTRAGV---EKETPIDDVTDDQLRALHEALERIGERLR-- 238

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            SGD+ P  Y    +    +D  P       ++ D   P  L++      V F++F+AA+
Sbjct: 239 -SGDVDPRVYEEELSDDEAEDRDP-------RVVD-VTPFPLSEHEGLPSVGFDSFNAAV 289

Query: 359 DEFYSKIESQRAEQQHKAKEDAA---------FHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           DE++ +++   +E+  +A  DA+           K  +I   Q+  +   +++ +   + 
Sbjct: 290 DEYFYRLDRDGSEE-GEAPADASPSRPDFEEEIGKQERIVEQQQGAIEGFEEQAEAERER 348

Query: 410 AELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           AEL+  EY+L  VD  +  V+ A    + W+++A  +    + G P A  +  +      
Sbjct: 349 AELLYAEYDL--VDEVLSTVQEAREAEVPWDEIAETLDAGAEQGIPAAETVVDVDGGEGT 406

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           +++ L     E DD E T    ++E+D +     NA R Y+  K+ E K+E  +    +A
Sbjct: 407 VTVELRGGDGEDDDGETT----RIELDASAGVEVNADRLYQEAKRIEGKKEGAM----EA 458

Query: 528 FKAAEKKTRLQILQEK------------------------------------TVANISHM 551
            K+   +  L+ ++E+                                    + ++I   
Sbjct: 459 IKST--RAELEAVKERKAEWEAKEAAADETAGDGADDGEEEEDGEEYQTDWLSRSSIPIR 516

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
               W+++F WF +S  YLVI GR+A QNE +VK+YM K D + H   HG   T++K   
Sbjct: 517 SPDDWYDRFRWFYTSTGYLVIGGRNADQNEELVKKYMGKHDRFFHTQAHGGPVTLLKAAG 576

Query: 612 PEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           P +   P+     TL +   F V +S  W D +    A+ V P QVSKT  +GEY+  GS
Sbjct: 577 PSESADPVDFSEETLREVAQFAVSYSSDWKDGRGAGDAYMVEPDQVSKTPESGEYIEKGS 636

Query: 666 FMIRGKKNFLPPHPLIMGFGL 686
           F+IRG + +    P  +  G+
Sbjct: 637 FVIRGDRTYFEDVPCRIAVGV 657


>gi|448313587|ref|ZP_21503301.1| fibronectin-binding A domain-containing protein [Natronolimnobius
           innermongolicus JCM 12255]
 gi|445597955|gb|ELY52026.1| fibronectin-binding A domain-containing protein [Natronolimnobius
           innermongolicus JCM 12255]
          Length = 723

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 164/680 (24%), Positives = 274/680 (40%), Gaps = 130/680 (19%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLLEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRIIVELFGQGNVAVTDGEYEVIDSLETVRLKSRTVVPGSRYEFPE------- 161

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
               S+++  LT S+E                               +FD  +    +  
Sbjct: 162 ----SRINP-LTVSRE-------------------------------AFD--REMEDSDT 183

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLED------NAIQVL 284
           D  R    TL T     L +G   +E +    G+   M + + +  ED       AI+ L
Sbjct: 184 DVVR----TLAT----QLNFGGLYAEEVCTRAGVEKAMDIEDAD--EDVYDRLYGAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSS---TQIYDEFCPLLLN 341
            L          D+ +G+  P  Y+   +   G D    ESG+      + D   P  L 
Sbjct: 234 AL----------DLRNGNFEPRLYVDDGDDENGDDSEDDESGADEGPAPVVDA-TPFPLE 282

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE----DAAFHKLNKIHMDQENRVH 397
           +        +++F AALD+++ ++E    E+     +    D    K  +I   QE  + 
Sbjct: 283 EHVELASEPYDSFLAALDDYFHRLELAEEEEPDPTDQRPDFDEQIAKHERIIEQQEGAIE 342

Query: 398 TLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA 455
             ++E D     AEL+  EY L  VD  +  VR A      W+++    +E  + G   A
Sbjct: 343 GFEREADELRDQAELLYAEYGL--VDEILSTVRQAREQDRPWDEIEERFEEGAERGIEAA 400

Query: 456 GLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES 515
             +  +      +++ +                E++E+        NA R Y   K+ E 
Sbjct: 401 EAVVGVDGSEGIVTVSVDG--------------ERIELVAQQGVEQNADRLYTEAKRVEE 446

Query: 516 KQEKTITA----HSKAFKAAEKKTRLQILQEK------------------TVANISHMRK 553
           K+E  + A      +  +  +++ R +    +                  + +++     
Sbjct: 447 KKEGALAAIEDTREELEEIVDRRDRWEAEDAETDEADEADEEEGEDRDWLSESSVPIREN 506

Query: 554 VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPE 613
             WF++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P 
Sbjct: 507 EPWFDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPS 566

Query: 614 QP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSF 666
           +       +P  ++ +A  F V ++  W D +     + V   QV+KT  +GEYL  G F
Sbjct: 567 EASSSDIELPESSIEEAAQFAVSYASVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGF 626

Query: 667 MIRGKKNFLPPHPLIMGFGL 686
            IRG + +    P+ +  G+
Sbjct: 627 AIRGDRTYYDDTPVGVAVGI 646


>gi|297527127|ref|YP_003669151.1| hypothetical protein Shell_1151 [Staphylothermus hellenicus DSM
           12710]
 gi|297256043|gb|ADI32252.1| protein of unknown function DUF814 [Staphylothermus hellenicus DSM
           12710]
          Length = 663

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 180/356 (50%), Gaps = 49/356 (13%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
            IY  + P L  +       K++  +  LD +YS+ E +   +Q   K+     K+ K +
Sbjct: 247 HIYTSYEPKLYKELYDVSVEKYDKLNHVLDIYYSEYEKRIYYEQRTIKQRILIEKIKK-N 305

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
           +D++ ++  +K+ ++ S K  E                R  + N    E +   V + RK
Sbjct: 306 IDKQQKI--IKKYIEESEKYKEF--------------SRTLVTNYNLLEKILECVNKTRK 349

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARR 505
                    DK+    NC       N+ +   ++ T+ V+    ++ +D+ L+A  N  R
Sbjct: 350 TSG-----WDKIV--ENC------PNIVKYYKDKGTVIVKFNEYEIPIDIRLNAWNNILR 396

Query: 506 WYELKK---KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNW 562
           + +L     K+  K E+ +    ++ + A  K   Q++Q +T   I   +   W+E+F+W
Sbjct: 397 YKKLSGELLKKAKKAEEALRELERSLEEAVNKK--QLIQRRTEIGI---KPRLWYERFHW 451

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--PEQPVPPLT 620
            I+SE +LVI+GRD  QNE+IVK+YM   D+++HAD+HGA +TVIK H   P Q     +
Sbjct: 452 MITSEGFLVIAGRDIDQNELIVKKYMEPHDIFLHADIHGAPATVIKTHNRMPSQK----S 507

Query: 621 LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           + +A     C+S+AW         +WVY +QVSKT P+GEYL  G+FMI GKKN++
Sbjct: 508 IKEAAVIAACYSKAWKEGFGAIDVFWVYANQVSKTPPSGEYLPKGAFMIYGKKNYV 563



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 74/141 (52%), Gaps = 10/141 (7%)

Query: 1   MVKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M+K  M+  DV +      +++IG    N+Y  +   ++ K+  S      G+S    L 
Sbjct: 1   MIKKSMDILDVYSWTNNFGKQIIGCFIENIY-FTGFYWLIKIRCSG----KGKS---YLK 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E  +RLH +     +K     F+  +RKHIR  R+ DV+QLG++RII        N + 
Sbjct: 53  IEPSIRLHISNIEPLEKKIDK-FSSFMRKHIRGARIIDVKQLGWERIIELHVKSRKNEYI 111

Query: 120 VILELYAQGNILLTDSEFTVL 140
           +I E+  +G ++LT+ ++++L
Sbjct: 112 LINEILPRGFLVLTNEKYSIL 132


>gi|269862824|ref|XP_002650989.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065304|gb|EED43067.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 506

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 168/347 (48%), Gaps = 46/347 (13%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 176 MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 226

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 227 TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 286

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 287 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 324

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
                K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 325 -----KIAMRDIQAKLKPRKEHIKIQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 379

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 380 NKYMEDRDLYFHCDVKGASSVVCKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 433

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 434 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 480


>gi|269862592|ref|XP_002650899.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065446|gb|EED43157.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 480

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 168/347 (48%), Gaps = 46/347 (13%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 97  MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 147

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 148 TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 207

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 208 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 245

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
                K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 246 -----KIAMRDIQAKLKPRKEHIKIQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 300

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 301 NKYMEDRDLYFHCDVKGASSVVCKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 354

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 355 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 401


>gi|448303302|ref|ZP_21493251.1| fibronectin-binding A domain-containing protein [Natronorubrum
           sulfidifaciens JCM 14089]
 gi|445593087|gb|ELY47265.1| fibronectin-binding A domain-containing protein [Natronorubrum
           sulfidifaciens JCM 14089]
          Length = 716

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 162/668 (24%), Positives = 279/668 (41%), Gaps = 113/668 (16%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RIELILEVGEIKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRLIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPD------- 161

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
               S+L+  LT S+E                               +FDL    +    
Sbjct: 162 ----SRLNP-LTVSRE-------------------------------AFDLEMEDSDTD- 184

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDN------AIQVL 284
                    +   L   L +G   +E I    G+   M +++ +  ED+      AI+ L
Sbjct: 185 ---------IVRTLATQLNFGGLYAEEICTRAGIEKGMDIADAD--EDDYDRLYEAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR 344
            L          D+ + +  P  Y+         D     + S+  +  +  P  L +  
Sbjct: 234 AL----------DLRNANFEPRLYLEDGEDGDDDDESDDSTESARVV--DATPFPLEEHA 281

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF----HKLNKIHMDQENRVHTLK 400
                 +++F AALD+++ ++E    E+     +   F     K  +I   Q+  +   +
Sbjct: 282 ELAAEPYDSFLAALDDYFFRLELDDEEEPDPTTQKPDFGEEIAKYERIIDQQQGAIEGFE 341

Query: 401 QEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKE--ER--KAGNPV 454
           Q+ D   + AEL+  EY L  VD  +  ++ A A    W+++    +E  ER  +A   V
Sbjct: 342 QQADDLREQAELLYAEYGL--VDDILSTIQDARAQDRPWDEIEARFEEGAERGIEAAEAV 399

Query: 455 AGL-----IDKLYLERNCMSLL----LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
            G+     I  + ++ + + L+    +  N D +  E K +  +K   + AL+A  + R 
Sbjct: 400 VGIDSSEGIVTVDIDGDRIDLVAHDGVEQNADRLYTEAKRVAEKK---EGALAAIEDTRE 456

Query: 506 WYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
             E  K++  + +       +A     ++T         + +I       WF++F WF +
Sbjct: 457 DLEDAKRRRDEWDADDEGDEQADDEDTEETNWL-----EMPSIPIRENEPWFDRFRWFHT 511

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPL 619
           S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  
Sbjct: 512 SDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPDS 571

Query: 620 TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
           ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG++ +    
Sbjct: 572 SIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGERTYHRDT 631

Query: 679 PLIMGFGL 686
           P+ +  G+
Sbjct: 632 PVGVAVGI 639


>gi|433431126|ref|ZP_20407596.1| hypothetical protein D320_16320 [Haloferax sp. BAB2207]
 gi|448568141|ref|ZP_21637718.1| hypothetical protein C456_00247 [Haloferax lucentense DSM 14919]
 gi|448601017|ref|ZP_21656300.1| hypothetical protein C452_18184 [Haloferax alexandrinus JCM 10717]
 gi|432194170|gb|ELK50822.1| hypothetical protein D320_16320 [Haloferax sp. BAB2207]
 gi|445727091|gb|ELZ78705.1| hypothetical protein C456_00247 [Haloferax lucentense DSM 14919]
 gi|445734620|gb|ELZ86178.1| hypothetical protein C452_18184 [Haloferax alexandrinus JCM 10717]
          Length = 702

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 104/365 (28%), Positives = 173/365 (47%), Gaps = 43/365 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +Q+     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPDFEEQIAKQQRIIDQQEGAIEGFEQQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+A  ++E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAATLEEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL       D+++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTL-------DVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSE 567
           +   AA KK R +   +                    + ++      HWFE+F WF +S 
Sbjct: 441 REELAAVKKRRDEWEADDGDDDEDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTST 500

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLN 622
            YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL 
Sbjct: 501 GYLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLR 560

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF++RG + +    P  
Sbjct: 561 EAAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAK 620

Query: 682 MGFGL 686
           +  G+
Sbjct: 621 VAVGI 625



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|387592702|gb|EIJ87726.1| hypothetical protein NEQG_02273 [Nematocida parisii ERTm3]
          Length = 700

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 112/350 (32%), Positives = 169/350 (48%), Gaps = 30/350 (8%)

Query: 358 LDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNL 417
            D F S +++  A Q+    E A+  K  KI   QE  +H    E+      AEL+  N 
Sbjct: 269 FDGFGSAMDAAFAVQE--ITETASQKKHRKIREAQERDLHKKIDEMTILKTKAELLSENQ 326

Query: 418 EDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLD 477
            +V   I  +  A A  +S ++  R  KE  K  NP A +I K    +  + L++   L 
Sbjct: 327 AEVKNVISVIEAAHAASLSEKEFERF-KESEKDKNPTAKIIKKANFGKKTVDLIIDKQL- 384

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
                        V +D   S        Y+  KK E K +KT  A        E +T+ 
Sbjct: 385 -------------VTIDYTASIFEQINALYQKAKKIEEKLKKTRVA------LEESRTK- 424

Query: 538 QILQEKTVANISHM-RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVH 596
           +I   K +  I  + R V WFEKF W I+ ++ L+++GRD++QNE++VK+++   D Y H
Sbjct: 425 EIEVTKRIEKIEKIDRNVFWFEKFRWLITKDSDLILAGRDSKQNEILVKKHLLDTDYYFH 484

Query: 597 ADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAP 656
           AD+ G SS ++  +         T   A    +  S+AW++  +T  + V   QVSKTAP
Sbjct: 485 ADVRGGSSVIVGENATVH-----TKEVAAAMALHLSKAWENSTITEVYCVRGEQVSKTAP 539

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
            GEYLT GSFMI GKK F  P  L  GF ++++L +  +    + R+V G
Sbjct: 540 AGEYLTHGSFMITGKKEFYHPTKLEYGFSIMYKLKDKEIEISDDNRQVSG 589



 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 50/144 (34%), Positives = 72/144 (50%), Gaps = 14/144 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R++  D+ A V  L ++ G     VY  S K  + K  N           K  LL++
Sbjct: 1   MKGRLSWLDIRAGVNELEKINGCHIKTVYSTSKKAILIKFSN-----------KEQLLID 49

Query: 62  SGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
              + H T    +K N TP    L LR+ I   R+E V QLG+DRI + +   G     +
Sbjct: 50  PPSKFHLTHKNYEKVNLTP--LALYLRREISNYRVEKVTQLGFDRIAVIKIRSGKGCRLL 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I+E+YA GNI+LTD E  ++ LLR
Sbjct: 108 IIEMYANGNIILTDEELNIINLLR 131


>gi|292656996|ref|YP_003536893.1| hypothetical protein HVO_2883 [Haloferax volcanii DS2]
 gi|448293595|ref|ZP_21483700.1| hypothetical protein C498_17603 [Haloferax volcanii DS2]
 gi|291371020|gb|ADE03247.1| conserved protein [Haloferax volcanii DS2]
 gi|445570456|gb|ELY25019.1| hypothetical protein C498_17603 [Haloferax volcanii DS2]
          Length = 702

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 104/365 (28%), Positives = 173/365 (47%), Gaps = 43/365 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +Q+     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPDFEEQIAKQQRIIDQQEGAIEGFEQQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+A  ++E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRGAREEGVPWDDIAATLEEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL       D+++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTL-------DVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSE 567
           +   AA KK R +   +                    + ++      HWFE+F WF +S 
Sbjct: 441 REELAAVKKRRDEWEADDGDEDEDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTST 500

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLN 622
            YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL 
Sbjct: 501 GYLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLR 560

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF++RG + +    P  
Sbjct: 561 EAAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAK 620

Query: 682 MGFGL 686
           +  G+
Sbjct: 621 VAVGI 625



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|20089538|ref|NP_615613.1| hypothetical protein MA0651 [Methanosarcina acetivorans C2A]
 gi|19914450|gb|AAM04093.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 788

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 107/394 (27%), Positives = 190/394 (48%), Gaps = 30/394 (7%)

Query: 320 HPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIE-SQRAEQQHKAKE 378
           H   E     + +D   P  L ++   E   F++F+ ALDEF+ K    Q AE +   K+
Sbjct: 271 HVKKEINGKIETFD-VVPFDLIRYSEFEKEYFDSFNTALDEFFGKKALEQVAEVKAAEKK 329

Query: 379 DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWE 438
           +       +  + QE  +    +E++++  +AE++  N + ++     +  A A   SW+
Sbjct: 330 EKTLGVYERRLLQQEESLAKFGKEIEKNNTLAEIVYANYQLIEELFSVLNGARAKGYSWD 389

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVD 494
           ++  ++K+ +K                   ++  +  +  +D +  T+ V+     V +D
Sbjct: 390 EIRSILKQAKK-------------------TVPAAQKITNIDQKTGTVTVDLDGRNVNLD 430

Query: 495 LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKV 554
           +  +   NA+ +YE  KK   K++  + A  +  KA EKK   +  +      +   RK 
Sbjct: 431 IRKTVPQNAQEYYEKVKKFSKKRDGALKAIEETKKAMEKKAASKAAKAGR--KLQAFRKK 488

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQ 614
           HW+++F WF+SS+ +LV+ GRDA  NE I K+Y+ K D+  H    GA  TV+K    E 
Sbjct: 489 HWYDRFRWFVSSDGFLVVGGRDADTNEEIFKKYLEKRDIVFHTQTPGAPLTVVKTGGEE- 547

Query: 615 PVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKN 673
            +P  TL +   F V +S  W S   +   +W+   QV+KT  +GEYL  G+F+IRG++N
Sbjct: 548 -IPESTLLEVARFAVSYSSLWKSGQFSGDCYWIKAEQVTKTPESGEYLKKGAFVIRGERN 606

Query: 674 FLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           +    PL +  GL  + +   +G   +  R  G+
Sbjct: 607 YFKDIPLGVAVGLELKGETRVIGGPASAVRKHGD 640



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 68/139 (48%), Gaps = 11/139 (7%)

Query: 6   MNTADVAAEVKCL----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           M++ADVAA V  L    R +I  +   +Y  + +     L     V   G      L++E
Sbjct: 5   MSSADVAAVVAELSAGPRSIIDAKIGKIYQPASEEIRINLY----VFHQGRDN---LVIE 57

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           +G RLH T Y R     P  F + LRK++   R+  V Q  +DRII            +I
Sbjct: 58  AGKRLHMTKYVRASPTLPQAFPMLLRKYLMGGRIISVEQHDFDRIIKIGIERAGVRSTLI 117

Query: 122 LELYAQGNILLTDSEFTVL 140
           +EL+A+GN+L+ DSE  ++
Sbjct: 118 VELFARGNVLIVDSENKII 136


>gi|284166116|ref|YP_003404395.1| fibronectin-binding A domain-containing protein [Haloterrigena
           turkmenica DSM 5511]
 gi|284015771|gb|ADB61722.1| Fibronectin-binding A domain protein [Haloterrigena turkmenica DSM
           5511]
          Length = 723

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 170/684 (24%), Positives = 274/684 (40%), Gaps = 138/684 (20%)

Query: 55  KVLLLMESG--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLLEVGETKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDS------ 162

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT        LT S+E                               +FD  +    +  
Sbjct: 163 RTNP------LTVSRE-------------------------------AFD--REMEDSDT 183

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLED------NAIQVL 284
           D  R    TL T     L +G   +E I    G+   M ++E +  ED       AI+ L
Sbjct: 184 DVVR----TLAT----QLNFGGLYAEEICTRAGVEKAMDIAEAD--EDVYDRIYGAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESG---SSTQIYDEFCPLLLN 341
            L          D+ +G+  P  Y+   +    +     E+G   SS ++ D   P  L 
Sbjct: 234 AL----------DLRNGNFDPRLYVADDDGDEDESESGDENGDDSSSDRVVDA-TPFPLE 282

Query: 342 QFRSREFVKFETFDAALDEFYSKIESQRAEQQ-----HKAKEDAAFHKLNKIHMDQENRV 396
           +        +++F AALD+++ ++E    E++      +   +    K  +I   Q   +
Sbjct: 283 EHVELASEPYDSFLAALDDYFYRLELADDEEETDPTTQRPDFEEEIAKYERIIEQQRGAI 342

Query: 397 HTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
              +QE D   + AEL+  EY L  VD  +  V+ A A    W+++     EER      
Sbjct: 343 EGFEQEADALREQAELLYAEYGL--VDDILSTVQEARAQDRPWDEI-----EER------ 389

Query: 455 AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELK 510
                  + E     +  +  +  +D  E T+ VE    ++++        NA R Y   
Sbjct: 390 -------FAEGADRGIAAAEAVVNVDGSEGTVTVELDGERIDLVAKQGVEQNADRLYTEA 442

Query: 511 KKQESKQEKTITA----HSKAFKAAEKKTRLQILQEKTVA-----------------NIS 549
           K+   K+E  + A         +A  ++ R +                         ++ 
Sbjct: 443 KRVGEKKEGALAAIEDTREDLGEAKARRDRWEEADAADEGEDDEDDEGEERDWLSEPSVP 502

Query: 550 HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
                 WF++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K 
Sbjct: 503 IRENEPWFDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKA 562

Query: 610 HRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLT 662
             P +       +P  ++ +A  F V +S  W D +     + V   QV+KT  +GEYL 
Sbjct: 563 TDPSEASSSDIELPDSSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLE 622

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            G F IRG + +    P+ +  G+
Sbjct: 623 KGGFAIRGDRTYYRDTPVDVAVGI 646


>gi|448340269|ref|ZP_21529242.1| Fibronectin-binding A domain protein [Natrinema gari JCM 14663]
 gi|445630575|gb|ELY83836.1| Fibronectin-binding A domain protein [Natrinema gari JCM 14663]
          Length = 722

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 166/712 (23%), Positives = 282/712 (39%), Gaps = 122/712 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +     + G  E +L + E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRD----FDRGRMELILEVGEV 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTTRI 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT        
Sbjct: 119 IVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP------ 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT S+E   +E D  + D                                         +
Sbjct: 167 LTVSREAFDHEMDDSDTD-----------------------------------------V 185

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
              L   L +G   +E +    G+   M + + ++      +V        E    D+ +
Sbjct: 186 VRTLATQLNFGGLYAEEVCTRAGVEKGMDIDDADE------EVYGRLYETIERLALDIRN 239

Query: 301 GDIVPEGYI--LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
           G   P  Y+                ESG++  +  +  P  L +    E   +++F +AL
Sbjct: 240 GTFDPRLYLEPDDAAGDDADGDGTAESGAARVV--DVTPFPLEEHDDLEGEPYDSFLSAL 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           D+++ ++E    E+     +   F     K  +I   Q+  +   +QE     + AEL+ 
Sbjct: 298 DDYFFRLELAAEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAASLREQAELLY 357

Query: 414 -EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
            EY L  VD  +  ++ A     SW+++    +E  + G   A  I  +      +++  
Sbjct: 358 AEYGL--VDEILSTIQGARERERSWDEIRERFEEGAEQGIDAAEAIVDIDGSDGTVTV-- 413

Query: 473 SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKA 530
                E+DDE       ++++D       NA R Y   K+ E K++  + A  +++   A
Sbjct: 414 -----EIDDE-------RIDLDAQQGVEQNADRLYTEAKRVEEKKDGALAAIENTRQDLA 461

Query: 531 AEKKTRLQILQEKT---------------------VANISHMRKVHWFEKFNWFISSENY 569
             K+ R +   +++                      ++I       WF++F WF +S+ +
Sbjct: 462 DAKRRRDEWEADESGGEDDDETDADGDDLPRDWLSESSIPIRENEPWFDRFRWFHTSDGF 521

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQ 623
           LVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ +
Sbjct: 522 LVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIDLPDSSVAE 581

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           A  F+V +S  W D +     + V   QVSKT  +GEYL  G F IRG + +
Sbjct: 582 AAQFSVSYSSVWKDGRYAGDVYAVDSDQVSKTPESGEYLEKGGFAIRGDRTY 633


>gi|149246271|ref|XP_001527605.1| hypothetical protein LELG_00125 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146447559|gb|EDK41947.1| hypothetical protein LELG_00125 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 701

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 129/243 (53%), Gaps = 8/243 (3%)

Query: 474 NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK 533
           +NL ++    K +PV+   +DL  S+ ANAR +++ KK  E  Q K       A++ AEK
Sbjct: 196 DNLGKLGSGRKGVPVK---IDLTQSSFANARIYFDSKKAAEQLQLKVEKGAEIAYRNAEK 252

Query: 534 KTRLQIL----QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
           K     +    +E    + S +R   WFEKF WF+SSE YL ++GRD  Q +MI  +Y+ 
Sbjct: 253 KISQDFVRNVKKELGSTDSSALRSKLWFEKFYWFVSSEGYLCLAGRDKTQVDMIYFKYVG 312

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D  V +++ G+    IKN   ++ +PP T+ QAG F +  S AW  K+ T+AW +   
Sbjct: 313 DDDYLVSSEIEGSLKVFIKNPIKDEAIPPSTILQAGIFAMSASHAWSGKVNTAAWVMQAS 372

Query: 650 QVSK-TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
            VSK  +  G  L  G F    KK+ LPP  L+MGFG    +DE S   H   R  R +E
Sbjct: 373 DVSKYDSAAGNLLPPGEFEYFAKKDLLPPAQLVMGFGFYCDVDEESAKKHAAIRVEREQE 432

Query: 709 EGM 711
            G+
Sbjct: 433 HGL 435


>gi|448385151|ref|ZP_21563730.1| Fibronectin-binding A domain protein [Haloterrigena thermotolerans
           DSM 11522]
 gi|445657436|gb|ELZ10264.1| Fibronectin-binding A domain protein [Haloterrigena thermotolerans
           DSM 11522]
          Length = 719

 Score =  153 bits (386), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 171/727 (23%), Positives = 279/727 (38%), Gaps = 131/727 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +        +  ++ L++E 
Sbjct: 4   KRELTSVDLAALVGELGTYEGAKVDKAYLYGDDLVRLKMRDF-------DRGRLELILEV 56

Query: 63  G--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT      
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP---- 166

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
             LT S+E   +E D  + D                                        
Sbjct: 167 --LTVSREAFDHEMDDSDTD---------------------------------------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            +   L   L +G   +E +    G+   + + + ++       V     A  E    D+
Sbjct: 185 -VVRTLATQLNFGGLYAEEVCTRAGVEKGLDIDDADE------DVYDRIYAAIERLALDI 237

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
            +G+  P  Y    ++  G D           + D   P  L +        +++F +AL
Sbjct: 238 RNGNFDPRLYFAGDDEADGDDESEETDAGDGPVVD-VTPFPLEEHADLPAEGYDSFLSAL 296

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           D+++ ++E    E+     +   F     K  +I   Q+  +   +QE ++  + AEL+ 
Sbjct: 297 DDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAEQLRERAELLY 356

Query: 414 -EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNCMSLL 471
            EY L  VD  +  V+ A     +W+++    +E    G   A  +ID            
Sbjct: 357 AEYGL--VDEILSTVQQAREQDRAWDEIRERFEEGADRGIAAAEAVID------------ 402

Query: 472 LSNNLDEMDDEEKTLPV----EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
                  +D  E T+ V    E++E+        NA R Y   K+ E K+E  + A    
Sbjct: 403 -------VDGSEGTVTVDLDGERIELVADRGVEQNADRLYTEAKRVEDKKEGALAAIENT 455

Query: 528 FKAAEKKTRLQILQEKTVA---------------------NISHMRKVHWFEKFNWFISS 566
            +  E   R +   E   A                     +I       WF++F WF +S
Sbjct: 456 REDLEDAKRRRDEWEAQDAASDDEDEADDEGPKRDWLADPSIPIRENEPWFDRFRWFHTS 515

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLT 620
           ++YLVI GR+A QNE IVK+Y+  GD  +H   HG   TV+K   P +       +P  +
Sbjct: 516 DDYLVIGGRNADQNEEIVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPESS 575

Query: 621 LNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           + +A  F V ++  W D +     + V   QVSKT  +GEYL  G F IRG + +    P
Sbjct: 576 IEEAAQFAVSYASVWKDGRYAGDVYAVDADQVSKTPESGEYLEKGGFAIRGDRTYYRDTP 635

Query: 680 LIMGFGL 686
           +    G+
Sbjct: 636 VGAAVGI 642


>gi|269863550|ref|XP_002651263.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064852|gb|EED42793.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 335

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 102/347 (29%), Positives = 168/347 (48%), Gaps = 46/347 (13%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 1   MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 51

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 52  TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 111

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 112 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 149

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
                K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 150 -----KIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGGKNAQQNDQIV 204

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS + K            +  A  F + +S+AWD +++   +
Sbjct: 205 NKYMEDRDLYFHCDVKGASSVICKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 258

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 259 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 305


>gi|387595331|gb|EIJ92956.1| hypothetical protein NEPG_02355 [Nematocida parisii ERTm1]
          Length = 700

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 111/357 (31%), Positives = 169/357 (47%), Gaps = 37/357 (10%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ F +A+D  ++  E      Q K +         KI   QE  +H    E+      A
Sbjct: 269 FDGFGSAMDAAFAVQEITETVSQKKHR---------KIREAQERDLHKKIDEMTILKTKA 319

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           EL+  N  +V   I  +  A A  +S ++  R  KE  K  NP A +I K    +  + L
Sbjct: 320 ELLSENQAEVKNVISVIEAAHAASLSEKEFERF-KESEKDKNPTAKIIKKANFGKKTVDL 378

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
           ++   L              V +D   S        Y+  KK E K +KT  A       
Sbjct: 379 IIDKQL--------------VTIDYTASIFEQINALYQKAKKIEEKLKKTRVA------L 418

Query: 531 AEKKTRLQILQEKTVANISHM-RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
            E +T+ +I   K +  I  + R V WFEKF W I+ ++ L+++GRD++QNE++VK+++ 
Sbjct: 419 EESRTK-EIEVTKRIEKIEKIDRNVFWFEKFRWLITKDSDLILAGRDSKQNEILVKKHLL 477

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH 649
             D Y HAD+ G SS ++  +         T   A    +  S+AW++  +T  + V   
Sbjct: 478 DTDYYFHADVRGGSSVIVGENATVH-----TKEVAAAMALHLSKAWENSTITEVYCVRGE 532

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRG 706
           QVSKTAP GEYLT GSFMI GKK F  P  L  GF ++++L +  +    + R+V G
Sbjct: 533 QVSKTAPAGEYLTHGSFMITGKKEFYHPTKLEYGFSIMYKLKDKEIEISDDNRQVSG 589



 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 50/144 (34%), Positives = 72/144 (50%), Gaps = 14/144 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K R++  D+ A V  L ++ G     VY  S K  + K  N           K  LL++
Sbjct: 1   MKGRLSWLDIRAGVNELEKINGCHIKTVYSTSKKAILIKFSN-----------KEQLLID 49

Query: 62  SGVRLHTTAYARDKKN-TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
              + H T    +K N TP    L LR+ I   R+E V QLG+DRI + +   G     +
Sbjct: 50  PPSKFHLTHKNYEKVNLTP--LALYLRREISNYRVEKVTQLGFDRIAVIKIRSGKGCRLL 107

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I+E+YA GNI+LTD E  ++ LLR
Sbjct: 108 IIEMYANGNIILTDEELNIINLLR 131


>gi|261350362|ref|ZP_05975779.1| fibronectin-binding protein A [Methanobrevibacter smithii DSM 2374]
 gi|288861145|gb|EFC93443.1| fibronectin-binding protein A [Methanobrevibacter smithii DSM 2374]
          Length = 668

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 168/340 (49%), Gaps = 20/340 (5%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ F+ A DEFYSK  +   +   +A  +   +K  K    QE  +    + ++ S    
Sbjct: 268 FDNFNEACDEFYSKKVNTDIKNIKEAAWNKKVNKFEKRLKLQEETLDNFHKTIETSQHKG 327

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E+I  N   ++  +  V  A++   S++++ + +KE +K G   A + +           
Sbjct: 328 EVIYSNYTTIENLVKVVNNAISKDYSYKEIGKTLKEAKKNGLKEAEIFE----------- 376

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
               ++D+M      L    + ++  L+   NA  +YE  KK + K +    A     K 
Sbjct: 377 ----SIDKMGVLTLKLNETSININPKLTIPENAEIYYEKAKKAKKKTKGATIAIENTKKQ 432

Query: 531 AEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
            EK K + ++  E        ++K + W+EK  WF++S+N LVI GRDA  NE +VK+YM
Sbjct: 433 LEKIKAKKEVAMEHISVPKKRVKKNLKWYEKLRWFVTSDNVLVIGGRDAGTNETVVKKYM 492

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVY 647
              D+Y+HAD+HGA+STVIK       V    L ++G F    S AW     T   +WV 
Sbjct: 493 DNNDIYLHADIHGATSTVIK--LEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVN 550

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           P QV+KT   GE+L  GSF+IRG +N++    + +  G++
Sbjct: 551 PEQVTKTPEAGEFLPKGSFVIRGNRNYIRSAKVRIAIGIV 590



 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 63/111 (56%), Gaps = 3/111 (2%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           ++ L+ME G R+HT+ Y  +    P  F + LRK I+   +  + Q  +DRII  +  + 
Sbjct: 47  RIDLVMECGKRIHTSKYPLENPINPPVFPMLLRKRIKGANVVSITQHNFDRII--EIKVK 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            + +Y +++EL+ +GNI+L D +  ++  L+  R  D+ ++    +++P E
Sbjct: 105 KDKYYTIVVELFDKGNIILLDEDNNIILPLKRKRFSDRDISSKKEYQFPEE 155


>gi|167044451|gb|ABZ09127.1| putative domain of unknown function (DUF814) [uncultured marine
           crenarchaeote HF4000_APKG6D9]
          Length = 648

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 163/681 (23%), Positives = 292/681 (42%), Gaps = 143/681 (20%)

Query: 23  GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGF 82
           G   SN+Y ++  + +FKL ++       +S+  +++  SGV L  TA   D+   P+  
Sbjct: 21  GYYISNIYGITKDSILFKLHHTE------KSDLFMMVSTSGVWL--TAVKIDQME-PNRL 71

Query: 83  TLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLT 141
             +LR  +   +L+ + Q+G +RI  F F  G    +V++ E +  GNILL   E  +L 
Sbjct: 72  LKRLRSDLLRLKLKKIEQIGAERIAYFTFE-GFGKEFVLVGEFFGDGNILLCSKEMKILA 130

Query: 142 LLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNN 201
           L  S         I  RHR               KL   L   + P+         +G +
Sbjct: 131 LQHS---------IEVRHR---------------KLSVGLEYVQPPN---------NGLD 157

Query: 202 VSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILD 261
           + N  + +         FD+ K S     D   AK       LG    Y   + E   +D
Sbjct: 158 IFNILESD---------FDVLKTS-----DLVSAKW--FGRTLGLPKKYVEGIFEIANID 201

Query: 262 TGLVPNMKLS-EVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDH 320
              + N+  + E+ K+ +   +V++           DVISG+  P   I+++N+      
Sbjct: 202 PKKIGNLLTNDEITKIFETTKKVVL-----------DVISGNHKP---IIIRNEK----- 242

Query: 321 PPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDA 380
                        E  P+ L +    E V   +F   LD  Y++    + +    +  D 
Sbjct: 243 ------------TEILPIKLGKMDG-EIVDVNSFIEGLDTVYTENIVTKGKSIQSSGSDK 289

Query: 381 AFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDL 440
              +      +QE  + T+K   DRS  +  +     E V + IL++  A A ++     
Sbjct: 290 KIKEFQTQISEQEKAIQTVK---DRSKNITNVANSLFEMVSSGILSIEDASAQKILVNHN 346

Query: 441 ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAH 500
           A++  E+                    +SL++  +             EK++++    A 
Sbjct: 347 AKLTSEK-------------------GISLIIVQD-------------EKIKIN----AK 370

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT-----VANISHMRKVH 555
           +  +    L   +  KQ + I++  +     EKK  L+  Q KT     +  ++ +RK  
Sbjct: 371 SPLQSIASLLFNEAKKQSRAISSIEEIKSKTEKK--LEKFQNKTESEQDIMLVTEIRKKS 428

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           W+E++ WF +++ YL + GRDA  N  +V++++ K D   HAD+ G+   +IK+    + 
Sbjct: 429 WYERYRWFYTTDGYLAVGGRDAASNSAVVRKHLVKNDKIFHADIFGSPFFIIKD---AEH 485

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
            P  ++++    TVC S+AW   +    A+W++P QV K+AP+GE+L  GSF I G++NF
Sbjct: 486 APATSMDEVAHATVCFSRAWREGLYGVKAYWIHPEQVKKSAPSGEFLPKGSFTIEGQRNF 545

Query: 675 LPPHPLIMGFGLLFRLDESSL 695
           +    L +  G++ + D  +L
Sbjct: 546 INSKNLKLAVGIIQQEDGHAL 566


>gi|222445070|ref|ZP_03607585.1| hypothetical protein METSMIALI_00687 [Methanobrevibacter smithii
           DSM 2375]
 gi|222434635|gb|EEE41800.1| fibronectin-binding protein A domain protein [Methanobrevibacter
           smithii DSM 2375]
          Length = 668

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 103/340 (30%), Positives = 168/340 (49%), Gaps = 20/340 (5%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ F+ A DEFYSK  +   +   +A  +   +K  K    QE  +    + ++ S    
Sbjct: 268 FDNFNEACDEFYSKKVNTDIKNIKEAAWNKKVNKFEKRLKLQEETLDNFHKTIETSQHKG 327

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E+I  N   ++  +  V  A++   S++++ + +KE +K G   A + +           
Sbjct: 328 EVIYSNYTTIENLVKVVNNAISKDYSYKEIGKTLKEAKKNGLKEAEIFE----------- 376

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
               ++D+M      L    + ++  L+   NA  +YE  KK + K +    A     K 
Sbjct: 377 ----SIDKMGVLTLKLNETSININPKLTIPENAEIYYEKAKKAKKKTKGATIAIENTKKQ 432

Query: 531 AEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM 588
            EK K + ++  E        ++K + W+EK  WF++S+N LVI GRDA  NE +VK+YM
Sbjct: 433 LEKIKAKKEVAMEHISVPKKRVKKNLKWYEKLRWFVTSDNVLVIGGRDAGTNEAVVKKYM 492

Query: 589 SKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVY 647
              D+Y+HAD+HGA+STVIK       V    L ++G F    S AW     T   +WV 
Sbjct: 493 DNNDIYLHADIHGATSTVIK--LEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWVN 550

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           P QV+KT   GE+L  GSF+IRG +N++    + +  G++
Sbjct: 551 PEQVTKTPEAGEFLPKGSFVIRGNRNYIRSAKVRIAIGIV 590



 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 63/111 (56%), Gaps = 3/111 (2%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           ++ L+ME G R+HT+ Y  +    P  F + LRK I+   +  + Q  +DRII  +  + 
Sbjct: 47  RIDLVMECGKRIHTSKYPLENPINPPVFPMLLRKRIKGANVVSITQHNFDRII--EIKVK 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            + +Y +++EL+ +GNI+L D +  ++  L+  R  D+ ++    +++P E
Sbjct: 105 KDKYYTIVVELFDKGNIILLDEDNNIILPLKRKRFSDRDISSKKEYQFPEE 155


>gi|397772651|ref|YP_006540197.1| Fibronectin-binding A domain protein [Natrinema sp. J7-2]
 gi|397681744|gb|AFO56121.1| Fibronectin-binding A domain protein [Natrinema sp. J7-2]
          Length = 722

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 166/712 (23%), Positives = 281/712 (39%), Gaps = 122/712 (17%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +     + G  E +L + E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRD----FDRGRMELILEVGEV 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTTRI 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        RT        
Sbjct: 119 IVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPDT------RTNP------ 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           LT S+E   +E D  + D                                         +
Sbjct: 167 LTVSREAFDHEMDDSDTD-----------------------------------------V 185

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
              L   L +G   +E +    G+   M + + ++      +V        E    D+ +
Sbjct: 186 VRTLATQLNFGGLYAEEVCTRAGVEKGMDIDDADE------EVYGRLYETIERLALDIRN 239

Query: 301 GDIVPEGYI--LMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
           G   P  Y+                ESG++  +  +  P  L +    E   +++F +AL
Sbjct: 240 GTFDPRLYLEPDDAAGDDADGDGTAESGAARVV--DVTPFPLEEHDDLEGEPYDSFLSAL 297

Query: 359 DEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           D+++ ++E    E+     +   F     K  +I   Q+  +   +QE     + AEL+ 
Sbjct: 298 DDYFFRLELAAEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAASLREQAELLY 357

Query: 414 -EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
            EY L  VD  +  ++ A     SW+++    +E  + G   A  I  +      +++  
Sbjct: 358 AEYGL--VDEILSTIQGARERERSWDEIRERFEEGAEQGIDAAEAIVDIDGSDGTVTV-- 413

Query: 473 SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HSKAFKA 530
                E+DDE       ++++D       NA R Y   K+ E K++  + A  +++   A
Sbjct: 414 -----EIDDE-------RIDLDAQQGVEQNADRLYTEAKRVEEKKDGALAAIENTRQDLA 461

Query: 531 AEKKTRLQILQEKT---------------------VANISHMRKVHWFEKFNWFISSENY 569
             K+ R +   +++                      ++I       WF++F WF +S+ +
Sbjct: 462 DAKRRRDEWEADESGGEDDDETDADGDDLPRDWLSESSIPIRENEPWFDRFRWFHTSDGF 521

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQ 623
           LVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ +
Sbjct: 522 LVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIDLPESSVAE 581

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           A  F V +S  W D +     + V   QVSKT  +GEYL  G F IRG + +
Sbjct: 582 AAQFAVSYSSVWKDGRYAGDIYAVDSDQVSKTPESGEYLEKGGFAIRGDRTY 633


>gi|444317477|ref|XP_004179396.1| hypothetical protein TBLA_0C00610 [Tetrapisispora blattae CBS 6284]
 gi|387512437|emb|CCH59877.1| hypothetical protein TBLA_0C00610 [Tetrapisispora blattae CBS 6284]
          Length = 1053

 Score =  151 bits (382), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 125/456 (27%), Positives = 215/456 (47%), Gaps = 78/456 (17%)

Query: 307 GYILMQ---NKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR--SREFVKFETFDAALDEF 361
           GYI+ +   N  +G+D    E    T  ++ F P +    R  S+  +    ++  LD+F
Sbjct: 274 GYIVAKKNPNYVIGRDADDLEYVYET--FNPFEPFIDETHRTNSKIIIVDGPYNLTLDKF 331

Query: 362 YSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
           ++ IES +   + + +E+ A  K+   H++ + R+  L      + +    I  N E ++
Sbjct: 332 FTTIESSKYALKIQTQEEQAKKKIEDAHLENKKRIDALINVQTSNEQKGYAIIANTELIE 391

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYLERNCMSLLL-------- 472
               AV+  +  +M W  + +++K E+  GN VA  +I  L L+ N ++++L        
Sbjct: 392 TTKYAVQGLVDQQMDWNTIEKLIKNEQVRGNEVAENIILPLNLKENTINMILPLKSETSS 451

Query: 473 ---------------SNNL---DEMDDEEKTLPVEK------------------------ 490
                          S+N    +   DEE  + VE+                        
Sbjct: 452 IENSSSEEQDEYCSESDNEPANENTSDEESDISVEQDVSDFVEVTTIGNSPLISKKSKHK 511

Query: 491 ----------VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK--KTRLQ 538
                     V +DL+LSA+ANA R+++ KKK   KQ++      KA K  E+  +T LQ
Sbjct: 512 RLQNNENSIIVSIDLSLSAYANASRYFDTKKKTAEKQKRVEENAEKAMKNIEQGIETSLQ 571

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
              +++   +  +RK ++FEK++WFISSE  LV+ G+ + + + I  +Y+   D+Y+   
Sbjct: 572 RKLKESHEVLKKIRKPYFFEKYHWFISSEKILVLMGKSSTETDQIYSKYIEDDDIYMSNS 631

Query: 599 LHGASSTVIKNHRPEQ-PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
               +   IKN  PE+  + P TL QAG F +  S+AW  K+ +S WW     VSK    
Sbjct: 632 FD--TQVWIKN--PEKIEISPNTLMQAGVFCMSSSEAWSKKIASSPWWCKAKNVSKFDKE 687

Query: 658 GE-YLTVGSFMIR--GKKNFLPPHPLIMGFGLLFRL 690
           G   L  G F+++   +K+ LPP  L+MG GLL+++
Sbjct: 688 GNTCLEPGKFILKNENEKHSLPPAQLVMGIGLLWKV 723



 Score = 87.0 bits (214), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 81/137 (59%), Gaps = 12/137 (8%)

Query: 20  RLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKN 77
           +L G R +N+Y++  + + ++ K         +    K+ ++++ G+R+H T + R    
Sbjct: 20  KLEGYRLTNIYNIADTKRQFLLKF--------NKPDSKLNVVVDCGLRIHLTDFTRHIPQ 71

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEF 137
            PS F +KLRKH++++RL  +RQ+  DRII+ QF  G+   Y++LE ++ GN++L D   
Sbjct: 72  FPSDFVIKLRKHLKSKRLTKLRQVPGDRIIVLQFAEGL--FYLVLEFFSAGNVILLDENK 129

Query: 138 TVLTLLRSHRDDDKGVA 154
           T+L+L R  ++ +  V 
Sbjct: 130 TILSLQRVVKEHENKVG 146


>gi|333910763|ref|YP_004484496.1| fibronectin-binding A domain-containing protein [Methanotorris
           igneus Kol 5]
 gi|333751352|gb|AEF96431.1| Fibronectin-binding A domain protein [Methanotorris igneus Kol 5]
          Length = 675

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 189/368 (51%), Gaps = 30/368 (8%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L ++ + E  ++  F  ALD+++++  ++   ++ ++K      K  +I   
Sbjct: 257 YVDVVPINLKKYENFEKKEYGEFLEALDDYFAQFMAKVETKKEESKLQKLIKKQERILKT 316

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   ++++  + +  +LI  N   VD  +  +R A   +M W  + ++V E +   
Sbjct: 317 QLETLEKYEKQMQENQEKGDLIYANYTLVDEILNTLRNA-REKMEWYKIKKIVNEHK--D 373

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+ GLI  +  +   + + LS +  +   E+       V +D+  +A  NA  +Y   K
Sbjct: 374 HPILGLIQNINEKNGEIVIKLSADYGDKKIEKN------VSLDIRKNAFENAETYYTKSK 427

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEK-----------TVANISHMRKVHWFEKF 560
           K +SK    I    +A K +EKK  L  L+EK                   ++  W+EKF
Sbjct: 428 KLKSK----IEGIKEAIKLSEKK--LAELKEKGEIELKELKEKEKIKKKERKERKWYEKF 481

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            W + +  +LVI+G+DA  NE+++K+Y    D+  HA + GA  TVIK ++  + V   T
Sbjct: 482 KWTVIN-GFLVIAGKDAVTNELLIKKYTEDDDIVFHAQIEGAPFTVIKTNK--RIVDEET 538

Query: 621 LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           LN+   F+V HS+AW         +WV P QVSKTA +GEYL  G+F+IRGK+NF+   P
Sbjct: 539 LNEVAKFSVAHSRAWKLGWGALDTYWVKPEQVSKTAESGEYLKKGAFVIRGKRNFIRNVP 598

Query: 680 LIMGFGLL 687
           L +G G++
Sbjct: 599 LELGIGII 606



 Score = 63.2 bits (152), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 26/99 (26%), Positives = 54/99 (54%)

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +  T Y R+K   P  F + LRKH++  ++  + Q  +DRI++  F      + +++EL+
Sbjct: 63  ITMTNYEREKPKIPPTFAMLLRKHLKNIKITKIEQHDFDRIVIITFEWNETVYKLVIELF 122

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +GN++L D E  ++  L+  R   + +A    +++P +
Sbjct: 123 GEGNVILLDKEDRIIMPLKIERWSTRTIAPKEIYKFPPQ 161


>gi|148642838|ref|YP_001273351.1| RNA-binding protein snRNP-like protein [Methanobrevibacter smithii
           ATCC 35061]
 gi|148551855|gb|ABQ86983.1| predicted RNA-binding protein, eukaryotic snRNP-like protein
           [Methanobrevibacter smithii ATCC 35061]
          Length = 668

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/341 (30%), Positives = 168/341 (49%), Gaps = 22/341 (6%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+ F+ A DEFYSK  +   +   +A  +   +K  K    QE  +    + ++ S    
Sbjct: 268 FDNFNEACDEFYSKKVNTDIKNIKEAAWNKKVNKFEKRLKLQEETLDNFHKTIETSQHKG 327

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E+I  N   ++  +  V  A++   S++++ + +KE +K                NC+  
Sbjct: 328 EVIYSNYTTIENLVKVVNNAISKDYSYKEIGKTLKEAKK----------------NCLKE 371

Query: 471 L-LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
             +  ++D+M      L    + ++  L+   NA  +YE  KK + K +    A     K
Sbjct: 372 AEIFESIDKMGVLTLKLNETSININPKLTIPENAEIYYEKAKKAKKKTKGATIAIENTKK 431

Query: 530 AAEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
             EK K + ++  E        ++K + W+EK  WF++S+N LVI GRDA  NE +VK+Y
Sbjct: 432 QLEKIKAKKEVAMEHISVPKKRVKKNLKWYEKLRWFVTSDNVLVIGGRDAGTNEAVVKKY 491

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWV 646
           M   D+Y+HAD+HGA+STVIK       V    L ++G F    S AW     T   +WV
Sbjct: 492 MDNNDIYLHADIHGATSTVIK--LEGNKVNDSILKESGEFAASFSTAWSKGFTTQDVFWV 549

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
            P QV+KT   GE+L  GSF+IRG +N++    + +  G++
Sbjct: 550 NPEQVTKTPEAGEFLPKGSFVIRGNRNYIRSAKVRIAIGIV 590



 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 63/111 (56%), Gaps = 3/111 (2%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           ++ L+ME G R+HT+ Y  +    P  F + LRK I+   +  + Q  +DRII  +  + 
Sbjct: 47  RIDLVMECGKRIHTSKYPLENPINPPVFPMLLRKRIKGANVVSITQHNFDRII--EIKVK 104

Query: 115 MNAHY-VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            + +Y +++EL+ +GNI+L D +  ++  L+  R  D+ ++    +++P E
Sbjct: 105 KDKYYTIVVELFDKGNIILLDEDNNIILPLKRKRFSDRDISSKKEYQFPEE 155


>gi|170582502|ref|XP_001896158.1| hypothetical protein [Brugia malayi]
 gi|158596691|gb|EDP34993.1| conserved hypothetical protein [Brugia malayi]
          Length = 643

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 70/131 (53%), Positives = 92/131 (70%), Gaps = 4/131 (3%)

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKT 654
           +HAD+ GASS +I+N      VPP TLN+A    + +S AW++K+ +SAWWV+ HQVS+T
Sbjct: 1   MHADVRGASSIIIRNKLGGGDVPPRTLNEAATMAISYSSAWEAKITSSAWWVHQHQVSRT 60

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDF 714
           APTGEYLT GSFMIRGKKN+LP   L MGFG++F+LDE SL  H  ER+V      M   
Sbjct: 61  APTGEYLTPGSFMIRGKKNYLPTCQLQMGFGVMFQLDEESLERHREERKV----APMVTA 116

Query: 715 EDSGHHKENSD 725
           ED+  H+++ D
Sbjct: 117 EDNAMHQDDGD 127


>gi|328909421|gb|AEB61378.1| serologically defined colon cancer antigen 1-like protein, partial
           [Equus caballus]
          Length = 302

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 89/238 (37%), Positives = 138/238 (57%), Gaps = 11/238 (4%)

Query: 240 LKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI 299
           LK VL   L YGPAL EH +++ G   N+K+ E  K E   I+ +++ + K ED+++   
Sbjct: 45  LKRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KFESKDIEKVLVCLQKAEDYMK--T 100

Query: 300 SGDIVPEGYILMQNKHLGKDHPPTESGSSTQ---IYDEFCPLLLNQFRSREFVKFETFDA 356
           + +   +GYI+ + +      P  E    TQ    Y+EF P L +Q     +++FE+FD 
Sbjct: 101 TSNFSGKGYIIQKREM----KPSLEVDKPTQDILTYEEFHPFLFSQHSQCPYIEFESFDK 156

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYN 416
           A+DEFYSKIE Q+ + +   +E  A  KL+ +  D E+R+  L+Q  +      ELIE N
Sbjct: 157 AVDEFYSKIEGQKIDLKALQQEKQALKKLDNVRKDHEDRLEALQQAQEIDKLKGELIEMN 216

Query: 417 LEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
           L+ VD AI  VR ALAN++ W ++  +VKE +  G+PVA  I +L L+ N +++LL N
Sbjct: 217 LQIVDRAIQVVRSALANQIDWTEIGLIVKEAQAQGDPVANAIKELKLQTNHVTMLLRN 274


>gi|150865765|ref|XP_001385110.2| highly conserved hypothetical protein Predicted RNA-binding
           [Scheffersomyces stipitis CBS 6054]
 gi|149387021|gb|ABN67081.2| conserved hypothetical protein Predicted RNA-binding protein
           [Scheffersomyces stipitis CBS 6054]
          Length = 1038

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 89/223 (39%), Positives = 121/223 (54%), Gaps = 2/223 (0%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ--EKTVANI 548
           V +DL+LS +ANAR ++E KK  ESK+EK       A K AE+K +  +    +     +
Sbjct: 521 VWIDLSLSPYANARLYFESKKSAESKKEKVEKNTEMALKNAERKIKQDLAHNLKNEHDTL 580

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +R  +WFEKF WF+SSE YL ++GRD  Q +MI  R+ +  D +V A++ G+    +K
Sbjct: 581 KQLRPKYWFEKFYWFVSSEGYLCLAGRDPSQTDMIYYRFFNDNDFFVSAEMEGSLKVFVK 640

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
           N    + VPP TL QAG F    S AW  K+ TSAW ++   VSK    G  L  G F  
Sbjct: 641 NPFKGESVPPYTLMQAGNFAKSTSTAWSGKVSTSAWVLHGSDVSKKDFDGSLLAGGEFNY 700

Query: 669 RGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           + KK FLPP  L MGFGL    DE +   +   R  +  E G 
Sbjct: 701 KSKKEFLPPTQLTMGFGLYLLGDEETAQKYTKLRVNKEVEHGF 743



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 120/469 (25%), Positives = 224/469 (47%), Gaps = 63/469 (13%)

Query: 21  LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           +   R  N+Y+L  S + Y+ K         S    K +++++ G R+H T + R     
Sbjct: 21  IANYRLQNIYNLAGSNRQYVLKF--------SVPDSKKIVVLDCGNRVHLTDFDRPTTPA 72

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           PS F  KLRKH++TRRL  ++Q+G DR+++ +F  G+   Y++LE ++ GN+LL D    
Sbjct: 73  PSNFVSKLRKHLKTRRLSGIKQVGNDRVLVLEFSDGL--FYLVLEFFSAGNVLLLDDNLK 130

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASKLHAALTSSKEPDANEPDK-VN 196
           +L+L R+ +  +KG       +Y   EI ++F+++  S+        ++ + +E    + 
Sbjct: 131 ILSLQRNVK--EKG----ENDKYAVNEIYKMFDKSLFSEDFK--YEKRDYNVDEIKAWIK 182

Query: 197 EDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSE 256
           E    V N S+E   G+K  K F + K                   +L   + +   LS 
Sbjct: 183 EQRIKVENQSQEPSSGKK-SKVFSIHK-------------------LLFVNVSH---LSS 219

Query: 257 HIIL----DTGLVPNMKLSEVNKLEDN-AIQVLVLAVAKFE-DWLQDVISGDI-VPEGYI 309
            +IL    + G+  +    E    EDN  +  +V A+ K E +++  + +GD     G+I
Sbjct: 220 DLILKNLQNAGISGSSSCFEF--AEDNEKLSTIVGALDKSEQEYISFISAGDNEQTNGFI 277

Query: 310 LMQNKHLGKDHPPTESGSSTQ---IYDEFCPL--LLNQFRSREFVKFETFDAALDEFYSK 364
           + +   L   + P+E  S      +YDEF P           +F + E ++  LD F+S 
Sbjct: 278 VSKKNPL---YNPSEEHSDNDLEYVYDEFHPFKPFKKNLEGYKFTEIEGYNKTLDTFFSA 334

Query: 365 IESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAI 424
           +ES +   + + ++  A  +L     ++  ++ +L Q+ + + K  + I Y+ + V + I
Sbjct: 335 LESTKFALKIEQQKQNANKRLENARSERNKQIQSLIQQQETNSKKGDTIIYHADLVASCI 394

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMSLLL 472
            A++  L  +M W ++  +VK E+ +GN +   I   L L  N ++L+L
Sbjct: 395 SAIQKMLDKQMDWGNIEAIVKHEQSSGNEIMSTIKLPLNLNENKINLVL 443



 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 20/31 (64%), Positives = 23/31 (74%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG+K KLKKM +KY DQDEEER +RM  L  
Sbjct: 823 RGKKAKLKKMAQKYADQDEEERRLRMTALGT 853


>gi|448306550|ref|ZP_21496454.1| fibronectin-binding A domain-containing protein [Natronorubrum
           bangense JCM 10635]
 gi|445597848|gb|ELY51920.1| fibronectin-binding A domain-containing protein [Natronorubrum
           bangense JCM 10635]
          Length = 710

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 164/669 (24%), Positives = 278/669 (41%), Gaps = 121/669 (18%)

Query: 55  KVLLLMESG--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RIELILEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P        
Sbjct: 109 FEREDGTTRLIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFPD------- 161

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
               S+L+  LT S+E                               +FDL    +    
Sbjct: 162 ----SRLNP-LTVSRE-------------------------------AFDLEMEDSDTD- 184

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDN------AIQVL 284
                    +   L   L +G   +E I    G+   M +++ +  ED+      AI+ L
Sbjct: 185 ---------VVRTLATQLNFGGLYAEEICTRAGIEKGMDIADAD--EDDYDRLYEAIERL 233

Query: 285 VLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFR 344
            L          D+ + +  P  Y+              +   S ++ D   P  L +  
Sbjct: 234 AL----------DLRNANFEPRLYLEDGEDG-------DDDDESARVVDA-TPFPLEEHA 275

Query: 345 SREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF----HKLNKIHMDQENRVHTLK 400
                 +++F AALD+++ ++E    E+     +   F     K  +I   Q+  +   +
Sbjct: 276 ELAAEPYDSFLAALDDYFFRLELDDEEEPDPTTQKPDFGEEIAKYERIIDQQQGAIEGFE 335

Query: 401 QEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKE--ER--KAGNPV 454
           Q+ D   + AEL+  EY L  VD  +  ++ A A    W+++    +E  ER  +A   V
Sbjct: 336 QQADELREQAELLYAEYGL--VDDILSTIQDARAQDRPWDEIEARFEEGAERGIEAAEAV 393

Query: 455 AGL-----IDKLYLERNCMSLL----LSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
            G+     I  + ++ + + L+    +  N D +  E K +  +K     AL+A  + R 
Sbjct: 394 VGIDSSEGIVTVDIDGDRIDLVAHDGVEQNADRLYTEAKRVAEKKAG---ALAAIEDTRE 450

Query: 506 WYE-LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
             E  K++++              + AE+K  L++       +I       WF++F WF 
Sbjct: 451 DLEDAKRRRDEWDADDEGDEEADDEEAEEKNWLEM------PSIPIRENEPWFDRFRWFH 504

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPP 618
           +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P 
Sbjct: 505 TSDGYLVIGGRNADQNEDLVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPD 564

Query: 619 LTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
            ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG + +   
Sbjct: 565 SSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYHRD 624

Query: 678 HPLIMGFGL 686
            P+ +  G+
Sbjct: 625 TPVGVAVGI 633


>gi|73669087|ref|YP_305102.1| hypothetical protein Mbar_A1575 [Methanosarcina barkeri str.
           Fusaro]
 gi|72396249|gb|AAZ70522.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
          Length = 797

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 112/407 (27%), Positives = 197/407 (48%), Gaps = 22/407 (5%)

Query: 303 IVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           I PE  +  +  +L   H   E     + +D   P  L ++   E   F++F+ ALDEF+
Sbjct: 278 IKPEVGVEGEAPNLRPQHVKKEIKGKLETFD-VLPFDLTRYSGFEKEYFDSFNTALDEFF 336

Query: 363 SKIE-SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
            K    Q  E +   K++       +  + QE  +   ++E++++  +AE +  N + ++
Sbjct: 337 GKKALEQIEEVKAAKKKEKTLGVYERRLLQQEGSLKKFEKEIEKNNTLAETVYANYQGIE 396

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDD 481
             +  +  A +   SW+++  ++K+ +K   P A  I  +      +++    N D    
Sbjct: 397 ELLSVLNGARSTGYSWDEIRSILKQAKKT-VPAAQKITNIDPRTGTVTV----NFDG--- 448

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
                  + + +D+  +   NA+ +YE  KK   K++  + A     KA EKK   ++ +
Sbjct: 449 -------KSISLDIRKTVPQNAQEYYEKVKKFNKKKDGALKAIEDTRKAMEKKAVAKVAK 501

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
                  S  RK HW+++F WF+SS+ + ++ GRDA  NE I K+Y+ K D+  H    G
Sbjct: 502 AGRKLRAS--RKKHWYDRFRWFVSSDGFFIVGGRDADTNEEIFKKYLEKRDLVFHTQTPG 559

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEY 660
           A  TVIK    E  VP  TL +A  F V +S  W +   +   +WV   QVSKT  +GEY
Sbjct: 560 APLTVIKTGGEE--VPESTLQEAAQFAVSYSSLWKAGHFSGDCYWVKAEQVSKTPESGEY 617

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
           +  G+F+IRG++N+    PL +  GL  + +   +G  ++  R  G+
Sbjct: 618 VKKGAFIIRGERNYFKDIPLGVAVGLELKGETRVIGGPVSAVRKHGD 664



 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 78/151 (51%), Gaps = 27/151 (17%)

Query: 2   VKVRMNTADVAAEVKCL----RRLIGMRCSNVY-----DLSPKTYIFKLMNSSGVTESGE 52
           +K  M++ADVAA V  L    + +I  +   +Y     ++    Y+F     +       
Sbjct: 1   MKQDMSSADVAAVVAELSAGPKSIIDAKIGKIYQPANEEIRINLYVFHQGRDN------- 53

Query: 53  SEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
                L++E+G R+H + Y R     P  F + LRK++   R+  V Q  +DRI+  + G
Sbjct: 54  -----LVIEAGKRIHLSKYLRASPTLPQAFPMLLRKYLMGGRIVSVEQHDFDRIV--KIG 106

Query: 113 L---GMNAHYVILELYAQGNILLTDSEFTVL 140
           +   G++++ +I+EL+A GNIL+ DSE  ++
Sbjct: 107 IERAGVHSN-LIVELFAPGNILIVDSENRII 136


>gi|256811227|ref|YP_003128596.1| fibronectin-binding A domain-containing protein [Methanocaldococcus
           fervens AG86]
 gi|256794427|gb|ACV25096.1| Fibronectin-binding A domain protein [Methanocaldococcus fervens
           AG86]
          Length = 671

 Score =  149 bits (377), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 189/361 (52%), Gaps = 17/361 (4%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L ++   E   + +F  A+D++++K  +    ++ K+K +    K   I   
Sbjct: 255 YFDVVPIDLKKYDGLEKKYYNSFLEAVDDYFAKFLTNIVVKKEKSKIEREIEKQENILKR 314

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q + +   K++ +++    +LI  N + V+  + A+R A   +M W  + ++V+E ++  
Sbjct: 315 QMDTLKKYKEDAEKNQIKGDLIYANYQIVEELLSAIRQA-REKMDWARIKKIVRENKE-- 371

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+ GLI+ +      + + L + +D+   EE+      V +D+  +A  NA  +YE  K
Sbjct: 372 HPILGLIENINENVGEIVIRLKSEVDDKVIEER------VSLDIRKNAFENAENYYEKAK 425

Query: 512 KQESKQEKTITA----HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           K ++K E    A      K  +  +K       +EK        ++  W+EKF W + + 
Sbjct: 426 KLKNKIEGIENAIELTKKKIEELKKKGEEELKEKEKLKMKKKVRKERKWYEKFKWTVIN- 484

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LVI+G+DA  NE+I+K+Y  K D+  HAD+ GA  TVIK    E  V   TL +   F
Sbjct: 485 GFLVIAGKDAITNEIIIKKYTDKDDIVFHADIQGAPFTVIKTEGRE--VDEETLEEVAKF 542

Query: 628 TVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRG++++    PL +G G+
Sbjct: 543 SVSHSRAWKLGYGAIDTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGV 602

Query: 687 L 687
           +
Sbjct: 603 I 603



 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 41/166 (24%), Positives = 80/166 (48%), Gaps = 8/166 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDL---SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   DV   V  L+ LI  R    + +   + +  I K+     V E G  E V+ 
Sbjct: 1   MKTEMTNVDVCCVVDELQSLINGRLDKAFLIDNENNRELILKIH----VPEGGSRELVIS 56

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           + +    +  T Y R+K   P  F + LRK+++  +L  + Q+ +DRI++F F      +
Sbjct: 57  IGKYKY-ITLTNYEREKPKLPPSFAMLLRKYLKNAKLVKIEQVNFDRIVIFHFETKEGIY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +++EL+ +GN +  ++E  ++  LR  R   + +     +++P +
Sbjct: 116 KLVVELFGEGNAIFLNNENVIIAPLRVERWSTRKIVPKEEYKFPPQ 161


>gi|448546430|ref|ZP_21626594.1| hypothetical protein C460_17818 [Haloferax sp. ATCC BAA-646]
 gi|448548417|ref|ZP_21627684.1| hypothetical protein C459_05213 [Haloferax sp. ATCC BAA-645]
 gi|448557611|ref|ZP_21632800.1| hypothetical protein C458_13126 [Haloferax sp. ATCC BAA-644]
 gi|445702883|gb|ELZ54823.1| hypothetical protein C460_17818 [Haloferax sp. ATCC BAA-646]
 gi|445714168|gb|ELZ65935.1| hypothetical protein C458_13126 [Haloferax sp. ATCC BAA-644]
 gi|445714512|gb|ELZ66274.1| hypothetical protein C459_05213 [Haloferax sp. ATCC BAA-645]
          Length = 702

 Score =  149 bits (377), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 102/365 (27%), Positives = 172/365 (47%), Gaps = 43/365 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +Q+     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPDFEEQIAKQERIIDQQEGAIEGFEQQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+   ++E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRDAREEGVPWDDIGATLEEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL       D+++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTL-------DVSMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSE 567
           +    A KK R +   +                    + ++      HWFE+F WF +S 
Sbjct: 441 REELEAVKKRRDEWEADDDEDDEEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTST 500

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLN 622
            YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL+
Sbjct: 501 GYLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLH 560

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF++RG + +    P  
Sbjct: 561 EAAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAK 620

Query: 682 MGFGL 686
           +  G+
Sbjct: 621 VAVGI 625



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLSGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|448622787|ref|ZP_21669436.1| hypothetical protein C438_10403 [Haloferax denitrificans ATCC
           35960]
 gi|445753295|gb|EMA04712.1| hypothetical protein C438_10403 [Haloferax denitrificans ATCC
           35960]
          Length = 701

 Score =  149 bits (375), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 170/364 (46%), Gaps = 42/364 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +++     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPDFEEQIAKQQRIIDQQEGAIEGFEKQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+   + E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLAEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL V       ++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTLEV-------SMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK-----------------TVANISHMRKVHWFEKFNWFISSEN 568
           +   AA KK R +   +                   + ++      HWFE+F WF +S  
Sbjct: 441 REELAAVKKRRDEWEADDDDDEEDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTSSG 500

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLNQ 623
           YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL +
Sbjct: 501 YLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLRE 560

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF++RG + +    P  +
Sbjct: 561 AAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVVRGDREYFEDVPAKV 620

Query: 683 GFGL 686
             G+
Sbjct: 621 AVGI 624



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|448602394|ref|ZP_21656450.1| hypothetical protein C441_00535 [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|445747909|gb|ELZ99363.1| hypothetical protein C441_00535 [Haloferax sulfurifontis ATCC
           BAA-897]
          Length = 702

 Score =  149 bits (375), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 103/365 (28%), Positives = 170/365 (46%), Gaps = 43/365 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE---DAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           ++TF+ ALDE++ +++    EQ+  +     +    K  +I   QE  +   +++     
Sbjct: 275 YDTFNDALDEYFFRLDLTADEQEATSDRPNFEEQIAKQQRIIDQQEGAIEGFEKQAQDER 334

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           + AEL+  N + VD  +  VR A    + W+D+   + E  + G P A  +  +      
Sbjct: 335 ERAELLYANYDLVDDVLSTVRGAREEGVPWDDIGETLAEGAEQGIPEAEAVTNVDGANGT 394

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--HS 525
           +++       ++DD   TL V       ++    NA R Y   K+ E K+E  + A   +
Sbjct: 395 VTV-------DLDDATVTLEV-------SMGVEKNADRLYTEAKRIEEKKEGALAAIEDT 440

Query: 526 KAFKAAEKKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFISSE 567
           +   AA KK R +   +                    + ++      HWFE+F WF +S 
Sbjct: 441 REELAAVKKRRDEWEADDDDEDEDDEDEEPEETDWLALDSVPVKSTEHWFERFRWFHTST 500

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-----TLN 622
            YLV+ GR+A QNE +VK+YMSK D + H   HG   T++K   P +P   +     TL 
Sbjct: 501 GYLVVGGRNADQNEELVKKYMSKHDRFFHTQAHGGPVTLLKATGPSEPAQAVDFSEETLR 560

Query: 623 QAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A  F V +S  W + +    A+ V P QVSKT  +GEY+  GSF+IRG + +    P  
Sbjct: 561 EAAQFAVSYSSIWKEGRFADDAYMVEPSQVSKTPESGEYIEKGSFVIRGDREYFEDVPAK 620

Query: 682 MGFGL 686
           +  G+
Sbjct: 621 VAVGI 625



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVTELNRYEGAKVDKAYLYGDDLLRLKMRDF-------DRGRLELLLEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F F  G    
Sbjct: 57  GEIKRAHLAAQEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQYEFDRILTFTFERGDENT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +++EL+ QGNI + D    V+  L + R   + VA  S++ YP
Sbjct: 117 KIVVELFGQGNIAVLDETGEVVRSLETVRLKSRTVAPGSQYEYP 160


>gi|448409564|ref|ZP_21574778.1| hypothetical protein C475_10624 [Halosimplex carlsbadense 2-9-1]
 gi|445672910|gb|ELZ25479.1| hypothetical protein C475_10624 [Halosimplex carlsbadense 2-9-1]
          Length = 729

 Score =  148 bits (373), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 146/656 (22%), Positives = 252/656 (38%), Gaps = 129/656 (19%)

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFT 138
           P  F + LR  ++   L DV Q  +DRI+   F        ++ EL+  GN+ + D    
Sbjct: 78  PPNFAMMLRNRMQGAELVDVSQFQFDRILELTFERDDETTTIVAELFGDGNVAILDGTGE 137

Query: 139 VLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNED 198
           V+  L                    E  R+  RT A        S++             
Sbjct: 138 VIDCL--------------------ETVRLKSRTVAPGAQYEFPSAR------------- 164

Query: 199 GNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHI 258
            N +             G  +D  +   ++S+         L   L   L +G    E +
Sbjct: 165 FNPL-------------GVDYDAFEARMRDSD-------SDLVRTLATQLNFGGLYGEEL 204

Query: 259 ILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGK 318
               G+  N+ + E     D+ ++ L  A+ +  D L D    D+ P  Y  + +     
Sbjct: 205 CTLAGVDYNVPIEEAT---DDQLRALYDALRRLADRLAD---SDLDPRVYYDLDDPD--A 256

Query: 319 DHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE 378
           + P  + G+      +  P+ L ++  R    F++F+ ALD++++    +  E    A  
Sbjct: 257 EDPTDDDGAIEGQRVDVTPIPLAEYDDRYGEPFDSFNEALDDYFTFASDEDDEGGGDAAG 316

Query: 379 --------DAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVA 430
                   ++   K  +I   Q+  +   + + +R    AE +  N + VD  +  V+ A
Sbjct: 317 GDRGRPDFESEIAKHERIIEQQQGAIEDFEAQAERERANAEALYANYDLVDDILSTVQEA 376

Query: 431 LANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK 490
            A   SW+D+     E  + G P A  +  L      +++       ++D E  TL   +
Sbjct: 377 RAEDRSWDDIEERFAEGARQGIPAAEAVVSLDGSEGTVTI-------DIDGERVTLAASE 429

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV----- 545
                      NA R Y   K+ E K+E    A       A+ ++ L+ ++E+       
Sbjct: 430 -------GVEKNADRLYREAKRIEGKKEGAEEA------IAQTRSELEAVEERKAEWEAA 476

Query: 546 ----------------------------ANISHMRKVHWFEKFNWFISSENYLVISGRDA 577
                                        +I   +  HWFE + WF +S+ +LVI GRDA
Sbjct: 477 DAGEAGSGGDESEGSDEDDDEPVDWLAEPSIPVRQSDHWFEDYRWFHTSDGFLVIGGRDA 536

Query: 578 QQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCH 631
             NE +VK+Y+ +GD + HA  HG  +T++K   P +       +P  +  +A  F V +
Sbjct: 537 DDNEDLVKKYLDRGDRFFHAQAHGGPATILKATGPSESYDDDVEIPESSKCEAAQFAVSY 596

Query: 632 SQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           S  W D K     + V   QVSKT  +GE+L  G F IRG + +     + +  G+
Sbjct: 597 SSIWKDGKFAGDVYEVGSDQVSKTPESGEFLEKGGFAIRGDRTYYESTEVGVAVGI 652


>gi|448435995|ref|ZP_21587011.1| Fibronectin-binding A domain protein [Halorubrum tebenquichense DSM
           14210]
 gi|445683155|gb|ELZ35558.1| Fibronectin-binding A domain protein [Halorubrum tebenquichense DSM
           14210]
          Length = 743

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 104/398 (26%), Positives = 183/398 (45%), Gaps = 45/398 (11%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAA--------FHKL 385
           +  P  L +  +   V F++F+ A+DE++ ++ S+  E      + +A          K 
Sbjct: 270 DVTPFPLAEHENLPSVGFDSFNDAVDEYFYRLGSEDTEAGDAPADASASRPDFEGEIAKQ 329

Query: 386 NKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVK 445
            +I   QE  +   +++     + AEL+  N + VD  I  VR A  + + W+++   + 
Sbjct: 330 ERIIEQQEGAIEGFEEQAQAERERAELLYANYDLVDEVISTVREARESEVPWDEIEETLD 389

Query: 446 EERKAGNPVAGLIDKLYLERNCMSLLLSNNLDE--MDDEE-KTLPVEKVEVDLALSAHAN 502
              + G P A  +  +      +++ L+   D+  +D E+  +    ++E+D +     N
Sbjct: 390 AGAERGIPAAEAVVDVDGGEGTVTVELAEEPDDDAVDGEDGASGGTTRIELDASEGVEVN 449

Query: 503 ARRWYELKKKQESKQEKTITA--HSKAFKAAEKKTRLQILQEKTV--------------- 545
           A R Y+  K+ E K+E  + A   ++A   A K+ + +  +++                 
Sbjct: 450 ADRLYQEAKRVEEKKEGAVAAIESTRAELEAVKERKAEWEEQQAADDGSAQGGDGDDEDD 509

Query: 546 -----------ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVY 594
                      A+I       W+++F WF +S  YLVI GR+A QNE +VK+YM K D +
Sbjct: 510 DEEYETDWLSRASIPIRSPDDWYDRFRWFHTSTGYLVIGGRNADQNEELVKKYMDKHDRF 569

Query: 595 VHADLHGASSTVIKNHRPEQPVPPL-----TLNQAGCFTVCHSQAW-DSKMVTSAWWVYP 648
            H   HG   T++K   P +   P+     TL +A  F V +S  W D +    A+ V P
Sbjct: 570 FHTQAHGGPVTILKAAGPSESAEPVDFSEETLREAAQFAVSYSSDWKDGRGAGDAYMVDP 629

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            QVSKT  +GEY+  GSF+IRG + +    P  +  G+
Sbjct: 630 DQVSKTPESGEYIEKGSFVIRGDRTYFEDVPCRIAVGV 667



 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 67/164 (40%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELSSIDLAALVTELNRYEGAKVDKAYLYDDDLLRLKLRDF-------DRGRVELMIEV 56

Query: 63  G--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H     R  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDIKRAHAADPDRVADAPGRPPNFAKMLRNRMSGADFAGVEQYEFDRILTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            ++ EL+ QGN+   D    V+  L + R   + VA  S++ YP
Sbjct: 117 TLVAELFGQGNVAALDETGEVVGSLSTVRLKSRTVAPGSQYEYP 160


>gi|289192132|ref|YP_003458073.1| Fibronectin-binding A domain protein [Methanocaldococcus sp.
           FS406-22]
 gi|288938582|gb|ADC69337.1| Fibronectin-binding A domain protein [Methanocaldococcus sp.
           FS406-22]
          Length = 671

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 190/361 (52%), Gaps = 17/361 (4%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L ++   E   + +F  A+D++++K   +   ++ K+K +    +   I   
Sbjct: 255 YFDVVPIDLKKYDGLEKKYYNSFLEAVDDYFAKFLVKVEVKKEKSKFEREIERQENILKR 314

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   K++ +++    +LI  N + V+  + A+R A   +M W  + ++++E ++  
Sbjct: 315 QLGTLKKYKEDAEKNQIKGDLIYANYQIVEELLNAIRQA-REKMDWARIKKIIRENKE-- 371

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+ GLI+ +      + + L + +D+   EE+      V +D+  +A  NA  +YE  K
Sbjct: 372 HPILGLIENINENVGEIVVRLKSEVDDNVIEER------VSLDIRKNAFENAESYYEKAK 425

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH----WFEKFNWFISSE 567
           K  +K E    A     K  ++  +    + K   +I   +KV     W+EKF W + + 
Sbjct: 426 KLRNKIEGIENAIELTKKKIDELKKKGEEELKEKESIQMKKKVRKERKWYEKFKWTVIN- 484

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LVI+G+DA  NE+I+K+Y  K D+  HAD+ GA  TVIK +  E  V   TL +   F
Sbjct: 485 GFLVIAGKDAITNEIIIKKYTDKDDIVFHADIQGAPFTVIKTYGRE--VDEETLEEVAKF 542

Query: 628 TVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRG++++    PL +G G+
Sbjct: 543 SVSHSRAWKLGYGAIDTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGV 602

Query: 687 L 687
           +
Sbjct: 603 I 603



 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 77/163 (47%), Gaps = 2/163 (1%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  +   DV   V  L+ LI  R    + L       +L+    V E G  E V+ +  
Sbjct: 1   MKSEITNVDVCCVVDELQNLINGRLDKAF-LIDNEQNRELILKIHVPEGGSRELVISIGR 59

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
               +  T Y R+K   P  F + LRK+++  +L  + Q+ +DRI++F F      + ++
Sbjct: 60  YKY-ITLTNYEREKPKLPPSFAMLLRKYLKNAKLIKIEQVNFDRIVIFHFETRDGIYKLV 118

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            EL+  GNI+  ++E  ++  LR  R   + +    ++++P +
Sbjct: 119 AELFGDGNIIFLNNEDIIIAPLRVERWSSRNIIPREKYKFPPQ 161


>gi|410695646|gb|AFV74963.1| serologically defined colon cancer antigen1-like protein, partial
           [Apis florea]
          Length = 273

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 94/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    D +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKYGFTLGCKIGKDFNIEED-MSKLILALEYANDMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  KQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|15790499|ref|NP_280323.1| hypothetical protein VNG1508C [Halobacterium sp. NRC-1]
 gi|169236235|ref|YP_001689435.1| hypothetical protein OE3153R [Halobacterium salinarum R1]
 gi|10580999|gb|AAG19803.1| conserved hypothetical protein [Halobacterium sp. NRC-1]
 gi|167727301|emb|CAP14087.1| conserved hypothetical protein [Halobacterium salinarum R1]
          Length = 703

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 160/652 (24%), Positives = 259/652 (39%), Gaps = 116/652 (17%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R H     +  D    P  F   LR  +       VRQ G+DRI+ F+
Sbjct: 49  RVELLVEVGETKRAHVADPTHVPDAPGRPPNFAKMLRNRLSGADFHAVRQHGFDRILEFE 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F        ++ EL+  GNI + D +  V+  L                    +  R+  
Sbjct: 109 FRREDADTTIVAELFGDGNIAVLDPQREVVDSL--------------------DTVRLQS 148

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A            PDA    +VN                       DLS  +     
Sbjct: 149 RTVAPGRDYGF-----PDA----RVN---------------------PLDLSYEAFAEQ- 177

Query: 231 DGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAK 290
              R     L   L   L +G   +E +    G+    K + V    ++ ++ L  A   
Sbjct: 178 --MRDSDTDLVRTLATQLNFGGLYAEELCSRAGV---EKTTPVADAPESTLEALFDAS-- 230

Query: 291 FEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVK 350
            E  L ++ +GD+ P+ Y           + PT+         +  P+ L++        
Sbjct: 231 -ETLLGNISAGDLDPQVY-----------YEPTDDEDEQGARVDVTPIALDERADLPSDA 278

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRS 406
           FE+F+ ALD++++ +++   E   +  +   F     K  +I   QE  +   + + +  
Sbjct: 279 FESFNDALDDYFTNLDTSEDEDSGETVDRPDFENEIEKQQRIIEQQEQAIEDFEAQAEAE 338

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERN 466
            + AE +  + + VD  + AVR A      W+ +A    +   A   V G   + ++  N
Sbjct: 339 REKAESLYGHYDLVDGLLSAVRQAREAGHGWQQIADTFDD---AAGDVPGA--EAFVGVN 393

Query: 467 CMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA--H 524
             + ++   +D+            V +D +     NA R Y   K+ E K+     A  +
Sbjct: 394 ESAGMIRARIDD----------HTVTLDPSAGVEKNADRLYTEAKRIEEKKAGARAAIEN 443

Query: 525 SKAFKAAEKKTRLQILQEK---------------TVANISHMRKVHWFEKFNWFISSENY 569
           ++A   A K+ R +   E                + ++I    +  W+E+F WF +SE +
Sbjct: 444 TRADLDAVKQRRDEWEAEPESEHEDDADDEVAWLSRSSIPIRHQEQWYERFRWFRTSEGF 503

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQA 624
           LVI GRDA QNE +VK+YM + D + H+  HG   TV+K   P +P     VP     QA
Sbjct: 504 LVIGGRDAGQNEELVKKYMDRYDRFFHSQAHGGPITVLKTSAPSEPSNDIEVPERDARQA 563

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             F V  S  W D +    A+ V P QVSKT  +GEYL  G F +RG + + 
Sbjct: 564 ARFAVACSSVWKDGRGAGDAYMVSPDQVSKTPESGEYLEKGGFAVRGDRTYF 615


>gi|387175434|gb|AFJ66834.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175436|gb|AFJ66835.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175438|gb|AFJ66836.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175440|gb|AFJ66837.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175442|gb|AFJ66838.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175448|gb|AFJ66841.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175450|gb|AFJ66842.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175488|gb|AFJ66861.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175496|gb|AFJ66865.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  147 bits (370), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 94/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + KF +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKKFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|288560094|ref|YP_003423580.1| RNA-binding protein [Methanobrevibacter ruminantium M1]
 gi|288542804|gb|ADC46688.1| RNA-binding protein [Methanobrevibacter ruminantium M1]
          Length = 669

 Score =  146 bits (369), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 186/370 (50%), Gaps = 38/370 (10%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           ++ ++   + L+Q+ + E   F++F+ A DEFYS           +A  +    K +K  
Sbjct: 259 KVKEDVVAIRLHQYENFEEESFDSFNEACDEFYSSKVKHEITDIQEAVWNKKVGKFSKRL 318

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
             QE  +   ++ ++ S K  EL+  N   V+  +  ++ A      W+++ + +K+ +K
Sbjct: 319 EKQEETLRGFEKTIEDSQKKGELLFTNYVQVENILNVIKDAREKDYGWKEIGKTLKDAKK 378

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMD---DEEKTLPVEKVEVDLALSAHANARRW 506
           +G   A + + +    N     ++ N+D +    D +K++P              NA  +
Sbjct: 379 SGMAEAQIFESMDPLGN-----ITLNIDGISIALDSKKSIP-------------DNAEVY 420

Query: 507 YELKKKQESKQEKTITA--HSKA-FKAAEKKTRLQILQEKTVANISHMRK-----VHWFE 558
           YE  KK + K +    A  ++KA  K  E+K      +EK +ANI   +K     + W+E
Sbjct: 421 YEKAKKAKRKIKGAKIAIENTKAQLKDMEEK------KEKAMANIMVPQKRVKKNLKWYE 474

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           K  WF+SS+  LV+ GRDA  NE +VK+Y+ + DVY+HAD+HGA S V K    +  +  
Sbjct: 475 KLRWFVSSDGTLVVCGRDAGSNEAVVKKYLEQNDVYLHADIHGAPSVVAKISSDK--LNN 532

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
             L + G F    S AW     T   +WV P QVSKT  +GE++  G+F+IRGK+N++  
Sbjct: 533 NLLKELGIFAASFSSAWSRNYGTQDVYWVEPEQVSKTPVSGEFVPKGAFIIRGKRNYIRG 592

Query: 678 HPLIMGFGLL 687
             L +  G++
Sbjct: 593 AKLEIAIGIV 602



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 60/107 (56%), Gaps = 1/107 (0%)

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++++G R+H + Y      +P  F + LRK ++   +  ++Q  +DR++  +    +  
Sbjct: 50  LVIQAGKRIHISQYPLANPQSPPSFPMLLRKRVKGANVVSIQQHNFDRVVEIKMKKDI-T 108

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           + +I+EL+A+GNI+L + E  +L  L+  +  D+ ++    + +P E
Sbjct: 109 YTLIVELFAKGNIILLNEENEILLPLKRKQWSDRDISSKKEYVFPIE 155


>gi|354610742|ref|ZP_09028698.1| Fibronectin-binding A domain protein [Halobacterium sp. DL1]
 gi|353195562|gb|EHB61064.1| Fibronectin-binding A domain protein [Halobacterium sp. DL1]
          Length = 745

 Score =  146 bits (368), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 175/380 (46%), Gaps = 46/380 (12%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKE----DAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           F+ F+ ALD++++ +++   E+  +A      +A   K  +I   Q+  +   +Q+ +  
Sbjct: 320 FDRFNDALDDYFTNLDTTEEEESGEAVSRPDFEAEIEKQKRIIEQQQQAIDDFEQQAEAE 379

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN-PVAGLIDKLYLER 465
            + AEL+  N + VD  I  V  A      W+D+A   +E   AG+ P A +   +    
Sbjct: 380 REKAELLYGNYDLVDELIGVVADARGAGHGWQDIAERFEE--AAGDVPGADVFVGVNESE 437

Query: 466 NCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA-- 523
             + + + ++  E+D E     VEK           NA R Y   K+ E KQE    A  
Sbjct: 438 GTVRVRIDDHTIELDPESG---VEK-----------NADRIYTEAKRIEEKQEGARAAIE 483

Query: 524 HSKAFKAAEKKTRLQILQE---------KTVANISHMRKV--------HWFEKFNWFISS 566
           +++    + K+ R +   E           +A++  + +          W+E+F WF +S
Sbjct: 484 NTRGDLESAKQRREEWEAEPDEQESEADDELADVDWLSRSSIPIRNQEQWYERFRWFRTS 543

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTL 621
           E +LV+ GRDA QNE +VK+YM + D + H+  HG   TV+K   P +P     VP    
Sbjct: 544 EGFLVLGGRDADQNEELVKKYMDRYDRFFHSQAHGGPITVLKTSAPSEPSNEIEVPETDK 603

Query: 622 NQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
            QA  F VC S  W D +    A+ V P QVSKT  +GEYL  G F IRG + +    P 
Sbjct: 604 RQAAQFAVCCSSVWKDGRGAGDAYMVSPDQVSKTPESGEYLEKGGFAIRGDRTYFRDLPA 663

Query: 681 IMGFGLLFRLDESSLGSHLN 700
               G+    +   LG  ++
Sbjct: 664 EWAVGIACEPNTRVLGGPID 683



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 55/113 (48%), Gaps = 4/113 (3%)

Query: 55  KVLLLMESG--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R H  A  +  D    P  F   LR  +      +VRQ G+DRI+ F+
Sbjct: 49  RVELLLEVGETKRAHVAAPEHVPDAPGRPPNFAKMLRNRLSGADFHEVRQHGFDRILEFE 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           F        +++EL+  GN+ + D    V+  L + R   + VA  +++ +P+
Sbjct: 109 FRREDQDTTIVVELFGDGNVAVLDQNGEVVDCLETVRLKSRTVAAGAQYGFPS 161


>gi|448659123|ref|ZP_21683091.1| hypothetical protein C435_18454 [Haloarcula californiae ATCC 33799]
 gi|445760625|gb|EMA11882.1| hypothetical protein C435_18454 [Haloarcula californiae ATCC 33799]
          Length = 717

 Score =  146 bits (368), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 160/663 (24%), Positives = 263/663 (39%), Gaps = 103/663 (15%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQPTLKTV--LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+     V  L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   GD+ P  Y    +   G  +  +      +  D   P  L ++     
Sbjct: 232 DEMGTRLRE---GDVDPRVYYETLDDGDGAGNGESGDDPDRRRVD-VTPTPLAEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHMDQENRVHTLKQEV 403
             F  F+ ALD+++     QR E+       +   +A   K  +I   QE  +   + + 
Sbjct: 288 ESFTEFNPALDDYFFNF--QREEEVEGGETQRPDFEAEIEKQERIIQQQEQAIEDFEADA 345

Query: 404 DRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYL 463
           +   + AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L  
Sbjct: 346 EVEREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRGIAAAEAVVSLDG 405

Query: 464 ERNCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE-L 509
               ++L +               N DE+  E K +  +K   + AL+A  N R   E +
Sbjct: 406 SEGTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAV 462

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K+++E  +       +   +A ++ T    +Q     +I       W+E+F WF +S+ +
Sbjct: 463 KERREEWEADDGEDEADNDEAEDEPTDWLSMQ-----SIPTRSTERWYEQFRWFHTSDGF 517

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQA 624
           LVI GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA
Sbjct: 518 LVIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQSSLDQA 577

Query: 625 GCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
             F V +S  W D K     + V P QVSKT  +GEYL  G F +RG + +    P  + 
Sbjct: 578 AQFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVA 637

Query: 684 FGL 686
            G+
Sbjct: 638 VGI 640


>gi|257388236|ref|YP_003178009.1| fibronectin-binding A domain-containing protein [Halomicrobium
           mukohataei DSM 12286]
 gi|257170543|gb|ACV48302.1| Fibronectin-binding A domain protein [Halomicrobium mukohataei DSM
           12286]
          Length = 708

 Score =  145 bits (367), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 174/371 (46%), Gaps = 45/371 (12%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQ-RAEQQHKAKED-----AAFHKLNKIH 389
            P+ L ++   E   FETF  ALDE++ ++E +  AE+   A  D     +   K  +I 
Sbjct: 265 TPIPLEEYDDVESRAFETFTEALDEYFYEVEREDTAEEIADAGVDRPDFESEIEKYERII 324

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
             Q++ +   + + +   + AEL+    + VD  +  ++ A      W+++    +E ++
Sbjct: 325 QQQQSAIEDFESDAEAEREKAELLYARYDLVDEILSTIQGARTQDTPWDEIEATFEEGKE 384

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
            G   A  ++ L      ++L    ++D++          +V +D  +    NA + Y+ 
Sbjct: 385 QGIAAAEAVEGLDGSEGTVTL----SIDDV----------RVTIDATMGVEKNADQLYQA 430

Query: 510 KKKQESKQE---KTITAHSKAFKAAEKK-----------TRLQILQEKTV-----ANISH 550
            K+ E K+E     I    +  +A E++           T+ Q  +   V     A+I  
Sbjct: 431 AKRIEEKKEGAQAAIEDTREDLEAVERRRENWEAEDTTETQEQTAEADDVDWLSRASIPV 490

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
            R+  W+++F WF +S  +LVI GR+A QNE +VK+Y+ +GD + HA  HG   TV+K  
Sbjct: 491 RRQEPWYDRFRWFRTSNGFLVIGGRNADQNEELVKKYLDRGDKFFHAQAHGGPVTVLKAT 550

Query: 611 RPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            P +      +P     +A  F V +S  W D K    A+ V P QVSKT  +GEYL  G
Sbjct: 551 GPSESSRDVDIPDQDKREAATFAVAYSSVWKDGKYAGDAYMVDPDQVSKTPESGEYLEKG 610

Query: 665 SFMIRGKKNFL 675
            F IRG + + 
Sbjct: 611 GFAIRGDRTYF 621



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 4/113 (3%)

Query: 55  KVLLLMESG--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R H     +  D    P  F + LR  I    L DVRQ  +DRI+ F+
Sbjct: 49  RVELLIEVGENKRAHVVDADHVPDAPGRPPNFAMMLRNRISGGELADVRQFEFDRIMEFE 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           F     +  V+ EL+  GN+ + D    V+  L + R   + VA  S++ +P+
Sbjct: 109 FDRPDASTTVVAELFGDGNVAVLDEHGEVVDCLETVRLKSRTVAPGSQYEFPS 161


>gi|410695644|gb|AFV74962.1| serologically defined colon cancer antigen1-like protein, partial
           [Apis cerana]
          Length = 273

 Score =  145 bits (366), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 157/277 (56%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+     +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGRDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKALLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|387175512|gb|AFJ66873.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175524|gb|AFJ66879.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175526|gb|AFJ66880.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  145 bits (366), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|387175444|gb|AFJ66839.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175452|gb|AFJ66843.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175454|gb|AFJ66844.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175462|gb|AFJ66848.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175464|gb|AFJ66849.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175466|gb|AFJ66850.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175470|gb|AFJ66852.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175474|gb|AFJ66854.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175476|gb|AFJ66855.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175478|gb|AFJ66856.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175480|gb|AFJ66857.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175482|gb|AFJ66858.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175484|gb|AFJ66859.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175486|gb|AFJ66860.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175494|gb|AFJ66864.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175498|gb|AFJ66866.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175506|gb|AFJ66870.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175508|gb|AFJ66871.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175510|gb|AFJ66872.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175514|gb|AFJ66874.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175516|gb|AFJ66875.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175518|gb|AFJ66876.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175520|gb|AFJ66877.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175522|gb|AFJ66878.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175528|gb|AFJ66881.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|395645660|ref|ZP_10433520.1| protein of unknown function DUF814 [Methanofollis liminatans DSM
           4140]
 gi|395442400|gb|EJG07157.1| protein of unknown function DUF814 [Methanofollis liminatans DSM
           4140]
          Length = 635

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 170/348 (48%), Gaps = 39/348 (11%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F+T++AAL+ FY ++ +   +++ K  +     +   I + QE  +   + ++ R+ K  
Sbjct: 255 FDTYNAALESFYPEVPASVTKEEEKRPK---LTREEVIRLQQETAIKKFESKIARAEKAV 311

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
           E I  N   V   I  ++ A +  MSW+++ +++K               L   +  +S+
Sbjct: 312 EAIYTNYPLVQEVITTLQRA-SRSMSWQEIEKILKS------------SDLPAAKAVVSV 358

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
             ++   ++D   +      V + +  S  AN  R+Y+  KK   K+E  + A  +    
Sbjct: 359 HPADAAVDVDVGMQ------VTIHVHESVEANVERYYDQIKKFRKKKEGALAAMERGVPK 412

Query: 531 AEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
            ++K +  +          H+ K  WF +F WF +++  LV+ GRDA QNE +VKRYM  
Sbjct: 413 QKEKPKETL----------HLLKKKWFHRFRWFYTTDGTLVLGGRDASQNEELVKRYMEG 462

Query: 591 GDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS-KMVTSAWWVYPH 649
            D +VHAD+HG S  ++K      P   L  ++  CF   +S AW +       +   P 
Sbjct: 463 KDTFVHADVHGGSVVIVKG-----PTEHLE-DEVACFAASYSNAWKAGHFAADVYIARPD 516

Query: 650 QVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
           QVSKT  +GEY++ G+F++RG++ ++   PL +  G+  + D + +G 
Sbjct: 517 QVSKTPESGEYVSRGAFIVRGERQYVRDVPLGVAIGVQLKPDVTVIGG 564



 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 64/148 (43%), Gaps = 7/148 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M+  DV A V  L   + +    +Y    KT   +L    GV       K   L+E+G R
Sbjct: 7   MSGIDVRAMVTELCGHLPLWIGKIYQYDTKTLGIRLNGEGGV-------KHQFLIETGRR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            H      +   TP G+ + LRKH+   R+  + Q G  RI     G       +++EL+
Sbjct: 60  AHLVRSLPESPKTPLGYAMFLRKHLEGGRVRAIGQYGLQRIFYIDIGKKTGVLRLVIELF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGV 153
            +GN +L D    +L  L  HR  D+ V
Sbjct: 120 DEGNAVLLDEGGVILKPLWHHRFKDRAV 147


>gi|387175446|gb|AFJ66840.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175458|gb|AFJ66846.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175460|gb|AFJ66847.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175468|gb|AFJ66851.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175472|gb|AFJ66853.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175490|gb|AFJ66862.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175492|gb|AFJ66863.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175500|gb|AFJ66867.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
 gi|387175504|gb|AFJ66869.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 157/277 (56%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   +  F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKXFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|387175502|gb|AFJ66868.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 157/277 (56%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+    AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREALKKLENVKKDHDQRLITLEKTQELDKX--KAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|227828200|ref|YP_002829980.1| hypothetical protein M1425_1938 [Sulfolobus islandicus M.14.25]
 gi|229585429|ref|YP_002843931.1| hypothetical protein M1627_2016 [Sulfolobus islandicus M.16.27]
 gi|227459996|gb|ACP38682.1| protein of unknown function DUF814 [Sulfolobus islandicus M.14.25]
 gi|228020479|gb|ACP55886.1| protein of unknown function DUF814 [Sulfolobus islandicus M.16.27]
          Length = 609

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 171/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D +LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTSLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|238620391|ref|YP_002915217.1| hypothetical protein M164_1946 [Sulfolobus islandicus M.16.4]
 gi|238381461|gb|ACR42549.1| protein of unknown function DUF814 [Sulfolobus islandicus M.16.4]
          Length = 609

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 171/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D +LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTSLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|387175456|gb|AFJ66845.1| serologically-defined colon cancer antigen 1-like protein, partial
           [Apis mellifera]
          Length = 273

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/277 (33%), Positives = 158/277 (57%), Gaps = 15/277 (5%)

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
           +LK +L   L +G A+ +H++L  G     K+ +   +E++ +  L+LA+    + +   
Sbjct: 5   SLKKILNPLLEFGSAVIDHVLLKHGFTLGCKIGKDFNIEED-MSKLILALEYANNMMNSA 63

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYD--EFCPLLLNQFRSREFVKFETFDA 356
                + +GYI+ +     K+  PT  G    IY   EF P L  Q++   + +F +FD 
Sbjct: 64  RQN--ISKGYIIQK-----KEIKPTTDGQKDFIYTNIEFHPFLFEQYKDHPYKEFASFDV 116

Query: 357 ALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK--QEVDRSVKMAELIE 414
           A+DE++S +E Q+ + +   +E  A  KL  +  D + R+ TL+  QE+D+  + AELI 
Sbjct: 117 AVDEYFSTMEGQKLDLKALQQEREAXKKLENVKKDHDQRLITLEKTQELDK--QKAELIS 174

Query: 415 YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSN 474
            N   VD AILA++ ALAN+M+W D+  ++KE    G+PVA  I +L LE N +SLLL +
Sbjct: 175 RNQSLVDNAILAIQSALANQMAWPDIKVLLKEAESKGDPVASAIKQLKLETNHISLLLHD 234

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             ++ D+E +  P+  +++DLA +A  NAR++Y  K+
Sbjct: 235 PYEDSDEESELKPM-LIDIDLAHTAFGNARKYYNQKR 270


>gi|218883339|ref|YP_002427721.1| hypothetical protein DKAM_0025 [Desulfurococcus kamchatkensis
           1221n]
 gi|218764955|gb|ACL10354.1| protein of unknown function DUF814 [Desulfurococcus kamchatkensis
           1221n]
          Length = 659

 Score =  144 bits (363), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 175/711 (24%), Positives = 315/711 (44%), Gaps = 154/711 (21%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           ++K  M+  D+ + V     ++ G    N Y      +I KL    GV         ++ 
Sbjct: 5   LLKKAMDILDIYSWVNKYSSVVTGCLIDNAYHYK-SYWILKLRCREGVY--------IVK 55

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGL--GMNA 117
           +E GVR+H +    ++K+   GFT  LR  IR  R+  ++Q  ++RIILF+  +   +  
Sbjct: 56  IEPGVRMHLSQSHPEEKDI-DGFTRFLRSRIRDSRITSIKQPWWERIILFETSIHDKILR 114

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           HYV  EL  +G  ++TD    ++   R  +  D+ +        P+E+            
Sbjct: 115 HYV--ELLPRGQWIITDQSDKIVYASRFMKYRDRSIK-------PSEVY----------- 154

Query: 178 HAALTSSKEPDANEPDKVNEDGNNVSNASKENL-GGQKGGKSFDLSKNSNKNSNDGARAK 236
                      +  P K      N+S + K+ L    KGG+                   
Sbjct: 155 -----------SPPPLK------NLSPSDKDALLNVVKGGRDL----------------- 180

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGL--VPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
              ++T++  A G    ++E  I   GL  V N  +SE+        Q L   V ++   
Sbjct: 181 ---VRTIIS-AWGIPGHIAEEAIHRAGLYGVKNKGVSEI------PYQDLEKLVDEYRRI 230

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETF 354
           +++V++G    +GY++  ++            +  +IY  + P L ++   +     +  
Sbjct: 231 VEEVLNG----KGYLVYGDE------------NKLEIYTSYEPRLFSEVYDKTVKPLDDI 274

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI- 413
           + A+D ++++ E   A   ++A+ +    KL +I    E R+   +QE        E+I 
Sbjct: 275 NTAIDVYFTEYE---AYLDYQARMEEVTEKLREI----EARIK--RQE--------EIIA 317

Query: 414 EYN--LEDVDAAILAVRVALANRMSWEDLARMVKE--ERKAGNPVAGLIDKLYLERNCMS 469
           EYN  +E++++ +  +    +N    E++    +E  E+K    +A           C  
Sbjct: 318 EYNNEIENIESILQTI---YSNYHVAEEILECARETREKKGWEHIA---------EEC-- 363

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHAN-ARRWYELKKKQESKQEKTITAHSKAF 528
               N++ E+  ++  + V+  E  L LS   + +R+  EL++K      KT +A     
Sbjct: 364 ----NSVIEIRKDKGMIVVKLGEKTLELSIREDLSRQVIELERKHGELVRKTESAKKVLE 419

Query: 529 KAAEKKTRLQI---LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
           +  ++   + I    +EKT+   S      W+E+F+W  +   +L I GRD  QNE++V+
Sbjct: 420 EMHQQLNTISISMNTEEKTIRKPS---PTFWYERFHWLFTRNGFLAIGGRDQSQNELVVR 476

Query: 586 RYMSKGDVYVHADLHGASSTVIKN---HRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VT 641
           +Y+ + DV++HAD+HG S+ V+K+   H  E  V       A     C+S+AW +     
Sbjct: 477 KYLGENDVFIHADIHGGSAVVLKSGGAHSLEDVV------DASYLAACYSKAWKAGFSYI 530

Query: 642 SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
             +WV   QVSKT P GEYL  G+FM+ G KN+L   PL +G G+   +D+
Sbjct: 531 EVYWVPGRQVSKTPPPGEYLPRGAFMVYGSKNYLQV-PLRLGIGIREYVDD 580


>gi|448639710|ref|ZP_21676858.1| hypothetical protein C436_08831 [Haloarcula sinaiiensis ATCC 33800]
 gi|445762237|gb|EMA13458.1| hypothetical protein C436_08831 [Haloarcula sinaiiensis ATCC 33800]
          Length = 717

 Score =  144 bits (362), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 158/661 (23%), Positives = 260/661 (39%), Gaps = 99/661 (14%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQPTLKTV--LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+     V  L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   GD+ P  Y    +   G  +  +      +  D   P+ L ++     
Sbjct: 232 DEMGTRLRE---GDVDPRVYYETLDDGDGAGNGESGDDPDRRRID-VTPIPLAEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
             F  F+ ALD+++   +  +  E     + D       +  + Q+        E D  V
Sbjct: 288 ESFTEFNPALDDYFFNFQREEEVEGGETQRPDFEAEIEKQQRIIQQQEQAIEDFEADAEV 347

Query: 408 KM--AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
           +   AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L    
Sbjct: 348 EREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRGIAAAEAVVSLDGSE 407

Query: 466 NCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE-LKK 511
             ++L +               N DE+  E K +  +K   + AL+A  N R   E +K+
Sbjct: 408 GTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAVKE 464

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
           ++E  +       +   +A ++ T    +Q     +I       W+E+F WF +S+ +LV
Sbjct: 465 RREEWEADDGEDEADNDEAEDEPTDWLSMQ-----SIPTRSTERWYEQFRWFHTSDGFLV 519

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGC 626
           I GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA  
Sbjct: 520 IGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQSSLDQAAQ 579

Query: 627 FTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
           F V +S  W D K     + V P QVSKT  +GEYL  G F +RG + +    P  +  G
Sbjct: 580 FAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVG 639

Query: 686 L 686
           +
Sbjct: 640 I 640


>gi|154304164|ref|XP_001552487.1| hypothetical protein BC1G_08352 [Botryotinia fuckeliana B05.10]
          Length = 484

 Score =  144 bits (362), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 68/131 (51%), Positives = 89/131 (67%), Gaps = 9/131 (6%)

Query: 590 KGDVYVHADLHGASSTVIKNH--RPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           KGDVY+HAD+ GA+S +++N+   P+ P+PP TL+QAG   V  S AWDSK   SAWWV 
Sbjct: 2   KGDVYLHADIRGAASVIVRNNPKTPDAPIPPQTLSQAGTLVVVTSSAWDSKAGMSAWWVT 61

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGE 707
             QVSK+APTGE+L  GSF   GKKNFLPP  L++GFG+LF++ + S   H N+ R++  
Sbjct: 62  ADQVSKSAPTGEFLPAGSFNTHGKKNFLPPAQLLLGFGVLFQISDESKARH-NKHRLQ-- 118

Query: 708 EEGMDDFEDSG 718
               DD   SG
Sbjct: 119 ----DDSPSSG 125


>gi|53136750|emb|CAG32704.1| hypothetical protein RCJMB04_33f3 [Gallus gallus]
          Length = 198

 Score =  144 bits (362), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 76/164 (46%), Positives = 103/164 (62%), Gaps = 9/164 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A V  LR  L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDIRALVAELRLSLLGMRVNNVYDVDSKTYLIRLQKPDC--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PSGF +K RKH++TRRL  VRQLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSGFAMKCRKHLKTRRLVSVRQLGIDRIVDFQFGSNEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +
Sbjct: 113 IIELYDRGNIVLTDHEYLILNILRFRTDEADDVRFAVRERYPVD 156


>gi|55377795|ref|YP_135645.1| hypothetical protein rrnAC0969 [Haloarcula marismortui ATCC 43049]
 gi|55230520|gb|AAV45939.1| unknown [Haloarcula marismortui ATCC 43049]
          Length = 717

 Score =  144 bits (362), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 158/661 (23%), Positives = 260/661 (39%), Gaps = 99/661 (14%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQPTLKTV--LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+     V  L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   GD+ P  Y    +   G  +  +      +  D   P+ L ++     
Sbjct: 232 DEMGTRLRE---GDVDPRVYYETLDDGDGAGNGESGDDPDRRRID-VTPIPLAEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
             F  F+ ALD+++   +  +  E     + D       +  + Q+        E D  V
Sbjct: 288 ESFTEFNPALDDYFFNFQREEEVEGGETQRPDFEAEIEKQQRIIQQQEQAIEDFEADAEV 347

Query: 408 KM--AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
           +   AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L    
Sbjct: 348 EREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRGIAAAEAVVSLDGSE 407

Query: 466 NCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE-LKK 511
             ++L +               N DE+  E K +  +K   + AL+A  N R   E +K+
Sbjct: 408 GTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAVKE 464

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
           ++E  +       +   +A ++ T    +Q     +I       W+E+F WF +S+ +LV
Sbjct: 465 RREEWEADDGEDEADNDEAEDEPTDWLSMQ-----SIPTRSTERWYEQFRWFHTSDGFLV 519

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGC 626
           I GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA  
Sbjct: 520 IGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQSSLDQAAQ 579

Query: 627 FTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
           F V +S  W D K     + V P QVSKT  +GEYL  G F +RG + +    P  +  G
Sbjct: 580 FAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVG 639

Query: 686 L 686
           +
Sbjct: 640 I 640


>gi|344211873|ref|YP_004796193.1| fibronectin-binding A domain-containing protein [Haloarcula
           hispanica ATCC 33960]
 gi|343783228|gb|AEM57205.1| fibronectin-binding A domain protein [Haloarcula hispanica ATCC
           33960]
          Length = 717

 Score =  144 bits (362), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 158/661 (23%), Positives = 260/661 (39%), Gaps = 99/661 (14%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHAADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           F     +  ++ EL+  GN+ + D    V+  L                    E  R+  
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEYGEVIDCL--------------------ETVRLKS 149

Query: 171 RTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSN 230
           RT A        S++      P  V+ DG                               
Sbjct: 150 RTVAPGTPYEFPSAR----FNPMTVDYDGFV----------------------------- 176

Query: 231 DGARAKQPTLKTV--LGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAV 288
             AR K+     V  L   L +G    E +    G+  N+    V+ L+++  + L   +
Sbjct: 177 --ARIKESDADLVRTLATQLNFGGLYGEELCTRAGIDYNVA---VDDLDESDFERLYELI 231

Query: 289 AKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF 348
            +    L++   GD+ P  Y    +   G  +  +      +  D   P+ L ++     
Sbjct: 232 DEMGTRLRE---GDVDPRVYYETLDDGDGAGNGESGDDPDRRRID-VTPIPLAEYEELYS 287

Query: 349 VKFETFDAALDEFYSKIE-SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
             F  F+ ALD+++   +  +  E     + D       +  + Q+        E D  V
Sbjct: 288 ESFTEFNPALDDYFFNFQREEEVEGGETQRPDFEAEIEKQQRIIQQQEQAIEDFEADAEV 347

Query: 408 KM--AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
           +   AEL+  N + VD  +  V+ A  + +SW+D+     E    G   A  +  L    
Sbjct: 348 EREKAELLYANYDLVDDVLSTVQAARQDDVSWDDIEAKFDEGADRGIAAAEAVVSLDGSE 407

Query: 466 NCMSLLLSN-------------NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE-LKK 511
             ++L +               N DE+  E K +  +K   + AL+A  N R   E +K+
Sbjct: 408 GTVTLDIDGTRVTVDAFTGVEKNADELYKEAKRIEEKK---EGALAAIENTREDLEAVKE 464

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
           ++E  +       +   +A ++ T    +Q     +I       W+E+F WF +S+ +LV
Sbjct: 465 RREEWEADDGEDEADNDEAEDEPTDWLSMQ-----SIPTRSTERWYEQFRWFHTSDGFLV 519

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAGC 626
           I GRDA  NE +V++Y+  GD + HA  HG   TV+K   P +P      P  +L+QA  
Sbjct: 520 IGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKATGPSEPSKEVEFPQSSLDQAAQ 579

Query: 627 FTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
           F V +S  W D K     + V P QVSKT  +GEYL  G F +RG + +    P  +  G
Sbjct: 580 FAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKGGFAVRGDRTYFEGTPAGVAVG 639

Query: 686 L 686
           +
Sbjct: 640 I 640


>gi|325958497|ref|YP_004289963.1| fibronectin-binding A domain-containing protein [Methanobacterium
           sp. AL-21]
 gi|325329929|gb|ADZ08991.1| Fibronectin-binding A domain protein [Methanobacterium sp. AL-21]
          Length = 661

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 185/361 (51%), Gaps = 27/361 (7%)

Query: 333 DEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQ---RAEQQHKAKEDAAFHKLNKIH 389
           ++  PL L  ++  E   FE+F+ A DEFYS I  +      ++  + E   F K   I 
Sbjct: 245 EDVLPLDLLMYKDFEKESFESFNDAADEFYSSIVGEDIVNVNEEVWSGEVGKFEKRLNIQ 304

Query: 390 MDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERK 449
           ++    +   ++ V  S    E I  + + ++  IL +  +     SW ++   VK+ +K
Sbjct: 305 LET---LEKFEKTVKDSKIKGEAIYSDYQAIEN-ILNIIHSARETNSWLEIIATVKKAKK 360

Query: 450 AGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
              P   +I+ +    + M +L + NLD +          +V +D ++    NA  +Y  
Sbjct: 361 DKVPGLEIIESI----DKMGVL-TLNLDGV----------RVNIDSSMGIPENAEIYYNK 405

Query: 510 KKKQESKQEKTITAHSKAFKAAEK-KTRLQILQEKTVANISHMRK-VHWFEKFNWFISSE 567
            KK + K +    A  K  K  +K K + +I  EK +     ++K + W+EK  WF++S+
Sbjct: 406 GKKAKRKIKGVHIAIEKTRKEIDKAKNKREIEMEKVLVPQKRVKKDLKWYEKLRWFVTSD 465

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
             L I GRDA  NEM+VK++M   D+Y H+D+HGASS ++K    E  +P  ++N+   F
Sbjct: 466 GLLAIGGRDATTNEMVVKKHMENRDIYFHSDIHGASSVILKAGEGE--IPERSINETAAF 523

Query: 628 TVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             C S AW   +  T  +WV+P QVSKT  +GE++  G+F+IRG +N++   PL +  G+
Sbjct: 524 AACFSSAWSKGLGSTDVYWVHPEQVSKTPQSGEFVAKGAFIIRGSRNYMRGLPLTLSLGI 583

Query: 687 L 687
           +
Sbjct: 584 V 584



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 59/110 (53%), Gaps = 1/110 (0%)

Query: 55  KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +V ++ ++G R+HTT Y       P  F + LRK+I+   +  V+Q  +DRI+       
Sbjct: 47  RVDVVFQAGFRVHTTQYPPQNPKIPPNFPMLLRKYIKGGTVTAVKQHNFDRIMRIDIQ-K 105

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
                +++EL+A+GNI+L D E  ++  L+     D+ ++    ++YP E
Sbjct: 106 EEKFSLVVELFAKGNIILLDHEDKIILPLKRKVWQDRKISSKEEYKYPPE 155


>gi|15669822|ref|NP_248636.1| hypothetical protein MJ_1625 [Methanocaldococcus jannaschii DSM
           2661]
 gi|42559938|sp|Q59020.1|Y1625_METJA RecName: Full=Uncharacterized protein MJ1625
 gi|1592339|gb|AAB99643.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
           2661]
          Length = 671

 Score =  143 bits (361), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 191/361 (52%), Gaps = 17/361 (4%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L +++  E   + +F  A+D++++K  ++   ++ K+K +    +   I   
Sbjct: 255 YFDVVPIDLKKYKGLEKKYYNSFLEAVDDYFAKFLTKVVVKKEKSKIEKEIERQENILRR 314

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   K++ +++    +LI  N + V+  + A+R A   +M W  + ++++E ++  
Sbjct: 315 QLETLKKYKEDAEKNQIKGDLIYANYQIVEELLNAIRQA-REKMDWARIKKIIRENKE-- 371

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+ GLI+ +      + + L + +D+   EE+      V +D+  +A  NA  +YE  K
Sbjct: 372 HPILGLIENINENIGEIIIRLKSEVDDKVIEER------VSLDIRKNAFENAESYYEKAK 425

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH----WFEKFNWFISSE 567
           K  +K E    A     K  E+  +    + K   ++   +K+     W+EKF W + + 
Sbjct: 426 KLRNKIEGIENAIELTKKKIEELKKKGEEELKEKESMQMKKKIRKERKWYEKFKWTVIN- 484

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LVI+G+DA  NE+I+K+Y  K D+  HAD+ GA  TVIK    E  V   TL +   F
Sbjct: 485 GFLVIAGKDAITNEIIIKKYTDKDDIVFHADIQGAPFTVIKTQGKE--VDEETLEEVAKF 542

Query: 628 TVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRG++++    PL +G G+
Sbjct: 543 SVSHSRAWKLGYGAIDTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGVGV 602

Query: 687 L 687
           +
Sbjct: 603 I 603



 Score = 66.6 bits (161), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 79/163 (48%), Gaps = 2/163 (1%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  +   DV   V  L+ LI  R    + L       +L+    V E G  E V+ + +
Sbjct: 1   MKSEITNVDVCCVVDELQNLINGRLDKAF-LIDNEQNRELILKIHVPEGGSRELVISIGK 59

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
               +  T Y R+K   P  F + LRK+++  +L  + Q+ +DR+++F F      + ++
Sbjct: 60  YKY-ITLTNYEREKPKLPPSFAMLLRKYLKNAKLIKIEQVNFDRVVIFHFETRDGIYKLV 118

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            EL+  GNI+  ++E T++  LR  R   + +    ++++P +
Sbjct: 119 AELFGDGNIIFLNNEDTIIAPLRVERWSTRNIVPKEKYKFPPQ 161


>gi|269864556|ref|XP_002651614.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064197|gb|EED42442.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 320

 Score =  143 bits (361), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/260 (32%), Positives = 133/260 (51%), Gaps = 37/260 (14%)

Query: 436 SWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDL 495
            W   A   K E++ GNP A  I+   L+     + L +              E +++DL
Sbjct: 1   GWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEAIIKLGD--------------ENIKLDL 46

Query: 496 ALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM---- 551
             +   N    Y+ +++   K EKT             K  ++ +Q K      H+    
Sbjct: 47  RKTIDRNIEDIYKTRRRMREKAEKT-------------KIAMRDIQAKLKPRKEHIKVQD 93

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           R  +WFEKF++FIS  N ++I G++AQQN+ IV +YM   D+Y H D+ GASS + K   
Sbjct: 94  RVSYWFEKFHFFISENNCVIIGGKNAQQNDQIVNKYMEDRDLYFHCDVKGASSVICKGSA 153

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
                    +  A  F + +S+AWD +++   ++V   QVSKTAP+GE+L  GSFMI+GK
Sbjct: 154 DR------NIEDATYFALVYSKAWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGK 207

Query: 672 KNFLPPHPLIMGFGLLFRLD 691
           KN + P+ L  G G++FR++
Sbjct: 208 KNMVYPYRLEYGVGVVFRIN 227


>gi|227830959|ref|YP_002832739.1| hypothetical protein LS215_2101 [Sulfolobus islandicus L.S.2.15]
 gi|227457407|gb|ACP36094.1| protein of unknown function DUF814 [Sulfolobus islandicus L.S.2.15]
          Length = 609

 Score =  143 bits (361), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|229579837|ref|YP_002838236.1| hypothetical protein YG5714_2060 [Sulfolobus islandicus Y.G.57.14]
 gi|228010552|gb|ACP46314.1| protein of unknown function DUF814 [Sulfolobus islandicus
           Y.G.57.14]
          Length = 609

 Score =  143 bits (360), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|424813826|ref|ZP_18239004.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalina sp. J07AB43]
 gi|339757442|gb|EGQ42699.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalina sp. J07AB43]
          Length = 632

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 96/345 (27%), Positives = 170/345 (49%), Gaps = 30/345 (8%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P  L ++   E + F+TF  A+DE+Y + ++ + +++ +         + +    QE +
Sbjct: 234 SPFPLERYADDESIDFDTFSEAIDEYYYRKKALKEKKEKEEAYQEKKQGIERQKQQQERK 293

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRM---SWEDL-ARMVKEERKAG 451
           +  L++  +++ + AE I  N +     +  ++  + N +    WE    ++ K E +  
Sbjct: 294 IQGLEKSAEQNREKAERIYENYQ----LLQRIKRQIENSLDEDGWEQTRQKLEKSESEDA 349

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           + VA L        N     +S +  E          E ++V L     A A ++Y+  K
Sbjct: 350 DKVASL--------NKQEDFISVDTGE----------ENLKVYLFQDLEATASQYYDKAK 391

Query: 512 KQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLV 571
             E K E    A  +  K  E   + +I  ++ + + +  RK  WFEK+ WF SSE+YLV
Sbjct: 392 NSEEKIESAKEALKETKKELEDLKKEEINTDEVLEDKTQKRKKKWFEKYRWFYSSEDYLV 451

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCH 631
           + GRDAQ N+M+VK++M   D+Y HAD  GA S VIK+    Q     T  +A    +  
Sbjct: 452 LCGRDAQTNDMLVKKHMESNDLYFHADFDGAPSVVIKDG---QEAGEQTRKEAAKAAITF 508

Query: 632 SQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           S+ W + +   +A++V P QV++   +GEYL  G+F+IRG + ++
Sbjct: 509 SKTWKAGIGADTAYYVEPGQVTQNPESGEYLQKGAFVIRGDREYM 553



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 61/112 (54%), Gaps = 9/112 (8%)

Query: 51  GESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           GE ++ LL+     R   T Y RD    P GF ++LRKH+    +E+++Q G+DRI+  +
Sbjct: 41  GEDKERLLIGTD--RAFITKYKRDNPTRPPGFCMELRKHL--GHVEEIKQRGFDRILEIK 96

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            G       +I EL+ +GN +LT  +  ++  LR  +  D+ + +   ++YP
Sbjct: 97  SG----DTKLICELFGKGNFILT-KKGKIIGALREEKWADREIRVGLEYQYP 143


>gi|284998447|ref|YP_003420215.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284446343|gb|ADB87845.1| protein of unknown function DUF814 [Sulfolobus islandicus L.D.8.5]
          Length = 609

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|254583608|ref|XP_002497372.1| ZYRO0F04004p [Zygosaccharomyces rouxii]
 gi|238940265|emb|CAR28439.1| ZYRO0F04004p [Zygosaccharomyces rouxii]
          Length = 1024

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 125/216 (57%), Gaps = 15/216 (6%)

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
           EEK L   KV +DL LSA+ANA  ++ +KK    KQ+K      KAFK  E+K   Q+ Q
Sbjct: 513 EEKGL---KVSIDLGLSAYANASYYFNIKKNNAEKQKKVEKNVEKAFKNIEEKVGRQLKQ 569

Query: 542 E-KTVANISHMRKV---HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA 597
           + K   N+  +RKV   ++FEK +WFISSE +LV+ G+   + ++I  +Y+   DVY+  
Sbjct: 570 KLKETHNV--LRKVRTPYFFEKHHWFISSEGFLVLMGKSDSETDLIYSKYIEDDDVYLFN 627

Query: 598 DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT 657
                +   IKN    + VPP TL QAG   +  S+AW  K+ +S WW +   +SK  P+
Sbjct: 628 TF--GTQVWIKNPDSTE-VPPNTLMQAGILCMSASEAWSKKISSSPWWCFAKNISKFEPS 684

Query: 658 -GEYLTVGSFMIRGK--KNFLPPHPLIMGFGLLFRL 690
               L  G F+++ +  KNF+PP  L+MGFG L+++
Sbjct: 685 DNSVLPPGRFLLKNENNKNFMPPAQLVMGFGFLWKV 720



 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 116/485 (23%), Positives = 219/485 (45%), Gaps = 47/485 (9%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+    + LR  L   R +N+Y++  S + ++ K         +    K  +
Sbjct: 1   MKQRISALDLQLLAEELRENLESYRLNNIYNIADSNRQFLLKF--------NKPDSKFSV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T Y R     PSGF +KLRKH++++RL  +RQ+  DRI++ QF  G+  +
Sbjct: 53  VVDCGLRIHLTDYDRPTPPGPSGFVIKLRKHLKSKRLTALRQVHDDRILVLQFADGL--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN++L D    +L+L R          I+  H       +V E+ T     
Sbjct: 111 YLVLEFFSAGNVILLDENKKILSLQR----------IVQEHE-----NKVGEQYTMFD-D 154

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKE-NLGGQKGGKSFDLSKNSNKNSNDGARAKQ 237
           +  +++++ +A EP+  NE+   V    +E     +   K  +    S K   DG R K 
Sbjct: 155 SIFSNNEKTNAREPETYNEE--TVKQWLREAQTKFETESKILNEVVPSGK-KKDGQRKKI 211

Query: 238 PTLKTVLGEALGYGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
             +  +    L   P LS  ++       G  P+    +    E   + +L     +++ 
Sbjct: 212 KVM-AIHRLLLSREPHLSSDLLSKNLQMQGFSPSASCLDFVGQESAIVDLLNNTEKEYQS 270

Query: 294 WLQDVISGDIVPEGYILMQ-NKHLGKDHPPTESGSSTQIYDEFCPLLLNQ-FRSREFVKF 351
            L D         GYIL + N +   +    +     + +  F P +  Q       +K 
Sbjct: 271 LLSDSERS-----GYILAKRNVNFNSERDEKDLEFVYETFHPFEPFVAPQNVGDTRTIKI 325

Query: 352 E-TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           E  ++  LD F+S IES +   + + +E  A  +L    +D + ++  L      + +  
Sbjct: 326 EGGYNKVLDSFFSTIESSKYALRIQQQEQQATKRLEAARLDNQKKIQALVDAQSFNEEKG 385

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERNCMS 469
             I  N + V+    AV+  +  +M W  + ++++ E+K GN +A LI   L L+ N ++
Sbjct: 386 HSIIANADLVEQTKSAVQGYVDQQMDWSTIEKLIQVEQKRGNKIAQLIQLPLNLQENKIA 445

Query: 470 LLLSN 474
           + L +
Sbjct: 446 IRLPD 450



 Score = 49.7 bits (117), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 21/31 (67%), Positives = 26/31 (83%)

Query: 894 RGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           RG+KGKLKKM+ KYGDQDEEER +R+ +L  
Sbjct: 820 RGKKGKLKKMQRKYGDQDEEERQMRLNMLGT 850


>gi|448633897|ref|ZP_21674396.1| hypothetical protein C437_16451 [Haloarcula vallismortis ATCC
           29715]
 gi|445750588|gb|EMA02026.1| hypothetical protein C437_16451 [Haloarcula vallismortis ATCC
           29715]
          Length = 717

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 166/382 (43%), Gaps = 47/382 (12%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQ-----QHKAKEDAAFHKLNKIHM 390
            P+ L ++       F  F+ ALD+++     QR E+       +   +A   K  +I  
Sbjct: 275 TPIPLAEYEELYSESFTEFNTALDDYFFNF--QREEEVEGGETQRPDFEAEIEKQKRIIQ 332

Query: 391 DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
            QE  +   + + +   + AEL+  N + VD  +  V+ A  + +SW+D+     E    
Sbjct: 333 QQEQAIEDFEADAEAEREKAELLYANYDLVDDVLSTVQAAREDDVSWDDIEAKFDEGADR 392

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
           G   A  +  L      ++L +                 +V VD       NA   Y+  
Sbjct: 393 GIEAAEAVVSLDGSEGTVTLDIEGT--------------RVTVDAFTGVEKNADELYKEA 438

Query: 511 KKQESKQEKTITA--HSKAFKAAEKKTRLQILQEK------------------TVANISH 550
           K+ E K+E  + A  +++    A K+ R +   +                   ++ +I  
Sbjct: 439 KRIEEKKEGALAAIENTREDLEAVKERRDEWEADDGEDEVDEDGSEDEPTDWLSIQSIPT 498

Query: 551 MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNH 610
                W+E+F WF +S+ +LVI GRDA  NE +V++Y+  GD + HA  HG   TV+K  
Sbjct: 499 RSTERWYEQFRWFHTSDGFLVIGGRDADDNEELVQKYLEGGDKFFHAQAHGGPVTVLKAT 558

Query: 611 RPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVG 664
            P +P      P  +L+QA  F V +S  W D K     + V P QVSKT  +GEYL  G
Sbjct: 559 GPSEPSKEVDFPQSSLDQAAQFAVSYSSVWKDGKFAGDVYMVDPDQVSKTPESGEYLEKG 618

Query: 665 SFMIRGKKNFLPPHPLIMGFGL 686
            F +RG + +    P+ +  G+
Sbjct: 619 GFAVRGDRTYFEGTPVGVAVGI 640



 Score = 46.6 bits (109), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 4/113 (3%)

Query: 55  KVLLLMESG--VRLHTT--AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V  L+E G   R H    ++  D    P  F + LR  +    L  V Q  +DRII  +
Sbjct: 50  RVEFLIEVGDVKRAHVADQSHVPDAPGRPPDFAMMLRNRLSGADLVRVEQFEFDRIIELE 109

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           F     +  ++ EL+  GN+ + D    V+  L + R   + VA  + + +PT
Sbjct: 110 FDREDASTTIVAELFGDGNVAVLDEHGEVIDCLETVRLKSRTVAPGTPYEFPT 162


>gi|399576519|ref|ZP_10770274.1| RNA-binding protein, snrnp like protein [Halogranum salarium B-1]
 gi|399237963|gb|EJN58892.1| RNA-binding protein, snrnp like protein [Halogranum salarium B-1]
          Length = 706

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 97/395 (24%), Positives = 176/395 (44%), Gaps = 47/395 (11%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIE-SQRAEQQHKAKE------DAAFHKLNKI 388
            P  L ++   +   F++F+AALD+++ +++ S  AE+     E           K  +I
Sbjct: 257 TPFPLEEYEGLDSAAFDSFNAALDDYFFRLDLSDEAEKGGGGAEANRPDFQEEIEKQKRI 316

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
              QE  +   +++     + AEL+  N E  D  +  VR A    + W D+A  + E  
Sbjct: 317 IQQQEGAIEGFEEQAQEEREKAELLYANYELADEVLSTVRGAREENIPWADIADTLAEGA 376

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           + G P A  ++ +      +++              T+  +++++D+++    NA R Y 
Sbjct: 377 EQGIPAAEAVEDVDGSTGTVTI--------------TIDGQRIDLDVSMGVEKNADRIYT 422

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVA--------------------NI 548
             K+ E K+   + A     +  E   + +   E +                      +I
Sbjct: 423 EAKRVEEKKAGALEAIENTREKLEAVEKRRDEWEASDDEPDEDEDDEEKPDIDWLSRNSI 482

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
               +  W+++F WF +S+ +LVI GR+A QNE IVK+Y++K D++ H   HG   T++K
Sbjct: 483 PIRNQDKWYDRFRWFETSDGFLVIGGRNADQNEEIVKKYLNKHDLFFHTQAHGGPVTILK 542

Query: 609 NHRPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLT 662
              P +P     +P  +  +A  F V +S  W + +    A+ V   QVSKT  +GEY+ 
Sbjct: 543 ATGPSEPARDVDIPEQSREEAAQFAVAYSSIWKEGRFADDAYMVSADQVSKTPESGEYVE 602

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
            GSF++RG + +       +  GL    D   +G 
Sbjct: 603 KGSFVVRGDRTYYEDVAAEVAVGLRCEPDTRVVGG 637



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L R  G +    Y         KL +        +  +V L +E 
Sbjct: 4   KQELSSIDLAALVTELGRYEGAKVDKAYLYGDDLLRLKLRDF-------DRGRVDLFIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F + LR  +       V Q  +DRI+ F+F  G    
Sbjct: 57  GDIKRAHVVAPEHVPDAPGRPPNFAMMLRNRLNGADFAGVEQFEFDRILTFKFERGDEDT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            ++ EL+ QGN+ + D    V++ L + R   + VA  S++ +P
Sbjct: 117 EIVAELFGQGNLAVLDENREVVSSLETVRLKSRTVAPGSQYEFP 160


>gi|385773877|ref|YP_005646444.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323477992|gb|ADX83230.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 609

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D +LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTSLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     N    D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNVLKTDIEDKIPGKSKIVKTSI 605


>gi|432328279|ref|YP_007246423.1| putative RNA-binding protein, snRNP like protein [Aciduliprofundum
           sp. MAR08-339]
 gi|432134988|gb|AGB04257.1| putative RNA-binding protein, snRNP like protein [Aciduliprofundum
           sp. MAR08-339]
          Length = 596

 Score =  142 bits (358), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 112/356 (31%), Positives = 176/356 (49%), Gaps = 61/356 (17%)

Query: 333 DEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQ 392
           D F P+ L  + S    +F+TF+ AL  +   ++S+RA       E     ++ +   + 
Sbjct: 223 DFFSPIPLKMYPS-SIARFDTFNEALVNY---LKSERA------VESPEVLRIKRRIREI 272

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
           E  +    +E +RS K+ ELI  +  DV+ A+   + A    +S+          R  G 
Sbjct: 273 EETIEKFTREEERSRKIGELIYAHFGDVERALSEAKGA---EISY----------RARGK 319

Query: 453 PVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLAL--SAHANARRWYELK 510
            +                               L +E V V+L +  S   NA  +YE  
Sbjct: 320 TM------------------------------VLDIEGVPVELRVDKSVGENASLYYEKA 349

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
           KK    +EK   A     KA E+   ++ ++EK    I   R+  WFEK+ WFISSE+ L
Sbjct: 350 KKM---REKIKGAQQALEKAKEELKSVKKMEEKKKREIRKSRRRFWFEKYRWFISSEDIL 406

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI+GRDA+ NE +VK+++   D+Y+HAD+HGA S VIK+   E  +   TL +A  F V 
Sbjct: 407 VIAGRDAKTNEEVVKKHLGDKDLYMHADIHGAPSVVIKSEGKE--IGEKTLYEAAQFAVS 464

Query: 631 HSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
            S+AW++     SA+WVYP QVSK   +GEY+  G++++ G++N++   PL +  G
Sbjct: 465 MSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAWVVHGRRNYIHKVPLRLAVG 520



 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 30/129 (23%), Positives = 61/129 (47%), Gaps = 15/129 (11%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A ++  R  I G     +Y +  + ++FK+         GE+  + + +   +
Sbjct: 1   MLSLDIHAWIEENREKIEGGFFKKIYQVGEREFLFKIYK-------GETRPLYVNLRGWI 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
                   R+    PS F + LRK    R++    QL +DRI++F+     + + ++LEL
Sbjct: 54  FFQ----GRETPMEPSMFVMFLRKRFSGRKILRFYQLNFDRIVVFE---TQDGYQLVLEL 106

Query: 125 YAQGNILLT 133
           +  GN+++ 
Sbjct: 107 FGDGNVVVV 115


>gi|334310399|ref|XP_001370312.2| PREDICTED: nuclear export mediator factor NEMF isoform 1
           [Monodelphis domestica]
          Length = 1094

 Score =  142 bits (358), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 71/170 (41%), Positives = 104/170 (61%), Gaps = 9/170 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ D+ A +      L+GMR  N+YD+  KTY+ +L             KV LL+
Sbjct: 1   MKTRFSSVDICAILSEFNASLLGMRVHNIYDVDNKTYLIRLQKPDF--------KVTLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL  V+QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSVKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE 170
           I+ELY +GNI+LT+ E+ +L +LR   D+   V    R +YP +  RVFE
Sbjct: 113 IIELYDKGNIVLTNYEYLILNILRFRSDEADDVKFAVREKYPVDHARVFE 162


>gi|448361523|ref|ZP_21550140.1| fibronectin-binding A domain-containing protein [Natrialba asiatica
           DSM 12278]
 gi|445650542|gb|ELZ03465.1| fibronectin-binding A domain-containing protein [Natrialba asiatica
           DSM 12278]
          Length = 720

 Score =  142 bits (358), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 164/730 (22%), Positives = 286/730 (39%), Gaps = 136/730 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         KL +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGAYEGAKLDKAYLYGDDLVRLKLRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVSQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +                 
Sbjct: 117 RIIVELFGQGNVAVTDGEYKVIDCLETVRLKSRTVVPGSRYEF----------------- 159

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
                   PD            N    S+E  G +      D+ +               
Sbjct: 160 --------PDTR---------TNPLTISREAFGHEMEDSDTDVVR--------------- 187

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDN----AIQVLVLAVAKFEDW 294
           TL T     L +G   +E +    G+   M +++ ++   +    AI+ L L        
Sbjct: 188 TLAT----QLNFGGLYAEELCTRAGVEKAMDIADADEETYDGLYEAIERLAL-------- 235

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREF--VKFE 352
             D  +G+     Y+   ++   +D    + GS+ ++ D   P  L +    +     ++
Sbjct: 236 --DTRNGNFDSRLYLDTGDEDRTEDGD-GDDGSAARVVD-VTPFPLEEHEQDDLDGEPYD 291

Query: 353 TFDAALDEFYSKIESQR------AEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           TF  ALD+++ ++E +        +Q+   +E+ A H+  +I   Q+  +   +Q+    
Sbjct: 292 TFLEALDDYFFRLELEDEEEPDPTDQRPDFEEEIAKHE--RIIEQQQGAIEGFEQDAQNL 349

Query: 407 VKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            + AE +  EY L  VD  +  ++ A      W+++     E  + G   A  +    ++
Sbjct: 350 RENAESLYAEYGL--VDEILSTIQEAREQDRPWDEIEERFAEGAEQGIDAAEAV----VD 403

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
            +    L++ ++D           E +E++       NA R Y   K+   K+E  + A 
Sbjct: 404 VDGSEGLVTVDVD----------GEYIELEAHDGVEQNADRLYTEAKRVAEKKEGALAAI 453

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVH---------------------WFEKFNWF 563
               +  E+  R +   E     ++                           WF++F WF
Sbjct: 454 EDTREDLEEAKRRRDEWEAADGEVADDEAAEDEGEDHDWLADPSIPIRENEPWFDRFRWF 513

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VP 617
            +S+ YLVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P
Sbjct: 514 HTSDGYLVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELP 573

Query: 618 PLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
             ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F +RG + +  
Sbjct: 574 ESSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYR 633

Query: 677 PHPLIMGFGL 686
             P+    G+
Sbjct: 634 DTPVGAAVGI 643


>gi|11499620|ref|NP_070862.1| hypothetical protein AF2038 [Archaeoglobus fulgidus DSM 4304]
 gi|2648497|gb|AAB89216.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 627

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 172/348 (49%), Gaps = 33/348 (9%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L  + + E   FE+F+ ALD+++SK  ++  E +    E+    KL K    
Sbjct: 220 YLDVVPMDLLYYSNYEKKYFESFNDALDDYFSKKLAEMDELESMKSEE--LEKLKKRLEI 277

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q+  +   + E +   K+ + I  N + V+  I A R A   R SW+++  +V  + K  
Sbjct: 278 QKESLRKFEDEAESFRKIGDAIYENYQMVEKIIEAFRAA-RERKSWDEIKEIVARDEK-- 334

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
             +  L+  +  E+N + + +  + D             VE+++  S H NA  +YE  K
Sbjct: 335 --LKKLVKAIKPEKNAIVVKV-GDFD-------------VELEIKKSIHENADLYYEKAK 378

Query: 512 KQESKQE---KTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K   K E   + I A  +  +  E+K     L++K V +I   RK  W+E + WF +SE 
Sbjct: 379 KAREKAEGVKRAIEATLREMERVEEK-----LEKKLVTSIKVRRKKEWYENYRWFFTSEG 433

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LVI GR A+ NE IV +++   D++ H    GA + ++K     Q     ++ +A  F 
Sbjct: 434 FLVIGGRTAEMNEEIVAKHLESLDLFFHTQTPGAPAVILKRG---QEAGEESIREAAEFA 490

Query: 629 VCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             +S  W + K     ++V P QVSK+A  GEYL  GSF I GK+N+L
Sbjct: 491 ATYSALWKEGKHAGEVYYVLPEQVSKSAKAGEYLPKGSFYITGKRNYL 538



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/184 (23%), Positives = 87/184 (47%), Gaps = 24/184 (13%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           ++++ D+ A V+ L+ L G +   VY   P     ++             KV L++E+G 
Sbjct: 3   QLSSFDIKACVRELKELEGGKVEKVYHHPPDEIRIRIYAGR---------KVDLVIEAGR 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T + +     PS F + LRKH+   R++ + Q  +DR+++ +F        ++ EL
Sbjct: 54  RIHLTKFPKQAPRFPSAFAMLLRKHLEGARIKKIEQYDFDRVVVIEFERFGEIRRIVAEL 113

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT---------EICRVFERTTAS 175
           +++GN++L + E  V+  L+        + +   +R+P          E+ RV   +   
Sbjct: 114 FSKGNVVLLNEENRVIMPLKH------TIKVGELYRFPEQRERKDEDREVVRVLAMSGLG 167

Query: 176 KLHA 179
            L+A
Sbjct: 168 GLYA 171


>gi|126178886|ref|YP_001046851.1| hypothetical protein Memar_0936 [Methanoculleus marisnigri JR1]
 gi|125861680|gb|ABN56869.1| protein of unknown function DUF814 [Methanoculleus marisnigri JR1]
          Length = 632

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 173/362 (47%), Gaps = 41/362 (11%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
           P++L     RE  +F TF  ALD FY K    + E    A       +   I   Q   +
Sbjct: 243 PVVLAGDEVRE--RFATFSEALDAFYPKTVGGKEEA---AAGKPRLSQAEVIRRRQAEAI 297

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
              +++++R+ ++ E+I  N   V   I  +  A  NR SW+++ +++KE     NP A 
Sbjct: 298 KGFEKKIERNQRIVEVIYENYTAVAGIIATLDEASKNR-SWQEIEKILKE--NGDNPAAK 354

Query: 457 LIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESK 516
           ++  ++     + + LS               E+V++ +  +   N  R+Y+  KK + K
Sbjct: 355 MVRAIHPADAAVDVDLSG--------------ERVKIYVHETIEQNLGRYYDQIKKFKKK 400

Query: 517 QEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           +   + A  +      +  R   LQ+K            W+ +F WF +S+  LVI GRD
Sbjct: 401 KTGALAAMERTVPEKPRTKRNLPLQKK-----------RWYHRFRWFTTSDGTLVIGGRD 449

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           A QNE +VK+YM  GD++VHAD+HG S  ++K            +++A  F   +S AW 
Sbjct: 450 ASQNEELVKKYMEGGDLFVHADVHGGSVVIVKGTTEH-------MDEAVRFAASYSNAWK 502

Query: 637 SKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           +   T+  +   P QVSKTA +GEY+  G+F++RG++ +    PL +  GL    + + +
Sbjct: 503 AGHFTADVYAARPDQVSKTAESGEYVARGAFIVRGERQYFRNAPLGVAIGLQMAPEVAVI 562

Query: 696 GS 697
           G 
Sbjct: 563 GG 564



 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 71/144 (49%), Gaps = 11/144 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE-KVLLLMESGV 64
           M+  D+ A V      + +    +Y    KT         G+  +GE   K L L+E+G 
Sbjct: 7   MSGVDLRALVAEAADRLPLWVGKIYQFDAKTL--------GIRLNGEDRAKYLFLIETGR 58

Query: 65  RLHTTA-YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
           R H TA +    KN PS F + LRKH+   ++  +RQLG +R +    G     +++I E
Sbjct: 59  RAHFTAEFPVPPKNPPS-FAMLLRKHLEGGKVLGIRQLGLERTMSLDIGKRDTTYHLIFE 117

Query: 124 LYAQGNILLTDSEFTVLTLLRSHR 147
           L+ +GN +L D  +T++  L  HR
Sbjct: 118 LFDEGNAVLCDEGYTIIKPLWHHR 141


>gi|385776519|ref|YP_005649087.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323475267|gb|ADX85873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 609

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 169/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  EK  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLEKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     N    D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNVLKTDIEDKIPGKSKIVKTSI 605


>gi|219852170|ref|YP_002466602.1| hypothetical protein Mpal_1566 [Methanosphaerula palustris E1-9c]
 gi|219546429|gb|ACL16879.1| protein of unknown function DUF814 [Methanosphaerula palustris
           E1-9c]
          Length = 629

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/350 (28%), Positives = 169/350 (48%), Gaps = 44/350 (12%)

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           TF  AL+  Y  +      Q+      A   +  +I + QE  + +  +++  +  + +L
Sbjct: 257 TFSEALEAIYPLVTRHEGPQK-----KAPIPREERIRLQQEAALKSFDKKIVLNKAIVDL 311

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLL 472
           I  N   V   I  +  A +  +SW+++  M+KE   + N VA  I  ++     + LLL
Sbjct: 312 IYENYTLVTDVIKTLDAA-SKTLSWQEIGSMLKE---SDNDVARQIAGVHPAEAAVDLLL 367

Query: 473 SNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF-KAA 531
                           +KV + +  S   N  R+Y   KK + K++  ++A  +   K A
Sbjct: 368 DG--------------KKVLIHVHESIEVNLERYYAQVKKFKKKRDGAVSAMERPVAKKA 413

Query: 532 EKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
             K  L  L+++            W+ +F WF +S+N LV+ GRDA QNE +VKRYM  G
Sbjct: 414 TSKVHLTPLKKR------------WYHRFRWFFTSDNCLVLGGRDAGQNEELVKRYMEGG 461

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQ 650
           D +VHAD+HGAS  ++K  + EQ      +++   F   +S AW S   ++  + V P Q
Sbjct: 462 DTFVHADVHGASVVIVKG-KTEQ------MDEVAQFAASYSGAWRSGHFSADVYAVRPDQ 514

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLN 700
           VSKT   GE+++ GSF++RG++ +    PL +  G     + + +G  +N
Sbjct: 515 VSKTPEAGEFVSRGSFIVRGERTYFKSVPLGVAIGYQTEPNAAVIGGPVN 564



 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 71/148 (47%), Gaps = 7/148 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M+  D+ A    LR  + +  + +Y    K    +L          E  K  LL+ESG R
Sbjct: 7   MSGVDLLAVTAELREHLPLWINKIYQYDNKMLSIRLNGE-------EHAKYHLLIESGRR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H      +    P  F + LRK++   R+ ++RQ G  R++ F  G      ++++EL+
Sbjct: 60  IHLATVLPNPPKNPPSFAMLLRKYLEGGRVLEIRQQGLQRVVTFVIGKRDTTLHLVIELF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGV 153
            +GN++L D + T++  L  HR  D+ V
Sbjct: 120 DEGNVILCDDQMTIIKPLWHHRFKDREV 147


>gi|409721207|ref|ZP_11269418.1| RNA-binding protein, snrnp like protein [Halococcus hamelinensis
           100A6]
 gi|448724851|ref|ZP_21707356.1| RNA-binding protein, snrnp like protein [Halococcus hamelinensis
           100A6]
 gi|445785060|gb|EMA35856.1| RNA-binding protein, snrnp like protein [Halococcus hamelinensis
           100A6]
          Length = 697

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 163/711 (22%), Positives = 273/711 (38%), Gaps = 143/711 (20%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L R  G +    Y         KL +        +  +V L++E 
Sbjct: 4   KRELTSVDLAALVTELGRYAGAKLDKAYLYGDDLLRLKLRDF-------DRGRVELMVEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  +  +  D    P  F   LR  +         Q G+DR++ F+F       
Sbjct: 57  GETKRAHVVSPDHVPDAPGRPPDFAKMLRNRLSGADFAGASQFGFDRVLTFEFEREDRNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
            ++ EL+ +GN+ + DS   V+  L +                     R+  RT A    
Sbjct: 117 RIVAELFGEGNVAVLDSTGEVVDCLNT--------------------VRLQSRTVAPGAQ 156

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
               SS+     +P  V+ +G                      +    +++ D       
Sbjct: 157 YEFPSSR----FDPLAVDYEG---------------------FAARMEESNTD------- 184

Query: 239 TLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDV 298
            L   L   L +G   +E +    G+     + +  + E +A   L  A+ +  + L D 
Sbjct: 185 -LVRTLATQLNFGGLYAEELCTRAGVEKEQAIEDSGEEEYSA---LFDALTRLSERLSD- 239

Query: 299 ISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAAL 358
             GD  P  Y         +D  P +            P  L +    +   FE+F  AL
Sbjct: 240 --GDFDPRIYR--------EDDEPVD----------VTPFPLEENADLDSEGFESFTEAL 279

Query: 359 DEFYSKIESQRAEQ---QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
           D ++  +E+   E+   + K   +    +  +I   QE  +   +++ +     AE +  
Sbjct: 280 DAYFVDLETTENEEGGGREKPDFEEEIERQQRIIDQQEGAIQGFEEQAEAERAKAESLYA 339

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNN 475
           N   VD  +  VR A      WE++    +E ++ G P A  +  +      +S+     
Sbjct: 340 NYGLVDEILSTVRTARERDTPWEEIEERFEEGKEQGIPAAEAVAGVEASEGTVSV----- 394

Query: 476 LDEMDDEEKTL-PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK 534
             E+D E  TL P E VE         NA R Y   K+   K+E    A       A+ +
Sbjct: 395 --EVDGETITLDPREGVE--------QNADRLYREAKRVVGKKEGAEEA------IADTR 438

Query: 535 TRLQILQEKTVA------------------------NISHMRKVHWFEKFNWFISSENYL 570
             L+ L+++                           +I       W+E+F WF +S+ +L
Sbjct: 439 AELEALEQRREEWEAGGADATDADDDSEDIDWLDRRSIPIRTNEQWYERFRWFHTSDGFL 498

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-----VPPLTLNQAG 625
           V+ GR+A QNE +VK+Y+ +GD ++H    G   TV+K   P +P     +P  TL++A 
Sbjct: 499 VLGGRNADQNEDLVKKYLDRGDRFLHTQARGGPVTVLKATGPSEPTREIDLPQGTLDEAA 558

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            F V +S  W D +     +   P QVSKT  +GEYL  G+F +RG + + 
Sbjct: 559 KFAVSYSSVWKDGRFAGDVYMADPDQVSKTPESGEYLEKGAFTVRGDRTYF 609


>gi|320100405|ref|YP_004175997.1| hypothetical protein [Desulfurococcus mucosus DSM 2162]
 gi|319752757|gb|ADV64515.1| protein of unknown function DUF814 [Desulfurococcus mucosus DSM
           2162]
          Length = 665

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 180/375 (48%), Gaps = 45/375 (12%)

Query: 325 SGSST-QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH 383
           SG +T  IY  + PLL     +    + E  + A+D ++++ E++   Q+   +  AA  
Sbjct: 240 SGENTLDIYTSYNPLLFRDVYNNSVKQVEDINTAIDAYFTEYEAELERQRRLDELAAAVK 299

Query: 384 KLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARM 443
           ++      QE  +   ++EV++  ++ +LI  N   V+ A+   R   A +  WE +A+ 
Sbjct: 300 EIEARIKRQEEVIRGYREEVEKIGRILQLIYGNYASVNEALECARSTRAVK-GWEHIAK- 357

Query: 444 VKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANA 503
                       G++  +Y ++  + L ++  + E+    K L  + VE++         
Sbjct: 358 ---------DCPGVVG-VYKDKGIVVLRVNGEVLELSIR-KGLDKQVVELE--------- 397

Query: 504 RRWYELKKKQE--SKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
                 KK+ E   K E  +    +  +   + +    +++KTV  +S      W+E+F+
Sbjct: 398 ------KKRGELVGKIESAVKVLEEMRRQLNEASSTMSIEDKTVRRLS---PTLWYERFH 448

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN---HRPEQPVPP 618
           W  +   +L I GRD  QNEM+V++Y+   DV++HAD+HG S+ V+K+   H  E  V  
Sbjct: 449 WLFTRNGFLAIGGRDQSQNEMVVRKYLGDNDVFIHADIHGGSAVVLKSRGLHSVEDVV-- 506

Query: 619 LTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
                A     C+S+AW +       +WV   QVSKT P+GEYL  G+FMI G KNFL  
Sbjct: 507 ----DASYLAACYSRAWRAGFSFIEVFWVPGSQVSKTPPSGEYLPRGAFMIYGSKNFLSI 562

Query: 678 HPLIMGFGLLFRLDE 692
            PL +  G  F  D+
Sbjct: 563 -PLRLAVGARFFSDD 576



 Score = 42.7 bits (99), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 69/153 (45%), Gaps = 17/153 (11%)

Query: 1   MVKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M+K  M+  DV A V +    L      N Y      +I KL   SGVT         L 
Sbjct: 1   MLKKAMDILDVYAWVGRHGASLTSCFVDNAYHCK-SYWILKLRCPSGVTH--------LK 51

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA-- 117
           +E  VR+H +    ++K+   GFT  LR  +R  R+  VRQ  ++RI++ + G       
Sbjct: 52  IEPAVRIHLSQSIPEEKDI-DGFTRFLRSRVRDSRILSVRQPWWERIVVLETGAREKPLR 110

Query: 118 HYVILELYAQGNILLTDSEFTVL--TLLRSHRD 148
           HY+  E+  +G  ++ D    ++  T    +RD
Sbjct: 111 HYI--EVVPRGQWVVADPSDRIIYSTRFTEYRD 141


>gi|159906014|ref|YP_001549676.1| hypothetical protein MmarC6_1632 [Methanococcus maripaludis C6]
 gi|159887507|gb|ABX02444.1| protein of unknown function DUF814 [Methanococcus maripaludis C6]
          Length = 680

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 177/356 (49%), Gaps = 25/356 (7%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q       +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLETKEKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  +++A   +M WE +  ++KE +   +PV   I  + 
Sbjct: 331 SRSNHKRGDLIYANYSFVDEIVSTIKLA-REKMGWEGIKNVIKENK--THPVLSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   + L LS       D    L  + V VDL  +A  NA   Y+  KK ++K +  I 
Sbjct: 388 EKNAELMLKLSA------DYGNGLIEDNVPVDLRKNAFENADIVYQKSKKFKNKVQGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRK----------VHWFEKFNWFISSENYLVI 572
              +A K +EKK      +EK  + +   ++          + W+EK  W +    YL++
Sbjct: 441 ---EALKISEKKLAELKDKEKLDSEVLKEKEENIKKKERKVLKWYEKLKWTVIG-GYLIV 496

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      L +   F   HS
Sbjct: 497 AGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENILFEVAKFASSHS 556

Query: 633 QAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G+L
Sbjct: 557 RAWKLGVGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGML 612



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 82/166 (49%), Gaps = 8/166 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++A V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISAAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYITITEYEREKPRNPPSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P +
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKEIYKFPPQ 161


>gi|354507679|ref|XP_003515882.1| PREDICTED: nuclear export mediator factor NEMF-like, partial
           [Cricetulus griseus]
          Length = 220

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 97/277 (35%), Positives = 134/277 (48%), Gaps = 60/277 (21%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKSRFSTVDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+ELY +GNI+LTD E+ +L +LR   D+   V    R RYP +  R      A+K    
Sbjct: 113 IIELYDRGNIVLTDYEYLILNILRFRTDEADDVKFAVRERYPVDHAR------AAKPLLT 166

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
           L    E  A+ P                                           K   L
Sbjct: 167 LERLTEVIASAP-------------------------------------------KGELL 183

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLE 277
           K VL   L YGPAL EH +++ G   N+K+ E  KLE
Sbjct: 184 KRVLNPLLPYGPALIEHCLIENGFSGNVKVDE--KLE 218


>gi|300711181|ref|YP_003736995.1| hypothetical protein HacjB3_09100 [Halalkalicoccus jeotgali B3]
 gi|448296718|ref|ZP_21486771.1| hypothetical protein C497_13578 [Halalkalicoccus jeotgali B3]
 gi|299124864|gb|ADJ15203.1| hypothetical protein HacjB3_09100 [Halalkalicoccus jeotgali B3]
 gi|445580850|gb|ELY35220.1| hypothetical protein C497_13578 [Halalkalicoccus jeotgali B3]
          Length = 694

 Score =  140 bits (353), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 172/369 (46%), Gaps = 39/369 (10%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE-DAAFHKLNKI 388
           QI D   P+ L++  + E   ++ F+ ALD+++ ++++   E+   + E D    +  +I
Sbjct: 252 QIVD-VTPIALDEHAALEGDSYDRFNEALDDYFFELDTSEDEETDTSPEFDEEIERKKRI 310

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
              QE  +   +QE     + AEL+  N + VD  +  VR AL     WE++    ++  
Sbjct: 311 IDQQEGAIEGFEQEATEERERAELVYANYDTVDEVLTTVRGALEEGRGWEEIEATFEQGA 370

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           + G   A  +     E   +S+ L          E T+ +E     +      NA R Y 
Sbjct: 371 EQGIDAAERVTGFDPENGMVSVDLG---------EATVSLE-----VRSGVEKNADRIYT 416

Query: 509 LKKKQESKQ---EKTITAHSKAFKAAEKKTRLQILQEKTV--------------ANISHM 551
             K+ E K+   E+ I    +   A  ++ R    +++T               A+I   
Sbjct: 417 EAKRIEEKKAGAEEAIADTREELDALRERKRQWETRDETQDDGGEPEEIDWLSRASIPVR 476

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           +   W+E F WF +S+ YLVI GR+A +NE +VK+Y+ +GD + H   HG   TV+K   
Sbjct: 477 KSEEWYEDFRWFHTSDGYLVIGGRNADENEDLVKKYLDRGDRFFHTQAHGGPVTVLKATG 536

Query: 612 PEQPV-----PPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGS 665
           P +P      P  ++ +A  F V +S  W + +    A+ V P QVSKT  +GEY+  G 
Sbjct: 537 PSEPAKDVEFPESSIQEAAQFAVSYSSVWKEGRFADDAYSVSPDQVSKTPESGEYIEKGG 596

Query: 666 FMIRGKKNF 674
           F+IRG + +
Sbjct: 597 FVIRGDRTY 605



 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 68/164 (41%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELTSIDLAALVGELNEYAGAKVDKAYLYGEDFLRLKLRDF-------DRGRVELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H  A  +  D    P  F   LR  +       V Q  +DRI+ F+F       
Sbjct: 57  GDVKRAHVAAPEHVPDAPGRPPDFAKMLRNRLSGADFTGVSQYEFDRILSFEFEREDGNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I EL+ +GN+ + D    V+  L + R   + VA  +R+++P
Sbjct: 117 TIIAELFGEGNVAVCDETRHVIDSLETVRLKSRTVAPGARYQFP 160


>gi|336122066|ref|YP_004576841.1| Fibronectin-binding A domain-containing protein
           [Methanothermococcus okinawensis IH1]
 gi|334856587|gb|AEH07063.1| Fibronectin-binding A domain protein [Methanothermococcus
           okinawensis IH1]
          Length = 684

 Score =  140 bits (352), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 108/354 (30%), Positives = 175/354 (49%), Gaps = 33/354 (9%)

Query: 352 ETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAE 411
           E F  ALD+++S+   ++  ++ + K      K  +I  +Q   +   +++   +    +
Sbjct: 279 EEFLTALDDYFSRFILKKEIKKEETKLQKMVKKQERILNNQIESLKKYEKQAKENQIKGD 338

Query: 412 LIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLL 471
           LI  N   VD  I  ++ A   +M W  + ++VKE +   NP+   I  +  +   ++L 
Sbjct: 339 LIYANYALVDEIITTLKSA-REKMDWSSIKKIVKENK--DNPILSKIVYINEKNGEITLK 395

Query: 472 LS----NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA---- 523
           LS    N L E D          V +D+  +A  NA  +Y   KK ++K E   TA    
Sbjct: 396 LSADYGNGLIEKD----------VSLDIRKNAFENADNYYSKSKKFKNKIEGVKTAINLS 445

Query: 524 ----HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
                    K   +   L+  +EKT+      +K  W+EKF W + + NYL+I+G+DA  
Sbjct: 446 KEKLEKLKKKEEIEMESLKEREEKTMEK-KERKKRKWYEKFKWTVIN-NYLIIAGKDATT 503

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNH-----RPEQPVPPLTLNQAGCFTVCHSQA 634
           NEM++KRY  K D+  H  + GA  TVIK +        +      LN+   F   HS+A
Sbjct: 504 NEMLIKRYTEKDDIVFHTLMEGAPFTVIKMNGKNIDELNEDEREFLLNETAKFAASHSKA 563

Query: 635 WDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           W   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+   PL +G G++
Sbjct: 564 WRLGLGSADVYWVKPEQISKTAESGEYLKKGAFVIRGKRNFIRSVPLELGIGIV 617



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/168 (25%), Positives = 83/168 (49%), Gaps = 12/168 (7%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSP---KTYIFKLMNSSGVTESGESEKVLL 58
           +K  +   D+   VK L+++I  +    + +     K  I KL     + E G  E   L
Sbjct: 1   MKTELTNVDIHVAVKELQKIINGKLDKAFLVDSQDGKELILKLH----IPEIGTRE---L 53

Query: 59  LMESGVRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
            + +G   + T   Y+R+K   P  F + LRKH++  ++  + Q  +DRI+ F F  G  
Sbjct: 54  AIGTGKYKYITLTEYSREKPKNPPSFAMLLRKHLKNIKITSIEQHNFDRIVKFTFQWGEI 113

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           ++ +++EL+  GNI+L D+E  ++  L+  +   + +     +++P +
Sbjct: 114 SYKLVVELFGDGNIILLDNEDKIILPLKIEKWSTRRIIPKEIYKFPPQ 161


>gi|449066809|ref|YP_007433891.1| hypothetical protein SacN8_03855 [Sulfolobus acidocaldarius N8]
 gi|449069082|ref|YP_007436163.1| hypothetical protein SacRon12I_03840 [Sulfolobus acidocaldarius
           Ron12/I]
 gi|449035317|gb|AGE70743.1| hypothetical protein SacN8_03855 [Sulfolobus acidocaldarius N8]
 gi|449037590|gb|AGE73015.1| hypothetical protein SacRon12I_03840 [Sulfolobus acidocaldarius
           Ron12/I]
          Length = 588

 Score =  139 bits (351), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 74/222 (33%), Positives = 126/222 (56%), Gaps = 23/222 (10%)

Query: 485 TLPVEKVEVDL--ALSAHANARRWYELKKKQESKQEKTITAHSK--------AFKAAEKK 534
           TL +  + +D+   L+ + NA ++Y+L K+   K +K      +         FK  E+K
Sbjct: 312 TLKINNISIDIDPKLTVYKNASKYYDLAKEYSEKAKKAGEVLEELRKKLSELQFKIDERK 371

Query: 535 TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVY 594
             ++I           +RK  W+EK++W I+   ++VI+GRD+ QNE IV++ + + D++
Sbjct: 372 EEIRI----------SLRKKEWYEKYHWGITRNGHIVIAGRDSDQNESIVRKLLDEKDIF 421

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSK 653
           +HAD+ GA++TV+K +  +  V    +  A     C+S+AW + +     +WVY +QVSK
Sbjct: 422 LHADIQGAAATVLKANSGQ--VSEDDILDAAYIAACYSKAWKTGLGSVDVFWVYGNQVSK 479

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           + P+GEYL  GSFMI G+KNF+    L +  G++ + DE  L
Sbjct: 480 SPPSGEYLAKGSFMIYGRKNFIKNVKLELAIGIMNQNDEVGL 521



 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 74/147 (50%), Gaps = 18/147 (12%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSP-KTYIFKLMNSSGVTESGESEKVLLLMESG 63
           M+  D+ A +   + +I G R  NVY +S  + Y+FKL        S  ++K  L++E G
Sbjct: 1   MSYIDLLAWITENKSIIEGSRIDNVYKISGIQAYLFKL-------HSKNTDK-FLVVEPG 52

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
            R+H T Y R+K  +  G    +R+ ++ + ++ +  LG +RI      + +    + +E
Sbjct: 53  KRIHFTKYDREK--SSEGEVRLIRELVKEKIIKSINILGNERIA----KIDLIDRKIYIE 106

Query: 124 LYAQGNILLTDSEFTVL--TLLRSHRD 148
           L  +G +++TD    VL  T  +  RD
Sbjct: 107 LLPRGLLVITDGNNKVLFSTEYKEFRD 133


>gi|70606588|ref|YP_255458.1| hypothetical protein Saci_0795 [Sulfolobus acidocaldarius DSM 639]
 gi|68567236|gb|AAY80165.1| conserved Prokaryal protein [Sulfolobus acidocaldarius DSM 639]
          Length = 594

 Score =  139 bits (351), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 74/222 (33%), Positives = 126/222 (56%), Gaps = 23/222 (10%)

Query: 485 TLPVEKVEVDL--ALSAHANARRWYELKKKQESKQEKTITAHSK--------AFKAAEKK 534
           TL +  + +D+   L+ + NA ++Y+L K+   K +K      +         FK  E+K
Sbjct: 318 TLKINNISIDIDPKLTVYKNASKYYDLAKEYSEKAKKAGEVLEELRKKLSELQFKIDERK 377

Query: 535 TRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVY 594
             ++I           +RK  W+EK++W I+   ++VI+GRD+ QNE IV++ + + D++
Sbjct: 378 EEIRI----------SLRKKEWYEKYHWGITRNGHIVIAGRDSDQNESIVRKLLDEKDIF 427

Query: 595 VHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSK 653
           +HAD+ GA++TV+K +  +  V    +  A     C+S+AW + +     +WVY +QVSK
Sbjct: 428 LHADIQGAAATVLKANSGQ--VSEDDILDAAYIAACYSKAWKTGLGSVDVFWVYGNQVSK 485

Query: 654 TAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           + P+GEYL  GSFMI G+KNF+    L +  G++ + DE  L
Sbjct: 486 SPPSGEYLAKGSFMIYGRKNFIKNVKLELAIGIMNQNDEVGL 527



 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 74/147 (50%), Gaps = 18/147 (12%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSP-KTYIFKLMNSSGVTESGESEKVLLLMESG 63
           M+  D+ A +   + +I G R  NVY +S  + Y+FKL        S  ++K  L++E G
Sbjct: 7   MSYIDLLAWITENKSIIEGSRIDNVYKISGIQAYLFKL-------HSKNTDK-FLVVEPG 58

Query: 64  VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
            R+H T Y R+K  +  G    +R+ ++ + ++ +  LG +RI      + +    + +E
Sbjct: 59  KRIHFTKYDREK--SSEGEVRLIRELVKEKIIKSINILGNERIA----KIDLIDRKIYIE 112

Query: 124 LYAQGNILLTDSEFTVL--TLLRSHRD 148
           L  +G +++TD    VL  T  +  RD
Sbjct: 113 LLPRGLLVITDGNNKVLFSTEYKEFRD 139


>gi|302761992|ref|XP_002964418.1| hypothetical protein SELMODRAFT_405643 [Selaginella moellendorffii]
 gi|300168147|gb|EFJ34751.1| hypothetical protein SELMODRAFT_405643 [Selaginella moellendorffii]
          Length = 382

 Score =  139 bits (350), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 116/221 (52%), Gaps = 24/221 (10%)

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           ++TSAWWVY HQVSK APTGEYLTVGS MIRGKKNFLPP+PL+MGFGL FRLD+SS+ +H
Sbjct: 174 IITSAWWVYDHQVSKNAPTGEYLTVGSLMIRGKKNFLPPYPLVMGFGLFFRLDKSSIPAH 233

Query: 699 LNERRVRG-------EEEGMDD--FEDSGHHKENSDIESEKDDTDEKPVAESLSVPNSAH 749
            NERR+R        E E  DD   +D+       ++   K+  D     E  SV  +  
Sbjct: 234 FNERRIRAKGDNEEPEAEIQDDEEIDDASVEDSQDNVHERKESGDGGSTIEKASVMEAEE 293

Query: 750 PAPSHTNASNVDSHEFPAEDKTISNGIDSKIFDIARNVAAPVTPQLEDLIDRALGLGS-- 807
                  +    + E            ++   D     A      ++ L+D+AL L S  
Sbjct: 294 ARSEEAESEEARALE-----------TENAAMDEHEEQAPQSDSDIDSLLDKALELKSVL 342

Query: 808 -ASISSTKHGIETTQFDLSEEDKHVERTATVRDKPYISKAE 847
            + + + K+G+   Q +    D  V+ T   R+K YISKAE
Sbjct: 343 PSQVDTNKYGLGEVQTE-DHVDDAVQETKVAREKQYISKAE 382



 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 68/144 (47%), Positives = 88/144 (61%), Gaps = 22/144 (15%)

Query: 281 IQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLL 340
           +  L+ A+ +FEDWL+ V +GD +PEGYI          HP   +   T    E      
Sbjct: 17  LHSLLEAIKRFEDWLESVTTGDFMPEGYITF--------HPNKTAKKKTAESAE------ 62

Query: 341 NQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK 400
                    KF+TFDA LDEF+SKIE QR +QQ K +ED+A+ KL KI +DQ +RV +LK
Sbjct: 63  --------EKFDTFDAVLDEFFSKIEGQRLDQQRKTQEDSAYSKLEKIRVDQRSRVESLK 114

Query: 401 QEVDRSVKMAELIEYNLEDVDAAI 424
           +EVD++V  AELIEYNL DVD AI
Sbjct: 115 REVDQAVHTAELIEYNLADVDLAI 138


>gi|161527567|ref|YP_001581393.1| hypothetical protein Nmar_0055 [Nitrosopumilus maritimus SCM1]
 gi|160338868|gb|ABX11955.1| protein of unknown function DUF814 [Nitrosopumilus maritimus SCM1]
          Length = 652

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 169/368 (45%), Gaps = 51/368 (13%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           E  P+ L +    E  K  +F   LD  +++    + +    +  D    +L     +QE
Sbjct: 247 EVLPIQLGKIEG-EITKVNSFIEGLDTVFTQNIVDKGKSIQTSGSDKKIKELETQISEQE 305

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             + T+K+   RS  +  +     + +   IL++  + A  +   + A+++ E+   G P
Sbjct: 306 KAIQTVKE---RSKNITNVANSLYDMISKGILSIEDSSAQEIMTANNAKLISEK---GIP 359

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQ 513
           +  + D                             EK++VD   S  + A   +   KKQ
Sbjct: 360 LIVIQD-----------------------------EKIKVDTKASLQSIASALFNEAKKQ 390

Query: 514 ESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN-----ISHMRKVHWFEKFNWFISSEN 568
                      SK  K  EK      LQ KT +      +S +RK +W+E++ WF +S+ 
Sbjct: 391 SGAISSIEEIKSKTLKKLEK------LQNKTESEKDTILVSEIRKKNWYERYRWFYTSDG 444

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LVI GRDA  N  +V++++ K D   H D+ G+   +IK+    Q VP  ++N+    T
Sbjct: 445 FLVIGGRDAASNSAVVRKHLDKNDKIFHGDIFGSPFFIIKDA---QNVPDTSMNEVSHAT 501

Query: 629 VCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           VC S+AW   M   SA+WV P QV K+AP+GE+L  GSF I G++NF+    L +  G++
Sbjct: 502 VCFSRAWREGMYGVSAYWVNPDQVKKSAPSGEFLPKGSFTIEGQRNFIKSGNLKLAVGII 561

Query: 688 FRLDESSL 695
            + D  +L
Sbjct: 562 PQEDGYAL 569



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 63/121 (52%), Gaps = 11/121 (9%)

Query: 26  CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLK 85
            SN+Y ++  + +FKL ++       +S+  +++  SGV L      +  +  P+    +
Sbjct: 27  VSNIYGITKDSILFKLHHTE------KSDLFMMISTSGVWL---TEVKIDQVEPNKLLKR 77

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLTLLR 144
           LR  +   +L+ + Q+G +RI  F+F  G    +V++ E +  GNILL ++E  +L L  
Sbjct: 78  LRSDLLRLKLKKIEQIGAERIAYFRFE-GFGKEFVLVGEFFGDGNILLCNNEMKILALQH 136

Query: 145 S 145
           S
Sbjct: 137 S 137


>gi|229581503|ref|YP_002839902.1| hypothetical protein YN1551_0858 [Sulfolobus islandicus Y.N.15.51]
 gi|228012219|gb|ACP47980.1| protein of unknown function DUF814 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 609

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 170/337 (50%), Gaps = 32/337 (9%)

Query: 445 KEERKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHA 501
           K  R+ GN +   A  ID+L L+    S  +  NLD ++          +E+D  LSA  
Sbjct: 296 KSYRQLGNIILSKAYEIDQLLLDNRPKSKKIKLNLDGVE----------IELDTLLSATK 345

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA R+++  K+ + K E+ + +  +  +  +K  + +I ++  +  +  +RK  W+EK+ 
Sbjct: 346 NAMRFFDEAKEYKRKIERALESLDELKEKLKKIEKQEIEKQNEIKLV--LRKKEWYEKYR 403

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           W IS   YL+I+G+DA QNE IVK+Y+   D+++HAD+ GA +T+I        +    +
Sbjct: 404 WSISRNGYLIIAGKDASQNESIVKKYLRDKDIFLHADIAGAPATIIIAQE-NNTILEDDI 462

Query: 622 NQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
             A      +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L
Sbjct: 463 YDAAVIAASYSKAWKVGLASVDVFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFVKNVKL 522

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGEEEGMDDFEDSGHHKENSDIESEKDDTDEKPVAE 740
            +  GL+  L E+S+        + G EE +     S   K  + I +  DD  E+   +
Sbjct: 523 QLAIGLI--LSENSVSV------IVGSEETV-----SAKTKYYAII-APGDDDKERIAQK 568

Query: 741 SLSVPNSAHPAPSHTNASNVDSHE-FPAEDKTISNGI 776
            + V + A P     NA   D  +  P + K +   I
Sbjct: 569 IIKVFSRALPDIKGLNALKTDIEDKIPGKSKIVKTSI 605


>gi|260803886|ref|XP_002596820.1| hypothetical protein BRAFLDRAFT_116214 [Branchiostoma floridae]
 gi|229282080|gb|EEN52832.1| hypothetical protein BRAFLDRAFT_116214 [Branchiostoma floridae]
          Length = 168

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/167 (44%), Positives = 111/167 (66%), Gaps = 10/167 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  ++  ++GMR +NVYD+  KTY+ KL+ +         EK +LL+
Sbjct: 1   MKGRFSTVDLRAILTEIKDSVLGMRVANVYDIDNKTYLIKLVKTD--------EKKMLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG RL+ T++   K   PSGF++KLRKH+RTRRL  ++QLG DRI+  QFG    A+++
Sbjct: 53  ESGTRLYATSFDWPKNMMPSGFSMKLRKHLRTRRLISIQQLGSDRIVDMQFGENEAAYHL 112

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICR 167
           I+ELY +GN++LTD E+T+L LLR+  + D  V    R +YP E+ R
Sbjct: 113 IVELYDRGNLILTDYEYTILNLLRTRTEGD-DVRFAVREKYPLELAR 158


>gi|261403479|ref|YP_003247703.1| fibronectin-binding A domain-containing protein [Methanocaldococcus
           vulcanius M7]
 gi|261370472|gb|ACX73221.1| Fibronectin-binding A domain protein [Methanocaldococcus vulcanius
           M7]
          Length = 670

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 190/360 (52%), Gaps = 17/360 (4%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           YD   P+ L ++   E   +E+F  A+D++++K  +    ++ K+K +    +   I   
Sbjct: 257 YD-VVPVNLKKYEDLEKKYYESFLDAVDDYFAKFLTNVEVKKKKSKIEKEIERQENILKR 315

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   K++ +++    +LI  N + V+  + A++ A   +M W  + ++VKE +   
Sbjct: 316 QLETLERYKKDAEKNQIKGDLIYANYQIVENLLSAIKQA-REKMDWARIKKIVKENK--D 372

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +P+  L++ +    N   +++    D  D   KT+  E++ +D+  +A  NA R+YE  K
Sbjct: 373 HPILDLVEDI--RENIGEIIVRLKADVGD---KTIE-ERIPLDIRKNASENAERFYEKAK 426

Query: 512 KQESKQEKTITA---HSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
           K + K E   TA     K  +  +KK    + +E         ++  W+EKF W + +  
Sbjct: 427 KLKHKVEGIKTAIELTKKKIEELKKKEEKTLGEEIPEMKKKKRKERKWYEKFKWTVIN-G 485

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LVI+G+DA  NE+++K+Y  K D+  HA++ GA  TVIK     + V   TL +   F+
Sbjct: 486 FLVIAGKDAITNEILIKKYTDKDDIVFHANIQGAPFTVIKTQG--RDVDEETLEEVAKFS 543

Query: 629 VCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRG++++    PL +G G+L
Sbjct: 544 VSHSKAWKLGYGAIDTYWVKPEQISKTAESGEYLKRGAFVIRGERHYYRNTPLELGIGVL 603



 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 82/164 (50%), Gaps = 2/164 (1%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           M+K  M   DV   +  L++L+  R    + L  +    +L+    V E G  E V+ + 
Sbjct: 1   MMKTEMTNVDVCGVILELQKLVNSRLDKAF-LVERDNNRELILKLHVPEGGSRELVISVG 59

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +    +  T Y RDK   P  F + LRK+++  +L  + Q+ +DRI +  F      + +
Sbjct: 60  KYKY-ITLTNYERDKPKIPPSFAMLLRKYLKNAKLVKIEQVNFDRIAILHFETREGIYKL 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
           I+EL+ +GN++  +SE T+++ LR      + +    ++++P +
Sbjct: 119 IVELFGEGNVIFLNSEDTIISPLRVEIWSSRKIVPKEKYQFPPQ 162


>gi|254166596|ref|ZP_04873450.1| conserved domain protein [Aciduliprofundum boonei T469]
 gi|197624206|gb|EDY36767.1| conserved domain protein [Aciduliprofundum boonei T469]
          Length = 593

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 123/204 (60%), Gaps = 6/204 (2%)

Query: 483 EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           E  L  EK+++ +  S   NA  +Y+  KK    +EK   A     KA E+  +++  +E
Sbjct: 325 EIELEGEKIKLYVDKSVGENAGIYYDRSKKM---REKIKGAREALEKAKEELKKVKKKEE 381

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           K    I   R+  WFEK+ WFISSE  LVI+GRDA+ NE +VK+++  GD+Y+HAD+HGA
Sbjct: 382 KKKKEIRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGA 441

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYL 661
            S VIK+   E  +   TL +A  F V  S+AW++     SA+WVYP QVSK   +GEY+
Sbjct: 442 PSVVIKSEGKE--IGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYV 499

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFG 685
             G++++ GK+N++   PL +  G
Sbjct: 500 ARGAWVVHGKRNYIHKVPLQLAVG 523



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 19/131 (14%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A +K  R  I G     +Y +  + ++FK+         GE++ +       V
Sbjct: 5   MLSLDIYAWLKENREFIEGGFFKKIYQVGEREFLFKIYK-------GETKPLY------V 51

Query: 65  RLHTTAYARDKKNT--PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
            L    +  D++    PS F + LRK    +++    Q  +DRII+F+     N + +I+
Sbjct: 52  NLRGWLFFDDRETPLEPSMFVMFLRKRFSGKKIVKFYQFNFDRIIIFEVP---NGYSLII 108

Query: 123 ELYAQGNILLT 133
           EL+  GNI++T
Sbjct: 109 ELFGDGNIIVT 119


>gi|289596339|ref|YP_003483035.1| protein of unknown function DUF814 [Aciduliprofundum boonei T469]
 gi|289534126|gb|ADD08473.1| protein of unknown function DUF814 [Aciduliprofundum boonei T469]
          Length = 589

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 123/204 (60%), Gaps = 6/204 (2%)

Query: 483 EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           E  L  EK+++ +  S   NA  +Y+  KK    +EK   A     KA E+  +++  +E
Sbjct: 321 EIELEGEKIKLYVDKSVGENAGIYYDRSKKM---REKIKGAREALEKAKEELKKVKKKEE 377

Query: 543 KTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGA 602
           K    I   R+  WFEK+ WFISSE  LVI+GRDA+ NE +VK+++  GD+Y+HAD+HGA
Sbjct: 378 KKKKEIRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGA 437

Query: 603 SSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYL 661
            S VIK+   E  +   TL +A  F V  S+AW++     SA+WVYP QVSK   +GEY+
Sbjct: 438 PSVVIKSEGKE--IGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYV 495

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFG 685
             G++++ GK+N++   PL +  G
Sbjct: 496 ARGAWVVHGKRNYIHKVPLQLAVG 519



 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 19/131 (14%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A +K  R  I G     +Y +  + ++FK+         GE++ +       V
Sbjct: 1   MLSLDIYAWLKENREFIEGGFFKKIYQVGEREFLFKIYK-------GETKPLY------V 47

Query: 65  RLHTTAYARDKKNT--PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
            L    +  D++    PS F + LRK    +++    Q  +DRII+F+     N + +I+
Sbjct: 48  NLRGWLFFDDRETPLEPSMFVMFLRKRFSGKKIVKFYQFNFDRIIIFEVP---NGYSLII 104

Query: 123 ELYAQGNILLT 133
           EL+  GNI++T
Sbjct: 105 ELFGDGNIIVT 115


>gi|150399105|ref|YP_001322872.1| hypothetical protein Mevan_0351 [Methanococcus vannielii SB]
 gi|150011808|gb|ABR54260.1| protein of unknown function DUF814 [Methanococcus vannielii SB]
          Length = 680

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 182/356 (51%), Gaps = 33/356 (9%)

Query: 347 EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQ-ENRVHTLKQEVDR 405
           E   +E+F  ALDE++S+   ++  +Q + K +    K  +I   Q E +    KQ V  
Sbjct: 275 EIKNYESFLVALDEYFSRFIIKKEIKQAETKINKLVKKQERILNSQLETKEKYEKQSVLN 334

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLER 465
             K  +LI  N  DVD  +  +R A   +M W  +  ++ + +   + + G I  +  + 
Sbjct: 335 QEK-GDLIYANYMDVDEILSTIRSA-REKMDWNAIKEVINKNK--DHQILGKIISVNEKN 390

Query: 466 NCMSLLLSNNLDEMDDEEKTLPVEK-VEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
             +SL LS  LD  +       +EK V +DL  +A  +A  +Y+  KK ++K    ++  
Sbjct: 391 AEISLKLS--LDYGNG-----IIEKNVVLDLRKNAFESADDFYQKSKKFKNK----VSGV 439

Query: 525 SKAFKAAEKKTRLQILQEKTVANISHMRKVH------------WFEKFNWFISSENYLVI 572
            +A K +EKK  L  L+EK   +   +R+              W+EK  W +  + YL++
Sbjct: 440 IEALKISEKK--LNELKEKEKTDSEVLREKEENIKKKEKKLLKWYEKLKWTLI-DGYLIV 496

Query: 573 SGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHS 632
           +G+DA  NEMI+KRY+ K D+  H  + GA  TVIK    E+     TL +   F   HS
Sbjct: 497 AGKDATTNEMIIKRYVEKNDIVFHTLMDGAPFTVIKMKDSEKAPEEKTLFEVSKFAASHS 556

Query: 633 QAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           +AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G+ 
Sbjct: 557 RAWKLGVGSADVYWVMPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALDLGVGIF 612



 Score = 66.2 bits (160), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 79/164 (48%), Gaps = 8/164 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSP---KTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++  V  L+ LIG +    + LS    K  + K+     + E G S+++ +
Sbjct: 1   MKTEMTNVDISVAVNELQSLIGAKFDKAFLLSGSDGKELVLKV----HLPEVG-SKEIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRK++   ++  + Q  +DRI+LF F      +
Sbjct: 56  GLGKYKYITITEYEREKPKNPPSFAMLLRKNLNNIKITSIEQHNFDRIVLFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ +GN +L D    ++  L+  R   + V     +++P
Sbjct: 116 KLIIELFGEGNAILLDKNDVIILPLKIERWSTRNVVPKEIYKFP 159


>gi|284174391|ref|ZP_06388360.1| hypothetical protein Ssol98_07002 [Sulfolobus solfataricus 98/2]
 gi|384433658|ref|YP_005643016.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|261601812|gb|ACX91415.1| protein of unknown function DUF814 [Sulfolobus solfataricus 98/2]
          Length = 609

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 82/245 (33%), Positives = 133/245 (54%), Gaps = 17/245 (6%)

Query: 448 RKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANAR 504
           R+ GN +   A  ID+L L     S  +  N+D ++          +E+D +LSA  NA 
Sbjct: 299 RQLGNFILSKAYEIDQLLLNNRAKSKKVKLNVDGVE----------IELDTSLSATKNAM 348

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           R+++  K+ + K E+ + +  +  +   K  + +I ++  +     +RK  W+EK+ W I
Sbjct: 349 RFFDEAKEYKRKIERALKSLEELKEKLAKIEKQEIEKQNEIK--LTLRKKEWYEKYRWSI 406

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           S   YL+I GRDA QNE IVK+Y+   D+++HAD+ GA +T+I      + +    +  A
Sbjct: 407 SRSGYLIILGRDASQNESIVKKYLRDKDIFLHADIIGAPATIIITQ-DNKTISEEDIYDA 465

Query: 625 GCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
                 +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L + 
Sbjct: 466 AVMAASYSKAWKVGLASVDIFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFIKNVKLQLA 525

Query: 684 FGLLF 688
            GL+ 
Sbjct: 526 IGLIL 530


>gi|150402208|ref|YP_001329502.1| hypothetical protein MmarC7_0281 [Methanococcus maripaludis C7]
 gi|150033238|gb|ABR65351.1| protein of unknown function DUF814 [Methanococcus maripaludis C7]
          Length = 680

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 178/358 (49%), Gaps = 29/358 (8%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q       +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLETKEKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  +++A   +M W  +  ++KE +   +PV   I  + 
Sbjct: 331 SILNHKRGDLIYANYSLVDEIVSTIKLA-REKMDWNGIKNVIKENK--THPVLSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   ++L LS       D    L  + V VDL  +A  NA   Y+  KK ++K    I 
Sbjct: 388 EKNAELTLNLSA------DYGNGLIEDTVPVDLRKNAFENADIVYQKSKKFKNKVHGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEKTVANISHMRK------------VHWFEKFNWFISSENYL 570
              +A K +EKK  L  L+EK   +   +++            + W+EK  W +    YL
Sbjct: 441 ---EALKISEKK--LAELKEKEKLDSEVLKEKEENIKKKERKVLKWYEKLKWTVIG-GYL 494

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           +++G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      L +   F   
Sbjct: 495 IVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENILFEVAKFAAS 554

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           HS+AW   + ++  +WV P Q+SKTA +GE+L  G+F+IRGK+NF+    L +G G+L
Sbjct: 555 HSRAWKLGIGSADVYWVRPDQISKTAESGEFLKKGAFVIRGKRNFIRSAALELGIGML 612



 Score = 69.3 bits (168), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 83/166 (50%), Gaps = 8/166 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++A V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISAAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     + TT Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYITTTEYEREKPRNPPSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P +
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKEIYKFPPQ 161


>gi|15897146|ref|NP_341751.1| hypothetical protein SSO0195 [Sulfolobus solfataricus P2]
 gi|13813331|gb|AAK40541.1| Membrane conserved hypothetical protein [Sulfolobus solfataricus
           P2]
          Length = 609

 Score =  137 bits (345), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 82/245 (33%), Positives = 133/245 (54%), Gaps = 17/245 (6%)

Query: 448 RKAGNPV---AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANAR 504
           R+ GN +   A  ID+L L     S  +  N+D ++          +E+D +LSA  NA 
Sbjct: 299 RQLGNFILSKAYEIDQLLLNNRAKSKKVKLNVDGVE----------IELDTSLSATKNAM 348

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFI 564
           R+++  K+ + K E+ + +  +  +   K  + +I ++  +     +RK  W+EK+ W I
Sbjct: 349 RFFDEAKEYKRKIERALKSLEELKEKLAKIEKQEIEKQNEIK--LTLRKKEWYEKYRWSI 406

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           S   YL+I GRDA QNE IVK+Y+   D+++HAD+ GA +T+I      + +    +  A
Sbjct: 407 SRSGYLIILGRDASQNESIVKKYLRDKDIFLHADIIGAPATIIITQ-DNKTISEEDIYDA 465

Query: 625 GCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
                 +S+AW   + +   +WV  +QVSK+ P+GEYL  GSFMI GKKNF+    L + 
Sbjct: 466 AVMAASYSKAWKVGLASVDIFWVLGNQVSKSPPSGEYLNKGSFMIYGKKNFIKNVKLQLA 525

Query: 684 FGLLF 688
            GL+ 
Sbjct: 526 IGLIL 530


>gi|45358591|ref|NP_988148.1| hypothetical protein MMP1028 [Methanococcus maripaludis S2]
 gi|44921349|emb|CAF30584.1| unnamed protein product [Methanococcus maripaludis S2]
          Length = 680

 Score =  137 bits (345), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 176/358 (49%), Gaps = 29/358 (8%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q +     +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLDTKDKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  ++ A   +M W  +  ++KE +   +P+   I  + 
Sbjct: 331 SVSNHKRGDLIYANYSLVDEIVSTIKDA-REKMDWNGIKNVIKENK--THPILSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   ++L LS       D    L  + V VDL  +A  NA   Y+  KK ++K +  I 
Sbjct: 388 EKNAELTLKLSA------DYGNGLIEDSVPVDLRKNAFENADIVYQKSKKFKNKVQGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEK------------TVANISHMRKVHWFEKFNWFISSENYL 570
              +A K +EKK  L  L+EK                    + + W+EK  W +    YL
Sbjct: 441 ---EALKISEKK--LAELKEKEKLDSEVFKEKEEKIKKKERKVLKWYEKLKWTVIG-GYL 494

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           +++G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      L +   F   
Sbjct: 495 IVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENILFEVAKFAAS 554

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           HS+AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G++
Sbjct: 555 HSRAWKLGIGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGII 612



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/166 (24%), Positives = 81/166 (48%), Gaps = 8/166 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++  V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISVAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYMTLTEYEREKPRNPPSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P +
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKEIYKFPPQ 161


>gi|254167318|ref|ZP_04874170.1| conserved domain protein [Aciduliprofundum boonei T469]
 gi|197623581|gb|EDY36144.1| conserved domain protein [Aciduliprofundum boonei T469]
          Length = 589

 Score =  137 bits (345), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 65/139 (46%), Positives = 93/139 (66%), Gaps = 3/139 (2%)

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           I   R+  WFEK+ WFISSE  LVI+GRDA+ NE +VK+++  GD+Y+HAD+HGA S VI
Sbjct: 383 IRKNRRRFWFEKYRWFISSEGILVIAGRDAKTNEEVVKKHLGNGDLYMHADIHGAPSVVI 442

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
           K+   E  +   TL +A  F V  S+AW++     SA+WVYP QVSK   +GEY+  G++
Sbjct: 443 KSEGKE--IGEKTLQEAAQFAVSMSKAWNAGFGNLSAYWVYPSQVSKMGESGEYVARGAW 500

Query: 667 MIRGKKNFLPPHPLIMGFG 685
           ++ GK+N++   PL +  G
Sbjct: 501 VVHGKRNYIHKVPLQLAVG 519



 Score = 47.0 bits (110), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 19/131 (14%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + D+ A +K     I G     +Y +  + ++FK+         GE++ +       V
Sbjct: 1   MLSLDIYAWLKENIEFIEGGFFKKIYQVGEREFLFKIYK-------GETKPLY------V 47

Query: 65  RLHTTAYARDKKNT--PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
            L    +  D++    PS F + LRK    +++    QL +DRII+F+     N + +I+
Sbjct: 48  NLRGWLFFDDRETPLEPSMFVMFLRKRFSGKKIVKFYQLNFDRIIIFEVP---NGYSLII 104

Query: 123 ELYAQGNILLT 133
           EL+  GNI++T
Sbjct: 105 ELFGDGNIIVT 115


>gi|300176455|emb|CBK23766.2| unnamed protein product [Blastocystis hominis]
          Length = 159

 Score =  137 bits (344), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 74/162 (45%), Positives = 104/162 (64%), Gaps = 10/162 (6%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K RM   DV A V  L+ ++ G + +NVYD+S K YI KLM            +  L+
Sbjct: 1   MPKTRMTALDVRACVNELKGIVLGAKLANVYDVSNKVYILKLMKGGA--------QYNLV 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +ESGVR+H T Y R+K   P+ F+ KLRKHIR RR+E VRQ+G+DR++   FG G   ++
Sbjct: 53  IESGVRVHLTKYLREKNQFPNTFSQKLRKHIRNRRIEAVRQIGFDRVVDLVFGNGETTYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           VI+ELY+ GNI+LT+ EF V+ LLRS+  +D G  +  +H+Y
Sbjct: 113 VIVELYSGGNIILTNYEFEVMFLLRSYTLND-GTQVDVKHQY 153


>gi|340624350|ref|YP_004742803.1| fibronectin-binding A domain-containing protein [Methanococcus
           maripaludis X1]
 gi|339904618|gb|AEK20060.1| Fibronectin-binding A domain protein [Methanococcus maripaludis X1]
          Length = 680

 Score =  136 bits (343), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 176/358 (49%), Gaps = 29/358 (8%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q +     +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLDTKDKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  ++ A   +M W  +  ++KE +   +P+   I  + 
Sbjct: 331 SISNHKRGDLIYANYSLVDEIVSTIKDA-REKMDWNGIKNVIKENK--THPILSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   ++L LS       D    L  + V VDL  +A  NA   Y+  KK ++K +  I 
Sbjct: 388 EKNAELTLKLSA------DYGNGLIEDSVPVDLRKNAFENADIVYQKSKKFKNKVQGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEK------------TVANISHMRKVHWFEKFNWFISSENYL 570
              +A K +EKK  L  L+EK                    + + W+EK  W +    YL
Sbjct: 441 ---EALKISEKK--LAELKEKEKLDSEVFKEKEEKIKKKERKVLKWYEKLKWTVIG-GYL 494

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           +++G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      + +   F   
Sbjct: 495 IVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENIMFEVAKFAAS 554

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           HS+AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G++
Sbjct: 555 HSRAWKLGIGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGII 612



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 82/166 (49%), Gaps = 8/166 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++A V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISAAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYITLTEYEREKPRNPPSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P +
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKELYKFPPQ 161


>gi|159040762|ref|YP_001540014.1| hypothetical protein Cmaq_0175 [Caldivirga maquilingensis IC-167]
 gi|157919597|gb|ABW01024.1| protein of unknown function DUF814 [Caldivirga maquilingensis
           IC-167]
          Length = 650

 Score =  136 bits (343), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 132/437 (30%), Positives = 215/437 (49%), Gaps = 49/437 (11%)

Query: 268 MKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVI-----SGDIVPEGYI--LMQNKHLGKDH 320
           +KL E + L D A   L   +    DW ++V      S  ++  G I  +++  HLG+  
Sbjct: 160 LKLIEDSGLSDEA---LAKGLGLGTDWAREVCTRSGCSDPVLVWGSIRGILEVLHLGRLK 216

Query: 321 PPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQR-AEQQHKAKED 379
           P   +  S        P+ L+  +  EF + E+F+ A+D++++ IE +R AE++ K  ED
Sbjct: 217 PVIYASPSY-----VSPIPLSSIKG-EFKEVESFNKAVDDYFTSIEVERVAEERVKGIED 270

Query: 380 AAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIE---YNLEDVDAAILAVRVALANRMS 436
               +L     + E+ V    +E +   +  ELI    Y   ++  A+L  R  +A++ S
Sbjct: 271 E-IARLESSIKELEDTVGGYLREAENLRRRGELIMGRLYEFSELHEALL--RAYMADKDS 327

Query: 437 WEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLA 496
           ++     VK     G  V   ID   L R  + + ++NN              +VE+ L 
Sbjct: 328 FKA---KVKGIEYGGIKV---IDYDPL-RKTVKVTVNNN--------------EVELTLG 366

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKA-FKAAEKKTRLQILQEKTVANISHMRKVH 555
            S    A +++E  K+ E K +      ++   K  E ++R+    E+T A +  +    
Sbjct: 367 ESPGETAAKYFEEAKRLEKKAKAAEAKLTELRGKVNELRSRVNEATEETRAAVRFVASRE 426

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WFE+F WFI+S    V++G+DA QNE IVKRYM+  D+++HAD+ G   TVIK  R  Q 
Sbjct: 427 WFERFRWFITSGGSPVLAGKDAGQNEAIVKRYMNPWDLFLHADVQGGPVTVIKVTR-GQE 485

Query: 616 VPPLTLNQAGCFTVCHSQAWD-SKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
           V    L +A  +   +S+AW         ++V   QVSK AP+GEYL+ G FMI G++ +
Sbjct: 486 VKQQDLIEAAQYAAAYSKAWKLGANSIDVYYVKGEQVSKKAPSGEYLSKGGFMIYGQRGW 545

Query: 675 LPPHPLIMGFGLLFRLD 691
           +    LI+  GL  R+D
Sbjct: 546 VRGVELIISVGL--RID 560


>gi|134045609|ref|YP_001097095.1| hypothetical protein MmarC5_0566 [Methanococcus maripaludis C5]
 gi|132663234|gb|ABO34880.1| protein of unknown function DUF814 [Methanococcus maripaludis C5]
          Length = 680

 Score =  136 bits (342), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 175/358 (48%), Gaps = 29/358 (8%)

Query: 343 FRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQE 402
            +  E   +E+F  ALDE++S+   ++  +Q ++K      K  +I   Q       +++
Sbjct: 271 LKENEIKHYESFLTALDEYFSRFIMKKEIKQAESKLQKLVKKQERILKSQLETKEKYEKQ 330

Query: 403 VDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + K  +LI  N   VD  +  ++ A   +M W  + +++KE +   +P+   I  + 
Sbjct: 331 SLSNHKRGDLIYANYSLVDEIVGTIKDA-REKMDWNGIKKIIKENK--THPILSKIINVN 387

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
            +   ++L LS       D    L  + V VDL  +A  NA   Y+  KK + K +  I 
Sbjct: 388 EKNAELTLKLSA------DYGNGLIEDTVPVDLRKNAFENADIVYQKSKKFKHKVQGVI- 440

Query: 523 AHSKAFKAAEKKTRLQILQEK------------TVANISHMRKVHWFEKFNWFISSENYL 570
              +A K +EKK  L  L++K                    + + W+EK  W +    YL
Sbjct: 441 ---EALKISEKK--LAELKDKEKLDSEILKEKEEKIKKKERKVLKWYEKLKWTVIG-GYL 494

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           +++G+DA  NEM++KRY+ K D+  H  + GA  T+I+    E+      L +   F   
Sbjct: 495 IVAGKDATTNEMLIKRYVEKNDIVFHTLMEGAPFTIIRTEGSEEIPDENVLFEVAKFASS 554

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           HS+AW   + ++  +WV P Q+SKTA +GEYL  G+F+IRGK+NF+    L +G G+L
Sbjct: 555 HSRAWKLGIGSADVYWVRPDQISKTAESGEYLKKGAFVIRGKRNFIRSAALELGIGML 612



 Score = 67.0 bits (162), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 82/166 (49%), Gaps = 8/166 (4%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLS---PKTYIFKLMNSSGVTESGESEKVLL 58
           +K  M   D++A V  L+++I  +    + ++    K  I K+     + E G S ++ +
Sbjct: 1   MKTEMTNVDISAAVSELQKVINGKLDKAFLVNNQDGKELILKV----HIPEIG-SREIAI 55

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            +     +  T Y R+K   P  F + LRKH++  ++  V Q  +DRI++F F      +
Sbjct: 56  GLGKYKYITITEYEREKPRNPHSFVMLLRKHLKNIKITSVAQHNFDRIVIFNFEWNELKY 115

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +I+EL+  GN +L DSE  ++  L+  R   + +     +++P +
Sbjct: 116 KLIIELFGDGNAILLDSEDKIILPLKIERWSTRKIVPKEIYKFPPQ 161


>gi|193787557|dbj|BAG52763.1| unnamed protein product [Homo sapiens]
          Length = 481

 Score =  135 bits (341), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 56/86 (65%), Positives = 70/86 (81%)

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +C+S AWD++++TSAWWVY HQVSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  
Sbjct: 1   MALCYSAAWDARVITSAWWVYHHQVSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSF 60

Query: 687 LFRLDESSLGSHLNERRVRGEEEGMD 712
           LF++DES +  H  ER+VR ++E M+
Sbjct: 61  LFKVDESCVWRHQGERKVRVQDEDME 86


>gi|257053989|ref|YP_003131822.1| Fibronectin-binding A domain protein [Halorhabdus utahensis DSM
           12940]
 gi|256692752|gb|ACV13089.1| Fibronectin-binding A domain protein [Halorhabdus utahensis DSM
           12940]
          Length = 707

 Score =  135 bits (341), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 111/478 (23%), Positives = 201/478 (42%), Gaps = 69/478 (14%)

Query: 243 VLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGD 302
            L   L +G    E +    G+  N  + E     D   + L  AV      L++   GD
Sbjct: 188 TLATQLNFGGLYGEELCSRAGVPYNQAIGETT---DAEFEALYDAVNDLSTRLRE---GD 241

Query: 303 IVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           + P  Y     +    D                 P+ L ++       F++F+ AL+ ++
Sbjct: 242 LDPRLYFETDEQETPVD---------------VTPVPLVEYEDTPGESFDSFNDALEAYF 286

Query: 363 SKIESQRAEQQ---HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
             +E +  E++   ++   +A   K  +I   QE  +   +++ +   + AEL+  N + 
Sbjct: 287 LGLEQEPDEEETGSNRPDFEAEIEKQKRIIQQQEGAIEDFEEDAEAEREKAELLYANYDL 346

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VD  +  V+ A A    W+++   +   +  G P A  +  +      +++ + ++    
Sbjct: 347 VDEVLSTVQDARAAETPWDEIEATLSAGKDQGIPAAEAVRDVDGSEGTVTVQIDDH---- 402

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESK-------------QEKTITAHSK 526
                      +E+D       NA R Y+  K+ E K             Q + +    +
Sbjct: 403 ----------HIELDADTGVEKNADRLYQEAKRIEGKKAGAEEAIANTREQLEAVKQRRE 452

Query: 527 AFKAAEKKTRLQILQEK-----------TVANISHMRKVHWFEKFNWFISSENYLVISGR 575
           A++A++         E            T  +I       W+E+F WF +S+ +LVI GR
Sbjct: 453 AWEASDGDDGGDGSGETHEDDQEDVDWLTRESIPIRTSEEWYERFRWFTTSDGFLVIGGR 512

Query: 576 DAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP-EQP-----VPPLTLNQAGCFTV 629
           +A QNE +VK+Y+ +GD++ H   HGA +T++K   P E P     +P  +  +A  F +
Sbjct: 513 NADQNEELVKKYLDRGDLFFHTQAHGAPATILKATGPSEAPPDDISIPESSREEAAQFAI 572

Query: 630 CHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            +S  W + K     + V P QV+KT  +GEYL  GSF IRG + +    P+ +  G+
Sbjct: 573 SYSTLWKEGKYAGDVYCVGPDQVTKTPESGEYLEKGSFAIRGDRTYYDDTPVGVAVGI 630



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 70/164 (42%), Gaps = 9/164 (5%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSS-GVTESGESEKVLLLME 61
           K  + + D AA    LR  +G      Y         KL   + G  E      +L+ ++
Sbjct: 4   KRELTSVDCAALAGELRAFVGAYHEKSYLYDDDLLRLKLSGPNFGRIE------LLIEVD 57

Query: 62  SGVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
              R+HT    R  +    P  F + LR  +   +LE V Q  +DRI+  +F    +   
Sbjct: 58  DPKRVHTITPDRVPNAPERPPNFAMMLRNRLEGAQLESVEQFEFDRILQLRFERSDDHTT 117

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           +I EL+  GN+ + D   TV+  L + R   + V   S++ +P+
Sbjct: 118 IIAELFGDGNLAVLDETDTVIDSLETVRLQSRTVTPGSQYEFPS 161


>gi|407461558|ref|YP_006772875.1| hypothetical protein NKOR_00035 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407045180|gb|AFS79933.1| hypothetical protein NKOR_00035 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 651

 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 80/236 (33%), Positives = 128/236 (54%), Gaps = 21/236 (8%)

Query: 471 LLSNNLDEMDDEEKTLPV-----EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHS 525
           +LSNN  ++  E K +P+     EK+++++     + A   +   KKQ           S
Sbjct: 344 ILSNNNAKLITE-KGIPLIVIQDEKIKINIKAPLQSIASTLFNEAKKQSGAISSIEEIKS 402

Query: 526 KAFKAAEKKTRLQILQEKTVAN-----ISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
           K  K  EK      LQ KT +      +S +RK +W+E++ WF +S+ +LVI GRDA  N
Sbjct: 403 KTLKKLEK------LQNKTDSEKDSVLVSEIRKKNWYERYRWFYTSDGFLVIGGRDAASN 456

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
             +V+++++K D   H D+ G+   +IK+    Q  P  ++N+    TVC S+AW   M 
Sbjct: 457 SAVVRKHLAKNDKIFHGDIFGSPFFIIKDA---QNAPDTSMNEVAHATVCFSRAWREGMY 513

Query: 641 -TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
             SA+WV P QV K+AP+GE+L  GSF I G++NF+    L +  G++ + D  +L
Sbjct: 514 GVSAYWVNPEQVKKSAPSGEFLPKGSFTIEGQRNFIKSGNLKLAVGIIPQEDGYAL 569



 Score = 43.5 bits (101), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 79/164 (48%), Gaps = 19/164 (11%)

Query: 26  CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLK 85
            SN+Y ++  + +FKL ++       +S+  +++  SGV L      +  +  P+    +
Sbjct: 27  VSNIYGITKDSILFKLHHTE------KSDLFMMISTSGVWL---TEVKIDQVEPNKLLKR 77

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLTLLR 144
           LR  +   +L+ ++Q+G +RI  F F  G    +V++ E +  GNILL + E  +L L  
Sbjct: 78  LRSDLLRLKLKKIKQIGAERIAYFTFE-GFGKEFVLVGEFFGDGNILLCNDEMKILALQH 136

Query: 145 S----HRDDDKGVAIMSRHRYPTEICRV----FERTTASKLHAA 180
           S    HR    G+  ++  +   +I  +    FE    ++L AA
Sbjct: 137 SIDVRHRKLSVGLEYVTPPQSGLDIFNLSESDFEDIKTTELVAA 180


>gi|48478297|ref|YP_024003.1| hypothetical protein PTO1225 [Picrophilus torridus DSM 9790]
 gi|48430945|gb|AAT43810.1| hypothetical protein PTO1225 [Picrophilus torridus DSM 9790]
          Length = 611

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 58/130 (44%), Positives = 94/130 (72%), Gaps = 3/130 (2%)

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           R  +WFE ++WF SS N++V++GRDA+ NE ++K++M + D+YVHADL+GA ST+IK+  
Sbjct: 409 RPRYWFETYHWFFSSNNFMVLAGRDAKTNESLIKKHMEENDIYVHADLYGAPSTLIKSEG 468

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
               +   T+ +A  F +  S+AW + + + +A+WVYP QVSKT  +GE+++ GS++IRG
Sbjct: 469 --NTIDERTIREACIFAISFSRAWPAGIGSGTAYWVYPSQVSKTPESGEFISKGSWVIRG 526

Query: 671 KKNFLPPHPL 680
           K+N++   PL
Sbjct: 527 KRNYIFDLPL 536


>gi|310752298|gb|ADP09459.1| FbpA and DUF814 domain protein [uncultured marine crenarchaeote
           E48-1C]
          Length = 608

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/368 (26%), Positives = 178/368 (48%), Gaps = 35/368 (9%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF----HKLNKIHMDQ 392
           P  L  +   E   +E+F+  LDEFY ++ +         +E  +      +L +I   Q
Sbjct: 180 PFRLKCYADFEHKCYESFNETLDEFYVRVGAIEKALTVATEEVGSLKQEMERLKRIIEMQ 239

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
           E    T K  +  + +M ++I  +  +++A +           +W+++   V  E+K G 
Sbjct: 240 EEACATAKTNMQENKRMGDIIHVHAGELEALLHRFLAGREEGKAWDEIVSEVLAEKKTGV 299

Query: 453 PVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKK 512
             +G +    +  +   L++   LD +          +  + L  S   NA R+Y   K+
Sbjct: 300 KSSGFL----VSFDDKHLVVDVCLDGL----------QFGLSLRRSLFDNAARFYRRYKR 345

Query: 513 QESKQEKTITAHSKAFKAAEK-KTRLQILQEKTVANISHM--------RKVH---WFEKF 560
            + K +    A  ++ +  E+ + RL+  + +   ++S +        RK+    WFEKF
Sbjct: 346 AKQKLDGAKIAMEESHRKLEEVEARLE--KAEAAGSVSPVEVIEEVAERKIERKKWFEKF 403

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
            WF+SS+  LV++G+DA  NE++V +Y + GD+  HAD+ GA   V+K +  E+P     
Sbjct: 404 RWFVSSDGVLVVAGKDAVSNEVLVNKYATDGDIVFHADVVGAPFVVVKMN-GEKPSEE-C 461

Query: 621 LNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           L QAG F    S+ W     +   +WV P Q+ K+A +G+Y+  G F++RGK+N++   P
Sbjct: 462 LRQAGVFAASFSRGWREGFASVDVYWVKPDQLDKSAKSGQYVPKGGFVVRGKRNWMRGSP 521

Query: 680 LIMGFGLL 687
           L +  G++
Sbjct: 522 LRLAVGIV 529



 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
           + LRK++R  RL +V Q  ++R+++F F        + LEL+  GN +L D + T+L  L
Sbjct: 1   MGLRKYLRNCRLANVEQSDFERVVIFTFETWAGEMRLYLELFGGGNAILVDEKGTILQAL 60

Query: 144 RSHRDDDKGVAIMSRHRY-------PTEICR 167
              R  D+ +      R+       P  +CR
Sbjct: 61  TYKRMRDRNIIRDQIFRFAPPVGKNPFRVCR 91


>gi|327401161|ref|YP_004342000.1| fibronectin-binding A domain-containing protein [Archaeoglobus
           veneficus SNP6]
 gi|327316669|gb|AEA47285.1| Fibronectin-binding A domain protein [Archaeoglobus veneficus SNP6]
          Length = 637

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/349 (29%), Positives = 174/349 (49%), Gaps = 39/349 (11%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L  +   E   F TF+ ALDE+Y++  S+  +++ +        +L K+   
Sbjct: 236 YIDVLPIELQIYDGLERKYFPTFNEALDEYYARRISEVKQEESE--------ELKKLKAR 287

Query: 392 QENRVHTLKQ---EVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
            E ++ T K+   E++R     + +  N + ++  + A R A   + SW+++ ++V+   
Sbjct: 288 LEKQLETKKEFENEMERYRAAGDAVYENYQLLEQILEAFRQARQQK-SWDEIKKIVR--- 343

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
            A   ++ L+ +++ E+N + + +   ++   D  K LP               A  +YE
Sbjct: 344 -AHPKLSKLVVEIHPEKNSVVVNIGPKIELALD--KNLP-------------QIADVYYE 387

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ-EKTVANISHMRKVHWFEKFNWFISSE 567
             KK   K E  + A  K     E+  R++ L+ +K V  +   RK  WFE+F WFI+S+
Sbjct: 388 RAKKVRQKLEGLLKAIEKT---KEEMQRVEELEAKKYVKGLRVARKREWFERFRWFITSD 444

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LVI GR+A  NE IV +YM   D++ H    GA +TV+K     Q  P  ++ +A  F
Sbjct: 445 GFLVIGGRNAAMNEEIVSKYMEPKDLFFHTQTPGAPATVLKLG---QEAPETSIIEAAQF 501

Query: 628 TVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
              +S  W + K     ++V P QV + A  GEYL  GSF I GK+N+L
Sbjct: 502 AATYSALWKEGKYSGEVYYVKPEQVKRAAKHGEYLARGSFYIEGKRNYL 550



 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 78/139 (56%), Gaps = 10/139 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M++AD+AA V  L++L+G +   +Y   P     K+  + G  +        L++E+G R
Sbjct: 4   MSSADIAACVSELQQLVGGKVEKIYHHPPDEIRVKIY-AGGRKD--------LILEAGRR 54

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H T + R+    PS F + LRKH+   R+  + Q  +DR+++ +       +++I+EL+
Sbjct: 55  IHLTKFPRESPRIPSSFAMLLRKHLEGGRVRKIEQHDFDRVVVIEVE-REKRNFIIVELF 113

Query: 126 AQGNILLTDSEFTVLTLLR 144
           ++GN++L D  F ++  L+
Sbjct: 114 SKGNVILADESFRIIMPLK 132


>gi|407465827|ref|YP_006776709.1| hypothetical protein NSED_09905 [Candidatus Nitrosopumilus sp. AR2]
 gi|407049015|gb|AFS83767.1| hypothetical protein NSED_09905 [Candidatus Nitrosopumilus sp. AR2]
          Length = 648

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 172/362 (47%), Gaps = 55/362 (15%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           E  P+ L +    E  +  +F   LD  +++   ++ +    +  D    +L     +QE
Sbjct: 244 EVLPIRLGKLEG-EITQVNSFIEGLDTVFTENIIEKGKSVQSSGSDKKIKELQTQISEQE 302

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             + T+K+   RS  +  +     E V   I+++   LA  +  ++ A+++ E+      
Sbjct: 303 KAIETVKE---RSKNITNVANSLFEMVSKGIISIEDNLAQEILAKNNAKLINEK------ 353

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQ 513
                         +SL++  +             EK++++      + A   ++  KKQ
Sbjct: 354 -------------GISLIVVQD-------------EKIKINTQSPLQSIASVLFDEAKKQ 387

Query: 514 ESKQEKTITAHSKAFKAAEKKT--RLQILQEKT-----VANISHMRKVHWFEKFNWFISS 566
            S           + KA ++KT  RL+  Q KT     +  +S +RK +W+E++ WF ++
Sbjct: 388 SSA--------IFSIKAIKEKTEKRLEKFQSKTDSEKDLIVVSEIRKKNWYERYRWFFTT 439

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           + +L I GRDA  N  ++++++ K D   H D+ G+   ++K+    Q  P  ++N+   
Sbjct: 440 DGFLTIGGRDAASNSAVIRKHLDKNDKIFHGDIFGSPFFILKDS---QNAPDTSMNEVAH 496

Query: 627 FTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
            TVC S+AW   M   SA+WVYP Q+ K+AP+GE+L  GSF I G++NF+    L +  G
Sbjct: 497 ATVCFSRAWREGMYGVSAYWVYPDQIKKSAPSGEFLPKGSFTIEGQRNFIKSDTLRLAVG 556

Query: 686 LL 687
           ++
Sbjct: 557 IM 558



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 64/124 (51%), Gaps = 11/124 (8%)

Query: 23  GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGF 82
           G   SN+Y ++  + +FKL ++       +S+  +++  SGV L +    +  +  P+  
Sbjct: 21  GYYVSNIYGITKDSILFKLHHTE------KSDLFMMISTSGVWLTS---VKIDQMEPNRL 71

Query: 83  TLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLT 141
             +LR  +   +L+ + Q+G +RI  F F  G    +V++ E +  GNILL ++E  +L 
Sbjct: 72  LKRLRSDLLRLKLKKIEQIGAERIAYFTFE-GFGKEFVLVGEFFGDGNILLCNNEMKILA 130

Query: 142 LLRS 145
           L  S
Sbjct: 131 LQHS 134


>gi|284161856|ref|YP_003400479.1| fibronectin-binding A domain-containing protein [Archaeoglobus
           profundus DSM 5631]
 gi|284011853|gb|ADB57806.1| Fibronectin-binding A domain protein [Archaeoglobus profundus DSM
           5631]
          Length = 626

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 118/437 (27%), Positives = 192/437 (43%), Gaps = 66/437 (15%)

Query: 243 VLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGD 302
           +L    G G   +E   L  G+  N    +++  E   I   ++++       + V  GD
Sbjct: 161 LLAVKCGLGGLFAEETCLRAGIDKNKLGKDLSDEEFERIYRAMMSI------FEPVFKGD 214

Query: 303 IVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           I P              H   + G     Y +  P+ L  +R  E   FE+F+ ALDEFY
Sbjct: 215 IKP--------------HIVIKDGE----YIDVLPIELEYYRDYEKKYFESFNKALDEFY 256

Query: 363 SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDA 422
           SK  ++  E++          KL K    Q      L++E ++   + + I  N   ++ 
Sbjct: 257 SKTIAETEEEES-----EELKKLRKRLEIQLESKRKLEEEAEKFKSLGDFIYENYATIEK 311

Query: 423 AILAVRVALANRMSWEDL---ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           A+ A R A   +MS+E+    A+ +K  +  G             ++ + ++L+      
Sbjct: 312 ALNAFRQA-KEKMSFEEFKAKAKSLKFVKDVG-------------KDYVVIVLNGK---- 353

Query: 480 DDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI 539
                     ++ +DL    H  A  +YE  KK   K E  + A  K  K  E+  R + 
Sbjct: 354 ----------EIRLDLDKDIHGIAESYYEKAKKAREKLEGLLIAIEKTKKEIEEAERKEK 403

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           L  K  A I  +RK  WFE+F WFI+S+ +L I GR+AQ NE IV +Y+   D++ H   
Sbjct: 404 L--KYTAPIRIVRKREWFERFRWFITSDGFLAIGGRNAQMNEEIVSKYLEPKDLFFHTQT 461

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTG 658
            GA + V+K        P +++ +   F   +S  W   + +   ++V   QV K+A  G
Sbjct: 462 PGAPAVVLKKG---LEAPEISIVETAQFAAIYSSLWKQGLHSGEVYYVTADQVKKSAKAG 518

Query: 659 EYLTVGSFMIRGKKNFL 675
           EYL  GSF I GK+N++
Sbjct: 519 EYLPKGSFYIVGKRNYI 535



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 74/142 (52%), Gaps = 13/142 (9%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M++ D+   V+ L+ LIG +   +Y   P     K      +   G  +   L++E+G R
Sbjct: 1   MSSLDIYVCVRELQELIGGKVEKIYHYPPNEIRIK------IYAKGRKD---LIIEAGRR 51

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +H T + ++    PS F + LRKH+  +R+E + Q  +DR+++  FG       ++ EL+
Sbjct: 52  IHLTIFPKESPKFPSPFAMLLRKHLEGKRIEKIWQHDFDRVVVIDFG----DRKIVAELF 107

Query: 126 AQGNILLTDSEFTVLTLLRSHR 147
           A+GN+ LTD  F V+  +   R
Sbjct: 108 AKGNVALTDENFDVIMDIHGKR 129


>gi|269865041|ref|XP_002651784.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063882|gb|EED42272.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 243

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 68/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 4   KAEKTKIAMRDIQAKLKPRKEHIKVQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 63

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 64  NKYMEDRDLYFHCDVKGASSVVCKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 117

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 118 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 164


>gi|389860344|ref|YP_006362583.1| hypothetical protein TCELL_0020 [Thermogladius cellulolyticus 1633]
 gi|388525247|gb|AFK50445.1| hypothetical protein TCELL_0020 [Thermogladius cellulolyticus 1633]
          Length = 644

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 186/375 (49%), Gaps = 54/375 (14%)

Query: 331 IYDEFCP-LLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQ--HKAKEDAAFHKLNK 387
           +Y  F P +L+ +++    VK   F+ A+D F+   E + A +    +A E AA  +L K
Sbjct: 241 LYTSFKPSVLIEEYKLS--VKGVDFNTAVDTFFGHYERRVARETTLRRAGEKAA--ELKK 296

Query: 388 IHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
              + + R+   ++++D    +   I  N   V+  +L  +  +     WE +      E
Sbjct: 297 AIDEIQQRISAFQKDLDGYRSILNTIYENYAQVEQVLLCAQ-EVRRAAGWESVP-----E 350

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
           R +G      ++    ++  + + + ++   +D          + +DL        R   
Sbjct: 351 RCSG------VESYQADKGLVLVKVGDSTVWLD----------IRLDLK-------RNVI 387

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRL--QILQEKTVANISHMRKVHWFEKFNWFIS 565
           E+KKK    + K  TA +K  +  E+  ++    L+E  +     +R   W+E+F+W I+
Sbjct: 388 EIKKKIGELERKLETALNKKREMEEELKQIGEASLEEPRLV----IRPREWYERFHWTIT 443

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           S  +L I GRDA QNE I ++YM + D+++HAD+HGA   V+K     + VP   + +A 
Sbjct: 444 SNGFLAIGGRDADQNETIYRKYMEESDIFLHADVHGAPVVVVKTR--GEDVPETDIREAA 501

Query: 626 CFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP-PHPLIMG 683
             T C+S+AW + + +   +WV   QVSK+ P+GEYL+ GSFM+ GK+N+L  P  L +G
Sbjct: 502 YLTACYSRAWKAGLASIEVFWVRGGQVSKSPPSGEYLSKGSFMVYGKRNYLSIPLELALG 561

Query: 684 --------FGLLFRL 690
                   +G+ +RL
Sbjct: 562 VEKVESSVYGVYYRL 576



 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/82 (26%), Positives = 45/82 (54%), Gaps = 3/82 (3%)

Query: 60  MESGVRLH-TTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +E GVR H +     +KK  P    + +RKH+   ++  VRQ+G++R++  +   G   +
Sbjct: 47  LEPGVRFHLSNIVPSEKKVDP--LAIFVRKHLDNVKVLGVRQVGWERVLRVELARGSEKY 104

Query: 119 YVILELYAQGNILLTDSEFTVL 140
            + +EL  +G +++ + E  +L
Sbjct: 105 SMFIELLPRGVVVIANYEERIL 126


>gi|330506586|ref|YP_004383014.1| hypothetical protein MCON_0325 [Methanosaeta concilii GP6]
 gi|328927394|gb|AEB67196.1| protein of unknown function (DUF814) [Methanosaeta concilii GP6]
          Length = 641

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 166/360 (46%), Gaps = 45/360 (12%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           +  P  L  +   E  +F TF  ALD F+ + E +   Q      D   H++      Q 
Sbjct: 247 DVLPRPLKLYSGLEKKRFVTFSEALDAFFVEREKETTRQ------DPLEHRIEL----QR 296

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             +   + +    V+  ELI      V+  +  +  A A   S+  +   +     +G P
Sbjct: 297 KAIEEFRSQEAELVRKGELIYQLYGSVEQILTLMNDARARGFSYNQIWERIS---GSGLP 353

Query: 454 VAGLIDKLYLE-RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE---- 508
            A  I  L L+ R  M + L                E++E++  L+   NA+R+Y+    
Sbjct: 354 QAKTI--LSLDGRGEMRVFLDG--------------EELELNAELAVPQNAQRYYDKAKD 397

Query: 509 -LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
            ++K + ++    IT   KA K A KKTR        V++    RK  W+E+F WF SS+
Sbjct: 398 MVRKARGAQSALAITEELKAGKVAPKKTR-------AVSSYYRRRKPKWYERFRWFYSSD 450

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            +LV+ GRDA  NE I  +Y+ + D+ +H D  GA  T IK    E  VP  TL +A  F
Sbjct: 451 GFLVLGGRDADSNEEIYAKYLERRDLAMHTDAPGAPLTAIKTEGKE--VPESTLQEAAGF 508

Query: 628 TVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            V +S  W S +  +  + V   QVSKT  +GE+L  G+F+IRG++ +    PL +  G+
Sbjct: 509 AVSYSSLWKSGLAAADCYLVKGDQVSKTPESGEFLKKGAFVIRGERRYFRDVPLGIALGI 568



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 74/144 (51%), Gaps = 8/144 (5%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K  M+  DVAA VK L+ R++G      Y  S       +       +S    ++ LL+
Sbjct: 1   MKKAMSNVDVAAMVKELQDRILGGFMGKAYQQSSDRIWLSV-------QSPAEGRLDLLL 53

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E+G R+H T   R    TP  F   LR H+   R+ D+RQ  +DR++  +      A Y+
Sbjct: 54  ETGRRVHITKAERPASKTPPQFPTMLRSHLSGGRIVDIRQHQFDRVLEIKVERSGTARYL 113

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I+EL+ +G+++L D    +L++LR
Sbjct: 114 IVELFPKGSMILLDESRNILSMLR 137


>gi|374635672|ref|ZP_09707266.1| Fibronectin-binding A domain protein [Methanotorris formicicus
           Mc-S-70]
 gi|373561525|gb|EHP87758.1| Fibronectin-binding A domain protein [Methanotorris formicicus
           Mc-S-70]
          Length = 673

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 108/377 (28%), Positives = 191/377 (50%), Gaps = 23/377 (6%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y +  P+ L ++   E  ++  F  ALD+++++   +   ++ ++K      K  +I   
Sbjct: 256 YVDVVPINLKKYGDFEKKEYGEFLEALDDYFAQFMVKVEVKKEESKLQKLIKKQERILKT 315

Query: 392 QENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAG 451
           Q   +   ++++  + +  +LI  N   VD  +  +R A   +M W  + +++KE +   
Sbjct: 316 QWETLEKYEKDMQENQEKGDLIYANYMLVDEILNTLRNA-REKMDWYKIKKIIKEHK--D 372

Query: 452 NPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKK 511
           +PV GLI  +  +   + + LS +  +   E+       V +D+  +A  NA  +Y   K
Sbjct: 373 HPVLGLIQNINEKNGEIVIKLSADYGDRKIEKN------VSLDIRKNAFENAETYYTKSK 426

Query: 512 KQESKQEKT-----ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISS 566
           K + K E       +T         +++  L+ L+EK        ++  W+EKF W + +
Sbjct: 427 KLKGKLEGIKEAIKLTEKKIEELKEKEEIELKELKEKEKIKKKERKERKWYEKFKWTVIN 486

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
             +LVI+G+DA  NE+++K+Y    D+  HA + GA  TVIK ++  + V   TLN+   
Sbjct: 487 -GFLVIAGKDAVTNELLIKKYTEDDDIVFHAQIEGAPFTVIKTNK--RIVDEETLNEVAK 543

Query: 627 FTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFG 685
           F+V HS+AW         +WV P Q+SKTA +GEYL  G+F+IRGK+NF+   PL +G G
Sbjct: 544 FSVAHSRAWKLGWGALDTYWVKPEQISKTAESGEYLKKGAFVIRGKRNFIRNVPLELGIG 603

Query: 686 LL-----FRLDESSLGS 697
           ++      RL  S L +
Sbjct: 604 VIEYDDALRLTTSPLNT 620



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/99 (24%), Positives = 52/99 (52%)

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           +  T Y R K   P  F + LRK+++  ++  + Q+ +DRI++  F      + +++EL+
Sbjct: 64  ITMTNYERKKPKNPPSFAMLLRKYLKNIKITKIEQVDFDRIVIITFEWNETVYKLVVELF 123

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
             GN++L D E  ++  L+  R   + +     +++P +
Sbjct: 124 GDGNVVLLDKEDRIIMPLKMGRWSTRNIIPKEFYKFPPQ 162


>gi|269863594|ref|XP_002651278.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064823|gb|EED42778.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 262

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 68/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 4   KAEKTKIAMRDIQAKLKPRKEHIKIQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 63

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 64  NKYMEDRDLYFHCDVKGASSVVCKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 117

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 118 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 164


>gi|269867209|ref|XP_002652521.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220062310|gb|EED41535.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 265

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 67/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 24  KAEKTKIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGGKNAQQNDQIV 83

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS + K            +  A  F + +S+AWD +++   +
Sbjct: 84  NKYMEDRDLYFHCDVKGASSVICKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 137

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 138 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 184


>gi|452206612|ref|YP_007486734.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
 gi|452082712|emb|CCQ35980.1| conserved hypothetical protein [Natronomonas moolapensis 8.8.11]
          Length = 703

 Score =  134 bits (337), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 170/372 (45%), Gaps = 52/372 (13%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQ-RAEQQHKAKED-----AAFHKLNKIHM 390
           PL  ++    +   FE+F+ A+DE++ ++E++   E+Q  A  D     +   K  +I  
Sbjct: 261 PLREHETEGYDATAFESFNGAIDEYFYRLETESETEEQAGAGTDRPDFESEIEKYERIIE 320

Query: 391 DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
            QE  + +  ++ D   + AE +  N + +D     VR A  + + W +    ++E  +A
Sbjct: 321 QQEGAIESYDEQADEEQRKAESLYGNYDLIDEICSTVRAAREDGVPWAE----IEETFEA 376

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRW 506
           G            ER   +   +  +  +D  E T+ V+     +E+D  +    NA R 
Sbjct: 377 G-----------AERGIEA---AEAVVSVDGAEGTVTVDLGDGPIELDPTVGVERNADRL 422

Query: 507 YELKKKQESKQE---KTITAHSKAFKAAEKKTRLQILQEK---------------TVANI 548
           Y   K+   K+E     I    +   A E++      ++                 V+++
Sbjct: 423 YTEAKRVRGKKEGAQAAIEDTREDLAAVERRREAWEAEDADEGEDEDDDAETDYLAVSSV 482

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
                  W E+F WF +S+ +LVI GR+A QNE +VK+YM   D + HA  HGA  T++K
Sbjct: 483 PVRYDEKWHERFRWFRTSDGFLVIGGRNADQNEELVKKYMDPSDRFFHAQAHGAPVTILK 542

Query: 609 NHRPEQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLT 662
              P++P     +P  +  +A  F V +S  W D K     + V   QVSKT  +GEY+ 
Sbjct: 543 ATEPDEPARDVDIPETSKREAARFAVSYSSVWKDGKFEGDVYEVDADQVSKTPESGEYVE 602

Query: 663 VGSFMIRGKKNF 674
            GSF+IRG + +
Sbjct: 603 KGSFVIRGDREY 614



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 70/163 (42%), Gaps = 11/163 (6%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-- 63
           M + D+AA V  LR   G      Y         K+ +        +  ++ L++E+G  
Sbjct: 7   MTSVDLAALVGELREYTGAVVDKAYLYGDDFVRLKMRDY-------DRGRIELVVETGDP 59

Query: 64  VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            R H     +  D    P  F + LR  I     E V Q G+DRI+ F+F        V+
Sbjct: 60  KRAHVAVPDHVADAPGRPPNFAMMLRNRIAGANFEGVEQYGFDRILTFRFEREDATTLVV 119

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            EL+  GN+ + + +  V+  L + R   + VA  S++ YP E
Sbjct: 120 AELFGDGNVAVMNEDREVIDSLDTVRLTARTVAPGSQYGYPDE 162


>gi|20093528|ref|NP_613375.1| RNA-binding protein snRNP [Methanopyrus kandleri AV19]
 gi|19886366|gb|AAM01305.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Methanopyrus kandleri AV19]
          Length = 671

 Score =  134 bits (337), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 162/684 (23%), Positives = 279/684 (40%), Gaps = 100/684 (14%)

Query: 6   MNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M + DV A  + L  L+ G     +Y +  +    K ++  GV          L+ E G+
Sbjct: 9   MTSFDVRATARELDSLLEGALIDKIYQVGERELKVK-VHVPGVGSH------YLVWEPGM 61

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T   +   + P+  +  LR  +   R+E V QLG+DRI+ F    G   H   +EL
Sbjct: 62  RVHLTWRPKPSPDQPTSVSQALRNTLSGDRIERVTQLGFDRILRFDLRSGRRVH---VEL 118

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSS 184
             +G + +TD    +     + R  ++ V        P E+                   
Sbjct: 119 LPKGTLAVTDENNVIERAFPARRFRNRAVV-------PGEVY------------------ 153

Query: 185 KEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
            EP    PD    D +                   +L   ++++           L   L
Sbjct: 154 -EPPEGPPDPYELDRDAF----------------LELLLEADRD-----------LVRTL 185

Query: 245 GEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIV 304
              +G G   +E ++L  GL    +    +   +     L        D L+ +  GD+ 
Sbjct: 186 AVDVGLGGLYAEEVLLRAGLYERRE----SHASEFEEDELEELYETLRDLLEQISEGDLR 241

Query: 305 PEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY-S 363
           P  Y   +  ++     P E  S     DE            E  + +TF  ALDE+Y +
Sbjct: 242 PTLYRTTERDYVDVTPVPLERYS-----DEL-----------EMEEQDTFQRALDEYYVT 285

Query: 364 KIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAA 423
           K  +++  +  +  E         I   Q + +  L+ + ++    A  +  N   VD  
Sbjct: 286 KFLAEKEREVREEWEREKRRLERTIER-QRSSIEQLRTKAEKLRGRANALYLNYNLVDGI 344

Query: 424 ILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEE 483
           +  +R A     S +++ R ++E + +G      I  + +E   + L L       ++ E
Sbjct: 345 LSELRKAERKGYSLDEIKRRIQEAKGSGIEEVERIADIDVENRRVILRLPG-----ENGE 399

Query: 484 KTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEK 543
            T+PV  ++ D+  +A     R  EL++K E  QE  +    +  +   ++   ++  E+
Sbjct: 400 VTVPV-PIDSDVHSTASKLFDRAKELERKAERAQE-VLREQERELEKLLEEGPPEVELEE 457

Query: 544 TVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGAS 603
               ++  RK  W+E+F WFISS+ ++VI G DA  NE+I++RY+ + D+ VHA +HGA 
Sbjct: 458 LTVELTKRRKKDWYERFRWFISSDGFVVIGGSDAHTNEIILRRYLEEHDILVHAHVHGAP 517

Query: 604 STVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLT 662
             VIK    E  VP  TL +A  F   +S+AW   +     +WV   QV K+A       
Sbjct: 518 HVVIKTEGEE--VPETTLREAAIFAASYSRAWRWGLKAADVYWVTADQVDKSAEAPH--- 572

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            G  +IRGK+N+     L +  G+
Sbjct: 573 -GGAIIRGKRNWFRRTELKVAIGV 595


>gi|124027973|ref|YP_001013293.1| hypothetical protein Hbut_1105 [Hyperthermus butylicus DSM 5456]
 gi|123978667|gb|ABM80948.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
          Length = 672

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 173/356 (48%), Gaps = 42/356 (11%)

Query: 331 IYDEFCPLLLNQFRSR--------EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
           +YD+  PL +  F  R        E+  F     A DE++  +  + A     A E  A 
Sbjct: 239 VYDKGVPLTVTCFEPRGLAARYGFEYRAFNDPSTAYDEYFLTVAREAAGASTVAAEIEAE 298

Query: 383 HK--LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDL 440
            K  L  +   + N  H L++++    ++AE++  N+ DV  A+   R  +     WE +
Sbjct: 299 RKKLLASLEAARRNLEH-LRKKLRELEELAEIVSTNIADVYDAVECAR-KMRETAGWEQI 356

Query: 441 ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAH 500
                     GN   G++D +   +  + + +  N+  +D     +   ++ VDL     
Sbjct: 357 P---------GN-CPGVVD-VEPNKGIIKISIVGNIVPIDIR---MEPGRLVVDLY---- 398

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKF 560
              +R  E++ K E + EK +          E+K R ++L+ + +     +R+  W+EK+
Sbjct: 399 ---KRIGEVRAKIE-RGEKAVKDIEARLAELEEKVRQRLLRARAM-----VRRKEWYEKY 449

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
           +W I+S  YL I GRDA QNE +VKRY++   +++HAD+HGA + V       Q  P   
Sbjct: 450 HWVITSHGYLAIGGRDASQNESVVKRYLNDKRIFMHADIHGAPAVVFFAE--GQTPPEQD 507

Query: 621 LNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           L +A      +S+AW + + +   +WV+  QVSK AP GEYL  G+FM+ GK+N++
Sbjct: 508 LREAAAIAAAYSKAWKAGIGSVDVYWVWGSQVSKAAPAGEYLAKGAFMVYGKRNYI 563



 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 71/138 (51%), Gaps = 10/138 (7%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  M   DVAA V+ L  L G R +N+Y            N   +     +E   +++  
Sbjct: 5   KTSMTAFDVAAVVRQLSGLQGSRLANIYA----------YNGGFLLRFKGAEDARVVVVP 54

Query: 63  GVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
            VRLH T Y   ++ TP    + LRK+IR  RLE V Q G+DRI +F+F  G  ++ ++ 
Sbjct: 55  AVRLHATRYEPAERGTPPPLVMGLRKYIRGARLESVEQHGFDRIAVFRFSRGNGSYVLVT 114

Query: 123 ELYAQGNILLTDSEFTVL 140
           EL  +G ++L DS + +L
Sbjct: 115 ELLPRGVVVLADSSWKIL 132


>gi|76801680|ref|YP_326688.1| hypothetical protein NP2070A [Natronomonas pharaonis DSM 2160]
 gi|76557545|emb|CAI49126.1| conserved hypothetical protein [Natronomonas pharaonis DSM 2160]
          Length = 699

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 97/391 (24%), Positives = 177/391 (45%), Gaps = 44/391 (11%)

Query: 337 PLLLNQFRSREF--VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHM 390
           PL L +  +  +    FE F+ A+D ++ +++++   +     +   F     K  +I  
Sbjct: 258 PLPLEEHTAEGYDATAFEHFNGAIDAYFHRLQAEAETETDTGDDKPDFESEIAKFERIIE 317

Query: 391 DQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
            Q+  +   +++ +   + AEL+  N + VD     V+ A    + W+++    +E  + 
Sbjct: 318 QQQGAIEEYEKQAEVEQQKAELLYGNYDLVDEICSTVQSAREEGVPWDEIETTFEEGAER 377

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
           G   A  +  +      +++       ++DD+E       +++D  +    NA R Y+  
Sbjct: 378 GIDAAAAVVGVDAAEGTVTI-------DLDDKE-------IDLDPTMGVEKNADRLYQEA 423

Query: 511 KKQESKQEKTITAHSKAFKAAE--KKTRLQILQEK----------------TVANISHMR 552
           K+   K+E    A     +  E  K+ R Q   +                 ++A++    
Sbjct: 424 KRVRGKKEGAQAAIEDTREDLEDVKERRRQWEADDDEDDDADEESPDRDYLSMASVPVRY 483

Query: 553 KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP 612
              W+E+F WF +S+++LVI GRDA QNE +VK+YM   D + HA  HG   T++K   P
Sbjct: 484 DEKWYEQFRWFRTSDDFLVIGGRDADQNEALVKKYMDPSDRFFHAQAHGGPVTILKATAP 543

Query: 613 EQP-----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSF 666
           ++P     +P  +  +A  F V +S  W D K     + V P QVSKT  +GEY+  G F
Sbjct: 544 DEPAREVDIPDTSKREAAQFAVSYSSVWKDGKFEGDVYEVDPDQVSKTPESGEYIEKGGF 603

Query: 667 MIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
           +IRG +N+     + +  G+    D   +G 
Sbjct: 604 VIRGDRNYYRDMQVGVAVGIKCEPDTRVIGG 634



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 11/163 (6%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-- 63
           M + D+AA V  LR   G      Y         K+ +        +  ++ LL+E G  
Sbjct: 7   MTSVDLAALVGELRDYTGAVVDKAYLYGDDFVRLKMRDY-------DRGRIELLIEVGDP 59

Query: 64  VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
            R H     +  D    P  F + LR  I     E V Q G+DRI+ F+F        ++
Sbjct: 60  KRAHVAVPEHVPDAPGRPPNFAMMLRNRIAGANFEGVEQYGFDRILTFRFEREDQTTLIV 119

Query: 122 LELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            EL+  GNI + + +  V+  L + R   + VA  +++ YP E
Sbjct: 120 AELFGDGNIAVLNEDHEVIDCLDTVRLSARTVAPGAQYGYPDE 162


>gi|15920412|ref|NP_376081.1| hypothetical protein ST0231 [Sulfolobus tokodaii str. 7]
 gi|15621194|dbj|BAB65190.1| hypothetical protein STK_02310 [Sulfolobus tokodaii str. 7]
          Length = 595

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 70/200 (35%), Positives = 112/200 (56%), Gaps = 11/200 (5%)

Query: 491 VEVDLALSAHANARRWYELKKK---QESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN 547
           +E+D  LS + NA +++++ K+   +  K E+T+    +  K  +K+     ++E+T   
Sbjct: 326 IELDPKLSVYKNASKYFDIAKEYAEKRKKAEETLNNLKQKLKELDKQ-----IEERTEEI 380

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
              +RK  W+EK+ W  +   YLVI+GRD  QNE +V++ +   D+++HAD+ GA +T+I
Sbjct: 381 RISLRKREWYEKYRWSFTRNGYLVIAGRDIDQNESLVRKLLEPKDIFLHADIQGAPATII 440

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSF 666
           K       V    +  A     C+S+AW   M     +WV   QVSK+ P+GEYL  GSF
Sbjct: 441 KTQG--NNVTEDDIRDAAVIAACYSKAWKVGMGAIDVFWVNGDQVSKSPPSGEYLKKGSF 498

Query: 667 MIRGKKNFLPPHPLIMGFGL 686
           MI GKKNF+    + +  GL
Sbjct: 499 MIYGKKNFINNVKMQLFLGL 518



 Score = 41.2 bits (95), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 56/115 (48%), Gaps = 15/115 (13%)

Query: 21  LIGMRCSNVYDLS-PKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           +I  R  NVY +S  + Y  KL           S+K L++ E G R+H T Y R K+   
Sbjct: 23  IISCRVDNVYKISGTQAYFLKL-------HCKNSDKNLVI-EPGKRIHFTKYDRQKE--I 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTD 134
           S     +R HI+ + + ++  LG +RII   F        + +EL  +G +++TD
Sbjct: 73  SNEVSLIRAHIKDKIINNIELLGKERIIKLTFM----DRLMYIELLPRGLLVITD 123


>gi|146302942|ref|YP_001190258.1| hypothetical protein Msed_0157 [Metallosphaera sedula DSM 5348]
 gi|145701192|gb|ABP94334.1| protein of unknown function DUF814 [Metallosphaera sedula DSM 5348]
          Length = 601

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 76/204 (37%), Positives = 119/204 (58%), Gaps = 18/204 (8%)

Query: 490 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANIS 549
           K+E+D  +SA  NA +++E  K+ ++K  +T     +  +  EKK   Q ++ K+   I 
Sbjct: 322 KIEIDPKISASKNASQYFEKAKELDAKIRRT----RETIEELEKKK--QEIKAKSKETIE 375

Query: 550 H----MRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
                +RK  W+E+++W I+S  ++VI+GRD  QNE IV++ +   D+++HAD+ GA +T
Sbjct: 376 GSKILVRKKEWYERYHWTITSNGFIVIAGRDIDQNESIVRKMLEDKDIFLHADIQGAPAT 435

Query: 606 VIKNHRPEQPV--PPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLT 662
           VIKN     PV      L  A     C+S+AW   + +   +WVY  QVSK+ P+GEYL 
Sbjct: 436 VIKN-----PVGIGEQDLMDAAVLAGCYSKAWKLGLASIDVFWVYGEQVSKSPPSGEYLP 490

Query: 663 VGSFMIRGKKNFLPPHPLIMGFGL 686
            GSFMI GKKN++    L +  G+
Sbjct: 491 KGSFMIYGKKNYIKNVKLELTIGV 514


>gi|269864527|ref|XP_002651604.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064216|gb|EED42452.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 257

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 67/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 4   KAEKTKIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGGKNAQQNDQIV 63

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS + K            +  A  F + +S+AWD +++   +
Sbjct: 64  NKYMEDRDLYFHCDVKGASSVICKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 117

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 118 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 164


>gi|150400994|ref|YP_001324760.1| hypothetical protein Maeo_0563 [Methanococcus aeolicus Nankai-3]
 gi|150013697|gb|ABR56148.1| protein of unknown function DUF814 [Methanococcus aeolicus
           Nankai-3]
          Length = 686

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 198/375 (52%), Gaps = 30/375 (8%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMD 391
           Y    P+ L ++ + +   ++ F  A+D+++S    +   ++ + K     ++  +I   
Sbjct: 257 YFSISPIELLKYANYDKKYYDNFLTAMDDYFSIFILKTEIKKQETKIQKMVNRQERILNS 316

Query: 392 Q-ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKA 450
           Q E+     KQ+++  +K  +LI  N   VD  IL   ++   ++ W+++ ++VK+ +  
Sbjct: 317 QIESLKKYEKQDIENKLK-GDLIYANYAMVDE-ILNTIISAREKLEWKEIKKIVKQNK-- 372

Query: 451 GNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEK-VEVDLALSAHANARRWYEL 509
            NP+ G I  +  E+N   ++L+  +D  D      P+ K V +D+  +A  NA  +Y  
Sbjct: 373 DNPILGKIVSIN-EKNG-EIILNLTVDYGDGA----PITKNVILDIRKNAFENADNYYGK 426

Query: 510 KKKQESKQEKTITAHSKAFKAAEK-----KTRLQILQEK--TVANISHMRKVHWFEKFNW 562
            KK + K +   TA   + K  +K     ++ ++ L+EK  T       +K  W+EKF W
Sbjct: 427 SKKFKHKIKGVHTAIEISEKKLKKLKIQEESEMETLKEKEETTMVKKERKKRKWYEKFKW 486

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK--NHRPEQPVPPLT 620
            + ++ YLVI+G+DA  NE ++KRY  K D+  H  + GA  TVIK    +  + +  L+
Sbjct: 487 TVIND-YLVIAGKDASTNESLIKRYTEKDDIVFHTQMAGAPFTVIKVDKSKGNKTIEELS 545

Query: 621 -------LNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKK 672
                  +++   + V HS+AW   + ++  +WV P Q+SKTA +GEYL+ G+FM+RGK+
Sbjct: 546 EEERNHLISETAKYAVSHSKAWKLGLGSADVYWVKPDQISKTAESGEYLSKGAFMVRGKR 605

Query: 673 NFLPPHPLIMGFGLL 687
           NF+    L +G G++
Sbjct: 606 NFIRSAILDLGIGII 620



 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/170 (25%), Positives = 82/170 (48%), Gaps = 16/170 (9%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSP---KTYIFKLMNSSGVTESGESEKVLL 58
           +K  +   D+   V+ L+++I  +    + ++    K  I K+     + E G  E V+ 
Sbjct: 1   MKTELTNVDIFVAVQELQQIINGKLDKAFLVNSQQGKELILKI----HIPEIGTREIVV- 55

Query: 59  LMESGVRLH----TTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
               GV  H     T Y+RDK   P  F + LRKH++  ++  V Q  +DRII  +F   
Sbjct: 56  ----GVGKHKYITLTEYSRDKPRNPPSFAMLLRKHLKNIKIVSVEQHNFDRIIKIKFQWN 111

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
              + +++EL+  GN++L D E T++  L+  R   + +     +++P +
Sbjct: 112 EIEYILVIELFGDGNVILLDKENTIILPLKIERWSTRKIVPKEIYKFPPQ 161


>gi|269864419|ref|XP_002651566.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064286|gb|EED42490.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 290

 Score =  132 bits (333), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 67/167 (40%), Positives = 102/167 (61%), Gaps = 10/167 (5%)

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
           KA + K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 4   KAEKTKIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENNCVIIGGKNAQQNDQIV 63

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS + K            +  A  F + +S+AWD +++   +
Sbjct: 64  NKYMEDRDLYFHCDVKGASSVICKGSADR------NIEDATYFALVYSKAWDEQVIKDVF 117

Query: 645 WVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
           +V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 118 YVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 164


>gi|257077022|ref|ZP_05571383.1| hypothetical protein Faci_08161 [Ferroplasma acidarmanus fer1]
          Length = 615

 Score =  132 bits (332), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 60/130 (46%), Positives = 95/130 (73%), Gaps = 3/130 (2%)

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
           R  +WFE ++WFISS   ++++GRDA+ NE +VK++MS  D+YVHADL+GA STVIK+  
Sbjct: 409 RVKYWFESYHWFISSSGNMIMAGRDAKTNEKLVKKHMSDDDIYVHADLYGAPSTVIKHEG 468

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
            E  +   T+ +A  F++  S+AW + + + +A+WVYP QVSKT  +GE+++ GS+++RG
Sbjct: 469 IE--ITEETIKEACIFSISLSRAWPAGIGSGTAYWVYPSQVSKTPESGEFVSKGSWIVRG 526

Query: 671 KKNFLPPHPL 680
           K+N++   PL
Sbjct: 527 KRNYVLNIPL 536



 Score = 41.6 bits (96), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 49/107 (45%), Gaps = 4/107 (3%)

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNI 130
           Y  +K    +  ++  RK +  +R+  + Q+ +DR++      G     +ILEL+  GN+
Sbjct: 62  YDAEKPEEATQLSMLFRKQLSEKRIVGIEQINFDRVVRITLHTGQE---IILELFGGGNL 118

Query: 131 LLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKL 177
           +LTD+   V   +  H    + V I   +  P  I  V +  T S +
Sbjct: 119 ILTDNGKIVFA-MEQHVYKTRKVQIGEEYIPPAVINPVADLETFSGI 164


>gi|397781041|ref|YP_006545514.1| hypothetical protein BN140_1875 [Methanoculleus bourgensis MS2]
 gi|396939543|emb|CCJ36798.1| putative protein MJ1625 [Methanoculleus bourgensis MS2]
          Length = 659

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 171/351 (48%), Gaps = 41/351 (11%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
           P++L     RE  +FETF  ALD FY K+  ++ E   +        +   I   Q   +
Sbjct: 270 PVVLAGEEVRE--RFETFSEALDAFYPKVAGEKEEAAAEKPR---LSREEVIRQRQAEAI 324

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
              ++++ R  ++ E++  N   V   I  +  A  +R SW+++ +++K    + N  A 
Sbjct: 325 KGFEKKIRRYERVVEVLYENYTAVTGVITTLDAASRDR-SWQEIEQILKS--NSDNAAAK 381

Query: 457 LIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESK 516
           +I  ++     + L L+               E+V+V +  +   N  R+Y+  KK + K
Sbjct: 382 MIRAVHPAEAAVELDLAG--------------ERVKVYVHETIEQNIGRYYDQIKKFKKK 427

Query: 517 QEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           +   + A  +A     ++ +  + Q+K            W+ +F WF +S+  LVI GRD
Sbjct: 428 KAGALAAMERAITVKPRRKQHLVFQKK-----------RWYHRFRWFSTSDGVLVIGGRD 476

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           A QNE +VK+YM  GD+++HAD+HG S  ++K            L++A  F   +S AW 
Sbjct: 477 ASQNEELVKKYMEGGDLFIHADVHGGSVVIVKGATEH-------LDEAAQFAASYSNAWK 529

Query: 637 SKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +   ++  +   P QVSKTA +GEY+  G+F++RG++ +    P+ +  GL
Sbjct: 530 AGHFSADVYAARPDQVSKTAESGEYVARGAFIVRGERQYFRNVPVGVAIGL 580



 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/150 (32%), Positives = 79/150 (52%), Gaps = 11/150 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE-KVLLLMESGV 64
           M+  D+ A V      + +    +Y    KT         G+  +GE   K LLL+E+G 
Sbjct: 34  MSGVDLRALVAEAADRLPLWVGKIYQFDAKTL--------GIRLNGEDRAKYLLLVETGR 85

Query: 65  RLHTTA-YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILE 123
           R+H TA + +  KN PS F + LRKH+   ++ D+RQLG +R +    G     +++I E
Sbjct: 86  RIHFTAEFPKPPKNPPS-FAMLLRKHLEGGKVLDIRQLGIERTMSIDIGKRDTTYHLIFE 144

Query: 124 LYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           L+ +GN +L D E+T++  L  HR  ++ V
Sbjct: 145 LFDEGNAILCDEEYTIIKPLWHHRFKNRDV 174


>gi|374630447|ref|ZP_09702832.1| Fibronectin-binding A domain protein [Methanoplanus limicola DSM
           2279]
 gi|373908560|gb|EHQ36664.1| Fibronectin-binding A domain protein [Methanoplanus limicola DSM
           2279]
          Length = 629

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 170/357 (47%), Gaps = 52/357 (14%)

Query: 324 ESGSSTQIYDEFC-PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAF 382
           ES  S  I    C PL+       E   FET+  ALD ++   E   AE + K       
Sbjct: 229 ESKKSPAITKSGCWPLIFEGEIPEE--TFETYSQALDSYFGLPEVSEAEVKKK------L 280

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLAR 442
            K   I   Q+  +   ++++  + +  E+I  N + + A I+      + +MSW+++  
Sbjct: 281 SKAEIIRKRQQEAIVKFEEKITLASEKVEIIYANYQTI-ADIVKTLSDASLKMSWQEIED 339

Query: 443 MVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHAN 502
           ++K    A NP+A +I ++Y     + +LL           KT+ +   E         N
Sbjct: 340 ILK---NADNPMAKMIKRVYPSEAAVDILLDG---------KTIKLYASE-----GVEGN 382

Query: 503 ARRWYELKKKQESKQEKTITAHSKAFKAAE----KKTRLQILQEKTVANISHMRKVHWFE 558
           A R+Y   KK + K+   + A  + FK  E    K+T ++ ++ K            W+ 
Sbjct: 383 AGRYYSEIKKFKKKKAGALVAMER-FKVTERPERKRTDIKFIKPK------------WYH 429

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KF WF +S++ LVI GRDA  NE IV++Y+   D ++HAD+HG S+  +K          
Sbjct: 430 KFRWFYTSDDVLVIGGRDAGTNEDIVRKYLEGKDTFLHADIHGGSAVAVKGETE------ 483

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPH-QVSKTAPTGEYLTVGSFMIRGKKNF 674
             +++A  F V +S AW S   ++  +  P  QVSKTA +GE L  G+F+IRG++ +
Sbjct: 484 -CMDEAAVFAVSYSNAWKSGFYSADVYAVPRDQVSKTAESGESLKRGAFVIRGERKY 539



 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 80/153 (52%), Gaps = 9/153 (5%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGES-EKVLLLM 60
           VK  M+  D+ A +  L  L+ +    +Y      + F+L        +GE  +K  ++ 
Sbjct: 3   VKKGMSGLDLRAVIAELNGLMPLWIGKIYQYDQNAFGFRL--------NGEDRQKFSIIA 54

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESGVR+H T         PSG+++ LRK++   R+ ++ Q G  R++    G   + +++
Sbjct: 55  ESGVRVHLTKKLPKSPENPSGYSMYLRKYLSGGRILEINQPGIQRVLDLTIGKSESIYHL 114

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           I E + +GN +L DSE+T+L  L+ HR  D+ +
Sbjct: 115 IFEFFDEGNAILCDSEYTILNALKRHRFKDRDI 147


>gi|14601515|ref|NP_148055.1| hypothetical protein APE_1611 [Aeropyrum pernix K1]
 gi|5105298|dbj|BAA80611.1| conserved hypothetical protein [Aeropyrum pernix K1]
          Length = 650

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/192 (38%), Positives = 109/192 (56%), Gaps = 13/192 (6%)

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ----ILQEKTVANISHMRKVHWFEKF 560
           R Y    + E+K E+      KAF  AE ++RL+      + +++  I   RK  WFEK+
Sbjct: 368 RLYREAGELEAKAERA----EKAF--AEARSRLEEAVRRARLRSLRRIIEGRKRFWFEKY 421

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLT 620
           +W I+   +L I GRDA QNE +VKRY+   D+++HAD+HGA +TV+   R  QP     
Sbjct: 422 HWTITRNGFLAIGGRDAGQNESVVKRYLGDDDIFLHADIHGAPATVLLTRR-LQPGDD-D 479

Query: 621 LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHP 679
           +  A      +S+AW +     S +WVY  QVSK+ P GEYL  G+FM+ GK+N++   P
Sbjct: 480 IYDAAVLAAAYSRAWKAGAGGVSVYWVYGSQVSKSPPAGEYLARGAFMVYGKRNYIHHVP 539

Query: 680 LIMGFGLLFRLD 691
           L +  G++   D
Sbjct: 540 LKLALGIVMHKD 551



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 80/152 (52%), Gaps = 14/152 (9%)

Query: 1   MVKVRMNTADV-AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M +  MN+ DV  A ++    L G R  N+Y    K  +  LM   G T +     V ++
Sbjct: 1   MARKSMNSLDVHIAAIQLDNMLRGARLDNIYWPPEKKGV--LMKFKGPTGT-----VNVI 53

Query: 60  MESGVRLHTTA-YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
            E  VR+H T+  A  ++  P+GF   LRK +R  RLE VRQLG+DRI+   F  G   H
Sbjct: 54  AEPSVRIHATSRTAALREVVPTGFVAILRKRVRGSRLEGVRQLGFDRIVELSFSTG---H 110

Query: 119 YVILELYAQGNILLTDSEFTV--LTLLRSHRD 148
            + +E+  +G+++L +SE  +   T++   RD
Sbjct: 111 RLYVEIMPRGSLVLVNSEGVIEATTVVAEFRD 142


>gi|335438854|ref|ZP_08561586.1| Fibronectin-binding A domain protein [Halorhabdus tiamatea SARL4B]
 gi|334890357|gb|EGM28628.1| Fibronectin-binding A domain protein [Halorhabdus tiamatea SARL4B]
          Length = 707

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 114/482 (23%), Positives = 206/482 (42%), Gaps = 77/482 (15%)

Query: 243 VLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGD 302
            L   L +G    E +    G+  N  + E   +E    + L  AV+   + L++   GD
Sbjct: 188 TLATQLNFGGLYGEELCSRAGVSYNQAIEETTDVE---FEALYDAVSDLSERLRE---GD 241

Query: 303 IVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY 362
           + P  Y+   ++    D                 P+ L ++  +    F++F+ AL+E++
Sbjct: 242 LDPRLYVEADDQETPVD---------------VTPVPLVEYEDKPSEAFDSFNDALEEYF 286

Query: 363 SKIESQRAEQQ---HKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLED 419
             +E +  E++   ++   +A   K  +I   QE  +   ++E     + AEL+  N + 
Sbjct: 287 LGLEQEPDEEETGSNRPGFEAEIEKQKRIIAQQEGAIEDFEEEAAAEREKAELLYANYDL 346

Query: 420 VDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEM 479
           VD  +  ++ A A    W ++   +   +  G P A                    + ++
Sbjct: 347 VDEVLSTIQDARAADTPWAEIEETLSAGKDQGIPAA------------------EAVSDV 388

Query: 480 DDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESK-------------QEKTIT 522
           D  E T+ V+    ++E+D       NA R Y+  K+ E K             Q + + 
Sbjct: 389 DGSEGTVTVQIDDHRIELDADTGVEKNADRLYQEAKRIEDKKAGAKEAIENTREQLEAVK 448

Query: 523 AHSKAFKAAEKKTRLQILQEKTVA-----------NISHMRKVHWFEKFNWFISSENYLV 571
              +A++A++         +               +I       W+E F WF +S+ +LV
Sbjct: 449 QRREAWEASDGNDGGDGSGDTDEDDQEDIDWLARESIPIRTSEEWYEHFRWFHTSDGFLV 508

Query: 572 ISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP-EQP-----VPPLTLNQAG 625
           I GR+A QNE +VK+Y+ +GD++ H   HGA +T++K   P E P     +P  +  +A 
Sbjct: 509 IGGRNADQNEELVKKYLDRGDLFFHTQAHGAPATILKATGPSEAPPDDISIPESSREEAA 568

Query: 626 CFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            F + +S  W D K     + V   QV+KT  +GEYL  GSF IRG++ +    P+ +  
Sbjct: 569 QFAISYSTLWKDGKYAGDVYCVEHDQVTKTPESGEYLEKGSFAIRGERTYYDDTPVGVAV 628

Query: 685 GL 686
           G+
Sbjct: 629 GI 630



 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 70/164 (42%), Gaps = 9/164 (5%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSS-GVTESGESEKVLLLME 61
           K  + + D AA    LR  +G      Y         KL   + G  E      +L+ ++
Sbjct: 4   KRELTSVDCAALAGELRAFVGAYHEKSYLYDDDLLRLKLSGPNFGRIE------LLIEVD 57

Query: 62  SGVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
              R+HT A  R  D    P  F + LR  +   +L  V Q  +DRI+  +F    +   
Sbjct: 58  DPKRVHTVAPERVPDAPERPPNFAMMLRNRLEGAQLASVEQFEFDRILQLRFERSDDHTT 117

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
           +I EL+  GN+ + D   TV+  L + R   + V   SR+ +P+
Sbjct: 118 IIAELFGDGNLAVLDETDTVIDSLETVRLQSRTVTPGSRYEFPS 161


>gi|374632982|ref|ZP_09705349.1| putative RNA-binding protein, snRNP like protein [Metallosphaera
           yellowstonensis MK1]
 gi|373524466|gb|EHP69343.1| putative RNA-binding protein, snRNP like protein [Metallosphaera
           yellowstonensis MK1]
          Length = 602

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 12/208 (5%)

Query: 483 EKTLPVEKVEVDLALSAHANARRWYELKKKQESKQ---EKTITAHSKAFKAAEKKTRLQI 539
           E TL    VE+D  LS    A  ++E  K+ ESK    E+TI    K  +  + K R + 
Sbjct: 316 EVTLGEVTVEIDPNLSLTRVASSYFERAKELESKARRAEETIAELKKKVEELKLKLR-ET 374

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
            + K++     +RK  W+EK+ W  +  NYLVI+GRD  QNE +VK+ + + ++++HAD+
Sbjct: 375 EESKSLV----IRKKEWYEKYRWSFTRNNYLVIAGRDVDQNESLVKKMLGEEEIFLHADI 430

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTG 658
            GA +T+IK+ +  Q      +  A     C+S+AW   +     +WVY  QVSK+ P+G
Sbjct: 431 QGAPATIIKDSKGVQEG---DIYDAAVVAACYSKAWKLGLGSVDVFWVYGSQVSKSPPSG 487

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           EYL  GSFMI GKKNF+    L +  GL
Sbjct: 488 EYLPKGSFMIYGKKNFIKNVRLELAIGL 515



 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 58/118 (49%), Gaps = 14/118 (11%)

Query: 20  RLIGMRCSNVYD-LSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           +++G R  N+Y  L  + Y+F L    G  E+        ++E   R+H T Y R++   
Sbjct: 26  KIVGCRVDNIYSILKGRGYLFLLHCRDGDKET--------ILEPSRRIHFTRYQRER--V 75

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
                  LR+ +R   + +V  +  +RI++F      N H + LEL  +G +++TDS+
Sbjct: 76  LDNKAKMLRELVRGAVIREVDVVPGERIVVFSLS---NDHKIYLELLPKGVLVVTDSQ 130


>gi|330835774|ref|YP_004410502.1| hypothetical protein Mcup_1916 [Metallosphaera cuprina Ar-4]
 gi|329567913|gb|AEB96018.1| conserved hypothetical protein [Metallosphaera cuprina Ar-4]
          Length = 508

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 77/216 (35%), Positives = 120/216 (55%), Gaps = 16/216 (7%)

Query: 490 KVEVDLALSAHANARRWYELKKKQE---SKQEKTITAHSKAFKAAEKKTRLQILQEKTVA 546
           K+E+D + S   NA  +++  K+ E    K E+TI    +  +    KT+ +I   K + 
Sbjct: 236 KIEIDPSKSIAKNAALYFDKAKELEEKIKKTEETIVELERKKQDLLSKTKEEIESSKVL- 294

Query: 547 NISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
               +RK  WFEK++W I+   Y+VI+GRD  QNE +VK+++   D+++HAD+ GA +TV
Sbjct: 295 ----IRKREWFEKYHWTITKNGYIVIAGRDIDQNESLVKKFLGDDDIFLHADIQGAPATV 350

Query: 607 IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGS 665
           IK+      +    L  A      +S+AW   + +   +WVY  QVSK+ P+GEYL  GS
Sbjct: 351 IKSP---NSISDEDLLDAATLAASYSKAWKLGLGSIDVFWVYGKQVSKSPPSGEYLPKGS 407

Query: 666 FMIRGKKNFLPPHPLIMGFGL----LFRLDESSLGS 697
           FMI GKKNF+    L +  G+     FR++  S  +
Sbjct: 408 FMIYGKKNFIKNVKLELTVGINTKEGFRIEVGSFNT 443


>gi|448377770|ref|ZP_21560466.1| Fibronectin-binding A domain protein [Halovivax asiaticus JCM
           14624]
 gi|445655714|gb|ELZ08559.1| Fibronectin-binding A domain protein [Halovivax asiaticus JCM
           14624]
          Length = 736

 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 169/737 (22%), Positives = 282/737 (38%), Gaps = 134/737 (18%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L  L G +    Y         KL +     + G  E  + + E+
Sbjct: 4   KRELSSVDLAAVVGELSDLEGAKVDKAYLYGDDLVRLKLRD----FDRGRVELFIEVSET 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R+HT A  R  D    P  F   LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RVHTVAQERVPDAPGRPPHFAKMLRNRLSGADFAGVSQYEFDRILEFVFEREDANTRL 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAA 180
           I+EL+ +GN+ +TD E+ V+  L                    E  R+  RT A      
Sbjct: 119 IVELFGEGNVAVTDGEYEVVDSL--------------------ETIRLKSRTVAPGARYE 158

Query: 181 LTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTL 240
              S+               N    S+E         +FD  +  +++  D  R    TL
Sbjct: 159 FPESR--------------VNPLTVSRE---------AFD--RQMDESDTDVVR----TL 189

Query: 241 KTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVIS 300
            T     L +G   +E +    G+   + + +  + E    + L  A+ +      DV +
Sbjct: 190 AT----QLNFGGLYAEELCTRAGVEKTIDIEDAGESE---YERLYGAIERL---AIDVRN 239

Query: 301 GDIVPEGYILMQNKHL------GKDHPPTESGSSTQIYDEF-----------CPLLLNQF 343
           G   P  Y+  +++        G D    E+G + +  DE             PL  +Q 
Sbjct: 240 GAFDPRLYLEHEDEEGETEGDSGTDD---EAGPTAETDDETEASGTPVDVTPFPLDEHQQ 296

Query: 344 RSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE----DAAFHKLNKIHMDQENRVHTL 399
              E   F++F  ALDE++ ++E    E    A +    +A   K  +I   QE  +   
Sbjct: 297 AGLEPEAFDSFTDALDEYFYRLELADEEPADAASQRPDFEAEIAKQQRIIEQQEGAIEEF 356

Query: 400 KQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           ++E +   + AEL+  N   VD  +  VR A      W ++    +   + G   A  + 
Sbjct: 357 EREAEAERERAELLYANYGFVDEILSTVRDARTEGTPWAEIEERFEAGAEQGIDAAEAVV 416

Query: 460 KLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY----ELKKKQES 515
            +      +++ L                E++ +D       NA R Y     + +K+E 
Sbjct: 417 DVDGANGRVTIELDG--------------ERIGLDADDGVEKNADRLYTEGKRIAEKKEG 462

Query: 516 KQEKTITAHSKAFKAAEKKTRLQILQEKT-------------------VANISHMRKVHW 556
            Q+       +     E+K   +   E +                    ++I       W
Sbjct: 463 AQQAIENTREELADVRERKAAWEADDEGSDETGGDDSDEDEPDIDWLARSSIPIRENEPW 522

Query: 557 FEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK------NH 610
           F++F W  +S+ +LVI GR+A QNE +V +Y+  GD   H   HG   TV+K      + 
Sbjct: 523 FDRFRWVQTSDGFLVIGGRNADQNEELVNKYLEPGDRVFHTQAHGGPVTVLKATDPSESS 582

Query: 611 RPEQPVPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIR 669
           RP+   P  ++ QA  F V ++  W D +     + V   QV+KT  +GEYL  G F IR
Sbjct: 583 RPDMEFPEASIEQAAQFAVSYASVWKDGRYAGDVYAVDADQVTKTPESGEYLEKGGFAIR 642

Query: 670 GKKNFLPPHPLIMGFGL 686
           G + +    P+ +  G+
Sbjct: 643 GDRTYHRDTPVDVAVGI 659


>gi|296241940|ref|YP_003649427.1| hypothetical protein Tagg_0195 [Thermosphaera aggregans DSM 11486]
 gi|296094524|gb|ADG90475.1| protein of unknown function DUF814 [Thermosphaera aggregans DSM
           11486]
          Length = 666

 Score =  130 bits (327), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 101/393 (25%), Positives = 193/393 (49%), Gaps = 55/393 (13%)

Query: 306 EGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI 365
           +GY+++Q ++              Q++  + P+L  +    E  + E+ D  +D +++++
Sbjct: 236 KGYLVLQEEN-------------PQLFTAYYPVLFKEEYGFEVKELESIDEVIDIYFTRL 282

Query: 366 ESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR-SVKMAELIEYNLEDVDAAI 424
           E        +++  A    LN+  + Q+  +   ++++D  S K++ +  Y   D+ +A+
Sbjct: 283 ELSLELAGKQSEMKAKLDSLNERILRQKEIISNYQRQLDEISNKLSSIYTY-FTDISSAL 341

Query: 425 LAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEK 484
                         D AR  +EE+             Y+ +NC  ++   N+ + D  E 
Sbjct: 342 --------------DCARKTREEQGWE----------YIVKNCPGII---NIHK-DKGEV 373

Query: 485 TLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKK---TRLQILQ 541
            L V    + L++      ++  E++K +   + K  TA + + K  EK+   T+++ L 
Sbjct: 374 ELSVGGRTITLSIRIPLE-KQIIEMEKIKGEVKRKIDTALN-SLKEIEKEYDATKME-LD 430

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + + + +  ++   W+EKF+W  +   +LV+ GRDA QNE IVK+Y+   D+++HA++HG
Sbjct: 431 KFSASKMISIKPRSWYEKFHWLFTRNGFLVVGGRDASQNEAIVKKYLRDKDIFLHAEIHG 490

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGE 659
            S+ V+  +  E   P L+ +  A     C+S+AW + M     +W     VS + P+GE
Sbjct: 491 GSAAVLLTNGKE---PSLSDIEDAALIPACYSKAWKTGMGFIEVFWTMGSSVSLSPPSGE 547

Query: 660 YLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
           YL  G+ M+ GKKN+L   PL +G GL    DE
Sbjct: 548 YLPKGAIMVYGKKNYL-KTPLRLGLGLDVVCDE 579


>gi|119719655|ref|YP_920150.1| hypothetical protein Tpen_0745 [Thermofilum pendens Hrk 5]
 gi|119524775|gb|ABL78147.1| protein of unknown function DUF814 [Thermofilum pendens Hrk 5]
          Length = 610

 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 86/280 (30%), Positives = 141/280 (50%), Gaps = 39/280 (13%)

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
             +++ V+   + AEL+  +   VD  + A R  +A+R+ W     +V+   K   P+  
Sbjct: 283 EAIRRAVEELSRKAELLSRHSATVDEVLAAYRGLVASRLQWS----LVEARLKEAYPIVK 338

Query: 457 LIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESK 516
            +D     R+ + L       E++  E       VEVD + SA +NA  ++E   K +S 
Sbjct: 339 SVDP---ARSRLVL-------ELEGVE-------VEVDASRSALSNAASYFE---KAKSA 378

Query: 517 QEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           + K   A +             + +    A     +   W+ +F +F +S  +LV++GR 
Sbjct: 379 KRKLAEASA------------AVERSAEPAPARPAKPAAWYAQFRFFFTSNGFLVVAGRS 426

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           A QNE++V+RYM  GD+++HAD+HGA++ V+K    +QP     + +A  F  C S AW 
Sbjct: 427 AGQNELLVRRYMEPGDIFLHADIHGAAAVVLKTG-GKQP-GEADIAEAAQFAACFSSAWK 484

Query: 637 SKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             +     +WV   QVSK  P+GEYL  GSFM+ GKKN++
Sbjct: 485 GGLYAVDVFWVPAEQVSKKPPSGEYLAKGSFMVYGKKNYV 524


>gi|403216659|emb|CCK71155.1| hypothetical protein KNAG_0G00970 [Kazachstania naganishii CBS
           8797]
          Length = 1006

 Score =  129 bits (325), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 109/385 (28%), Positives = 180/385 (46%), Gaps = 51/385 (13%)

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAEL 412
           T++  +D+F+S +ES +   + + +E  A  KL +   +   R+  L     ++ +   L
Sbjct: 319 TYNRTVDKFFSTLESSKYAMKIQNQETLAGKKLEEARSENGKRIQALIDVQSQNEQKGHL 378

Query: 413 IEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLERN--CMS 469
           I  + E V+ A  AV+  L  ++ W  + +++  E+K GN +A  I   L L++N   + 
Sbjct: 379 IITHAELVEDAKGAVQGLLDQQLDWNIIEKLIITEQKKGNKIAKAIKLPLKLKKNTIVLE 438

Query: 470 LLLSNNLDEMDDEE------------------------------------KTLPVEKVEV 493
           L L +N D  DD E                                    + L    V V
Sbjct: 439 LPLEDNNDTEDDTELSEEVDSSDISSSELSSDEESDQGSTQHQHRKSNRIRALKPTTVSV 498

Query: 494 D--LALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQEKTVANI 548
           D  L LS +ANA  ++ +KK    KQ+K      KA K  E K   Q+   L+E     +
Sbjct: 499 DIKLDLSTYANASEYFMVKKHTVEKQKKVEQNLDKAMKNIETKVNKQLNSKLKESHKV-L 557

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +R  ++FEK+NWFISSE +LV+ G+   + + +  +Y++  D+YV  +    S   IK
Sbjct: 558 KRLRTPYFFEKYNWFISSEGHLVLMGKSDIETDQLYSKYITPDDIYVSNEF--GSHVWIK 615

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSFM 667
           N +  + VPP T+ QAG F +  S AW  K+ +S ++     VSK +A     L  G + 
Sbjct: 616 NPKKTE-VPPNTIMQAGIFAMAASVAWSKKLSSSPYFCSASNVSKFSANDNTVLPQGCYR 674

Query: 668 I--RGKKNFLPPHPLIMGFGLLFRL 690
           +    +K  LPP  L+MG G  +++
Sbjct: 675 LIDEREKVVLPPAQLVMGLGFFWKV 699



 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 83/146 (56%), Gaps = 13/146 (8%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+   D+   V  L   L   R +N+Y++  S + ++ K         +    K+ +
Sbjct: 1   MKQRLGALDIQLLVPELSTALESYRLNNIYNVADSSRQFLLKF--------NKPDSKINV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G++++ T ++RD    PSGF +KLRKH++ +RL  +RQ+  DRII+ QF  G N  
Sbjct: 53  VVDCGLKIYMTEFSRDIPPVPSGFVVKLRKHLKAKRLTALRQVLDDRIIVLQFADGKN-- 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR 144
           Y++LE ++ GN++L D    +L + R
Sbjct: 111 YLVLEFFSAGNVILLDETRKILLVQR 136



 Score = 40.8 bits (94), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 5/62 (8%)

Query: 867 EKERGK---DASSQPESIVRKTKIEGG--KISRGQKGKLKKMKEKYGDQDEEERNIRMAL 921
           +KE G     + S PE++     I G   K  RG+KGKLKKM+ KY DQDE ER +++  
Sbjct: 771 DKEEGSATGSSGSIPENMSVAETIVGDIKKNVRGKKGKLKKMQRKYRDQDENERLLKLEA 830

Query: 922 LA 923
           L 
Sbjct: 831 LG 832


>gi|429217609|ref|YP_007175599.1| RNA-binding protein [Caldisphaera lagunensis DSM 15908]
 gi|429134138|gb|AFZ71150.1| putative RNA-binding protein, snRNP like protein [Caldisphaera
           lagunensis DSM 15908]
          Length = 669

 Score =  129 bits (324), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 93/143 (65%), Gaps = 3/143 (2%)

Query: 545 VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
           V NI   RK  W+EK++W ++  N+L I GRDA QNE +VK+Y+S+ D+Y+HAD+HG+ S
Sbjct: 437 VKNIIRSRKREWYEKYHWILTRNNFLAIGGRDADQNESVVKKYLSEKDIYIHADIHGSPS 496

Query: 605 TVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTV 663
            V+  +  +  V    +N A    + +S+AW + M    A+WV  +QVSK+ P+GEYL  
Sbjct: 497 VVLFANNKD--VGEEDINDAAIIAIAYSKAWKAGMGSVGAYWVLGNQVSKSPPSGEYLAK 554

Query: 664 GSFMIRGKKNFLPPHPLIMGFGL 686
           GSFMI GKKNFL P  + +  G+
Sbjct: 555 GSFMIYGKKNFLKPINMELYLGI 577



 Score = 43.1 bits (100), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 27/110 (24%), Positives = 55/110 (50%), Gaps = 5/110 (4%)

Query: 57  LLLMESGVRLHTTAYAR-DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM 115
           +LL+E  +R+H +   +   +     F L LRK+IR +++  V Q+G+DR+I   F    
Sbjct: 71  ILLIEPSLRIHFSNRIKPSSEFVDKQFALLLRKYIRDQKITSVEQIGFDRLIKITF---F 127

Query: 116 NAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEI 165
           N     +E+  +G + L D    ++   +  +  D+ +    ++++P  I
Sbjct: 128 NIK-TFVEILPKGVVALVDENDQIIGATKYLKFKDREIKPKIKYKFPKII 176


>gi|355571923|ref|ZP_09043131.1| protein of unknown function DUF814 [Methanolinea tarda NOBI-1]
 gi|354825019|gb|EHF09254.1| protein of unknown function DUF814 [Methanolinea tarda NOBI-1]
          Length = 633

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 91/316 (28%), Positives = 150/316 (47%), Gaps = 44/316 (13%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
           ++ KA+ D      + I   Q+  V   ++++    +  E +  +   V   + A+R A 
Sbjct: 281 KEEKARRD------DHIRSRQQEAVKKFEEKIAACERAVEALYSHYTLVSEILEALRKAR 334

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKV 491
             R SW+++  +V   R A +  A  I  +Y  R  + + L                E+V
Sbjct: 335 ETR-SWQEIEALV---RGAKSGPATRIVAVYPGRGAVDIDLG---------------ERV 375

Query: 492 EVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHM 551
            + +  S  ANA  +YE  KK   K      A  +A +  E++T      +K        
Sbjct: 376 TLTVGESIEANAAAYYEEIKKYRRKIAGAQAAMERAVQKKERRTVRAAAGKK-------- 427

Query: 552 RKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR 611
               W+ +F WFI+S+  LV+ GRDA QNE +VK+YM   D++VHAD+HGAS  ++K   
Sbjct: 428 ---RWYHRFRWFITSDGVLVVGGRDASQNEELVKKYMEGSDLFVHADVHGASVVIVKGKT 484

Query: 612 PEQPVPPLTLNQAGCFTVCHSQAWDSK-MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
            +       +++   F   +S AW S  +    + V P QVSKT  +GEY++ GSF++RG
Sbjct: 485 GK-------MDEVATFAASYSGAWKSGHLAADVYCVAPSQVSKTPESGEYVSRGSFIVRG 537

Query: 671 KKNFLPPHPLIMGFGL 686
           ++ +    PL +  GL
Sbjct: 538 ERRYFRNVPLGIAIGL 553



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 80/159 (50%), Gaps = 7/159 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           ++  DV A V    RL+ +     Y+++P T + +           E  +  L++E  VR
Sbjct: 7   LSGIDVRALVTEWERLLPLWVDKAYEVAPGTILLRFKGK-------EHGRHALVIEPPVR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            H T +      TPS F + LRK++   R+  VRQ G  RI++F  G G   +++++EL+
Sbjct: 60  AHLTWHEVAVPKTPSAFAMLLRKYLSGGRVLSVRQHGIQRIVIFDIGKGDRLYHLVIELF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTE 164
            +GNI+L  S++T++   R     ++ +   + +  P E
Sbjct: 120 DRGNIVLCASDWTIIQPFRRLHFREREIVAGAAYTLPPE 158


>gi|124485365|ref|YP_001029981.1| hypothetical protein Mlab_0540 [Methanocorpusculum labreanum Z]
 gi|124362906|gb|ABN06714.1| protein of unknown function DUF814 [Methanocorpusculum labreanum Z]
          Length = 642

 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 103/341 (30%), Positives = 166/341 (48%), Gaps = 48/341 (14%)

Query: 351 FETFDAALDEFYSKIESQRA-EQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           F TF  AL+ FY K  +++  EQ+ K        K  +I   QE  V    +++  + ++
Sbjct: 258 FATFSQALEAFYPKPVAEKVIEQKIK------LSKEERIRKQQEAAVVNFDKKIAEATEI 311

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
           +E+I  +  +V   I  V  A + ++SW+D+A ++K   K+  P A         +  +S
Sbjct: 312 SEIIYSHYGEVQETI-DVLAAASQKLSWQDIAAVIK---KSDLPAA---------KRIIS 358

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           +   N    +D +EK     KV + +  S  AN  R++ + KK  +K+   + A      
Sbjct: 359 VDPKNASVVIDLQEK----HKVTIFVHESLEANVGRYFAVVKKFRAKKAGALRAMEAGIV 414

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
            AEKK           A      K  W+ +F W  +S+  LVI GR+A QNE +VK+YM 
Sbjct: 415 HAEKKK----------AAGPGRLKPKWYHRFRWMETSDGVLVIGGRNADQNEELVKKYME 464

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAW----DSKMVTSAWW 645
             D ++HAD+ GAS+ ++K            ++QA  F   +S+AW     S  V +A  
Sbjct: 465 GKDTFLHADVFGASAVIVKGVTER-------MDQAVQFAASYSRAWAGGGASVDVIAA-- 515

Query: 646 VYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             P+QVSKT  +GEY+  GSF+IRG++      PL +  G+
Sbjct: 516 -SPNQVSKTPESGEYVAHGSFVIRGERKIYKDVPLEIAIGV 555



 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 75/148 (50%), Gaps = 7/148 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M+ ADV A    L  L+ +    +Y     +  F+L          E  + LL +  G+R
Sbjct: 7   MSGADVKAMTAELAALLPLWIGKIYQYDNASLGFRLNGE-------EKARHLLYVVRGIR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            H  +        PSGF++ LRK+I   ++ ++ Q   +R+I+   G G + + +I+EL+
Sbjct: 60  AHLVSELPPAPKNPSGFSMYLRKYIEGGKVLNIEQKAIERVIIITIGKGPSEYKLIIELF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGV 153
            +GN++LTD +FT++  L   R  D+ +
Sbjct: 120 DEGNLILTDEKFTIINALAQRRFRDRDI 147


>gi|302347972|ref|YP_003815610.1| fibronectin-binding protein [Acidilobus saccharovorans 345-15]
 gi|302328384|gb|ADL18579.1| Predicted fibronectin-binding protein [Acidilobus saccharovorans
           345-15]
          Length = 647

 Score =  126 bits (317), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 59/149 (39%), Positives = 91/149 (61%), Gaps = 5/149 (3%)

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           +SH R+  W+E+++W ++S   L + GRDA QNE +V++ +   DV++HAD+HGA + ++
Sbjct: 417 VSHRRRA-WYERYHWLVTSSGVLAVGGRDADQNESLVRKMLGPNDVFLHADIHGAPAVIL 475

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
                        +++A   T  +S+AW   M + S +W Y  QVSK+ P+GEYLT GSF
Sbjct: 476 MAA-AAGGFTETDVSEAAVLTAAYSRAWKEGMASVSVYWAYGSQVSKSPPSGEYLTKGSF 534

Query: 667 MIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           M+ GKKN+L P  L +  G+   LDE  L
Sbjct: 535 MVYGKKNYLRPLRLELYLGIA--LDEEGL 561


>gi|448730186|ref|ZP_21712496.1| hypothetical protein C449_10386 [Halococcus saccharolyticus DSM
           5350]
 gi|445793917|gb|EMA44482.1| hypothetical protein C449_10386 [Halococcus saccharolyticus DSM
           5350]
          Length = 699

 Score =  126 bits (317), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/372 (25%), Positives = 157/372 (42%), Gaps = 55/372 (14%)

Query: 351 FETFDAALDEFYSKIESQRAEQ---QHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSV 407
           F++F AALD ++  +++   E+   + +   +    +  +I   QE  +   + + D   
Sbjct: 272 FDSFTAALDAYFVALDTTEDEEGGGRERPDFEDDIERQQRIIEQQEGAIEDFEDQADAER 331

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
             AE +  + + VD  +  VR A      W+D+     E    G   A  +D +      
Sbjct: 332 AKAESLYAHYDLVDEILSTVRNAREQGTGWDDIEERFAEGADRGIAAAEAVDGVTPSEGT 391

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           +++ +     E+D      P + VE         NA R Y+  K+   K+E    A    
Sbjct: 392 VTVDIDGRSVELD------PRDGVE--------QNADRLYKEAKRVVGKKEGAEEA---- 433

Query: 528 FKAAEKKTRLQILQEK--------------------------TVANISHMRKVHWFEKFN 561
              AE +  L+ LQ +                          T  +I   +   W+E+F 
Sbjct: 434 --VAETRAELEALQRRRDEWESADENETESTDTDEDEDIDWLTRRSIPVRQNEQWYERFR 491

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL-- 619
           WF +S+ +LV+ GR A QNE +VK+Y+ +GD + H    G   TV+K   P +P   +  
Sbjct: 492 WFRTSDGFLVLGGRSADQNEDLVKKYLERGDRFFHTQARGGPVTVLKATGPSEPTEEVEF 551

Query: 620 ---TLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
              TL +   F V +S  W + +    A+   P QVSKT  +GEYL  G F IRG + + 
Sbjct: 552 SESTLEETAQFAVSYSSVWKNGRFAGDAYMASPDQVSKTPESGEYLEKGGFAIRGDRTYF 611

Query: 676 PPHPLIMGFGLL 687
               + +  G++
Sbjct: 612 RDTAVGVAVGIV 623



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 68/165 (41%), Gaps = 11/165 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         KL +        +  +V LL+E 
Sbjct: 4   KRELTSVDLAALVTELGTYAGAKLDKAYLYGDDLLRLKLRDF-------DRGRVELLIEV 56

Query: 63  G--VRLHTTA--YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R H        D    P GF   LR  +       V Q G+DR++ F+F       
Sbjct: 57  GETKRAHVVDPDNVPDAPGRPPGFAKMLRNRLSGADFAGVSQFGFDRVLTFEFEREDQNT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT 163
            ++ EL+ +GN+ + D+   V+  L + R   + VA  + + +P+
Sbjct: 117 KIVAELFGEGNVAVLDANDEVVDCLNTVRLQSRTVAPGATYEFPS 161


>gi|429192346|ref|YP_007178024.1| RNA-binding protein [Natronobacterium gregoryi SP2]
 gi|448325749|ref|ZP_21515133.1| Fibronectin-binding A domain-containing protein [Natronobacterium
           gregoryi SP2]
 gi|429136564|gb|AFZ73575.1| putative RNA-binding protein, snRNP like protein [Natronobacterium
           gregoryi SP2]
 gi|445614570|gb|ELY68242.1| Fibronectin-binding A domain-containing protein [Natronobacterium
           gregoryi SP2]
          Length = 710

 Score =  126 bits (316), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 167/371 (45%), Gaps = 53/371 (14%)

Query: 351 FETFDAALDEFYSKIE------SQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +++F A LD+++ ++E      S   EQ+   +E+ A  K  +I   QE  +   +Q+ +
Sbjct: 281 YDSFLAVLDDYFFRLELEEEDDSDPTEQRPDFEEEIA--KYERIIEQQEGAIEGFEQQAE 338

Query: 405 RSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
           +  + AEL+  EY L  VD  +  VR A      W+++    +E ++ G   A  +  + 
Sbjct: 339 QLREKAELLYAEYGL--VDEVLSTVREAREQDRPWDEIEERFEEGKERGIEAAKAVVDVD 396

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTIT 522
                +++              TL  E VE+ +      NA R Y+  K  E K+E  + 
Sbjct: 397 GSEGTVTV--------------TLDGEHVELAVHDGVEQNADRLYKEAKDIEGKKEGALA 442

Query: 523 AHSKAFKAAE--KKTRLQILQEK------------------TVANISHMRKVHWFEKFNW 562
           A     +  E  K+ R Q   +                   ++ ++       W+++F W
Sbjct: 443 AIEDTREDLEEAKRRRDQWEVDDEDDGDDDEIDEADSKDWLSMPSVPIRENEPWYDRFRW 502

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------V 616
           F +S++YLVI GR+A QNE +VK+Y+  GD   H   HG   TV+K   P +       +
Sbjct: 503 FYTSDDYLVIGGRNADQNEELVKKYLEPGDKVFHTQAHGGPVTVLKATDPSEASSHDIDL 562

Query: 617 PPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           P  ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG + + 
Sbjct: 563 PQTSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYY 622

Query: 676 PPHPLIMGFGL 686
              P+ +  G+
Sbjct: 623 DDTPVGVAVGI 633



 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 56/112 (50%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V L++E G   R HT A  R  D    P  F + LR  +      DV Q  +DRI+ F 
Sbjct: 49  RVELILEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFVDVEQYEFDRILEFI 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|422293271|gb|EKU20571.1| hypothetical protein NGA_2069500, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 107

 Score =  125 bits (315), Expect = 8e-26,   Method: Composition-based stats.
 Identities = 55/83 (66%), Positives = 62/83 (74%)

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           V P+ L +AGC  V  S AW +KMVTSAWWV   QVSKTAP GE+L  GSFM+RGKKNFL
Sbjct: 10  VSPVALQEAGCLAVSRSSAWKAKMVTSAWWVGAGQVSKTAPAGEFLPTGSFMVRGKKNFL 69

Query: 676 PPHPLIMGFGLLFRLDESSLGSH 698
            P PL MG GLLF+LDE S+G H
Sbjct: 70  APQPLEMGLGLLFKLDEGSVGRH 92


>gi|305663918|ref|YP_003860206.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
 gi|304378487|gb|ADM28326.1| protein of unknown function DUF814 [Ignisphaera aggregans DSM
           17230]
          Length = 667

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/185 (36%), Positives = 107/185 (57%), Gaps = 13/185 (7%)

Query: 513 QESKQEKTITAHSKAF-KAAEKKTRL---------QILQEKTVANISHMRKVHWFEKFNW 562
           Q ++  K I+   K+  +A E+K +L         +IL+EK    +    K  W+EK++W
Sbjct: 395 QYNELRKNISDIEKSIERALEEKVKLMQKINEMNNRILEEKQKVKVKLSLKKEWYEKYHW 454

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
            I+   +LVI GRDA QN  +++R++   D+ +HAD+HGAS+ +IK     + V   TL 
Sbjct: 455 TITPTGFLVIGGRDASQNIQLIRRFLEPNDIVLHADIHGASTVIIKTGG--RDVDEETLM 512

Query: 623 QAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLI 681
           +A     C+S+AW S ++    +WVY  Q+S + PTGEYL  GS+M+ GKKN++    L 
Sbjct: 513 EAATIAACYSKAWKSGLLAIDVFWVYGSQISLSPPTGEYLPKGSYMVYGKKNYIKNVSLK 572

Query: 682 MGFGL 686
           +  G+
Sbjct: 573 LALGI 577


>gi|448399812|ref|ZP_21571045.1| Fibronectin-binding A domain protein [Haloterrigena limicola JCM
           13563]
 gi|445668265|gb|ELZ20895.1| Fibronectin-binding A domain protein [Haloterrigena limicola JCM
           13563]
          Length = 722

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/400 (25%), Positives = 171/400 (42%), Gaps = 61/400 (15%)

Query: 327 SSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE----DAAF 382
           S  Q+ D   P  L +    +   +ETF  ALD+++ ++E    E+    ++    D+  
Sbjct: 267 SEGQVVD-VTPFPLEEHTDLDSEPYETFLEALDDYFFQLELGEDEEPEPTEQRPDFDSEI 325

Query: 383 HKLNKIHMDQENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDL 440
            K  +I   Q+  +   +QE D   + AEL+  EY L  VD  +  ++ A      W++ 
Sbjct: 326 AKYERIIEQQQGAIEGFEQEADALREQAELLYAEYGL--VDEILSTIQDARVQDRPWDE- 382

Query: 441 ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPV----EKVEVDLA 496
              ++E  +AG                  +  +  + ++D  E T+ V    E++++ + 
Sbjct: 383 ---IRERFEAGAE--------------QGIEAAEAVVDVDGSEGTVTVDLDGERIDLVVE 425

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV----------- 545
                NA R Y   K+ E K+E  + A     +  E   R +   E T            
Sbjct: 426 QGVEQNADRLYTEAKRVEEKKEGALAAIEDTREDLEDAKRRRDEWEATEREDTSEDGEDE 485

Query: 546 ------------ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
                        +I       WF++F WF +S+ YLVI GR+A QNE +VK+Y+  GD 
Sbjct: 486 ADEAEQRDWLAEPSIPIRENEPWFDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDK 545

Query: 594 YVHADLHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWV 646
            +H   HG   TV+K   P +       +P  ++ +A  F V +S  W D +     + V
Sbjct: 546 VLHTQAHGGPVTVLKATDPSEASSSDIELPDSSIEEAAQFAVSYSSVWKDGRYAGDVYAV 605

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
              QV+KT  +GEYL  G F IRG + +    P+    G+
Sbjct: 606 DADQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 645



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RIELILEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGNI +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FEREDGTTRIIVELFGQGNIAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|393796641|ref|ZP_10380005.1| hypothetical protein CNitlB_10052 [Candidatus Nitrosoarchaeum
           limnia BG20]
          Length = 638

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 92/152 (60%), Gaps = 4/152 (2%)

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK     + +RK +W+E++ WF +S+  L I GRDA  N  +V++++ K D   H D+ G
Sbjct: 415 EKESVTFAEIRKKNWYERYRWFFTSDGILAIGGRDAPSNSAVVRKHLGKNDKIFHGDIFG 474

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEY 660
           +   ++K+   E P PP +LN+    TVC S+AW   M   SA+WV P QV K+AP+G++
Sbjct: 475 SPFFILKD--TENP-PPASLNEVAHATVCFSRAWREGMYGVSAFWVNPEQVKKSAPSGQF 531

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
           L  GSF I G++NF+    L +  GL+ + D+
Sbjct: 532 LPKGSFTIEGQRNFVKISTLKLAVGLMPQGDD 563



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 63/121 (52%), Gaps = 11/121 (9%)

Query: 26  CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLK 85
            SN+Y ++  + +FKL ++       + +  +++  SGV L +    + ++  P+    +
Sbjct: 24  VSNIYGVTKDSILFKLHHTE------KPDIYMMISTSGVWLTS---VKIEQMEPNRLLKR 74

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLTLLR 144
           LR  +   +++ + Q+  +RI  F F  G +  +VI+ E +  GNILL ++E  +L L  
Sbjct: 75  LRSDLLRLKVKKIEQIASERIAYFTFE-GFDKEFVIVGEFFGDGNILLCNNEMKILALQH 133

Query: 145 S 145
           S
Sbjct: 134 S 134


>gi|67624075|ref|XP_668320.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54659500|gb|EAL38073.1| hypothetical protein Chro.50204 [Cryptosporidium hominis]
          Length = 1375

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/154 (39%), Positives = 92/154 (59%), Gaps = 15/154 (9%)

Query: 1   MVKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           MVK RM + D+ A V  + + L G +  N+YD++ +TY+FK          G  EK  LL
Sbjct: 1   MVKSRMTSVDICAMVHGISKDLKGQKLINIYDINSRTYLFKF---------GGEEKKFLL 51

Query: 60  MESGVRLHTTAYARDKK-----NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLG 114
           +ESG+R HTT + R+ +     ++ S F  KLR++IR ++L+D+ Q+G DRI+   FG G
Sbjct: 52  VESGIRFHTTQWKRENEHKTSVSSISFFNSKLRRYIRNKKLDDISQMGMDRIVKLTFGFG 111

Query: 115 MNAHYVILELYAQGNILLTDSEFTVLTLLRSHRD 148
            N  Y+I E +  GNI+LTD  + +L +LR   D
Sbjct: 112 DNTFYLIFEFFVAGNIILTDCNYKILVILRDTND 145



 Score = 43.5 bits (101), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 25/32 (78%)

Query: 892  ISRGQKGKLKKMKEKYGDQDEEERNIRMALLA 923
            + RG+K KLKK+ +KYG+QD+EER I+M L  
Sbjct: 1168 LPRGKKSKLKKVADKYGEQDDEERKIKMMLFG 1199


>gi|329766254|ref|ZP_08257812.1| hypothetical protein Nlim_1602 [Candidatus Nitrosoarchaeum limnia
           SFB1]
 gi|329137313|gb|EGG41591.1| hypothetical protein Nlim_1602 [Candidatus Nitrosoarchaeum limnia
           SFB1]
          Length = 590

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 57/147 (38%), Positives = 90/147 (61%), Gaps = 4/147 (2%)

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK    ++ +RK +W+E++ WF +S+  L I GRDA  N  +V++++ K D   H D+ G
Sbjct: 367 EKESVTVAEIRKKNWYERYRWFFTSDGILAIGGRDAPSNSAVVRKHLGKNDKIFHGDIFG 426

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEY 660
           +   ++K+   + P PP +LN+    TVC S+AW   M   SA+WV P QV K+AP+G++
Sbjct: 427 SPFFILKD--VDNP-PPASLNEVAHATVCFSRAWREGMYGVSAFWVNPEQVKKSAPSGQF 483

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           L  GSF I G++NF+    L +  GL+
Sbjct: 484 LPKGSFTIEGQRNFVKISTLKLAVGLM 510


>gi|21227916|ref|NP_633838.1| hypothetical protein MM_1814 [Methanosarcina mazei Go1]
 gi|20906336|gb|AAM31510.1| hypothetical protein MM_1814 [Methanosarcina mazei Go1]
          Length = 343

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 110/207 (53%), Gaps = 5/207 (2%)

Query: 502 NARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFN 561
           NA+ +YE  KK   K++  I A     KA EKK   +  +       S  RK HW+++F 
Sbjct: 4   NAQEYYEKVKKFTKKKDGAIRAIEDTKKAMEKKAATKSAKAGRKLQAS--RKKHWYDRFR 61

Query: 562 WFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTL 621
           WF+SS+ +LV+ GRDA  NE I K+YM K D+  H    GA  TV+K    E  VP  TL
Sbjct: 62  WFVSSDGFLVVGGRDADTNEEIFKKYMEKRDIVFHTQTPGAPLTVVKTGGKE--VPDSTL 119

Query: 622 NQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPL 680
            +   F V +S  W +   +   +W+   QV+KT  +GEYL  G+F+IRG++N+    PL
Sbjct: 120 QEVSQFAVSYSSLWKAGQFSGDCYWIKSEQVTKTPESGEYLKKGAFVIRGERNYFKDVPL 179

Query: 681 IMGFGLLFRLDESSLGSHLNERRVRGE 707
            +  GL  + +   +G   +  R  G+
Sbjct: 180 GIAVGLELKGETRIIGGPASAVRKHGD 206


>gi|154150873|ref|YP_001404491.1| hypothetical protein Mboo_1330 [Methanoregula boonei 6A8]
 gi|153999425|gb|ABS55848.1| protein of unknown function DUF814 [Methanoregula boonei 6A8]
          Length = 631

 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 176/362 (48%), Gaps = 42/362 (11%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
           P++L +   ++  +F  F  AL+ FY   ++++ +   + K      +  +I   QE  +
Sbjct: 242 PVVLAENAPQDENQFAGFSDALEVFYPMTKAEKVKVAARPK----LSEGERIRKYQEAAI 297

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
               ++V ++ ++   I  N   +   I ++  A + R+SW+++   +K+          
Sbjct: 298 KKFDEKVAKAEEVVAAIYENYPFISQVITSL-AAASKRLSWQEIEHHLKDTSSTD----- 351

Query: 457 LIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESK 516
                   +   +        E+D  +K      V++ +  +   NA  +Y+  KK + K
Sbjct: 352 -------AKRITAFFPGEAAVEVDIGKK------VKIFVHETVEQNAGHYYDQIKKFKKK 398

Query: 517 QEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
           +E  + A          K R ++++     +I  M+K+ W+ +F WFI+S+  +V+ GRD
Sbjct: 399 KEGALLAMKTV------KPRKKVIRH----DIVPMKKL-WYHRFRWFITSDGVVVLGGRD 447

Query: 577 AQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWD 636
           A QNE +VK+YM+ GD++VHAD+HGAS  ++K    +       +++   F   +S AW 
Sbjct: 448 AGQNEELVKKYMTGGDLFVHADVHGASVVIVKGKTEK-------MDEVAQFAASYSGAWR 500

Query: 637 SKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
           S   T+  +   P QVSKT   GE++  GSF++RG++ +    PL +G GL+     + +
Sbjct: 501 SGHFTADVFSAQPTQVSKTPQAGEFVARGSFIVRGERTYYRDVPLSVGIGLVLEPYAAVI 560

Query: 696 GS 697
           G 
Sbjct: 561 GG 562



 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 64/109 (58%), Gaps = 1/109 (0%)

Query: 46  GVTESGESE-KVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYD 104
           G+  +GE+  K LLL+E+G R H    A +    P  F + LRK++   ++  +RQ G +
Sbjct: 39  GIRLNGEAHAKYLLLIEAGRRAHLVKNAPEPPKNPPQFAMFLRKYLTGGKVLAIRQHGLE 98

Query: 105 RIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           RI++F  G G   + +I+EL+ +GN++L D  + ++  LR HR  D+ +
Sbjct: 99  RILIFDIGKGALTYRLIIELFDEGNVILADEAYRIIKPLRHHRFKDRDI 147


>gi|68062538|ref|XP_673276.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56491007|emb|CAH97640.1| hypothetical protein PB000420.02.0 [Plasmodium berghei]
          Length = 423

 Score =  124 bits (311), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 100/156 (64%), Gaps = 9/156 (5%)

Query: 1   MVKVRMNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M K R+   D+ A +  C   +IG   +N+Y++S K Y+ K         S + +K  LL
Sbjct: 1   MGKQRLTALDIRAIITSCKNSIIGSVVTNIYNISNKIYVLKC--------SKKEQKYFLL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E+  R+H T + R+K   PSGFT+KLRKH+R+R++ ++ QLG DR+I  QFG   N ++
Sbjct: 53  VEAEKRVHITEWVREKDVMPSGFTMKLRKHLRSRKITNISQLGGDRVIDIQFGYDDNVYH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAI 155
           +I+ELY  GNI+LT++++ ++ +L+S+ D+ K + I
Sbjct: 113 LIVELYIAGNIILTNNDYKIIFILKSNDDNKKNLKI 148


>gi|50293495|ref|XP_449159.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528472|emb|CAG62129.1| unnamed protein product [Candida glabrata]
          Length = 1031

 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 111/206 (53%), Gaps = 12/206 (5%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--- 547
           V +DL LSA+ANA  ++ +KK    KQ+K      KA K  E K   Q LQ+K   +   
Sbjct: 516 VAIDLGLSAYANASTYFNMKKDHAEKQKKVEKNIEKAMKNIEDKIGKQ-LQKKLKESHDV 574

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
           +  +RK ++FEK+ WF S+E +LV+ G+   + + I  RY+   D+++       +   I
Sbjct: 575 LKKIRKPYFFEKYFWFYSTEGFLVMLGKSNVETDQIYSRYIEDDDIFMSNSFD--TKVWI 632

Query: 608 KNHRPEQ-PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPT-GEYLTVGS 665
           KN  PE+  VPP TL QAG   +  S AW  K+ +S WW +   V+K     G  L  G 
Sbjct: 633 KN--PERVEVPPNTLMQAGILCMSASPAWQKKIASSPWWCFAKNVTKFDDVDGSVLAPGV 690

Query: 666 FMIRGKK--NFLPPHPLIMGFGLLFR 689
           F +R +K  N LPP  L+MG G +++
Sbjct: 691 FRLRNEKQINMLPPAQLVMGVGFMWK 716



 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 120/490 (24%), Positives = 229/490 (46%), Gaps = 63/490 (12%)

Query: 2   VKVRMNTADV---AAEVKCLRRLIGMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKV 56
           +K R++  D+   A E+K    L G R SN+Y++  S + ++ K         +    K 
Sbjct: 1   MKQRISALDLQILAVELKS--ALEGFRLSNIYNIADSSRQFLLKF--------NKPDSKA 50

Query: 57  LLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
            ++++ G+R+H T + R    TPSGF +KLRKH++++RL  +RQ+  DRI++ +F  G+ 
Sbjct: 51  NVVVDCGLRIHLTEFNRPVPPTPSGFVVKLRKHLKSKRLTALRQVTGDRILVLEFADGL- 109

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASK 176
             Y++LE ++ GN++L D E  +L L R   + +  V          E+  +F+ TT  +
Sbjct: 110 -FYLVLEFFSAGNVILLDHERKILALQRIVHEHENKVG---------EVYNMFDETTFDE 159

Query: 177 LHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
            +   T  +       + VN   N      K  L          LS+N +KN       K
Sbjct: 160 -NMNDTQDERERTYSLELVNSWMNECETKFKSELSI--------LSQNESKN-------K 203

Query: 237 QPTLKTVLGEALGYGPALSEHIILDT----GLVPNMKLSEVNKLEDNAIQVLVLAVAKFE 292
           +  + ++    L   P LS  ++       G  P+    E    +D  + +L+    ++ 
Sbjct: 204 KVKVMSIHKLLLSKVPHLSSDLLSKNLRIHGFNPSSSCLEYIGKKDEILNLLLETEKEY- 262

Query: 293 DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLL-----LNQFRSR 346
              +++++ D    GYI+ +   L K   P   G   + IY+ F P +      ++ +S+
Sbjct: 263 ---KNLLNAD-EKTGYIIAKKNPLYKIDTP---GYDLEYIYENFHPFIPHIPATDEDKSK 315

Query: 347 EFVKFE-TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDR 405
             +K E  ++  LD+F+S IES +   + + +E  A  K+     + + R+  L+++   
Sbjct: 316 -VIKIEGDYNKTLDDFFSTIESSKYALKIQNQEQQAKQKIEAARQENKKRIDALREQQAS 374

Query: 406 SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLE 464
           +     L+  N++ V+    AV   +  +M W  + ++++ E+  GN +A  +   L L+
Sbjct: 375 NETKGNLLIANVDLVEEVKSAVLGLVNQQMDWNTIEKLIQSEQNKGNKIAKHVSLPLDLK 434

Query: 465 RNCMSLLLSN 474
            N + +LL N
Sbjct: 435 NNKIKILLPN 444


>gi|156844590|ref|XP_001645357.1| hypothetical protein Kpol_1058p36 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156116018|gb|EDO17499.1| hypothetical protein Kpol_1058p36 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 1019

 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/210 (33%), Positives = 115/210 (54%), Gaps = 10/210 (4%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--I 548
           V +DL LSA+ANA +++ +KK    KQ+K      KA K  E++   Q+ ++   ++  +
Sbjct: 519 VTIDLGLSAYANASQYFSIKKTSVEKQKKVEKNAEKAMKNIEERVSQQLKKKLKESHEVL 578

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +RK ++FEK+ WFISSE +LV+ G+   + + I  +Y+   DV+    +  A  T + 
Sbjct: 579 KKIRKPYFFEKYFWFISSEGFLVMMGKSELETDQIYSKYIENDDVF----MQNAFGTQVW 634

Query: 609 NHRPEQP-VPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSK-TAPTGEYLTVGSF 666
              P+   +PP TL QAG F +  S+AW  K+  S  W Y   +SK  + T   L  G F
Sbjct: 635 IKNPDMTEIPPNTLMQAGIFCMSASEAWSKKIAASPRWCYARNISKFDSTTNTLLPRGRF 694

Query: 667 MIRGKKNF--LPPHPLIMGFGLLFRLDESS 694
            ++ +K+   LPP  L+MGFG  +++   S
Sbjct: 695 ALKDEKSMIHLPPAQLVMGFGFAWKVKTES 724



 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 117/487 (24%), Positives = 225/487 (46%), Gaps = 57/487 (11%)

Query: 2   VKVRMNTADVAAEVKCLRRLI-GMRCSNVYDL--SPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+      LR+ + G R +NVY++  S + ++ K   S          K+ +
Sbjct: 1   MKQRVSALDILLLGNELRQEVEGYRLTNVYNIAESSRQFLLKFNKSDS--------KINV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T + R     PSGF +KLRKH++ +RL   RQ+  DRI++ QF  G+  +
Sbjct: 53  VVDCGLRIHKTDFTRPIPPAPSGFVVKLRKHLKAKRLTGFRQVKNDRILVLQFADGL--Y 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           Y++LE ++ GN++L D    +L+L R  ++            Y  ++   +E    S L 
Sbjct: 111 YLVLEFFSAGNVILLDENRKILSLQRIVQE------------YGNKVGEAYEMFDES-LF 157

Query: 179 AALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQP 238
           A + ++ E    E D + E  N +     +    +   +S  L +  NK      + K+ 
Sbjct: 158 AEIGNTTE---KELDYLKEYNNEMVREWIDEALAKFKLESSHLLQEENK-----GQHKKV 209

Query: 239 TLKTVLGEALGYGPALSEHII----LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
            + ++    L   P LS  +I       G+ P+    E +   D+ + +L    ++F++ 
Sbjct: 210 KVMSIAKLLLNKEPHLSSDLISKNLKKNGINPSSSSLEYSDKIDDLVNILNATTSEFKEL 269

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPLLLNQF-RSREFVKFE 352
           L +         GYIL +     +++ P +    T+ IY+ F P     F  S++  K +
Sbjct: 270 LNNDEKC-----GYILAKK---NENYNPEKHSPDTEFIYETFHP--FEPFVESKDLEKTK 319

Query: 353 T------FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
                  ++  LD+F+S IES +   + + +E  A  KL+   ++ E R+  L      +
Sbjct: 320 IIEIPGDYNKTLDQFFSTIESSKYSLRIQNQELQAKKKLDDAKLENERRIQALVDVQTSN 379

Query: 407 VKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLID-KLYLER 465
            +   LI  +   ++    AV+  +  +M W  +  ++  E+K GN +A  +   L L+ 
Sbjct: 380 EQKGHLIIAHSNLIEEVKFAVQGLIDQQMDWNTIENLIGSEQKKGNKIAQKVKLPLKLKN 439

Query: 466 NCMSLLL 472
           N + ++L
Sbjct: 440 NKIDVIL 446


>gi|410730361|ref|XP_003671360.2| hypothetical protein NDAI_0G03400 [Naumovozyma dairenensis CBS 421]
 gi|401780178|emb|CCD26117.2| hypothetical protein NDAI_0G03400 [Naumovozyma dairenensis CBS 421]
          Length = 1037

 Score =  123 bits (308), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 74/226 (32%), Positives = 122/226 (53%), Gaps = 9/226 (3%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVAN--I 548
           V +DL  SA+ANA  ++  KK    KQ++      KA K  E+K   Q+ ++   ++  +
Sbjct: 529 VTIDLGFSAYANASEYFNAKKTSAEKQKRVEKNIEKAMKNIEEKVNTQLKKKLKESHEVL 588

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
             +R  ++FEK++WFISSE YLV+ G++  + + I  +Y+   DV++  +    +   IK
Sbjct: 589 KKIRTPYFFEKYHWFISSEGYLVMMGKNDAETDQIYSKYIEDDDVFMSNNF--GTKVWIK 646

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGE-YLTVGSFM 667
           N    + VPP TL QAG   +  S+AW  K+ +SAWW     V+K     +  L  G F+
Sbjct: 647 NPMKHE-VPPNTLMQAGILCMSSSEAWSKKIASSAWWCNAKNVTKFDKFDKSVLPPGVFV 705

Query: 668 IRGKK--NFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGM 711
           ++ +K  N LP   L+MG G L+++  S  G   + +   GE+E +
Sbjct: 706 LKDEKDQNTLPASQLVMGLGFLWKVKTSDNGDE-DVKEFEGEQEEL 750



 Score =  106 bits (264), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 129/509 (25%), Positives = 232/509 (45%), Gaps = 75/509 (14%)

Query: 2   VKVRMNTADV---AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R++  D+   AAE+K    L G R +N+Y+ S     F L  +          K+ +
Sbjct: 1   MKQRISALDLQILAAELKT--SLEGYRLNNIYNASDSNRQFLLRFNKP------DSKLNV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T + R   + PSGF +KLRKH++++RL  +RQ+  DRI++ QF  G+   
Sbjct: 53  IVDCGLRIHLTEFTRPIPSAPSGFVMKLRKHLKSKRLTALRQVKNDRILVLQFADGL--F 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLH 178
           +++LE ++ GN++L D    +++L R              H +   I   +     S  H
Sbjct: 111 FLVLEFFSAGNVILLDENRKIMSLQR------------IVHEHENIIGETYTMFDESLFH 158

Query: 179 AALTSSKEPDANEPDKVNEDGNN--VSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAK 236
            A       D N  +  N+D +   V N   E         S  L  + N  S+   + K
Sbjct: 159 TA------DDTNATNITNKDFSEGLVKNWLDEVKQKYAVAASTILETSKNDKSHQKKKIK 212

Query: 237 QPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEV----------NKLEDNAIQVLVL 286
             ++  +L   L   P LS  +     L  N+K+S++          N+++D  I++L  
Sbjct: 213 VMSIHKLL---LSKEPHLSSDL-----LSKNLKMSKIDPSTSALDFENRVDD-IIKLLNT 263

Query: 287 AVAKFEDWLQDVISGDIVPEGYIL-MQNKHLGKDHPPTESGSSTQ-IYDEFCPLLLNQFR 344
             +++   L D    +    GYIL  +NK+    +P  +S    + IY+ F P    +  
Sbjct: 264 TESEYHQLLND----NEHRVGYILDHENKNF---NPKIDSNPDLEFIYETFHPF---EPY 313

Query: 345 SREFVKFET--------FDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRV 396
             E  K  +        ++  LD+F+S IES +   + + +E  A  KL++  +D + ++
Sbjct: 314 VEEKDKASSHISEIPGYYNKTLDKFFSTIESSKYALRIQNQELQAKKKLDEAKLDNQKKL 373

Query: 397 HTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAG 456
             L      + +   LI  N + V+ A  A++  +  +M W  + +++K E+K    +A 
Sbjct: 374 QALIDVQSSNEEKGHLIVANADLVEEAKSAIQGLVDQQMDWNTIEKLIKSEQKKHVKIAE 433

Query: 457 LID-KLYLERNCMSLLLSNNLDEMDDEEK 484
           LI   L L+ N   + L   L   DD+E+
Sbjct: 434 LIVLPLNLKENKFKMKLP--LKTFDDDEQ 460


>gi|340345857|ref|ZP_08668989.1| RNA-binding protein [Candidatus Nitrosoarchaeum koreensis MY1]
 gi|339520998|gb|EGP94721.1| RNA-binding protein [Candidatus Nitrosoarchaeum koreensis MY1]
          Length = 638

 Score =  123 bits (308), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 59/157 (37%), Positives = 94/157 (59%), Gaps = 4/157 (2%)

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK   + + +RK +W+E++ WF +S+  L I GRDA  N  +V++++ K D   H D+ G
Sbjct: 415 EKDSISFTEIRKKNWYERYRWFFTSDGILAIGGRDAPSNSAVVRKHLEKNDKIFHGDIFG 474

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEY 660
           +   ++KN   + P P  +LN+    TVC S+AW   M   SA+WV P QV K+AP+G++
Sbjct: 475 SPFFILKN--ADNP-PTASLNEVAHATVCFSRAWREGMYGVSAFWVNPEQVKKSAPSGQF 531

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGS 697
           L  GSF I G++NF+    L +  G++ + D+  L S
Sbjct: 532 LPKGSFTIEGQRNFVKISTLKLAVGIIPQGDDYVLTS 568



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 98/218 (44%), Gaps = 21/218 (9%)

Query: 26  CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLK 85
            SN+Y ++  + +FKL ++       +S+  ++L  SGV L  T+   D+   P+    +
Sbjct: 24  VSNIYGVTKDSILFKLHHTE------KSDLFMMLSTSGVWL--TSVKIDQME-PNRLLKR 74

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL-ELYAQGNILLTDSEFTVLTLLR 144
           LR  +   +++ + Q+  +RI  F F  G +  YVI+ E + +GNILL ++E  +L L  
Sbjct: 75  LRSDLLRLKIKKIEQIASERIAYFTFA-GFDKEYVIVAEFFGEGNILLCNNEMKILALQH 133

Query: 145 S----HRDDDKGVAIMSRHRYPTEICRV----FERTTASKLHAA--LTSSKEPDANEPDK 194
           S    HR    G+          ++ +V    FE    S L AA  L  +        + 
Sbjct: 134 SIDVRHRKLGVGLVYAPPPLNGIDVIKVTENDFEELKTSDLAAAKWLGRTLGLPKKYVEG 193

Query: 195 VNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDG 232
           + E  N  S     NL  ++  K +D +KN   N   G
Sbjct: 194 IFEMSNVDSKCVGTNLTSEQIKKLYDTTKNIVTNVVTG 231


>gi|390937875|ref|YP_006401613.1| putative RNA-binding protein [Desulfurococcus fermentans DSM 16532]
 gi|390190982|gb|AFL66038.1| putative RNA-binding protein, snRNP like protein [Desulfurococcus
           fermentans DSM 16532]
          Length = 659

 Score =  122 bits (307), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 181/370 (48%), Gaps = 58/370 (15%)

Query: 330 QIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIH 389
           +IY  + P L ++   +     +  + A+D ++++ E   A   ++A+ +    KL +I 
Sbjct: 250 EIYTSYEPRLFSEVYDKTVKPLDDINTAIDVYFTEYE---AYLDYQARMEEVTEKLREI- 305

Query: 390 MDQENRVHTLKQEVDRSVKMAELI-EYN--LEDVDAAILAVRVALANRMSWEDLARMVKE 446
              E R+   +QE        E+I EYN  +E++++ +  +    +N    E++    +E
Sbjct: 306 ---EARIK--RQE--------EIIAEYNNEIENIESILQTI---YSNYHVAEEILECARE 349

Query: 447 --ERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHAN-A 503
             E+K    +A           C      N + E+  ++  + V+  E  L LS   + +
Sbjct: 350 TREKKGWEHIA---------EEC------NGVIEVRKDKGVIVVKLGEKTLELSIREDLS 394

Query: 504 RRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQI---LQEKTVANISHMRKVHWFEKF 560
           R+  EL++K+     KT +A     +  ++   + I    +EKT+   S      W+E+F
Sbjct: 395 RQVIELERKRGELVRKTESAKKVLEEMHQQLNTISISMNTEEKTIRKPS---PTFWYERF 451

Query: 561 NWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN---HRPEQPVP 617
           +W  +   +L I GRD  QNE++V++Y+ + DV++HAD+HG S+ V+K+   H  E  V 
Sbjct: 452 HWLFTRNGFLAIGGRDQSQNELVVRKYLGENDVFIHADIHGGSAVVLKSGGAHSLEDVV- 510

Query: 618 PLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLP 676
                 A     C+S+AW +       +WV   QVSKT P GEYL  G+FM+ G KN+L 
Sbjct: 511 -----DASYLAACYSKAWKAGFSYIEVYWVSGRQVSKTPPPGEYLPRGAFMVYGSKNYLQ 565

Query: 677 PHPLIMGFGL 686
             PL +G G+
Sbjct: 566 V-PLRLGIGV 574



 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 67/137 (48%), Gaps = 15/137 (10%)

Query: 1   MVKVRMNTADVAAEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           ++K  M+  D+ + V     +I G    N Y      +I KL    GV         ++ 
Sbjct: 5   LLKKAMDILDIYSWVNKYSSVITGCLIDNAYHYK-SYWILKLRCREGVY--------IVK 55

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGL--GMNA 117
           +E GVR+H +    ++K+   GFT  LR  IR  R+  ++Q  ++RIILF+  +   +  
Sbjct: 56  IEPGVRMHLSQSHPEEKDI-DGFTRFLRSRIRDSRITSIKQPWWERIILFETSIHDKILR 114

Query: 118 HYVILELYAQGNILLTD 134
           HYV  EL  +G  ++TD
Sbjct: 115 HYV--ELLPRGQWIITD 129


>gi|448346455|ref|ZP_21535340.1| Fibronectin-binding A domain protein [Natrinema altunense JCM
           12890]
 gi|445632658|gb|ELY85869.1| Fibronectin-binding A domain protein [Natrinema altunense JCM
           12890]
          Length = 715

 Score =  122 bits (306), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 97/377 (25%), Positives = 166/377 (44%), Gaps = 52/377 (13%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIH 389
           +  P  L +    E   +++F +ALD ++ ++E    E+     +   F     K  +I 
Sbjct: 266 DVTPFPLEEHDDLEGEPYDSFLSALDAYFFRLELAEEEEPDPTDQRPDFESEIAKHERII 325

Query: 390 MDQENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
             Q+  +   +QE     + AEL+  EY L  VD  +  ++ A     SW+D+    +E 
Sbjct: 326 EQQQGAIEGFEQEAASLREQAELLYAEYGL--VDDILSTIQGARERERSWDDIRERFEEG 383

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
            + G   A  I  +      +++       E+DDE       ++++D       NA R Y
Sbjct: 384 AEQGIDAAAAIVDIDGSDGTVTV-------EIDDE-------RIDLDAQQGVEQNADRLY 429

Query: 508 ELKKKQESKQEKTITA--HSKAFKAAEKKTRLQILQEKTV-------------------- 545
              K+ E K++  + A   ++   A  K+ R +   +++                     
Sbjct: 430 TEAKRVEEKKDGALAAIEDTRQDLADAKRRRDEWEADESGGGDDDETDEDGDDLPRDWLS 489

Query: 546 -ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASS 604
            ++I       WF++F WF +S+ +LVI GR+A QNE +VK+Y+  GD  +H   HG   
Sbjct: 490 ESSIPIRENEPWFDRFRWFNTSDGFLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPV 549

Query: 605 TVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPT 657
           TV+K   P +       +P  ++ +A  F V +S  W D +     + V   QVSKT  +
Sbjct: 550 TVLKATDPSEASSSDIDLPESSIAEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVSKTPES 609

Query: 658 GEYLTVGSFMIRGKKNF 674
           GEYL  G F IRG + +
Sbjct: 610 GEYLEKGGFAIRGDRTY 626



 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 70/162 (43%), Gaps = 7/162 (4%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V  L    G +    Y         K+ +     + G  E +L + E 
Sbjct: 4   KRELTSVDLAALVGELGAYEGAKVDKAYLYGDDLVRLKMRD----FDRGRMELILEVGEV 59

Query: 63  GVRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
             R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F        +
Sbjct: 60  K-RAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTTRI 118

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 119 IVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|432330923|ref|YP_007249066.1| putative RNA-binding protein, snRNP like protein [Methanoregula
           formicicum SMSP]
 gi|432137632|gb|AGB02559.1| putative RNA-binding protein, snRNP like protein [Methanoregula
           formicicum SMSP]
          Length = 630

 Score =  122 bits (306), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 96/364 (26%), Positives = 163/364 (44%), Gaps = 53/364 (14%)

Query: 340 LNQFRSREFVKFETFDAALDEFY--SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVH 397
           +N     E   + TF  AL+ FY  +K E +   +   AKED       +I   Q+  + 
Sbjct: 243 INLRTGEETTAYPTFSLALEAFYPMTKAEKKATSRPKIAKED-------RIRSHQQAAI- 294

Query: 398 TLKQEVDRSVKMAELIE---YNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPV 454
              ++ DRS+  AE +    Y      A ++    A +   SW+++ + +   R A +  
Sbjct: 295 ---KKFDRSIAQAEEVVNAIYENYPFIAQVIGTLAAASKTHSWQEIEKRI---RAAPSEE 348

Query: 455 AGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQE 514
              I   +     + + L   +    +E               S   NA  +Y++ KK +
Sbjct: 349 TKKITAFFPGEAAVEIDLGKRIKVFVNE---------------SVEQNAGHYYDVIKKFK 393

Query: 515 SKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISG 574
            K+   +TA        + K R  +  +K            W+ +F WFI+S+  +V+ G
Sbjct: 394 KKKAGAVTAMETVATKKQTKRREFVPLKK-----------QWYHRFRWFITSDGAVVLGG 442

Query: 575 RDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQA 634
           RDA QNE +VK+YM+ GD +VHAD+HGAS  ++K            +++   F   +S A
Sbjct: 443 RDATQNEELVKKYMAGGDTFVHADVHGASVVLVKGKTER-------MDEVARFAASYSGA 495

Query: 635 WDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           W S   ++  +   P QVSKT   GE+++ GSF++RG++ +    PL  G GL+     +
Sbjct: 496 WRSGHFSADVYSALPSQVSKTPEAGEFVSRGSFIVRGERTYYRNIPLSTGIGLMLDPHAA 555

Query: 694 SLGS 697
            +G 
Sbjct: 556 VIGG 559



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 74/149 (49%), Gaps = 9/149 (6%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE-KVLLLMESGV 64
           M+  DV A    L+  + +    VY    KT         G+  +GE++ K LL +ESG 
Sbjct: 7   MSGIDVRAMTCELQEKLPLWIDKVYQFDTKTL--------GIRLNGENKAKYLLFIESGR 58

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R H  A   +    P  F + LRKH+   ++  +RQ G +R+++F  G G     +I+EL
Sbjct: 59  RAHLVADLPEPPKNPPHFAMLLRKHLSGGKVLSIRQHGLERVLIFAIGKGTTVFNLIIEL 118

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
           +  GN++L D   T++  L  HR  D+ V
Sbjct: 119 FDNGNVILADDTMTIIKPLWHHRFKDREV 147


>gi|386874769|ref|ZP_10116995.1| hypothetical protein BD31_I0230 [Candidatus Nitrosopumilus salaria
           BD31]
 gi|386807392|gb|EIJ66785.1| hypothetical protein BD31_I0230 [Candidatus Nitrosopumilus salaria
           BD31]
          Length = 539

 Score =  122 bits (306), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 88/147 (59%), Gaps = 4/147 (2%)

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           EK +  +S +RK +W+E++ WF +S+ +L I GRDA  N  +V++++ K D   H D+ G
Sbjct: 306 EKDLIVVSEIRKKNWYERYRWFFTSDGFLAIGGRDAASNSAVVRKHLVKKDKIFHGDIFG 365

Query: 602 ASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEY 660
           +   ++K        P  ++N+    TVC S+AW   M   SA+WV P QV K+AP+GE+
Sbjct: 366 SPFFILKEA---DNAPDKSMNEVAHATVCFSRAWREGMYGVSAYWVNPEQVKKSAPSGEF 422

Query: 661 LTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           L  GSF I G++NF+    L +  G++
Sbjct: 423 LPKGSFTIEGQRNFIKSDTLRLAVGII 449


>gi|297736764|emb|CBI25965.3| unnamed protein product [Vitis vinifera]
          Length = 1266

 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/173 (45%), Positives = 105/173 (60%), Gaps = 17/173 (9%)

Query: 713 DFEDSGHHKENSDIESEKDDTDEK---------------PVAESLSVPNSAHPAPSHTNA 757
           DFE++   K NSD ESEK++TDEK               P+ E  S  +SAH   + +N 
Sbjct: 28  DFEENESLKGNSDSESEKEETDEKRTAESKSIMDPPTHQPILEGFSEISSAHNELTTSNV 87

Query: 758 SNVDSHEFPAEDKTISNGIDSK-IFDIARNVAAPVTPQLEDLIDRALGLGSASISSTKHG 816
            +++  E P E++ + NG DS+ I DI+    + V PQLEDLID AL LGS + S  K+ 
Sbjct: 88  GSINLPEVPLEERNMLNGNDSEHIDDISGRHVSSVNPQLEDLIDWALELGSNTASGKKYA 147

Query: 817 IETTQFDLSEEDKHVERTATVRDKPYISKAERRKLKKGQGSSVVDPKVEREKE 869
           +ET+Q DL E+  H +R A VR+KPYISKAERRKLKKGQ +S  D   +  KE
Sbjct: 148 LETSQVDL-EDHNHEDRKAKVREKPYISKAERRKLKKGQKTSTSDAGGDHGKE 199


>gi|433590765|ref|YP_007280261.1| putative RNA-binding protein, snRNP like protein [Natrinema
           pellirubrum DSM 15624]
 gi|448331831|ref|ZP_21521081.1| Fibronectin-binding A domain protein [Natrinema pellirubrum DSM
           15624]
 gi|433305545|gb|AGB31357.1| putative RNA-binding protein, snRNP like protein [Natrinema
           pellirubrum DSM 15624]
 gi|445628400|gb|ELY81707.1| Fibronectin-binding A domain protein [Natrinema pellirubrum DSM
           15624]
          Length = 721

 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 162/375 (43%), Gaps = 60/375 (16%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRS 406
           +++F +ALD+++ ++E    E+     +   F     K  +I   Q+  +   +QE ++ 
Sbjct: 291 YDSFLSALDDYFFRLELAEEEEPDPTDQRPDFESEIAKHERIIEQQQGAIEGFEQEAEQL 350

Query: 407 VKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYL 463
            + AEL+  EY L  VD  +  V+ A     +W+++    +E    G   A  +ID    
Sbjct: 351 RERAELLYAEYGL--VDEILSTVQGAREQDRAWDEIRERFEEGADRGIAAAEAVID---- 404

Query: 464 ERNCMSLLLSNNLDEMDDEEKTLPV----EKVEVDLALSAHANARRWYELKKKQESKQEK 519
                          +D  E T+ V    E++E+        NA R Y   K+ E K+E 
Sbjct: 405 ---------------VDGSEGTVTVDLDGERIELVADRGVEQNADRLYTEAKRVEDKKEG 449

Query: 520 TITAHSKAFKAAEKKTRLQILQEKTVA---------------------NISHMRKVHWFE 558
            + A     +  E   R +   E   A                     +I       WF+
Sbjct: 450 ALAAIENTREDLEDAKRRRDEWEAKDAASDDEDEADDEGPNRDWLADPSIPIRENEPWFD 509

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP--- 615
           +F WF +S++YLVI GR+A QNE IVK+Y+  GD  +H   HG   TV+K   P +    
Sbjct: 510 RFRWFHTSDDYLVIGGRNADQNEEIVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSS 569

Query: 616 ---VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
              +P  ++ +A  F V ++  W D +     + V   QVSKT  +GEYL  G F IRG 
Sbjct: 570 DIELPESSIEEAAQFAVSYASVWKDGRYAGDVYAVDADQVSKTPESGEYLEKGGFAIRGD 629

Query: 672 KNFLPPHPLIMGFGL 686
           + +    P+    G+
Sbjct: 630 RTYYRDTPVGAAVGI 644



 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESG--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RLELILEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|76156171|gb|AAX27403.2| SJCHGC07504 protein [Schistosoma japonicum]
          Length = 170

 Score =  121 bits (303), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 63/144 (43%), Positives = 94/144 (65%), Gaps = 7/144 (4%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K+   + DV   +  ++ +++G R  NVYD+  KTY+ KL ++         EK +LL+
Sbjct: 11  MKLLFTSYDVMVSISEIKNQILGHRVINVYDVDNKTYLLKLASTK------SDEKTILLL 64

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R+H T Y   K   PSGF++KLRKHIR +++ DV Q+G DR++  Q G   +A+++
Sbjct: 65  ESGSRIHITDYDWPKNMMPSGFSMKLRKHIRNKKIVDVCQIGADRVVDIQIGYESSAYHL 124

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           ILELY +GN+LLTD  FT+L LLR
Sbjct: 125 ILELYDRGNMLLTDDTFTILHLLR 148


>gi|88601740|ref|YP_501918.1| hypothetical protein Mhun_0437 [Methanospirillum hungatei JF-1]
 gi|88187202|gb|ABD40199.1| protein of unknown function DUF814 [Methanospirillum hungatei JF-1]
          Length = 627

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 161/338 (47%), Gaps = 45/338 (13%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMA 410
           F++F+AAL  FY       A    K +E     + ++I   QE  +   ++ + R+ ++A
Sbjct: 254 FDSFNAALAAFYPV-----APPVKKQEEKIRVSREDRIRHQQEEAIVKFEKNITRNEELA 308

Query: 411 ELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSL 470
            L+      V   I  +  A   R SW+++  ++K++                 +  + +
Sbjct: 309 ALLYEEYGFVSEIITTLSKAAETR-SWQEIEAILKKDTSGAG------------KKIIRI 355

Query: 471 LLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKA 530
             +    E+D          V+V +  +   NA R+Y+  KK + K      A  +  + 
Sbjct: 356 FPAEAAVELDLGRP------VKVFVHETIDQNAGRYYDQVKKFKKKLAGAKAAMEREVQQ 409

Query: 531 AEKKTRLQILQEKTVANISHMR-KVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
           A  +TR           + + R K  WF++F WF +S+  LVI GRDA QNE ++++Y+ 
Sbjct: 410 A--RTR----------KVQYQRPKKRWFDRFRWFYTSDQVLVIGGRDAGQNEELIRKYLE 457

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYP 648
            GD +VHAD+HGAS  V+K    +       +++   F   +S AW +   ++  +   P
Sbjct: 458 GGDTFVHADVHGASVVVVKGKTKD-------MDEVARFAAAYSGAWRAGFASADVYAARP 510

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
            QVSKTA +GEYL+ GSF++RG++ +    PL +  GL
Sbjct: 511 DQVSKTAESGEYLSRGSFVVRGERQWFHDVPLEVVIGL 548



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 7/148 (4%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M+  D+      + RL+ +    VY    +  IF+L        S    KV +L+E G R
Sbjct: 7   MSGLDLITVTDEITRLLPLWVHKVYLDENRLCIFRL-------NSKNQGKVNILIEPGRR 59

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
            H  +   +    P  F + LRK++   R++ +RQ G  R ++          ++I+E++
Sbjct: 60  FHCVSTLPEMPQIPPAFAMFLRKYLAGGRVDGIRQQGLQRTVIIDIRKSEQLFHLIVEVF 119

Query: 126 AQGNILLTDSEFTVLTLLRSHRDDDKGV 153
             GNI+L   + T++  L  HR  D+ V
Sbjct: 120 DDGNIILCGEDMTIIQPLTRHRFKDRDV 147


>gi|118577090|ref|YP_876833.1| RNA-binding protein [Cenarchaeum symbiosum A]
 gi|118195611|gb|ABK78529.1| RNA-binding protein [Cenarchaeum symbiosum A]
          Length = 631

 Score =  120 bits (302), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 106/200 (53%), Gaps = 5/200 (2%)

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           EK+ VD   S H+ A   ++  K+Q           +KA K  +   R    Q  +V + 
Sbjct: 355 EKISVDPRSSIHSAASSLFDEAKRQSGAVPAIEKLRAKAAKELDALRRDSEEQAASV-SF 413

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
           + +R+  W+E++ WF +++  L + GRD+  N  I++R++   D   HAD  G+   ++K
Sbjct: 414 TKVRRKSWYERYRWFFTTDGSLAVGGRDSSSNTSIIRRHLDANDRVFHADTFGSPFFILK 473

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFM 667
           +    +P     L +A   TVC S+AW   M   SA+WV P QV K AP+G++L  GSF+
Sbjct: 474 DGADSRPA---GLEEAAHATVCFSRAWREAMYGLSAYWVLPEQVKKAAPSGQFLPKGSFV 530

Query: 668 IRGKKNFLPPHPLIMGFGLL 687
           I G++NF+    L +  GL+
Sbjct: 531 IEGRRNFVKIPTLRLAVGLV 550



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 69/142 (48%), Gaps = 15/142 (10%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +++R    D+A      +R  G   SN+Y +SP++ +FKL +        E E ++L++ 
Sbjct: 6   IELRYLVDDIA------KRTGGYYVSNIYGISPESLLFKLHHP-------EKEDIMLML- 51

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVI 121
           S   L T++  R ++  P+    +LRK +   RLE V Q G DRI   +F        + 
Sbjct: 52  STFGLWTSS-VRIEQVGPNRLLARLRKELLRSRLESVEQPGMDRIAYLRFEGPRGTRILA 110

Query: 122 LELYAQGNILLTDSEFTVLTLL 143
            E +  GN++L      +L LL
Sbjct: 111 GEFFGGGNMILCGDGMMILALL 132


>gi|156938202|ref|YP_001435998.1| hypothetical protein Igni_1415 [Ignicoccus hospitalis KIN4/I]
 gi|156567186|gb|ABU82591.1| protein of unknown function DUF814 [Ignicoccus hospitalis KIN4/I]
          Length = 644

 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 90/148 (60%), Gaps = 3/148 (2%)

Query: 540 LQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADL 599
           ++E+    I+  R+  W+EK++W I+S   L I G+DA QNE +V+RY+   D+++HA++
Sbjct: 400 VKEEIAKEIAKSRRREWYEKYHWLITSSGLLAIGGKDASQNEAVVRRYLEDDDIFMHAEV 459

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTG 658
            GA + V+K    E  V    L +A   T C+S+AW + +     ++V   QVSK+ P G
Sbjct: 460 QGAPAVVLKTEGKE--VTEKDLREAAFLTACYSKAWKEGRGSVDVFYVKGSQVSKSPPPG 517

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +Y+  G+F+I+GK+ ++   PL +  G+
Sbjct: 518 QYVAKGAFIIKGKREYVRDVPLRLALGV 545



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 64/138 (46%), Gaps = 8/138 (5%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  MN  DV A ++    LIG    NVY      ++ KL      T++       L+ E 
Sbjct: 4   KASMNYLDVVAWIRKNEDLIGSTVQNVYYKDGLMWM-KLKGKGSGTKA-------LIAEP 55

Query: 63  GVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVIL 122
           G R+H T    +       F   LRK +++ +L  ++ +GYDR++   F  G   + +++
Sbjct: 56  GRRIHLTPSPPEAPERLHPFAGGLRKFLKSAKLTSIKTVGYDRVVEMNFSKGGEVYKLMI 115

Query: 123 ELYAQGNILLTDSEFTVL 140
           EL  +G I L D E  +L
Sbjct: 116 ELVPRGVIALLDPENKIL 133


>gi|448317278|ref|ZP_21506835.1| fibronectin-binding A domain-containing protein [Natronococcus
           jeotgali DSM 18795]
 gi|445604315|gb|ELY58265.1| fibronectin-binding A domain-containing protein [Natronococcus
           jeotgali DSM 18795]
          Length = 717

 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/343 (26%), Positives = 151/343 (44%), Gaps = 46/343 (13%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRV 429
           Q+   +E+ A H+  +I   Q+  +   +Q+ +   + AEL+  EY L  VD  +  V+ 
Sbjct: 316 QRPDFEEEIAKHE--RIIEQQQGAIEGFEQQAEAQRENAELLYAEYGL--VDDILSTVQE 371

Query: 430 ALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVE 489
           A A    W+++ +  +E ++ G   A  +  +      +++ L                E
Sbjct: 372 ARAQDRPWDEIEQRFEEGKERGIEAAEAVVGVDGTDGIVTVELDG--------------E 417

Query: 490 KVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT----- 544
           K+++D       NA R Y   K+ E K+E  + A     +      R +   E T     
Sbjct: 418 KIDLDAGQGVEQNADRIYTEAKRIEEKKEGALAAIEDTREDLADAKRRRDEWEATDETAD 477

Query: 545 --------------VANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSK 590
                         +A+I       W+++F WF +S+ YLVI GR+A QNE +VK+Y+  
Sbjct: 478 GDEDDEHEETNWLELASIPIRENEPWYDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEP 537

Query: 591 GDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSA 643
           GD  +H   HG   TV+K   P +       +P  ++ +A  F V +S  W D +     
Sbjct: 538 GDTVLHTQAHGGPVTVLKATDPSEASSSDIELPDSSVEEAAQFAVSYSSVWKDGRYAGDV 597

Query: 644 WWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           + V   QV+KT  +GEYL  G F IRG + +    P+    G+
Sbjct: 598 YAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 640



 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLIEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFVGVEQFEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGNI +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNIAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|385805336|ref|YP_005841734.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Fervidicoccus fontis Kam940]
 gi|383795199|gb|AFH42282.1| putative RNA-binding protein, eukaryotic snRNP-like protein
           [Fervidicoccus fontis Kam940]
          Length = 629

 Score =  118 bits (296), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 90/149 (60%), Gaps = 7/149 (4%)

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYM--SKGDVYVHAD 598
           +E+ V  I+  RK  W+EK+ W  +    L+I+GRDAQQNE IVK+Y+  +K  +Y HA+
Sbjct: 409 KEREVKAIA--RKRDWYEKYIWSFTRNRLLIIAGRDAQQNEAIVKKYLMKNKKSLYFHAE 466

Query: 599 LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPT 657
           +HGA ST++      + +    +         +S+AW + + V   +WV+  QVSKT P 
Sbjct: 467 IHGAPSTILLAEN--EDIKEEDIYDTSVIAASYSKAWKASLKVVDVFWVHSDQVSKTPPA 524

Query: 658 GEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           GEYL  GSFMI G+KN++   PL +G GL
Sbjct: 525 GEYLEKGSFMIYGEKNYVRNVPLKLGIGL 553



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/185 (25%), Positives = 89/185 (48%), Gaps = 19/185 (10%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDL-SPKTYIFKLMNSSGVTESGESEKVLLL 59
           +K  M   D+ A ++ L +  I ++ SN+Y +   K  + KL +              L+
Sbjct: 3   IKESMTVIDLIAFLRELEKEKINLKVSNIYHIPQTKRILIKLKDPYFK---------FLV 53

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
            E+  +++ + Y+      PS F L LRK++  R +  ++Q+G+DRI+  +F    N + 
Sbjct: 54  AEASKKIYFSKYSLPTPEKPSIFALSLRKYLNERVITSIKQIGFDRILKLEFD---NDYA 110

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFE-RTTASKLH 178
           + +EL  +G I+LTD    ++      +  D+ +   S++  P     +FE R TA    
Sbjct: 111 LYIELLPRGEIILTDPTERIIHASSFKKMRDRKIERNSQYILPP----IFEKRPTAEMCI 166

Query: 179 AALTS 183
            AL+S
Sbjct: 167 EALSS 171


>gi|288932692|ref|YP_003436752.1| Fibronectin-binding A domain protein [Ferroglobus placidus DSM
           10642]
 gi|288894940|gb|ADC66477.1| Fibronectin-binding A domain protein [Ferroglobus placidus DSM
           10642]
          Length = 646

 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 95/348 (27%), Positives = 179/348 (51%), Gaps = 31/348 (8%)

Query: 332 YDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLN---KI 388
           Y ++ P+ L ++   E   FE+F+ A+DEFY++  S   E + K K+     KL    KI
Sbjct: 236 YVDYQPIDLKKYEGYEKKYFESFNKAVDEFYTR--SALKEIEVKEKKSEVIEKLENRLKI 293

Query: 389 HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEER 448
            ++ + R    ++E ++  ++ +LI      V+    A++ A+  +  ++++ +++ E++
Sbjct: 294 QLETKER---YERESEKLRRIGDLIYEKYPIVERIHSALKKAVELK-GFDEVKKILAEQK 349

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
           KAG  +  ++D +  E+   +++LS     +DD +  L ++K       + H NA  +Y+
Sbjct: 350 KAGK-LKEILDIIPKEK---AVVLS-----IDDVKFKLFLDK-------NLHENAEYYYD 393

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
             KK + K    + A  K     E +   +I  +K ++    +R+  W+EK+ W+I+SE 
Sbjct: 394 QAKKLKEKVNGIVKAIEKT--REEIRRAEEIEAKKILSEFRVVRRREWYEKYRWYITSEG 451

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
           +LVI GR+A+ NE IV ++    D++ H    G + T++K           ++ +A  F 
Sbjct: 452 FLVIGGRNAEMNEEIVSKHFESKDLFFHTQTPGGAVTILKRG---LEAGEKSIKEAAEFA 508

Query: 629 VCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
             +S  W   M +   ++V   QV + A  GEYL  GSF I GK+N+L
Sbjct: 509 AIYSALWKHGMHSGEVYYVTYEQVKRAAKPGEYLPKGSFYIVGKRNYL 556



 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 72/136 (52%), Gaps = 10/136 (7%)

Query: 5   RMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           +M++ D+ A +  L+ + GM+   VY   P  +  KL             +V  L+E+G 
Sbjct: 3   QMSSIDIRAVLNELK-IEGMKVDKVYHYPPNEFRIKLRGRG---------RVDFLVEAGK 52

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R+H T + ++    PS   + LRKH+   R+E V Q  +DRI++ +F  G     ++ EL
Sbjct: 53  RIHATEFPKESPKFPSSIAMLLRKHLENARVERVYQHDFDRIVVIEFSRGDEKKIMVAEL 112

Query: 125 YAQGNILLTDSEFTVL 140
           + +GN+LL D +F V+
Sbjct: 113 FGKGNLLLLDEDFKVI 128


>gi|307595006|ref|YP_003901323.1| hypothetical protein Vdis_0882 [Vulcanisaeta distributa DSM 14429]
 gi|307550207|gb|ADN50272.1| protein of unknown function DUF814 [Vulcanisaeta distributa DSM
           14429]
          Length = 668

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 67/182 (36%), Positives = 107/182 (58%), Gaps = 6/182 (3%)

Query: 508 ELKKKQESKQEKT--ITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           EL++K ++ +E    + A  +  +A  +K    I +E ++  I   R+  WFE+F WFI+
Sbjct: 396 ELERKAKTAEESLSQLRARIEELRAESEKIAESI-REGSIRVIYGARE--WFERFRWFIT 452

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           S   LVI+GRDA QNE+IV+ Y+   D++VHAD+ G ++ VI+       V    + +A 
Sbjct: 453 SGGKLVIAGRDATQNEVIVRHYLRPWDIFVHADIPGGAAVVIRLASSGDNVSDDDIKEAA 512

Query: 626 CFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            + V +S+AW   + V  A++V   QV+K AP+GEYL  GSFMI G + ++    L +G 
Sbjct: 513 QYAVSYSRAWVMGLSVLDAFYVRGEQVTKKAPSGEYLGKGSFMIYGTRGWVRNAELGLGI 572

Query: 685 GL 686
           G+
Sbjct: 573 GV 574


>gi|448300325|ref|ZP_21490327.1| fibronectin-binding A domain-containing protein [Natronorubrum
           tibetense GA33]
 gi|445586054|gb|ELY40340.1| fibronectin-binding A domain-containing protein [Natronorubrum
           tibetense GA33]
          Length = 726

 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 166/371 (44%), Gaps = 52/371 (14%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFH----KLNKIHMDQENRVHTLKQEVDRS 406
           ++T+ +ALD+++ ++E +   +     +   F     K  +I   Q+  +   +QE D  
Sbjct: 296 YDTYLSALDDYFFRLELEEEGEPDPTDQRPDFEEEIAKQERIIEQQQGAIEGFEQEADML 355

Query: 407 VKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
            + AE +  EY L  VD  +  ++ A A    W+++    +   + G   A  +    ++
Sbjct: 356 REQAESLYAEYGL--VDDILSTIQEARAQDRPWDEIEERFEAGAEQGIEAAEAV----ID 409

Query: 465 RNCMSLLLSNNLD-EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITA 523
            +    +++ ++D E  D E T  VE+           NA R Y   K  E K+E  ++A
Sbjct: 410 VDGSEGVVTVDVDGEYIDLETTQGVEQ-----------NADRLYTEAKAVEDKKEGALSA 458

Query: 524 HSKAFKAAE--KKTRLQILQEK-------------------TVANISHMRKVHWFEKFNW 562
                K  +  K+ R Q   +                    ++ ++       W+++F W
Sbjct: 459 IENTRKDLQEAKRRRDQWEADDGEDEGDDADEEEREDRDWLSMPSVPVRENEPWYDRFRW 518

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------V 616
           F +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +       +
Sbjct: 519 FYTSDGYLVIGGRNADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIEL 578

Query: 617 PPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           P  ++ +A  F V ++  W D +     + V   QV+KT  +GEYL  G F IRG + + 
Sbjct: 579 PETSIEEAAQFAVSYASVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYY 638

Query: 676 PPHPLIMGFGL 686
              P+ +  G+
Sbjct: 639 DDTPVGVAVGI 649



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           ++ L++E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RLELIIEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQFEFDRILEFT 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FEREDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|448348947|ref|ZP_21537792.1| fibronectin-binding A domain-containing protein [Natrialba
           taiwanensis DSM 12281]
 gi|445641664|gb|ELY94739.1| fibronectin-binding A domain-containing protein [Natrialba
           taiwanensis DSM 12281]
          Length = 720

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 101/392 (25%), Positives = 171/392 (43%), Gaps = 39/392 (9%)

Query: 324 ESGSSTQIYDEFCPLLLNQFRSREF--VKFETFDAALDEFYSKIESQRAEQQHKAKEDAA 381
           + GS+ ++ D   P  L +    +     ++TF  ALD+++ ++E    E+     +   
Sbjct: 262 DEGSAARVVD-VTPFPLEEHEQDDLDGEPYDTFLEALDDYFFRLELDDEEEPDPTDQRPD 320

Query: 382 FH----KLNKIHMDQENRVHTLKQEVDRSVKMAELI--EYNLEDVDAAILAVRVALANRM 435
           F     K  +I   Q+  +   +QE +   + AE +  EY L  VD  +  ++ A     
Sbjct: 321 FEEEIAKHERIIEQQQGAIEGFEQEAENLRENAESLYAEYGL--VDEILSTIQEAREQDR 378

Query: 436 SWEDLARMVKEERKAGNPVA----------GL----IDKLYLERNCMSLLLSNNLDEMDD 481
            W+++     E  + G   A          GL    ID  Y+E      +   N D +  
Sbjct: 379 PWDEIEERFAEGAEQGIDAAEAVVDVDGSEGLVTVDIDGEYIELVAHDGV-EQNADRLYT 437

Query: 482 EEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQ 541
           E K +  +K   + AL+A  + R   E  K++  + E T    +      ++      L 
Sbjct: 438 EAKRVAEKK---EGALAAIEDTREDLEEAKRRRDEWEATDGEEADDEATEDEGEDHDWLA 494

Query: 542 EKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +   I       WF++F WF +S+ YLVI GRDA QNE +VK+Y+  GD  +H   HG
Sbjct: 495 DPS---IPIRENEPWFDRFRWFHTSDGYLVIGGRDADQNEELVKKYLEPGDKVLHTQAHG 551

Query: 602 ASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKT 654
              TV+K   P +       +P  ++ +A  F V ++  W D +     + V   QV+KT
Sbjct: 552 GPVTVLKATDPSEASSADIELPESSIEEAAQFAVSYASVWKDGRYAGDVYAVDSDQVTKT 611

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             +GEYL  G F +RG + +    P+    G+
Sbjct: 612 PESGEYLEKGGFAVRGDRTYYRDTPVGAAVGI 643



 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGAYEGAKLDKAYLYGDNLVRLKMRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +         Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGASQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|383621605|ref|ZP_09948011.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
 gi|448702236|ref|ZP_21699890.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
 gi|445777606|gb|EMA28567.1| Fibronectin-binding A domain-containing protein [Halobiforma
           lacisalsi AJ5]
          Length = 718

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 169/376 (44%), Gaps = 62/376 (16%)

Query: 351 FETFDAALDEFYSKIESQRAE------QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +++F  ALD+++ ++E    E      Q+   +E+ A H+  +I   QE  +   +Q+ D
Sbjct: 288 YDSFLTALDDYFFRLELDEEEEPDPTEQRPDFEEEIAKHQ--RIIEQQEGAIEGFEQQAD 345

Query: 405 RSVKMAELI--EYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLY 462
              + AE +  EY L  VD  +  +R A      W+++ +  +E ++ G           
Sbjct: 346 ELREQAESLYAEYGL--VDEVLSTIRQARKQDRPWDEIEQRFEEGKERG----------- 392

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVE----KVEVDLALSAHANARRWYELKKKQESKQE 518
                  +  +  + ++D  E T+ VE    ++++ +      NA R Y   K+ E K+E
Sbjct: 393 -------IEAAETVVDLDGSEGTVTVEVDGERIDLVVDDGVEQNADRLYTEAKRVEEKKE 445

Query: 519 KTITAHSKAFKAAE--KKTRLQILQEK-------------------TVANISHMRKVHWF 557
             + A     +  E  K+ R Q   E                    ++ ++       W+
Sbjct: 446 GALAAIEDTREDLEDAKRRRDQWEAEDAAEDDEDDDDEDEEERNWLSMPSVPIRENEPWY 505

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-- 615
           ++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H   HG   TV+K   P +   
Sbjct: 506 DRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDEVLHTQAHGGPVTVLKATDPSEASS 565

Query: 616 ----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
               +P  ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG
Sbjct: 566 HDIELPESSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRG 625

Query: 671 KKNFLPPHPLIMGFGL 686
            + +    P+ +  G+
Sbjct: 626 DRTYYRDTPVGVAVGI 641



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESG--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLLEVGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|336253827|ref|YP_004596934.1| Fibronectin-binding A domain-containing protein [Halopiger
           xanaduensis SH-6]
 gi|335337816|gb|AEH37055.1| Fibronectin-binding A domain protein [Halopiger xanaduensis SH-6]
          Length = 718

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 166/369 (44%), Gaps = 49/369 (13%)

Query: 351 FETFDAALDEFYSKIESQRAE------QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           +++F  ALD+++ ++E +  E      Q+   +E+ A H+  +I   Q+  +   +QE +
Sbjct: 289 YDSFLTALDDYFFRLELEDEEEPDPTEQRPDFEEEIAKHE--RIIEQQQGAIEGFEQEAE 346

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLE 464
           +  + AEL+      VD  +  +R A      W+++    +E ++ G   A  +  +   
Sbjct: 347 QLREKAELLYARYGLVDDILSTIRNAREQDRPWDEIEERFEEGKERGIEAAEAVVGIDGS 406

Query: 465 RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH 524
              +++ +                E+++++       NA R Y   K+ E K+E  + A 
Sbjct: 407 EGIVTVDIDG--------------ERIDLEARQGVEQNADRLYTEAKRVEEKKEGALAAI 452

Query: 525 SKAFKAAE--KKTRLQILQEK------------------TVANISHMRKVHWFEKFNWFI 564
               +  E  K+ R Q   E                   ++ ++       W+++F WF 
Sbjct: 453 EDTREDLEEAKRRREQWEAEDAGEDDADDEDEGEDKDWLSMPSVPIRENEPWYDRFRWFH 512

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPP 618
           +S++YLVI GR+A QNE IVK+Y+  GD  +H   HG   TV+K   P +       +P 
Sbjct: 513 TSDDYLVIGGRNADQNEEIVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPD 572

Query: 619 LTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPP 677
            ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG + +   
Sbjct: 573 SSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYDD 632

Query: 678 HPLIMGFGL 686
            P+ +  G+
Sbjct: 633 TPVGVAVGI 641



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 71/165 (43%), Gaps = 13/165 (7%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMN-SSGVTESGESEKVLLLME 61
           K  + + D+AA V+ L    G +    Y         K+ +   G TE        L+ E
Sbjct: 4   KRELTSVDLAALVEELGAYEGAKVDKAYLYGDDLVRLKMRDFDRGRTE--------LIFE 55

Query: 62  SG--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F      
Sbjct: 56  VGEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFTFERDDGT 115

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
             +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 116 TRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|428671809|gb|EKX72724.1| hypothetical protein BEWA_012830 [Babesia equi]
          Length = 1178

 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 89/149 (59%), Gaps = 9/149 (6%)

Query: 1   MVKVRMNTADVAAEVKCLRRL-IGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL 59
           M + R+N  DV   V  L+RL +     N+YD++ + ++ K         S   EKV +L
Sbjct: 1   MARERLNAIDVGVVVANLKRLALNYSLVNIYDITNRIFVLKF--------SKNEEKVYVL 52

Query: 60  MESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHY 119
           +E G R+HTT + R   + PS F +KLRKH+R+R+L +V Q+  DR+I F F     AH+
Sbjct: 53  IEIGCRIHTTQFLRSSDSLPSNFNVKLRKHLRSRKLRNVAQMSQDRVIDFTFSSEEYAHH 112

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRD 148
           +I++L+  GNI LTD+ + VLT+L   +D
Sbjct: 113 LIVQLFLPGNIYLTDANYKVLTVLSGEKD 141


>gi|332796292|ref|YP_004457792.1| hypothetical protein Ahos_0606 [Acidianus hospitalis W1]
 gi|332694027|gb|AEE93494.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 566

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 72/221 (32%), Positives = 125/221 (56%), Gaps = 16/221 (7%)

Query: 479 MDDEEKTLPVE--KVEVDLALSAHANARRWYELKKK--QESKQEK-TITAHSKAFKAAEK 533
           + ++EK + +E  ++E+D  LS   NA  +++  K+  Q+SK+ K T+    +     E 
Sbjct: 281 IKNKEKKIKLEGKEIEIDPKLSVAKNASLYFDKAKEYVQKSKKAKETLEELKRKLNEIEI 340

Query: 534 KTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDV 593
           + + +    K       +RK  W+EK+ W  ++  +LVI+G+DA QNE +V++ +   D+
Sbjct: 341 EIKKEEEGRKL-----SIRKKEWYEKYRWSFTTNGFLVIAGKDADQNESLVRKLLEDNDI 395

Query: 594 YVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVS 652
           ++HAD+ GA++T+IKN +    +    +  A      +S+AW   +     +WVY  QVS
Sbjct: 396 FLHADIQGAAATIIKNPK---NITEQDIYDAAAIAASYSKAWKLGLAAVDVFWVYGSQVS 452

Query: 653 KTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDES 693
           K+ P GEYL  GSFMI GKKN++    L +  G  F++++S
Sbjct: 453 KSPPAGEYLPKGSFMIYGKKNYIKSVKLNLAIG--FKINDS 491


>gi|116754828|ref|YP_843946.1| hypothetical protein Mthe_1534 [Methanosaeta thermophila PT]
 gi|116666279|gb|ABK15306.1| protein of unknown function DUF814 [Methanosaeta thermophila PT]
          Length = 641

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 166/379 (43%), Gaps = 49/379 (12%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
           +  P  L  ++  E   FE F  ALDEF+         +    K  A   +L      Q 
Sbjct: 246 DVIPFPLEVYKGLEARSFERFSDALDEFF-------VAEPEMPKLSALERRLEL----QR 294

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNP 453
             +  L+ +  +   M + I     ++D+ + A+  A    +S+ D+   ++   K+   
Sbjct: 295 AAIDELRAKETQLASMGDFIYQRYSEIDSILKAIAGARERGLSYTDIWERIQSSGKSAVK 354

Query: 454 VAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQ 513
                 ++ +E + ++L                     E++  L+   NA R+YE  K+ 
Sbjct: 355 SLDYSGEMIVEIDGVTL---------------------ELNAGLTVPQNAGRYYERAKEA 393

Query: 514 ESKQEKTITAHSKA---FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYL 570
             K      A  +     +  E++ R  +L+ +         K  WFE+F WF SS+++L
Sbjct: 394 AKKAAGAEEALRRTEDLLQRGEERRRSPVLKRR--------HKPRWFERFRWFYSSDDFL 445

Query: 571 VISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVC 630
           VI GRDA  NE I  +Y+ K D+ +H D  GA  TVIK    E  VP  T+ +A  F V 
Sbjct: 446 VIGGRDADGNEEIYLKYLEKRDLALHTDYPGAPLTVIKTEGRE--VPERTVEEAAQFAVS 503

Query: 631 HSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFR 689
           +S  W   + +   + V   QV+KT   GE+L  G+F++RG++ +L   PL +   +   
Sbjct: 504 YSNLWREGVASGDCYVVRGDQVTKTPEHGEFLRKGAFVVRGERRYLRDVPLGVALAI--- 560

Query: 690 LDESSLGSHLNERRVRGEE 708
            D S +G  ++  R +  E
Sbjct: 561 ADGSLIGGPVSAVRSKSSE 579



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 65/146 (44%), Gaps = 11/146 (7%)

Query: 6   MNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           M+  DVAA V  L+ R+ G      Y  S       +    G        ++ +++E+G 
Sbjct: 5   MSNVDVAAIVAELQTRIAGGFFGKAYQSSGDAIWLTIQAREG--------RLDIILEAGR 56

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R H T   R    TP  F   LR  +   R+  V Q  +DR++          + +++EL
Sbjct: 57  RAHVTRKERVVGRTPPQFPAMLRSRLSGGRIVSVEQHDFDRVMEICVERSDGRYRLVVEL 116

Query: 125 YAQGNILLTDSEFTVLTLLR--SHRD 148
           + +GN+LL D E  ++  LR  S RD
Sbjct: 117 FPKGNMLLLDDEMRIILPLRPMSFRD 142


>gi|374724028|gb|EHR76108.1| putative RNA-binding protein [uncultured marine group II
           euryarchaeote]
          Length = 723

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 94/391 (24%), Positives = 177/391 (45%), Gaps = 54/391 (13%)

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAA---------FHK 384
           E  P +L         KF T   A+D +    ++    ++   K D A           +
Sbjct: 272 EATPTILPSHAGMAQAKFATLCEAVDAWKGAHDAGALARREAEKLDIAAPGRGHSTDVER 331

Query: 385 LNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMV 444
           L +  + QE  +    +++++   +   I+ N   V++ ++ V  A+  +  W+++  M 
Sbjct: 332 LERRKVQQEKALSGFSKKIEKQQMIGHTIQNNWTHVESLLIQVTEAIEAK-GWKEVKSMA 390

Query: 445 KEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANAR 504
           K        +  ++     ER+ +S+L   N  E    + TL +++       S H NA+
Sbjct: 391 KS-------IPWIVSLNPAERSFLSVLPDEN-GEPKGPQATLSIDE-------SVHQNAQ 435

Query: 505 RWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKT--VANISHMRKVHWFEKFNW 562
           R++   +KQ+ K +  + A        ++  + +  Q+ T  +  I   +++ WFE   W
Sbjct: 436 RFFTAARKQKDKTKGAVDALEDTLLQLQRAQKKEAKQQATGKLNKIKRSKRL-WFEHHRW 494

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST--------VIKNHRPEQ 614
            + +  +L++ G+DA+ N+ IVK+++S  D Y+HADLHGA S         V+  H+P  
Sbjct: 495 SMITGGHLLVGGKDAKGNDSIVKKHLSGQDRYLHADLHGAPSCSLRATQGFVVDQHKPAH 554

Query: 615 ---PVPPL--------------TLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAP 656
               VP                 L +A    +C S+AW       + + V P QVSKTA 
Sbjct: 555 IPADVPAFRIVDKLGDERITEEKLLEAATMALCWSRAWAGGGAHGTVYSVKPAQVSKTAQ 614

Query: 657 TGEYLTVGSFMIRGKKNFLPPHPLIMGFGLL 687
           TGE++  GSF++RG++ +     + +G G++
Sbjct: 615 TGEFVGKGSFIVRGQRQWFKDLDVQIGIGIV 645



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 56/92 (60%)

Query: 52  ESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
           E ++  L++  G R++T+   R    TP  F + LRKH++  R+  VRQLG+DR++ F F
Sbjct: 44  EQDQFDLVLVRGSRIYTSQRDRPMPMTPPPFAMVLRKHLKNARMTGVRQLGFDRVLGFDF 103

Query: 112 GLGMNAHYVILELYAQGNILLTDSEFTVLTLL 143
                ++++ +E++  GNI+LTD E  ++  L
Sbjct: 104 DTKHGSYHLYVEVFRDGNIILTDQEGVIIQPL 135


>gi|435848081|ref|YP_007310331.1| putative RNA-binding protein, snRNP like protein [Natronococcus
           occultus SP4]
 gi|433674349|gb|AGB38541.1| putative RNA-binding protein, snRNP like protein [Natronococcus
           occultus SP4]
          Length = 712

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 154/335 (45%), Gaps = 31/335 (9%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
           Q+   +E+ A H+  +I   Q+  +   +Q+     + AEL+    E VD  +  ++ A 
Sbjct: 312 QRPDFEEEIAKHE--RIIEQQQGAIEGFEQQAQAQRENAELLYARYELVDDILSTIQEAR 369

Query: 432 ANRMSWEDLARMVKEERKAG----NPVAGL-----IDKLYLERNCMSLL----LSNNLDE 478
                W+++    +E ++ G      V G+     I  + L+   + L+    +  N D 
Sbjct: 370 TQDRPWDEIEERFEEGKERGIEAAEAVVGVDGTEGIVTVELDGEEIDLVADDGVEQNADR 429

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
           +  E K +  +K   + AL+A  + R   E  K++  + E T        +  E+K  L+
Sbjct: 430 LYTEAKRIEEKK---EGALAAIEDTREDLEDAKRRRDEWEATDDHEDDDDEEDEEKNWLE 486

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
           +      A++       W+++F WF +S+ YLVI GR A QNE +VK+Y+  GD  +H  
Sbjct: 487 M------ASVPIRENEPWYDRFRWFHTSDGYLVIGGRSADQNEELVKKYLEPGDTVLHTQ 540

Query: 599 LHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQV 651
            HG   TV+K   P +       +P  ++ +A  F V +S  W D +     + V   QV
Sbjct: 541 AHGGPVTVLKATDPSEASSSDIELPDSSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQV 600

Query: 652 SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +KT  +GEYL  G F IRG + +    P+    G+
Sbjct: 601 TKTPESGEYLEKGGFAIRGDRTYYRDTPVGAAVGI 635



 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLIEVGEIKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQFEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGNI +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FERDDGTTRIIVELFGQGNIAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|448353444|ref|ZP_21542220.1| fibronectin-binding A domain-containing protein [Natrialba
           hulunbeirensis JCM 10989]
 gi|445640304|gb|ELY93393.1| fibronectin-binding A domain-containing protein [Natrialba
           hulunbeirensis JCM 10989]
          Length = 736

 Score =  113 bits (283), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 164/364 (45%), Gaps = 34/364 (9%)

Query: 351 FETFDAALDEFY------SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           ++TF  ALD+++       + E     Q+   +E+ A H+  +I   Q+  +   +QE +
Sbjct: 302 YDTFLDALDDYFFHLELEDEEEPDPTSQRPDFEEEIAKHE--RIIEQQQGAIEGFEQEAE 359

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA--------- 455
              + AEL+  N   VD  +  ++ A A    WE +    +E  + G   A         
Sbjct: 360 NLRENAELLYANYGLVDDILSTIQEARAQDRPWEAIEARFEEGAEQGIEAAEAVIDVDGS 419

Query: 456 -GLI----DKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELK 510
            G++    D  Y+E      +   N D +  E K +  +K   + AL+A  + R   E  
Sbjct: 420 EGIVTVDVDGEYIELVAHDGV-EQNADRLYTEAKRVAEKK---EGALAAIEDTREDLEDA 475

Query: 511 KKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVH-WFEKFNWFISSENY 569
           K++  + E++           ++       ++    +   +R+   WF++F WF +S+ Y
Sbjct: 476 KRRRDEWEESDGESGAGSGGGDEDEGEDEDRDWLAESSIPIRENEPWFDRFRWFHTSDGY 535

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQ 623
           LVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ +
Sbjct: 536 LVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPESSIEE 595

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           A  F V +S  W D +     + V   QV+KT  +GEYL  G F +RG + +    P+  
Sbjct: 596 AAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRGDRTYYRDTPVGA 655

Query: 683 GFGL 686
             G+
Sbjct: 656 AVGI 659



 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGTYEGAKVDKAYRYGDDLVRLKMRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFTFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|448359396|ref|ZP_21548054.1| fibronectin-binding A domain-containing protein [Natrialba
           chahannaoensis JCM 10990]
 gi|445643534|gb|ELY96581.1| fibronectin-binding A domain-containing protein [Natrialba
           chahannaoensis JCM 10990]
          Length = 727

 Score =  113 bits (283), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 158/376 (42%), Gaps = 61/376 (16%)

Query: 351 FETFDAALDEFY------SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           ++TF  ALD+++       + E     Q+    E+ A H+  +I   Q+  +   +QE +
Sbjct: 296 YDTFLNALDDYFFHLELEDEEEPDPTSQRPDFGEEIAKHE--RIIEQQQGAIEGFEQEAE 353

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA-GLIDKLYL 463
              + AEL+  N   VD  +  ++ A A    W+D+    +E  + G   A  +ID    
Sbjct: 354 NLRENAELLYANYGLVDDILSTIQEARAQDRPWDDIEARFEEGAEQGIEAAEAVID---- 409

Query: 464 ERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAH----ANARRWYELKKKQESKQEK 519
                          +D  E  + V+     + L AH     NA R Y   K+   K+E 
Sbjct: 410 ---------------VDGSEGIVTVDVNGEYIELVAHDGVEQNADRLYTEAKRVAEKKEG 454

Query: 520 TITAHSKAFKAAEKKTRLQILQEK----------------------TVANISHMRKVHWF 557
            + A     +  E   R +   E+                        ++I       WF
Sbjct: 455 ALVAIEDTREDLEDAKRRRDEWEEQDGEPGAGEEDEDDEDDDRDWLAESSIPIRENEPWF 514

Query: 558 EKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP-- 615
           ++F WF +S+ YLVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P +   
Sbjct: 515 DRFRWFHTSDGYLVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASS 574

Query: 616 ----VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRG 670
               +P  ++ +A  F V +S  W D +     + V   QV+KT  +GEYL  G F +RG
Sbjct: 575 SDIELPESSIEEAAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAVRG 634

Query: 671 KKNFLPPHPLIMGFGL 686
            + +    P+    G+
Sbjct: 635 DRTYYRDTPVGAAVGI 650



 Score = 63.2 bits (152), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGTYEGAKVDKAYRYGDDLVRLKMRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFTFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|433638964|ref|YP_007284724.1| putative RNA-binding protein, snRNP like protein [Halovivax ruber
           XH-70]
 gi|433290768|gb|AGB16591.1| putative RNA-binding protein, snRNP like protein [Halovivax ruber
           XH-70]
          Length = 847

 Score =  112 bits (281), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 98/385 (25%), Positives = 164/385 (42%), Gaps = 50/385 (12%)

Query: 337 PLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKE----DAAFHKLNKIHMDQ 392
           PL  +Q    E   F++F  ALDE++ ++E    E    A +    +A   K  +I   Q
Sbjct: 401 PLEEHQQAGLEPEAFDSFTEALDEYFYQLELAEEEPADSASQRPDFEAEIAKQQRIIEQQ 460

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGN 452
           E  +   ++E +   + AEL+  N   VD  +  VR A      W+++     EER A  
Sbjct: 461 EGAIEEFEREAEAERERAELLYANYGFVDEILTTVRDARTEGTPWDEI-----EERFAAG 515

Query: 453 PVAGL-IDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY---- 507
              G+   +  ++ +  +  ++  LD+          E++ +D       NA R Y    
Sbjct: 516 AEQGIDAAEAVVDVDGANGRVTIELDD----------ERIPLDADDGVEKNADRLYTEAK 565

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV-------------------ANI 548
            + +K+E  Q+       +     E+K   +   E                      ++I
Sbjct: 566 RIAEKKEGAQQAIENTREELADVRERKAAWEADDEGGDDIGGDDSDEDEPDIDWLARSSI 625

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
                  WF++F W  +S+ +LVI GR+A QNE +V +Y+  GD   H   HG   TV+K
Sbjct: 626 PIRENEPWFDRFRWVQTSDGFLVIGGRNADQNEELVSKYLEPGDRVFHTQAHGGPVTVLK 685

Query: 609 ------NHRPEQPVPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYL 661
                 + RP+   P  ++ QA  F V ++  W D +     + V   QV+KT  +GEYL
Sbjct: 686 ATDPSESSRPDMEFPETSIEQAAQFAVSYASVWKDGRYAGDVYSVDADQVTKTPESGEYL 745

Query: 662 TVGSFMIRGKKNFLPPHPLIMGFGL 686
             G F IRG + +    P+ +  G+
Sbjct: 746 EKGGFAIRGDRTYHRDTPVGVAVGI 770



 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 73/164 (44%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  +++ D+AA V  L  L G +    Y         K+ +        +  +V L +E 
Sbjct: 114 KRELSSVDLAAVVGELSDLEGAKVDKAYLYGDDLVRLKMRDF-------DRGRVELFIEV 166

Query: 63  G--VRLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R+HT A  R  D    P  F   LR  +       V Q  +DRI+ F F       
Sbjct: 167 GETKRVHTVAQERVPDAPGRPPHFAKMLRNRLSGADFAGVSQYEFDRILEFVFEREDANT 226

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            VI+EL+ +GN+ +TD E+ V+  L + R   + VA  +R+ +P
Sbjct: 227 RVIVELFGEGNVAVTDGEYEVVDSLETIRLKSRTVAPGARYEFP 270


>gi|289580546|ref|YP_003479012.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
 gi|448284209|ref|ZP_21475471.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
 gi|289530099|gb|ADD04450.1| Fibronectin-binding A domain protein [Natrialba magadii ATCC 43099]
 gi|445571291|gb|ELY25845.1| fibronectin-binding A domain-containing protein [Natrialba magadii
           ATCC 43099]
          Length = 727

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 165/364 (45%), Gaps = 37/364 (10%)

Query: 351 FETFDAALDEFY------SKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVD 404
           ++TF  ALD+++       + E     Q+    E+ A H+  +I   Q+  +   +QE +
Sbjct: 296 YDTFLDALDDYFFHLELEDEEEPDPTSQRPDFGEEIAKHE--RIIEQQQGAIEGFEQEAE 353

Query: 405 RSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVA--------- 455
              + AEL+  N   VD  +  ++ A A    W+++    ++  + G   A         
Sbjct: 354 NLRENAELLYANYGLVDDILSTIQEARAQDRPWDEIEARFEDGAEQGIEAAEAVIDVDGS 413

Query: 456 -GLI----DKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR-WYEL 509
            G++    D  Y+E      +   N D +  E K +  +K   + AL+A  + R    + 
Sbjct: 414 EGIVTVDVDGEYIELVAHDGV-EQNADRLYTEAKRVAEKK---EGALAAIEDTREDLKDA 469

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
           K++++  +E+     +      ++      L E   ++I       WF++F WF +S+ Y
Sbjct: 470 KRRRDEWEEQDGKPGAGDEDEDDEDDDRDWLAE---SSIPIRENEPWFDRFRWFHTSDGY 526

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP------VPPLTLNQ 623
           LVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P +       +P  ++ +
Sbjct: 527 LVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEASSSDIELPESSIEE 586

Query: 624 AGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIM 682
           A  F V +S  W D +     + V   QV+KT  +GEYL  G F IRG + +    P+  
Sbjct: 587 AAQFAVSYSSVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAIRGDRTYYRDTPVGA 646

Query: 683 GFGL 686
             G+
Sbjct: 647 AVGI 650



 Score = 63.2 bits (152), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         K+ +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGTYEGAKVDKAYRYGDDLVRLKMRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVAQERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFTFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RLIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|167043365|gb|ABZ08068.1| putative domain of unknown function (DUF814) [uncultured marine
           crenarchaeote HF4000_ANIW141O9]
          Length = 632

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 108/202 (53%), Gaps = 10/202 (4%)

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           EK+++DL  S    A   +   KKQ++     I +  K     E +    I + ++  ++
Sbjct: 361 EKIKIDLNSSLPTTASTLFNESKKQKA----AIGSIEKLLIKTENELEKVIEKGESAKSV 416

Query: 549 S--HMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
           S   +RK +WFE++ WF +++  L + GRD+  N  I+++++ K D   HA++ G+   +
Sbjct: 417 SFTQVRKKNWFERYRWFYTTDGVLAVGGRDSSSNSAIIRKHLDKNDKVFHAEISGSPFFL 476

Query: 607 IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGS 665
           +K++    P    +L +    TVC S+ W      +SA+WV P QV K AP+G+ +  GS
Sbjct: 477 LKDNATSTPA---SLTEVAHATVCFSKVWKEAFYGSSAYWVNPDQVKKGAPSGQSMAKGS 533

Query: 666 FMIRGKKNFLPPHPLIMGFGLL 687
           FMI G++NF+    L M   ++
Sbjct: 534 FMIEGQRNFVKISTLKMCVAII 555



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 72/147 (48%), Gaps = 28/147 (19%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES-GVRLHTTAYARDKKN 77
           +R+ G   SN+Y ++    +FK  +        E   +LL++ + G+ +      + + N
Sbjct: 17  KRIDGYYLSNIYGITKDGLLFKFHHP-------EKPDILLMLSTFGIWITNVKIEQIEPN 69

Query: 78  TPSGFTLKLRKHIRTR----RLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLT 133
                  KL KH+R+     +L++V+Q+G +RI+            +++EL++ GNI++ 
Sbjct: 70  -------KLLKHLRSNILRFKLKEVKQIGTERIVYLTLSYFEKEFVIVVELFSDGNIIIC 122

Query: 134 DSEFTVLTLLRSHRDDDKGVAIMSRHR 160
           ++E  +L L  SH       +I  RHR
Sbjct: 123 NNEMKILAL--SH-------SINVRHR 140


>gi|408405775|ref|YP_006863758.1| hypothetical protein Ngar_c31850 [Candidatus Nitrososphaera
           gargensis Ga9.2]
 gi|408366371|gb|AFU60101.1| hypothetical protein with domain of unknown function DUF814 and
           fibronectin-binding A protein [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 661

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 49/136 (36%), Positives = 85/136 (62%), Gaps = 5/136 (3%)

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP--- 612
           W+E++ WFI+++  L I GRDA  N  ++++++++ D+  HA++HG+   ++KN      
Sbjct: 436 WYERYRWFITTDGLLAIGGRDASSNSALIRKHLTEDDIVFHAEVHGSPFFIVKNAAAPAK 495

Query: 613 EQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWVYPHQVSKTAPTGEYLTVGSFMIRGK 671
           E  + P +L Q    TV  S+AW   + ++ A+WV P QV K APTG++L  GSF+I GK
Sbjct: 496 EGRIDP-SLLQVAKATVSFSRAWKDGLSSADAYWVMPEQVKKGAPTGQFLPKGSFVIEGK 554

Query: 672 KNFLPPHPLIMGFGLL 687
           +N+L    + +  G++
Sbjct: 555 RNYLKGVEIRLAIGIV 570


>gi|170291097|ref|YP_001737913.1| RNA-binding protein, snRNP-like protein [Candidatus Korarchaeum
           cryptofilum OPF8]
 gi|170175177|gb|ACB08230.1| RNA-binding protein, snRNP-like protein [Candidatus Korarchaeum
           cryptofilum OPF8]
          Length = 624

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 150/337 (44%), Gaps = 48/337 (14%)

Query: 347 EFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRS 406
           E V++  F        S +E  RA +   +     ++K   I  + E R+ +L++E++R 
Sbjct: 238 EIVEYSAFP------LSHLEYDRARRDLLSDAIEDYYKSKGISFEDE-RISSLRREIERQ 290

Query: 407 VKMAELIEYN---LEDVDAAILA----VRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           + + E  E     L  +   IL+    V  AL    S E+ A + + + K+G  +     
Sbjct: 291 ISLKEEYERTYAQLRRIGDTILSNIHEVEEALGRARSGEEHALVKRVDWKSGKVII---- 346

Query: 460 KLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
                                    +L  E++++D+  SA  NA  +Y+  KK   K  +
Sbjct: 347 -------------------------SLEGEEIQLDIRRSASENASEYYDKAKKAREKALR 381

Query: 520 TITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
              A S   +   K+    + + K   +    ++  W+EKF WF +S   LVI GRDAQ 
Sbjct: 382 IDKALSNIMERL-KQIESSLEERKLELSPKPRKRERWYEKFRWFYTSSGNLVICGRDAQT 440

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
           N  IV +YM   D++ H D+ G +  V+K    E+ V   ++ QA       S+AW   +
Sbjct: 441 NSEIVSKYMDDKDLFFHVDMPGGAVVVLKV---EREVDQRSIEQAAVAAASFSRAWKEGL 497

Query: 640 -VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
                ++V   QVSK AP G YL  GSF I GK+N+L
Sbjct: 498 SYADVYYVKGEQVSKHAPPGMYLPKGSFYITGKRNYL 534



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 72/141 (51%), Gaps = 10/141 (7%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVR 65
           M   +++  +  LRRL G     +Y++   +  F L+    V   G  E +++ +   + 
Sbjct: 6   MTGIEISHTINELRRLEGGFIKKIYNIDGNS--FSLLFHPEV--DGRRE-IVIDLRGFIF 60

Query: 66  LHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELY 125
           L    +A  K  TPS F + LRKH+   R+E + QLG +RII F+F  GM    +I+EL+
Sbjct: 61  LTKLKWA--KPQTPSSFVMTLRKHLENARIESISQLGLERIISFEFPRGMR---LIVELF 115

Query: 126 AQGNILLTDSEFTVLTLLRSH 146
             GN++L   +  V +  R+ 
Sbjct: 116 GGGNLILLSGDEIVASQRRAE 136


>gi|448321837|ref|ZP_21511312.1| fibronectin-binding A domain-containing protein [Natronococcus
           amylolyticus DSM 10524]
 gi|445602889|gb|ELY56860.1| fibronectin-binding A domain-containing protein [Natronococcus
           amylolyticus DSM 10524]
          Length = 717

 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 155/335 (46%), Gaps = 30/335 (8%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
           Q+   +E+ A H+  +I   QE  +   +Q+     + AEL+      VD  +  V+ A 
Sbjct: 316 QRPDFEEEIAKHE--RIIEQQEGAIEGFEQQAQSQRENAELLYAEYGVVDDILSTVQEAR 373

Query: 432 ANRMSWEDLARMVKEERKAG----NPVAGL-----IDKLYLERNCMSLL----LSNNLDE 478
           A    W+++    +E ++ G      V G+     I  + L+   + LL    +  N D 
Sbjct: 374 AQDRPWDEIEERFEEGKERGIEAAEAVVGVDGTEGIVTVELDGEEIDLLARQGVEQNADR 433

Query: 479 MDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQ 538
           +  E K +  +K   + AL+A  + R   +L+  +  + E   T  +      E +    
Sbjct: 434 LYTEAKRIAEKK---EGALAAIEDTRE--DLEDAKRRRDEWEATDETDDDDEDEAQEETN 488

Query: 539 ILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHAD 598
            L+   +A++       W+++F WF +S+ YLVI GR+A QNE +VK+Y+  GD  +H  
Sbjct: 489 WLE---LASVPIRENEPWYDRFRWFHTSDGYLVIGGRNADQNEELVKKYLEPGDTVLHTQ 545

Query: 599 LHGASSTVIKNHRPEQP------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQV 651
            HG   TV+K   P +       +P  ++ +A  F V +S  W D +     + V   QV
Sbjct: 546 AHGGPVTVLKATDPSEASSSDIELPDSSIEEAAQFAVTYSSVWKDGRYAGDVYAVDSDQV 605

Query: 652 SKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
           +KT  +GEYL  G F IRG + +    P+ +  G+
Sbjct: 606 TKTPESGEYLEKGGFAIRGDRTYHRDTPVGVAVGI 640



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 55/112 (49%), Gaps = 4/112 (3%)

Query: 55  KVLLLMESGV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           +V LL+E G   R HT A  R  D    P  F + LR  +       V Q  +DRI+ F 
Sbjct: 49  RVELLIEVGEIKRAHTVAPERVPDAPGRPPQFAMMLRNRLSGADFVGVEQFEFDRILEFV 108

Query: 111 FGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F        +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 109 FDRDDGTTRIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|424812620|ref|ZP_18237860.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
 gi|339756842|gb|EGQ40425.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
          Length = 628

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 98/348 (28%), Positives = 144/348 (41%), Gaps = 43/348 (12%)

Query: 336 CPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
            P  L  +   E  +FETF  ALDE + +   Q+ E +   K       + +    QE +
Sbjct: 231 APFPLQTYSEHEEERFETFSRALDELFHRRRQQKLESKRMDKYRERREGIERQLHQQEQK 290

Query: 396 VHTLKQEVDRSVKMAELIEYNLE-------DVDAAILAVRVALANRMSWEDLARMVKEER 448
              L+Q   +  + AE I  N +        VD+ I       A ++   DL  +  +ER
Sbjct: 291 AEGLEQAARQRRQAAETIYENYQVFHDLKQKVDSVIHEEGWESAEQLEVSDLESVNHQER 350

Query: 449 KAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYE 508
                + G                         E K  P E +E        A A R Y+
Sbjct: 351 FYRVAIDGA------------------------EVKLSPDESLE--------AAASRMYD 378

Query: 509 LKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSEN 568
             K++E K E T  A        E+    +   E+        R   WFEK+ WF + E 
Sbjct: 379 EAKEREQKAENTREALQNTRGKLEELEEDEFEVEEDSMERDESRSKRWFEKYRWFHTPEG 438

Query: 569 YLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFT 628
            LVI GR  Q NE +VK ++   D+Y+HAD  GA S  +K+    Q      + QA    
Sbjct: 439 RLVICGRGPQTNESLVKNHLEGDDLYLHADFDGAPSVALKDG---QDASEEEIRQAAKAA 495

Query: 629 VCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           V  S+AW S +     ++V P QV+K   +GEYL  G+F+IRG + +L
Sbjct: 496 VTFSKAWKSGIGADDVYYVEPSQVTKNPESGEYLEKGAFVIRGDRTYL 543



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 7/94 (7%)

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           + Y RD    P GF ++LRKH+    ++ +RQ G+DRI+  + G   +  +V  EL+ +G
Sbjct: 55  SEYKRDNPERPPGFCMELRKHLGG--VDRIRQRGFDRILEIRSG---DVRFVA-ELFGKG 108

Query: 129 NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           N  L     T+   LR     D+   +     YP
Sbjct: 109 NAALVKDGKTI-GALRQEEWSDRRTVVGEEFGYP 141


>gi|325969240|ref|YP_004245432.1| hypothetical protein VMUT_1728 [Vulcanisaeta moutnovskia 768-28]
 gi|323708443|gb|ADY01930.1| hypothetical protein VMUT_1728 [Vulcanisaeta moutnovskia 768-28]
          Length = 668

 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 111/190 (58%), Gaps = 8/190 (4%)

Query: 508 ELKKKQESKQE--KTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFIS 565
           EL++K +S +E    + A  +  +A  +K  ++ ++E ++  I   R+  WFE+F WFI+
Sbjct: 396 ELERKAKSAEEVMSQLRARIEELRAEGEKV-IESIREGSIHVIYGARE--WFERFRWFIT 452

Query: 566 SENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAG 625
           S   LVI+GRDA QNE+IV+ Y+   D++VHAD+ GA+  VI+   P        + +A 
Sbjct: 453 SGGKLVIAGRDAAQNEVIVRHYLRPWDIFVHADIPGAAVVVIRLSNPSDNASNSDIYEAA 512

Query: 626 CFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGF 684
            +   +S+AW   + V   ++V   QV+K AP+GEYL  GSFMI G + ++    L +G 
Sbjct: 513 QYAAAYSRAWVMGLSVLDVFYVRGEQVTKKAPSGEYLGKGSFMIYGTRGWIRNVELRLGI 572

Query: 685 GLLFRLDESS 694
           GL  R+D  S
Sbjct: 573 GL--RIDNLS 580



 Score = 40.8 bits (94), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 61/127 (48%), Gaps = 8/127 (6%)

Query: 38  IFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLED 97
           ++ + NS  +    ESEK  ++  S  R   T+Y  +  +   G T  LR+ I   RL  
Sbjct: 31  VYTMSNSLLLRFRKESEKYFVIANSH-RFGLTSYVLE--HGAEGVT-PLRRLIEGMRLRS 86

Query: 98  VRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMS 157
           +  L +DRI+   F  G    Y+++EL    N +   ++  +  +LR++R  D+ + I  
Sbjct: 87  IELLNFDRIVKLVFSDG----YLVIELLEPWNAIYMSNDNVIRWVLRAYRSRDRVINIGL 142

Query: 158 RHRYPTE 164
            ++ P +
Sbjct: 143 EYKPPPQ 149


>gi|76156132|gb|AAX27365.2| SJCHGC07862 protein [Schistosoma japonicum]
          Length = 241

 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/238 (31%), Positives = 123/238 (51%), Gaps = 7/238 (2%)

Query: 237 QPTLKTVLGEALGYGPALSEHII-LDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWL 295
           +P +   L   L YG  + EH + +    V   K     +LE +   ++ L V  F   L
Sbjct: 8   KPYVNKTLSLELPYGNVVIEHCMRIAQKEVKQAKTINDFQLESSETYLMKLYVKHFAVAL 67

Query: 296 QDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFD 355
           +D++ G    +    ++    GK H  TE G   Q Y+EF P +  Q+R +  + F++F+
Sbjct: 68  RDILLGPYSIDHQSSLKGYIFGKPHQSTEKG--LQSYEEFHPFMFEQYREKPHLAFDSFN 125

Query: 356 AALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEY 415
            A+D F+SKIESQ+   Q    E  A  K+  I  DQE R+  LK E +  ++ A LIE 
Sbjct: 126 RAVDAFFSKIESQKTLGQISRNEQKANRKVENIKKDQERRIMLLKTEQELDMQKAYLIEA 185

Query: 416 NLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLS 473
           N + VD  I+ +  AL+N++ W++L  +++E ++  +P++  I    +E NC  + LS
Sbjct: 186 NRQLVDNIIILINHALSNQIDWKELELIIEEAKQRNDPLSCHI----VELNCKRVRLS 239


>gi|307354208|ref|YP_003895259.1| Fibronectin-binding A domain-containing protein [Methanoplanus
           petrolearius DSM 11571]
 gi|307157441|gb|ADN36821.1| Fibronectin-binding A domain protein [Methanoplanus petrolearius
           DSM 11571]
          Length = 636

 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 162/350 (46%), Gaps = 46/350 (13%)

Query: 351 FETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNK---IHMDQENRVHTLKQEVDRSV 407
           F +F+ AL  ++   +S        A +DA   KL K   I   Q+  +   ++++    
Sbjct: 252 FSSFNDALSAYFPLPQS--------AAKDAKKEKLPKSEIIRRRQQEAIVNFEKKIAELQ 303

Query: 408 KMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
           +  + I  N +D+   I  +R A ++++SW+++   +K    +  P A  I ++Y   + 
Sbjct: 304 EKVDAIYENYQDISGIIDTLRDA-SSKLSWQEIEETLK---NSSLPAAKSIVRIYPSESA 359

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKA 527
           + ++                 +KV++ +  +  ANA R+Y   KK + K+   + A  K 
Sbjct: 360 VDVMAGG--------------KKVKIFINENPEANANRYYGEIKKYKKKKAGALVAMEK- 404

Query: 528 FKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRY 587
           F   EK       Q K   +    +K  W+ K+ WF++S+  LVI G+DA  NE I K+Y
Sbjct: 405 FMPKEK-------QAKKRQDYKPQKK-KWYHKYRWFVTSDGVLVIGGQDAGSNEDIGKKY 456

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTS-AWWV 646
           +   D +VHAD+HG S  V+K              +   F   +S AW +       +  
Sbjct: 457 LEGRDYFVHADVHGGSVVVVKGETE-------NWEEVAEFAASYSNAWKAGHFNCDVYAA 509

Query: 647 YPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLG 696
            P QVSKTA +GE++  G+F+IRG++ +     L +  GL    + + +G
Sbjct: 510 KPEQVSKTAESGEFVKRGAFIIRGERRYFRNIGLKVAIGLQLEPELAVIG 559



 Score = 73.2 bits (178), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 42/158 (26%), Positives = 78/158 (49%), Gaps = 9/158 (5%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESE-KVLLLMESGV 64
           M++ D+   +  +R  + +    +Y  +  ++ F+L        +GE + K   L+E G 
Sbjct: 7   MSSIDIRTMLYEIRERLPLWIGKIYQYNTNSFGFRL--------NGEDKSKYNFLVECGR 58

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
           R H T    D    PSG+++ LRK+I   R+ D++Q G  RI + + G     + +I EL
Sbjct: 59  RAHLTDNLPDAPQNPSGYSMFLRKYISGGRVLDIKQYGLQRIFIIKIGKTEKEYNLIFEL 118

Query: 125 YAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           + +GN +L D  F V+  L+     D+ +   + + +P
Sbjct: 119 FNEGNAVLCDENFIVINPLKRLHFRDREIVSGTEYIFP 156


>gi|448368844|ref|ZP_21555611.1| fibronectin-binding A domain-containing protein [Natrialba aegyptia
           DSM 13077]
 gi|445651387|gb|ELZ04295.1| fibronectin-binding A domain-containing protein [Natrialba aegyptia
           DSM 13077]
          Length = 722

 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 75/138 (54%), Gaps = 7/138 (5%)

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WF++F WF +S+ YLVI GRDA QNE +VK+Y+  GD  +H   HG   TV+K   P + 
Sbjct: 508 WFDRFRWFHTSDGYLVIGGRDADQNEELVKKYLEPGDKVLHTQAHGGPVTVLKATDPSEA 567

Query: 616 ------VPPLTLNQAGCFTVCHSQAW-DSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI 668
                 +P  ++ +A  F V ++  W D +     + V   QV+KT  +GEYL  G F +
Sbjct: 568 SSSDIELPESSIEEAAQFAVSYASVWKDGRYAGDVYAVDSDQVTKTPESGEYLEKGGFAV 627

Query: 669 RGKKNFLPPHPLIMGFGL 686
           RG + +    P+    G+
Sbjct: 628 RGDRTYYRDTPVGAAVGI 645



 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 3   KVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES 62
           K  + + D+AA V+      G +    Y         KL +        +  ++ LL+E 
Sbjct: 4   KRELTSVDLAALVREFGAYEGAKLDKAYLYGDDLVRLKLRDF-------DRGRIELLLEV 56

Query: 63  GV--RLHTTAYAR--DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           G   R HT    R  D    P  F + LR  +       V Q  +DRI+ F F       
Sbjct: 57  GEVKRAHTVTPERVPDAPGRPPQFAMMLRNRLSGADFAGVEQYEFDRILEFVFERDDGTT 116

Query: 119 YVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
            +I+EL+ QGN+ +TD E+ V+  L + R   + V   SR+ +P
Sbjct: 117 RIIVELFGQGNVAVTDGEYEVIDCLETVRLKSRTVVPGSRYEFP 160


>gi|56753953|gb|AAW25169.1| SJCHGC08981 protein [Schistosoma japonicum]
          Length = 414

 Score =  107 bits (268), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 46/78 (58%), Positives = 57/78 (73%)

Query: 627 FTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             V  S AW S ++T AWWV+  QVSKTAP+GEYLT GSF+IRGKKN+LPP P   GFG+
Sbjct: 1   MAVVLSSAWQSHVLTRAWWVHHDQVSKTAPSGEYLTSGSFIIRGKKNYLPPCPFDYGFGI 60

Query: 687 LFRLDESSLGSHLNERRV 704
           +F+L E S+  H  ERR+
Sbjct: 61  MFKLHEDSVFKHKGERRI 78


>gi|352682802|ref|YP_004893326.1| putative RNA-binding protein [Thermoproteus tenax Kra 1]
 gi|350275601|emb|CCC82248.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Thermoproteus tenax Kra 1]
          Length = 624

 Score =  106 bits (264), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 113/459 (24%), Positives = 200/459 (43%), Gaps = 81/459 (17%)

Query: 233 ARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFE 292
           A A+   L+  L   LG GP ++E +                +   NA +    A+A  E
Sbjct: 157 ALAEGKDLRRALSRELGLGPEVAEEV--------------YQRSSGNADR----ALAVLE 198

Query: 293 DWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFE 352
           + +++V  G + P  Y+L              +G    +     P+      +    +F+
Sbjct: 199 ELIREVTLGQLRPTLYVL--------------NGVPVTV----TPIRFISINADATEEFD 240

Query: 353 TFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLK---QEVDRSVKM 409
           TF  ALD+++ +IE ++A ++  A   +   KL +     E  +   +   +E+ R  + 
Sbjct: 241 TFWKALDKYFIEIELRKAVEKKTANITSRRQKLEQTIKSLEVEIEEYRRKGEELRRIAQT 300

Query: 410 AELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMS 469
              I+Y LED     L  R+  A  +  E + R++  +RK    V        LE + + 
Sbjct: 301 MMNIKYELED-----LMGRLNTATDVENESI-RIIDVDRKRREAV--------LETSGIK 346

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
            ++   LD        LPV K    +   A        E  +K E  +E      ++  +
Sbjct: 347 FVV--KLD--------LPVGKQISSMFEKAK-------EYLRKAEKAEETLRRLRAELER 389

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
             E++  L+   ++ V  ++      WFE++ W  +S    V+ GRDA QNE++VK+Y+ 
Sbjct: 390 LEEQRAELERSIKEGVVRVAER---SWFERYRWTATSRKTPVLGGRDASQNEILVKKYLR 446

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVP-PLTLNQAGCFTVCHSQAWDSKM-VTSAWWVY 647
              ++ HAD+ GAS  + +      P+   L L +   F   +S+AW + +     ++V+
Sbjct: 447 DNYLFFHADIPGASVVITR------PIEDQLELLEVAQFAASYSKAWKAGIHSIDVFYVF 500

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGL 686
             QVSK  P+GEYL  GSFMI G +N++    L +  G+
Sbjct: 501 GSQVSKQPPSGEYLARGSFMIYGTRNYIRHVRLELAIGV 539



 Score = 43.9 bits (102), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 51/111 (45%), Gaps = 16/111 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME 61
           +K  +   D+ A  + +R LIG R  N+Y  +P  Y+FK    S           L++ E
Sbjct: 1   MKTSLTIVDLYASAREMRNLIGRRVENIYK-TPSGYLFKFAGGS----------YLIIDE 49

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
           +   L      RD +   +     LR  +R  +L+DV    +D+I++ +FG
Sbjct: 50  TRASLTGVLGERDYRGAET-----LRGLLRDEKLDDVTVPRFDKILVLKFG 95


>gi|18313944|ref|NP_560611.1| hypothetical protein PAE3259 [Pyrobaculum aerophilum str. IM2]
 gi|18161516|gb|AAL64793.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
          Length = 614

 Score =  105 bits (263), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 140/598 (23%), Positives = 236/598 (39%), Gaps = 154/598 (25%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLRS 145
           LR   R  RL +V    +DRI    FG G     +I+EL    N++    +  V+ LL S
Sbjct: 68  LRGLFRDDRLAEVVMPRFDRIAELVFGSGK----IIVELLEPFNMVAV-RDGKVVWLLHS 122

Query: 146 HRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNA 205
           +R  D+ ++  + + YP                A      + D +E  K  + G+     
Sbjct: 123 YRGKDRVISPGAMYAYPP---------------AVFVDVLKADVDELQKAIDPGD----- 162

Query: 206 SKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLV 265
                                             L+  L   LG GP L++ +I+  G  
Sbjct: 163 ----------------------------------LRRSLIRRLGTGPELADELIVRAGTS 188

Query: 266 PNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTES 325
           P                                    I  E   L++   LGK  P    
Sbjct: 189 PRA----------------------------------IAEEFKALVEKVRLGKIEPTVCV 214

Query: 326 GSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKL 385
                I     P+     +  E+ +F  F  ALD +++ +E + A  Q   +      +L
Sbjct: 215 KDGVPI--TVMPIKPLSLKCDEYKQFNAFWEALDFYFAPMELESAAIQTTQELAQRRKRL 272

Query: 386 NKIHMDQENRVHTLKQEVDRSVKMA-ELIEYNLE------DVDAAILAVRVALANRMSWE 438
                + EN++   ++E  +   +A +L+ Y LE       ++ +I  V V  A R+  E
Sbjct: 273 EASIRELENKIPEYREEAAKLKTLAHKLLMYKLEIEEALKGMETSIRVVNVD-ATRIKIE 331

Query: 439 DLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALS 498
            L    + E + G  +   I +L+ E        +  L+E   +   + +EK++ DL+  
Sbjct: 332 -LPEGEQVELRKGVSIGKQISQLFDE--------AKELEEKAQKAAQV-LEKLKKDLS-- 379

Query: 499 AHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFE 558
                    +L ++Q   +EK              K+ ++I  +K+           WFE
Sbjct: 380 ---------KLDEEQRRAEEKL-------------KSSVKIATKKS-----------WFE 406

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           KF+W +++    VI GRDA QNE++VK+Y+ +  ++ HAD+ GAS+ V     P +   P
Sbjct: 407 KFHWTVTTGRKPVIGGRDASQNEVVVKKYLKEHYLFFHADIPGASAVVAP---PSE--DP 461

Query: 619 LTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           L L Q   F   +S+AW   +     ++V   QV+K  P+G+YL  GSFMI GK+ ++
Sbjct: 462 LELLQIAQFAAAYSKAWKIGIHAVDVYYVKGVQVTKQPPSGQYLARGSFMIYGKREYV 519


>gi|359415829|ref|ZP_09208221.1| hypothetical protein HRED_04719, partial [Candidatus Haloredivivus
           sp. G17]
 gi|358033813|gb|EHK02326.1| hypothetical protein HRED_04719 [Candidatus Haloredivivus sp. G17]
          Length = 194

 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/150 (39%), Positives = 82/150 (54%), Gaps = 3/150 (2%)

Query: 486 LPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTV 545
           L  + +++DL     A A ++Y+  K+ ESK E    A  K     E      I  E+ +
Sbjct: 47  LEEDSIKIDLHQDLEATASQYYDKAKESESKMENAEKALEKTEDEIESLGEEDIELEEVM 106

Query: 546 ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
            + S  R   WFEK+ WF SS+ YLV  GRDAQ NEM+VK++    D+Y+HAD  GA ST
Sbjct: 107 EDKSEKRSKKWFEKYRWFYSSDGYLVCLGRDAQTNEMLVKKHTDSEDLYLHADFDGAPST 166

Query: 606 VIKNHRPEQPVPPLTLNQAGCFTVCHSQAW 635
           VIK+    Q  P  TL +A   +V  ++AW
Sbjct: 167 VIKDG---QEAPESTLEEAAKASVSFTKAW 193


>gi|387219995|gb|AFJ69706.1| hypothetical protein NGATSA_2054800, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 94

 Score =  103 bits (257), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 46/84 (54%), Positives = 67/84 (79%)

Query: 526 KAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVK 585
           KA KAAE++    + +++    +S +RK +WFEKF+WFI+S+N+LV+SGRDAQQNE++VK
Sbjct: 3   KAVKAAERQAAASLSKQQRKRTLSVVRKPYWFEKFHWFITSDNHLVVSGRDAQQNELLVK 62

Query: 586 RYMSKGDVYVHADLHGASSTVIKN 609
           RY+  GD YVHADL GA+S V+++
Sbjct: 63  RYLRVGDAYVHADLPGAASCVVRH 86


>gi|154304166|ref|XP_001552488.1| hypothetical protein BC1G_08353 [Botryotinia fuckeliana B05.10]
          Length = 288

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/144 (42%), Positives = 80/144 (55%), Gaps = 11/144 (7%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R SNVYDLS K ++ K              K  +L+
Sbjct: 1   MKQRFSSIDVKVIAHELSNALVTLRVSNVYDLSSKIFLIKFAKPDN--------KQQILI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           +SG R H T ++R     PS F  +LRK ++TRR+  V Q+G DRII FQF  G    Y 
Sbjct: 53  DSGFRCHLTDFSRATAAAPSVFVQRLRKFLKTRRVTQVSQVGTDRIIEFQFSDGQYRLY- 111

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
            LE YA GNI+LTD E  +LTLLR
Sbjct: 112 -LEFYAGGNIILTDKELNILTLLR 134


>gi|408392777|gb|EKJ72097.1| hypothetical protein FPSE_07722 [Fusarium pseudograminearum CS3096]
          Length = 1078

 Score =  103 bits (256), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 56/145 (38%), Positives = 83/145 (57%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L+ RL+ +R SNVYDLS K  + K              K  L++
Sbjct: 1   MKQRFSSLDVKIIAHELQERLVTLRLSNVYDLSSKILLLKFAKPDN--------KKQLVI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ++G R H T +AR     PS F  +LRK ++TRRL  VRQ+G DR++ F+F  G   + +
Sbjct: 53  DTGFRCHLTKFARTTAAAPSIFVARLRKFLKTRRLTAVRQVGTDRVLEFEFSDGQ--YRM 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE +A GNI+LTD++  +L L R+
Sbjct: 111 FLEFFASGNIILTDADLNILALART 135


>gi|327311796|ref|YP_004338693.1| hypothetical protein TUZN_1922 [Thermoproteus uzoniensis 768-20]
 gi|326948275|gb|AEA13381.1| hypothetical protein TUZN_1922 [Thermoproteus uzoniensis 768-20]
          Length = 623

 Score =  103 bits (256), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 150/337 (44%), Gaps = 58/337 (17%)

Query: 350 KFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKM 409
           +++ F  ALD +++ +E ++A +   A+  A   KL +        +   ++  +    +
Sbjct: 238 EYDAFWKALDRYFADVELRKAVELKTAELKAKKAKLEQSIAKLRGEIQEYRKRSEELYSL 297

Query: 410 AEL---IEYNLEDVDAAIL-------AVRVALANRMSWEDLARMVKEERKAGNPVAGLID 459
           A+    ++Y LE+   AIL       ++R+   NR S E +              +GL  
Sbjct: 298 AKTMLSLKYELEEAMQAILRNEEIGASIRILDVNRTSKEAVLEH-----------SGLRF 346

Query: 460 KLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEK 519
           KL L+R          ++E+ +E K       + + AL      R   EL + +  + E 
Sbjct: 347 KLRLDRPV-----GRQIEEVFEEAKDYARRAAKAEEALK-----RLEEELARVESERAEA 396

Query: 520 TITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQ 579
                 +  KAAE+                      WFEKF WF++      I GRDA Q
Sbjct: 397 ERAVAERVRKAAERA---------------------WFEKFRWFLALGRVPAIGGRDASQ 435

Query: 580 NEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM 639
           NE  V+RY+    ++ HAD+ GAS+ V K  + E  +  L L Q   F   +S+AW + +
Sbjct: 436 NEAAVRRYLKDDYLFFHADVPGASAVVAKPTQDEAAL--LELAQ---FAASYSRAWRAGI 490

Query: 640 -VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
                ++V   QVSK  P+GEYL  GSFMI G KN++
Sbjct: 491 HAVDVFYVPGRQVSKQPPSGEYLARGSFMIYGSKNYI 527


>gi|119872023|ref|YP_930030.1| hypothetical protein Pisl_0509 [Pyrobaculum islandicum DSM 4184]
 gi|119673431|gb|ABL87687.1| protein of unknown function DUF814 [Pyrobaculum islandicum DSM
           4184]
          Length = 613

 Score =  102 bits (255), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 80/136 (58%), Gaps = 6/136 (4%)

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           +EK  +++  + K  WFEKF W I++    +I GRDA QNE IV++Y+ +  ++ HAD+ 
Sbjct: 388 EEKVKSSVKIVVKRAWFEKFRWSITTGKRPIIGGRDASQNETIVRKYLREHYLFFHADIP 447

Query: 601 GASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGE 659
           GAS  V+    P +   PL L Q   F   +S+AW   +     ++V   QVSK AP G+
Sbjct: 448 GASVVVMP---PSE--DPLELLQTAQFAAAYSKAWKIGIHSIDVYYVRGEQVSKHAPAGQ 502

Query: 660 YLTVGSFMIRGKKNFL 675
           YL  GSFMI GK+ ++
Sbjct: 503 YLARGSFMIYGKREYI 518


>gi|269865204|ref|XP_002651842.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063777|gb|EED42214.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 323

 Score =  102 bits (254), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 79/304 (25%), Positives = 135/304 (44%), Gaps = 46/304 (15%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 59  MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 109

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 110 TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 169

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 170 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 207

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSENYLVISGRDAQQNEMIV 584
                K  ++ +Q K      H+    R  +WFEKF++FIS  N ++I G++AQQN+ IV
Sbjct: 208 -----KIAMRDIQAKLKPRKEHIKIQDRVSYWFEKFHFFISENNCVIIGGKNAQQNDQIV 262

Query: 585 KRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            +YM   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   +
Sbjct: 263 NKYMEDRDLYFHCDVKGASSVVCKGS------ADRNIEDATYFALVYSKAWDEQVIKDVF 316

Query: 645 WVYP 648
           +V P
Sbjct: 317 YVSP 320


>gi|424812621|ref|ZP_18237861.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
 gi|339756843|gb|EGQ40426.1| putative RNA-binding protein, eukaryotic snRNP family [Candidatus
           Nanosalinarum sp. J07AB56]
          Length = 361

 Score =  102 bits (254), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 68/207 (32%), Positives = 104/207 (50%), Gaps = 8/207 (3%)

Query: 474 NNLDEMDDEEK--TLPVEKVEVDLAL--SAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           NNL+ ++ +E+   + ++  EV L+   S  A A R Y+  K++E K E    A      
Sbjct: 77  NNLESVNHQERFYRVAIDGAEVKLSPDESLEAAASRMYDEAKEREQKAENAREALQNTQG 136

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
             E+    +   E+        R   WFEK+ WF + E  LVI GR  Q NE +V  ++ 
Sbjct: 137 KLEELEEDEFEVEEESMERDESRSKRWFEKYRWFHTPEGRLVICGRGPQTNESLVNNHLE 196

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYP 648
           + D+Y+HAD  GA S  +K+    Q      + QA    V  S+AW S +     ++V P
Sbjct: 197 RDDLYLHADFDGAPSVALKDG---QNASKDEIRQAAKAAVTFSKAWKSGIGADDVYYVGP 253

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFL 675
            QV+K+  +GEYL  G+F IRG + +L
Sbjct: 254 AQVTKSPESGEYLERGAFAIRGDRTYL 280


>gi|440491782|gb|ELQ74392.1| putative RNA-binding protein [Trachipleistophora hominis]
          Length = 886

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 70/240 (29%), Positives = 118/240 (49%), Gaps = 40/240 (16%)

Query: 497 LSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRK--V 554
           LS   N   +Y   K ++ K+EK I  + ++  A        I+++K V     ++K  +
Sbjct: 576 LSIDKNMNYYYNQMKNKKIKREK-IRNNLESILA-------NIVEKKAVVKPQEIKKRVL 627

Query: 555 HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR--- 611
            WFEKFN+ I+S  +LV+ G++A QNE++ KR   K  ++ HAD+ G S+  I   R   
Sbjct: 628 FWFEKFNFTITSNGFLVLGGKNASQNEVLNKR---KFLLFFHADIKGGSAVTIDGTRINI 684

Query: 612 ----------PEQPVPPLT-------------LNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
                      E  +  +              +  A    + +S  W  ++V+ +++V  
Sbjct: 685 LGRCAKHESSSETSIKRIVASSDNAYGLKEEDITDASQMCMVYSNCWKDRIVSDSYYVNE 744

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
            QVSK+AP+GE+L+ G FM++GKKN++    L     LLF L E +L   +    V G++
Sbjct: 745 DQVSKSAPSGEFLSKGGFMVKGKKNYVHNVRLEYAIALLFAL-EKNLEQQIENMHVGGDK 803



 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 65/112 (58%), Gaps = 11/112 (9%)

Query: 29  VYDLSPK---TYIFKLMNSSGVTESGESEKVLLLMESGVRLH-TTAYARDKKNTPSGFTL 84
           V +L PK   TYI  + +S   T    + K + L+E+G+R+H T  Y  D+    S F  
Sbjct: 14  VNELHPKIESTYIQNIYSSGQRTFYLRTNKNIFLIEAGLRIHLTNTYPSDE---ISFFAK 70

Query: 85  KLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSE 136
           +LR ++R +++  VRQ+G+DR ++ Q G       V++E+++ GN+++ + E
Sbjct: 71  RLRTYLRRKKVGGVRQVGFDRAVVVQIG----EFLVVIEMFSAGNLIVLEKE 118


>gi|85091915|ref|XP_959135.1| hypothetical protein NCU09191 [Neurospora crassa OR74A]
 gi|28920536|gb|EAA29899.1| conserved hypothetical protein [Neurospora crassa OR74A]
 gi|29150083|emb|CAD79644.1| conserved hypothetical protein [Neurospora crassa]
          Length = 1097

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/145 (38%), Positives = 80/145 (55%), Gaps = 11/145 (7%)

Query: 2   VKVRMNTADVAAEVKCLRR-LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R ++ DV      L   L+ +R +N+YDL+ K  + K        +        LL+
Sbjct: 1   MKQRFSSLDVRVVAHELSEALVSLRLANIYDLNSKILLLKFAKPDTRQQ--------LLI 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG R H T + R     PS F  +LRK+++TRR   V Q+G DRII FQF  G  A  +
Sbjct: 53  ESGFRCHLTDFVRTASPAPSQFVARLRKYLKTRRCTSVSQIGTDRIIEFQFSDG--AFRL 110

Query: 121 ILELYAQGNILLTDSEFTVLTLLRS 145
            LE +A GNI+LTD++  +L LLR+
Sbjct: 111 YLEFFASGNIILTDADLKILALLRN 135


>gi|387220185|gb|AFJ69801.1| hypothetical protein NGATSA_2069500, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 75

 Score =  100 bits (249), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 44/60 (73%), Positives = 48/60 (80%)

Query: 639 MVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSH 698
           MVTSAWWV   QVSKTAP GE+L  GSFM+RGKKNFL P PL MG GLLF+LDE S+G H
Sbjct: 1   MVTSAWWVGAGQVSKTAPAGEFLPTGSFMVRGKKNFLAPQPLEMGLGLLFKLDEGSVGRH 60


>gi|429964304|gb|ELA46302.1| hypothetical protein VCUG_02190 [Vavraia culicis 'floridensis']
          Length = 943

 Score = 99.4 bits (246), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 65/224 (29%), Positives = 107/224 (47%), Gaps = 38/224 (16%)

Query: 497 LSAHANARRWYELKKKQESKQEKTIT-AHSKAFKAAEKKTRLQILQEKTVANISHMRKVH 555
           LS   N   +Y   K +++K+EK      S     +EKK  ++  + K        R++ 
Sbjct: 587 LSIDKNVNYYYNQMKSKKTKREKIRNNLESILANISEKKATVKQREYKK-------RELF 639

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHR---- 611
           WFEKFN+ ++   +LV+ G++A QNE + KR   K  ++ HAD+ G S   +   +    
Sbjct: 640 WFEKFNFTVTQNGFLVLGGKNATQNETLNKR---KFKLFFHADVKGGSVVTVDGTKLNIL 696

Query: 612 ---------------------PEQ--PVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYP 648
                                PE    +    +  A    + +S  W  ++V  +++V  
Sbjct: 697 RRNTGYAESSSVTSIKRLQTNPENVYGLKEEDITDASQMCMVNSNCWKDRIVCDSYYVNE 756

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDE 692
            QVSK+AP+GE+LT G FM++GKKN++    L    GLLF L++
Sbjct: 757 EQVSKSAPSGEFLTKGGFMVKGKKNYVHNVRLEYAVGLLFALEK 800



 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 75/137 (54%), Gaps = 12/137 (8%)

Query: 29  VYDLSPK---TYIFKLMNSSGVTESGESEKVLLLMESGVRLHTT-AYARDKKNTPSGFTL 84
           V +L PK   TYI  + +S   T    + K + L+E+G+R+H T  Y     N  S F  
Sbjct: 14  VNELHPKIESTYIQNIYSSGQRTFYVRTNKNIFLIEAGLRIHLTDTYP---SNEISFFCK 70

Query: 85  KLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR 144
           +LR  +R +++  V+Q+G+DR+++ Q G       V++E++A GN+++ + E +V +   
Sbjct: 71  RLRTCLRRKKIGGVKQVGFDRVVVVQAG----EFLVVVEMFAAGNLIVLEKE-SVASERN 125

Query: 145 SHRDDDKGVAIMSRHRY 161
           S  +D+K    + R  Y
Sbjct: 126 SGEEDEKDRNGLERTEY 142


>gi|351707265|gb|EHB10184.1| Serologically defined colon cancer antigen 1 [Heterocephalus
           glaber]
          Length = 208

 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 77/125 (61%), Gaps = 9/125 (7%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R +T D+ A +  L   L+GMR +NVYD+  KTY+ +L             K  LL+
Sbjct: 1   MKTRFSTIDLRAVLAELNASLLGMRVNNVYDVDNKTYLIRLQKPDF--------KATLLL 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           ESG+R+HTT +   K   PS F +K RKH+++RRL   +QLG DRI+ FQFG    A+++
Sbjct: 53  ESGIRIHTTEFEWPKNMMPSSFAMKCRKHLKSRRLVSAKQLGVDRIVDFQFGSDEAAYHL 112

Query: 121 ILELY 125
           I+ELY
Sbjct: 113 IIELY 117



 Score = 42.7 bits (99), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 4/74 (5%)

Query: 234 RAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFED 293
           R  +  LKT+    + YGPAL EH +++ G   N+K+ E  KLE   I+ ++  + K ED
Sbjct: 119 RCYRKILKTISSAFVAYGPALLEHCLIENGFSGNVKVDE--KLESKDIEKVLDCMQKAED 176

Query: 294 WLQDV--ISGDIVP 305
           +++      G + P
Sbjct: 177 YMKTTSNFHGKVTP 190


>gi|379003409|ref|YP_005259081.1| putative RNA-binding protein [Pyrobaculum oguniense TE7]
 gi|375158862|gb|AFA38474.1| putative RNA-binding protein [Pyrobaculum oguniense TE7]
          Length = 614

 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 91/170 (53%), Gaps = 11/170 (6%)

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           EL++K   K E+ +    K   A E++ R    +E   A+   + K  WFEKF+W +++ 
Sbjct: 359 ELEEKAR-KAEQVLEKLRKELSALEEQQRRA--EEALKASAKVVAKRSWFEKFHWTVTTG 415

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-PPLTLNQAGC 626
              VI GRDA QNE +V+RY+     + HAD+ GAS+          P+  PL + Q   
Sbjct: 416 RRPVIGGRDASQNEAVVRRYLKDHYFFFHADIPGASAVA------APPMDDPLEILQVAQ 469

Query: 627 FTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           F   +S+AW   +     ++V   QVSK  P+G+YL  GSFM+ GK+ ++
Sbjct: 470 FAAAYSRAWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMVYGKREYV 519


>gi|145591891|ref|YP_001153893.1| hypothetical protein Pars_1690 [Pyrobaculum arsenaticum DSM 13514]
 gi|145283659|gb|ABP51241.1| protein of unknown function DUF814 [Pyrobaculum arsenaticum DSM
           13514]
          Length = 614

 Score = 97.1 bits (240), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 11/170 (6%)

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           EL++K   K E+ +    K   A E++ R    +E   A+   + K  WFEKF+W +++ 
Sbjct: 359 ELEEKAR-KAEQVLEKLRKELSALEEQQRRA--EEALKASAKVVAKRSWFEKFHWTVTTG 415

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPV-PPLTLNQAGC 626
              VI GRDA QNE +V++Y+     + HAD+ GAS+          P+  PL + Q   
Sbjct: 416 RRPVIGGRDASQNEAVVRKYLKDHYFFFHADIPGASAVA------APPMDDPLEILQVAQ 469

Query: 627 FTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFL 675
           F   +S+AW   +     ++V   QVSK  P+G+YL  GSFM+ GK+ ++
Sbjct: 470 FAAAYSRAWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKGSFMVYGKREYV 519


>gi|374326819|ref|YP_005085019.1| hypothetical protein P186_1339 [Pyrobaculum sp. 1860]
 gi|356642088|gb|AET32767.1| hypothetical protein P186_1339 [Pyrobaculum sp. 1860]
          Length = 621

 Score = 97.1 bits (240), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 49/131 (37%), Positives = 73/131 (55%), Gaps = 6/131 (4%)

Query: 546 ANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASST 605
           A+   + K  WFEKF+W +++    VI GRDA QNE +V++Y+    ++ HAD+ GAS+ 
Sbjct: 401 ASARAVAKKSWFEKFHWTVTTGKRPVIGGRDASQNESVVRKYLKDHYLFFHADIPGASAV 460

Query: 606 VIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVG 664
                       PL L Q   F   +S+AW   +     ++V   QVSK  P+G+YL  G
Sbjct: 461 AAPPME-----DPLELLQVAQFAAAYSKAWKIGIHAVDVYYVRGEQVSKQPPSGQYLAKG 515

Query: 665 SFMIRGKKNFL 675
           SFMI GK+ ++
Sbjct: 516 SFMIYGKREYV 526


>gi|78395025|gb|AAI07765.1| SDCCAG1 protein, partial [Homo sapiens]
          Length = 458

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 42/62 (67%), Positives = 50/62 (80%)

Query: 651 VSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEG 710
           VSKTAPTGEYLT GSFMIRGKKNFLPP  L+MGF  LF++DES +  H  ER+VR ++E 
Sbjct: 2   VSKTAPTGEYLTTGSFMIRGKKNFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDED 61

Query: 711 MD 712
           M+
Sbjct: 62  ME 63


>gi|126460385|ref|YP_001056663.1| hypothetical protein Pcal_1780 [Pyrobaculum calidifontis JCM 11548]
 gi|126250106|gb|ABO09197.1| protein of unknown function DUF814 [Pyrobaculum calidifontis JCM
           11548]
          Length = 616

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 46/137 (33%), Positives = 78/137 (56%), Gaps = 8/137 (5%)

Query: 541 QEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLH 600
           +EK  +++  + +  WFEK++W +++    V+ GRDA QNE IV++Y+    ++ HAD+ 
Sbjct: 391 EEKVKSSVKAVVEREWFEKYHWTVTTGKRPVLGGRDASQNESIVRKYLKDHYLFFHADIP 450

Query: 601 GASSTVIKNHRPEQPVP-PLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTG 658
           GAS  +        P+  PL ++Q   F   +S+AW   +     ++    QVSK  P G
Sbjct: 451 GASVVI------APPIEDPLEVHQVAQFAAAYSRAWKIGIHAIDVYYARGEQVSKQPPAG 504

Query: 659 EYLTVGSFMIRGKKNFL 675
           +YL  GSFM+ GK+ ++
Sbjct: 505 QYLARGSFMVYGKREYV 521


>gi|41615287|ref|NP_963785.1| hypothetical protein NEQ506 [Nanoarchaeum equitans Kin4-M]
 gi|40069011|gb|AAR39346.1| NEQ506 [Nanoarchaeum equitans Kin4-M]
          Length = 255

 Score = 93.2 bits (230), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 73/131 (55%), Gaps = 11/131 (8%)

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP--- 612
           WF K+ +  +   +LVI G+DA QNE I+K Y   GD+  HAD+HGA   ++  + P   
Sbjct: 54  WFMKYRFTFTESGFLVIGGKDANQNERIMKVYRKDGDLVFHADIHGAPFALMLLNNPNAD 113

Query: 613 -------EQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVG 664
                  +  +    L QA   +  +S+AW   + +   ++V   Q+SK AP+GEYL  G
Sbjct: 114 SVEEVIEKYKITETDLMQAAGLSAVYSKAWQEGLASIDVFYVLGKQISKKAPSGEYLKHG 173

Query: 665 SFMIRGKKNFL 675
           SFM+ GKK+++
Sbjct: 174 SFMVYGKKHYI 184


>gi|440301762|gb|ELP94148.1| serologically defined colon cancer antigen 1, putative, partial
           [Entamoeba invadens IP1]
          Length = 144

 Score = 92.0 bits (227), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 47/121 (38%), Positives = 75/121 (61%), Gaps = 8/121 (6%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP 79
           RL+ M  + VYD++ + Y+ KL        S    K  +++ESGVR+H T Y RDK +TP
Sbjct: 27  RLLDMNVNTVYDINRRLYVIKL--------SKTDLKEFIVIESGVRVHLTQYNRDKSDTP 78

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTV 139
           + FT +LRK++  +RL  V Q+G DR+I    G     + +I++LY+ GNI LTD+++ +
Sbjct: 79  NNFTSRLRKYLNKKRLLRVNQIGNDRVIEIVIGNATEKYNLIIDLYSNGNICLTDADYKI 138

Query: 140 L 140
           +
Sbjct: 139 V 139


>gi|171186042|ref|YP_001794961.1| hypothetical protein Tneu_1592 [Pyrobaculum neutrophilum V24Sta]
 gi|170935254|gb|ACB40515.1| protein of unknown function DUF814 [Pyrobaculum neutrophilum
           V24Sta]
          Length = 613

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/121 (41%), Positives = 73/121 (60%), Gaps = 6/121 (4%)

Query: 556 WFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQP 615
           WFEKF+W I++    VI GRDA QNE +V++Y+    ++ HAD+ GAS+  +    P + 
Sbjct: 403 WFEKFHWTITTGRRPVIGGRDASQNETVVRKYLKDSYLFFHADIPGASAVAMP---PAE- 458

Query: 616 VPPLTLNQAGCFTVCHSQAWDSKM-VTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
             PL L QA  F   +S+AW   +     ++V   QV+K AP G+YL  GSFMI GK+ +
Sbjct: 459 -DPLELLQAAQFAAAYSKAWKIGIHAVDVYYVRGEQVTKQAPAGQYLARGSFMIYGKREY 517

Query: 675 L 675
           +
Sbjct: 518 V 518


>gi|290559894|gb|EFD93216.1| protein of unknown function DUF814 [Candidatus Parvarchaeum
           acidophilus ARMAN-5]
          Length = 587

 Score = 90.5 bits (223), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 52/161 (32%), Positives = 83/161 (51%), Gaps = 11/161 (6%)

Query: 542 EKTVANISHMRKV------HWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +K   N+  +R++       W+ KF +F +S N L I G+D  QNE +++++  KGD+  
Sbjct: 367 DKIKTNVIKVRRLKVITGNEWYSKFRFFSTSLNKLCIIGKDVNQNESLIQKHAEKGDIVG 426

Query: 596 HADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKT 654
           HAD+ G+   VIK    E     + L +       +S AW +       ++V P QV+KT
Sbjct: 427 HADVFGSPFGVIKTGNAE--TKEVELEEMATMIASYSSAWRAGATNLDVYFVNPEQVTKT 484

Query: 655 APTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSL 695
            P+GE L  G+F I GK+ ++    L  G  L F + E S+
Sbjct: 485 PPSGESLKKGAFYIEGKRKYIKNSSL--GIYLSFDIREDSV 523


>gi|269986196|gb|EEZ92508.1| protein of unknown function DUF814 [Candidatus Parvarchaeum
           acidiphilum ARMAN-4]
          Length = 587

 Score = 90.1 bits (222), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 96/188 (51%), Gaps = 19/188 (10%)

Query: 489 EKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           +++ +D+  + + N    Y+  K+ ++   + ITA +K  +      R+++  E      
Sbjct: 336 QQLNIDITQNLNYNLALMYQKAKRLKNIDTEAITAKTKMIR------RIKVKNEN----- 384

Query: 549 SHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIK 608
                  W+ KF  FI+SE  LVI G+D  QNE +++++M K D+  HAD+ G+   +IK
Sbjct: 385 ------QWYSKFRHFITSEGNLVIIGKDVNQNESLIEKHMEKEDIVGHADVFGSPFGIIK 438

Query: 609 NHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSFM 667
             +  + +    + +       +S AW         +++ P QV+KT P+GE L  G+F 
Sbjct: 439 -PKEGKSISKKEIEETAIMIASYSSAWRVGATNLDVYFIKPEQVTKTPPSGESLKKGAFY 497

Query: 668 IRGKKNFL 675
           I GK++++
Sbjct: 498 IEGKRDYI 505


>gi|42733496|dbj|BAD11345.1| BRI1-KD interacting protein 117 [Oryza sativa Japonica Group]
          Length = 360

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/139 (51%), Positives = 88/139 (63%), Gaps = 7/139 (5%)

Query: 791 VTPQLEDLIDRALGLGSASISSTKHGIETTQFDLSEE-DKHVERTATVRDKPYISKAERR 849
           V+ QLEDL+D+ LGLG   +      + +    ++++ D    +  +VRDKPYISKA+RR
Sbjct: 17  VSSQLEDLLDKNLGLGPTKVLGRSSLLSSNSASVADDIDDLDTKKTSVRDKPYISKADRR 76

Query: 850 KLKKGQ--GSSVVD-PKVEREKERGKDASSQPESIVRKTKIEGGKISRGQKGKLKKMKEK 906
           KLKKGQ  G S  D P  E  K   K  +SQ E      K    K+SRGQKGKLKK+KEK
Sbjct: 77  KLKKGQNVGDSTSDSPNGEAAK---KPVNSQQEKGKTIEKPANPKVSRGQKGKLKKIKEK 133

Query: 907 YGDQDEEERNIRMALLAVS 925
           YG+QDEEER IRMALLA S
Sbjct: 134 YGEQDEEEREIRMALLASS 152


>gi|269862884|ref|XP_002651013.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065270|gb|EED43045.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 191

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/104 (41%), Positives = 63/104 (60%), Gaps = 6/104 (5%)

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           M   D+Y H D+ GASS V K            +  A  F + +S+AWD +++   ++V 
Sbjct: 1   MEDRDLYFHCDVIGASSVVCKGSADR------IIEDATYFALVYSKAWDEQVIKDVFYVS 54

Query: 648 PHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
             QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 55  SDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 98


>gi|255514115|gb|EET90378.1| protein of unknown function DUF814 [Candidatus Micrarchaeum
           acidiphilum ARMAN-2]
          Length = 260

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/194 (30%), Positives = 97/194 (50%), Gaps = 12/194 (6%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKAFKAAEKKTRLQILQEKTVAN 547
           V +D   SA  NA  +Y+  KK   K E   K +T   +   + E +   Q  + KT+  
Sbjct: 3   VSIDFTKSAQENANSYYQNAKKYHKKSEGAAKAMTQMEEKLNSIESEHVQQAAKTKTL-- 60

Query: 548 ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVI 607
             H++K  W+EKF+WF +S   L I GRDAQQNE++  ++  + D++ HAD+ GAS  ++
Sbjct: 61  --HLQKKEWYEKFHWFFTSHGSLAIGGRDAQQNELLNSKHFDENDLFFHADIFGASVVIL 118

Query: 608 KNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVT-SAWWVYPHQVSKTAPTGEYLTVGSF 666
           K            +         +S AW   +V+   + +   Q+SK+   G  L  GSF
Sbjct: 119 KGGAGADKEEKAEVAAF---AASYSSAWKKMLVSVDVYAMRRDQISKSTNKGS-LGQGSF 174

Query: 667 MIRGKKNFLPPHPL 680
           +++G++ +    PL
Sbjct: 175 LMKGEREWYRNTPL 188


>gi|366991987|ref|XP_003675759.1| hypothetical protein NCAS_0C04050 [Naumovozyma castellii CBS 4309]
 gi|342301624|emb|CCC69395.1| hypothetical protein NCAS_0C04050 [Naumovozyma castellii CBS 4309]
          Length = 1020

 Score = 85.9 bits (211), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 87/146 (59%), Gaps = 13/146 (8%)

Query: 2   VKVRMNTADV---AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL 58
           +K R+++ D+   A E+K    L   R +N+Y++S  T  F L  +          K+ +
Sbjct: 1   MKQRISSLDLQILAGELK--NSLESYRLNNIYNVSDSTRQFLLRFNKP------DSKLNV 52

Query: 59  LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAH 118
           +++ G+R+H T + R     PSGF +KLRKH++ +RL  +RQ+  DRI++ QF  G+   
Sbjct: 53  IVDCGLRIHLTDFNRPIPPAPSGFVVKLRKHLKGKRLTALRQVQNDRILVLQFADGL--F 110

Query: 119 YVILELYAQGNILLTDSEFTVLTLLR 144
           Y++LE ++ GN++L + + T+L+L R
Sbjct: 111 YLVLEFFSAGNVILLNEDRTILSLQR 136


>gi|347828081|emb|CCD43778.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 430

 Score = 83.2 bits (204), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 40/77 (51%), Positives = 51/77 (66%), Gaps = 7/77 (9%)

Query: 642 SAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNE 701
           SAWWV   QVSK+APTGE+L  GSF   GKKNFLPP  L++GFG+LF++ + S   H N+
Sbjct: 2   SAWWVTADQVSKSAPTGEFLPAGSFNTHGKKNFLPPAQLLLGFGVLFQISDESKARH-NK 60

Query: 702 RRVRGEEEGMDDFEDSG 718
            R++      DD   SG
Sbjct: 61  HRLQ------DDSPSSG 71


>gi|302509578|ref|XP_003016749.1| DUF814 domain protein, putative [Arthroderma benhamiae CBS 112371]
 gi|291180319|gb|EFE36104.1| DUF814 domain protein, putative [Arthroderma benhamiae CBS 112371]
          Length = 1073

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 65/110 (59%), Gaps = 10/110 (9%)

Query: 35  KTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRR 94
           +T++FKL        +    K  L++ +G   H T  +R   + PS F  +LRK ++TRR
Sbjct: 12  RTFLFKL--------ALPDIKKQLIINAGFHCHLTESSRTTADAPSHFVSRLRKLLKTRR 63

Query: 95  LEDVRQLGYDRIILFQFGLGMNAHYVILELYAQGNILLTDSEFTVLTLLR 144
           +  VRQ+G DRII F+   G+   Y  LE +A GN++LTD+++ ++ LLR
Sbjct: 64  ITGVRQIGTDRIIEFEISDGLFRLY--LEFFAAGNLILTDAKYGIVALLR 111


>gi|269862032|ref|XP_002650678.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220065783|gb|EED43376.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 166

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 49/68 (72%)

Query: 624 AGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMG 683
           A  F + +S+AWD +++   ++V   QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G
Sbjct: 6   ATYFALVYSKAWDEQVIKDVFYVSSDQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYG 65

Query: 684 FGLLFRLD 691
            G++FR++
Sbjct: 66  VGVVFRIN 73


>gi|374850433|dbj|BAL53422.1| hypothetical conserved protein [uncultured crenarchaeote]
          Length = 530

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 62/116 (53%), Gaps = 4/116 (3%)

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F  FI+S  +  + GRDA+ N M++KR++ + D+ +H ++ G+ + V+ N          
Sbjct: 332 FREFITSGGFRALLGRDARSNIMLLKRHLGENDLVLHTEIPGSPAAVLINGVKASET--- 388

Query: 620 TLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
            + +      C+S+AW       S + V   QVS T P+G+YL  GSFM+ G K F
Sbjct: 389 DVEECAQMVGCYSRAWRENFSNVSVYAVKAEQVSFTPPSGQYLPKGSFMVYGSKKF 444


>gi|315427275|dbj|BAJ48887.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
 gi|343485854|dbj|BAJ51508.1| conserved hypothetical protein [Candidatus Caldiarchaeum
           subterraneum]
          Length = 628

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 62/116 (53%), Gaps = 4/116 (3%)

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           F  F++S  +  + GRDA+ N M++KR++ + D+ +H ++ G+ + V+ N          
Sbjct: 430 FREFVTSGGFRALLGRDARSNIMLLKRHLGENDLVLHTEIPGSPAAVLINGVKASET--- 486

Query: 620 TLNQAGCFTVCHSQAWDSKMV-TSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNF 674
            + +      C+S+AW       S + V   QVS T P+G+YL  GSFM+ G K F
Sbjct: 487 DVQECAQMVGCYSRAWRENFSNVSVYAVKAEQVSFTPPSGQYLPKGSFMVYGSKKF 542



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 67/128 (52%), Gaps = 12/128 (9%)

Query: 6   MNTADVAAEV-KCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV 64
           +NT ++   V +C  R++     NVY    +  + K+   S    SGE     L + +G 
Sbjct: 4   LNTYEIGVLVAECRDRVLDSYVRNVYGFGSRAILLKVWKPS--IGSGE-----LWLTAGY 56

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILEL 124
            +     + +K++TPS   L+LR+ +  +R+ D++Q+G +R++     LG++   +++E 
Sbjct: 57  SVFYIDQSVEKESTPSTHVLQLRRKVVGKRITDIKQVGGERLVT----LGLDGFELVVEC 112

Query: 125 YAQGNILL 132
              GNI+L
Sbjct: 113 MPPGNIVL 120


>gi|402470262|gb|EJW04606.1| hypothetical protein EDEG_01190 [Edhazardia aedis USNM 41457]
          Length = 393

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 46/72 (63%)

Query: 619 LTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMIRGKKNFLPPH 678
           L++ +     +C S+ W  K+  + ++V   QVSK A +GEYL  GSFMIRGKKN++  +
Sbjct: 131 LSIEETASMALCLSKFWKEKVTGNVYYVKSDQVSKKAQSGEYLKAGSFMIRGKKNYVDVY 190

Query: 679 PLIMGFGLLFRL 690
            L  G G++F++
Sbjct: 191 RLEYGIGIVFKI 202



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 34/40 (85%)

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKN 609
           LVI+GR AQ+N+++VK+++S  D++ HAD+ GA++ ++KN
Sbjct: 2   LVIAGRSAQENDLLVKKHLSNDDLFFHADVAGAATVILKN 41


>gi|402470263|gb|EJW04607.1| hypothetical protein EDEG_01191 [Edhazardia aedis USNM 41457]
          Length = 499

 Score = 69.7 bits (169), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 79/148 (53%), Gaps = 21/148 (14%)

Query: 2   VKVRMNTADVAAEVKCLRRL-IGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  L+ +       NVY ++ KTY+FKL           S K  +L+
Sbjct: 1   MKQRFTFLDIRAVVNELQTIPTNTYIQNVYSINNKTYVFKL-----------SSKHFILV 49

Query: 61  ESGVRLHTTAYARDKKNTPSG----FTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
           E GVRLH  + + D  N  SG    F  K+R+ ++ ++L  ++Q+G+DRI++F+    ++
Sbjct: 50  EIGVRLHLISQS-DFDNLNSGELTFFCTKIRQLLKRQQLAQIKQVGFDRIVVFE----LS 104

Query: 117 AHYVILELYAQGNILLTDSEFTVLTLLR 144
              +  E +A GN+++ D ++ V  + R
Sbjct: 105 NVCIYFEFFAAGNLVICDKDYVVKLVYR 132


>gi|313242815|emb|CBY39580.1| unnamed protein product [Oikopleura dioica]
          Length = 96

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 57/104 (54%), Gaps = 9/104 (8%)

Query: 2   VKVRMNTADVAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A +  +R  L+     N+YD+  KTY+ KL   +         K +LL 
Sbjct: 1   MKTRFTVLDIKAALAEIRDNLLHHYVLNIYDIDSKTYLLKLRKCAS--------KHVLLF 52

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYD 104
           ESG R+H T     K   PSGF++KLRKH++ +RL +  QLG+D
Sbjct: 53  ESGNRVHPTEMEWPKNTAPSGFSMKLRKHLKGKRLINATQLGFD 96


>gi|70913606|ref|XP_731580.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56501553|emb|CAH83949.1| hypothetical protein PC300777.00.0 [Plasmodium chabaudi chabaudi]
          Length = 56

 Score = 66.6 bits (161), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 28/56 (50%), Positives = 42/56 (75%)

Query: 73  RDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           R+K   PSGFT+KLRKH+R+R++ ++ QLG DR++  QFG   N +++I+ELY  G
Sbjct: 1   REKDVMPSGFTMKLRKHLRSRKITNISQLGGDRVVDIQFGYDDNVYHLIVELYIAG 56


>gi|70918391|ref|XP_733179.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56504739|emb|CAH85243.1| hypothetical protein PC301461.00.0 [Plasmodium chabaudi chabaudi]
          Length = 169

 Score = 66.2 bits (160), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 74/135 (54%), Gaps = 7/135 (5%)

Query: 328 STQIYDEFCPLLLNQFRSR------EFVKFETFDAALDEFYSKIESQRAEQ-QHKAKEDA 380
           + +++ EF P+LL    ++      E +KF  F+  +D ++SK+E  + ++ Q   K   
Sbjct: 14  NDRLFVEFIPILLKNHINKIDEKKIELIKFNDFNMCVDTYFSKMELTKYDKHQEMNKRKN 73

Query: 381 AFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDL 440
           A  K++KI +D E R+  L++EV+   K   LI+ N E V  AI  +R A++   +WE +
Sbjct: 74  ALTKIDKIKLDHERRIEALEKEVNILKKKILLIQANDEFVGEAIKLMRAAISTSANWEKI 133

Query: 441 ARMVKEERKAGNPVA 455
              VK  +K  +PVA
Sbjct: 134 WDHVKLFKKRNHPVA 148


>gi|21227915|ref|NP_633837.1| hypothetical protein MM_1813 [Methanosarcina mazei Go1]
 gi|20906335|gb|AAM31509.1| conserved protein [Methanosarcina mazei Go1]
          Length = 407

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/143 (30%), Positives = 70/143 (48%), Gaps = 11/143 (7%)

Query: 2   VKVRMNTADVAAEVKCL----RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVL 57
           +K  M++ADVAA V  L    R +I  +   +Y  + +     L     V   G      
Sbjct: 1   MKQDMSSADVAAVVAELSAGPRSIIDAKIGKIYQPASEEIRINLY----VFHQGRDN--- 53

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
           L++E+G RLH T + R     P  F + LRK++   R+  V Q  +DRI+          
Sbjct: 54  LVIEAGKRLHMTKHIRPSPTLPQAFPMLLRKYLMGGRIVSVEQHDFDRIVKIGIERAGVR 113

Query: 118 HYVILELYAQGNILLTDSEFTVL 140
             +I+EL+A+GN+L+ DSE  ++
Sbjct: 114 STLIVELFARGNVLIVDSENKII 136



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 33/135 (24%), Positives = 65/135 (48%), Gaps = 2/135 (1%)

Query: 316 LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKIE-SQRAEQQH 374
           L   H   E     + +D   P  LN++   E   F++F+ ALDEF+ K    Q AE + 
Sbjct: 269 LRPQHIKQEINGKMETFD-VVPFDLNRYSEYEKEYFDSFNTALDEFFGKKALEQVAEVKE 327

Query: 375 KAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANR 434
             K++       +  M QE  +   ++E++++  +AE +  N + ++     +  A A  
Sbjct: 328 AEKKEKTLGVFERRLMQQEESLAKFEKEIEKNNALAETVYANYQIIEELFSVLNGARAKG 387

Query: 435 MSWEDLARMVKEERK 449
            SW+++  ++K+ +K
Sbjct: 388 YSWDEIRSILKQAKK 402


>gi|430813961|emb|CCJ28738.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 441

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/50 (52%), Positives = 35/50 (70%)

Query: 659 EYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEE 708
           EY TVG+FMI+GKKNFLPP  LI+G+G+L+ +DE S    L  +  +  E
Sbjct: 3   EYSTVGTFMIQGKKNFLPPSQLILGYGILWTIDEVSKARRLENKLSKNNE 52


>gi|410667776|ref|YP_006920147.1| fibronectin-binding A domain-containing protein [Thermacetogenium
           phaeum DSM 12270]
 gi|409105523|gb|AFV11648.1| fibronectin-binding A domain-containing protein [Thermacetogenium
           phaeum DSM 12270]
          Length = 587

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 96/422 (22%), Positives = 166/422 (39%), Gaps = 107/422 (25%)

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
           G G +++  ++   GL P ++L    + E +A+         F+  +  ++ G+  PE  
Sbjct: 204 GIGRSMAREVVYRAGLDPELRLEFCGEYELHAL------FQSFQKTVIPLLRGN-KPEPV 256

Query: 309 ILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYS-KIES 367
           I+ Q               +T +  ++ PL L  +R  + +  ET +  LD +Y+ K ES
Sbjct: 257 IIFQG--------------TTAV--DYAPLPLTHYRGLKSIPCETVNEMLDRYYAAKAES 300

Query: 368 QRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAV 427
            R +Q              K H++       ++Q +DR  K   L E   +D   A  A+
Sbjct: 301 NRLKQ-------------IKTHLET-----VIRQNMDRCSKKLTLQE---KDEAEAREAL 339

Query: 428 RVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLP 487
           ++ L   M +  L  +    R+   P                     NL + D      P
Sbjct: 340 KLRLLGEMIFAHLHLIRPGSREVELP---------------------NLYQPDA-----P 373

Query: 488 VEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEK--------KTRLQI 539
             K+E+D +LSA  NA+R +    ++  K   TI A  K  K+ ++        KT L+ 
Sbjct: 374 SLKIELDPSLSAVQNAQRLF----RRYDKARDTIKALEKQIKSTKEEIQYLNSIKTALE- 428

Query: 540 LQEKTVANISHM-----------------RKVHWFEK----FNWFISSENYLVISGRDAQ 578
            Q + +A+   +                 R+    +K       F S + Y ++ G++ Q
Sbjct: 429 -QAECLADYQEIHEELEDAGYIRSDGKKSRRSKGTKKAPPQIMRFTSRDGYQILVGKNNQ 487

Query: 579 QNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSK 638
           QN+ I  R     D ++H     A + VI   +P Q +PP TL +A       S+A  S 
Sbjct: 488 QNDYITMRLARDEDYWLHVK-DSAGAHVIVKSKPGQEIPPSTLEEAAGLAAHFSEARYSS 546

Query: 639 MV 640
            V
Sbjct: 547 KV 548


>gi|269863395|ref|XP_002651206.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064951|gb|EED42851.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 150

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269865201|ref|XP_002651841.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063780|gb|EED42216.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 142

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/43 (58%), Positives = 34/43 (79%)

Query: 649 HQVSKTAPTGEYLTVGSFMIRGKKNFLPPHPLIMGFGLLFRLD 691
            QVSKTAP+GE+L  GSFMI+GKKN + P+ L  G G++FR++
Sbjct: 7   RQVSKTAPSGEFLAKGSFMIKGKKNMVYPYRLEYGVGVVFRIN 49


>gi|269863970|ref|XP_002651409.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064579|gb|EED42648.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 185

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269863464|ref|XP_002651232.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064908|gb|EED42825.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 164

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269863903|ref|XP_002651387.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064622|gb|EED42668.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 172

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269864916|ref|XP_002651741.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063963|gb|EED42314.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 184

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269865384|ref|XP_002651904.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063549|gb|EED42152.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 224

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269867274|ref|XP_002652541.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220062265|gb|EED41515.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 246

 Score = 56.6 bits (135), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269866242|ref|XP_002652204.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220062960|gb|EED41852.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 240

 Score = 56.6 bits (135), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|269865392|ref|XP_002651907.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220063544|gb|EED42149.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 275

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCLRRLIGMR-CSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K + +  DVAA    LR ++  +   N Y    + + FK            S K +L +
Sbjct: 1   MKQKFSVLDVAAVTNELRVILKNKYVVNFYSHKQRLFYFKF-----------SSKDILAI 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E GVRL+ T    D ++  + F  KLR+  R  R+ D+ QLG+DRI++    + +  + +
Sbjct: 50  EPGVRLNLTL---DHESEINHFCKKLRETCRNLRVVDIYQLGFDRIVM----VDLYRYRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           +LE Y+ GNI++ D    ++ + R
Sbjct: 103 VLEFYSLGNIIILDRNDMIVEIQR 126


>gi|381179596|ref|ZP_09888446.1| Fibronectin-binding A domain protein [Treponema saccharophilum DSM
           2985]
 gi|380768543|gb|EIC02532.1| Fibronectin-binding A domain protein [Treponema saccharophilum DSM
           2985]
          Length = 511

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 83/174 (47%), Gaps = 19/174 (10%)

Query: 474 NNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQES---KQEKTIT-------- 522
           +N  E DD E    V ++ +D +LSAH NA+ +YE  +K ES   + E+ I+        
Sbjct: 287 SNFIEADDWESGEKV-RIRIDPSLSAHENAQSYYEKYRKSESGIAELERDISIAEGELEK 345

Query: 523 --AHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQN 580
             A      A +   +L+ +  KT       +K H   +F    S + + +I GRDA +N
Sbjct: 346 LDAQYAEMVAEKNPIKLEQVLRKTQRPKQLEKKTHPGLEF----SVDGWTIIVGRDADEN 401

Query: 581 EMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQA 634
           + +++  +   D+++H   +      IKN RP + VP   L  AG   V +S+A
Sbjct: 402 DELLRHNVKGQDMWLHVRDYSGGYVFIKN-RPGKTVPLEILLYAGNLAVFYSKA 454


>gi|385810177|ref|YP_005846573.1| RNA-binding protein [Ignavibacterium album JCM 16511]
 gi|383802225|gb|AFH49305.1| Putative RNA-binding protein [Ignavibacterium album JCM 16511]
          Length = 538

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/218 (21%), Positives = 98/218 (44%), Gaps = 32/218 (14%)

Query: 446 EERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARR 505
           E  + GN +   I+K++   N + L      D++ + +KT+   ++++D  L+   N  R
Sbjct: 294 EYNRLGNILLININKIHSGMNSIIL------DDIYESDKTI---EIKLDPKLTPKENVNR 344

Query: 506 WYELKKKQESKQEKTI-------------------TAHSKAFKAAEKKTRLQILQEKTVA 546
           ++E  K+ +++  K I                   T++S   K  E+  +   ++ KT  
Sbjct: 345 YFEKAKESKTQYHKAIELIEIVSREKDRLIEFKNRTSNSSTVKELEQIAKGLKIKMKTEK 404

Query: 547 NISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTV 606
           NI         EKF  ++    Y V  G+D++ N+M+  ++  + D++ HA     S  V
Sbjct: 405 NIQESIS----EKFKQYLVDGKYKVYVGKDSKSNDMLTLKFAKQNDLWFHARAVPGSHVV 460

Query: 607 IKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
           ++    ++P+P   L +       HS+A  + +V  ++
Sbjct: 461 LRIENTKEPIPKSVLKKVASLAAYHSKAKTAGLVPVSY 498


>gi|345892116|ref|ZP_08842940.1| hypothetical protein HMPREF1022_01600 [Desulfovibrio sp.
           6_1_46AFAA]
 gi|345047527|gb|EGW51391.1| hypothetical protein HMPREF1022_01600 [Desulfovibrio sp.
           6_1_46AFAA]
          Length = 534

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FIS + + ++ GRDA+ N +  ++  +  D+++HAD    S  +I+     QPVP  TL+
Sbjct: 412 FISEDGFALLRGRDAKGN-LAARKLAAPHDIWLHADNGPGSHVIIRRAHGGQPVPERTLD 470

Query: 623 QAGCFTVCHSQAWDSKMV 640
           QAG    C S   D+ + 
Sbjct: 471 QAGGLAACKSWQRDAAVA 488


>gi|303326372|ref|ZP_07356815.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
 gi|302864288|gb|EFL87219.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
          Length = 556

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FIS + + ++ GRDA+ N +  ++  +  D+++HAD    S  +I+     QPVP  TL+
Sbjct: 434 FISEDGFALLRGRDAKGN-LAARKLAAPHDIWLHADNGPGSHVIIRRAHGGQPVPERTLD 492

Query: 623 QAGCFTVCHSQAWDSKMV 640
           QAG    C S   D+ + 
Sbjct: 493 QAGGLAACKSWQRDAAVA 510


>gi|312143921|ref|YP_003995367.1| fibronectin-binding A domain-containing protein [Halanaerobium
           hydrogeniformans]
 gi|311904572|gb|ADQ15013.1| Fibronectin-binding A domain protein [Halanaerobium
           hydrogeniformans]
          Length = 582

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 45/78 (57%), Gaps = 1/78 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           F+SS  Y ++ GR+ +QN+ + K+  +KGD+++H      S  +IK    ++ +P  TLN
Sbjct: 462 FVSSNGYQILIGRNNKQNDKLTKKIANKGDIWLHTKTIAGSHVIIKRDTSKE-IPDTTLN 520

Query: 623 QAGCFTVCHSQAWDSKMV 640
           +A       S+A +SK V
Sbjct: 521 EAASLAAYFSKARNSKNV 538


>gi|397906011|ref|ZP_10506838.1| Fibronectin/fibrinogen-binding protein [Caloramator australicus
           RC3]
 gi|397160925|emb|CCJ34173.1| Fibronectin/fibrinogen-binding protein [Caloramator australicus
           RC3]
          Length = 574

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 58/102 (56%), Gaps = 12/102 (11%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNA-HY 119
           R+  T   ++   T   F + LRK+++  RLED++Q+ +DRI+  +F     LG ++ +Y
Sbjct: 57  RIQITNINKENPQTAPNFVMVLRKYLQNSRLEDIKQINFDRIVEIKFEGKDELGYSSYYY 116

Query: 120 VILELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHR 160
           +I+E+  +  NI+L D ++ ++  ++    D      M+R+R
Sbjct: 117 IIIEIMGKHSNIILLDEKYKIIDAIKHLGSD------MNRYR 152


>gi|310779110|ref|YP_003967443.1| fibronectin-binding A domain-containing protein [Ilyobacter
           polytropus DSM 2926]
 gi|309748433|gb|ADO83095.1| Fibronectin-binding A domain protein [Ilyobacter polytropus DSM
           2926]
          Length = 539

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 54/98 (55%), Gaps = 10/98 (10%)

Query: 69  TAYARDKKN----TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNAHYV 120
             Y +D K     TP  F+L LRKH+    + +V QLGYDRI++F+F     LG    Y+
Sbjct: 55  VCYLKDNKENAPETPMSFSLNLRKHLLNSIITEVSQLGYDRILVFKFRKLNELGQYKDYI 114

Query: 121 I-LELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIM 156
           +  E+  +  N++LTD +  +L L++    ++  + ++
Sbjct: 115 LYFEIMGKHSNLILTDKDGGILDLMKKFSLEENKLRVL 152


>gi|212704765|ref|ZP_03312893.1| hypothetical protein DESPIG_02829 [Desulfovibrio piger ATCC 29098]
 gi|212671828|gb|EEB32311.1| hypothetical protein DESPIG_02829 [Desulfovibrio piger ATCC 29098]
          Length = 604

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 56/125 (44%), Gaps = 8/125 (6%)

Query: 508 ELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSE 567
           EL   Q ++QE  +     A   A K  R       TVA +  +R          F+S +
Sbjct: 423 ELATVQAARQEALLGGIGHAAGEAGKPDR------STVA-LGALRGAALPRNVQLFVSDD 475

Query: 568 NYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCF 627
            + ++ GRDA+ N +  ++  +  D+++H D    S  +I+     Q VP  TL+QAG  
Sbjct: 476 GFALLRGRDAKGN-IAARKLAAAHDIWLHTDGGPGSHVIIRRAHAGQEVPERTLDQAGAL 534

Query: 628 TVCHS 632
             C S
Sbjct: 535 AACKS 539


>gi|110456080|gb|ABG74581.1| RNA-binding protein-like protein [Musa acuminata AAA Group]
          Length = 53

 Score = 51.2 bits (121), Expect = 0.002,   Method: Composition-based stats.
 Identities = 20/24 (83%), Positives = 24/24 (100%)

Query: 12 AAEVKCLRRLIGMRCSNVYDLSPK 35
          AAE+KCLR+LIGMRC+NVYD+SPK
Sbjct: 1  AAELKCLRKLIGMRCANVYDISPK 24


>gi|255513711|gb|EET89976.1| Predicted fibronectin-binding protein [Candidatus Micrarchaeum
           acidiphilum ARMAN-2]
          Length = 374

 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 67/156 (42%), Gaps = 15/156 (9%)

Query: 1   MVKVRMNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKV---L 57
           M   +++T ++A+  K LR L G      Y +    +  K         S + EKV   +
Sbjct: 1   MASRQVSTLEIASLSKELRFLEGFHIDKFYQVDESRFRIK--------ASSKGEKVNLGI 52

Query: 58  LLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNA 117
            L     R  T   A    + P+ F++ +R+ I    ++ V  L  DRII  +   G   
Sbjct: 53  WLCRYIGRTETITIA----DKPTNFSIAVRRRISGFVVDSVVMLNSDRIIEIKCSKGQET 108

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGV 153
             VI E++ +GNI+L D  +T+      H   D+ V
Sbjct: 109 KSVIFEMFGRGNIILCDGSYTIELAYAPHTFKDRAV 144


>gi|385799646|ref|YP_005836050.1| fibronectin-binding A domain-containing protein [Halanaerobium
           praevalens DSM 2228]
 gi|309389010|gb|ADO76890.1| Fibronectin-binding A domain protein [Halanaerobium praevalens DSM
           2228]
          Length = 583

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS  Y ++ GR+ +QN+ + K+  + GD+++H  +   S  +IK    E  VP  TL 
Sbjct: 462 FISSNGYQILVGRNNKQNDRLSKKIANNGDIWLHTKVIAGSHVIIK-RDTEVEVPEQTLT 520

Query: 623 QAGCFTVCHSQAWDSKMV 640
           +A       SQA +S  V
Sbjct: 521 EAAAIAAYFSQARESTNV 538


>gi|39992427|gb|AAH64364.1| SDCCAG1 protein, partial [Homo sapiens]
          Length = 435

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 29/40 (72%)

Query: 673 NFLPPHPLIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           NFLPP  L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 1   NFLPPSYLMMGFSFLFKVDESCVWRHQGERKVRVQDEDME 40


>gi|344304197|gb|EGW34446.1| hypothetical protein SPAPADRAFT_70556 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 865

 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/172 (23%), Positives = 84/172 (48%), Gaps = 11/172 (6%)

Query: 280 AIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQ-IYDEFCPL 338
            +Q +  A+   ED    ++       G+I+       K +  +++ SS + IYDEF P 
Sbjct: 106 GLQSVANALGACEDAYLSLVDSKNENTGFIV------AKRNKASDTNSSFEFIYDEFHPF 159

Query: 339 L---LNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENR 395
                NQ    ++ +   ++  LD F+S +ES + E + +  +  A  +L+K   +++ +
Sbjct: 160 KPYKANQ-EDYQYTEVSGYNKTLDRFFSTLESSKFELKVEQLKQTAAKRLDKAKSERDKQ 218

Query: 396 VHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
           + +L ++ D + K  ELI+Y+ + VD     ++  L   M W ++  +++ E
Sbjct: 219 IQSLLEQQDLNAKKGELIQYHADLVDDCRAYIQSFLDQSMDWTNIETVLELE 270



 Score = 43.1 bits (100), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 24/33 (72%)

Query: 892 ISRGQKGKLKKMKEKYGDQDEEERNIRMALLAV 924
           +SRG++ KLKK+  KY DQDEEER +RM  L  
Sbjct: 656 LSRGKRSKLKKIAAKYADQDEEERRLRMDALGT 688


>gi|429961216|gb|ELA40761.1| hypothetical protein VICG_02202, partial [Vittaforma corneae ATCC
           50505]
          Length = 147

 Score = 49.7 bits (117), Expect = 0.008,   Method: Composition-based stats.
 Identities = 41/144 (28%), Positives = 63/144 (43%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  L  RL      N Y    +    K  N           K  LL+
Sbjct: 1   MKQRFTLLDLRATVNELNERLTNTFIQNFYSTQQRFIYIKFSN-----------KDTLLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E G R H T   ++  +  S F  KLR+  R  R+  + Q G+DRI +    + +    +
Sbjct: 50  EPGFRFHLT---QNADSEISHFCKKLREKCRHARVHRIYQFGFDRIAI----IDLQRVRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           ++E ++ GN+L+ D    +L LLR
Sbjct: 103 VIEFFSAGNMLVLDENDQILELLR 126


>gi|397691486|ref|YP_006528740.1| RNA-binding protein snRNP [Melioribacter roseus P3M]
 gi|395812978|gb|AFN75727.1| RNA-binding protein snRNP [Melioribacter roseus P3M]
          Length = 363

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 41/182 (22%), Positives = 78/182 (42%), Gaps = 25/182 (13%)

Query: 491 VEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRLQILQEK------- 543
           +++D  LS   N  R++E  K ++ + EK+I  ++      E K +  IL+E        
Sbjct: 158 IKLDPKLSPQKNIDRYFEKAKSEKIEYEKSIELYN------ELKNKYDILKELDEKLNKE 211

Query: 544 -TVANISHMRKVHWFEK-----------FNWFISSENYLVISGRDAQQNEMIVKRYMSKG 591
            T+  +  + K    +K           F  FI    Y V  G+D++ N+ +  R+  + 
Sbjct: 212 LTLEELQTIEKQLGIKKKMEMQDKSRPNFRHFIIDGKYNVYVGKDSKNNDELTLRFAKQN 271

Query: 592 DVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVYPHQV 651
           D + HA     S  V++   P++ VP   L +A      +S+A  + +   ++    + V
Sbjct: 272 DYWFHARSVSGSHVVLRTDNPKEVVPKSVLKKAASIAAFYSKAKTAGLAPVSYTFKKYVV 331

Query: 652 SK 653
            K
Sbjct: 332 KK 333


>gi|407477620|ref|YP_006791497.1| hypothetical protein Eab7_1781 [Exiguobacterium antarcticum B7]
 gi|407061699|gb|AFS70889.1| Hypothetical protein Eab7_1781 [Exiguobacterium antarcticum B7]
          Length = 564

 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 66/139 (47%), Gaps = 17/139 (12%)

Query: 15  VKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV---RLHTTAY 71
           V+ L+ L+G R + ++       IF +          E + V+LL  +     RLH T+ 
Sbjct: 12  VRELQPLVGARINKIHQPYALDLIFSV--------RAERKNVMLLASANAMYARLHLTSE 63

Query: 72  ARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYA 126
                + P  F + LRKH+    +E + QLG DRIIL +      LG   A  + +EL  
Sbjct: 64  TTTNPSEPPMFCMMLRKHLEGGFIESIEQLGRDRIILMRVRSRNELGDEEAKKLYIELMG 123

Query: 127 Q-GNILLTDSEFTVLTLLR 144
           +  NILLTD +  +L  ++
Sbjct: 124 RHSNILLTDGQDKILDAIK 142


>gi|417002378|ref|ZP_11941767.1| putative fibronectin-binding protein [Anaerococcus prevotii
           ACS-065-V-Col13]
 gi|325479519|gb|EGC82615.1| putative fibronectin-binding protein [Anaerococcus prevotii
           ACS-065-V-Col13]
          Length = 582

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 66/148 (44%), Gaps = 15/148 (10%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRL 66
           T  +  E+K   +L+G +   +   S    +F       +   G S K+LL   +   R+
Sbjct: 8   TRKITNELK--EKLLGGKIQKISQPSKNDIVF------NIYSMGNSYKLLLSANNNEARV 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYV 120
           + T    +  + P  F + LRKHI   ++ D+ Q G DR+I+F      + G   +   +
Sbjct: 60  NITNIKYENPDVPPNFCMVLRKHINQGKIVDINQKGLDRVIIFSISSIDEMGYDTSKKLI 119

Query: 121 ILELYAQGNILLTDSEFTVLTLLRSHRD 148
           I  +    NI+L D +F ++  ++   D
Sbjct: 120 IEIMGKYSNIILVDDDFKIIDSIKRVND 147


>gi|429961917|gb|ELA41461.1| hypothetical protein VICG_01445 [Vittaforma corneae ATCC 50505]
          Length = 179

 Score = 49.3 bits (116), Expect = 0.010,   Method: Composition-based stats.
 Identities = 41/144 (28%), Positives = 63/144 (43%), Gaps = 19/144 (13%)

Query: 2   VKVRMNTADVAAEVKCL-RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLM 60
           +K R    D+ A V  L  RL      N Y    +    K  N           K  LL+
Sbjct: 1   MKQRFTLLDLRATVNELNERLTNTFIQNFYSTQQRFIYIKFSN-----------KDTLLV 49

Query: 61  ESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV 120
           E G R H T   ++  +  S F  KLR+  R  R+  + Q G+DRI +    + +    +
Sbjct: 50  EPGFRFHLT---QNADSEISHFCKKLREKCRHARVHRIYQFGFDRIAI----IDLQRVRI 102

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           ++E ++ GN+L+ D    +L LLR
Sbjct: 103 VIEFFSAGNMLVLDENDQILELLR 126


>gi|374711077|ref|ZP_09715511.1| fibronectin-binding A domain-containing protein, partial
           [Sporolactobacillus inulinus CASD]
          Length = 306

 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 62/131 (47%), Gaps = 13/131 (9%)

Query: 13  AEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYA 72
           A V+ L+   G R + +Y  +P   IF L      +     + ++ +  +  R+H T  +
Sbjct: 10  AAVEELQDFTGGRIAKIYQPTPTDLIFHLR-----SRHARGKLLISINAAFARMHLTEQS 64

Query: 73  RDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYA 126
            D    P  F + LRKH+    ++ + QLG++RI+        +FG  +    +I+EL  
Sbjct: 65  ADNPQEPPMFCMLLRKHLEGSVIQRIEQLGFERIVHIDARSRNEFG-DLTEKQLIIELMG 123

Query: 127 Q-GNILLTDSE 136
           +  N++L D E
Sbjct: 124 RHSNVILIDKE 134


>gi|172057940|ref|YP_001814400.1| fibronectin-binding A domain-containing protein [Exiguobacterium
           sibiricum 255-15]
 gi|171990461|gb|ACB61383.1| Fibronectin-binding A domain protein [Exiguobacterium sibiricum
           255-15]
          Length = 564

 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 67/139 (48%), Gaps = 17/139 (12%)

Query: 15  VKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV---RLHTTAY 71
           V+ L+ L+G R + ++       IF +          E + V+LL  +     RLH T+ 
Sbjct: 12  VQELQPLVGARINKIHQPYALDLIFSV--------RAERKNVMLLASANAMYARLHLTSE 63

Query: 72  ARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYA 126
           +    + P  F + LRKH+    +E + QLG DR+IL +      LG   A  + +EL  
Sbjct: 64  STSNPSEPPMFCMMLRKHLEGGFIESIEQLGRDRVILMRVRSRNELGDEEAKKLYIELMG 123

Query: 127 Q-GNILLTDSEFTVLTLLR 144
           +  NILLTD +  +L  ++
Sbjct: 124 RHSNILLTDGQDKILDAIK 142


>gi|398310663|ref|ZP_10514137.1| hypothetical protein BmojR_14603 [Bacillus mojavensis RO-H-1]
          Length = 570

 Score = 48.5 bits (114), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 59/127 (46%), Gaps = 19/127 (14%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +   G+++K+LL    S  R+H TA   +  + 
Sbjct: 18  RITGGRITKVHQPYKHDVIFH------IRADGKNQKLLLSAHPSYSRVHITAQTYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHYVILELYAQ-----GN 129
           P  F + LRKHI    +E + Q G DRI++F       +G   H    +LY +      N
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETH---RKLYVEIMGRHSN 128

Query: 130 ILLTDSE 136
           I+LTD E
Sbjct: 129 IILTDGE 135


>gi|403234858|ref|ZP_10913444.1| Fibronectin-binding A domain-containing protein [Bacillus sp.
           10403023]
          Length = 569

 Score = 48.5 bits (114), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 68/136 (50%), Gaps = 15/136 (11%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRL 66
           T  +A E+K  + L   R S +Y       IF+      +  +G++ K+LL    S  R+
Sbjct: 8   THAIANELK--QTLESGRISKIYQPYKNELIFQ------IRSNGKNHKLLLSAHPSYARI 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVI 121
           H T    D  + P  F + LRKH+    +E +RQ+  DRII+F       LG ++   +I
Sbjct: 60  HLTNELYDNPHEPPMFCMLLRKHLEGSIIEAIRQVDKDRIIIFDIKGRNELGDVSYKQLI 119

Query: 122 LELYAQ-GNILLTDSE 136
           +E+  +  NI+L D+E
Sbjct: 120 IEIMGRHSNIILVDTE 135


>gi|220929383|ref|YP_002506292.1| fibronectin-binding A domain-containing protein [Clostridium
           cellulolyticum H10]
 gi|219999711|gb|ACL76312.1| Fibronectin-binding A domain protein [Clostridium cellulolyticum
           H10]
          Length = 592

 Score = 48.1 bits (113), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 124/588 (21%), Positives = 226/588 (38%), Gaps = 121/588 (20%)

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MN 116
           S  RLH T   ++    P  F + +RKH+   RL ++    Y+RII         LG + 
Sbjct: 55  SNPRLHLTTLQKENPAAPPVFCMLMRKHVAGGRLLNISFHDYERIITLNIESVNELGDLT 114

Query: 117 AHYVILELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTAS 175
              +++E+  +  NI+L +SE  ++  ++ H D D              I  V E   A 
Sbjct: 115 VKRLVVEIMGKYSNIILLNSENKIIDSVK-HVDSD--------------ISSVREIMPAR 159

Query: 176 KLHAALTSSKE-PDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGAR 234
                   +KE P+  E DK+          ++EN+ G                      
Sbjct: 160 TYLLPPAQNKELPENTEVDKI---------FNEENIKG---------------------- 188

Query: 235 AKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDW 294
           AK P    +L    G+ P     I    G+     + E+N  +   I+V   A+AK+ D 
Sbjct: 189 AKHPE-GLILNTVKGFSPYTCRDICASAGVPSKTPIGELNDSDKEKIKV---ALAKYIDK 244

Query: 295 LQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETF 354
           ++   S +  P   I+ ++K            S  +  D +C       +   +  +E  
Sbjct: 245 IK---SNNFSP--CIVYEDK------------SMLRPIDFYC---FEPSKEVFYKSYELL 284

Query: 355 DAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK------ 408
             ALD++Y      R   +   ++     K+ K  +++  +  T+  E  R V       
Sbjct: 285 STALDQYYM----LRDTNERLGQKMGDVLKVVKNGIERCQKKTTMFNEKLREVSERDKLQ 340

Query: 409 -MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNC 467
              ELI  N+  +     + RV L    + E+   +   E K+    A    K Y +   
Sbjct: 341 LYGELITANIYCIAEGAKSARV-LNYYSANEEYVDIPLNEYKSAQDNAQKYFKKYSKAKS 399

Query: 468 MSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAH-SK 526
             L ++  L E   E     +E ++  L +  + N+R+  +     E +QE     +  +
Sbjct: 400 THLNVTKQLAETLSE-----LEYLQSVLTMLGNCNSRQEID-----EIRQELIDQGYIRQ 449

Query: 527 AFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKR 586
           ++K A+ K      Q+K  + +              FISS+ + ++ G++ +QN+++  +
Sbjct: 450 SYKNAKNK------QDKPSSPLE-------------FISSDGFQILVGKNNKQNDLLTLK 490

Query: 587 YMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQA 634
             +  D+++H      S  +I+  R    VP  TL +A      HS A
Sbjct: 491 TAASNDLWLHTKNIPGSHVIIRTER--NTVPDSTLLEAATLAAYHSSA 536


>gi|317057453|ref|YP_004105920.1| fibronectin-binding A domain-containing protein [Ruminococcus albus
           7]
 gi|315449722|gb|ADU23286.1| Fibronectin-binding A domain protein [Ruminococcus albus 7]
          Length = 594

 Score = 47.8 bits (112), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 18  LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKK 76
           L  LIG R   ++  S    +  L    G+      +K+L+   +G  RLH TA   +  
Sbjct: 15  LMPLIGGRVDKIHQPSKGELLIALRTYDGI------KKLLINTVAGTARLHLTAAEIENP 68

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYA-QGNI 130
             P  F + +RKH+   +L D+RQ  ++R+I+  F     LG M    V +EL   + N+
Sbjct: 69  KQPPMFCMLMRKHLSGAKLADIRQPEHERVIMLDFDATNELGDMVRLTVTVELMGRRANL 128

Query: 131 LLTD 134
           LLTD
Sbjct: 129 LLTD 132


>gi|385264690|ref|ZP_10042777.1| hypothetical protein MY7_1447 [Bacillus sp. 5B6]
 gi|385149186|gb|EIF13123.1| hypothetical protein MY7_1447 [Bacillus sp. 5B6]
          Length = 568

 Score = 47.8 bits (112), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+HTT  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHTTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|255528127|ref|ZP_05394955.1| Fibronectin-binding A domain protein [Clostridium carboxidivorans
           P7]
 gi|255508168|gb|EET84580.1| Fibronectin-binding A domain protein [Clostridium carboxidivorans
           P7]
          Length = 541

 Score = 47.8 bits (112), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 38/71 (53%), Gaps = 6/71 (8%)

Query: 57  LLLMESGV--RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF--- 111
           LL+  S V  ++H T  ++     P  F + LRKHI T RL ++RQL  DR+I   F   
Sbjct: 11  LLISASSVYPKIHLTQLSKTNPMQPPLFCMVLRKHINTGRLVNIRQLDTDRVIFLDFEST 70

Query: 112 -GLGMNAHYVI 121
             LG N+ Y +
Sbjct: 71  DELGFNSIYTL 81


>gi|435853658|ref|YP_007314977.1| putative RNA-binding protein, snRNP like protein [Halobacteroides
           halobius DSM 5150]
 gi|433670069|gb|AGB40884.1| putative RNA-binding protein, snRNP like protein [Halobacteroides
           halobius DSM 5150]
          Length = 584

 Score = 47.4 bits (111), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 80/345 (23%), Positives = 157/345 (45%), Gaps = 67/345 (19%)

Query: 345 SREFVKFETFD---AALDEFYSKIESQR------AEQQHKAKEDAAFHKLNKIHMDQE-- 393
            +E +K +  D   +A ++ ++KI++++       ++++  KE  AF KL +  + QE  
Sbjct: 221 QQELIKPKEIDNLWSAFNDIFNKIKNEKFNPTLVLDKENNLKEYEAF-KLKQFDLPQESF 279

Query: 394 ------------NRVHTLKQEVDR-SVKMAELIEYNLEDVDAAILAVRVALANRMSWEDL 440
                       NR+  ++++V+R + KM  +I  N+E++      VR  L         
Sbjct: 280 TSVNQLLDYYFTNRI--IQKKVNRLTNKMNNIIRDNIENIKKKYSKVRGQLKG------- 330

Query: 441 ARMVKEERKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAH 500
           A+   + +  G  +   I +L   +N ++L    N    +++E T     +E+D  L+  
Sbjct: 331 AKNADKHQLKGELITANIYQLEKGQNKVTLQNYYN----NNQEVT-----IELDPELTPA 381

Query: 501 ANARRWYELKKKQESKQEKTITAHSKAFKAA---EKKTRLQILQEKTVANISHMRKVHWF 557
            NA+R++E K ++  K  K +   +K  KA     ++  + I Q +T+A +  + K    
Sbjct: 382 ENAQRYFE-KYEKAKKSVKYLRREAKKAKAEFEYLQQVEVNINQSETLAELQEIEKELVQ 440

Query: 558 EKFNW-----------------FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA-DL 599
           E +                   F S+  Y ++ GR+ +QN+ + K+  +  D +VH  DL
Sbjct: 441 EGYIKEQKQNNNKQNDKLPPLKFASTAGYDILVGRNNRQNDGLTKKIANNQDTWVHVKDL 500

Query: 600 HGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            G S T+I+NH  ++ +P  TL +A      +S+   S  V   +
Sbjct: 501 PG-SHTIIRNHTGKK-IPEETLLEAAQIAAFYSKGRKSSNVPVDY 543



 Score = 46.6 bits (109), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 68/140 (48%), Gaps = 13/140 (9%)

Query: 12  AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTA 70
           A + +   +LIG R   +Y   PK  +  +     + + GE+ K+L+       R+H T 
Sbjct: 10  AIKTELQNKLIGGRVDKIY--QPKENLLTIR----IRQPGENIKLLISANPQNPRIHITE 63

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRI--ILFQF----GLGMNAHYVILEL 124
              D    P  F + LRKH+++ R++++ Q  ++RI  I+ Q+    G  ++   VI  +
Sbjct: 64  QDFDNPYQPPTFCMLLRKHLQSGRIKEINQPNFERILEIIIQYKNNQGELVDKKLVIELM 123

Query: 125 YAQGNILLTDSEFTVLTLLR 144
               NI+LT  +  +L  ++
Sbjct: 124 GRHSNIILTKPDEQILDCIK 143


>gi|317059002|ref|ZP_07923487.1| fibronectin-binding protein [Fusobacterium sp. 3_1_5R]
 gi|313684678|gb|EFS21513.1| fibronectin-binding protein [Fusobacterium sp. 3_1_5R]
          Length = 541

 Score = 47.4 bits (111), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 59/118 (50%), Gaps = 14/118 (11%)

Query: 55  KVLLLMESGVRLHTTAYARDKKN----TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           K +L++    +L       DK+     + S F   LRKH+    L  V Q+G+DR ++F+
Sbjct: 50  KQVLVLSCNPQLPICYVTEDKETVLEESVSSFLNTLRKHLMNSFLYQVEQVGWDRTLIFR 109

Query: 111 FG----LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F     LG    +++I EL  +  N+ L D ++ +L LL+    D+    + +R+ +P
Sbjct: 110 FSKLTELGDYKQYFLIFELMGRNSNLFLCDQDYKILDLLKRFSLDE----VQTRNLFP 163


>gi|428279159|ref|YP_005560894.1| hypothetical protein BSNT_02575 [Bacillus subtilis subsp. natto
           BEST195]
 gi|291484116|dbj|BAI85191.1| hypothetical protein BSNT_02575 [Bacillus subtilis subsp. natto
           BEST195]
          Length = 570

 Score = 47.4 bits (111), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + V+       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 18  KIMGGRITKVHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|394993902|ref|ZP_10386641.1| YloA [Bacillus sp. 916]
 gi|393805226|gb|EJD66606.1| YloA [Bacillus sp. 916]
          Length = 568

 Score = 47.4 bits (111), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 65/133 (48%), Gaps = 15/133 (11%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG---MNAHYVILELYAQGNIL 131
           P  F + LRKHI    +E + Q G DRI++F+      +G   + A YV + +    NI+
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRALYVEI-MGRHSNII 130

Query: 132 LTDSEFTVLTLLR 144
           LTD E  ++  L+
Sbjct: 131 LTDGEGAIIDGLK 143


>gi|433446087|ref|ZP_20410218.1| fibrinogen binding protein [Anoxybacillus flavithermus TNO-09.006]
 gi|432000832|gb|ELK21724.1| fibrinogen binding protein [Anoxybacillus flavithermus TNO-09.006]
          Length = 569

 Score = 47.4 bits (111), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 13/125 (10%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKN 77
           R L+G R S +Y   P   +        +   G + K+LL    +  R+H T    D  +
Sbjct: 17  RTLVGGRISKIYQPFPHELVLH------IRSYGNNYKLLLSAHPTYARIHLTNEVYDHPS 70

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQGNIL 131
            P  F + LRKHI    +E + Q+ +DRII+       + G       +I  +    NI+
Sbjct: 71  EPPMFCMLLRKHIEGGVIEAITQVDFDRIIIIHVKARNELGDVCTKQLIIEMMGRHSNII 130

Query: 132 LTDSE 136
           L D++
Sbjct: 131 LVDAQ 135


>gi|384430804|ref|YP_005640164.1| fibronectin-binding A domain-containing protein [Thermus
           thermophilus SG0.5JP17-16]
 gi|333966272|gb|AEG33037.1| Fibronectin-binding A domain protein [Thermus thermophilus
           SG0.5JP17-16]
          Length = 512

 Score = 47.0 bits (110), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 91/201 (45%), Gaps = 20/201 (9%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   EK +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAEKALDLIPKTEARIRELEAEKERLRTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     K    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGLKIGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMVTSAWWVYPH--QVSKTAPTG 658
            S  ++K    E   PPL  L  A      HS+A   + V   +    H  +  K AP G
Sbjct: 438 GSHVILKT---EGKNPPLEDLLFAARLAAYHSKARGERQVPVDYTRKKHVWRPRKAAP-G 493

Query: 659 E--YLTVGSFMIRGKKNFLPP 677
           +  Y    +  + G    LPP
Sbjct: 494 QVLYTKAKTLFVEG----LPP 510


>gi|386360884|ref|YP_006059129.1| RNA-binding protein [Thermus thermophilus JL-18]
 gi|383509911|gb|AFH39343.1| putative RNA-binding protein, snRNP like protein [Thermus
           thermophilus JL-18]
          Length = 512

 Score = 47.0 bits (110), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 91/201 (45%), Gaps = 20/201 (9%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   EK +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAEKALDLIPKTEARIRELEAEKERLKTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     K    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGLKIGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMVTSAWWVYPH--QVSKTAPTG 658
            S  ++K    E   PPL  L  A      HS+A   + V   +    H  +  K AP G
Sbjct: 438 GSHVILKT---EGKNPPLEDLLFAARLAAYHSKARGERQVPVDYTRKKHVWRPRKAAP-G 493

Query: 659 E--YLTVGSFMIRGKKNFLPP 677
           +  Y    +  + G    LPP
Sbjct: 494 QVLYTKAKTLFVEG----LPP 510


>gi|300854261|ref|YP_003779245.1| RNA-binding protein [Clostridium ljungdahlii DSM 13528]
 gi|300434376|gb|ADK14143.1| putative RNA binding protein [Clostridium ljungdahlii DSM 13528]
          Length = 578

 Score = 47.0 bits (110), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 6/80 (7%)

Query: 49  ESGESEKVLLLMESGV--RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRI 106
           ++G     LLL  S V  ++H T  ++     P  F + LRKH+   +L D+RQL  DRI
Sbjct: 40  KNGRKNYKLLLSASPVYPKMHITVKSKQNPLQPPMFCMVLRKHLSPSKLVDIRQLDTDRI 99

Query: 107 ILFQF----GLGMNAHYVIL 122
           +   F     LG N+ Y ++
Sbjct: 100 VFLDFESSDELGFNSIYTLV 119


>gi|430759013|ref|YP_007209734.1| Fibronectin-binding protein YloA [Bacillus subtilis subsp. subtilis
           str. BSP1]
 gi|430023533|gb|AGA24139.1| Fibronectin-binding protein YloA [Bacillus subtilis subsp. subtilis
           str. BSP1]
          Length = 572

 Score = 47.0 bits (110), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 20  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 73

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 74  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 133

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 134 TDAAENVI 141


>gi|321315330|ref|YP_004207617.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           BSn5]
 gi|320021604|gb|ADV96590.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           BSn5]
          Length = 570

 Score = 47.0 bits (110), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 18  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|315924518|ref|ZP_07920739.1| fibronectin-binding protein [Pseudoramibacter alactolyticus ATCC
           23263]
 gi|315622222|gb|EFV02182.1| fibronectin-binding protein [Pseudoramibacter alactolyticus ATCC
           23263]
          Length = 595

 Score = 47.0 bits (110), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 52/106 (49%), Gaps = 18/106 (16%)

Query: 51  GESEKVLLLMESG--VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           G++  VLL+  +    R+H T   +   NTP  F + LRKH+   R+E +RQ   DR+IL
Sbjct: 46  GKTNYVLLMSANANQPRVHLTNKKKKNPNTPPSFCMALRKHLINGRIEAIRQHESDRVIL 105

Query: 109 F------QFGLGMNAHYVILELYAQ-----GNILLTDSEFTVLTLL 143
                  +FG       VI  L A+      NI+LT +E   L ++
Sbjct: 106 LDIATKNEFGDP-----VIKSLIAEITGRHANIILTKTEADALVII 146


>gi|147678138|ref|YP_001212353.1| RNA-binding protein [Pelotomaculum thermopropionicum SI]
 gi|146274235|dbj|BAF59984.1| hypothetical RNA-binding protein [Pelotomaculum thermopropionicum
           SI]
          Length = 290

 Score = 47.0 bits (110), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 21/115 (18%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTL 621
           F+S++ + +  GR+ +QN+ + ++     D+++HA D+ GA   +IK    E  VPP TL
Sbjct: 163 FVSTDGFQIFIGRNNKQNDYLTQKIARDNDIWLHARDIPGA-HVIIKTEGKE--VPPATL 219

Query: 622 NQAGCFTVCHSQAWDSKMVTSAWW----------------VYPHQVS-KTAPTGE 659
            +A       S+  +SK+V   +                 VY HQ +   AP GE
Sbjct: 220 EEAAGLAAYFSKGRNSKIVPVDYTFKKHVRKPKGARPGMVVYDHQKTIMAAPAGE 274


>gi|16078628|ref|NP_389447.1| persistent RNA/DNA binding protein [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|402775809|ref|YP_006629753.1| persistent RNA/DNA binding protein [Bacillus subtilis QB928]
 gi|81637590|sp|O34693.1|YLOA_BACSU RecName: Full=Uncharacterized protein YloA
 gi|2462963|emb|CAA04416.1| putative fibronectin-binding protein [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|2633937|emb|CAB13438.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           subsp. subtilis str. 168]
 gi|402480991|gb|AFQ57500.1| Putative persistent RNA/DNA binding protein [Bacillus subtilis
           QB928]
          Length = 572

 Score = 47.0 bits (110), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 20  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 73

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 74  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 133

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 134 TDAAENVI 141


>gi|221309439|ref|ZP_03591286.1| hypothetical protein Bsubs1_08641 [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|221313764|ref|ZP_03595569.1| hypothetical protein BsubsN3_08577 [Bacillus subtilis subsp.
           subtilis str. NCIB 3610]
 gi|221318688|ref|ZP_03599982.1| hypothetical protein BsubsJ_08511 [Bacillus subtilis subsp.
           subtilis str. JH642]
 gi|221322959|ref|ZP_03604253.1| hypothetical protein BsubsS_08617 [Bacillus subtilis subsp.
           subtilis str. SMY]
 gi|418033289|ref|ZP_12671766.1| hypothetical protein BSSC8_27100 [Bacillus subtilis subsp. subtilis
           str. SC-8]
 gi|452914213|ref|ZP_21962840.1| fibronectin-binding A family protein [Bacillus subtilis MB73/2]
 gi|351469437|gb|EHA29613.1| hypothetical protein BSSC8_27100 [Bacillus subtilis subsp. subtilis
           str. SC-8]
 gi|407958971|dbj|BAM52211.1| persistent RNA/DNA binding protein [Bacillus subtilis BEST7613]
 gi|407964548|dbj|BAM57787.1| persistent RNA/DNA binding protein [Bacillus subtilis BEST7003]
 gi|452116633|gb|EME07028.1| fibronectin-binding A family protein [Bacillus subtilis MB73/2]
          Length = 570

 Score = 47.0 bits (110), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 18  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|257066456|ref|YP_003152712.1| fibronectin-binding A domain-containing protein [Anaerococcus
           prevotii DSM 20548]
 gi|256798336|gb|ACV28991.1| Fibronectin-binding A domain protein [Anaerococcus prevotii DSM
           20548]
          Length = 582

 Score = 47.0 bits (110), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 30/132 (22%), Positives = 60/132 (45%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRLHTTAYARDKKNT 78
           +L+G +   V   S    +F       V   G++ K+LL   +   R++ T    +  + 
Sbjct: 18  KLLGGKIQKVTQPSKNDIVF------NVYSMGKNYKLLLSANNNEARINITNKKYENPDV 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQGNILL 132
           P  F + LRKHI   ++ D+ Q G DR+++F      + G   +   ++  +    NI+L
Sbjct: 72  PPNFCMVLRKHINQGKIIDISQRGLDRVVIFSISSIDEMGFDTSKKLIVEIMGKYSNIIL 131

Query: 133 TDSEFTVLTLLR 144
            D  + ++  ++
Sbjct: 132 VDDNYKIIDAIK 143


>gi|312898711|ref|ZP_07758100.1| fibronectin-binding protein A [Megasphaera micronuciformis F0359]
 gi|310620142|gb|EFQ03713.1| fibronectin-binding protein A [Megasphaera micronuciformis F0359]
          Length = 574

 Score = 46.6 bits (109), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 33/156 (21%), Positives = 68/156 (43%), Gaps = 14/156 (8%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L G + + +Y L+ +   F++ N   +        +++ ++   RL  +       + P+
Sbjct: 19  LTGGQITKIYQLNGRGLYFRVFNDKSLYH------LIITLDGSPRLFLSDNQPPTPDVPT 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-LGMNAHYVILELYAQ-----GNILLTD 134
           G  + LRK+    R+  + QL  DRII      L M+   V  +++ +      N++ T+
Sbjct: 73  GLAMFLRKYYENGRIASITQLHLDRIIDVNIDVLNMSGQLVTRKMHVELMGKYSNVIFTE 132

Query: 135 SEFTVLTLLRSHRDDDKGVAIMSRHRY--PTEICRV 168
               +  L+++H+D      I  +H Y  P    R+
Sbjct: 133 DGMILEALIKTHKDKQALRTIYPKHPYEFPPNFMRM 168


>gi|384175306|ref|YP_005556691.1| fibronectin-binding protein [Bacillus subtilis subsp. subtilis str.
           RO-NN-1]
 gi|349594530|gb|AEP90717.1| fibronectin-binding protein [Bacillus subtilis subsp. subtilis str.
           RO-NN-1]
          Length = 570

 Score = 46.6 bits (109), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 18  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|326790867|ref|YP_004308688.1| fibronectin-binding A domain-containing protein [Clostridium
           lentocellum DSM 5427]
 gi|326541631|gb|ADZ83490.1| Fibronectin-binding A domain protein [Clostridium lentocellum DSM
           5427]
          Length = 586

 Score = 46.6 bits (109), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 79/170 (46%), Gaps = 22/170 (12%)

Query: 9   ADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLH 67
           A++  E+K +  LIG R   +Y +  +  +F + N+      G   K+LL   S   R+H
Sbjct: 9   ANIVHELKDV--LIGGRIDKIYQIEKEDILFTIRNN------GNVYKLLLTANSNYPRVH 60

Query: 68  TTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLG-MNAHYVIL 122
            +  A++    P  F + LRKH+   RL D+ Q   +RI+ F       LG      +I+
Sbjct: 61  LSTLAKNPSQDPPMFCMLLRKHLGGGRLLDIVQPDLERIVEFHIEATNELGDKETKKLII 120

Query: 123 ELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFER 171
           E+  +  NI+LT  +  +L  ++   +D   V    R   P    RV++R
Sbjct: 121 EIMGRHSNIILTKEDHLILDSIKHISNDKSSV----REILPN---RVYQR 163


>gi|46198551|ref|YP_004218.1| fibronectin/fibrinogen-binding protein [Thermus thermophilus HB27]
 gi|46196173|gb|AAS80591.1| fibronectin/fibrinogen-binding protein [Thermus thermophilus HB27]
          Length = 516

 Score = 46.6 bits (109), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   EK +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAEKALDLIPKTEARIRELEAEKERLKTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     K    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGLKVGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMV 640
            S  ++K    E   PPL  L  A      HS+A   + V
Sbjct: 438 GSHVILKT---EGKNPPLEDLLFAARLAAYHSKARGERQV 474


>gi|55980577|ref|YP_143874.1| RNA-biniding protein [Thermus thermophilus HB8]
 gi|55771990|dbj|BAD70431.1| probable RNA-biniding protein [Thermus thermophilus HB8]
          Length = 516

 Score = 46.6 bits (109), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   EK +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAEKALDLIPKTEARIRELEAEKERLRTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     K    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGLKVGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMV 640
            S  ++K    E   PPL  L  A      HS+A   + V
Sbjct: 438 GSHVILKT---EGKNPPLEDLLFAARLAAYHSKARGERQV 474


>gi|399888866|ref|ZP_10774743.1| RNA-binding protein [Clostridium arbusti SL206]
          Length = 576

 Score = 46.6 bits (109), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 98/235 (41%), Gaps = 33/235 (14%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNAHY- 119
           ++H T   +    TP  F + LRK++   R+ D+RQ+  DRII+F F     LG N+ Y 
Sbjct: 59  KIHITKNNKTNPLTPPMFCMVLRKYLLNGRIVDIRQVSTDRIIIFDFESVDDLGFNSIYS 118

Query: 120 VILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPT-EICRVFERTTASK-L 177
           +++E+  +          + +TL+R  RD+     IM   ++ T EI R        K +
Sbjct: 119 LVVEIMGRH---------SNITLIR-QRDN----IIMDSIKHITPEINRFRSLYPGIKYV 164

Query: 178 HAALTSSKEP-DANEPDKVNEDGNNVSNASKENLGGQKGGKS--------FDLSKN---S 225
           +   +    P D N+ D  N   +N  +  ++       G S        F LSKN    
Sbjct: 165 YPPKSERLNPFDFNKSDFTNYLTSNAIDIDEKMFSKIFTGVSKPLSKEVFFRLSKNIKMD 224

Query: 226 NKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNA 280
           N NSND           +      Y       II D   +    LS ++K+E N+
Sbjct: 225 NINSNDIYEYIANLFNDIKNYKFSYNAYSENGIIKDFSCIDLTNLSTMDKIEYNS 279


>gi|220903575|ref|YP_002478887.1| hypothetical protein Ddes_0294 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
 gi|219867874|gb|ACL48209.1| protein of unknown function DUF814 [Desulfovibrio desulfuricans
           subsp. desulfuricans str. ATCC 27774]
          Length = 577

 Score = 46.6 bits (109), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + ++ GRDA+ N + V++  +  D+++HA+    S  +I+     Q VP  TL+
Sbjct: 456 FISSDGFALLRGRDARGN-LAVRKLAAPHDIWLHAENGPGSHVIIRRAHGGQEVPARTLD 514

Query: 623 QAGCFTV 629
           +AG    
Sbjct: 515 EAGALAA 521


>gi|452974532|gb|EME74352.1| fibronectin-binding protein YloA [Bacillus sonorensis L12]
          Length = 571

 Score = 46.2 bits (108), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ K+LL    S  R+H T  A D  + P  F + LRKH+    +E + Q+G DR
Sbjct: 39  IRANGKNRKLLLSAHPSYARVHLTEEAYDNPSAPPMFCMLLRKHLEGGFVEQIEQIGLDR 98

Query: 106 IILF------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
           +++F      + G  +    V+  +    NI+LTD E  V+
Sbjct: 99  VMVFHIRSRNEVGDTLIRKLVVEIMGRHSNIVLTDGEKDVI 139


>gi|443632767|ref|ZP_21116946.1| hypothetical protein BSI_20210 [Bacillus subtilis subsp.
           inaquosorum KCTC 13429]
 gi|443347590|gb|ELS61648.1| hypothetical protein BSI_20210 [Bacillus subtilis subsp.
           inaquosorum KCTC 13429]
          Length = 570

 Score = 46.2 bits (108), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + V+       IF       +  +G+++K+LL    S  R+H T  A +  + 
Sbjct: 18  KMMGGRITKVHQPYKHDVIFH------IRANGKNQKLLLSAHPSYSRVHITTQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E++ Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIENIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD    V+
Sbjct: 132 TDGAENVI 139


>gi|375362208|ref|YP_005130247.1| hypothetical protein BACAU_1518 [Bacillus amyloliquefaciens subsp.
           plantarum CAU B946]
 gi|371568202|emb|CCF05052.1| hypothetical protein BACAU_1518 [Bacillus amyloliquefaciens subsp.
           plantarum CAU B946]
          Length = 568

 Score = 46.2 bits (108), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E +++  L+
Sbjct: 132 TDGEGSIIDGLK 143


>gi|451347065|ref|YP_007445696.1| hypothetical protein KSO_011620 [Bacillus amyloliquefaciens IT-45]
 gi|449850823|gb|AGF27815.1| hypothetical protein KSO_011620 [Bacillus amyloliquefaciens IT-45]
          Length = 568

 Score = 46.2 bits (108), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E +++  L+
Sbjct: 132 TDGEGSIIDGLK 143


>gi|167769343|ref|ZP_02441396.1| hypothetical protein ANACOL_00669 [Anaerotruncus colihominis DSM
           17241]
 gi|167668311|gb|EDS12441.1| fibronectin-binding protein [Anaerotruncus colihominis DSM 17241]
          Length = 590

 Score = 46.2 bits (108), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 82/192 (42%), Gaps = 35/192 (18%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNTP 79
           ++G R   ++  + +T +  +    G      + K+LL    S  R+H T  A+D   +P
Sbjct: 19  VVGGRVDKIHQPARETIVIAMRARVG------NRKLLLSASASNPRVHFTELAQDNPKSP 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNAHYVILELYAQ-----GNI 130
             F + +RKH+   +L D+ Q G DRI+ F F     LG     V+L L A+      NI
Sbjct: 73  PMFCMLMRKHLTGAKLVDITQAGLDRILHFHFETTNELG---DRVVLTLSAEIMGRHSNI 129

Query: 131 LLTDSEFTVLTLLRSHRDDDKGV-----AIMSRH-----------RYPTEICRVFERTTA 174
           +L   +  ++  ++   D+   V      +M  H             P+EI +    T  
Sbjct: 130 ILVGQDGRIIDAVKRVSDEMSRVRPVLPGMMYTHVPAGSRLDIYKAAPSEIVKRLHDTPE 189

Query: 175 SKLHAALTSSKE 186
             L+ AL S+ E
Sbjct: 190 QPLYKALISALE 201



 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 8/92 (8%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           F+S + + ++ GR+  QN+ +  +   K D+++H      S  VI      Q VP  TL 
Sbjct: 466 FVSDDGFTILCGRNNLQNDRLTLKDSRKNDIWLHTQKIPGSHVVIVTQ--GQEVPDRTLE 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAW------WVYP 648
           QA      HS+A +S  V   +      W +P
Sbjct: 524 QAAVIAAYHSKARESGKVAVDYTQVRNVWKHP 555


>gi|2337794|emb|CAA74268.1| YloA protein [Bacillus subtilis subsp. subtilis str. 168]
          Length = 200

 Score = 46.2 bits (108), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 47/93 (50%), Gaps = 7/93 (7%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + ++       IF       +   G+++K+LL    S  R+H TA A +  + 
Sbjct: 20  KIMGGRITKIHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITAQAYENPSE 73

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
           P  F + LRKHI    +E + Q G DRI++F  
Sbjct: 74  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHI 106


>gi|384159452|ref|YP_005541525.1| persistent RNA/DNA binding protein [Bacillus amyloliquefaciens
           TA208]
 gi|328553540|gb|AEB24032.1| persistent RNA/DNA binding protein [Bacillus amyloliquefaciens
           TA208]
          Length = 568

 Score = 45.8 bits (107), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|242280078|ref|YP_002992207.1| hypothetical protein Desal_2613 [Desulfovibrio salexigens DSM 2638]
 gi|242122972|gb|ACS80668.1| protein of unknown function DUF814 [Desulfovibrio salexigens DSM
           2638]
          Length = 503

 Score = 45.8 bits (107), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 35/70 (50%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ +L+I G++++ N  I+ +  S  D + H      S  V+K   P Q VP  TL 
Sbjct: 381 FISSDGFLMIRGKNSKANHEILSKVSSVFDYWFHVQGGPGSHVVLKRDHPSQEVPEQTLR 440

Query: 623 QAGCFTVCHS 632
           +A       S
Sbjct: 441 EAAVLAALKS 450


>gi|387209294|gb|AFJ69115.1| hypothetical protein NGATSA_3044600, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 106

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 48/87 (55%)

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVAL 431
           Q  +A+E+A   +  ++  + E R+  L+    R +  A L+E + + VD  +L +R A+
Sbjct: 3   QAVRAQEEAVRSRPLRVQRENEARLKELEATEARLLDAARLVECHSDAVDKVLLVLRSAI 62

Query: 432 ANRMSWEDLARMVKEERKAGNPVAGLI 458
           A    W+ L   +++E+  GNP+A +I
Sbjct: 63  ATGADWQTLDEYIRKEQAGGNPLARMI 89


>gi|154685980|ref|YP_001421141.1| hypothetical protein RBAM_015470 [Bacillus amyloliquefaciens FZB42]
 gi|154351831|gb|ABS73910.1| YloA [Bacillus amyloliquefaciens FZB42]
          Length = 568

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|421731766|ref|ZP_16170889.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           M27]
 gi|407073979|gb|EKE46969.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           M27]
          Length = 568

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|429505115|ref|YP_007186299.1| hypothetical protein B938_08035 [Bacillus amyloliquefaciens subsp.
           plantarum AS43.3]
 gi|429486705|gb|AFZ90629.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           AS43.3]
          Length = 568

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|384164113|ref|YP_005545492.1| persistent RNA/DNA binding protein [Bacillus amyloliquefaciens LL3]
 gi|384168499|ref|YP_005549877.1| uroporphyrin-III C-methyltransferase [Bacillus amyloliquefaciens
           XH7]
 gi|328911668|gb|AEB63264.1| putative persistent RNA/DNA binding protein [Bacillus
           amyloliquefaciens LL3]
 gi|341827778|gb|AEK89029.1| putative uroporphyrin-III C-methyltransferase [Bacillus
           amyloliquefaciens XH7]
          Length = 571

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 21  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 74

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 75  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 134

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 135 TDGEGAIIDGLK 146


>gi|392394834|ref|YP_006431436.1| RNA-binding protein [Desulfitobacterium dehalogenans ATCC 51507]
 gi|390525912|gb|AFM01643.1| putative RNA-binding protein, snRNP like protein
           [Desulfitobacterium dehalogenans ATCC 51507]
          Length = 637

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 55/96 (57%), Gaps = 13/96 (13%)

Query: 51  GESEKVLL-LMESGVRLHTTAYARDKKNTPSG--FTLKLRKHIRTRRLEDVRQLGYDRII 107
           G+S ++LL +  +G RLH +   ++KKN PS   F + LRKHI   ++  + QLG +RI+
Sbjct: 43  GQSYRLLLNISATGARLHLSQ--KNKKNPPSPPMFCMILRKHIEGGKILALEQLGLERIV 100

Query: 108 LF------QFGLGMNAHYVILELYAQ-GNILLTDSE 136
           L       ++G  +   Y+ LE+  +  N++L D +
Sbjct: 101 LLTVQNYNEYG-DLATFYLYLEIMGKHSNLILVDPQ 135


>gi|325679051|ref|ZP_08158645.1| putative fibronectin-binding protein [Ruminococcus albus 8]
 gi|324109175|gb|EGC03397.1| putative fibronectin-binding protein [Ruminococcus albus 8]
          Length = 594

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 63/127 (49%), Gaps = 13/127 (10%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           LIG R   ++  S    +  +    G+      +K+L+   +G  RLH T    +    P
Sbjct: 18  LIGGRVDKIHQPSKGELLIAVRTFDGI------KKLLINTVAGTARLHLTTAEIENPKQP 71

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYA-QGNILLT 133
             F + +RKH+ + +L D+RQ  ++R+I+  F     LG +    V +EL   + N++LT
Sbjct: 72  PMFCMLMRKHLSSAKLVDIRQPAFERVIMLDFDASNELGDIVRLTVTVELMGRRANLMLT 131

Query: 134 DSEFTVL 140
           D++  ++
Sbjct: 132 DADGKII 138


>gi|440781920|ref|ZP_20960148.1| Fibronectin-binding protein [Clostridium pasteurianum DSM 525]
 gi|440220638|gb|ELP59845.1| Fibronectin-binding protein [Clostridium pasteurianum DSM 525]
          Length = 577

 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 39/65 (60%), Gaps = 5/65 (7%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNAHY- 119
           ++H T  ++    TP  F + LRK++   ++ D+RQ+  DRII+F F     LG N+ Y 
Sbjct: 59  KIHITDNSKKNPLTPPMFCMVLRKYLLNSKIVDIRQIETDRIIIFDFQSVDDLGFNSIYS 118

Query: 120 VILEL 124
           +I+E+
Sbjct: 119 LIIEI 123


>gi|449094256|ref|YP_007426747.1| hypothetical protein C663_1608 [Bacillus subtilis XF-1]
 gi|449028171|gb|AGE63410.1| hypothetical protein C663_1608 [Bacillus subtilis XF-1]
          Length = 570

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + V+       IF       +   G+++K+LL    S  R+H T  A +  + 
Sbjct: 18  KIMGGRITKVHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITTQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|52080167|ref|YP_078958.1| fibronectin binding protein [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|319646053|ref|ZP_08000283.1| YloA protein [Bacillus sp. BT1B_CT2]
 gi|404489055|ref|YP_006713161.1| fibronectin-binding protein YloA [Bacillus licheniformis DSM 13 =
           ATCC 14580]
 gi|52003378|gb|AAU23320.1| putative fibronectin binding protein [Bacillus licheniformis DSM 13
           = ATCC 14580]
 gi|52348046|gb|AAU40680.1| putative fibronectin-binding protein YloA [Bacillus licheniformis
           DSM 13 = ATCC 14580]
 gi|317391803|gb|EFV72600.1| YloA protein [Bacillus sp. BT1B_CT2]
          Length = 570

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ K+LL    S  R+H T    D  +TP  F + LRKH+    ++ V Q+G DR
Sbjct: 39  IRANGKNRKLLLSAHPSYARVHLTNETYDNPSTPPMFCMLLRKHLEGGFIDQVEQIGMDR 98

Query: 106 IILF------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
           +++F      + G  +    ++  +    NI+LTD E  V+
Sbjct: 99  MMVFHIRSRNEIGDTLTRKLMVEIMGRHSNIVLTDGEKDVI 139


>gi|452855511|ref|YP_007497194.1| putative persistent RNA/DNA binding protein [Bacillus
           amyloliquefaciens subsp. plantarum UCMB5036]
 gi|452079771|emb|CCP21528.1| putative persistent RNA/DNA binding protein [Bacillus
           amyloliquefaciens subsp. plantarum UCMB5036]
          Length = 571

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 21  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 74

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 75  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 134

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 135 TDGEGAIIDGLK 146


>gi|423682109|ref|ZP_17656948.1| fibronectin binding protein [Bacillus licheniformis WX-02]
 gi|383438883|gb|EID46658.1| fibronectin binding protein [Bacillus licheniformis WX-02]
          Length = 570

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ K+LL    S  R+H T    D  +TP  F + LRKH+    ++ V Q+G DR
Sbjct: 39  IRANGKNRKLLLSAHPSYARVHLTNETYDNPSTPPMFCMLLRKHLEGGFIDQVEQIGMDR 98

Query: 106 IILF------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
           +++F      + G  +    ++  +    NI+LTD E  V+
Sbjct: 99  MMVFHIRSRNEIGDTLTRKLMVEIMGRHSNIVLTDGEKDVI 139


>gi|160933821|ref|ZP_02081209.1| hypothetical protein CLOLEP_02682 [Clostridium leptum DSM 753]
 gi|156867698|gb|EDO61070.1| fibronectin-binding protein [Clostridium leptum DSM 753]
          Length = 585

 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 43/185 (23%), Positives = 88/185 (47%), Gaps = 25/185 (13%)

Query: 475 NLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQE---------------- 518
           +L+   DE + +   +V++D AL+A  NA+++Y+  +K ++ Q+                
Sbjct: 363 DLENFYDENRLM---RVKLDPALNATQNAQKYYKEYRKAKTAQQVLGEQIAQAEQELLYV 419

Query: 519 -KTITAHSKAFKAAE-KKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRD 576
                  S+A   +E  + R ++ +E  +  +   RK         F+SSE + ++ GR+
Sbjct: 420 DSVFDCLSRAQSESELNEIRQELREEGYLKAVRDKRKPPAPLAPLEFVSSEGFRILVGRN 479

Query: 577 AQQNEMIVKRYMSKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAW 635
            +QN+ +  +  +  D+++H  ++ G+ + ++   R  QP    TL +A      HS+A 
Sbjct: 480 NRQNDKLTLKQANNNDIWLHTKNIPGSHTIIVTGGR--QP-GDATLKEAAMLAAYHSRAK 536

Query: 636 DSKMV 640
           DS  V
Sbjct: 537 DSSQV 541



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 58/132 (43%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNT 78
           R +G R   +Y  + +  +F L          E+ K+LL   +   R+H T YA +    
Sbjct: 18  RALGARVDKIYQPNKEELVFLLRTRQ------EAFKLLLSARANSPRIHFTQYAPENPKV 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF------GLGMNAHYVILELYAQGNILL 132
           P    + LRK +   +L +VRQ G +R++   F      G  +    VI  +    NI+L
Sbjct: 72  PPMLCMLLRKRLSGAKLVEVRQPGLERLLYLDFDAANELGDKVRLSLVIEIMGKYSNIIL 131

Query: 133 TDSEFTVLTLLR 144
            D +  ++  L+
Sbjct: 132 VDGQGKIVDALK 143


>gi|317132057|ref|YP_004091371.1| fibronectin-binding A domain-containing protein [Ethanoligenens
           harbinense YUAN-3]
 gi|315470036|gb|ADU26640.1| Fibronectin-binding A domain protein [Ethanoligenens harbinense
           YUAN-3]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 43/182 (23%), Positives = 82/182 (45%), Gaps = 23/182 (12%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQE---KTITAHSKAFKAAEK---------- 533
           PVE + +D+ L+   NA+++Y+   K  + +    + I A  +  +  E           
Sbjct: 371 PVE-IALDVRLTPAQNAQKYYKEYHKAAAAERFLTEQIAAGEEELRYLETVLDEIARAGG 429

Query: 534 KTRLQILQEKTVANISHMRKVHWFEKFN-----WFISSENYLVISGRDAQQNEMIVKRYM 588
           ++ L  ++++ V +    R+    EK        F+S + + ++ GR+ +QN+ +  +  
Sbjct: 430 ESELAEIRDELVGSGYLRRRGQKREKLRENAPRRFVSDDGFEILVGRNNKQNDRLTLKTA 489

Query: 589 SKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAWWVY 647
           +K D++ H  ++ GA   V+   R    VP  TL QA      HS+A DS  V   +   
Sbjct: 490 AKTDMWFHTKNIPGAHVIVLAGGR---EVPERTLTQAAVLAATHSKAKDSAQVPVDYAPV 546

Query: 648 PH 649
            H
Sbjct: 547 RH 548


>gi|419841188|ref|ZP_14364565.1| fibronectin-binding protein A [Fusobacterium necrophorum subsp.
           funduliforme ATCC 51357]
 gi|386905940|gb|EIJ70691.1| fibronectin-binding protein A [Fusobacterium necrophorum subsp.
           funduliforme ATCC 51357]
          Length = 533

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 67/136 (49%), Gaps = 16/136 (11%)

Query: 38  IFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP-----SGFTLKLRKHIRT 92
           I ++  ++  + S +  K LL++    +L    Y  ++K T      S F   LRKH+  
Sbjct: 25  IHRIFQNTDTSLSLQFGKQLLVLSCNPQL-PICYVTEEKETVLEESVSSFLNSLRKHLMN 83

Query: 93  RRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLRSH 146
             L  V Q+ +DR ++F+F     LG    +++I EL  +  N+ L D ++ +L LL+  
Sbjct: 84  SLLYQVEQVAWDRTLIFRFSKLTELGEYKQYFLIFELMGRNSNLFLCDRDYKILDLLKRF 143

Query: 147 RDDDKGVAIMSRHRYP 162
             D+    + +R+ +P
Sbjct: 144 SLDE----LPTRNLFP 155


>gi|340756150|ref|ZP_08692781.1| fibronectin-binding protein [Fusobacterium sp. D12]
 gi|421500707|ref|ZP_15947699.1| fibronectin-binding protein A, N-terminal domain protein
           [Fusobacterium necrophorum subsp. funduliforme Fnf 1007]
 gi|313686904|gb|EFS23739.1| fibronectin-binding protein [Fusobacterium sp. D12]
 gi|402267261|gb|EJU16657.1| fibronectin-binding protein A, N-terminal domain protein
           [Fusobacterium necrophorum subsp. funduliforme Fnf 1007]
          Length = 533

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 67/136 (49%), Gaps = 16/136 (11%)

Query: 38  IFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP-----SGFTLKLRKHIRT 92
           I ++  ++  + S +  K LL++    +L    Y  ++K T      S F   LRKH+  
Sbjct: 25  IHRIFQNTDTSLSLQFGKQLLVLSCNPQL-PICYVTEEKETVLEESVSSFLNSLRKHLMN 83

Query: 93  RRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLRSH 146
             L  V Q+ +DR ++F+F     LG    +++I EL  +  N+ L D ++ +L LL+  
Sbjct: 84  SLLYQVEQVAWDRTLIFRFSKLTELGEYKQYFLIFELMGRNSNLFLCDRDYKILDLLKHF 143

Query: 147 RDDDKGVAIMSRHRYP 162
             D+    + +R+ +P
Sbjct: 144 SLDE----LPTRNLFP 155


>gi|373114330|ref|ZP_09528543.1| hypothetical protein HMPREF9466_02576 [Fusobacterium necrophorum
           subsp. funduliforme 1_1_36S]
 gi|371652324|gb|EHO17740.1| hypothetical protein HMPREF9466_02576 [Fusobacterium necrophorum
           subsp. funduliforme 1_1_36S]
          Length = 533

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 67/136 (49%), Gaps = 16/136 (11%)

Query: 38  IFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTP-----SGFTLKLRKHIRT 92
           I ++  ++  + S +  K LL++    +L    Y  ++K T      S F   LRKH+  
Sbjct: 25  IHRIFQNTDTSLSLQFGKQLLVLSCNPQL-PICYVTEEKETVLEESVSSFLNSLRKHLMN 83

Query: 93  RRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLRSH 146
             L  V Q+ +DR ++F+F     LG    +++I EL  +  N+ L D ++ +L LL+  
Sbjct: 84  SLLYQVEQVAWDRTLIFRFSKLTELGEYKQYFLIFELMGRNSNLFLCDRDYKILDLLKRF 143

Query: 147 RDDDKGVAIMSRHRYP 162
             D+    + +R+ +P
Sbjct: 144 SLDE----LPTRNLFP 155


>gi|308173527|ref|YP_003920232.1| persistent RNA/DNA binding protein [Bacillus amyloliquefaciens DSM
           7]
 gi|307606391|emb|CBI42762.1| putative persistent RNA/DNA binding protein [Bacillus
           amyloliquefaciens DSM 7]
          Length = 568

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + ++       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRIHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|317496576|ref|ZP_07954925.1| fibronectin-binding protein A [Gemella morbillorum M424]
 gi|316913379|gb|EFV34876.1| fibronectin-binding protein A [Gemella morbillorum M424]
          Length = 556

 Score = 45.1 bits (105), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 9/109 (8%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGM-NAHY 119
           R   T    +  NTPS F   LRK++    ++++ Q+  DRII+F+      LG    +Y
Sbjct: 57  RFQLTKNTYENPNTPSNFCTVLRKYLIGGIIQNIEQINNDRIIVFKIKNFDELGYEKYYY 116

Query: 120 VILELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY---PTE 164
           +I EL  +  NI+LTD    ++  L++    D   + ++   Y   PTE
Sbjct: 117 LIAELMGKHSNIILTDDNKVIIESLKNSYSIDYKRSTIANMNYILPPTE 165


>gi|328957541|ref|YP_004374927.1| putative persistent RNA/DNA binding protein [Carnobacterium sp.
           17-4]
 gi|328673865|gb|AEB29911.1| putative persistent RNA/DNA binding protein [Carnobacterium sp.
           17-4]
          Length = 575

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 74/152 (48%), Gaps = 14/152 (9%)

Query: 50  SGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +G++ K+LL    S  R+  T    +  ++P  F + +RKH+    LED++Q+G DR+I 
Sbjct: 48  NGKNHKLLLSAHPSYARIQLTEIPYENPSSPPNFCMIMRKHLEGAILEDIQQVGNDRVIH 107

Query: 109 FQF------GLGMNAHYVILELYAQGNILLT--DSEFTVLTLLRSHRDDDKGVAIMSRHR 160
           F+F      G   N   ++  +    NILL   D++  + T+       +    IM    
Sbjct: 108 FRFKSRDEIGDVQNVILIVELMGRHSNILLIEQDTQRILDTIKHVPTSQNSFRFIMPGAT 167

Query: 161 YPT----EICRVFERTTASKLHAALTSSKEPD 188
           Y +    +    FE T++S+L   +T+ ++PD
Sbjct: 168 YQSPPHQDKLNPFE-TSSSELAELITAFEDPD 198


>gi|381190336|ref|ZP_09897859.1| fibronectin/fibrinogen-binding protein [Thermus sp. RL]
 gi|380451929|gb|EIA39530.1| fibronectin/fibrinogen-binding protein [Thermus sp. RL]
          Length = 516

 Score = 45.1 bits (105), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTI----TAHSKAFKAAEKKTRLQILQE 542
           PVE + +D ALS   NAR+ Y+  ++ E   E+ +       ++  +   +K RL+ L  
Sbjct: 320 PVE-IPLDPALSPQENARKLYDRARRLEELAERALDLIPKTEARIRELEAEKERLRTLDL 378

Query: 543 KTVANISHMRKVHWFEKFNW-FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHG 601
           + +  ++   K     +    + S   +LV+ GR+A++N+++ +   S+ D++ HA    
Sbjct: 379 EGLLALAQRPKGEKGPRIGLRYTSPSGFLVLVGRNAKENDLLTRAAHSE-DLWFHAQGVP 437

Query: 602 ASSTVIKNHRPEQPVPPLT-LNQAGCFTVCHSQAWDSKMV 640
            S  ++K    E   PPL  L  A      HS+A   + V
Sbjct: 438 GSHVILKA---EGKNPPLEDLLFAARLAAYHSKARGERQV 474


>gi|392531657|ref|ZP_10278794.1| putative persistent RNA/DNA binding protein [Carnobacterium
           maltaromaticum ATCC 35586]
          Length = 569

 Score = 45.1 bits (105), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 50  SGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +G++ KVLL    S  R+  T    +  NTP  F + +RK +    LE++ Q+G DR+I 
Sbjct: 42  NGKNHKVLLSAHPSYARIQITEIPYENPNTPPNFCMMMRKQLEGAILENIEQIGNDRVIH 101

Query: 109 FQF 111
           F F
Sbjct: 102 FTF 104


>gi|317121734|ref|YP_004101737.1| fibronectin-binding A domain-containing protein [Thermaerobacter
           marianensis DSM 12885]
 gi|315591714|gb|ADU51010.1| Fibronectin-binding A domain protein [Thermaerobacter marianensis
           DSM 12885]
          Length = 681

 Score = 44.7 bits (104), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 82/212 (38%), Gaps = 39/212 (18%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV- 64
           MN   +AA V+ L  L+  R   VY   P   + +L        +G    +L+  +  + 
Sbjct: 1   MNGLLLAAVVQELGNLLPARVERVYQPDPHVLVLRLY-------AGRELNLLISADPNLP 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLG-YDRIILFQFGL------GMNA 117
           RLH TA        P  F + LRKH+ + RL   RQ   +DR +   F            
Sbjct: 54  RLHLTARPPANPPAPPAFCMLLRKHLESLRLVGARQGPEFDRWLWLDFAAPGADEPARRL 113

Query: 118 HYVILELYAQGNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY-------------PTE 164
           H  +  L  + N++L D +  +L  LR       G +++    Y             P  
Sbjct: 114 HLAVELLDRRANVVLLDGQGRILDALRRVPGSPGGRSLLPGIPYEPPPPPSPLPQGDPAS 173

Query: 165 I-CRVFERTTASKLHAALTSSKEPDANEPDKV 195
           + CR  E         ALT +  PDA +PD V
Sbjct: 174 LGCRWLE---------ALTGAG-PDAEDPDAV 195


>gi|414083819|ref|YP_006992527.1| fibronectin-binding A N-terminus family protein [Carnobacterium
           maltaromaticum LMA28]
 gi|412997403|emb|CCO11212.1| fibronectin-binding A N-terminus family protein [Carnobacterium
           maltaromaticum LMA28]
          Length = 440

 Score = 44.7 bits (104), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 50  SGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +G++ KVLL    S  R+  T    +  NTP  F + +RK +    LE++ Q+G DR+I 
Sbjct: 42  NGKNHKVLLSAHPSYARIQITEIPYENPNTPPNFCMMMRKQLEGAILENIEQIGNDRVIH 101

Query: 109 FQF 111
           F F
Sbjct: 102 FTF 104


>gi|163790397|ref|ZP_02184828.1| fibronectin/fibrinogen-binding protein, putative [Carnobacterium
           sp. AT7]
 gi|159874301|gb|EDP68374.1| fibronectin/fibrinogen-binding protein, putative [Carnobacterium
           sp. AT7]
          Length = 569

 Score = 44.7 bits (104), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 7/94 (7%)

Query: 50  SGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIIL 108
           +G++ K+LL    S  R+  T    +  ++P  F + +RKH+    LED++Q+G DR+I 
Sbjct: 42  NGKNHKLLLSAHPSYARIQLTEIPYENPSSPPNFCMIMRKHLEGAILEDIQQVGNDRVIH 101

Query: 109 FQF------GLGMNAHYVILELYAQGNILLTDSE 136
           F+F      G   N   ++  +    NILL + +
Sbjct: 102 FRFKSRDEIGDVQNVILIVELMGRHSNILLIEQD 135


>gi|315917485|ref|ZP_07913725.1| fibronectin-binding protein [Fusobacterium gonidiaformans ATCC
           25563]
 gi|313691360|gb|EFS28195.1| fibronectin-binding protein [Fusobacterium gonidiaformans ATCC
           25563]
          Length = 541

 Score = 44.7 bits (104), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 58/118 (49%), Gaps = 14/118 (11%)

Query: 55  KVLLLMESGVRLHTTAYARDKKN----TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           K +L++    +L       DK+     + S F   LRKH+    L  V Q+G+DR ++F 
Sbjct: 50  KQVLVLSCNPQLPICYVTEDKETVLEESVSSFLNTLRKHLMNSFLYQVEQVGWDRTLIFC 109

Query: 111 FG----LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYP 162
           F     LG    +++I EL  +  N+ L + ++ +L LL+    D+    + +R+ +P
Sbjct: 110 FSKLTELGDYKQYFLIFELMGRNSNLFLCNQDYKILDLLKRFSLDE----VQTRNLFP 163


>gi|212639624|ref|YP_002316144.1| Fibronectin/fibrinogen-binding protein [Anoxybacillus flavithermus
           WK1]
 gi|212561104|gb|ACJ34159.1| Fibronectin/fibrinogen-binding protein [Anoxybacillus flavithermus
           WK1]
          Length = 653

 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 54/125 (43%), Gaps = 13/125 (10%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKN 77
           R L+G R S +Y   P +Y         V   G + K+LL    +  R+H T    D   
Sbjct: 100 RTLVGGRISKIYQ--PSSYEL----VCHVRSHGRNYKLLLCAHPTYARIHLTNETYDNPP 153

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQGNIL 131
            P  F + LRKH+    +E + Q+ +DRII+       + G       +I  +    NI+
Sbjct: 154 EPPMFCMLLRKHMEGGIIEAITQVDFDRIIIIHVKARNELGDVCTKQLIIEMMGRHSNII 213

Query: 132 LTDSE 136
           L D +
Sbjct: 214 LVDEQ 218


>gi|319649630|ref|ZP_08003786.1| fibronectin/fibrinogen-binding protein [Bacillus sp. 2_A_57_CT2]
 gi|317398792|gb|EFV79474.1| fibronectin/fibrinogen-binding protein [Bacillus sp. 2_A_57_CT2]
          Length = 566

 Score = 44.3 bits (103), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 1/65 (1%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           V  +G + ++LL    S  R+  T  A +  + P  F + LRKH+    LEDV Q+G DR
Sbjct: 39  VRANGRNHRLLLSAHPSYARVQLTNEAHENPSEPPMFCMLLRKHLEGYILEDVHQIGLDR 98

Query: 106 IILFQ 110
           II+F+
Sbjct: 99  IIVFE 103


>gi|387898143|ref|YP_006328439.1| hypothetical protein MUS_1715 [Bacillus amyloliquefaciens Y2]
 gi|387172253|gb|AFJ61714.1| conserved hypothetical protein YloA [Bacillus amyloliquefaciens Y2]
          Length = 563

 Score = 44.3 bits (103), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 60/132 (45%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 13  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 66

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F   LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 67  PPMFCTLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 126

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 127 TDGEGAIIDGLK 138


>gi|350265877|ref|YP_004877184.1| fibronectin-binding protein [Bacillus subtilis subsp. spizizenii
           TU-B-10]
 gi|349598764|gb|AEP86552.1| fibronectin-binding protein [Bacillus subtilis subsp. spizizenii
           TU-B-10]
          Length = 570

 Score = 44.3 bits (103), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           ++ G R + ++       IF       +  +G+++K+LL    S  R+H T  A +  + 
Sbjct: 18  KMTGGRITKIHQPYKHDVIFH------IRANGKNQKLLLSAHPSYSRVHITTQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD    V+
Sbjct: 132 TDGAENVI 139


>gi|384265146|ref|YP_005420853.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           YAU B9601-Y2]
 gi|380498499|emb|CCG49537.1| putative proteinYloA [Bacillus amyloliquefaciens subsp. plantarum
           YAU B9601-Y2]
          Length = 568

 Score = 44.3 bits (103), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 60/132 (45%), Gaps = 13/132 (9%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + V+       IF       +  +G++ K+LL    S  R+H T  A +  + 
Sbjct: 18  RIAGGRITRVHQPFKHDVIFH------IRANGKNHKLLLSAHPSYSRVHMTNQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F   LRKHI    +E + Q G DRI++F+           +  LY +      NI+L
Sbjct: 72  PPMFCTLLRKHIEGGFIEKIEQAGLDRIMIFRIKSRNEIGDETVRTLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVLTLLR 144
           TD E  ++  L+
Sbjct: 132 TDGEGAIIDGLK 143


>gi|365128101|ref|ZP_09340417.1| hypothetical protein HMPREF1032_02181 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363623448|gb|EHL74567.1| hypothetical protein HMPREF1032_02181 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 587

 Score = 44.3 bits (103), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 86/189 (45%), Gaps = 30/189 (15%)

Query: 463 LERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY-ELKKKQESKQEKT- 520
           ++R   ++ L+N     D +E T+P+     D+ LS  ANA++++ E KKKQ + +  T 
Sbjct: 358 IQRGAKNVTLTNY---YDGKEVTIPL-----DVRLSPSANAQKYFKEYKKKQTAARMLTE 409

Query: 521 ITAHSKAFKAAEKKTRLQILQEKTVANISHMR---KVHWFEK-------------FNWFI 564
           + A S A        + ++   +  A ++ +R   K   + K             F  ++
Sbjct: 410 LIAESDAEAEYLATVQYEVETAEGEAALAEIRAELKSQGYLKYYKAKDKKQKPADFLRYV 469

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA-DLHGASSTVIKNHRPEQPVPPLTLNQ 623
           SS+ + ++ GR+  QN+ +  +     DV+ H  +  G+ + V+      QPVP  T  +
Sbjct: 470 SSDGFPILVGRNNAQNDRLTLKTARGRDVWFHVKNAPGSHAVVLSGG---QPVPDTTKTE 526

Query: 624 AGCFTVCHS 632
           A      HS
Sbjct: 527 AAVLAAVHS 535


>gi|329767576|ref|ZP_08259097.1| hypothetical protein HMPREF0428_00794 [Gemella haemolysans M341]
 gi|328839203|gb|EGF88787.1| hypothetical protein HMPREF0428_00794 [Gemella haemolysans M341]
          Length = 555

 Score = 44.3 bits (103), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 43/169 (25%), Positives = 80/169 (47%), Gaps = 32/169 (18%)

Query: 25  RCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFT 83
           R + V +LS   ++F +         G++ K+ L    S  R+  T  + +  +TPS F 
Sbjct: 23  RINKVNNLSTDEFVFSI-------RKGKNLKLFLSANPSASRIQLTNNSYENPSTPSNFC 75

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGM-NAHYVILELYAQ-GNILLTDSEF 137
             LRK++    +++++Q+  DR+++F+      LG    +Y+I EL  +  NI+LT+ + 
Sbjct: 76  SVLRKYLTGGIIQEIKQVNNDRVLVFKIKNFDDLGYEKYYYLITELMGKHSNIILTNEDN 135

Query: 138 TVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKE 186
            +L  L              ++ Y  E    F+R+T S +   L  +KE
Sbjct: 136 IILESL--------------KNSYSLE----FKRSTISNMAYTLPPTKE 166


>gi|260890517|ref|ZP_05901780.1| hypothetical protein GCWU000323_01695 [Leptotrichia hofstadii
           F0254]
 gi|260859759|gb|EEX74259.1| hypothetical protein GCWU000323_01695 [Leptotrichia hofstadii
           F0254]
          Length = 322

 Score = 44.3 bits (103), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 55/105 (52%), Gaps = 13/105 (12%)

Query: 68  TTAYARDKK--NT--PSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNA 117
           T  Y +D+K  NT   S F L L+KH++   L ++RQ G+DRI+ F      QFG  +  
Sbjct: 54  TIFYLKDEKDPNTDFQSKFLLSLKKHLQNSILINIRQEGFDRIVYFDFEKLNQFG-DVEK 112

Query: 118 HYVILELYAQG-NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
           + +I+E+  +  NI LT  +  +L+ L     D     IM+  RY
Sbjct: 113 YTLIIEIMGKASNIFLTSKD-KILSALYFTSIDVGNRVIMTGARY 156


>gi|452992516|emb|CCQ96047.1| Fibronectin-binding protein A [Clostridium ultunense Esp]
          Length = 590

 Score = 44.3 bits (103), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 121/615 (19%), Positives = 236/615 (38%), Gaps = 121/615 (19%)

Query: 46  GVTESGESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYD 104
            +   G++ K+L+   S   R+H T   +   ++P  F + LRKH+    + ++ Q   D
Sbjct: 38  NIYNRGKNRKLLISASSNNPRIHLTNCGKSNPSSPPMFCMLLRKHLTGGIILNIEQFHMD 97

Query: 105 RIILF------QFGLGMNAHYVILELYAQGNILLTDS-EFTVLTLLRSHRDDDKGVAIMS 157
           RII        + G  +    ++  +    NI+L D   F V+  ++    D      MS
Sbjct: 98  RIIFIDISSLDELGQPIEKRLIVEIMGKYSNIILIDKISFRVIDSIKRVTPD------MS 151

Query: 158 RHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGK 217
           R R                    L   +    ++ +K+N    +++      L GQ  G 
Sbjct: 152 RIR------------------QVLPGVEYKYPHQNNKINPL--DLAEDQFFQLIGQDNGN 191

Query: 218 SFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLE 277
                              +P  +      +G GP +S+ I   + +  +  L+ +   E
Sbjct: 192 -------------------RPIYRFFYTNYIGLGPLISKEICFQSNIDMDRPLASITFEE 232

Query: 278 DNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLGKDHPPTESGSSTQIYDEFCP 337
              I  + +A+ K       +   +  P   IL++N H G++            Y  F  
Sbjct: 233 KKKIFSIFMAIVK------RIRDNNFKP---ILIKNNH-GRN------------YKAFYA 270

Query: 338 LLLNQFRSREFVKFETFDAALDEFYSKIES-----QRAEQQHKAKEDAAFHKLNKIHMDQ 392
           L + QF + + +   +    LDE+Y K ++     Q+A+   K+ +      LNK+   +
Sbjct: 271 LDIEQFGNNKIL-LASISQVLDEYYIKNDTLDRVNQKAQSLRKSVQTKLERSLNKLAKQK 329

Query: 393 ENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVK--EERKA 450
           +  + +  +E  +    A+LI  NL  +D  +   +V L N  S E++ +++   +ER +
Sbjct: 330 QELLDSKNRE--KFKIYADLISANLYRIDKGL--SQVELENFYS-ENMEKIIVPLDERYS 384

Query: 451 GNPVAGLIDKLYLE-RNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYEL 509
               A    K Y + +N   LLL           + +P  + E+D   +   +     E+
Sbjct: 385 PAENAQKYYKRYSKLKNANQLLL-----------EQIPETEEEIDYLENVLNSIDHCTEV 433

Query: 510 KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENY 569
            +  E K+E     + K                    +I   +K     K   +ISS+ +
Sbjct: 434 LELDEIKEELIKEGYLKG-------------------SIKKKQKKDMVSKPYQYISSDGF 474

Query: 570 LVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTV 629
            +  G++ +QN+ +  +   K D+++H      S  ++K     + V   TL +A     
Sbjct: 475 HIFVGKNNRQNDFLTLKTAHKEDLWLHVQKMPGSHVIVKTE--NRRVSEKTLEEAAILAA 532

Query: 630 CHSQAWDSKMVTSAW 644
            +S+A +S  V   +
Sbjct: 533 YYSKAKNSTNVAVDY 547


>gi|325846551|ref|ZP_08169466.1| putative fibronectin-binding protein [Anaerococcus hydrogenalis
           ACS-025-V-Sch4]
 gi|325481309|gb|EGC84350.1| putative fibronectin-binding protein [Anaerococcus hydrogenalis
           ACS-025-V-Sch4]
          Length = 582

 Score = 43.9 bits (102), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 67/140 (47%), Gaps = 15/140 (10%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRL 66
           T  V  E+K L  L+G +   +   S    I        +   G++ K+LL   +   R+
Sbjct: 8   TRAVTFEIKKL--LLGAKIQKISQPSKNDIIL------NIYSFGKTYKLLLSANNNEARV 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMN-AHYVI 121
           H T    +    P  F + LRKH+   ++  + Q   DR+I+F+      +G + ++ +I
Sbjct: 60  HITEKKYENPEVPPNFCMVLRKHLSQSKIIGIDQYKLDRVIVFKISSVDEMGFDVSNKLI 119

Query: 122 LELYAQ-GNILLTDSEFTVL 140
           +E+  +  NI+LTD ++ ++
Sbjct: 120 VEIMGKYSNIILTDDKYKII 139


>gi|374849978|dbj|BAL52979.1| fibronectin-binding A domain protein [uncultured candidate division
           OP1 bacterium]
 gi|374856393|dbj|BAL59247.1| fibronectin-binding A domain protein [uncultured candidate division
           OP1 bacterium]
          Length = 576

 Score = 43.9 bits (102), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 45/98 (45%), Gaps = 8/98 (8%)

Query: 11  VAAEVKCLR-RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTT 69
           V+A V  LR RL G R   +Y   P T   +L        +GE + +L+      R+H T
Sbjct: 8   VSALVAELRERLCGSRVQQIYHPRPSTITLELW-------AGEEQSLLIETAEQPRVHLT 60

Query: 70  AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
                   TPS F + LRK++R   +  V Q   +RII
Sbjct: 61  QQRFPHPKTPSAFCMLLRKYLRNGIIVGVSQPALERII 98


>gi|256545176|ref|ZP_05472542.1| fibronectin-binding protein [Anaerococcus vaginalis ATCC 51170]
 gi|256399217|gb|EEU12828.1| fibronectin-binding protein [Anaerococcus vaginalis ATCC 51170]
          Length = 582

 Score = 43.9 bits (102), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 26/102 (25%), Positives = 54/102 (52%), Gaps = 7/102 (6%)

Query: 46  GVTESGESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYD 104
            +   G+S K+LL   +   R+H T    +   +P  F + LRK++   ++ ++ Q   D
Sbjct: 38  NIYSVGKSYKLLLSANNNEARVHITEKKYENPISPPNFCMVLRKYLNQSKIVEIEQYKMD 97

Query: 105 RIILFQFG----LGMN-AHYVILELYAQ-GNILLTDSEFTVL 140
           R+I+F       +G + ++ +I+E+  +  NI+LTD  + ++
Sbjct: 98  RVIIFHISSVDEMGFDISNKLIVEIMGKYSNIILTDENYKII 139


>gi|253578854|ref|ZP_04856125.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849797|gb|EES77756.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
          Length = 581

 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 51/103 (49%), Gaps = 9/103 (8%)

Query: 47  VTESGESEKVLLLMESGVRLHTTAYARDKKNTP---SGFTLKLRKHIRTRRLEDVRQLGY 103
           +T  G + +  LL+ +   L    +    K +P     F + LRKHI + R+ D+RQ G 
Sbjct: 37  ITGKGANGQCRLLLSASASLPLIYFTSKNKPSPMTAPNFCMLLRKHIGSARVSDIRQPGM 96

Query: 104 DRIILFQF----GLGMNAHYV-ILELYAQ-GNILLTDSEFTVL 140
           +R+++F+      LG     V I+EL  +  NI+  D +  +L
Sbjct: 97  ERVVMFELEHLNELGDPCKKVLIMELMGKHSNIIFCDDKGMIL 139


>gi|386758287|ref|YP_006231503.1| hypothetical protein MY9_1710 [Bacillus sp. JS]
 gi|384931569|gb|AFI28247.1| hypothetical protein MY9_1710 [Bacillus sp. JS]
          Length = 570

 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           +++G R + V+       IF       +   G+++K+LL    S  R+H T    +  + 
Sbjct: 18  KIMGGRITKVHQPYKHDVIFH------IRAKGKNQKLLLSAHPSYSRVHITTQTYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD+   V+
Sbjct: 132 TDAAENVI 139


>gi|20807959|ref|NP_623130.1| RNA-binding protein snRNP [Thermoanaerobacter tengcongensis MB4]
 gi|20516530|gb|AAM24734.1| predicted RNA-binding protein homologous to eukaryotic snRNP
           [Thermoanaerobacter tengcongensis MB4]
          Length = 570

 Score = 43.9 bits (102), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 8/97 (8%)

Query: 13  AEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTA 70
           A VK L++ I G R   +Y    +  IF       +   G++ K+LL   +   R+H T 
Sbjct: 10  AIVKELKKEIEGGRIEKIYQPEKEDLIF------TIRSKGKNYKLLLSANANYPRIHLTK 63

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
             R+    P  F + LRKH++  R+ ++RQ+ +DRI+
Sbjct: 64  EDRENPLEPPMFCMLLRKHLQNGRIAEIRQVEFDRIV 100


>gi|291544501|emb|CBL17610.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Ruminococcus champanellensis 18P13]
          Length = 591

 Score = 43.9 bits (102), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 45/101 (44%), Gaps = 8/101 (7%)

Query: 11  VAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTA 70
           +  E+ CL   +  R   VY  S ++ I         T+ G  + ++    S  R+H T 
Sbjct: 11  IQGELDCL---LEGRIDKVYQPSRESVILGFR-----TKQGARKLLISAAPSSARVHMTQ 62

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
            A D    P  F + LRKH+   RL  +RQ G +RI+   F
Sbjct: 63  VAVDNPAKPPMFCMLLRKHLTGGRLIAIRQDGLERILFLDF 103



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 39/175 (22%), Positives = 78/175 (44%), Gaps = 26/175 (14%)

Query: 487 PVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSK--------------AFKAAE 532
           P  ++ +D+ L+   NA+R+Y  K ++ S  EK +    +              A     
Sbjct: 372 PTVEIPLDVRLTPSQNAQRYYA-KYRKASTAEKVLVEQIRNGEEELRYIDSVFDALTRCT 430

Query: 533 KKTRLQILQEKTVANISHMRKVHWFEKFN------WFISSENYLVISGRDAQQNEMIVKR 586
            +T + +L+E+ +A   ++R      K         F SS+ + ++ GR+ +QN+ +  +
Sbjct: 431 SETDIAVLREE-LAGEGYLRAARRGTKPARSQPPLVFRSSDGFQILVGRNNRQNDQLTLK 489

Query: 587 YMSKGDVYVHAD-LHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
             +K D+++H   + G+   V+   R    +P  T+ +A      HS+  DS  V
Sbjct: 490 QAAKQDLWLHTQGIPGSHVIVVSQGR---EIPESTIYEAALLAAHHSKGRDSAQV 541


>gi|295706340|ref|YP_003599415.1| fibronectin-binding protein [Bacillus megaterium DSM 319]
 gi|294803999|gb|ADF41065.1| fibronectin-binding protein [Bacillus megaterium DSM 319]
          Length = 573

 Score = 43.5 bits (101), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           L+  R S +Y   P   I +      V   GE+ K+L+       R+H T    +  + P
Sbjct: 22  LVSGRISKIYQPFPNELILQ------VRAKGENRKLLISAHPNYSRVHFTNEPYENPSEP 75

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
             F + LRKH+    +E V QLG DRI++ +
Sbjct: 76  PMFCMLLRKHLEGSIIEQVYQLGLDRILVME 106


>gi|227499520|ref|ZP_03929627.1| fibrinogen-binding protein [Anaerococcus tetradius ATCC 35098]
 gi|227218399|gb|EEI83650.1| fibrinogen-binding protein [Anaerococcus tetradius ATCC 35098]
          Length = 582

 Score = 43.5 bits (101), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 32/144 (22%), Positives = 66/144 (45%), Gaps = 15/144 (10%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRL 66
           T  +  E+K   +L+G +   +   S    +F L +       G+S K+LL   +   R+
Sbjct: 8   TRKIVNELK--EKLLGAKIQKISQPSKNDIVFNLYSM------GKSYKLLLSANNNEARI 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYV 120
           + T    +  +    F + LRKHI   ++ +++Q G DR+++F      + G   +   +
Sbjct: 60  NITKRKFENPDIAPNFCMVLRKHINQGKIIEIKQKGLDRVVIFSIASIDEMGFDTSKKLI 119

Query: 121 ILELYAQGNILLTDSEFTVLTLLR 144
           I  +    NI+L D  + ++  ++
Sbjct: 120 IEIMGKYSNIVLVDDNYKIIDAIK 143


>gi|138894684|ref|YP_001125137.1| fibronectin-binding protein [Geobacillus thermodenitrificans
           NG80-2]
 gi|196247697|ref|ZP_03146399.1| Fibronectin-binding A domain protein [Geobacillus sp. G11MC16]
 gi|134266197|gb|ABO66392.1| Fibronectin-binding protein / Fibrinogen-bindingprotein
           [Geobacillus thermodenitrificans NG80-2]
 gi|196212481|gb|EDY07238.1| Fibronectin-binding A domain protein [Geobacillus sp. G11MC16]
          Length = 579

 Score = 43.5 bits (101), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 55/122 (45%), Gaps = 13/122 (10%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNTP 79
           L G R + ++   P  Y   ++    V   G + KV+L    +  R+H T    D    P
Sbjct: 19  LAGGRITKIH--QPSAYEIVML----VRARGHNHKVMLSAHPTYARVHLTNETYDNPPEP 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQGNILLT 133
             F ++LRKH+    +E +RQ+ +DRII+       + G       +I  +    NI+L 
Sbjct: 73  PMFCMRLRKHLEGSIVEAIRQVEFDRIIVIDTKGRDELGDVQTKQLIIEVMGRHSNIILV 132

Query: 134 DS 135
           D+
Sbjct: 133 DA 134


>gi|384045157|ref|YP_005493174.1| Fibronectin-binding A-like protein [Bacillus megaterium WSH-002]
 gi|345442848|gb|AEN87865.1| Fibronectin-binding A-like protein [Bacillus megaterium WSH-002]
          Length = 570

 Score = 43.5 bits (101), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           L+  R S +Y   P   I +      V   GE+ K+L+       R+H T    +  + P
Sbjct: 19  LVSGRISKIYQPFPNELILQ------VRAKGENRKLLISAHPNYSRVHFTNEPYENPSEP 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
             F + LRKH+    +E V QLG DRI++ +
Sbjct: 73  PMFCMLLRKHLEGSIIEQVYQLGLDRILVIE 103


>gi|218290470|ref|ZP_03494590.1| Fibronectin-binding A domain protein [Alicyclobacillus
           acidocaldarius LAA1]
 gi|218239491|gb|EED06686.1| Fibronectin-binding A domain protein [Alicyclobacillus
           acidocaldarius LAA1]
          Length = 594

 Score = 43.5 bits (101), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 43/178 (24%), Positives = 80/178 (44%), Gaps = 34/178 (19%)

Query: 490 KVEVDLALSAHANARRWYELKKKQ-------ESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           ++E+D AL A ANA+R + +  K+       E+++E T+    +  +  E    LQ L +
Sbjct: 369 RIELDPALDAIANAQRLFRMAAKRKRARQWIEAERENTL----RDLRYLEDV--LQALAD 422

Query: 543 KTVANISHMRKVHWFEKF-NW-------------------FISSENYLVISGRDAQQNEM 582
            ++ N+  +R+    + F  W                   F SS+ +++  GR+  QN+ 
Sbjct: 423 TSLENLEEVRRELEAQGFLAWAARRGTGGKRRSGETEPHAFRSSDGFVIRVGRNNVQNDR 482

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           +  R   K D+++H      S  VI+  + E+ +P  T+ +A       S+  DS  V
Sbjct: 483 LTFRKADKRDLWLHVKDAPGSHVVIERGQAEE-IPERTIEEAAVLAAYFSRMRDSANV 539


>gi|294500991|ref|YP_003564691.1| fibronectin-binding protein [Bacillus megaterium QM B1551]
 gi|294350928|gb|ADE71257.1| fibronectin-binding protein [Bacillus megaterium QM B1551]
          Length = 573

 Score = 43.5 bits (101), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 7/91 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           L+  R S +Y   P   I +      V   GE+ K+L+       R+H T    +  + P
Sbjct: 22  LVSGRISKIYQPFPNELILQ------VRAKGENRKLLISAHPNYSRVHFTNEPYENPSEP 75

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
             F + LRKH+    +E V QLG DRI++ +
Sbjct: 76  PMFCMLLRKHLEGSIIEQVYQLGLDRILVIE 106


>gi|296331140|ref|ZP_06873614.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           subsp. spizizenii ATCC 6633]
 gi|305674295|ref|YP_003865967.1| persistent RNA/DNA binding protein [Bacillus subtilis subsp.
           spizizenii str. W23]
 gi|296151784|gb|EFG92659.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           subsp. spizizenii ATCC 6633]
 gi|305412539|gb|ADM37658.1| putative persistent RNA/DNA binding protein [Bacillus subtilis
           subsp. spizizenii str. W23]
          Length = 570

 Score = 43.5 bits (101), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 59/128 (46%), Gaps = 13/128 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           ++ G R + ++       IF       +  +G+++K+LL    S  R+H T  A +  + 
Sbjct: 18  KMTGGRITKIHQPYKHDVIFH------IRVNGKNQKLLLSAHPSYSRVHITTQAYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TDSEFTVL 140
           TD    V+
Sbjct: 132 TDGAENVI 139


>gi|398304107|ref|ZP_10507693.1| persistent RNA/DNA binding protein [Bacillus vallismortis DV1-F-3]
          Length = 570

 Score = 43.5 bits (101), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 7/95 (7%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G+++K+LL    S  R+H T  A +  + P  F + LRKHI    +E + Q G DR
Sbjct: 39  IRANGKNQKLLLSAHPSYSRVHITTQAYENPSEPPMFCMLLRKHIEGGFIEKIEQAGLDR 98

Query: 106 IILFQF-GLGMNAHYVILELYAQ-----GNILLTD 134
           I++F            + +LY +      NI+LTD
Sbjct: 99  IMIFHIKSRNEIGDETVRKLYVEIMGRHSNIILTD 133


>gi|83589737|ref|YP_429746.1| hypothetical protein Moth_0886 [Moorella thermoacetica ATCC 39073]
 gi|83572651|gb|ABC19203.1| conserved hypothetical protein [Moorella thermoacetica ATCC 39073]
          Length = 584

 Score = 43.5 bits (101), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 79/341 (23%), Positives = 123/341 (36%), Gaps = 74/341 (21%)

Query: 9   ADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLL-MESGVRLH 67
           A ++AE   L  L G R   ++    +T I  L          ++ K+LL  +    R+H
Sbjct: 9   AAISAE---LSGLTGSRVDRIFQPEKETVILHLRKGR------DTRKLLLCSLSDQARVH 59

Query: 68  TTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHYVIL- 122
            T  +     TP  F + LRKH+    L  V Q G +R++   F     LG  A  ++L 
Sbjct: 60  LTTASFTNPPTPPLFCMVLRKHLEGGILTAVEQPGLERVLKLHFNTTDELGRQAPRLLLI 119

Query: 123 ELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAAL 181
           E+  +  NI+L + E +++   R +         +SRHR   E+                
Sbjct: 120 EIMGKHSNIILLNPEGSIIDAARRY------THAVSRHR---EVL--------------- 155

Query: 182 TSSKEPDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLK 241
                P    P +   D   + + +   L                 N  D      P  +
Sbjct: 156 --PGRPYVPPPAQDKADPRKLDDEAFTRL-------------LYEGNWGD------PLER 194

Query: 242 TVLGEALGYGPALSEHIILDTGLVPNMKLSEVNKLEDNAI-QVL--VLAVAKFEDWLQDV 298
            ++    G GP  +  II   GL     L      E N + Q L  VLA      W  +V
Sbjct: 195 LLVNRLAGVGPETAREIIHRAGLPAGTTLEGCGAYEVNRLYQALGEVLAATGPAAWKPEV 254

Query: 299 ISGDIVPEG-------YILMQNKHLGKDHPPTESGSSTQIY 332
           I   + PEG       + L Q + L ++HP T   +    Y
Sbjct: 255 I---LRPEGEPLAFASFELHQYQGLPREHPATPGAACDYFY 292


>gi|410583545|ref|ZP_11320651.1| putative RNA-binding protein, snRNP like protein [Thermaerobacter
           subterraneus DSM 13965]
 gi|410506365|gb|EKP95874.1| putative RNA-binding protein, snRNP like protein [Thermaerobacter
           subterraneus DSM 13965]
          Length = 696

 Score = 43.5 bits (101), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 67/151 (44%), Gaps = 15/151 (9%)

Query: 6   MNTADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV- 64
           MN   +AA ++ L  L+  R   +Y   P   + +L        +G    +L+  +  + 
Sbjct: 1   MNGLVLAAVLQELSSLLPARVERIYQPEPHLLVLRLY-------AGREVHLLIGADPSLP 53

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQ-LGYDRIILFQF---GLGMNA--H 118
           RLH TA        P  F + LRKH+ + RL    Q   +DR +   F   G    A   
Sbjct: 54  RLHLTARPPANPPAPPAFCMLLRKHLESLRLVAAHQGPAFDRWVQLAFVAPGPDEPARRR 113

Query: 119 YVILELYA-QGNILLTDSEFTVLTLLRSHRD 148
           Y+I+EL   + N++LTD E  +L  LR   D
Sbjct: 114 YLIVELLERRANVVLTDGEGRILDALRRTPD 144


>gi|373497493|ref|ZP_09588017.1| hypothetical protein HMPREF0402_01890 [Fusobacterium sp. 12_1B]
 gi|371963247|gb|EHO80817.1| hypothetical protein HMPREF0402_01890 [Fusobacterium sp. 12_1B]
          Length = 541

 Score = 43.5 bits (101), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 39/70 (55%), Gaps = 6/70 (8%)

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHYVI-LELYAQ-GNILLTD 134
           G    +RKH+    L DV+QLG+DRI+ F+F     LG   +Y I  E+  +  N + TD
Sbjct: 71  GLAANMRKHLLNAMLTDVQQLGFDRILCFKFAKINELGEVKNYSIYFEIMGKYSNFIFTD 130

Query: 135 SEFTVLTLLR 144
            +  ++ LL+
Sbjct: 131 EDDRIIDLLK 140


>gi|328948692|ref|YP_004366029.1| fibronectin-binding A domain-containing protein [Treponema
           succinifaciens DSM 2489]
 gi|328449016|gb|AEB14732.1| Fibronectin-binding A domain protein [Treponema succinifaciens DSM
           2489]
          Length = 482

 Score = 43.5 bits (101), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 3/112 (2%)

Query: 56  VLLLMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGM 115
           V+       R++ T     K   P  F   L+  ++  R+   +QLG DRI+ F      
Sbjct: 47  VICTSPQSCRINKTNSKSPKNEKPLRFNEFLKSRVQGMRINSCKQLGLDRIVKFDVSTWK 106

Query: 116 NAHYVILELYAQ-GNILLTDSEFTVLTLL--RSHRDDDKGVAIMSRHRYPTE 164
           +  ++   L++   NI++TD    +L  L  R  +D+  G   + + + PTE
Sbjct: 107 DRLFIYARLWSNAANIIVTDENGKILDCLYRRPAKDEITGGVFVPQEKIPTE 158


>gi|312111736|ref|YP_003990052.1| Fibronectin-binding A domain-containing protein [Geobacillus sp.
           Y4.1MC1]
 gi|423720651|ref|ZP_17694833.1| fibronectin-binding A domain-containing protein [Geobacillus
           thermoglucosidans TNO-09.020]
 gi|311216837|gb|ADP75441.1| Fibronectin-binding A domain protein [Geobacillus sp. Y4.1MC1]
 gi|383366004|gb|EID43295.1| fibronectin-binding A domain-containing protein [Geobacillus
           thermoglucosidans TNO-09.020]
          Length = 571

 Score = 43.5 bits (101), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 49/93 (52%), Gaps = 7/93 (7%)

Query: 51  GESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G + K+LL       R+H T    D    P  F + LRKH+    +E +RQ+ +DRII+ 
Sbjct: 43  GRNYKLLLSAHPNYARVHLTNETYDNPAEPPMFCMLLRKHLEGSIIEAIRQVDFDRIIII 102

Query: 110 QFG----LG-MNAHYVILELYAQ-GNILLTDSE 136
           +      +G ++A  +I+E+  +  NI+L D E
Sbjct: 103 ETKGRDEIGDIHAKQLIIEIMGRHSNIILVDEE 135


>gi|154500200|ref|ZP_02038238.1| hypothetical protein BACCAP_03864 [Bacteroides capillosus ATCC
           29799]
 gi|150270932|gb|EDM98206.1| fibronectin-binding protein A domain protein [Pseudoflavonifractor
           capillosus ATCC 29799]
          Length = 588

 Score = 43.1 bits (100), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 57/125 (45%), Gaps = 12/125 (9%)

Query: 51  GESEKVLLLMESGV---RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
           G  E V LL+ +     R+  T   R+  +TP  F + LRKH+   R+ D+ Q   +R++
Sbjct: 41  GSRENVKLLLSASPNHPRVQLTRITRENPDTPPMFCMLLRKHLTGARILDITQPPVERLV 100

Query: 108 LFQF----GLGMNA-HYVILELYAQ-GNILLTDSEFTVLTLLRSHRDDDKGVAIMSRHRY 161
            F+      LG      ++LEL  +  N++L DSE  +   +R    D   +A   R   
Sbjct: 101 EFRLECLDELGDRVERRLVLELMGRSANLILLDSEGRITDCVRRVEGD---LATGKRQLL 157

Query: 162 PTEIC 166
           P   C
Sbjct: 158 PGLFC 162


>gi|336236110|ref|YP_004588726.1| fibronectin-binding A domain-containing protein [Geobacillus
           thermoglucosidasius C56-YS93]
 gi|335362965|gb|AEH48645.1| Fibronectin-binding A domain protein [Geobacillus
           thermoglucosidasius C56-YS93]
          Length = 571

 Score = 43.1 bits (100), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 49/93 (52%), Gaps = 7/93 (7%)

Query: 51  GESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G + K+LL       R+H T    D    P  F + LRKH+    +E +RQ+ +DRII+ 
Sbjct: 43  GRNYKLLLSAHPNYARVHLTNETYDNPAEPPMFCMLLRKHLEGSIIEAIRQVDFDRIIII 102

Query: 110 QFG----LG-MNAHYVILELYAQ-GNILLTDSE 136
           +      +G ++A  +I+E+  +  NI+L D E
Sbjct: 103 ETKGRDEIGDIHAKQLIIEIMGRHSNIILVDEE 135


>gi|269864365|ref|XP_002651547.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
 gi|220064321|gb|EED42509.1| RNA-binding protein, predicted [Enterocytozoon bieneusi H348]
          Length = 322

 Score = 43.1 bits (100), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 51/224 (22%), Positives = 90/224 (40%), Gaps = 40/224 (17%)

Query: 349 VKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVK 408
           ++F +F+  +  F+      R E+  K K      K  +I   Q   ++ L+++     K
Sbjct: 129 MRFNSFNQTVFSFF------RVEKVAKTK---IISKEERIQESQRKYINELEEKTCTMEK 179

Query: 409 MAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLIDKLYLERNCM 468
            A L+E   E V   +   +     ++ W   A   K E++ GNP A  I+   L+    
Sbjct: 180 TACLLEEEREFVSQILSIFQKVYEEKLDWSGFAEFYKTEKERGNPYAVGIEGYDLKSGEA 239

Query: 469 SLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAF 528
            + L +              E +++DL  +   N    Y+ +++   K EKT        
Sbjct: 240 IIKLGD--------------ENIKLDLRKTIDRNIEDIYKTRRRMREKAEKT-------- 277

Query: 529 KAAEKKTRLQILQEKTVANISHM----RKVHWFEKFNWFISSEN 568
                K  ++ +Q K      H+    R  +WFEKF++FIS  N
Sbjct: 278 -----KIAMRDIQAKLKPRKEHIKVQDRVNYWFEKFHFFISENN 316


>gi|404366578|ref|ZP_10971960.1| hypothetical protein FUAG_01772 [Fusobacterium ulcerans ATCC 49185]
 gi|313689422|gb|EFS26257.1| hypothetical protein FUAG_01772 [Fusobacterium ulcerans ATCC 49185]
          Length = 541

 Score = 43.1 bits (100), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 39/70 (55%), Gaps = 6/70 (8%)

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHYVI-LELYAQ-GNILLTD 134
           G    +RKH+    L DV+QLG+DRI+ F+F     LG   +Y I  E+  +  N + TD
Sbjct: 71  GLAANMRKHLLNAMLTDVQQLGFDRILCFKFAKINELGEIKNYSIYFEIMGKYSNFIFTD 130

Query: 135 SEFTVLTLLR 144
            +  ++ LL+
Sbjct: 131 EDDRIIDLLK 140


>gi|254479575|ref|ZP_05092888.1| Fibronectin-binding protein A domain protein [Carboxydibrachium
           pacificum DSM 12653]
 gi|214034487|gb|EEB75248.1| Fibronectin-binding protein A domain protein [Carboxydibrachium
           pacificum DSM 12653]
          Length = 469

 Score = 43.1 bits (100), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 8/97 (8%)

Query: 13  AEVKCLRRLI-GMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTA 70
           A VK L++ I G R   +Y    +  IF       +   G++ K+LL   +   R+H T 
Sbjct: 12  AIVKELKKEIEGGRIEKIYQPEKEDLIF------TIRSKGKNYKLLLSANANYPRIHLTK 65

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
             R+    P  F + LRKH++  R+ ++RQ+ +DRI+
Sbjct: 66  EDRENPLEPPMFCMLLRKHLQNGRIAEIRQVEFDRIV 102


>gi|302391733|ref|YP_003827553.1| fibronectin-binding A domain protein [Acetohalobium arabaticum DSM
           5501]
 gi|302203810|gb|ADL12488.1| Fibronectin-binding A domain protein [Acetohalobium arabaticum DSM
           5501]
          Length = 589

 Score = 43.1 bits (100), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 1/87 (1%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           F SS+ + +  GR+  QN+ +VK   S  D+++HA     S  +IKNH  ++ VP  T+ 
Sbjct: 469 FKSSDGFDIRVGRNNHQNDKLVKYESSDQDLWLHAKDIPGSHVIIKNHTRDE-VPQNTIE 527

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPH 649
           +A      +S+  +S  V   + +  H
Sbjct: 528 EAAHLAAYYSKGKNSSNVPVDYALAKH 554



 Score = 43.1 bits (100), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 67/156 (42%), Gaps = 19/156 (12%)

Query: 12  AAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTA 70
           A +++    LIG R   +Y   PK  +  L       + GE+ K+LL       R+H T 
Sbjct: 10  AIKIELEEELIGGRLDKIY--QPKENLLTLR----FRQPGENIKLLLSASPQNPRIHITD 63

Query: 71  YARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYV---ILELYAQ 127
              +    P  F + LRKH+   RL  + Q  ++RI+        N   +   IL +   
Sbjct: 64  SDHENPLRPPTFCMLLRKHLEHGRLRKIEQPDFERILKIYIDSKNNQGEIETKILLIEVM 123

Query: 128 G---NILLTDSEFTVLTLLRSHRDDDKGVAIMSRHR 160
           G   NI+L D++  +L  ++    D      MSRHR
Sbjct: 124 GRHSNIILIDNKNQILDSIKRVTSD------MSRHR 153


>gi|225019375|ref|ZP_03708567.1| hypothetical protein CLOSTMETH_03328 [Clostridium methylpentosum
           DSM 5476]
 gi|224948006|gb|EEG29215.1| hypothetical protein CLOSTMETH_03328 [Clostridium methylpentosum
           DSM 5476]
          Length = 582

 Score = 43.1 bits (100), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 51  GESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G S ++LL    S  R+H T++  +   TP  F + LRKH+ + +L  VRQL  DR++  
Sbjct: 42  GGSGRLLLSASASNARIHFTSFPPENPKTPPMFCMLLRKHLGSGKLIAVRQLELDRVLCL 101

Query: 110 QF 111
            F
Sbjct: 102 DF 103


>gi|427439102|ref|ZP_18923844.1| fibronectin-binding protein [Pediococcus lolii NGRI 0510Q]
 gi|425788480|dbj|GAC44632.1| fibronectin-binding protein [Pediococcus lolii NGRI 0510Q]
          Length = 571

 Score = 43.1 bits (100), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 48/103 (46%), Gaps = 11/103 (10%)

Query: 13  AEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG---VRLHTT 69
           A V  LR L   R + +Y    + Y  +L+    +T     + V LL+ S     R+  T
Sbjct: 10  AMVNELRSLESGRVAKIY----QPYQSELV----LTIRANRKNVPLLISSHPNYARVQVT 61

Query: 70  AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
             A     TPS F + LRKH+    L+ V+QL  DRII F F 
Sbjct: 62  NQALSNPATPSNFVMSLRKHLEGAILKSVKQLDNDRIINFYFS 104


>gi|333978737|ref|YP_004516682.1| fibronectin-binding A domain-containing protein [Desulfotomaculum
           kuznetsovii DSM 6115]
 gi|333822218|gb|AEG14881.1| Fibronectin-binding A domain protein [Desulfotomaculum kuznetsovii
           DSM 6115]
          Length = 585

 Score = 43.1 bits (100), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 5/90 (5%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L+  R   +Y  SP   I  L++  G      +  +L       R+H T   R+   +P 
Sbjct: 19  LLDGRIDRIYQPSP-LEIHLLIHRPGT----RARLLLSAHPENARVHLTGRVRENPPSPP 73

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
            F + LRKH+   R+  ++Q G DR+++FQ
Sbjct: 74  VFCMVLRKHLEGGRIRGIQQRGLDRVLVFQ 103


>gi|304384925|ref|ZP_07367271.1| fibronectin-binding protein [Pediococcus acidilactici DSM 20284]
 gi|418069136|ref|ZP_12706416.1| fibronectin-binding protein [Pediococcus acidilactici MA18/5M]
 gi|304329119|gb|EFL96339.1| fibronectin-binding protein [Pediococcus acidilactici DSM 20284]
 gi|357537869|gb|EHJ21892.1| fibronectin-binding protein [Pediococcus acidilactici MA18/5M]
          Length = 571

 Score = 43.1 bits (100), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 48/103 (46%), Gaps = 11/103 (10%)

Query: 13  AEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG---VRLHTT 69
           A V  LR L   R + +Y    + Y  +L+    +T     + V LL+ S     R+  T
Sbjct: 10  AMVNELRSLESGRVAKIY----QPYQSELV----LTIRANRKNVPLLISSHPNYARVQVT 61

Query: 70  AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
             A     TPS F + LRKH+    L+ V+QL  DRII F F 
Sbjct: 62  NQALSNPATPSNFVMSLRKHLEGAILKSVKQLDNDRIINFYFS 104


>gi|270290261|ref|ZP_06196486.1| fibronectin-binding protein [Pediococcus acidilactici 7_4]
 gi|270281042|gb|EFA26875.1| fibronectin-binding protein [Pediococcus acidilactici 7_4]
          Length = 571

 Score = 43.1 bits (100), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 48/103 (46%), Gaps = 11/103 (10%)

Query: 13  AEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG---VRLHTT 69
           A V  LR L   R + +Y    + Y  +L+    +T     + V LL+ S     R+  T
Sbjct: 10  AMVNELRSLESGRVAKIY----QPYQSELV----LTIRANRKNVPLLISSHPNYARVQVT 61

Query: 70  AYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG 112
             A     TPS F + LRKH+    L+ V+QL  DRII F F 
Sbjct: 62  NQALSNPATPSNFVMSLRKHLEGAILKSVKQLDNDRIINFYFS 104


>gi|126649682|ref|ZP_01721918.1| hypothetical protein BB14905_15830 [Bacillus sp. B14905]
 gi|126593401|gb|EAZ87346.1| hypothetical protein BB14905_15830 [Bacillus sp. B14905]
          Length = 591

 Score = 43.1 bits (100), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 62/126 (49%), Gaps = 13/126 (10%)

Query: 18  LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRLHTTAYARDKK 76
           L++L+  R + ++  + +  I        V  +G++ K+L  + S   R+H T  + +  
Sbjct: 42  LQQLVTGRITKIHQPNAQEVILH------VRANGKNHKLLFSIHSSYARVHLTEQSIENP 95

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNI 130
             P  F + LRKH+    +  V+QLG+DRII+ +          ++ +L+A+      N+
Sbjct: 96  AEPPMFCMLLRKHLEGGFISSVKQLGFDRIIIVEIESKNEIGDPIVRQLHAEIMGRHSNL 155

Query: 131 LLTDSE 136
           LL D E
Sbjct: 156 LLIDKE 161


>gi|359417662|ref|ZP_09209759.1| RNA-binding protein, partial [Candidatus Haloredivivus sp. G17]
 gi|358031981|gb|EHK00788.1| RNA-binding protein [Candidatus Haloredivivus sp. G17]
          Length = 101

 Score = 42.7 bits (99), Expect = 0.88,   Method: Composition-based stats.
 Identities = 22/64 (34%), Positives = 38/64 (59%), Gaps = 6/64 (9%)

Query: 69  TAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFGLGMNAHYVILELYAQG 128
           + Y RD    P GF ++LRKH+   +++ + Q G+DRI++ + G       +I EL+ +G
Sbjct: 35  SKYKRDNPMKPPGFCMELRKHL--GKVDRIEQKGFDRILVIESG----DTKLICELFGRG 88

Query: 129 NILL 132
           N +L
Sbjct: 89  NYIL 92


>gi|212696157|ref|ZP_03304285.1| hypothetical protein ANHYDRO_00693 [Anaerococcus hydrogenalis DSM
           7454]
 gi|212676786|gb|EEB36393.1| hypothetical protein ANHYDRO_00693 [Anaerococcus hydrogenalis DSM
           7454]
          Length = 326

 Score = 42.7 bits (99), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 67/140 (47%), Gaps = 15/140 (10%)

Query: 8   TADVAAEVKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRL 66
           T  V  E+K L  L+G +   +   S    I        +   G++ K+LL   +   R+
Sbjct: 8   TRAVTFEIKKL--LLGAKIQKISQPSKNDIIL------NIYSFGKTYKLLLSANNNEARV 59

Query: 67  HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMN-AHYVI 121
           H T    +    P  F + LRKH+   ++  + Q   DR+I+F+      +G + ++ +I
Sbjct: 60  HITEKKYENPEVPPNFCMVLRKHLSQSKIIGIDQYKLDRVIVFKISSVDEMGFDVSNKLI 119

Query: 122 LELYAQ-GNILLTDSEFTVL 140
           +E+  +  NI+LTD ++ ++
Sbjct: 120 VEIMGKYSNIILTDDKYKII 139


>gi|268610540|ref|ZP_06144267.1| fibronectin-binding A-like protein [Ruminococcus flavefaciens FD-1]
          Length = 597

 Score = 42.7 bits (99), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 7/92 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESG-VRLHTTAYARDKKNTP 79
           LIG R   ++  S +  +  +   +G      S+K+ +   +G  R+H T  + D   TP
Sbjct: 31  LIGGRVEKIHQPSREEIVISIRTRNG------SKKLYISANAGSARVHLTEKSVDNPQTP 84

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
             F + LRK + + +L D+RQ G +RI+   F
Sbjct: 85  PMFCMLLRKRLGSGKLIDIRQDGLERILFLDF 116


>gi|89895496|ref|YP_518983.1| hypothetical protein DSY2750 [Desulfitobacterium hafniense Y51]
 gi|89334944|dbj|BAE84539.1| hypothetical protein [Desulfitobacterium hafniense Y51]
          Length = 648

 Score = 42.7 bits (99), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 25/93 (26%), Positives = 47/93 (50%), Gaps = 7/93 (7%)

Query: 51  GESEKVLL-LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G+S ++LL +  +  RLH +  ++    +P  F + LRKHI   ++  + QLG +RI+L 
Sbjct: 54  GQSYRLLLNISATAARLHLSQTSKKNPASPPMFCMILRKHIEGGKILSLEQLGLERIVLL 113

Query: 110 ------QFGLGMNAHYVILELYAQGNILLTDSE 136
                 ++G     H  +  +    N++L D +
Sbjct: 114 TVQNYNEYGDLATLHLYLEIMGKHSNLILVDPQ 146


>gi|329768836|ref|ZP_08260265.1| hypothetical protein HMPREF0433_00029 [Gemella sanguinis M325]
 gi|328838229|gb|EGF87842.1| hypothetical protein HMPREF0433_00029 [Gemella sanguinis M325]
          Length = 555

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 6/90 (6%)

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNA 117
           S  R+  T  + +   TPS F   LRK++    +E++ Q+  DRII F+      LG   
Sbjct: 54  SASRIQLTNNSYENPQTPSNFCSVLRKYLMGGIIEEINQINNDRIIKFKIKNFDELGYEK 113

Query: 118 HY-VILELYAQ-GNILLTDSEFTVLTLLRS 145
           +Y +I EL  +  NI+LT+S+  ++  L++
Sbjct: 114 YYFLITELMGKHSNIILTNSDNIIIESLKN 143


>gi|395213235|ref|ZP_10400120.1| hypothetical protein O71_05359 [Pontibacter sp. BAB1700]
 gi|394456814|gb|EJF11060.1| hypothetical protein O71_05359 [Pontibacter sp. BAB1700]
          Length = 523

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 47/100 (47%), Gaps = 8/100 (8%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
              +E + ++ G+ AQ N+++ +R+  K D+++HA     S  VIK H+  + VP   L 
Sbjct: 406 LFETEGFKILVGKSAQNNDLLTQRHTYKEDIWLHAKDVSGSHVVIK-HQAGKTVPATVLE 464

Query: 623 QAGCFTVCHSQAWDSKMV----TSAWWVYPHQVSKTAPTG 658
           +A      +S+     +     T   WV   +  K AP G
Sbjct: 465 KAAQLAAYYSKRKSDTLCPVLYTPKKWV---RKPKGAPAG 501


>gi|373857258|ref|ZP_09600000.1| Fibronectin-binding A domain protein [Bacillus sp. 1NLA3E]
 gi|372452908|gb|EHP26377.1| Fibronectin-binding A domain protein [Bacillus sp. 1NLA3E]
          Length = 567

 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 53/97 (54%), Gaps = 7/97 (7%)

Query: 47  VTESGESEKVLL-LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ ++LL +  S  R+  T    +  N P  F + +RKH+    LED+ Q+  DR
Sbjct: 39  IRANGKNHRLLLSIHPSYARVQLTNEVYENPNEPPMFCMLMRKHLEGFILEDISQISLDR 98

Query: 106 IILFQFG----LG-MNAHYVILELYAQ-GNILLTDSE 136
           +I+F+      LG ++   +I+E+  +  NI+L D +
Sbjct: 99  MIVFEVKGRNELGDISCKQLIVEIMGRHSNIILVDKD 135


>gi|258511297|ref|YP_003184731.1| fibronectin-binding A domain-containing protein [Alicyclobacillus
           acidocaldarius subsp. acidocaldarius DSM 446]
 gi|257478023|gb|ACV58342.1| Fibronectin-binding A domain protein [Alicyclobacillus
           acidocaldarius subsp. acidocaldarius DSM 446]
          Length = 594

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 41/178 (23%), Positives = 80/178 (44%), Gaps = 34/178 (19%)

Query: 490 KVEVDLALSAHANARRWYELKKKQ-------ESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           ++E+D AL A ANA+R + +  K+       E+++E T+    +  +  E    LQ L +
Sbjct: 369 RIELDPALDAIANAQRLFRMAAKRKRARQWIEAERENTL----RDLRYLEDV--LQALAD 422

Query: 543 KTVANISHMRKVHWFEKF--------------------NWFISSENYLVISGRDAQQNEM 582
            ++ N+  +R+    + F                    + F SS+ +++  GR+  QN+ 
Sbjct: 423 TSLENLEEVRRELQAQGFLARADRRGTGGKRRAAESEPHAFRSSDGFVIRVGRNNVQNDR 482

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           +  R   K D+++H      S  VI+  + ++ +P  T+ +A       S+  DS  V
Sbjct: 483 LTFRRADKRDLWLHVKDAPGSHVVIERGQADE-IPERTIEEAAALAAYFSRMRDSANV 539


>gi|256828054|ref|YP_003156782.1| hypothetical protein Dbac_0239 [Desulfomicrobium baculatum DSM
           4028]
 gi|256577230|gb|ACU88366.1| protein of unknown function DUF814 [Desulfomicrobium baculatum DSM
           4028]
          Length = 498

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 34/65 (52%)

Query: 559 KFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPP 618
           K   + SS+ +L++ GR AQ N  ++ +  S  D ++HA     +  ++K   P Q VP 
Sbjct: 369 KVQAYRSSDGFLIVRGRSAQANHQLLTQAASPFDYWLHAQDGPGAHVIVKRDFPAQEVPE 428

Query: 619 LTLNQ 623
            T+ Q
Sbjct: 429 RTIQQ 433


>gi|300813244|ref|ZP_07093609.1| putative fibronectin-binding protein [Peptoniphilus sp. oral taxon
           836 str. F0141]
 gi|300512651|gb|EFK39786.1| putative fibronectin-binding protein [Peptoniphilus sp. oral taxon
           836 str. F0141]
          Length = 584

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 47/96 (48%), Gaps = 8/96 (8%)

Query: 57  LLLMESG--VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-- 112
           LLL  SG   R+H T    D  + P  F + LRKH+    L  + Q   DRII F F   
Sbjct: 48  LLLSASGNYPRVHLTENIIDNPSNPPAFCMLLRKHLEGSILNQITQYKMDRIIKFDFSSK 107

Query: 113 --LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLR 144
             LG +    +ILE+  +  NI+L + +  +L  L+
Sbjct: 108 DELGLLEDKSLILEIMGKYSNIILVNKDSKILDSLK 143


>gi|326201884|ref|ZP_08191754.1| Fibronectin-binding A domain protein [Clostridium papyrosolvens DSM
           2782]
 gi|325987679|gb|EGD48505.1| Fibronectin-binding A domain protein [Clostridium papyrosolvens DSM
           2782]
          Length = 592

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 49/95 (51%), Gaps = 7/95 (7%)

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLG-MN 116
           S  RLH T   ++  +TP  F + +RKH+   RL D+    Y+R+I         LG + 
Sbjct: 55  SNPRLHLTTLQKENPSTPPVFCMLMRKHVAGGRLLDISFHDYERVITLNIESVNELGDLT 114

Query: 117 AHYVILELYAQ-GNILLTDSEFTVLTLLRSHRDDD 150
              +++E+  +  NI+L +SE  ++  ++ H D D
Sbjct: 115 VKKLVVEIMGKYSNIILLNSENKIIDSVK-HVDSD 148


>gi|205373309|ref|ZP_03226113.1| fibronectin-binding protein / fibrinogen-binding protein [Bacillus
           coahuilensis m4-4]
          Length = 545

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 66/129 (51%), Gaps = 13/129 (10%)

Query: 15  VKCLRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYAR 73
           V  L+ LIG R + V+   P    FKL     +  +G+++K+LL    S  R+  T  + 
Sbjct: 12  VNELQPLIGGRINKVH--QP----FKLEILLNIRANGKNQKLLLSSHPSYARVQLTEQSY 65

Query: 74  DKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ- 127
           D   TP  F + LRKH+    +E++ Q   +R+I+ +      +G ++   +I+E+  + 
Sbjct: 66  DNPTTPPMFCMLLRKHLEGYIIENIYQKDLERMIIMEVKGRNEIGDISYKQLIIEIMGRH 125

Query: 128 GNILLTDSE 136
            NI+L D E
Sbjct: 126 SNIILVDKE 134


>gi|347750555|ref|YP_004858120.1| fibronectin-binding A domain-containing protein [Bacillus coagulans
           36D1]
 gi|347583073|gb|AEO99339.1| Fibronectin-binding A domain protein [Bacillus coagulans 36D1]
          Length = 572

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 45/98 (45%), Gaps = 9/98 (9%)

Query: 47  VTESGESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           V  +G++ ++LL       R+  T    D    P  F + LRKH+    +ED+RQ G DR
Sbjct: 39  VRANGKNRRLLLSAHPAYARIQLTDEPFDNPQDPPMFCMLLRKHLEGSVIEDIRQAGLDR 98

Query: 106 IILFQF-------GLGMNAHYVILELYAQGNILLTDSE 136
           +++F          +     YV + +    NI+L D E
Sbjct: 99  VVVFDIKNRNEIGDISFKQLYVEI-MGRHSNIILVDKE 135


>gi|302872268|ref|YP_003840904.1| fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor obsidiansis OB47]
 gi|302575127|gb|ADL42918.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           obsidiansis OB47]
          Length = 585

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 4/96 (4%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTLRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWV--YPHQVSKTAP 656
           +A       S+A  S  V   +    Y  +  KT P
Sbjct: 524 EAALLASYFSKAKHSTKVPVDYTFVKYVKKPPKTKP 559


>gi|158320452|ref|YP_001512959.1| fibronectin-binding A domain-containing protein [Alkaliphilus
           oremlandii OhILAs]
 gi|158140651|gb|ABW18963.1| Fibronectin-binding A domain protein [Alkaliphilus oremlandii
           OhILAs]
          Length = 593

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 51/97 (52%), Gaps = 7/97 (7%)

Query: 47  VTESGESEKVLLLMESGV-RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           V  +G++ K+LL  +S   ++H T   ++  ++P  F + LRKH+   R+ D+ Q  ++R
Sbjct: 39  VRSNGKNHKILLSADSNYPKIHFTTSNKENPSSPPNFCMVLRKHLMGGRIVDIVQPQFER 98

Query: 106 II------LFQFGLGMNAHYVILELYAQGNILLTDSE 136
           I+      L +  +  +   +I  +    NI+L DSE
Sbjct: 99  IVKIIIESLDELNILKSKELMIEIMGKHSNIILVDSE 135


>gi|336400749|ref|ZP_08581522.1| hypothetical protein HMPREF0404_00813 [Fusobacterium sp. 21_1A]
 gi|423136512|ref|ZP_17124155.1| hypothetical protein HMPREF9942_00293 [Fusobacterium nucleatum
           subsp. animalis F0419]
 gi|336161774|gb|EGN64765.1| hypothetical protein HMPREF0404_00813 [Fusobacterium sp. 21_1A]
 gi|371961666|gb|EHO79290.1| hypothetical protein HMPREF9942_00293 [Fusobacterium nucleatum
           subsp. animalis F0419]
          Length = 541

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L DV QLG+DRI++F F     LG +  + +  E   +  NI+ TD E  +
Sbjct: 77  LRKHLMNAMLTDVEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNIIFTDEENKI 136

Query: 140 LTLLR 144
           L  L+
Sbjct: 137 LDTLK 141


>gi|282881987|ref|ZP_06290628.1| fibronectin-binding protein [Peptoniphilus lacrimalis 315-B]
 gi|281298017|gb|EFA90472.1| fibronectin-binding protein [Peptoniphilus lacrimalis 315-B]
          Length = 584

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 47/96 (48%), Gaps = 8/96 (8%)

Query: 57  LLLMESG--VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-- 112
           LLL  SG   R+H T    D  + P  F + LRKH+    L  + Q   DRII F F   
Sbjct: 48  LLLSASGNYPRVHLTENLIDNPSNPPAFCMLLRKHLEGSILNKITQYKMDRIIKFDFSSK 107

Query: 113 --LG-MNAHYVILELYAQ-GNILLTDSEFTVLTLLR 144
             LG +    +ILE+  +  NI+L + +  +L  L+
Sbjct: 108 DELGLLEDKSLILEIMGKYSNIILVNKDSKILDSLK 143


>gi|336113750|ref|YP_004568517.1| fibronectin-binding A domain-containing protein [Bacillus coagulans
           2-6]
 gi|335367180|gb|AEH53131.1| Fibronectin-binding A domain protein [Bacillus coagulans 2-6]
          Length = 572

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 7/97 (7%)

Query: 47  VTESGESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           V  +G++ ++LL       R+  T    D    P  F + LRKH+    +ED+RQ G DR
Sbjct: 39  VRANGKNRRLLLSAHPAYARIQLTDEPFDNPQDPPMFCMLLRKHLEGSVIEDIRQAGLDR 98

Query: 106 IILFQFG----LG-MNAHYVILELYAQ-GNILLTDSE 136
           +++F       +G ++   + +E+  +  NI+L D E
Sbjct: 99  VVVFDIKSRNEIGDISYKQLYVEIMGRHSNIILVDKE 135


>gi|311068085|ref|YP_003973008.1| persistent RNA/DNA binding protein [Bacillus atrophaeus 1942]
 gi|419823934|ref|ZP_14347467.1| putative persistent RNA/DNA binding protein [Bacillus atrophaeus
           C89]
 gi|310868602|gb|ADP32077.1| putative persistent RNA/DNA binding protein [Bacillus atrophaeus
           1942]
 gi|388471971|gb|EIM08761.1| putative persistent RNA/DNA binding protein [Bacillus atrophaeus
           C89]
          Length = 570

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 56/122 (45%), Gaps = 13/122 (10%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNT 78
           R+ G R + ++       IF       +  +G+++K+LL    S  R+H T    +  + 
Sbjct: 18  RITGGRITKIHQPFKHDVIFH------IRANGKNQKLLLSAHPSYSRVHLTNQTYENPSE 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQ-----GNILL 132
           P  F + LRKHI    +E + Q G DRI++F            + +LY +      NI+L
Sbjct: 72  PPMFCMLLRKHIEGGFIESIEQSGMDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIIL 131

Query: 133 TD 134
           TD
Sbjct: 132 TD 133


>gi|312135575|ref|YP_004002913.1| Fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor owensensis OL]
 gi|311775626|gb|ADQ05113.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           owensensis OL]
          Length = 585

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 4/96 (4%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTLRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWV--YPHQVSKTAP 656
           +A       S+A  S  V   +    Y  +  KT P
Sbjct: 524 EAALLASYFSKAKHSTKVPVDYTFVKYVKKPPKTKP 559


>gi|169827056|ref|YP_001697214.1| hypothetical protein Bsph_1482 [Lysinibacillus sphaericus C3-41]
 gi|168991544|gb|ACA39084.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41]
          Length = 587

 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 36/65 (55%), Gaps = 1/65 (1%)

Query: 47  VTESGESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           V  +G++ K+L  + S   R+H T    +    P  F + LRKH+    +  V+QLG+DR
Sbjct: 61  VRANGKNHKLLFSIHSSYARVHLTEQTIENPAEPPMFCMLLRKHLEGGFISSVKQLGFDR 120

Query: 106 IILFQ 110
           II+ +
Sbjct: 121 IIIVE 125


>gi|227485000|ref|ZP_03915316.1| fibrinogen-binding protein [Anaerococcus lactolyticus ATCC 51172]
 gi|227236997|gb|EEI87012.1| fibrinogen-binding protein [Anaerococcus lactolyticus ATCC 51172]
          Length = 580

 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/82 (24%), Positives = 41/82 (50%), Gaps = 6/82 (7%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRII------LFQFGLGMNAH 118
           R++ T    +    P  F + LRKHI   ++ D++Q G DR++      + + G   +  
Sbjct: 58  RINFTEKKYENPEKPDNFCMVLRKHINQGKIIDIKQYGLDRVVELSIVSIDEMGFDTSKK 117

Query: 119 YVILELYAQGNILLTDSEFTVL 140
            +I  +    N++LTD+ + ++
Sbjct: 118 LIIEIMGKHSNVILTDTNYKII 139


>gi|348027019|ref|YP_004766824.1| fibronectin-binding A [Megasphaera elsdenii DSM 20460]
 gi|341823073|emb|CCC73997.1| fibronectin-binding A [Megasphaera elsdenii DSM 20460]
          Length = 567

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 30/156 (19%), Positives = 70/156 (44%), Gaps = 14/156 (8%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPS 80
           L G + S +Y L  ++  F++ N +G+        +++ ++   RL+         + P+
Sbjct: 19  LKGGQISKIYQLDARSLYFRIFNDAGI------HHLVITLDDSPRLYIAETMPPTPDVPT 72

Query: 81  GFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG-LGMNAHYVILELYAQ-----GNILLTD 134
           G  + LRK+    R+  + QL  DR+I      L M+   V  +++ +      N++ T+
Sbjct: 73  GLCMFLRKYYENGRIAAIAQLHLDRLIDIDIDVLDMSGRLVTRKIHVELMGKYSNVIFTE 132

Query: 135 SEFTVLTLLRSHRDDD--KGVAIMSRHRYPTEICRV 168
               +  L+++ ++    + +A    + +P    R+
Sbjct: 133 DGTIIEALIKTGKNKQALRTIAPHEPYAFPPNFMRM 168


>gi|295696031|ref|YP_003589269.1| fibronectin-binding A domain-containing protein [Kyrpidia tusciae
           DSM 2912]
 gi|295411633|gb|ADG06125.1| Fibronectin-binding A domain protein [Kyrpidia tusciae DSM 2912]
          Length = 599

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 58/121 (47%), Gaps = 12/121 (9%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISSE   +  G++ +QN+ +  +   K D ++HA     S  VI++    + VPP TL 
Sbjct: 478 FISSEGIDIFVGKNNRQNDELTTKTAHKQDTWLHAQNIPGSHVVIRS----REVPPKTLE 533

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWVYPHQVSKTAPTGEYLTVGSFMI--RGKKNFLPPHPL 680
           +A      +S+A  +  V   + +  H V K  PTG       F++    K  F+PP P 
Sbjct: 534 EAARLAAYYSKARHAGTVAVDYTLVKH-VWK--PTG---ARPGFVLYDHQKTVFVPPDPA 587

Query: 681 I 681
           +
Sbjct: 588 L 588


>gi|89098716|ref|ZP_01171598.1| fibronectin/fibrinogen-binding protein, putative [Bacillus sp. NRRL
           B-14911]
 gi|89086678|gb|EAR65797.1| fibronectin/fibrinogen-binding protein, putative [Bacillus sp. NRRL
           B-14911]
          Length = 570

 Score = 41.6 bits (96), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G++ K+LL    S  R   T  A +  + P  F + LRKH+    LED+RQ+G DR
Sbjct: 39  IRANGKNHKLLLSAHPSYARAQLTHEAYENPSEPPMFCMLLRKHLEGYILEDIRQVGLDR 98

Query: 106 IILF 109
           I++ 
Sbjct: 99  ILIL 102


>gi|406981505|gb|EKE02969.1| hypothetical protein ACD_20C00301G0015 [uncultured bacterium]
          Length = 587

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 39/177 (22%), Positives = 77/177 (43%), Gaps = 23/177 (12%)

Query: 491 VEVDLALSAHANARRWYEL--KKKQESKQEKTITAHSKAFKAAEKKTRLQILQEKTVANI 548
           +++D   S +ANA+R+Y+L  K K  S+  K I    +      +     I Q  ++A++
Sbjct: 373 IQLDPVKSPNANAQRYYKLYNKAKTASRISKDIVRQVQEELDYLESIETFINQSDSLADL 432

Query: 549 SHMR--------------KVHWFEKF-------NWFISSENYLVISGRDAQQNEMIVKRY 587
             ++              ++   EK        + + S++ Y +  G++ +QNE ++ + 
Sbjct: 433 KQIKDELISQNLLKTTGKQIKSPEKLKKEGISLSEYTSTDGYKIYVGKNNRQNEYLISKI 492

Query: 588 MSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMVTSAW 644
            S  D+++H      S  +IK +     VP  T+ +A       SQA +S  V   +
Sbjct: 493 ASPNDIWLHTQNIPGSHVLIKINDENVEVPASTIEEAASIAAYFSQAKNSANVAVIY 549


>gi|386714204|ref|YP_006180527.1| fibronectin/fibrinogen-binding protein [Halobacillus halophilus DSM
           2266]
 gi|384073760|emb|CCG45253.1| fibronectin/fibrinogen-binding protein, putative [Halobacillus
           halophilus DSM 2266]
          Length = 578

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 30/50 (60%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRP 612
           F+SS+  L+  GR+ +QNE +  R  +K D+++HA     S  VI+N  P
Sbjct: 453 FLSSDGTLIYVGRNNKQNEYLTNRMANKSDIWLHAKDIPGSHVVIRNEDP 502


>gi|312793063|ref|YP_004025986.1| Fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor kristjanssonii 177R1B]
 gi|312180203|gb|ADQ40373.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           kristjanssonii 177R1B]
          Length = 585

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 4/96 (4%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTLRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWV--YPHQVSKTAP 656
           +A       S+A  S  V   +    Y  +  KT P
Sbjct: 524 EAALLASYFSKAKHSTKVPVDYTFVKYVKKPPKTKP 559


>gi|134299547|ref|YP_001113043.1| fibronectin-binding A domain-containing protein [Desulfotomaculum
           reducens MI-1]
 gi|134052247|gb|ABO50218.1| Fibronectin-binding A domain protein [Desulfotomaculum reducens
           MI-1]
          Length = 592

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 7/76 (9%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           F+SSE + ++ G++ +QN+ +  R  S+ D+++H      S  +I+N  P   VP  TL 
Sbjct: 471 FLSSEGFTILVGKNNKQNDYLSLRLASEDDIWLHTKDIPGSHVIIRN--PAGEVPNQTLL 528

Query: 623 QAGCFTVCHSQAWDSK 638
           +A         AW SK
Sbjct: 529 EAATLA-----AWFSK 539


>gi|384134988|ref|YP_005517702.1| fibronectin-binding A domain-containing protein [Alicyclobacillus
           acidocaldarius subsp. acidocaldarius Tc-4-1]
 gi|339289073|gb|AEJ43183.1| Fibronectin-binding A domain protein [Alicyclobacillus
           acidocaldarius subsp. acidocaldarius Tc-4-1]
          Length = 472

 Score = 41.6 bits (96), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 41/178 (23%), Positives = 80/178 (44%), Gaps = 34/178 (19%)

Query: 490 KVEVDLALSAHANARRWYELKKKQ-------ESKQEKTITAHSKAFKAAEKKTRLQILQE 542
           ++E+D AL A ANA+R + +  K+       E+++E T+    +  +  E    LQ L +
Sbjct: 247 RIELDPALDAIANAQRLFRMAAKRKRARQWIEAERENTL----RDLRYLEDV--LQALGD 300

Query: 543 KTVANISHMRKVHWFEKF--------------------NWFISSENYLVISGRDAQQNEM 582
            ++ N+  +R+    + F                    + F SS+ +++  GR+  QN+ 
Sbjct: 301 TSLENLEEVRRELEAQGFLARTNRRGTGGKRRATESEPHAFRSSDGFVIRVGRNNVQNDR 360

Query: 583 IVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
           +  R   K D+++H      S  VI+  + ++ +P  T+ +A       S+  DS  V
Sbjct: 361 LTFRKADKRDLWLHVKDAPGSHVVIERGQADE-IPERTIEEAATLAAYFSRMRDSANV 417


>gi|323489530|ref|ZP_08094757.1| hypothetical protein GPDM_09295 [Planococcus donghaensis MPA1U2]
 gi|323396661|gb|EGA89480.1| hypothetical protein GPDM_09295 [Planococcus donghaensis MPA1U2]
          Length = 554

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 49/97 (50%), Gaps = 7/97 (7%)

Query: 51  GESEKVLL-LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G++ K+L+ +  S  R+H TA A    + P  F + LRKHI    + ++ Q G DR+I+ 
Sbjct: 42  GKNHKLLISIHPSYSRIHLTATANVNPSEPPMFCMLLRKHIEGGVITEISQYGMDRLIML 101

Query: 110 ------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
                 + G  +     +  +    N++L D+E T++
Sbjct: 102 KIKAKNEIGDDIERELHVEMMGRHSNVILIDAERTMI 138


>gi|389815947|ref|ZP_10207184.1| hypothetical protein A1A1_03947 [Planococcus antarcticus DSM 14505]
 gi|388465441|gb|EIM07758.1| hypothetical protein A1A1_03947 [Planococcus antarcticus DSM 14505]
          Length = 554

 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 49/97 (50%), Gaps = 7/97 (7%)

Query: 51  GESEKVLL-LMESGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILF 109
           G++ K+L+ +  S  R+H TA A    + P  F + LRKHI    + ++ Q G DR+I+ 
Sbjct: 42  GKNHKLLISIHPSYSRIHLTAAANVNPSEPPMFCMLLRKHIEGGVITEISQYGMDRLIML 101

Query: 110 ------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
                 + G  +     +  +    N++L D+E T++
Sbjct: 102 KIKAKNEIGDDIERELHVEMMGRHSNVILIDAERTMI 138


>gi|376261569|ref|YP_005148289.1| putative RNA-binding protein, snRNP like protein [Clostridium sp.
           BNL1100]
 gi|373945563|gb|AEY66484.1| putative RNA-binding protein, snRNP like protein [Clostridium sp.
           BNL1100]
          Length = 592

 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 48/95 (50%), Gaps = 7/95 (7%)

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLG-MN 116
           S  RLH T   ++   TP  F + +RKH+   RL D+    Y+R+I         LG + 
Sbjct: 55  SNPRLHLTTLQKENPATPPVFCMLMRKHVAGGRLLDISFHDYERVITLNIESVNELGDLT 114

Query: 117 AHYVILELYAQ-GNILLTDSEFTVLTLLRSHRDDD 150
              +++E+  +  NI+L +SE  ++  ++ H D D
Sbjct: 115 VKKLVVEIMGKYSNIILLNSENKIIDSVK-HVDSD 148


>gi|332297231|ref|YP_004439153.1| fibronectin-binding A domain-containing protein [Treponema
           brennaborense DSM 12168]
 gi|332180334|gb|AEE16022.1| Fibronectin-binding A domain protein [Treponema brennaborense DSM
           12168]
          Length = 504

 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/68 (29%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           + + + +GR A +N+ +++R++   D+++HA  +      +KN RP + VP   L  A  
Sbjct: 385 DGWTIFAGRTAAENDELLRRHVKGQDMWLHARDYAGGYVFVKN-RPGKSVPLEVLLCAAN 443

Query: 627 FTVCHSQA 634
             V HS+A
Sbjct: 444 VAVYHSKA 451


>gi|237744728|ref|ZP_04575209.1| fibronectin-binding protein [Fusobacterium sp. 7_1]
 gi|229431957|gb|EEO42169.1| fibronectin-binding protein [Fusobacterium sp. 7_1]
          Length = 541

 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 32/174 (18%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L DV QLG+DRI++F F     LG +  + +  E   +  N++ TD E  +
Sbjct: 77  LRKHLMNAMLTDVEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKI 136

Query: 140 L-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFER------TTASKLHAALTSSKEPDAN 190
           L TL + H  +  D+ + +   +  P      FE+       T S+ +  L  +K P  N
Sbjct: 137 LDTLKKFHISENFDRTLFLGETYTRPK-----FEKKLLPIDITESEFNRIL-ENKIPLTN 190

Query: 191 EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
           E + V +  NN+           K  K F    NS+  +    + K+  L TVL
Sbjct: 191 EIEGVGKFLNNI-----------KSFKDFKNILNSDVKAKIYFKDKKIKLATVL 233


>gi|167751125|ref|ZP_02423252.1| hypothetical protein EUBSIR_02110 [Eubacterium siraeum DSM 15702]
 gi|167655840|gb|EDR99969.1| fibronectin-binding protein A domain protein [Eubacterium siraeum
           DSM 15702]
          Length = 587

 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 7/92 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES-GVRLHTTAYARDKKNTP 79
           L+G R   +Y  S +  I        +  +G+  K+L+   S   R+  T  A +  + P
Sbjct: 20  LVGGRIDKIYQPSREEIII------SIRSAGKHNKILISSNSMSARVCMTERAAENPSAP 73

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
             F + LRKH+   +L D+ Q G +RII F F
Sbjct: 74  PMFCMLLRKHLSGGKLLDITQDGLERIINFDF 105


>gi|295101410|emb|CBK98955.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Faecalibacterium prausnitzii L2-6]
          Length = 587

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 52  ESEKVLLLMESG-VRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ 110
           E++ +LL   SG  R+  T  + +   TP  F + +RKH+   RL DVR    DRI+ F+
Sbjct: 44  ETDSLLLSARSGSARVCLTEESFENPETPPSFCMLMRKHLTGGRLLDVRMEPGDRIVYFE 103

Query: 111 F 111
           F
Sbjct: 104 F 104


>gi|291531786|emb|CBK97371.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Eubacterium siraeum 70/3]
          Length = 587

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 7/92 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES-GVRLHTTAYARDKKNTP 79
           L+G R   +Y  S +  I        +  +G+  K+L+   S   R+  T  A +  + P
Sbjct: 20  LVGGRIDKIYQPSREEIII------SIRSAGKHNKILISSNSMSARVCMTERAAENPSAP 73

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
             F + LRKH+   +L D+ Q G +RII F F
Sbjct: 74  PMFCMLLRKHLSGGKLLDITQDGLERIINFDF 105


>gi|340751990|ref|ZP_08688800.1| fibronectin-binding protein [Fusobacterium mortiferum ATCC 9817]
 gi|229420957|gb|EEO36004.1| fibronectin-binding protein [Fusobacterium mortiferum ATCC 9817]
          Length = 540

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 10/75 (13%)

Query: 78  TPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG----MNAHYVILELYAQGN 129
           + +G    ++KH+    L DV QLG+DR++ F+F     LG     N ++ I+  Y+  N
Sbjct: 68  SSAGLAANMKKHLLNAMLVDVVQLGFDRVLQFKFSRINELGEVKNYNIYFEIMGKYS--N 125

Query: 130 ILLTDSEFTVLTLLR 144
            + TD E  ++ LL+
Sbjct: 126 FVFTDGENKIIDLLK 140


>gi|34763685|ref|ZP_00144610.1| Fibronectin-binding protein; Fibrinogen-binding protein
           [Fusobacterium nucleatum subsp. vincentii ATCC 49256]
 gi|27886575|gb|EAA23790.1| Fibronectin-binding protein [Fusobacterium nucleatum subsp.
           vincentii ATCC 49256]
          Length = 541

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L D+ QLG+DRI++F F     LG +  + +  E   +  N++ TD E  V
Sbjct: 77  LRKHLMNAMLTDIEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKV 136

Query: 140 LTLLR 144
           L  L+
Sbjct: 137 LDTLK 141


>gi|291556680|emb|CBL33797.1| Predicted RNA-binding protein homologous to eukaryotic snRNP
           [Eubacterium siraeum V10Sc8a]
          Length = 587

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 44/92 (47%), Gaps = 7/92 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMES-GVRLHTTAYARDKKNTP 79
           L+G R   +Y  S +  I        +  +G+  K+L+   S   R+  T  A +  + P
Sbjct: 20  LVGGRIDKIYQPSREEIII------SIRSAGKHNKILISSNSMSARVCMTERAAENPSAP 73

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF 111
             F + LRKH+   +L D+ Q G +RII F F
Sbjct: 74  PMFCMLLRKHLSGGKLLDITQDGLERIINFDF 105


>gi|260494593|ref|ZP_05814723.1| fibronectin-binding protein A [Fusobacterium sp. 3_1_33]
 gi|260197755|gb|EEW95272.1| fibronectin-binding protein A [Fusobacterium sp. 3_1_33]
          Length = 357

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 32/174 (18%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L DV QLG+DRI++F F     LG +  + +  E   +  N++ TD E  +
Sbjct: 77  LRKHLMNAMLTDVEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKI 136

Query: 140 L-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFER------TTASKLHAALTSSKEPDAN 190
           L TL + H  +  D+ + +   +  P      FE+       T S+ +  L  +K P  N
Sbjct: 137 LDTLKKFHISENFDRTLFLGETYTRPK-----FEKKLLPIDITESEFNRIL-ENKIPLTN 190

Query: 191 EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
           E + V +  NN+           K  K F    NS+  +    + K+  L TVL
Sbjct: 191 EIEGVGKFLNNI-----------KSFKDFKNILNSDVKAKIYFKDKKIKLATVL 233


>gi|354558691|ref|ZP_08977945.1| Fibronectin-binding A domain protein [Desulfitobacterium
           metallireducens DSM 15288]
 gi|353545753|gb|EHC15203.1| Fibronectin-binding A domain protein [Desulfitobacterium
           metallireducens DSM 15288]
          Length = 619

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 125/651 (19%), Positives = 239/651 (36%), Gaps = 128/651 (19%)

Query: 20  RLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLL-LMESGVRLHTTAYARDKKNT 78
           +LIG R   V     +     L N       G+S ++LL +  +  RLH T   +    +
Sbjct: 18  QLIGARIDKVVQPEKEEIHLYLRNQ------GKSLRLLLNISATAARLHLTQENKKNPTS 71

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-GLGMNAHYVILELYAQGNILLTDSEF 137
           P  F + LRKH+   ++ ++ Q+G +RI+L              L+LY +  I+   S  
Sbjct: 72  PPMFCMILRKHLEGGKILNLEQIGLERIVLITVQNYNEYGDLATLQLYLE--IMGKHSNL 129

Query: 138 TVLTLLRSHRDDDKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEPDANEPDKVNE 197
            ++       D  K V +    RY   + R  E      +    T    P   + D ++E
Sbjct: 130 ILV-------DPVKQVILDGIKRYSHAVSRHRE------VLPGRTYIIPPSQGKWDPISE 176

Query: 198 DGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGEALGYGPALSEH 257
                  +    L  +  GK  DL                     ++    G  P L+  
Sbjct: 177 SEEETFRSVL--LKDEISGKLIDL---------------------LVKHFNGISPELARE 213

Query: 258 IILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILMQNKHLG 317
           +++  GL   ++L +   ++ + +         F+ +L  + + D +P+    ++     
Sbjct: 214 VVVRAGLSLTIRLEQCGDIDLSRV---------FQGYL-TLANPDTLPQ----IEPCLYY 259

Query: 318 KDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFY------SKIESQRAE 371
           +   P++ G  T     F P    Q++      F + DAA++ FY      + +E++R  
Sbjct: 260 QSDAPSKKGLPTAF--TFVPF--QQYQGLTAETFFSLDAAIERFYHSKASNNNLEAKRGS 315

Query: 372 QQHKAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRV-- 429
            +   +E+   H +NK     E  +   ++    S +  ELI  NL  +   +  + V  
Sbjct: 316 LRKIVQEN--LHHMNKKLSIYEETMENAEKSFKYS-RWGELITANLYRITPGMTEITVED 372

Query: 430 ------------------ALANRMSWEDLARMVKEERKAGNPV--AGLIDKLYLERNCMS 469
                              + N   +  L    K   +   P+  A L +  YLE   +S
Sbjct: 373 YNEETLPQITIQLDPQLSGIDNSQRYYRLYNKAKVTLQKTEPLKAASLNEVNYLESVLLS 432

Query: 470 LLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFK 529
           L  ++   E+++  K L    ++ +     H N           +S  +K     +KA  
Sbjct: 433 LEQASTPSEIEEVHKEL----IDQEYLAGKHIN-----------KSNPKKASKYPNKANA 477

Query: 530 AAEKKTRLQILQEKTVANISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMS 589
             ++  + +  Q KT                  ++SSE  +++ G++ +QN+ +  +   
Sbjct: 478 KKKQGNKPEAPQPKT------------------YLSSEGRMILVGKNNRQNDWLTLKKGR 519

Query: 590 KGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDSKMV 640
             D+++H      S  +I     E+     TL +A    +  SQA  S  V
Sbjct: 520 PQDLWLHVKNIPGSHVLIPLEDGEEFPDDTTLEEAAALAIHFSQAKGSSQV 570


>gi|421145721|ref|ZP_15605566.1| fibronectin-binding protein-like protein A [Fusobacterium nucleatum
           subsp. fusiforme ATCC 51190]
 gi|395487876|gb|EJG08786.1| fibronectin-binding protein-like protein A [Fusobacterium nucleatum
           subsp. fusiforme ATCC 51190]
          Length = 541

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L D+ QLG+DRI++F F     LG +  + +  E   +  N++ TD E  V
Sbjct: 77  LRKHLMNAMLTDIEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKV 136

Query: 140 LTLLR 144
           L  L+
Sbjct: 137 LDTLK 141


>gi|336418046|ref|ZP_08598325.1| hypothetical protein HMPREF0401_00343 [Fusobacterium sp. 11_3_2]
 gi|336160505|gb|EGN63550.1| hypothetical protein HMPREF0401_00343 [Fusobacterium sp. 11_3_2]
          Length = 541

 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 32/174 (18%)

Query: 86  LRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNILLTDSEFTV 139
           LRKH+    L DV QLG+DRI++F F     LG +  + +  E   +  N++ TD E  +
Sbjct: 77  LRKHLMNAILTDVEQLGFDRILVFHFSRINELGEIKKYKIYFECIGKLSNVIFTDEENKI 136

Query: 140 L-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFER------TTASKLHAALTSSKEPDAN 190
           L TL + H  +  D+ + +   +  P      FE+       T S+ +  L  +K P  N
Sbjct: 137 LDTLKKFHISENFDRTLFLGETYTRPK-----FEKKLLPIDITESEFNRIL-ENKIPLTN 190

Query: 191 EPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVL 244
           E + V +  NN+           K  K F    NS+  +    + K+  L TVL
Sbjct: 191 EIEGVGKFLNNI-----------KSFKDFKNILNSDVKAKIYFKDKKIKLATVL 233


>gi|168031469|ref|XP_001768243.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680421|gb|EDQ66857.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 779

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 82/383 (21%), Positives = 159/383 (41%), Gaps = 66/383 (17%)

Query: 249 GYGPALSEHIILDTGLVPNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGY 308
           G GP L+  +I  +GL P+M  + + + E  ++ V+ L      DWL+ V+       G 
Sbjct: 376 GVGPGLAVELISRSGLSPSMDPAAMTEDEWFSLHVVWL------DWLR-VLEESTFKPGL 428

Query: 309 ILMQNKH--LGKDHPPTESGSSTQIYDEFCPLLLNQFRSREFVKFETFDAALDEFYSKI- 365
           +     +  LG D                 P +L+  +  E        A LD++Y+++ 
Sbjct: 429 VRSTGSYSVLGGDG----------------PYILSTDQDSEDAATGIL-AMLDDYYTRVY 471

Query: 366 ---ESQRAEQQHKAKEDAAFHKL-NKIHMDQENRVHTLKQEVDRSVKMAELIEYNLEDVD 421
              + Q+  QQ  AK  AA  K  +K+++ ++    ++  E  +  KMA+L+  NL   +
Sbjct: 472 ETEKFQQLRQQLVAKVSAATKKAQSKVNLFEDQIKASM--EYSKISKMADLLMANLHVCE 529

Query: 422 AAILAVRVALANRMSWEDLARMVKEERKAGNPVAGLI----DKLYLERNCMSLLLSNNLD 477
              L++ +        E+   +  + R+     A  +     KL      ++ LL+   D
Sbjct: 530 PGALSITLP---DFETEEPTTIALDPRQTALVTAQKLYKRSQKLKKSEKAVAPLLAEARD 586

Query: 478 EMDDEEKTLPVEKVEVDLALSAHANARRWYELKKKQESKQEKTITAHSKAFKAAEKKTRL 537
           E+            +V+++L       R  +L+  +E + E    A+ K   A       
Sbjct: 587 ELTYLS--------QVEVSLQQLDRYTRSTDLRSLEEVRDELVEGAYLKPIIAGTPPPSS 638

Query: 538 QILQEKTVAN--ISHMRKVHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  ++ +  +   ++MR+         F S   Y V+ GR+ +QN+++  R  ++ D++ 
Sbjct: 639 KRKKKSSPLDNFAANMRR---------FTSPSGYEVLVGRNNRQNDVLANRVATEYDLWF 689

Query: 596 HADLHGASSTVIKNHRPEQPVPP 618
           HA     S TV++       VPP
Sbjct: 690 HARNIPGSHTVLR-------VPP 705


>gi|302874785|ref|YP_003843418.1| fibronectin-binding A domain-containing protein [Clostridium
           cellulovorans 743B]
 gi|302577642|gb|ADL51654.1| Fibronectin-binding A domain protein [Clostridium cellulovorans
           743B]
          Length = 586

 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 5/80 (6%)

Query: 48  TESGESEKVLLLMESGV-RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRI 106
           TE  E+ K+LL   S   ++H T   ++    P  F + LRK++   ++ DV QL  DR+
Sbjct: 41  TEDNENIKLLLSANSTYPKIHVTKITKENPMNPPMFCMILRKYLSGSKILDVSQLENDRL 100

Query: 107 ILFQFG----LGMNAHYVIL 122
            + +F      G N+ Y ++
Sbjct: 101 AIIKFKSTDEFGFNSEYSLI 120


>gi|410455461|ref|ZP_11309341.1| Fibronectin-binding A domain-containing protein [Bacillus
           bataviensis LMG 21833]
 gi|409929288|gb|EKN66373.1| Fibronectin-binding A domain-containing protein [Bacillus
           bataviensis LMG 21833]
          Length = 570

 Score = 40.8 bits (94), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 49/101 (48%), Gaps = 7/101 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +  +G ++K+LL    S  R+  T  + D  + P  F + LRKHI    LED+ Q+  DR
Sbjct: 39  IRANGVNQKLLLSAHPSYARVQLTNESYDNPSEPPMFCMLLRKHIEGHILEDLYQVENDR 98

Query: 106 IILF------QFGLGMNAHYVILELYAQGNILLTDSEFTVL 140
           +I+F      + G   N   +I  +    NI++ D    V+
Sbjct: 99  MIIFEIKGRNEIGDISNKQLIIEIMGRHSNIVIVDKTRNVI 139


>gi|312127148|ref|YP_003992022.1| Fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor hydrothermalis 108]
 gi|311777167|gb|ADQ06653.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           hydrothermalis 108]
          Length = 585

 Score = 40.8 bits (94), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 2/82 (2%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTIRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAW 644
           +A       S+A  S  V   +
Sbjct: 524 EAALLASYFSKAKHSTKVPVDY 545


>gi|312621969|ref|YP_004023582.1| Fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor kronotskyensis 2002]
 gi|312202436|gb|ADQ45763.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           kronotskyensis 2002]
          Length = 585

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTIRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLV 523

Query: 623 QAGCFTVCHSQAWDSKMV 640
           +A       S+A  S  V
Sbjct: 524 EAALLASYFSKAKHSTKV 541


>gi|222529807|ref|YP_002573689.1| fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor bescii DSM 6725]
 gi|222456654|gb|ACM60916.1| Fibronectin-binding A domain protein [Caldicellulosiruptor bescii
           DSM 6725]
          Length = 585

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 2/82 (2%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  R+ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTIRFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLV 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAW 644
           +A       S+A  S  V   +
Sbjct: 524 EAALLASYFSKAKHSTKVPVDY 545


>gi|160881031|ref|YP_001559999.1| fibronectin-binding A domain-containing protein [Clostridium
           phytofermentans ISDg]
 gi|160429697|gb|ABX43260.1| Fibronectin-binding A domain protein [Clostridium phytofermentans
           ISDg]
          Length = 594

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 49/93 (52%), Gaps = 7/93 (7%)

Query: 55  KVLLLMESGVRL-HTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF-- 111
           K+ L   +G+ L + T  ++        F + LRKH+ + R+  + Q  ++RII F+   
Sbjct: 46  KLFLSASAGLPLIYLTEQSKQNPLNAPNFCMLLRKHLNSARILSITQPDFERIIQFEIEH 105

Query: 112 --GLG-MNAHYVILELYAQ-GNILLTDSEFTVL 140
              +G +   Y+I+E+  +  NI+  D EFTV+
Sbjct: 106 LDEMGDLRKKYLIVEIMGKHSNIIFCDEEFTVI 138


>gi|241889368|ref|ZP_04776669.1| fibronectin-binding A domain-containing protein [Gemella
           haemolysans ATCC 10379]
 gi|241863911|gb|EER68292.1| fibronectin-binding A domain-containing protein [Gemella
           haemolysans ATCC 10379]
          Length = 552

 Score = 40.8 bits (94), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 33/132 (25%), Positives = 67/132 (50%), Gaps = 14/132 (10%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           ++  R + + +LS   ++F +         G++ K+ L   S   R+  T  + +  +TP
Sbjct: 19  ILNGRINKINNLSTDEFVFSV-------RKGKNLKLFLSANSSASRIQLTNNSFENPSTP 71

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRIILFQF----GLGMNAHYVIL-ELYAQ-GNILLT 133
           S F   LRK++    + ++ Q+  DRI++F+      LG   +Y ++ EL  +  NI+LT
Sbjct: 72  SNFCSVLRKYLTGGIILEINQVNNDRIVIFKIKNFDDLGYEKYYYLISELMGKHSNIILT 131

Query: 134 DSEFTVLTLLRS 145
           + +  +L  L++
Sbjct: 132 NEDNIILESLKN 143


>gi|373451686|ref|ZP_09543605.1| hypothetical protein HMPREF0984_00647 [Eubacterium sp. 3_1_31]
 gi|371967907|gb|EHO85374.1| hypothetical protein HMPREF0984_00647 [Eubacterium sp. 3_1_31]
          Length = 553

 Score = 40.8 bits (94), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 162/403 (40%), Gaps = 83/403 (20%)

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDI--VPEGYILMQNKHLGKDHPPTESGSS-TQIYD 333
           ED  I   +  +  FE+  + ++ G +  +PE +    NK     H P ++  S ++ + 
Sbjct: 133 EDGRIVDALKRIPPFENSKRTILPGAVFTLPEPH---SNKQDPYHHGPFDAEESFSKQFH 189

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
            F PLL  + + R   K E FD  L     KI           K+   FH +   H+   
Sbjct: 190 GFSPLLSKEVQYR-MHKGEAFDDIL----KKIHDSNTLYISDVKDQVYFHCIPLTHLTDT 244

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE-RKAGN 452
            R + L   +D       ++ Y  E+       VR+    +   +DL R VK E  K  +
Sbjct: 245 YRQYPLMHGMD-------ILFYEKEE------KVRI----KQQSQDLYRSVKRELHKNTS 287

Query: 453 PVAGLIDKLYLERNCMSL-----LLSNNLDEMDDEEK-TLPVEK------VEVDLALSAH 500
            +  L   L    +C        LL   + E++ +   TLP  +      + +D+     
Sbjct: 288 KLPKLKQSLAESMDCDKYREYGDLLFAYMHEIEKQPIITLPSFETGEEIAIPIDMRFDIK 347

Query: 501 ANARRWYELKKKQESKQEKTITAHSKA--------FKAAEKK-----------TRLQILQ 541
            NA RWY+  K  +SK+ ++I     A        F+A E +            R ++++
Sbjct: 348 GNANRWYQ--KYHKSKRAQSILKEQIALCEKEIAYFEAMETQLSQAGVQDAIEIREELVK 405

Query: 542 EKTV-ANISHMRK-----VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  + A  S +RK     +  +E F +    ++Y +  G++  QN+ +  +   K D ++
Sbjct: 406 QGYLRAQKSRIRKKKKQELPHYETFLF----DDYRIYVGKNNLQNDYVTWKLARKKDTWL 461

Query: 596 HA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS 637
           HA DLHGA   +      EQP      N+A   T     AW S
Sbjct: 462 HAKDLHGAHVILTL----EQP------NEAALRTAAMLAAWYS 494


>gi|293400918|ref|ZP_06645063.1| putative fibronectin-binding protein [Erysipelotrichaceae bacterium
           5_2_54FAA]
 gi|291305944|gb|EFE47188.1| putative fibronectin-binding protein [Erysipelotrichaceae bacterium
           5_2_54FAA]
          Length = 556

 Score = 40.4 bits (93), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 162/403 (40%), Gaps = 83/403 (20%)

Query: 277 EDNAIQVLVLAVAKFEDWLQDVISGDI--VPEGYILMQNKHLGKDHPPTESGSS-TQIYD 333
           ED  I   +  +  FE+  + ++ G +  +PE +    NK     H P ++  S ++ + 
Sbjct: 136 EDGRIVDALKRIPPFENSKRTILPGAVFTLPEPH---SNKQDPYHHGPFDAEESFSKQFH 192

Query: 334 EFCPLLLNQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKIHMDQE 393
            F PLL  + + R   K E FD  L     KI           K+   FH +   H+   
Sbjct: 193 GFSPLLSKEVQYR-MHKGEAFDDIL----KKIHDSNTLYISDVKDQVYFHCIPLTHLTDT 247

Query: 394 NRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE-RKAGN 452
            R + L   +D       ++ Y  E+       VR+    +   +DL R VK E  K  +
Sbjct: 248 YRQYPLMHGMD-------ILFYEKEE------KVRI----KQQSQDLYRSVKRELHKNTS 290

Query: 453 PVAGLIDKLYLERNCMSL-----LLSNNLDEMDDEEK-TLPVEK------VEVDLALSAH 500
            +  L   L    +C        LL   + E++ +   TLP  +      + +D+     
Sbjct: 291 KLPKLKQSLAESMDCDKYREYGDLLFAYMHEIEKQPIITLPSFETGEEIAIPIDMRFDIK 350

Query: 501 ANARRWYELKKKQESKQEKTITAHSKA--------FKAAEKK-----------TRLQILQ 541
            NA RWY+  K  +SK+ ++I     A        F+A E +            R ++++
Sbjct: 351 GNANRWYQ--KYHKSKRAQSILKEQIALCEKEIAYFEAMETQLSQAGVQDAIEIREELVK 408

Query: 542 EKTV-ANISHMRK-----VHWFEKFNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYV 595
           +  + A  S +RK     +  +E F +    ++Y +  G++  QN+ +  +   K D ++
Sbjct: 409 QGYLRAQKSRIRKKKKQELPHYETFLF----DDYRIYVGKNNLQNDYVTWKLARKKDTWL 464

Query: 596 HA-DLHGASSTVIKNHRPEQPVPPLTLNQAGCFTVCHSQAWDS 637
           HA DLHGA   +      EQP      N+A   T     AW S
Sbjct: 465 HAKDLHGAHVILTL----EQP------NEAALRTAAMLAAWYS 497


>gi|56964090|ref|YP_175821.1| fibronectin/fibrinogen-binding protein [Bacillus clausii KSM-K16]
 gi|56910333|dbj|BAD64860.1| fibronectin/fibrinogen-binding protein [Bacillus clausii KSM-K16]
          Length = 568

 Score = 40.4 bits (93), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 58/125 (46%), Gaps = 15/125 (12%)

Query: 18  LRRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV--RLHTTAYARDK 75
           L++L+G R + ++       +F  + + G T      + LL   + V  RLH T    D 
Sbjct: 15  LQQLVGGRINKIHQPFKTELVFT-VRAKGKT------RALLASANAVFARLHLTTEKYDN 67

Query: 76  KNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GN 129
              P  F + LRKH+    +  + Q GYDRII+ +      LG      +I+E+  +  N
Sbjct: 68  PAEPPMFCMLLRKHLEGGIITSITQHGYDRIIVLKVANKDELGDTTEKTLIVEIMGRHSN 127

Query: 130 ILLTD 134
           I+L D
Sbjct: 128 IILVD 132


>gi|257875727|ref|ZP_05655380.1| conserved hypothetical protein [Enterococcus casseliflavus EC20]
 gi|257809893|gb|EEV38713.1| conserved hypothetical protein [Enterococcus casseliflavus EC20]
          Length = 556

 Score = 40.4 bits (93), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 6/92 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +   G++ K+LL    +  R+  T  A     TP  F + LRK++    LE + Q+  DR
Sbjct: 39  IRSQGKNHKLLLSAHPTYARVQCTTIAYSNPETPPNFLMMLRKYLEGAILESIEQIDNDR 98

Query: 106 IILFQFG----LG-MNAHYVILELYAQGNILL 132
           +I F F     LG + A  +I+EL  + + +L
Sbjct: 99  VIHFHFTRRDELGDLQALVLIVELMGRHSTIL 130


>gi|344996726|ref|YP_004799069.1| fibronectin-binding A domain-containing protein
           [Caldicellulosiruptor lactoaceticus 6A]
 gi|343964945|gb|AEM74092.1| Fibronectin-binding A domain protein [Caldicellulosiruptor
           lactoaceticus 6A]
          Length = 585

 Score = 40.4 bits (93), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 4/96 (4%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           FISS+ + +  GR+  QN+ +  ++ S  D+++H      S  +I+ +  E  VP  TL 
Sbjct: 466 FISSDGFDIYVGRNNLQNDFLTLKFASSHDIWLHTQKIPGSHVIIRTNNKE--VPQTTLI 523

Query: 623 QAGCFTVCHSQAWDSKMVTSAWWV--YPHQVSKTAP 656
           +A       S+A  S  V   +    Y  +  KT P
Sbjct: 524 EAALLASYFSKAKHSTKVPVDYTFVKYVKKPPKTKP 559


>gi|256004603|ref|ZP_05429581.1| protein of unknown function DUF814 [Clostridium thermocellum DSM
           2360]
 gi|255991475|gb|EEU01579.1| protein of unknown function DUF814 [Clostridium thermocellum DSM
           2360]
          Length = 330

 Score = 40.4 bits (93), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 563 FISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLN 622
           + S++ + +  G++  QN+ +  ++ S  D+++H      S  +I+  R E  +P  TL 
Sbjct: 209 YKSTDGFYIYVGKNNVQNDFLTLKFASSNDIWLHTKNIPGSHVIIRKDRGE--IPDSTLF 266

Query: 623 QAGCFTVCHSQAWDSKMV 640
           QA      HS+A +S  V
Sbjct: 267 QAAMLAAYHSKAKNSSHV 284


>gi|373456230|ref|ZP_09547997.1| protein of unknown function DUF814 [Caldithrix abyssi DSM 13497]
 gi|371717894|gb|EHO39665.1| protein of unknown function DUF814 [Caldithrix abyssi DSM 13497]
          Length = 555

 Score = 40.4 bits (93), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 2/99 (2%)

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPL 619
           FN  I    + V  GRD + N+++   + +K D+++HA     +  +I+     Q  P  
Sbjct: 438 FNRLIVDGKWEVYIGRDGKTNDLLTFHFANKWDIWLHAQGVSGAHVIIRVPNRNQNPPAH 497

Query: 620 TLNQAGCFTVCHSQAWDSKMVTSAWWV--YPHQVSKTAP 656
            + QA      HS+A  S  V   +    Y  ++ K  P
Sbjct: 498 IIEQAARIAAAHSKARTSSTVPVIYTQVRYVSRIRKAPP 536


>gi|307690598|ref|ZP_07633044.1| Fibronectin-binding A domain-containing protein [Clostridium
           cellulovorans 743B]
          Length = 314

 Score = 40.4 bits (93), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 5/80 (6%)

Query: 48  TESGESEKVLLLMESGV-RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRI 106
           TE  E+ K+LL   S   ++H T   ++    P  F + LRK++   ++ DV QL  DR+
Sbjct: 41  TEDNENIKLLLSANSTYPKIHVTKITKENPMNPPMFCMILRKYLSGSKILDVSQLENDRL 100

Query: 107 ILFQFG----LGMNAHYVIL 122
            + +F      G N+ Y ++
Sbjct: 101 AIIKFKSTDEFGFNSEYSLI 120


>gi|392940833|ref|ZP_10306477.1| LOW QUALITY PROTEIN: putative RNA-binding protein, snRNP like
           protein [Thermoanaerobacter siderophilus SR4]
 gi|392292583|gb|EIW01027.1| LOW QUALITY PROTEIN: putative RNA-binding protein, snRNP like
           protein [Thermoanaerobacter siderophilus SR4]
          Length = 570

 Score = 40.4 bits (93), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 46/88 (52%), Gaps = 7/88 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           +IG R   +Y    +  IF + N       G++ K+LL   +   R++ T   ++    P
Sbjct: 19  IIGGRIDKIYQPEKEELIFIIRNK------GKNYKLLLSANANYPRIYLTEENKENPAEP 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRII 107
             F + LRK +++ R+ D++Q+ +DRI+
Sbjct: 73  PMFCMLLRKFLQSGRIIDIKQVEFDRIV 100


>gi|238854976|ref|ZP_04645305.1| fibronectin/fibrinogen binding protein [Lactobacillus jensenii
           269-3]
 gi|260664462|ref|ZP_05865314.1| fibronectin-binding protein [Lactobacillus jensenii SJ-7A-US]
 gi|238832347|gb|EEQ24655.1| fibronectin/fibrinogen binding protein [Lactobacillus jensenii
           269-3]
 gi|260561527|gb|EEX27499.1| fibronectin-binding protein [Lactobacillus jensenii SJ-7A-US]
          Length = 566

 Score = 40.4 bits (93), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 8/105 (7%)

Query: 65  RLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LGMNAHYV 120
           R + T  + D       F + LRK++    L+D++Q+G DRII F F     LG     V
Sbjct: 58  RFYLTKNSLDNPKVAPTFVMVLRKYLEGSILQDIKQVGQDRIINFSFSNRNELGDEVELV 117

Query: 121 I-LELYAQGN--ILLTDSEFTVLTLL-RSHRDDDKGVAIMSRHRY 161
           + LEL  + +  IL    +  ++ LL R + D+++   ++ + +Y
Sbjct: 118 LSLELMGRHSNVILYNKQDGKIIDLLKRVNPDENRARLLLPKAKY 162


>gi|339499630|ref|YP_004697665.1| Fibronectin-binding A domain-containing protein [Spirochaeta
           caldaria DSM 7334]
 gi|338833979|gb|AEJ19157.1| Fibronectin-binding A domain protein [Spirochaeta caldaria DSM
           7334]
          Length = 468

 Score = 40.4 bits (93), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 567 ENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQAGC 626
           + + +I GRDA +N+ +++ Y+   D+++HA  +  S   IK  RP + VP   L  AG 
Sbjct: 348 KGWTLIVGRDATENDDLLRHYVKGSDLWLHARDYPGSYVFIKA-RPGKTVPLDILLDAGN 406

Query: 627 FTVCHSQA 634
             + +S+ 
Sbjct: 407 LALFYSKG 414


>gi|297544796|ref|YP_003677098.1| Fibronectin-binding A domain-containing protein [Thermoanaerobacter
           mathranii subsp. mathranii str. A3]
 gi|296842571|gb|ADH61087.1| Fibronectin-binding A domain protein [Thermoanaerobacter mathranii
           subsp. mathranii str. A3]
          Length = 570

 Score = 40.4 bits (93), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 46/88 (52%), Gaps = 7/88 (7%)

Query: 21  LIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGV-RLHTTAYARDKKNTP 79
           +IG R   +Y    +  IF + N       G++ K+LL   +   R++ T   ++    P
Sbjct: 19  IIGGRIDKIYQPEKEELIFIIRNK------GKNYKLLLSANANYPRIYLTEENKENPAEP 72

Query: 80  SGFTLKLRKHIRTRRLEDVRQLGYDRII 107
             F + LRK +++ R+ D++Q+ +DRI+
Sbjct: 73  PMFCMLLRKFLQSGRIIDIKQVEFDRIV 100


>gi|290968771|ref|ZP_06560308.1| fibronectin-binding A, N-terminal domain protein [Megasphaera
           genomosp. type_1 str. 28L]
 gi|335049115|ref|ZP_08542125.1| fibronectin-binding protein A [Megasphaera sp. UPII 199-6]
 gi|290781067|gb|EFD93658.1| fibronectin-binding A, N-terminal domain protein [Megasphaera
           genomosp. type_1 str. 28L]
 gi|333764227|gb|EGL41627.1| fibronectin-binding protein A [Megasphaera sp. UPII 199-6]
          Length = 573

 Score = 40.0 bits (92), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 20/89 (22%), Positives = 44/89 (49%), Gaps = 6/89 (6%)

Query: 19  RRLIGMRCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNT 78
           + L G + + +Y    +T  F++ +++G+        V++ ++   R++         +T
Sbjct: 17  KELTGGQITKIYQPRARTLYFRIFSATGL------HHVIITLDESPRIYIAEKMPPMPDT 70

Query: 79  PSGFTLKLRKHIRTRRLEDVRQLGYDRII 107
           PS   + LRK+    R+  +RQL  DR++
Sbjct: 71  PSALCMFLRKYYENGRISSLRQLHLDRLL 99


>gi|257866094|ref|ZP_05645747.1| conserved hypothetical protein [Enterococcus casseliflavus EC30]
 gi|257872425|ref|ZP_05652078.1| conserved hypothetical protein [Enterococcus casseliflavus EC10]
 gi|257800028|gb|EEV29080.1| conserved hypothetical protein [Enterococcus casseliflavus EC30]
 gi|257806589|gb|EEV35411.1| conserved hypothetical protein [Enterococcus casseliflavus EC10]
          Length = 566

 Score = 40.0 bits (92), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 6/92 (6%)

Query: 47  VTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDR 105
           +   G++ K+LL    +  R+  T  A     TP  F + LRK++    LE + Q+  DR
Sbjct: 39  IRSQGKNHKLLLSAHPTYARVQCTTIAYSNPETPPNFLMMLRKYLEGAILESIEQIDNDR 98

Query: 106 IILFQFG----LG-MNAHYVILELYAQGNILL 132
           +I F F     LG + A  +I+EL  + + +L
Sbjct: 99  VIHFHFTRRDELGDLQALVLIVELMGRHSTIL 130


>gi|422317084|ref|ZP_16398451.1| hypothetical protein FPOG_01910 [Fusobacterium periodonticum D10]
 gi|404590236|gb|EKA92690.1| hypothetical protein FPOG_01910 [Fusobacterium periodonticum D10]
          Length = 541

 Score = 40.0 bits (92), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNI 130
           +  S     LRKH+    L D+ QLG+DRI+ F F     LG +  + +  E   +  N+
Sbjct: 68  DISSSLISNLRKHLMNAMLTDIEQLGFDRILAFHFSRINELGEIKKYKIYFECLGKLSNV 127

Query: 131 LLTDSEFTVLTLLR 144
           + TD E  VL  L+
Sbjct: 128 IFTDEEDKVLDTLK 141


>gi|294783159|ref|ZP_06748483.1| fibronectin-binding protein A (FbpA) [Fusobacterium sp. 1_1_41FAA]
 gi|294480037|gb|EFG27814.1| fibronectin-binding protein A (FbpA) [Fusobacterium sp. 1_1_41FAA]
          Length = 541

 Score = 40.0 bits (92), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNI 130
           +  S     LRKH+    L D+ QLG+DRI+ F F     LG +  + +  E   +  N+
Sbjct: 68  DISSSLISNLRKHLMNAMLTDIEQLGFDRILAFHFSRINELGEIKKYKIYFECLGKLSNV 127

Query: 131 LLTDSEFTVLTLLR 144
           + TD E  VL  L+
Sbjct: 128 IFTDEEDKVLDTLK 141


>gi|125973100|ref|YP_001037010.1| fibronectin-binding A-like protein [Clostridium thermocellum ATCC
           27405]
 gi|281417295|ref|ZP_06248315.1| Fibronectin-binding A domain protein [Clostridium thermocellum
           JW20]
 gi|385778999|ref|YP_005688164.1| fibronectin-binding A domain-containing protein [Clostridium
           thermocellum DSM 1313]
 gi|419721494|ref|ZP_14248657.1| Fibronectin-binding A domain protein [Clostridium thermocellum AD2]
 gi|419725118|ref|ZP_14252171.1| Fibronectin-binding A domain protein [Clostridium thermocellum YS]
 gi|125713325|gb|ABN51817.1| Fibronectin-binding A domain protein [Clostridium thermocellum ATCC
           27405]
 gi|281408697|gb|EFB38955.1| Fibronectin-binding A domain protein [Clostridium thermocellum
           JW20]
 gi|316940679|gb|ADU74713.1| Fibronectin-binding A domain protein [Clostridium thermocellum DSM
           1313]
 gi|380771439|gb|EIC05306.1| Fibronectin-binding A domain protein [Clostridium thermocellum YS]
 gi|380782434|gb|EIC12069.1| Fibronectin-binding A domain protein [Clostridium thermocellum AD2]
          Length = 588

 Score = 40.0 bits (92), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 2/76 (2%)

Query: 565 SSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQA 624
           S++ + +  G++  QN+ +  ++ S  D+++H      S  +I+  R E  +P  TL QA
Sbjct: 469 STDGFYIYVGKNNVQNDFLTLKFASSNDIWLHTKNIPGSHVIIRKDRGE--IPDSTLFQA 526

Query: 625 GCFTVCHSQAWDSKMV 640
                 HS+A +S  V
Sbjct: 527 AMLAAYHSKAKNSSHV 542


>gi|340753767|ref|ZP_08690541.1| fibronectin-binding protein [Fusobacterium sp. 2_1_31]
 gi|229423321|gb|EEO38368.1| fibronectin-binding protein [Fusobacterium sp. 2_1_31]
          Length = 541

 Score = 40.0 bits (92), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNI 130
           +  S     LRKH+    L D+ QLG+DRI+ F F     LG +  + +  E   +  N+
Sbjct: 68  DISSSLISNLRKHLMNAMLTDIEQLGFDRILAFHFSRINELGEIKKYKIYFECLGKLSNV 127

Query: 131 LLTDSEFTVLTLLR 144
           + TD E  VL  L+
Sbjct: 128 IFTDEEDKVLDTLK 141


>gi|436837729|ref|YP_007322945.1| protein of unknown function DUF814 [Fibrella aestuarina BUZ 2]
 gi|384069142|emb|CCH02352.1| protein of unknown function DUF814 [Fibrella aestuarina BUZ 2]
          Length = 543

 Score = 40.0 bits (92), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 23/80 (28%), Positives = 41/80 (51%), Gaps = 6/80 (7%)

Query: 564 ISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHADLHGASSTVIKNHRPEQPVPPLTLNQ 623
           +  + + +  GR+A+ N+++ +RY  K D+++HA     S  VIK +R  +P P   + +
Sbjct: 433 VQVDGFTIRIGRNAKNNDLLTQRYTYKEDLWLHARDVSGSHVVIK-YRAGKPFPKTVIER 491

Query: 624 AGCFTVCHSQAWDSKMVTSA 643
           A         AW SK  T +
Sbjct: 492 AAELA-----AWYSKRRTDS 506


>gi|358467879|ref|ZP_09177544.1| fibronectin-binding protein A [Fusobacterium sp. oral taxon 370
           str. F0437]
 gi|357066553|gb|EHI76702.1| fibronectin-binding protein A [Fusobacterium sp. oral taxon 370
           str. F0437]
          Length = 538

 Score = 40.0 bits (92), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 9/135 (6%)

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNI 130
           +  S     LRKH+    L D+ QLG+DRI+ F F     LG +  + +  E   +  N+
Sbjct: 65  DISSSLISNLRKHLMNAMLTDIEQLGFDRILAFHFSKINELGEIKKYKIYFECLGKLSNV 124

Query: 131 LLTDSEFTVL-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEP 187
           + TD E  +L TL + H  +  D+ + +   +  P    ++     +     +L +S   
Sbjct: 125 IFTDEEDKILDTLKKFHISENIDRTLFLGETYSRPKYDKKILPTELSKDKFDSLLASGNV 184

Query: 188 DANEPDKVNEDGNNV 202
            +NE + V +  NN+
Sbjct: 185 FSNEVEGVGKYLNNI 199


>gi|291460975|ref|ZP_06026217.2| fibronectin-binding protein A [Fusobacterium periodonticum ATCC
           33693]
 gi|291379669|gb|EFE87187.1| fibronectin-binding protein A [Fusobacterium periodonticum ATCC
           33693]
          Length = 538

 Score = 40.0 bits (92), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 9/135 (6%)

Query: 77  NTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQFG----LG-MNAHYVILELYAQ-GNI 130
           +  S     LRKH+    L D+ QLG+DRI+ F F     LG +  + +  E   +  N+
Sbjct: 65  DISSSLISNLRKHLMNAMLTDIEQLGFDRILAFHFSKINELGEIKKYKIYFECLGKLSNV 124

Query: 131 LLTDSEFTVL-TLLRSHRDD--DKGVAIMSRHRYPTEICRVFERTTASKLHAALTSSKEP 187
           + TD E  +L TL + H  +  D+ + +   +  P    ++     +     +L +S   
Sbjct: 125 IFTDEEDKILDTLKKFHISENIDRTLFLGETYSRPKYNKKILPTELSKDKFDSLLASGNV 184

Query: 188 DANEPDKVNEDGNNV 202
            +NE + V +  NN+
Sbjct: 185 LSNEVEGVGKYLNNI 199


>gi|350270514|ref|YP_004881822.1| fibronectin-binding protein [Oscillibacter valericigenes Sjm18-20]
 gi|348595356|dbj|BAK99316.1| fibronectin-binding protein [Oscillibacter valericigenes Sjm18-20]
          Length = 581

 Score = 40.0 bits (92), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 40/89 (44%), Gaps = 6/89 (6%)

Query: 62  SGVRLHTTAYARDKKNTPSGFTLKLRKHIRTRRLEDVRQLGYDRIILFQ------FGLGM 115
           S  R+H T   RD    P  F + LRKH+   R+  + Q G +R++  +      FG   
Sbjct: 52  SNPRIHLTGQLRDNPAEPPMFCMLLRKHLVGARVAALTQPGLERLVEMELDVTDDFGQPG 111

Query: 116 NAHYVILELYAQGNILLTDSEFTVLTLLR 144
               V+  +    N++L D E  V+  LR
Sbjct: 112 KRTLVLEAMGRHSNLILLDGERRVIECLR 140


>gi|384486273|gb|EIE78453.1| hypothetical protein RO3G_03157 [Rhizopus delemar RA 99-880]
          Length = 2524

 Score = 40.0 bits (92), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 36/159 (22%), Positives = 77/159 (48%), Gaps = 13/159 (8%)

Query: 266 PNMKLSEVNKLEDNAIQVLVLAVAKFEDWLQDVISGDIVPEGYILM-------QNKHLGK 318
           PN  ++ VN+   + + +L+       +W+++++  D+ P  Y ++         K+ G 
Sbjct: 776 PNESVAMVNRFVVDMVDLLLCDNIIIREWVKEILGTDLSPALYSMLFRYMETVLAKYFGP 835

Query: 319 DHPPTESGSSTQIYDE---FCPLLLNQFR-SREFVKFETFDAALDEFYSKIESQRAEQQH 374
           D  P  + S+T   ++      L+L++   S E +    F + +D+ Y+K  ++    Q 
Sbjct: 836 DGDPICNASNTLFVEQAISVLKLILDRMEGSFENLLTVDFSSLIDQ-YAKYLNKLGNGQS 894

Query: 375 KAKEDAAFHKLNKIHMDQENRVHTLKQEVDRSVKMAELI 413
             K    F +L ++ M ++++V TL+QE     K+ E+I
Sbjct: 895 ALKIKIKFCQLTEVLMMKKDKV-TLRQEFRLRNKLLEII 932


>gi|410460737|ref|ZP_11314410.1| Fibronectin-binding A domain-containing protein [Bacillus
           azotoformans LMG 9581]
 gi|409926667|gb|EKN63823.1| Fibronectin-binding A domain-containing protein [Bacillus
           azotoformans LMG 9581]
          Length = 570

 Score = 40.0 bits (92), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 70/145 (48%), Gaps = 22/145 (15%)

Query: 25  RCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLME-SGVRLHTTAYARDKKNTPSGFT 83
           R S +Y    + Y + L+ +  +  +G+++++L+    S  RLH T    D    P  F 
Sbjct: 23  RISRIY----QPYKYDLIFT--IRANGKNQQLLISANPSYARLHITKETYDNPKEPPMFC 76

Query: 84  LKLRKHIRTRRLEDVRQLGYDRIILF------QFGLGMNAHYVILELYAQ-GNILLTDSE 136
           + LRKH+    +E + Q G +RII F      + G   +   +I+E+  +  NILL D E
Sbjct: 77  MLLRKHLEGSFIEKIEQDGLERIIKFYVRTKNEIG-DESIKILIVEVMGRHSNILLVDQE 135

Query: 137 FTVLTLLRSHRDDDKGVA-IMSRHR 160
             ++       D  K V+  ++RHR
Sbjct: 136 KNIIM------DSIKHVSPAVNRHR 154


>gi|170571420|ref|XP_001891721.1| hypothetical protein Bm1_01020 [Brugia malayi]
 gi|158603618|gb|EDP39478.1| hypothetical protein Bm1_01020 [Brugia malayi]
          Length = 1770

 Score = 39.7 bits (91), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 44/81 (54%), Gaps = 7/81 (8%)

Query: 187  PDANEPDKVNEDGNNVSNASKENLGGQKGGKSFDLSKNSNKNSNDGARAKQPTLKTVLGE 246
            P++N P  V+ +G  V  ++KE +  +K   SF  +K + +N +D +R +    KTV+  
Sbjct: 1334 PNSNSPKNVSVNG--VCKSTKEKINAEKA--SFLKAKETKENRSDSSRLRP---KTVVDA 1386

Query: 247  ALGYGPALSEHIILDTGLVPN 267
            A+    +     + DTGLVPN
Sbjct: 1387 AVSLPSSTPNGCVKDTGLVPN 1407


>gi|319937179|ref|ZP_08011586.1| fibronectin-binding protein/fibrinogen-binding protein
           [Coprobacillus sp. 29_1]
 gi|319807545|gb|EFW04138.1| fibronectin-binding protein/fibrinogen-binding protein
           [Coprobacillus sp. 29_1]
          Length = 547

 Score = 39.7 bits (91), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 46/92 (50%), Gaps = 6/92 (6%)

Query: 25  RCSNVYDLSPKTYIFKLMNSSGVTESGESEKVLLLMESGVRLHTTAYARDKKNTPSGFTL 84
           R + +Y +S    +F +      ++S   + ++ +     RL  T+ +     +P+ FT+
Sbjct: 23  RINKIYQISQYELLFHMR-----SQSSNMKLLMSIHPMYARLQLTSLSFPTPPSPNAFTM 77

Query: 85  KLRKHIRTRRLEDVRQLGYDRIILFQFGLGMN 116
            LRKH+    LE V+Q+  DRI+   F +G N
Sbjct: 78  LLRKHLEGAYLESVKQIQLDRIVDMTF-IGTN 108


>gi|60422786|gb|AAH89999.1| Sdccag1 protein, partial [Rattus norvegicus]
          Length = 419

 Score = 39.7 bits (91), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 24/33 (72%)

Query: 680 LIMGFGLLFRLDESSLGSHLNERRVRGEEEGMD 712
           L+MGF  LF++DES +  H  ER+VR ++E M+
Sbjct: 1   LMMGFSFLFKVDESCVWRHRGERKVRVQDEDME 33


>gi|297617030|ref|YP_003702189.1| Fibronectin-binding A domain-containing protein [Syntrophothermus
           lipocalidus DSM 12680]
 gi|297144867|gb|ADI01624.1| Fibronectin-binding A domain protein [Syntrophothermus lipocalidus
           DSM 12680]
          Length = 602

 Score = 39.3 bits (90), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 71/324 (21%), Positives = 129/324 (39%), Gaps = 55/324 (16%)

Query: 334 EFCPLLL----NQFRSREFVKFETFDAALDEFYSKIESQRAEQQHKAKEDAAFHKLNKI- 388
           EF P  L    +Q    E + F + + A+D ++                   +HKL+++ 
Sbjct: 264 EFSPFSLLPMASQEAGEEVLTFASVNQAVDYYF-------------------YHKLSQLR 304

Query: 389 -HMDQENRVHTLKQEVDRSVKMAELIEYNLEDVDAAILAVRVALANRMSWEDLARMVKEE 447
            +  + N + TLK  ++++ + A L E +L   +              +W +L      +
Sbjct: 305 AYSYKTNLLRTLKAHLEKAYRKALLQEGDLVQAEKTF--------PYRTWGELLTAYGHQ 356

Query: 448 RKAGNPVAGLIDKLYLERNCMSLLLSNNLDEMDDEEKTLPVEKVEVDLALSAHANARRWY 507
            + G     LID    E   + LL               P+E  +    L A   A   +
Sbjct: 357 IEKGQTEVELIDFYTGESVTVGLL-----------PHLTPIENAQRYFKLYAKGKAAALH 405

Query: 508 ELKKKQESKQE-KTITAHSKAFKAAEKKTRLQILQEKT----VANISHMRKVHWFE---K 559
             K+ +E++QE   + +   A + AE    ++ + E+       N    RK    E   +
Sbjct: 406 AEKRLRETRQEIAYLESVQFALEQAETMDEIEEIAEELDREGYINKDKKRKARVKEERLQ 465

Query: 560 FNWFISSENYLVISGRDAQQNEMIVKRYMSKGDVYVHA-DLHGASSTV--IKNHRPEQPV 616
              F+SS+ Y ++ GR+  QNE +  +     D+++HA D+ G+   V   KN +    V
Sbjct: 466 PRMFLSSDGYKILVGRNNLQNEQLTLKASGHNDLWLHAKDVPGSHVIVRLSKNIQSIHEV 525

Query: 617 PPLTLNQAGCFTVCHSQAWDSKMV 640
           P  TL +A       S++ +S  V
Sbjct: 526 PDHTLEEAALLAAYFSKSRESDKV 549


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.130    0.368 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,288,968,295
Number of Sequences: 23463169
Number of extensions: 617686232
Number of successful extensions: 1843575
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 753
Number of HSP's successfully gapped in prelim test: 838
Number of HSP's that attempted gapping in prelim test: 1836044
Number of HSP's gapped (non-prelim): 5383
length of query: 934
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 782
effective length of database: 8,792,793,679
effective search space: 6875964656978
effective search space used: 6875964656978
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 82 (36.2 bits)